学位论文详细信息
Real-time video scene analysis with heterogeneous processors
QA75 Electronic computers. Computer science;QA76 Computer software;TK Electrical engineering. Electronics Nuclear engineering
Blair, Calum Grahame ; Robertson, Neil M
University:University of Glasgow
关键词: fpga, gpu, heterogeneous architecture, object detection, computer vision;   
Others  :  http://theses.gla.ac.uk/5061/8/2014BlairEngD.pdf
来源: University of Glasgow
PDF
【 摘 要 】

Field-Programmable Gate Arrays (FPGAs) and General Purpose Graphics Processing Units (GPUs) allow acceleration and real-time processing of computationally intensive computer vision algorithms. The decision to use either architecture in any application is determined by task-specific priorities such as processing latency, power consumption and algorithm accuracy. This choice is normally made at design time on a heuristic or fixed algorithmic basis; here we propose an alternative method for automatic runtime selection.In this thesis, we describe our PC-based system architecture containing both platforms; this provides greater flexibility and allows dynamic selection of processing platforms to suit changing scene priorities. Using the Histograms of Oriented Gradients (HOG) algorithm for pedestrian detection, we comprehensively explore algorithm implementation on FPGA, GPU and a combination of both, and show that the effect of data transfer time on overall processing performance is significant. We also characterise performance of each implementation and quantify tradeoffs between power, time and accuracy when moving processing between architectures, then specify the optimal architecture to use when prioritising each of these. We apply this new knowledge to a real-time surveillance application representative of anomaly detection problems: detecting parked vehicles in videos. Using motion detection and car and pedestrian HOG detectors implemented across multiple architectures to generate detections, we use trajectory clustering and a Bayesian contextual motion algorithm to generate an overall scene anomaly level. This is in turn used to select the architectures to run the compute-intensive detectors for the next frame on, with higher anomalies selecting faster, higher-power implementations. Comparing dynamic context-driven prioritisation of system performance against a fixed mapping of algorithms to architectures shows that our dynamic mapping method is 10% more accurate at detecting events than the power-optimised version, at the cost of 12W higher power consumption.

【 预 览 】
附件列表
Files Size Format View
Real-time video scene analysis with heterogeneous processors 25728KB PDF download
  文献评价指标  
  下载次数:12次 浏览次数:17次