In recent years, advancements in machine learning techniques, and specifically, deep learning methods, have started to create a great impact in the world. With the advent of deep neural network, we are able to achieve unprecedented results in previously unsolvable computer vision tasks. Face recognition, one of the critical computer vision tasks, also sees breakthrough in terms of accuracy. This thesis presents an accelerated and optimized end-to-end face recognition pipeline. Such a pipeline consists of three stages: face detection, alignment, and face recognition/verification. Algorithms for these jobs are extremely computation intensive and thus real-time application was not attainable. In order to bring about the goal of high definition real-time multi-face recognition, we leverage different types of hardware to accelerate detection and recognition stages, which are the most time-consuming stages of the recognition pipeline. To achieve this goal, we leverage an embedded Graphic Processing Unit (GPU) platform as the front end, to perform video capture and face detection. For the back end, we employ a powerful Field-Programming Gate Arrays (FPGA) equipped server, which runs a state-of-the-art deep neural network to recognize faces streamed from the front end with low latency. With the two acceleration schemes targeting GPUs and FPGAs, respectively, we are able to achieve real-time performance for the overall task, and such face recognition system can be widely adopted for various applications.
【 预 览 】
附件列表
Files
Size
Format
View
Acceleration of real-time face recognition pipeline on heterogeneous hardware platforms