| EURASIP Journal on Advances in Signal Processing | |
| CenterTransFuser: radar point cloud and visual information fusion for 3D object detection | |
| Research | |
| Kai Zeng1  Tao Shen1  Yan Li1  | |
| [1] School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, China; | |
| 关键词: Cross-transformer; Depth threshold filtering; 3D detection; Cross-modal fusion; Contextual interaction; | |
| DOI : 10.1186/s13634-022-00944-6 | |
| received in 2022-02-22, accepted in 2022-11-02, 发布年份 2022 | |
| 来源: Springer | |
PDF
|
|
【 摘 要 】
Sensor fusion is an important component of the perception system in autonomous driving, and the fusion of radar point cloud information and camera visual information can improve the perception capability of autonomous vehicles. However, most of the existing studies ignore the extraction of local neighborhood information and only consider shallow fusion between the two modalities based on the extracted global information, which cannot perform a deep fusion of cross-modal contextual information interaction. Meanwhile, in data preprocessing, the noise in radar data is usually only filtered by the depth information derived from image feature prediction, and such methods affect the accuracy of radar branching to generate regions of interest and cannot effectively filter out irrelevant information of radar points. This paper proposes the CenterTransFuser model that makes full use of millimeter-wave radar point cloud information and visual information to enable cross-modal fusion of the two heterogeneous information. Specifically, a new interaction called cross-transformer is explored, which cooperatively exploits cross-modal cross-multiple attention and joint cross-multiple attention to mine radar and image complementary information. Meanwhile, an adaptive depth thresholding filtering method is designed to reduce the noise of radar modality-independent information projected onto the image. The CenterTransFuser model is evaluated on the challenging nuScenes dataset, and it achieves excellent performance. Particularly, the detection accuracy is significantly improved for pedestrians, motorcycles, and bicycles, showing the superiority and effectiveness of the proposed model.
【 授权许可】
CC BY
© The Author(s) 2023
【 预 览 】
| Files | Size | Format | View |
|---|---|---|---|
| RO202305112411629ZK.pdf | 2208KB | ||
| 41116_2022_35_Article_IEq196.gif | 1KB | Image | |
| Fig. 2 | 66KB | Image | |
| 41116_2022_35_Article_IEq208.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq212.gif | 1KB | Image | |
| Fig. 1 | 71KB | Image | |
| MediaObjects/12888_2023_4558_MOESM1_ESM.docx | 41KB | Other | |
| 41116_2022_35_Article_IEq345.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq355.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq413.gif | 1KB | Image | |
| MediaObjects/12888_2022_4513_MOESM1_ESM.docx | 258KB | Other | |
| 41116_2022_35_Article_IEq417.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq456.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq458.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq478.gif | 1KB | Image | |
| Fig. 4 | 39KB | Image | |
| Fig. 5 | 36KB | Image | |
| Fig. 21 | 75KB | Image | |
| Fig. 22 | 162KB | Image | |
| 41116_2022_35_Article_IEq487.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq489.gif | 1KB | Image | |
| Fig. 25 | 1500KB | Image | |
| 41116_2022_35_Article_IEq492.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq520.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq522.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq564.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq566.gif | 1KB | Image | |
| Fig. 41 | 265KB | Image | |
| Fig. 1 | 149KB | Image | |
| Fig. 42 | 3051KB | Image | |
| Fig. 2 | 100KB | Image | |
| Fig. 44 | 418KB | Image | |
| 41116_2022_35_Article_IEq598.gif | 1KB | Image | |
| Fig. 45 | 668KB | Image | |
| 41116_2022_35_Article_IEq624.gif | 1KB | Image | |
| 41116_2022_35_Article_IEq659.gif | 1KB | Image | |
| Fig. 5 | 2274KB | Image | |
| Fig. 3 | 301KB | Image | |
| Fig. 2 | 313KB | Image | |
| Fig. 1 | 137KB | Image | |
| MediaObjects/40360_2023_642_MOESM1_ESM.xlsx | 13KB | Other | |
| Fig. 7 | 672KB | Image | |
| MediaObjects/12888_2022_4438_MOESM2_ESM.jpg | 501KB | Other | |
| MediaObjects/13041_2023_1005_MOESM3_ESM.pdf | 455KB | ||
| Fig. 1 | 240KB | Image | |
| MediaObjects/12888_2022_4438_MOESM3_ESM.jpg | 890KB | Other | |
| 12888_2022_4500_Article_IEq2.gif | 1KB | Image | |
| MediaObjects/12888_2022_4500_MOESM1_ESM.docx | 19KB | Other | |
| Fig. 1 | 88KB | Image | |
| MediaObjects/12888_2022_4438_MOESM4_ESM.jpg | 409KB | Other | |
| Fig. 5 | 557KB | Image | |
| MediaObjects/12888_2022_4438_MOESM5_ESM.jpg | 350KB | Other | |
| Fig. 2 | 49KB | Image | |
| MediaObjects/12944_2022_1767_MOESM1_ESM.tif | 309KB | Other | |
| MediaObjects/12888_2022_4438_MOESM6_ESM.pdf | 210KB | ||
| Fig. 4 | 36KB | Image | |
| Fig. 56 | 805KB | Image | |
| MediaObjects/12888_2022_4438_MOESM8_ESM.pdf | 529KB | ||
| Fig. 7 | 856KB | Image | |
| Fig. 3 | 549KB | Image | |
| Fig. 1 | 683KB | Image | |
| Fig. 4 | 1235KB | Image | |
| Fig. 1 | 578KB | Image | |
| Fig. 57 | 1785KB | Image | |
| MediaObjects/42004_2023_814_MOESM1_ESM.pdf | 3415KB | ||
| Fig. 3 | 55KB | Image | |
| 40854_2022_439_Article_IEq93.gif | 1KB | Image | |
| Fig. 4 | 1884KB | Image | |
| Fig. 1 | 1255KB | Image | |
| 40798_2022_490_Article_IEq2.gif | 1KB | Image | |
| 40798_2022_490_Article_IEq57.gif | 1KB | Image | |
| Fig. 5 | 1117KB | Image | |
| Fig. 4 | 188KB | Image | |
| 40798_2022_490_Article_IEq60.gif | 1KB | Image | |
| Fig. 4 | 1926KB | Image | |
| Fig. 3 | 1769KB | Image | |
| 40798_2022_490_Article_IEq63.gif | 1KB | Image | |
| 40798_2022_490_Article_IEq64.gif | 1KB | Image | |
| Fig. 1 | 96KB | Image | |
| MediaObjects/41408_2022_782_MOESM1_ESM.docx | 424KB | Other | |
| Fig. 3 | 723KB | Image | |
| Fig. 3 | 2042KB | Image | |
| Fig. 1 | 201KB | Image | |
| Fig. 2 | 599KB | Image | |
| Fig. 2 | 1284KB | Image | |
| 42004_2022_800_Article_IEq69.gif | 1KB | Image | |
| Fig. 1 | 686KB | Image | |
| Fig. 3 | 715KB | Image | |
| 42004_2022_800_Article_IEq73.gif | 1KB | Image | |
| Fig. 2 | 543KB | Image | |
| 42004_2022_800_Article_IEq75.gif | 1KB | Image | |
| Fig. 64 | 403KB | Image |
【 图 表 】
Fig. 64
42004_2022_800_Article_IEq75.gif
Fig. 2
42004_2022_800_Article_IEq73.gif
Fig. 3
Fig. 1
42004_2022_800_Article_IEq69.gif
Fig. 2
Fig. 2
Fig. 1
Fig. 3
Fig. 3
Fig. 1
40798_2022_490_Article_IEq64.gif
40798_2022_490_Article_IEq63.gif
Fig. 3
Fig. 4
40798_2022_490_Article_IEq60.gif
Fig. 4
Fig. 5
40798_2022_490_Article_IEq57.gif
40798_2022_490_Article_IEq2.gif
Fig. 1
Fig. 4
40854_2022_439_Article_IEq93.gif
Fig. 3
Fig. 57
Fig. 1
Fig. 4
Fig. 1
Fig. 3
Fig. 7
Fig. 56
Fig. 4
Fig. 2
Fig. 5
Fig. 1
12888_2022_4500_Article_IEq2.gif
Fig. 1
Fig. 7
Fig. 1
Fig. 2
Fig. 3
Fig. 5
41116_2022_35_Article_IEq659.gif
41116_2022_35_Article_IEq624.gif
Fig. 45
41116_2022_35_Article_IEq598.gif
Fig. 44
Fig. 2
Fig. 42
Fig. 1
Fig. 41
41116_2022_35_Article_IEq566.gif
41116_2022_35_Article_IEq564.gif
41116_2022_35_Article_IEq522.gif
41116_2022_35_Article_IEq520.gif
41116_2022_35_Article_IEq492.gif
Fig. 25
41116_2022_35_Article_IEq489.gif
41116_2022_35_Article_IEq487.gif
Fig. 22
Fig. 21
Fig. 5
Fig. 4
41116_2022_35_Article_IEq478.gif
41116_2022_35_Article_IEq458.gif
41116_2022_35_Article_IEq456.gif
41116_2022_35_Article_IEq417.gif
41116_2022_35_Article_IEq413.gif
41116_2022_35_Article_IEq355.gif
41116_2022_35_Article_IEq345.gif
Fig. 1
41116_2022_35_Article_IEq212.gif
41116_2022_35_Article_IEq208.gif
Fig. 2
41116_2022_35_Article_IEq196.gif
【 参考文献 】
- [1]
- [2]
- [3]
- [4]
- [5]
- [6]
- [7]
- [8]
- [9]
- [10]
- [11]
- [12]
- [13]
- [14]
- [15]
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
- [23]
- [24]
- [25]
- [26]
- [27]
- [28]
- [29]
- [30]
- [31]
- [32]
- [33]
- [34]
- [35]
- [36]
PDF