Google Scholar

J Mao, S Shi, X Wang, H Li�- International Journal of Computer Vision, 2023 - Springer

Autonomous driving, in recent years, has been receiving increasing attention for its potential
to relieve drivers' burdens and improve the safety of driving. In modern autonomous driving�…

Save Cite Cited by 94 Related articles All 8 versions

[PDF] ieee.org

Delving into the devils of bird's-eye-view perception: A review, evaluation and recipe

H Li, C Sima, J Dai, W Wang, L Lu…�- …�on Pattern Analysis�…, 2023 - ieeexplore.ieee.org

Learning powerful representations in bird's-eye-view (BEV) for perception tasks is trending
and drawing extensive attention both from industry and academia. Conventional�…

Save Cite Cited by 83 Related articles All 9 versions

[PDF] arxiv.org

Bytetrack: Multi-object tracking by associating every detection box

Y Zhang, P Sun, Y Jiang, D Yu, F Weng, Z Yuan…�- European conference on�…, 2022 - Springer

Multi-object tracking (MOT) aims at estimating bounding boxes and identities of objects in
videos. Most methods obtain identities by associating detection boxes whose scores are�…

Save Cite Cited by 1183 Related articles All 12 versions

[PDF] arxiv.org

Bevfusion: Multi-task multi-sensor fusion with unified bird's-eye view representation

Z Liu, H Tang, A Amini, X Yang, H Mao…�- …�on robotics and�…, 2023 - ieeexplore.ieee.org

Multi-sensor fusion is essential for an accurate and reliable autonomous driving system.
Recent approaches are based on point-level fusion: augmenting the LiDAR point cloud with�…

Save Cite Cited by 581 Related articles All 4 versions

[PDF] ieee.org

Multimodal learning with transformers: A survey

P Xu, X Zhu, DA Clifton�- IEEE Transactions on Pattern Analysis�…, 2023 - ieeexplore.ieee.org

Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications�…

Save Cite Cited by 370 Related articles All 9 versions

[PDF] neurips.cc

Fully convolutional one-stage 3d object detection on lidar range images

Z Tian, X Chu, X Wang, X Wei…�- Advances in Neural�…, 2022 - proceedings.neurips.cc

We present a simple yet effective fully convolutional one-stage 3D object detector for LiDAR
point clouds of autonomous driving scenes, termed FCOS-LiDAR. Unlike the dominant�…

Save Cite Cited by 5916 Related articles All 16 versions View as HTML

[PDF] thecvf.com

Voxelnext: Fully sparse voxelnet for 3d object detection and tracking

Y Chen, J Liu, X Zhang, X Qi…�- Proceedings of the IEEE�…, 2023 - openaccess.thecvf.com

Abstract 3D object detectors usually rely on hand-crafted proxies, eg, anchors or centers,
and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be�…

Save Cite Cited by 142 Related articles All 6 versions View as HTML

[PDF] neurips.cc

Bevfusion: A simple and robust lidar-camera fusion framework

T Liang, H Xie, K Yu, Z Xia, Z Lin…�- Advances in�…, 2022 - proceedings.neurips.cc

Fusing the camera and LiDAR information has become a de-facto standard for 3D object
detection tasks. Current methods rely on point clouds from the LiDAR sensor as queries to�…

Save Cite Cited by 241 Related articles All 7 versions View as HTML

[PDF] thecvf.com

Bevformer v2: Adapting modern image backbones to bird's-eye-view recognition via perspective supervision

C Yang, Y Chen, H Tian, C Tao, X Zhu…�- Proceedings of the�…, 2023 - openaccess.thecvf.com

We present a novel bird's-eye-view (BEV) detector with perspective supervision, which
converges faster and better suits modern image backbones. Existing state-of-the-art BEV�…

Save Cite Cited by 159 Related articles All 9 versions View as HTML

[PDF] mdpi.com

A survey of visual transformers

Y Liu, Y Zhang, Y Wang, F Hou, J Yuan…�- …�on Neural Networks�…, 2023 - ieeexplore.ieee.org

Transformer, an attention-based encoder–decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some�…

Save Cite Cited by 307 Related articles All 22 versions

Create alert

Cite

Advanced search

Saved to My library

Transfusion: Robust lidar-camera fusion for 3d object detection with transformers

3D object detection for autonomous driving: A comprehensive survey