BEVFormer

3D visual perception tasks, including 3D detection and map segmentation based on multi-camera images, are essential for autonomous driving systems. In this work, we present a new framework termed BEVFormer, which learns unified BEV representations with spatiotemporal transformers to support multiple autonomous driving perception tasks. In a nutshell, BEVFormer exploits both spatial and temporal information by interacting with spatial and temporal space through predefined grid-shaped BEV queries. To aggregate spatial information, we design spatial cross-attention that each BEV query extracts the spatial features from the regions of interest across camera views. For temporal information, we propose temporal self-attention to recurrently fuse the history BEV information. Our approach achieves the new state-of-the-art 56.9\% in terms of NDS metric on the nuScenes \texttt{test} set, which is 9.0 points higher than previous best arts and on par with the performance of LiDAR-based baseline.

Features

Cutting-edge Baseline for Camera-based Detection
In this work, the authors present a new framework termed BEVFormer
BEVFormer exploits both spatial and temporal information
The proposed approach achieves the new state-of-the-art 56.9% in terms of NDS metric on the nuScenes test set
To aggregate spatial information, the authors design a spatial cross-attention that each BEV query extracts the spatial features
On par with the performance of LiDAR-based baselines

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow BEVFormer

BEVFormer Web Site

Other Useful Business Software

Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now

Rate This Project

User Reviews

Be the first to post a review of BEVFormer!

Additional Project Details

Programming Language

Python

Related Categories

Python Frameworks, Python Machine Learning Software, Python Autonomous Driving Software, Python LiDAR Software

Registered

2022-08-22

Similar Business Software

Helm.ai

We license AI software throughout the L2-L4 autonomous driving stack, perception, intent modeling, path planning, and vehicle control. Highest accuracy perception and intent prediction, leading to safer autonomous driving systems. Unsupervised learning and mathematical modeling, instead of...

See Software
Oxbotica Selenium

Selenium is our flagship product, a full-stack autonomy system, the product of over 500 person-years of effort. An on-vehicle suite of software which given a drive-by-wire interface and very modest compute hardware, brings full autonomy to a land-based vehicle. Selenium has the ability to...

See Software
Tronis

Tronis is an environment for virtual prototyping and for safeguarding driver assistance systems, e.g. for highly automated or autonomous driving . Based on a modern 3D game engine, real driving situations and traffic scenarios can be efficiently mapped and used for testing, e.g. for camera and...

See Software

Report inappropriate content

BEVFormer

Implementation of BEVFormer, a camera-only framework

Get an email when there's a new version of BEVFormer

Features

Project Samples

Project Activity

Categories

License

Follow BEVFormer

User Reviews

Additional Project Details

Programming Language

Related Categories

Registered