6 Jan 2021 | Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl
The paper "CenterPoint: Center-based 3D Object Detection and Tracking" by Tianwei Yin proposes a novel framework for 3D object detection and tracking using point clouds. The main challenge in 3D object detection is the representation of objects, which are often represented as axis-aligned bounding boxes in 2D but pose significant difficulties in 3D due to the lack of orientation information and the complexity of rotated objects. To address these challenges, CenterPoint represents objects as points, simplifying the detection and tracking process.
**Key Contributions:**
1. **CenterPoint Framework:** CenterPoint uses a keypoint detector to find the centers of objects and regress other attributes such as 3D size, orientation, and velocity.
2. **Two-Stage Refinement:** A second stage refines the estimates using additional point features, improving the accuracy of object localization.
3. **Efficiency and Effectiveness:** The method is simple, efficient, and achieves state-of-the-art performance on the nuScenes and Waymo Open datasets.
**Methods:**
- **Center Heatmap Head:** Predicts a heatmap peak at the center of detected objects, using a focal loss to handle sparse supervision.
- **Regression Heads:** predicts sub-voxel location refinement, height-above-ground, 3D size, and orientation.
- **Velocity Head and Tracking:** Predicts velocity estimates to track objects over time using greedy closest-point matching.
**Experiments:**
- **Waymo Open Dataset:** Achieves 71.8 mAP and 66.4 mAPH for vehicle and pedestrian detection, respectively.
- **nuScenes Dataset:** Outperforms previous methods with a single model, achieving 65.5 NDS and 63.8 AMOTA for 3D detection and tracking.
**Ablation Studies:**
- **Center-based vs Anchor-based:** CenterPoint outperforms anchor-based methods, especially for rotated and large objects.
- **One-stage vs Two-stage:** The two-stage refinement module improves accuracy with minimal overhead.
- **Feature Components:** Bird-eye view features are sufficient and more efficient than voxel features.
**Conclusion:**
CenterPoint is a robust and efficient framework for 3D object detection and tracking, demonstrating superior performance on both the Waymo and nuScenes datasets.The paper "CenterPoint: Center-based 3D Object Detection and Tracking" by Tianwei Yin proposes a novel framework for 3D object detection and tracking using point clouds. The main challenge in 3D object detection is the representation of objects, which are often represented as axis-aligned bounding boxes in 2D but pose significant difficulties in 3D due to the lack of orientation information and the complexity of rotated objects. To address these challenges, CenterPoint represents objects as points, simplifying the detection and tracking process.
**Key Contributions:**
1. **CenterPoint Framework:** CenterPoint uses a keypoint detector to find the centers of objects and regress other attributes such as 3D size, orientation, and velocity.
2. **Two-Stage Refinement:** A second stage refines the estimates using additional point features, improving the accuracy of object localization.
3. **Efficiency and Effectiveness:** The method is simple, efficient, and achieves state-of-the-art performance on the nuScenes and Waymo Open datasets.
**Methods:**
- **Center Heatmap Head:** Predicts a heatmap peak at the center of detected objects, using a focal loss to handle sparse supervision.
- **Regression Heads:** predicts sub-voxel location refinement, height-above-ground, 3D size, and orientation.
- **Velocity Head and Tracking:** Predicts velocity estimates to track objects over time using greedy closest-point matching.
**Experiments:**
- **Waymo Open Dataset:** Achieves 71.8 mAP and 66.4 mAPH for vehicle and pedestrian detection, respectively.
- **nuScenes Dataset:** Outperforms previous methods with a single model, achieving 65.5 NDS and 63.8 AMOTA for 3D detection and tracking.
**Ablation Studies:**
- **Center-based vs Anchor-based:** CenterPoint outperforms anchor-based methods, especially for rotated and large objects.
- **One-stage vs Two-stage:** The two-stage refinement module improves accuracy with minimal overhead.
- **Feature Components:** Bird-eye view features are sufficient and more efficient than voxel features.
**Conclusion:**
CenterPoint is a robust and efficient framework for 3D object detection and tracking, demonstrating superior performance on both the Waymo and nuScenes datasets.