Skip to content

benchun123/monocular_object_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Monocular Object Detection - Sample-Score Method

Related Paper:

  • Benchun Zhou, Maximilian Gilles, Yongqi Meng. Structure SLAM with Points, Planes, and Objects[J]//Advanced Robotics 36.20 (2022): 1060-1075. [Link] [PDF] [Slide] [Youtube] [Bilibili]

  • If you use the code in your academic work, please cite the above paper.

Taking RGB images as input, the sample-score method employs a 2D object detector to get object classes and 2D bounding boxes. For each bounding box, we sample the dimension and rotation of the object to generate many cuboid candidates. These candidates are scored by 3D and 2D constraints, and the best candidate with the highest score is selected as the representation of the object.

1. Prerequisites

  • Ubuntu (18.04.5)
  • CMake (3.10.2)
  • Eigen (3)
  • OpenCV (3.2.0)
  • PCL (1.8.1)

2. Running

Clone the repository:

    git clone https://github.com/benchun123/monocular_object_detection.git

build the project:

    cd monocular_object_detection
    mkdir build
    cd build
    cmake ..
    make -j4

3. Thirdparty

Maybe you need to rebuild the library in the folder /Thirdparty, use the following instruction:

    cd Thirdparty/line_lbd
    mkdir build
    cd build
    cmake ..
    make -j4
    cd Thirdparty/tictoc_profiler
    mkdir build
    cd build
    cmake ..
    make -j4

4. Run it as follows:

    ./det_rgbd_opti_sun_node  ~/path/to/sunrgbd_dataset

A small dataset from SUN RGB-D dataset can be found here.

5. Acknowledgement

Thanks for the great work: Deep3DBox, Cube SLAM, and RoomLayout

  • Mousavian, A., Anguelov, D., Flynn, J., & Kosecka, J. 3D Bounding Box Estimation Using Deep Learning and Geometry[C]. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2017. [PDF], [Code]
  • Yang S, Scherer S. Cubeslam: Monocular 3-d object slam[J]. IEEE Transactions on Robotics, 2019, 35(4): 925-938. [PDF], [Code]
  • Gupta, A., Hebert, M., Kanade, T., & Blei, D. Estimating spatial layout of rooms using volumetric reasoning about objects and surfaces.[J]. Advances in neural information processing systems, 2010. [PDF],

About

Monocular 3D Object Detection

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published