Intelligent Systems
Note: This research group has relocated.

Physics-Based Rigid Body Object Tracking and Friction Filtering From RGB-D Videos

2024

Conference Paper

ev


Physics-based understanding of object interactions from sensory observations is an essential capability in augmented reality and robotics. It enables to capture the properties of a scene for simulation and control. In this paper, we propose a novel approach for real-to-sim which tracks rigid objects in 3D from RGB-D images and infers physical properties of the objects. We use a differentiable physics simulation as state-transition model in an Extended Kalman Filter which can model contact and friction for arbitrary mesh-based shapes and in this way estimate physically plausible trajectories. We demonstrate that our approach can filter position, orientation, velocities, and concurrently can estimate the coefficient of friction of the objects. We analyze our approach on various sliding scenarios in synthetic image sequences of single objects and colliding objects. We also demonstrate and evaluate our approach on a real-world dataset. We make our novel benchmark datasets publicly available to foster future research in this novel problem setting and comparison with our method.

Author(s): Rama Krishna Kandukuri and Michael Strecke and Joerg Stueckler
Book Title: Proceedings of the International Conference on 3D Vision (3DV)
Year: 2024

Department(s): Embodied Vision
Research Project(s): Physics-based Scene Understanding
Object-Level Scene Understanding
Bibtex Type: Conference Paper (inproceedings)
Paper Type: Conference

DOI: 10.1109/3DV62453.2024.00111

State: Published
URL: https://doi.org/10.1109/3DV62453.2024.00111

Links: preprint
supplemental video
dataset
Video:

BibTex

@inproceedings{kandukuri2023_ekfphys,
  title = {Physics-Based Rigid Body Object Tracking and Friction Filtering From RGB-D Videos},
  author = {Kandukuri, Rama Krishna and Strecke, Michael and Stueckler, Joerg},
  booktitle = {Proceedings of the International Conference on 3D Vision (3DV)},
  year = {2024},
  doi = {10.1109/3DV62453.2024.00111},
  url = {https://doi.org/10.1109/3DV62453.2024.00111}
}