|
ReVLA: Reverting Visual Domain Limitation of Robotic Foundation Models
Sombit Dey, Jan-Nico, Nikolay, Danda Pani Paudel, Luc Van Gool
Accepted at ICRA 2025
arxiv /
website /
We study the visual generalization capabilities of three existing robotic foundation models, and propose a corresponding evaluation framework.
|
|
Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding
Sombit Dey, Ozan Unal, Christos Sakaridis, Luc Van Gool
WACV 2025
arxiv /
code /
website /
We introduce two novel losses for 3D visual grounding: a visual-level offset loss on regressed vector offsets from each instance to the ground-truth referred instance and a language-related span loss on predictions for the word-level span of the referred instance in the description
|
|
Learning whom to trust in navigation: dynamically switching between classical and neural planning
Sombit Dey, Assem Sadek, Gianluca Monaci, Boris Chidlovskii, Christian Wolf
IROS 2023
arxiv /
code /
We introduce two novel losses for 3D visual grounding: a visual-level offset loss on regressed vector offsets from each instance to the ground-truth referred instance and a language-related span loss on predictions for the word-level span of the referred instance in the description
|
|