Abstract: Recent real-time detection transformers (DETRs) have gained popularity due to their simplicity and efficiency. However, these detectors do not explicitly model object rotation, especially in ...
We introduce TASTE-Rob: 1) a dataset with 100,856 task-oriented hand-object interaction videos, 2) a three-stage pose-refinement video generation pipeline. With the above contributions, TASTE-Rob is ...
Abstract: Object detection is a foundation process in computer vision having widespread applications in autonomous driving, medical diagnostics and security monitoring. Recent advancements and ...