Abstract: Recent real-time detection transformers (DETRs) have gained popularity due to their simplicity and efficiency. However, these detectors do not explicitly model object rotation, especially in ...
We introduce TASTE-Rob: 1) a dataset with 100,856 task-oriented hand-object interaction videos, 2) a three-stage pose-refinement video generation pipeline. With the above contributions, TASTE-Rob is ...
Abstract: Object detection is a foundation process in computer vision having widespread applications in autonomous driving, medical diagnostics and security monitoring. Recent advancements and ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results