TAP-Vid is a benchmark which contains both real-world videos with accurate human annotations of point tracks, and synthetic videos with perfect ground-truth point tracks. This is designed for a new task called tracking any point.
Jul15. Jul70717273 Filter: untagged Edit Leaderboard RankModelAverage JaccardAverage PCKOcclusion AccuracyPaperCodeResultYearTags 1 BootsTAPIR 72.4 83.1 91.2 BootsTAP: Bootstrapped Training for Tracking-Any-Point 2024 2 LocoTrack-B 70.8 83.2 84.1 Local All-Pair Correspondence for Point Tracking 2024...