Ground truth was built by a panoramic camera placed on a tripod in the center of the scene, and a large moving rigid target board marked with an AprilTag (https://april.eecs.umich.edu/software/apriltag). A new video dataset for object referring, with 30,000 objects over 5000 stereo ...