In this work we propose a novel dataset which contains transcribed ADs, which are temporally aligned to full length HD movies. In addition we also collected the aligned movie scripts which have been used in prior work and compare the two different sources of descriptions. In total the MPII ...
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language Jun Xu , Tao Mei , Ting Yao and Yong Rui Microsoft Research, Beijing, China {v-junfu, tmei, tiyao, yongrui}@microsoft.com Abstract While there has been increasing interest in the task of describing video with ...
To the best of our knowledge, MovieNet is the largest dataset with richest annotations for comprehensive movie understanding. Based on MovieNet, we set up several benchmarks for movie understanding from different angles. Extensive experiments are executed on these benchmarks to show the immeasurable...
A holistic dataset for movie understanding. 1.1K Movies, 60K trailers. Non-commercial can only be used for research and educational purposes. Commercial use is prohibited.Image ETH-XGaze 2020 ETH-XGaze, consisting of over one million high-resolution images of varying gaze under extreme head...
Very recently the authors of MovieNet, the massive project described in [18], totally annotated 47K shots from movies and trailers, each with one tag of view scale and one tag of camera movement. Although remarkable for its variety, the size of such dataset is still very limited if ...
Direct URL to data: https://github.com/airtlab/A-Dataset-for-Automatic-Violence-Detection-in-Videos Value of the Data 1. Data Description The pervasiveness of video surveillance cameras and the need of watching footages and making decisions in a very short time [1] boosted the interest of ...
A JSON based anime dataset containing the most important meta data as well as cross references to various anime sites such as MAL, ANIDB, ANILIST, KITSU and more... - manami-project/anime-offline-database
The resulting movie dataset is ~130MB (about half of which is Postgres indices) and should fit in most free-tier Postgres instances. Some fields likebudgetandrevenueuse strings instead of numbers, because they can overflow Postgres integers. ...
separates, or filters data items.Ameasureis an attribute that addresses the question of interest and that the analyst expects to vary across the dimensions. Both the measures and the dimensions might be attributes directly found in the dataset or derived attributes calculated from the existing data...
TALC: Time-Aligned Captions for Multi-Scene Text-to-Video Generation Team: UCLA, Google. Hritik Bansal, Yonatan Bitton, Michal Yarom, et al., Kai-Wei Chang arXiv, 2024.05 [Paper], [PDF], [Code], [Dataset], [Pretrained Model], [Home Page] ...