Section 2 gives an overview of the recent literature on deep gen- erative models, image encoders, and diagram-based tasks and datasets. Section 3 describes Paper2Fig100k, a novel dataset of research figures and
plate recognition based on YOLOv8, Easy‑OCR, and CNN Open Access Amany Sarhan1* , Rowyda Abdel‑Rahem2, Bassel Darwish2, Arwa Abou‑Attia2, Ahmed Sneed2, Shahd Hatem2, Awatef Badran2 and Mohamed Ramadan2 *Correspondence: amany_sarhan@f-eng....
CDLA Chinese document layout analysis data set, for Chinese literature (paper) scenarios, including 10 categories:Text, Title, Figure, Figure caption, Table, Table caption, Header, Footer, Reference, Equation DocBank Large-scale dataset (500K document pages) constructed using weakly supervised methods...