The RVL-CDIP dataset consists of scanned document images belonging to 16 classes such as letter, form, email, resume, memo, etc. The dataset has 320,000 training, 40,000 validation and 40,000 test images. The images are characterized by low quality, nois
We find that models trained on the\nsmaller Tobacco-3482 dataset perform poorly on our new out-of-distribution\ndata, while text classification models trained on the larger RVL-CDIP exhibit\nsmaller performance drops.doi:10.48550/arXiv.2108.02684Stefan Larson...
RVL-CDIP(瑞尔森视觉实验室复杂文档信息处理)数据集由 16 类 400,000 张灰度图像组成,每类 25,000 张图像。有 320,000 张训练图像、40,000 张验证图像和 40,000 张测试图像。图像的大小使其最大尺寸不超过 1000 像素。 - 飞桨AI Studio
in Beyond Document Page Classification: Design, Datasets, and Challenges RVL-CDIP_MP is our first contribution to retrieve the original documents of the IIT-CDIP test collection which were used to create RVL-CDIP. Some PDFs or encoded images were corrupt, which explains that we have around ...