Projects Security Insights Additional navigation options main BranchesTags Code README Apache-2.0 license The Powerful Multi-modal LLM Family for OCR-free Document Understanding Alibaba Group 📢 News [2023.07.10] The demo of mPLUG-DocOwl onModelScopeis avaliable. ...
代码链接: https://github.com/X-PLUG/mPLUG-DocOwl/tree/main/DocOwl2 模型结构 NLP领域对于文本总结和压缩已经有了很多研究。考虑到文档图片的主要信息都是布局和文字信息,且现有的多模态大模型普遍通过一个vision-to-text模块将视...
3 changes: 0 additions & 3 deletions 3 DocOwl1.5/evaluation/benchmarks_eval.py Original file line numberDiff line numberDiff line change @@ -1,9 +1,6 @@ import jsonlines import json from icecream import ic import sys sys.path.append('/nas-alinlp/anwenhu/code/mPLUG_github/mPLUG-Do...
Expand Up@@ -128,6 +128,7 @@ prepare python environments as [mPLUG-Owl2](https://github.com/X-PLUG/mPLUG-Owl/ from docowl_infer import DocOwlInfer # e.g. model_path = './mPLUG/DocOwl1.5-Chat' model_path = $YOUR_LOCAL_MODEL_PATH ...
``` 0 comments on commit 457327e Please sign in to comment. Footer © 2024 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact Manage cookies Do not share my personal information
Github:https://github.com/X-PLUG/mPLUG-DocOwl Paper:https://arxiv.org/abs/2307.02499 Demo:https://modelscope.cn/studios/damo/mPLUG-DocOwl/summary ModelScope Demo 体验 点击https://modelscope.cn/studios/damo/mPLUG-DocOwl/summary就可以立刻体验了。
*🔥🔥🔥[2024.9.06]We release the arxiv paper of[mPLUG-DocOwl 2](https://arxiv.org/abs/2409.03420), a SOTA 8B Multimodal LLM on OCR-free Multipage Document Understanding, each document image is encoded with just 324 tokens!
limitations under the License.0 comments on commit 2975250 Please sign in to comment. Footer © 2024 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact Manage cookies Do not share my personal information
sys.path.append('/nas-alinlp/anwenhu/code/mPLUG_github/mPLUG-DocOwl2/evaluation') print(sys.path) import re from evaluator import doc_evaluate import os from tqdm import tqdm import random from pathlib import Path def parser_line(line): image = line['image'][0] assert len(line['messag...
.gitignore commit github Jul 7, 2023 LICENSE Initial commit Jul 4, 2023 README.md update readme: dDocOwl2 is now supported by ms-swift Dec 24, 2024 Repository files navigation README Apache-2.0 license The Powerful Multi-modal LLM Family for OCR-free Document Understanding Alibaba Group ...