demo configs images README.md image_demo.py video_demo.py ext figs omg_llava seg tools .gitattributes .gitignore DATASET.md EMB.md INSTALL.md LICENSE OMG_Seg_README.md README.mdBreadcrumbs OMG-Seg /demo / video_demo.py Latest commit...
Demo Scripts Run the visualization scripts on COCO ./tools/dist.sh test seg/configs/m2ov_val/eval_m2_convl_300q_ov_coco.py 1 --checkpoint model_path --show-dir vis Run the visualization scripts on VIPSeg ./tools/dist.sh test seg/configs/m2ov_val/eval_m2_convl_300q_ov_vipseg.py ...
The first open-sourced codebase for multiple multimodal understanding tasks, including training, inference and demo. Key Features of OMG-Seg $\color{#2F6EBA}{Universal\ Image, Video, Open-Vocabulary, Segmentation\ Model}$ Anew unifiedsolution forover ten different segmentation tasks: PS, IS, VSS...
We present OMG-LLaVA, a new and elegant framework combining powerful pixel-level vision understanding with reasoning abilities. 0 comments on commit c655c80 Please sign in to comment. Footer © 2024 GitHub, Inc. Footer navigation Terms...