qilin-med-vl

2025-03-25 21:46:58

拼音 [ 拼音 ]

Qilin-Med-VL: Towards Chinese Large Vision-Language Model for...

In response, this study introduces Qilin-Med-VL, the first Chinese large vision-language model designed to integrate the analysis of textual and visual data. Qilin-Med-VL combines a pre-trained Vision Transformer (ViT) with a foundational LLM. It undergoes a thorough two-stage curriculum ...