visual-llm+zero-shot+classification

2025-02-15 23:56:29

拼音 [ 拼音 ]

LLM2CLIP: Powerful Language Model Unlocks Richer Visual...

The rich supervision signals provided by natural language — the carrier of human knowledge — shape a powerful cross-modal representation space. As a result, CLIP supports a variety of tasks, including zero-shot classification, detection, segmentation, and cross-modal retrieval, significantly influenci...
...datasets, tuning techniques, in-context learning, visual...

Benchmarking zero-shot text classification: Datasets, evaluation and entailment approach, EMNLP 2019. Paper Language Models are Few-Shot Learners, NIPS 2020. Paper Does Synthetic Data Generation of LLMs Help Clinical Text Mining? Arxiv 2023 Paper Test data/user data Shortcut learning of large lang...
LLM2CLIP: Powerful Language Model Unlocks Richer Visual...

The rich supervision signals provided by natural language — the carrier of human knowledge — shape a powerful cross-modal representation space. As a result, CLIP supports a variety of tasks, including zero-shot classification, detection, segmentation, and cross-modal retrieval, significantly influenci...