liutongxuan/vllmPublic forked fromvllm-project/vllm NotificationsYou must be signed in to change notification settings Fork0 Star0 Apache-2.0 license starsforks NotificationsYou must be signed in to change notification settings Code Pull requests ...
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding - liutongxuan/DeepSeek-VL2
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm/requirements-dev.txt at main · liutongxuan/vllm
liutongxuan/vllmPublic forked fromvllm-project/vllm NotificationsYou must be signed in to change notification settings Fork0 Star0 Code Pull requests Actions Projects Security Insights Additional navigation options Files main .buildkite .github
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm/requirements-build.txt at main · liutongxuan/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm/requirements-common.txt at main · liutongxuan/vllm
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production. - liutongxuan/
Merged guocuimi merged 1 commit into vectorch-ai:main from liutongxuan:features/fixbug Jul 2, 2024 Merged bugfix: fix invalid max_cache_size when device is cpu. #259 guocuimi merged 1 commit into vectorch-ai:main from liutongxuan:features/fixbug Jul 2, 2024 Conversation...
OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version. - GitHub - liutongxuan/OpenBLAS: OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version.
A high-throughput and memory-efficient inference and serving engine for LLMs - vllm/requirements-rocm.txt at main · liutongxuan/vllm