GitHub is where people build software. More than 150 million people use GitHub to discover, fork, and contribute to over 420 million projects.
bin/api/run_docker_eval.sh generate \ --model-name lfm-3b-jp \ --model-url https://inference-1.liquid.ai/v1 \ --model-api-key <API-KEY> bin/api/run_docker_eval.sh judge \ --model-name lfm-3b-jp \ --openai-api-key <OPENAI-API-KEY> Run Evaluation without Docker (click to...
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file. - Archon/archon/mt_bench/README.md at main · ScalingIntelligence/Archon
Low contributions. Medium-low contributions. Medium-high contributions. High contributions. More 2024 Contribution activity December 2024 mtbench101 has no activity yet for this period. Loading Show more activity Seeing something unexpected? Take a look at the GitHub profile guide. Footer...
xingyuanbu temporarily deployed to prod June 3, 2024 05:31 — with GitHub Actions Inactive Collaborator bittersweet1999 commented Jun 3, 2024 Please add a introduction for your dataset, and remember to release the DataSet bittersweet1999 approved these changes Jun 3, 2024 View reviewed changes...
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets. - MT-Bench-101 (#1215) · triple-Mu/opencompass@02a0a4e
git clone https://github.com/lm-sys/FastChat.gitcdFastChat If you are running on Mac: brew install rust cmake Install Package pip3 install --upgrade pip#enable PEP 660 supportpip3 install -e".[model_worker,webui]" Vicunais based on Llama 2 and should be used under Llama'smodel lice...
No description provided.MichaelClifford closed this as completed Nov 6, 2024 Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment Assignees No one assigned Labels None yet Projects None yet Milestone No milestone Development No branches or pull...
Load diff This file was deleted. 0 comments on commit a5d4ac2 Please sign in to comment. Footer © 2024 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact Manage cookies Do not share my personal information ...
Project of llm evaluation to Japanese tasks. Contribute to Stability-AI/llm-leaderboard development by creating an account on GitHub.