Hugging Facemeta-llama/Llama-2-7b,tiiuae/falcon-180B,bigscience/bloom,gpt2,baichuan-inc/Baichuan2-13B-Chat,THUDM/chatglm2-6bHugging Face Claudeclaude-2.0,claude-2.1,claude-instant-1.2,claude-3-sonnet-20240229,claude-3-opus-20240229,claude-3-haiku-20240307Claude ...
@@ -164,7 +164,7 @@ Compared to ChatGLM's [P-Tuning](https://github.com/THUDM/ChatGLM2-6B/tree/main/ | [OLMo](https://huggingface.co/allenai) | 1B/7B | q_proj,v_proj | - | | [PaliGemma](https://huggingface.co/google) | 3B | q_proj,v_proj | gemma | | [Phi-1.5...
1.**更强大的性能**:基于 ChatGLM 初代模型的开发经验,我们全面升级了 ChatGLM2-6B 的基座模型。ChatGLM2-6B 使用了[GLM](https://github.com/THUDM/GLM)的混合目标函数,经过了 1.4T 中英标识符的预训练与人类偏好对齐训练,[评测结果](#评测结果)显示,相比于初代模型,ChatGLM2-6B 在 MMLU(+23%)、CEv...
{ + "cell_type": "markdown", + "id": "bfd6bdfb-c8cc-4fb6-8969-153a97dfb576", + "metadata": {}, + "source": [ + "In this tutorial, we will see how we can leverage Mistral 7B to train a classifier. We need:\n", + "- The mistral codebase: `https://github.com/...
treehollow/thuhole-index -Clojuretonsky/FiraCode - Free monospaced font with programming ligatures logseq/logseq - A privacy-first, open-source platform for knowledge management and collaboration. Download link: http://github.com/logseq/logseq/releases. roadmap: http://trello.com/b/8txSM12G/road...
python3 -m torch.distributed.launch --nproc_per_node 4 \ --nnodes=1 --node_rank=0 --master_addr=xxx --master_port=yyy \ uniform_finetune.py --model_type chatglm --model_name_or_path THUDM/chatglm-6b \ --data alpaca-belle-cot --lora_target_modules query_key_value \ --lora_...
4 0.841 0.979 0.960 -1.0 -1.0 0.841 0.995671 0.996 0.060269 0.165117 25.0 5 0.847 0.978 0.960 -1.0 -1.0 0.847 0.995671 0.996 0.073501 0.173967 30.0 6 0.848 0.979 0.960 -1.0 -1.0 0.848 0.995671 0.996 0.067540 0.170206 35.0 7 0.846 0.979 0.960 -1.0 -1.0 0.846 0.995671 0.996 0.057506 0.158793 40...
.github/workflows Update main.yml Jan 28, 2024 README.md Add Jan 28, 2024 a.py Update a.py May 30, 2023 Repository files navigation README ALLStarRepo 这里显示我所有star的repo NumNameautherDescriptionStarsLast update 1 klee klee KLEE Symbolic Execution Engine 2401 2024-01-25 2 CVPR2023-Pap...
4 5 0 3 Allen, Mr. William Henry male 35.0 0 0 373450 8.0500 NaN S\n", + "" + ], + "text/plain": [ + " PassengerId Survived Pclass \\\n", + "0 1 0 3 \n", + "1 2 1 1 \n", + "2 3 1 3 \n", + "3 4 1 1 \n", + "4 5 0 3 \n", + "\n", +...
GitHubDaily + +### 排名 17:/rasbt/LLMs-from-scratch +- 简介: 从头开始逐步实现类似ChatGPT的LLM +- URL: https://github.com/rasbt/LLMs-from-scratch +- 作者: rasbt + +### 排名 18:/zed-industries/zed +- 简介: 以思维的速度编写代码–Zed是Atom和Tree-sitter的创建者提供的高性能多人代码...