This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model. - Shenzhi-Wang/Llama3-Chinese-Chat
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model. - Llama3-Chinese-Chat/README.md at main · Shenzhi-Wang/Llama3-Chinese-Chat
shenzhi-wang/Llama3-70B-Chinese-Chat · Hugging Face 自动总结: - Llama3-70B-Chinese-Chat是一个针对中文和英文用户的LLM模型,具有多种能力。 - Llama3-70B-Chinese-Chat在中文表现上超过了ChatGPT,与GPT-4相媲美。 - Llama3-70B-Chinese-Chat的训练使用了ORPO算法和大量中英文数据集。 - Llama3-70B-...
"model.layers.3.self_attn.k_proj.weight": "model-00002-of-00030.safetensors", "model.layers.3.self_attn.o_proj.weight": "model-00002-of-00030.safetensors", "model.layers.3.self_attn.q_proj.weight": "model-00002-of-00030.safetensors", "model.layers.3.self_attn.v_proj.weigh...
你好作者,我在使用djl环境下运行了Llama3-8B-Chinese-Chat-q4_0-v2_1.gguf模型,推理的结果部分单词会发生乱码, 我通过debug代码发现一个UTF8的字节数组拆成两个字节数组,正常'奋'的UFT8字节编码为[-27, -91, -117]。
Shenzhi WangShenzhi-Wang Block or Report PinnedLoading Llama3-Chinese-ChatLlama3-Chinese-ChatPublic This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model. 30717
--output_dir /nfsdata/gpu007/dingfei/commondata/huggingface/Meta-Llama-3-8B-Instruct-saves/shenzhi-wang/Llama3-8B-Chinese-Chat-v1-test1 --per_device_train_batch_size 2 --per_device_eval_batch_size 2 --gradient_accumulation_steps 4 --lr_scheduler_type cosine --log_level info ...
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model. - 有gradio那种网页版的部署方式吗? · Issue #4 · Shenzhi-Wang/Llama3-Chinese-Chat
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model. - context_length是微调成2k的了吗 · Issue #3 · Shenzhi-Wang/Llama3-Chinese-Chat
This is the first Chinese chat model specifically fine-tuned for Chinese through ORPO based on the Meta-Llama-3-8B-Instruct model. - History for README.md - Shenzhi-Wang/Llama3-Chinese-Chat