DATA_LICENSE Dockerfile LICENSE README.md alpaca_data.json alpaca_data_cleaned_archive.json alpaca_data_gpt4.json docker-compose.yml export_hf_checkpoint.py export_state_dict_checkpoint.py finetune.py generate.p
alpaca_legacy.json alpaca_short.json vigogne.json utils .dockerignore .gitignore DATA_LICENSE Dockerfile LICENSE README.md alpaca_data.json alpaca_data_cleaned_archive.json alpaca_data_gpt4.json docker-compose.yml export_hf_checkpoint.py export_state_dict_checkpoint.py finetune.py generate.py inf...
README.md alpaca_data_cleaned_archive.json alpaca_data_gpt4.json 3 files changed +260013 -0lines changed README.md +1 Original file line numberDiff line numberDiff line change @@ -2,6 +2,7 @@ 2 2 3 3 - 🤗 **Try the pretrained model out [here](https://huggingface.co/...
The primary goal of this project is to provide a cleaned and curated version of the Alpaca dataset that will improve the performance of natural language processing models trained on this data. By removing errors and inconsistencies, the goal is to improve performance of the fine-tuned llama model...
alpaca_data.json initial commit Mar 14, 2023 alpaca_data_cleaned_archive.json Add LLaMA-GPT4 dataset Apr 7, 2023 alpaca_data_gpt4.json Add LLaMA-GPT4 dataset Apr 7, 2023 docker-compose.yml Added Dockerfile and docker-compose.yml (#207) ...
alpaca_data_cleaned.json: about 52K English instruction-following training samples. CoT_data.json: 9 CoT datasets involving about 75k samples. (published by FLAN[7]) belle_data_cn.json: about 0.5M Chinese |instruction-following training samples. (published by BELLE [8]) Ablation of CoT and ...
DATA_LICENSE Dockerfile LICENSE README.md alpaca_data.json alpaca_data_cleaned_archive.json alpaca_data_gpt4.json docker-compose.yml export_hf_checkpoint.py export_state_dict_checkpoint.py finetune.py generate.py lengths.ipynb pyproject.toml requirements.txt Breadcrumbs alpaca-lora / alpaca_data_gp...
alpaca_data.json alpaca_data_cleaned_archive.json alpaca_data_gpt4.json docker-compose.yml export_hf_checkpoint.py export_state_dict_checkpoint.py finetune.py generate.py lengths.ipynb pyproject.toml requirements.txt Breadcrumbs alpaca-lora / alpaca_data_gpt4.json Latest commit tloen Add LLaMA-...
DATA_LICENSE Dockerfile LICENSE README.md alpaca_data.json alpaca_data_cleaned_archive.json alpaca_data_gpt4.json docker-compose.yml export_hf_checkpoint.py export_state_dict_checkpoint.py finetune.py generate.py inference.py lengths.ipynb pyproject.toml requirements.txtBreadcrumbs alpaca-lora/...
alpaca_data.json alpaca_data_cleaned_archive.json alpaca_data_gpt4.json docker-compose.yml export_hf_checkpoint.py export_state_dict_checkpoint.py finetune.py generate.py lengths.ipynb pyproject.toml requirements.txtBreadcrumbs alpaca-lora/