InfiniTensor development is based on the pull request on Github. Before requesting for merging, a PR should satisfy the following requirements Pass all tests. Now CI on Github will test everything that can be tested in the ci environment, including code format. So, script test/script/clang_...
Hi, welcome to InfiniTensor! The following repos are deep learning inference frameworks developed by our organization: Any new ideas to share? Just crate a discussion to tell us! Popular repositories Loading InfiniTensor Public C++ 229 57 InfiniLM Public Rust 103 29 RefactorGraph Public ...
InfiniTUI 支持自定义键绑定。您可以通过更新 config 配置文件中的[key_bindings]部分来修改一些默认键绑定。这是一个包含默认键绑定的示例 [key_bindings] show_help = '?' show_history = 'h' new_chat = 'n' save_chat = 's' ℹ️ 注意 为避免与vim键绑定重叠,您需要使用ctrl + 键,除了帮助?外...
Last commit date Latest commit YdrMaster fix(llama-cuda): 支持根据空闲内存计算可能的 kv cache 容量 Feb 8, 2025 e73b0fb·Feb 8, 2025 History 527 Commits .github/workflows feat: 初步实现 llama-nv 单体 Oct 15, 2024 common perf(llama): 优化分布式切分和参数加载 ...
another tokenizer crate. Contribute to InfiniTensor/tokeneer development by creating an account on GitHub.
Github 讨论|常见问题 开营🎉 2024 冬季大模型与人工智能系统训练营开营仪式已于 2025 年 1 月 5 日晚 20:00 举行,可查看回放, 密码 L8RF。 人工智能系统专业知识直播课程已于 1 月 6 日下午正式开始,配套算力资源也已陆续上线。专业知识课程将提供配套交流群供学员交流讨论。为降低管理压力,要求学员完成系...
本项目使用safetensors模型格式,初始代码只支持单个文件的模型。 本项目自带两个微型的语言模型,分别用于文本生成和AI对话(模型来自于Hugginface上的raincandy-u/TinyStories-656K和Felladrin/Minueza-32M-UltraChat)。对话模型比较大,需要到github页面的release里下载。
九格大模型-适配前端. Contribute to InfiniTensor/jiuge-front development by creating an account on GitHub.
.cargo publish: 准备发布 gguf-utils Nov 25, 2024 .github/workflows build: upgrade Rust to 2024 edition Feb 25, 2025 ggml-quants build: upgrade Rust to 2024 edition Feb 25, 2025 ggus build: upgrade Rust to 2024 edition Feb 25, 2025 ...
昇腾加速卡驱动(rust binding). Contribute to InfiniTensor/ascendcl development by creating an account on GitHub.