Trending LLM Projects TinyZero- Clean, minimal, accessible reproduction of DeepSeek R1-Zero open-r1- Fully open reproduction of DeepSeek-R1 DeepSeek-R1- First-generation reasoning models from DeepSeek. Qwen2.5-Max- Exploring the Intelligence of Large-scale MoE Model. ...
Trending LLM Projects TinyZero- Clean, minimal, accessible reproduction of DeepSeek R1-Zero open-r1- Fully open reproduction of DeepSeek-R1 DeepSeek-R1- First-generation reasoning models from DeepSeek. Qwen2.5-Max- Exploring the Intelligence of Large-scale MoE Model. ...
27 Nov, Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens, https://arxiv.org/abs/2411.17691 27 Nov, Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration, https://arxiv.org/abs/2411.17686 29 Nov...
QLoRA: Efficient Finetuning of Quantized LLMs:4-bit is all you need* (*Plus double quantization and paged optimizers) DPR: Dense Passage Retrieval for Open-Domain Question Answering:Dense embeddings are all you need* (*Also, high precision retrieval) ...
4357 FEDAQT: ACCURATE QUANTIZED TRAINING WITH FEDERATED LEARNING 2827 FEDERATED CINN CLUSTERING FOR ACCURATE CLUSTERED FEDERATED LEARNING 2915 Federated Dataset Dictionary Learning for Multi-Source Domain Adaptation 8131 Federated Learning of Tensor Generalized Linear Models with Low Separation Rank 1592 Federat...
(98%)Elad Sofer; Tomer Shaked; Caroline Chaux; Nir Shlezinger Graph of Attacks: Improved Black-Box and Interpretable Jailbreaks for LLMs. (92%)Mohammad Akbar-Tajari; Mohammad Taher Pilehvar; Mohammad Mahmoody Test It Before You Trust It: Applying Software Testing for Trustworthy In-context ...
ML-SpecQD: Multi-Level Speculative Decodingwith Quantized DraftsEvangelos Georganas, Dhiraj Kalamkar, Alexander Kozlov, Alexander HeineckeIntel CorporationAbstract—Speculative decoding (SD) has emerged as a methodto accelerate LLM inference without sacrif i cing any accuracyover the 16-bit model inferen...
We also show you how to copy the ONNX quantized INT4 model to the project and add the C++ API to generate text. This is a preliminary exploration of deploying generative AI on mobile devices, but it provides a good starting point for further development. kinfeyMay 06, ...
Model Spec | Open AI | Reading,New The Rise and Rise of A.I. LLMs & their associated bots like ChatGPT | Visualization Opening up ChatGPT: tracking openness of instruction-tuned LLMs Generative AI exists because of the transformer | Visualization ...
A list of awesome papers and resources of recommender system on large language model (LLM). - WLiK/LLM4Rec-Awesome-Papers