ChatGPT is a powerful artificial intelligence (AI) language model that has demonstrated significant improvements in various natural language processing (NLP) tasks. However, like any technology, it presents potential security risks that need to be carefu
Since the introduction of the Transformer [Vaswani et al., 2017] architecture by Google in 2017, Language Models (LMs) are generally pre-trained with either discriminative or generative objectives. Discriminative pre-training uses a masked language model to predict the next sentence and features an ...
Large Language Model (LLM) LLM security LLM privacy ChatGPT LLM attacks LLM vulnerabilities 1. Introduction A large language model is the language model with massive parameters that undergoes pretraining tasks (e.g., masked language modeling and autoregressive prediction) to understand and process hum...
researchers are faced with a challenging question: how to determine which model is the best suited for a particular machine learning problem. A good method for selecting a model is to create benchmark tasks – typical problems that can
[27] Duan Y, Gong S. DIKWP-TRIZ method: an innovative problem-solving method that combines the DIKWP model and classic TRIZ. DOI: 10.13140/RG.2.2.12020.53120.https://www.researchgate.net/publication/375380084_DIKWP-TRIZfangfazongheDIKWPmoxinghejingdianTRIZdechuangxinwentijiejuefangfa. 2023. ...
Large language models (LLMs) have broad medical knowledge and can reason about medical information across many domains, holding promising potential for diverse medical applications in the near future. In this study, we demonstrate a concerning vulnerabil
[2023/10] Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond Liang Chen et al. arXiv. [paper] [code] This work proposes PCA-EVAL, which benchmarks embodied decision making via MLLM-based End-to-End method and LLM...
openaimachine-learning-courserallmfinetuning-llmslargelanguagemodelsfinetuning-large-language-models UpdatedJan 21, 2024 Star0 new method for discovering vulnerabilities that employs a variety of methodologies. This method combines self-attention with convolutional networks to record both local, position-spec...
Large Language Model Meta AI (Llama) is Meta's LLM which was first released in 2023. The Llama 3.1 models were released in July 2024, including both a 405 billion and 70 billion parameter model. The most recent version is Llama 3.2 which was released in September 2024, initially with smal...
Thus, it is promising to construct large language model-empowered agents (Wang et al.,2024b; Xi et al.,2023) due to their human-like intelligence in perceiving the environment and making decisions. In the following, we have a short summary of the motivations to apply large language models ...