View a PDF of the paper titled Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models, by Chengkai Huang and 6 other authors
Abstract:Retrieval-augmented large language models (LLMs) have been remarkably competent in various NLP tasks. However, it was observed by previous works that retrieval is not always helpful, especially when the LLM is already knowledgeable on the query to answer. Motivated by this, Adaptive Retrieval-Augmented Generation (ARAG) studies retrieving only when the knowledge asked by the query is absent in the LLM. Previous works of ARAG either require accessing the pre-training corpus or prompting with additional model inferences. Aiming to avoid such drawbacks, we propose to determine whether the model is knowledgeable on a query via inspecting the (contextualized) pre-trained token embeddings of LLMs. We hypothesize that such embeddings capture rich information on the model’s intrinsic knowledge base, which enables an efficient way of judging the necessity to retrieve from an external corpus. Extensive experiments demonstrate our ARAG approach’s superior performance across various benchmarks.
Submission history
From: Chengkai Huang [view email]
[v1]
Thu, 4 Apr 2024 15:21:22 UTC (8,985 KB)
[v2]
Fri, 13 Dec 2024 02:45:14 UTC (262 KB)
Source link
lol