Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models

stp2yDecember 16, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 4 Apr 2024 (v1), last revised 13 Dec 2024 (this version, v2)]

View a PDF of the paper titled Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models, by Chengkai Huang and 6 other authors

Abstract:Retrieval-augmented large language models (LLMs) have been remarkably competent in various NLP tasks. However, it was observed by previous works that retrieval is not always helpful, especially when the LLM is already knowledgeable on the query to answer. Motivated by this, Adaptive Retrieval-Augmented Generation (ARAG) studies retrieving only when the knowledge asked by the query is absent in the LLM. Previous works of ARAG either require accessing the pre-training corpus or prompting with additional model inferences. Aiming to avoid such drawbacks, we propose to determine whether the model is knowledgeable on a query via inspecting the (contextualized) pre-trained token embeddings of LLMs. We hypothesize that such embeddings capture rich information on the model’s intrinsic knowledge base, which enables an efficient way of judging the necessity to retrieve from an external corpus. Extensive experiments demonstrate our ARAG approach’s superior performance across various benchmarks.

Submission history

From: Chengkai Huang [view email]
[v1]
Thu, 4 Apr 2024 15:21:22 UTC (8,985 KB)
[v2]
Fri, 13 Dec 2024 02:45:14 UTC (262 KB)

Source link
lol

By stp2y