View a PDF of the paper titled Knowledge in Triples for LLMs: Enhancing Table QA Accuracy with Semantic Extraction, by Hossein Sholehrasa and 3 other authors
No PDF available, click to view other formats
Abstract:Integrating structured knowledge from tabular formats poses significant challenges within natural language processing (NLP), mainly when dealing with complex, semi-structured tables like those found in the FeTaQA dataset. These tables require advanced methods to interpret and generate meaningful responses accurately. Traditional approaches, such as SQL and SPARQL, often fail to fully capture the semantics of such data, especially in the presence of irregular table structures like web tables. This paper addresses these challenges by proposing a novel approach that extracts triples straightforward from tabular data and integrates it with a retrieval-augmented generation (RAG) model to enhance the accuracy, coherence, and contextual richness of responses generated by a fine-tuned GPT-3.5-turbo-0125 model. Our approach significantly outperforms existing baselines on the FeTaQA dataset, particularly excelling in Sacre-BLEU and ROUGE metrics. It effectively generates contextually accurate and detailed long-form answers from tables, showcasing its strength in complex data interpretation.
Submission history
From: Hossein Sholehrasa [view email]
[v1]
Sat, 21 Sep 2024 16:46:15 UTC (1,137 KB)
[v2]
Tue, 29 Oct 2024 21:10:59 UTC (1 KB) (withdrawn)
Source link
lol