Knowledge in Triples for LLMs: Enhancing Table QA Accuracy with Semantic Extraction

This paper has been withdrawn by Hossein Sholehrasa

[Submitted on 21 Sep 2024 (v1), last revised 29 Oct 2024 (this version, v2)]

View a PDF of the paper titled Knowledge in Triples for LLMs: Enhancing Table QA Accuracy with Semantic Extraction, by Hossein Sholehrasa and 3 other authors

No PDF available, click to view other formats

Abstract:Integrating structured knowledge from tabular formats poses significant challenges within natural language processing (NLP), mainly when dealing with complex, semi-structured tables like those found in the FeTaQA dataset. These tables require advanced methods to interpret and generate meaningful responses accurately. Traditional approaches, such as SQL and SPARQL, often fail to fully capture the semantics of such data, especially in the presence of irregular table structures like web tables. This paper addresses these challenges by proposing a novel approach that extracts triples straightforward from tabular data and integrates it with a retrieval-augmented generation (RAG) model to enhance the accuracy, coherence, and contextual richness of responses generated by a fine-tuned GPT-3.5-turbo-0125 model. Our approach significantly outperforms existing baselines on the FeTaQA dataset, particularly excelling in Sacre-BLEU and ROUGE metrics. It effectively generates contextually accurate and detailed long-form answers from tables, showcasing its strength in complex data interpretation.