Utilizing Large Language Models in an iterative paradigm with Domain feedback for Zero-shot Molecule optimization

stp2yNovember 7, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 17 Oct 2024 (v1), last revised 6 Nov 2024 (this version, v5)]

View a PDF of the paper titled Utilizing Large Language Models in an iterative paradigm with Domain feedback for Zero-shot Molecule optimization, by Khiem Le and 1 other authors

View PDF
HTML (experimental)

Abstract:Molecule optimization is a critical task in drug discovery to optimize desired properties of a given molecule through chemical modification. Despite Large Language Models (LLMs) holding the potential to efficiently simulate this task by using natural language to direct the optimization, straightforwardly utilizing shows limited performance. In this work, we facilitate utilizing LLMs in an iterative paradigm by proposing a simple yet highly effective domain feedback provider, namely $text{Re}^3$DF. In detail, $text{Re}^3$DF harnesses an external toolkit, RDKit, to handle the molecule hallucination, if the modified molecule is chemically invalid. Otherwise, its desired properties are computed and compared to the original one, establishing reliable domain feedback with correct direction and distance towards the objective, followed by a retrieved example, to explicitly guide the LLM to refine the modified molecule. We conduct experiments across both single- and multi-property objectives with 2 thresholds, where $text{Re}^3$DF shows significant improvements. Particularly, for 20 single-property objectives, $text{Re}^3$DF enhances Hit ratio by 16.95% and 20.76% under loose and strict thresholds, respectively. For 32 multi-property objectives, $text{Re}^3$DF enhances Hit ratio by 6.04% and 5.25%.

Submission history

From: Khiem Le [view email]
[v1]
Thu, 17 Oct 2024 02:04:57 UTC (138 KB)
[v2]
Mon, 21 Oct 2024 00:38:07 UTC (140 KB)
[v3]
Sat, 26 Oct 2024 01:31:08 UTC (141 KB)
[v4]
Wed, 30 Oct 2024 14:54:25 UTC (141 KB)
[v5]
Wed, 6 Nov 2024 05:18:04 UTC (141 KB)

Source link
lol

By stp2y