Multi-intention Inverse Q-learning for Interpretable Behavior Representation

stp2ySeptember 11, 20240 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 23 Nov 2023 (v1), last revised 10 Sep 2024 (this version, v4)]

View a PDF of the paper titled Multi-intention Inverse Q-learning for Interpretable Behavior Representation, by Hao Zhu and 6 other authors

View PDF
HTML (experimental)

Abstract:In advancing the understanding of natural decision-making processes, inverse reinforcement learning (IRL) methods have proven instrumental in reconstructing animal’s intentions underlying complex behaviors. Given the recent development of a continuous-time multi-intention IRL framework, there has been persistent inquiry into inferring discrete time-varying rewards with IRL. To address this challenge, we introduce the class of hierarchical inverse Q-learning (HIQL) algorithms. Through an unsupervised learning process, HIQL divides expert trajectories into multiple intention segments, and solves the IRL problem independently for each. Applying HIQL to simulated experiments and several real animal behavior datasets, our approach outperforms current benchmarks in behavior prediction and produces interpretable reward functions. Our results suggest that the intention transition dynamics underlying complex decision-making behavior is better modeled by a step function instead of a smoothly varying function. This advancement holds promise for neuroscience and cognitive science, contributing to a deeper understanding of decision-making and uncovering underlying brain mechanisms.

Submission history

From: Gabriel Kalweit [view email]
[v1]
Thu, 23 Nov 2023 09:27:08 UTC (737 KB)
[v2]
Fri, 2 Feb 2024 12:37:02 UTC (733 KB)
[v3]
Wed, 19 Jun 2024 07:55:34 UTC (2,946 KB)
[v4]
Tue, 10 Sep 2024 10:12:56 UTC (3,070 KB)

Source link
lol

By stp2y