RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions

stp2yJanuary 16, 20250 Comments

AmazUtah_NLP at SemEval-2024 Task 9: A MultiChoice Question Answering System for Commonsense Defying Reasoning

[Submitted on 11 Dec 2023 (v1), last revised 15 Jan 2025 (this version, v4)]

View a PDF of the paper titled RoME: A Robust Mixed-Effects Bandit Algorithm for Optimizing Mobile Health Interventions, by Easton K. Huch and 5 other authors

View PDF
HTML (experimental)

Abstract:Mobile health leverages personalized and contextually tailored interventions optimized through bandit and reinforcement learning algorithms. In practice, however, challenges such as participant heterogeneity, nonstationarity, and nonlinear relationships hinder algorithm performance. We propose RoME, a Robust Mixed-Effects contextual bandit algorithm that simultaneously addresses these challenges via (1) modeling the differential reward with user- and time-specific random effects, (2) network cohesion penalties, and (3) debiased machine learning for flexible estimation of baseline rewards. We establish a high-probability regret bound that depends solely on the dimension of the differential-reward model, enabling us to achieve robust regret bounds even when the baseline reward is highly complex. We demonstrate the superior performance of the RoME algorithm in a simulation and two off-policy evaluation studies.

Submission history

From: Easton Huch [view email]
[v1]
Mon, 11 Dec 2023 14:24:24 UTC (878 KB)
[v2]
Fri, 15 Dec 2023 01:29:21 UTC (853 KB)
[v3]
Fri, 7 Jun 2024 01:18:16 UTC (2,348 KB)
[v4]
Wed, 15 Jan 2025 15:21:46 UTC (1,353 KB)

Source link
lol

By stp2y