Variational Quantum Circuits in Offline Contextual Bandit Problems

Lukas Schulte, Daniel Hein, Steffen Udluft, Thomas A. Runkler

公開日: 2025/9/9

Abstract

This paper explores the application of variational quantum circuits (VQCs) for solving offline contextual bandit problems in industrial optimization tasks. Using the Industrial Benchmark (IB) environment, we evaluate the performance of quantum regression models against classical models. Our findings demonstrate that quantum models can effectively fit complex reward functions, identify optimal configurations via particle swarm optimization (PSO), and generalize well in noisy and sparse datasets. These results provide a proof of concept for utilizing VQCs in offline contextual bandit problems and highlight their potential in industrial optimization tasks.

Variational Quantum Circuits in Offline Contextual Bandit Problems | SummarXiv | SummarXiv