Author Archives: Josep Lumbreras
Multi-armed quantum bandits
The multi-armed bandit problem is a simple model of decision-making with uncertainty that lies in the class of classical reinforcement learning problems. Given a set of arms, a learner interacts sequentially with these arms sampling a reward at each round … Continue reading
Posted in Uncategorized
Comments Off on Multi-armed quantum bandits