Multi-Armed Bandits § Group: 5 #group-5 Relations § Reinforcement Learning: Multi-Armed Bandits are a simplified reinforcement learning problem used to study exploration vs exploitation tradeoffs.