Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request
Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request
Academic Journal

A minimax and asymptotically optimal algorithm for stochastic bandits

Subjects: Stochastic multi-armed bandits; regret analysis; upper confidence bound (UCB)

  • Source: Algorithmic Learning Theory ; https://hal.science/hal-01475078 ; Algorithmic Learning Theory, 2017, 2017 Algorithmic Learning Theory Conference 76

تفاصيل العنوان

×
  • 1-1 of  1 نتائج ل ""Ménard, Pierre""