Skip to content

Epsilon-greedy solution to the multiarmed-bandit problem, with recency bias

License

Notifications You must be signed in to change notification settings

zbroyar/multiarmed

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is an OCaml library that provides an implementation of the epsilon-greedy solution to the multiarmed-bandit problem, using exponential moving averages for a faster adaptation to new trends.

Practical use: finding out which variant of a user interface is most often successful while maximizing the overall success rate as the system is learning.

See also:

Installation:

$ make
$ make install

Uninstallation:

$ make uninstall

About

Epsilon-greedy solution to the multiarmed-bandit problem, with recency bias

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • OCaml 100.0%