Skip to content
/ aixi Public

Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.

License

Notifications You must be signed in to change notification settings

jjoller/aixi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Monte-Carlo AIXI approximation

Implementation of the Monte-Carlo AIXI approximation using CTW as described by Joel Veness et al.

Veness, Joel, et al. "A monte-carlo aixi approximation." Journal of Artificial Intelligence Research 40.1 (2011): 95-142.

Example code

Have a look at the RunAgentTest.java file for an example of how to set up and run the intelligent agent.

// set up the biased coin environment. The coin lands on one side with a probability of 0.7.
Environment env = new CoinFlipEnv(0.7);

// set up a world model
WorldModelSettings modelSettings = new WorldModelSettings();
modelSettings.setFacContextTree(true);
modelSettings.setDepth(3);
Worldmodel model = Worldmodel.getInstance("CoinFlipTraining", modelSettings);

AgentController controller = new AgentController(env, new ControllerSettings(), new UCTSettings(), model);


// Play 1000 rounds against the biased coin environment
// A smart agent should learn to always choose the biased side and should come close to an average reward of 0.7
controller.play(1000, false);

About

Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages