Skip to content

Markov Decision Process and Temporal Difference algorithms

Notifications You must be signed in to change notification settings

florianvazelle/unity-rl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Shenjun LIN, Antoine THENEVIN, Florian VAZELLE

Note cours

Facteur de dévaluation gamma

gamma = 0 -> récompense immédiate (myope) gamma = 1 -> récompense futur 0 < gamma < 1