GitHub - archit1197/invasiveSpecies: Epsilon-Delta Optimal MDP Planning Algorithms applied on the invasive-species problem

Introduction

Markov Decision Processes (MDP) are a fundamental mathematical abstraction used to model sequential decision making under uncertainty and are a discrete model of discrete-time stochastic control and reinforcement learning (RL). Particularly central to the real life applications of the modelling of real life scenarios through MDPs is their planning, wherein we try to compute an optimal policy that maps each state of an MDP to an action to be followed at that state. The goal is to find an optimal policy which maximises the utility of traversing the MDP. The modelling of the utility or the reward model can be discounted, undiscounted, finite horizon, etc. which can be chosen according to the practical application.

In the particular MDPs we study in this report, simulators used to replicate the behaviour of the problem are of high accuracy, and hence are very expensive to compute. Consequently, the time required to solve these MDPs is dominated by the number of calls to the simulator. A good MDP planning algorithm in such domains should ideally minimise the number of calls of the simulator yet terminate with a policy that is approximately optimal with high probability. This is referred to being as PAC-RL.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
FINALDATATOPLOT		FINALDATATOPLOT
data		data
observations		observations
value iteration		value iteration
.gitignore		.gitignore
01-ddv-samples.txt		01-ddv-samples.txt
01-ddv.txt		01-ddv.txt
01-ddv500.txt		01-ddv500.txt
01-fiechter-samples.txt		01-fiechter-samples.txt
01-fiechter.txt		01-fiechter.txt
01-lucb-samples.txt		01-lucb-samples.txt
01-lucbbound-samples.txt		01-lucbbound-samples.txt
01-lucbbound.txt		01-lucbbound.txt
01-lucbeps-samples.txt		01-lucbeps-samples.txt
01-lucbeps.txt		01-lucbeps.txt
01-mbie-samples.txt		01-mbie-samples.txt
01-mbie.txt		01-mbie.txt
01-policy.txt		01-policy.txt
01-policy10.txt		01-policy10.txt
01-policy100.txt		01-policy100.txt
01-policy1000.txt		01-policy1000.txt
01-policy50.txt		01-policy50.txt
01-policy500.txt		01-policy500.txt
01-rr-samples.txt		01-rr-samples.txt
01-rr10.txt		01-rr10.txt
01-rr100.txt		01-rr100.txt
01-rr20.txt		01-rr20.txt
01-rr50.txt		01-rr50.txt
03-ddv-samples.txt		03-ddv-samples.txt
03-ddv.txt		03-ddv.txt
03-ddv10.txt		03-ddv10.txt
03-ddv100.txt		03-ddv100.txt
03-ddv110.txt		03-ddv110.txt
03-ddv120.txt		03-ddv120.txt
03-ddv150.txt		03-ddv150.txt
03-ddv20.txt		03-ddv20.txt
03-ddv200.txt		03-ddv200.txt
03-ddv50.txt		03-ddv50.txt
03-ddv500.txt		03-ddv500.txt
03-ddv5000.txt		03-ddv5000.txt
03-fiechter-samples.txt		03-fiechter-samples.txt
03-fiechter.txt		03-fiechter.txt
03-lucb-samples.txt		03-lucb-samples.txt
03-lucb.txt		03-lucb.txt
03-lucbeps-samples.txt		03-lucbeps-samples.txt
03-lucbeps.txt		03-lucbeps.txt
03-lucbeps100.txt		03-lucbeps100.txt
03-lucbeps150.txt		03-lucbeps150.txt
03-lucbeps200.txt		03-lucbeps200.txt
03-lucbeps50.txt		03-lucbeps50.txt
03-markov10.txt		03-markov10.txt
03-markov20.txt		03-markov20.txt
03-markov50.txt		03-markov50.txt
03-markovMBAE10.txt		03-markovMBAE10.txt
03-markovMBAE100.txt		03-markovMBAE100.txt
03-markovMBAE20.txt		03-markovMBAE20.txt
03-markovMBAE50.txt		03-markovMBAE50.txt
03-markovMBIE10.txt		03-markovMBIE10.txt
03-markovMBIE100.txt		03-markovMBIE100.txt
03-markovMBIE20.txt		03-markovMBIE20.txt
03-markovMBIE50.txt		03-markovMBIE50.txt
03-markovbest10.txt		03-markovbest10.txt
03-markovbest100.txt		03-markovbest100.txt
03-markovbest20.txt		03-markovbest20.txt
03-markovbest50.txt		03-markovbest50.txt
03-markovddv10.txt		03-markovddv10.txt
03-markoveps10.txt		03-markoveps10.txt
03-markoveps100.txt		03-markoveps100.txt
03-markoveps20.txt		03-markoveps20.txt
03-markoveps50.txt		03-markoveps50.txt
03-markovuni10.txt		03-markovuni10.txt
03-markovuni100.txt		03-markovuni100.txt
03-markovuni20.txt		03-markovuni20.txt
03-markovuni50.txt		03-markovuni50.txt
03-mbie-samples.txt		03-mbie-samples.txt
03-mbie.txt		03-mbie.txt
03-policy.txt		03-policy.txt
03-policy10.txt		03-policy10.txt
03-policy100.txt		03-policy100.txt
03-policy1000.txt		03-policy1000.txt
03-policy110.txt		03-policy110.txt
03-policy30.txt		03-policy30.txt
03-policy50.txt		03-policy50.txt
03-policy500.txt		03-policy500.txt
03-policyddv10.txt		03-policyddv10.txt
03-policyddv100.txt		03-policyddv100.txt
03-policyddv20.txt		03-policyddv20.txt
03-policyddv50.txt		03-policyddv50.txt
03-policyddv500.txt		03-policyddv500.txt
03-rr-samples.txt		03-rr-samples.txt
03-rr.txt		03-rr.txt
03-rr10.txt		03-rr10.txt
03-rr20.txt		03-rr20.txt
03-rr50.txt		03-rr50.txt
03-trunc-ddv-samples.txt		03-trunc-ddv-samples.txt
03-trunc-ddv500.txt		03-trunc-ddv500.txt
03ddvspe.txt		03ddvspe.txt
03fiechterspe.txt		03fiechterspe.txt
03lucbepsspe.txt		03lucbepsspe.txt
03markovMBAEspe.txt		03markovMBAEspe.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

About

Releases

Packages

Contributors 2

Languages

License

archit1197/invasiveSpecies

Folders and files

Latest commit

History

Repository files navigation

Introduction

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages