My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22
-
Updated
Apr 17, 2022 - Python
My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22
Inventory Control with Lateral Transshipment Using Proximal Policy Optimization, DOCS2023
Applied MDP with Value Iteration to optimally choose path for an agent in a Stochastic Environment, in order to maximize its rewards
example for a presentation about RL.
Using Tabular RL, Value Iteration to train a tic-tac-toe agent
Value Iteration (Exact RL method) implmeneted in basic python
A mouse finds the cheese with the help of reinforcement learning (value iteration).
this repository contains my codes for fundamentals of AI course projects
Simple program to solve Markov Decision Processes using policy iteration and value iteration.
Please don't feed a gamblers addiction
A CANDECOMP-PARAFAC tensor decomposition method to solve a Markov Decision Process (MDP) gridworld problem.
This assignment is based on the concept of the Bellman equation on the basis of the value iteration algorithm for solving MDPs.
This repository has the code I wrote for Markovian Pacman
Solving Taxi-v3 problem of python Gym library.
MDPs for Frozen Lake (Open AI Gym) environment
Using Deep Reinforcement Learning and Search for the Rubik's cube
Implementation of path planning algorithms.
implementations of basic RL algorithms
🤖 Implementation and short explanation of basic RL algorithms, reproducing the simulations from Andrej Kaparthy's REINFORCEjs library.
Computing optimal MDP policy using Value Iteration Algorithm and Linear Programming
Add a description, image, and links to the value-iteration topic page so that developers can more easily learn about it.
To associate your repository with the value-iteration topic, visit your repo's landing page and select "manage topics."