GitHub - jayxsinha/RL-Project: Implementation of REINFORCE with Baseline and Monte-Carlo Tree Search algorithms along with Multi-Armed Bandits.

We present implementations of REINFORCE with Baseline and Monte-Carlo Tree Search algorithms on three MDPs: Cartpole, CS687-Gridworld and Mountain Car. For extra-credits, we have implemented a yet unexplored MDP: Mountain Car and we present different algorithms: Epsilon Greedy, Epsilon Decreasing Greedy, Upper Confidence Bound (UCB) and Thompson sampling performance analysis on multi-armed bandits.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
mcts_figs		mcts_figs
multi_armed_bandits		multi_armed_bandits
reinforce_with_baseline		reinforce_with_baseline
.gitignore		.gitignore
CS687-Gridworld.png		CS687-Gridworld.png
CS687FinalReport.pdf		CS687FinalReport.pdf
README.md		README.md
cartpole.py		cartpole.py
cs687_gridworld.py		cs687_gridworld.py
mab_algorithms.py		mab_algorithms.py
mcts.py		mcts.py
mcts_main.py		mcts_main.py
mountain_car.py		mountain_car.py
multi_armed_bandits.py		multi_armed_bandits.py
reinforce_baseline.py		reinforce_baseline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages