grid_world

Classic grid world

'b' - robots 'h' - humans 'p' - pellets (aisle/ food/ reward points) 'f' - human standing over a pellet

The robot is running MinMax Human takes random actions

The robot overestimates human's adversarial behaviour. Hence is afraid to eat the last pellet

Need to change this to ExpectiMax to see the difference

Evaluation function : 0.1 * min_human_dist - 0.3 * min_pellet_dist + 0.05 * min_obstacle_dist + 100 * state.pellets_eaten + 500 * state.win_status - 5000 * state.lose_status;

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
alpha_beta.py		alpha_beta.py
grid_world.py		grid_world.py
run_agent.py		run_agent.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

grid_world

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

grid_world

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages