18020287 - Nguyễn Tiến Đạt

Reinforcement-Pacman

Câu 1

Hàm tính value: với mỗi action trong mỗi state tính ra tổng điểm rồi lấy giá trị lớn nhất tương ứng với action.
Hàm computeQValueFromValues: tương tự hàm trên nhưng chỉ tính value của từng cặp state, action, không lấy max
Hàm computeActionFromValues: trả về hành động có điểm số tốt nhất

Câu 2

Ta chọn hai giá trị answerDiscount = 0.9 và answerNoise = 0 vì với noise = 0 thì mới có thể dễ dàng xác đinh đồng thời agent sẽ luôn kế thúc khi có Agent mới.

Câu 3

Câu 4

Làm giống như câu 1 nhưng trừ hàm update

Câu 5

Với việc chọn ngẫu nhiên một action epsilon chỉ khi đánh giá được self.epsilon còn không sẽ trả về giá trị hành động của hàm computeActionFromQValues

Câu 6

Ta chọn các giá trị : answerEpsilon = 0.1 answerLearningRate = 0.8 Vì không thể tìm được con đường tối ưu đến 99%, 50 tập là quá nhỏ nên cần thêm để tìm kiếm.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
layouts		layouts
test_cases		test_cases
README.md		README.md
VERSION		VERSION
analysis.py		analysis.py
analysis.pyc		analysis.pyc
autograder.py		autograder.py
crawler.py		crawler.py
environment.py		environment.py
environment.pyc		environment.pyc
featureExtractors.py		featureExtractors.py
featureExtractors.pyc		featureExtractors.pyc
game.py		game.py
game.pyc		game.pyc
ghostAgents.py		ghostAgents.py
ghostAgents.pyc		ghostAgents.pyc
grading.py		grading.py
grading.pyc		grading.pyc
graphicsCrawlerDisplay.py		graphicsCrawlerDisplay.py
graphicsDisplay.py		graphicsDisplay.py
graphicsGridworldDisplay.py		graphicsGridworldDisplay.py
graphicsUtils.py		graphicsUtils.py
gridworld.py		gridworld.py
gridworld.pyc		gridworld.pyc
keyboardAgents.py		keyboardAgents.py
keyboardAgents.pyc		keyboardAgents.pyc
layout.py		layout.py
layout.pyc		layout.pyc
learningAgents.py		learningAgents.py
learningAgents.pyc		learningAgents.pyc
mdp.py		mdp.py
mdp.pyc		mdp.pyc
pacman.py		pacman.py
pacman.pyc		pacman.pyc
pacmanAgents.py		pacmanAgents.py
pacmanAgents.pyc		pacmanAgents.pyc
projectParams.py		projectParams.py
projectParams.pyc		projectParams.pyc
qlearningAgents.py		qlearningAgents.py
qlearningAgents.pyc		qlearningAgents.pyc
reinforcementTestClasses.py		reinforcementTestClasses.py
reinforcementTestClasses.pyc		reinforcementTestClasses.pyc
testClasses.py		testClasses.py
testClasses.pyc		testClasses.pyc
testParser.py		testParser.py
testParser.pyc		testParser.pyc
textDisplay.py		textDisplay.py
textDisplay.pyc		textDisplay.pyc
textGridworldDisplay.py		textGridworldDisplay.py
util.py		util.py
util.pyc		util.pyc
valueIterationAgents.py		valueIterationAgents.py
valueIterationAgents.pyc		valueIterationAgents.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

18020287 - Nguyễn Tiến Đạt

Reinforcement-Pacman

Câu 1

Câu 2

Câu 3

Câu 4

Câu 5

Câu 6

Câu 7

Câu 8

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

18020287 - Nguyễn Tiến Đạt

Reinforcement-Pacman

Câu 1

Câu 2

Câu 3

Câu 4

Câu 5

Câu 6

Câu 7

Câu 8

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages