-
Notifications
You must be signed in to change notification settings - Fork 90
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
4 changed files
with
38 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,37 @@ | ||
第1章 简介 | ||
=========== | ||
|
||
当我们思考学习的本质时,我们首先想到的是通过与环境互动来学习。 | ||
当一个婴儿玩耍,挥动手臂或环顾四周时,他没有明确的老师,但他确实通过直接的感觉与环境联系。 | ||
他可以通过这种联系获得大量关于因果关系、行动的结果以及如何实现目标的信息。 | ||
在我们的生活中,这种互动无疑是我们环境和自身知识的主要来源。 | ||
无论我们是学习驾驶汽车还是进行交谈,我们都敏锐地意识到我们的环境如何响应我们的行为,并且我们试图通过我们的行为来影响所发生的事情。 | ||
从互动中学习是几乎所有学习和智能理论的基本思想。 | ||
|
||
在本书中,我们探索了一种从交互中学习的 *计算* 方法。 | ||
我们主要探索理想化的学习情境并评估各种学习方法的有效性,而不是直接理解人或动物的学习方式 [#学习方式]_ 。也就是说,我们采用人工智能研究员或工程师的观点。 | ||
我们探索在解决科学或经济利益的学习问题方面有效的机器设计,通过数学分析或计算实验评估设计。 | ||
我们探索的方法称为 *强化学习*,更侧重于从交互中进行目标导向的学习,而不是其他机器学习方法。 | ||
|
||
.. [#学习方式] 第14章和第15章总结了心理学和神经科学的关系。 | ||
1.1 强化学习 | ||
------------ | ||
|
||
1.2 例子 | ||
-------- | ||
|
||
1.3 强化学习的要素 | ||
------------------ | ||
|
||
1.4 Limitations and Scope | ||
-------------------------- | ||
|
||
1.5 An Extended Example: Tic-Tac-Toe | ||
-------------------------------------- | ||
|
||
1.6 Summary | ||
----------- | ||
|
||
1.7 Early History of Reinforcement Learning | ||
-------------------------------------------- |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,4 @@ | ||
================================= | ||
第一部分 Tabular Solution Methods | ||
================================= | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,4 @@ | ||
===================================== | ||
第二部分 Approximate Solution Methods | ||
===================================== | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,4 @@ | ||
======================== | ||
第三部分 Looking Deeper | ||
======================== | ||
|
||
|