- Machine Learning with Swift
- Alexander Sosnovshchenko
- 138字
- 2021-06-24 18:54:52
Reinforcement learning
Reinforcement learning is special in the sense that it doesn't require a dataset (see the following diagram). Instead, it involves an agent who takes actions, changing the state of the environment. After each step, it gets a reward or punishment, depending on the state and previous actions. The goal is to obtain a maximum cumulative reward. It can be used to teach the computer to play video games or drive a car. If you think about it, reinforcement learning is the way our pets train us humans: by rewarding our actions with tail-wagging, or punishing with scratched furniture.
One of the central topics in reinforcement learning is the exploration-exploitation dilemma—how to find a good balance between exploring new options and using what is already known:
![](https://epubservercos.yuewen.com/A3C592/19470396901583706/epubprivate/OEBPS/Images/b58d2ea2-831a-4509-b5d4-0df5cb8f3ad3.png?sign=1739683565-nuFJ4YT68FwyKZ56F222mNfecSOIcGmP-0-22dd5404530963cfb4debfc1077572d0)
Table 1.3: ML tasks:
![](https://epubservercos.yuewen.com/A3C592/19470396901583706/epubprivate/OEBPS/Images/003.jpg?sign=1739683565-5qp2M7h9vzjqL4SKOe5zHM9NmWYApPo8-0-75e1f863a6f35e182390ae8baf88b01c)