27. 非監督式學習
機器學習模型
Beijing is the capital of China.
As China's capital, Beijing is a large and
vibrant city.
Tokyo is the capital of Japan.
As Japan’s capital, Tokyo is a large and
vibrant city.
…….
資料
結果
正向反饋(Positive Reward)
負向反饋(Negative Reward)
「明天的世界只和今天有關、和昨天無關了。」(The future is independent of the past given the present.)
多拉桿吃角子老虎機(Multi-armed Bandit)
最重要的目標只有探索(Explore)和採集(Exploit)的平衡