https://codingnote.cc/zh-hans/p/244756/
First contact reinforcement learning