The system used deep neural networks and reinforcement learning AlphaGo gained fame after defeating 18-time world champion Lee Sedol A documentary was made on AlphaGo’s match vs Lee Sedol in 2017 ...
An AI agent reads its own source code, forms a hypothesis for improvement (such as changing a learning rate or an architecture depth), modifies the code, runs the experiment, and evaluates the results ...