China did reinforcement learning at scale in the real world. They essentially de...

China did reinforcement learning at scale in the real world. They essentially dedicated a decade for exploration, inviting experts from all countries to some local area (each expert/country of origin combo in a different one) and developed it according to what experts advised. Then they evaluated the changes, took the best performing ones and went into full exploitation mode, spreading those lessons throughout the China. They also had a large part of Africa for additional experiments on what worked.