S15-261 Reinforcement learning with bootstrapped value function randomization Stanford researchers have developed a new algorithm for reinforcement learning, which can learn to take good actions with potentially long term consequences in a general unknown complex system. Ian Osband Benjamin Van Roy