Wolfram Computation Meets Knowledge

Wolfram Summer School

Alumni

Ian Fan

Technology and Innovation

Class of 2018

Bio

Ian Fan is a final-year student at the University of Melbourne, Australia. He is pursuing a bachelor’s in computer science. He is interested in finding elegant and efficient algorithms to solve problems. The Wolfram Summer School is able to help him to gain experience and lead him to a deeper understanding of computer science.

Computational Essay

MinMax Algorithm »

Project: Reinforcement Q-Learning for Atari Games

Goal

This project aims to create a neural network agent that plays Atari games. This agent is trained using Q-learning. The agent will not have any a priori knowledge of the game and will be able to learn by playing the game and only being told when it loses.

Main Results in Detail

Successfully implemented a classical Q-learning agent on the CartPole environment and was able to achieve an average performance of 195 episodes in 300 games. Tried to add time information in the input, and the agent achieved 8,000 episodes within 300 games.

Future Work

Make the Q-learning agent with multiple observations as input, make the performance more stable and implement different techniques that increase the network’s performance, like DDQN, NoisyNet, etc.