PinnedCheng Xi TsouinNerd For TechGenetic Algorithm: 8 Queens ProblemIn my recent lecture on AI (CS4100), I came across an interesting concept: a genetic algorithm. As described in “Artificial Intelligence: A…May 18, 2021May 18, 2021
PinnedCheng Xi TsouinGeek CulturePolicy Optimizations: TRPO/PPOIn this post, I will be talking about policy optimization methods from the papers Trust Region Policy Optimization (Schulman et al. 2015)…Sep 17, 2021Sep 17, 2021
Cheng Xi TsouinGeek CultureIntroduction to Deterministic Policy Gradient (DPG)In this post, I will be exploring the concepts following the paper Deterministic Policy Gradient Algorithms (Silver et al.), implementing…Aug 26, 20211Aug 26, 20211
Cheng Xi TsouinGeek CultureActor-Critic: Off-Policy Actor-Critic AlgorithmIn this post, I will be exploring the ideas behind the paper Off-Policy Actor-Critic (Degris et al.) submitted to the ICML 2012. The paper…Aug 18, 2021Aug 18, 2021
Cheng Xi TsouinGeek CulturePolicy Parameterization for a Continuous Action SpaceIn the past few Policy Gradient and Actor-Critic algorithms I’ve implemented, I’ve been using the classical control environment, CartPole…Aug 9, 20211Aug 9, 20211
Cheng Xi TsouinGeek CultureActor-Critic: Implementing Actor-Critic MethodsIn this post, I’ll be implementing some Actor-Critic methods using the policy gradients methods and value function approximations from my…Aug 3, 2021Aug 3, 2021
Cheng Xi TsouinGeek CultureActor-Critic: Value Function ApproximationsIn my previous post, I discussed a way to reduce variance by using the generalized policy update equation, which is derived from the policy…Jul 23, 2021Jul 23, 2021
Cheng Xi TsouinNerd For TechPolicy Gradients: REINFORCE with BaselineAfter an introduction to the REINFORCE algorithm, I wanted to explore a little bit further this simple algorithm derived from the policy…Jul 17, 2021Jul 17, 2021
Cheng Xi TsouinNerd For TechReinforcement Learning: Introduction to Policy GradientsIn the previous posts, I have been working on a form of Reinforcement learning, Q learning, where the agent finds an optimal policy that…Jul 14, 20211Jul 14, 20211
Cheng Xi TsouinNerd For TechReinforcement Learning: Deep Q-Learning with Atari gamesIn my previous post A First Look at Reinforcement Learning, I attempted to use Deep Q learning to solve the CartPole problem. In this post…Jul 8, 20211Jul 8, 20211