Cheng Xi Tsou – Medium

Cheng Xi Tsou

Pinned

Published in
Nerd For Tech

Genetic Algorithm: 8 Queens Problem

In my recent lecture on AI (CS4100), I came across an interesting concept: a genetic algorithm. As described in “Artificial Intelligence: A…

May 18, 2021

Genetic Algorithm: 8 Queens Problem

May 18, 2021

Pinned

Published in
Geek Culture

Policy Optimizations: TRPO/PPO

In this post, I will be talking about policy optimization methods from the papers Trust Region Policy Optimization (Schulman et al. 2015)…

Sep 17, 2021

Policy Optimizations: TRPO/PPO

Sep 17, 2021

Published in
Geek Culture

Introduction to Deterministic Policy Gradient (DPG)

In this post, I will be exploring the concepts following the paper Deterministic Policy Gradient Algorithms (Silver et al.), implementing…

Aug 26, 2021

Introduction to Deterministic Policy Gradient (DPG)

Aug 26, 2021

Published in
Geek Culture

Actor-Critic: Off-Policy Actor-Critic Algorithm

In this post, I will be exploring the ideas behind the paper Off-Policy Actor-Critic (Degris et al.) submitted to the ICML 2012. The paper…

Aug 18, 2021

Actor-Critic: Off-Policy Actor-Critic Algorithm

Aug 18, 2021

Published in
Geek Culture

Policy Parameterization for a Continuous Action Space

In the past few Policy Gradient and Actor-Critic algorithms I’ve implemented, I’ve been using the classical control environment, CartPole…

Aug 9, 2021

Policy Parameterization for a Continuous Action Space

Aug 9, 2021

Published in
Geek Culture

Actor-Critic: Implementing Actor-Critic Methods

In this post, I’ll be implementing some Actor-Critic methods using the policy gradients methods and value function approximations from my…

Aug 3, 2021

Actor-Critic: Implementing Actor-Critic Methods

Aug 3, 2021

Published in
Geek Culture

Actor-Critic: Value Function Approximations

In my previous post, I discussed a way to reduce variance by using the generalized policy update equation, which is derived from the policy…

Jul 23, 2021

Actor-Critic: Value Function Approximations

Jul 23, 2021

Published in
Nerd For Tech

Policy Gradients: REINFORCE with Baseline

After an introduction to the REINFORCE algorithm, I wanted to explore a little bit further this simple algorithm derived from the policy…

Jul 17, 2021

Policy Gradients: REINFORCE with Baseline

Jul 17, 2021

Published in
Nerd For Tech

Reinforcement Learning: Introduction to Policy Gradients

In the previous posts, I have been working on a form of Reinforcement learning, Q learning, where the agent finds an optimal policy that…

Jul 14, 2021

Reinforcement Learning: Introduction to Policy Gradients

Jul 14, 2021

Published in
Nerd For Tech

Reinforcement Learning: Deep Q-Learning with Atari games

In my previous post A First Look at Reinforcement Learning, I attempted to use Deep Q learning to solve the CartPole problem. In this post…

Jul 8, 2021

Reinforcement Learning: Deep Q-Learning with Atari games

Jul 8, 2021

Cheng Xi Tsou

Cheng Xi Tsou

Interested in Web Dev, AI/ML, specifically RL. Github: github.com/chengxi600

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech