Open in app

Sign in

Write

Sign in

Cheng Xi Tsou
Cheng Xi Tsou

95 Followers

Home

About

Pinned
Cheng Xi Tsou

Cheng Xi Tsou

in

Nerd For Tech

Genetic Algorithm: 8 Queens Problem

In my recent lecture on AI (CS4100), I came across an interesting concept: a genetic algorithm. As described in “Artificial Intelligence: A…

May 18, 2021
Genetic Algorithm: 8 Queens Problem
Genetic Algorithm: 8 Queens Problem
May 18, 2021
Pinned
Cheng Xi Tsou

Cheng Xi Tsou

in

Geek Culture

Policy Optimizations: TRPO/PPO

In this post, I will be talking about policy optimization methods from the papers Trust Region Policy Optimization (Schulman et al. 2015)…

Sep 17, 2021
Policy Optimizations: TRPO/PPO
Policy Optimizations: TRPO/PPO
Sep 17, 2021
Cheng Xi Tsou

Cheng Xi Tsou

in

Geek Culture

Introduction to Deterministic Policy Gradient (DPG)

In this post, I will be exploring the concepts following the paper Deterministic Policy Gradient Algorithms (Silver et al.), implementing…

Aug 26, 2021
1
Introduction to Deterministic Policy Gradient (DPG)
Introduction to Deterministic Policy Gradient (DPG)
Aug 26, 2021
1
Cheng Xi Tsou

Cheng Xi Tsou

in

Geek Culture

Actor-Critic: Off-Policy Actor-Critic Algorithm

In this post, I will be exploring the ideas behind the paper Off-Policy Actor-Critic (Degris et al.) submitted to the ICML 2012. The paper…

Aug 18, 2021
Actor-Critic: Off-Policy Actor-Critic Algorithm
Actor-Critic: Off-Policy Actor-Critic Algorithm
Aug 18, 2021
Cheng Xi Tsou

Cheng Xi Tsou

in

Geek Culture

Policy Parameterization for a Continuous Action Space

In the past few Policy Gradient and Actor-Critic algorithms I’ve implemented, I’ve been using the classical control environment, CartPole…

Aug 9, 2021
1
Policy Parameterization for a Continuous Action Space
Policy Parameterization for a Continuous Action Space
Aug 9, 2021
1
Cheng Xi Tsou

Cheng Xi Tsou

in

Geek Culture

Actor-Critic: Implementing Actor-Critic Methods

In this post, I’ll be implementing some Actor-Critic methods using the policy gradients methods and value function approximations from my…

Aug 3, 2021
Actor-Critic: Implementing Actor-Critic Methods
Actor-Critic: Implementing Actor-Critic Methods
Aug 3, 2021
Cheng Xi Tsou

Cheng Xi Tsou

in

Geek Culture

Actor-Critic: Value Function Approximations

In my previous post, I discussed a way to reduce variance by using the generalized policy update equation, which is derived from the policy…

Jul 23, 2021
Actor-Critic: Value Function Approximations
Actor-Critic: Value Function Approximations
Jul 23, 2021
Cheng Xi Tsou

Cheng Xi Tsou

in

Nerd For Tech

Policy Gradients: REINFORCE with Baseline

After an introduction to the REINFORCE algorithm, I wanted to explore a little bit further this simple algorithm derived from the policy…

Jul 17, 2021
Policy Gradients: REINFORCE with Baseline
Policy Gradients: REINFORCE with Baseline
Jul 17, 2021
Cheng Xi Tsou

Cheng Xi Tsou

in

Nerd For Tech

Reinforcement Learning: Introduction to Policy Gradients

In the previous posts, I have been working on a form of Reinforcement learning, Q learning, where the agent finds an optimal policy that…

Jul 14, 2021
1
Reinforcement Learning: Introduction to Policy Gradients
Reinforcement Learning: Introduction to Policy Gradients
Jul 14, 2021
1
Cheng Xi Tsou

Cheng Xi Tsou

in

Nerd For Tech

Reinforcement Learning: Deep Q-Learning with Atari games

In my previous post A First Look at Reinforcement Learning, I attempted to use Deep Q learning to solve the CartPole problem. In this post…

Jul 8, 2021
1
Reinforcement Learning: Deep Q-Learning with Atari games
Reinforcement Learning: Deep Q-Learning with Atari games
Jul 8, 2021
1
Cheng Xi Tsou

Cheng Xi Tsou

95 Followers

Interested in Web Dev, AI/ML, specifically RL. Github: github.com/chengxi600

Following
  • Pictureframe

    Pictureframe

  • Towards Data Science

    Towards Data Science

  • Barack Obama

    Barack Obama

  • Saahil Kumar

    Saahil Kumar

  • @PatrickYoon

    @PatrickYoon

See all (6)

Help

Status

About

Careers

Press

Blog

Privacy

Terms

Text to speech

Teams