Skip to content

Releases: CUN-bjy/gym-td3-keras

TD3 basis implemented.

17 Jan 10:10
Compare
Choose a tag to compare

What is differences from DDPG

  1. Overestimation Bias Problem Solved

    • double Q-network
    • clipped double q-update
  2. Addressing Variance

    • delayed policy update
    • target policy smoothing