TianShou
0.1

Tutorials:

  • DDPG (Deep Deterministic Policy Gradient) with TianShou
    • Make an Environment
    • Build the Networks
    • Construct Optimization Methods
    • Specify Data Acquisition
    • Start Training!

API Docs

  • tianshou.core.policy
    • Base class
    • Deterministic policy
    • Distributional policy
    • DQN policy
  • tianshou.core.value_function
    • Base class
    • State value
    • Action value
  • tianshou.core.losses
  • tianshou.core.opt
  • tianshou.core.random
  • tianshou.core.utils
  • tianshou.data.data_buffer
    • Base class
    • Batch set
    • Replay buffer base
    • Vanilla replay buffer
  • tianshou.data.advantage_estimation
  • tianshou.data.data_collector
  • tianshou.data.tester
TianShou
  • Docs »
  • Welcome to TianShou’s documentation!
  • View page source

Welcome to TianShou’s documentation!¶

Tutorials:

  • DDPG (Deep Deterministic Policy Gradient) with TianShou
    • Make an Environment
    • Build the Networks
    • Construct Optimization Methods
    • Specify Data Acquisition
    • Start Training!

API Docs

  • tianshou.core.policy
  • tianshou.core.value_function
  • tianshou.core.losses
  • tianshou.core.opt
  • tianshou.core.random
  • tianshou.core.utils
  • tianshou.data.data_buffer
  • tianshou.data.advantage_estimation
  • tianshou.data.data_collector
  • tianshou.data.tester

Indices and tables¶

  • Index
  • Module Index
  • Search Page
Next

© Copyright 2018, TSAIL.

Built with Sphinx using a theme provided by Read the Docs.