PyTorch implementation of the Value Iteration Networks (VIN) (NIPS '16 best paper)
最近更新: 接近5年前Self-Consistent Trajectory Autoencoder: Hierarchical Reinforcement Learning with Trajectory Embeddings
最近更新: 接近5年前Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
最近更新: 接近5年前