DRLControl

Applying Deep Reinforcement Learning to control physically-based models.

This repository contains:

Custom gymnasium environments
Implementations of RL agents to solve these environments

Environments

These custom environments are mostly built using Gymnasium and Mujoco. They are typically more complicated than the ones provided by Gymnasium. They are designed to be more challenging and to require more complex control strategies. Feel free to use them to test your own implementations of RL agents.

All custom environments are registered in the gymnasium registry, so you can create them using gym.make(). They inherit the MujocoEnv base class provided by Gymnasium, which allows you to use the standard methods and attributes of a Mujoco environment, and therefore can also be used with custom parameters supported by MujocoEnv.

To use these environments, you will need the following dependencies:

And if you want to use the provided test PPO agent, you will also need:

You can refer to ppo.py for an example of how to use an RL agent to train on these environments. If you have all the dependencies installed, you can run the provided test PPO agent on the environments using the following command:

python src/ppo.py

Here are some of the custom environments included in this repository.

ViperX - Robot Arm Manipulation

This environment aims to move the robot arm to grab the box and place it in the target location. It uses the ViperX 300 6DOF robot arm model from Trossen Robotics, provided from the mujoco-menagerie repository. Details for the full parameter list can be found at viperx.py.

import gymnasium as gym
import envs # The module path containing the src/envs directory in this repository

models_path = "models" # Should be your absolute path to the models directory

env = gym.make(
    "ViperX-v0",
    render_mode="human",
    frame_skip=5,
    max_episode_steps=20000,  # physics steps will have been multiplied by 5, due to the frame_skip value
)

Spot / Anymal - Quadrupedal Robot Manipulation

This environment simulates a simple quadruped robot walking towards a target location. You can either use the Spot quadruped robot model from Boston Dynamics, or the Anymal B quadruped robot model from Anybotics, both provided from the mujoco-menagerie repository. Details for the full parameter list can be found at the following files:

Forward Locomotion: forward.py

Caution! A commonly achieved gait is the quadruped robot performing a series of backflips for forward locomotion, learned from a set of sub-optimal environment parameters 😅. Try to train the robot for walking / running instead for more stable performance.

Targeted Locomotion: target.py

Targeted Locomotion with non-uniform terrain: terrain.py

Ideally, these environments provide a set of observation / reward functions that you can use to train a quadruped robot to achieve the environment's goals. It is best to use the environment wiht your own custom scene files for environments like terrain.py to generate your own custom terrain. The example scene files are listed below for reference.

import gymnasium as gym
import envs # The module path containing the src/envs directory in this repository

models_path = "models" # Should be your absolute path to the models directory

env = gym.make(
    "LeggedForwardEnv",
    render_mode="human",
    frame_skip=5,
    max_episode_steps=20000,  # physics steps will have been multiplied by 5, due to the frame_skip value
    xml_file=os.path.join(models_path, "boston_dynamics_spot/scene_gap.xml"), # Forward locomotion with gaps
)

env = gym.make(
    "LeggedTargetEnv",
    render_mode="human",
    frame_skip=5,
    max_episode_steps=20000,  # physics steps will have been multiplied by 5, due to the frame_skip value
    xml_file=os.path.join(models_path, "anybotics_anymal_b/target.xml"), 
)

env = gym.make(
    "LeggedTerrainEnv",
    render_mode="human",
    frame_skip=5,
    max_episode_steps=20000,  # physics steps will have been multiplied by 5, due to the frame_skip value
    xml_file=os.path.join(models_path, "anybotics_anymal_b/scene_terrain.xml"), # Bumpy terrain generated with perlin noise
)

Cassie - Bipedal Robot Forward Locomotion

This environment simulates a simple bipedal robot walking forward. It uses the Cassie bipedal robot model from Agility Robotics, provided from the mujoco-menagerie repository.

import gymnasium as gym
import envs # The module path containing the src/envs directory in this repository

models_path = "models" # Should be your absolute path to the models directory

env = gym.make(
    "Cassie-v0",
    render_mode="human",
    frame_skip=5,
    max_episode_steps=20000,  # physics steps will have been multiplied by 5, due to the frame_skip value
    xml_file=os.path.join(models_path, "agility_cassie/scene.xml"), # Forward locomotion
)

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
assets		assets
models		models
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
TROUBLESHOOTING.md		TROUBLESHOOTING.md
gen_terrain.nu		gen_terrain.nu
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DRLControl

Environments

ViperX - Robot Arm Manipulation

Spot / Anymal - Quadrupedal Robot Manipulation

Cassie - Bipedal Robot Forward Locomotion

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

RulerOfCakes/drlcontrol

Folders and files

Latest commit

History

Repository files navigation

DRLControl

Environments

ViperX - Robot Arm Manipulation

Spot / Anymal - Quadrupedal Robot Manipulation

Cassie - Bipedal Robot Forward Locomotion

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages