Code Monkey home page Code Monkey logo

finetuning-octo's Introduction

Finetuning Octo to a new Observation/Action Space

Installation

The following instructions were tested on Ubuntu 22.04 and Linux 5.13.0 using Python 3.10

Create a virtual environment using Anaconda:

conda create -n robot-learning python=3.10
conda activate robot-learning

Clone and install Octo:

git clone https://github.com/octo-models/octo.git
cd octo 
pip install -e .
pip install -r requirements.txt

Install JAX (GPU or TPU):

pip install --upgrade "jax[cuda11_pip]==0.4.20" -f https://storage.googleapis.com/jax-releases/jax_cuda_releases.html
pip install --upgrade "jax[tpu]==0.4.20" -f https://storage.googleapis.com/jax-releases/libtpu_releases.html

Clone Tony Z. Zhao's ACT (Action Chunking with Transformers) for its simulated ALOHA environment Transfer Cube:

git clone https://github.com/tonyzhaozh/act.git
pip3 install opencv-python modern_robotics pyrealsense2 h5py_cache pyquaternion pyyaml rospkg pexpect mujoco==2.3.3 dm_control==1.0.9 einops packaging h5py ipython

Finetuning

Finetuning Octo to a new observation space and a new action space on simulated ALOHA cube handover data.

Download and extract the dataset:

wget https://rail.eecs.berkeley.edu/datasets/example_sim_data.zip
unzip example_sim_data.zip

Run scripts/finetune.py:

python scripts/finetune.py \
--pretrained_path=hf://rail-berkeley/octo-small \
--data_dir=PATH/TO/aloha_sim_dataset  \
--save_dir=PATH/TO/CHECKPOINT/DIR

finetuning-octo's People

Contributors

j4nn1k avatar

Stargazers

Angus Stewart avatar

Watchers

 avatar

Forkers

corneliamelon

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.