Support other cameras

To support other cameras, you need to customize the data capture pipeline similar to here and modify the code here. You need to perform undistortion by yourself.

Support other robot arms

To support other robot arms, you need to do the following things. 1) Since we don't have other types of robot arms in our lab, you need to a) tune the textures of the robot model, b) define the mesh_mapping, c) modify the control code, and d) write the render_api for your robot. 2) Obtain the segmentation network. 3) Initialize the camera pose.

1. Obtain Segmentation mask

We provide two approaches to obtain the segmentation mask: 1) Train a PointRend, or 2) Manually label by SAM.

Train a PointRend

Render data in SAPIEN

a) The main script for the rendering is at tools/simulate/gen_data_for_mask_training.py.

b) Split the generated data to train/val split with tools/simulate/split_mask_training_data.py.

Train PointRend using synthetic data

a) Register the dataset at builtin.py.

b) Link the generated dataset.

ln -s data/xarm7/simulate/mask_data third_party/detectron2/datasets/xarm7

c) Train the PointRend

cd third_party/detectron2/projects/PointRend/
python train_net.py -c configs/InstanceSegmentation/pointrend_rcnn_R_50_FPN_3x_xarm7.yaml

Finetune the model on some real data (Optional, but recommended)

a) Capture real data and organize as the example data. Download the example data using the following command.

cd data/xarm7/simulate/mask_data/
gdown 1FhVb-Ba154x8jdQxXiq_520EXnKkFJhN
unzip xarm7_finetune.zip && rm xarm7_finetune.zip
cd ../../../../

b) Register the dataset at builtin.py.

c) Train the PointRend

python -c train_net.py -c configs/InstanceSegmentation/pointrend_rcnn_R_50_FPN_3x_xarm7_finetune.yaml

Manually label by SAM.

Setup SAM following the instructions here, and download the checkpoint with the following command.

mkdir -p models/sam/ && wget https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth -O models/sam/sam_vit_h_4b8939.pth

Run EasyHeC with the following command. The prompt drawer released on 2023.7.11 allows you to draw multiple boxes and take the union of them as the robot mask. Press "z" to undo, "p" to switch to the point prompt mode, and "b" to switch to the bounding box prompt mode. Although the point prompt is supported, we recommend using the bounding box prompt since it's more stable than the point prompt.

python tools/run_easyhec.py -c configs/xarm7/example_sam.yaml

2. Initialize the camera pose

To initialize the camera pose for the the robot other than XArm7, we provide the following approaches:

Train a PVNet as describe in our paper. This is time-consuming but generalize well to different camera poses.
Initialize with the results of other methods, such as traditional marker-based methods. This is convenient if you already have implementations for other methods.
Manually initialize the camera pose by tuning.
Initialize the camera pose with last result. This is convenient if the camera poses are roughly the same each time.

Here, we provide example for training the PVNet.

Generate data in SAPIEN.

python tools/simulate/gen_data_for_pvnet.py -c configs/xarm7/simulate/pvnet_data.yaml
python tools/simulate/convert_pvnet_data_to_pvnet_format.py -c configs/xarm7/simulate/pvnet_data.yaml
cd third_party/pvnet/
python run.py --type custom custom.data_dir ../../data/xarm7/simulate/pvnet_data/pvnet_format/train
python run.py --type custom custom.data_dir ../../data/xarm7/simulate/pvnet_data/pvnet_format/test

Register the dataset at here.

Train PVNet.

python train_net.py -c third_party/pvnet/configs/xarm7/10k.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

custom.md

custom.md

Support other cameras

Support other robot arms

1. Obtain Segmentation mask

Train a PointRend

Manually label by SAM.

2. Initialize the camera pose

Files

custom.md

Latest commit

History

custom.md

File metadata and controls

Support other cameras

Support other robot arms

1. Obtain Segmentation mask

Train a PointRend

Manually label by SAM.

2. Initialize the camera pose