Skip to content

pzhren/InfiniteWorld

Repository files navigation

🌍 InfiniteWorld

A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction

Pengzhen Ren*, Min Li*, Zhen Luo*, Xinshuai Song*, Ziwei Chen*, Weijia Liufu*, Yixuan Yang*, Hao Zheng*

Rongtao Xu, Zitong Huang, Tongsheng Ding, Luyang Xie, Kaidong Zhang, Changfei Fu, Yang Liu, Liang Lin, Feng Zheng📧, Xiaodan Liang📧

* equal contribution. 📧 corresponding author.

[Paper.] [Workshop]

20241202_215955

🚀 Introduction

  • We have built a unified and scalable simulation framework that integrates various improved and latest embodied asset reconstruction methods. This has greatly alleviated the community's plight of lacking high-quality embodied assets.
  • We build a complete web-based smart point cloud automatic annotation framework that supports distributed collaboration, AI assistance, and optional human-in-the-loop features. This provides strong support for complex robot interactions.
  • We designed systematic benchmarks for robot interaction, including scene graph collaborative exploration and open-world social mobile manipulation. This provides a comprehensive and systematic evaluation of the capabilities of embodied agents in perception, planning, execution, and communication.

📄 Simulator

20241202_221116

Overview of the functions of InfiniteWorld simulator. Our simulation platform supports different sensors, robot platforms, and teleoperation. In addition, it also realizes unlimited expansion of scene and object assets through generative and Sim2Real methods, and we have also built an annotation platform to reduce annotation costs and improve annotation quality.

Language-Driven Automatic Scene Generation and Editing

20241202_221141Language-driven automatic scene generation and editing framework based on HOLODECK [77]. It can easily generate various interactive high-fidelity scenes that meet the requirements of users, including scene style replacement, object editing (e.g., adding/removing a specific number of objects), and replacement (that is, replacing similar objects), etc.

Depth-Prior-Constrained Real2Sim

We use the depth prior to constrain the generation of the 3D model, which enhances simulation realism by integrating depth priors and normal priors into the Planar-based Gaussian Splatting Reconstruction (PGSR) framework.

Additionally, this project contains Python scripts for post-processing reconstructed mesh scene ply files, which could help you optimize, repair, and enhance the 3D mesh. Especially, an example of a conda environment real2sim/post-process/env.yaml is provided, which has been tested to successfully run all scripts on Windows 11.

Annot8-3D

The 3D annotation framework is open-sourced at: https://github.com/zh-plus/annot8-3d.

Unified 3D Asset

We provide code for converting 3D datasets into different formats in 3D format convert. We have the scene converter and object converter. The scene converter requires installing the pymeshlab library. The script ./object_format_convert/obj_usd.py allows you to convert objects in OBJ format to USD format for use in Isaac Sim. For batch conversions, you can use obj_usd_multi.py, though its compatibility with Isaac Sim is limited. Additionally, ./object_format_convert/pkl_obj.py converts files from PKL to OBJ format, which is the first step in adapting the Objaverse dataset for use in Isaac Sim.

The example of object converter is in the corresponding file folders. Due to the large size of the scene, the examples are in the link here with the [password:1234].

And here is the link for our 3D object assets list:

Dataset Categories Count Scenes Links Password
3D Front 21 5,172 Furniture scenes Download qprj
PartNet 24 26,671 Articulated objects Download parn
Objaverse (Holodeck) 940 4,042,937 Small objects Download etmn
ClothesNet 11 3,051 Clothing Download Link -

Generation Scene

We provide scripts in generation scene to import from the scene generated json file. You can convert them to usd format using ./multi_place.py. Note: you need to change 'scene_files' 'output_files' and 'obj_data_usd' to your own paths in code.

Benchmark

20241202_220957 Detailed benchmark descriptions can be found here.

Citation

If you find this code useful in your work, please consider citing

@misc{ren2024infiniteworld,
    title={InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction},
    author={Pengzhen Ren and Min Li and Zhen Luo and Xinshuai Song and Ziwei Chen and Weijia Liufu and Yixuan Yang and Hao Zheng and Rongtao Xu and Zitong Huang and Tongsheng Ding and Luyang Xie and Kaidong Zhang and Changfei Fu and Yang Liu and Liang Lin and Feng Zheng and Xiaodan Liang},
    year={2024},
    eprint={2412.05789},
    archivePrefix={arXiv},
    primaryClass={cs.RO}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published