Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
JasonMa2016 committed May 3, 2024
1 parent 88dca77 commit 3ee3d07
Showing 1 changed file with 7 additions and 4 deletions.
11 changes: 7 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@

<div align="center">

[[Website]]( https://eureka-research.github.io/dr-eureka/)
[[Website]](https://eureka-research.github.io/dr-eureka/)
[[arXiv (coming soon!)]](https://arxiv.org/abs/2310.12931)
[[PDF]](https://eureka-research.github.io/dr-eureka/assets/eureka_paper.pdf)

Expand All @@ -18,9 +18,12 @@
[![GitHub license](https://img.shields.io/github/license/eureka-research/Eureka)](https://github.com/eureka-research/Eureka/blob/main/LICENSE)
______________________________________________________________________

<!-- https://github.com/eureka-research/Eureka/assets/21993118/1abb960d-321a-4de9-b311-113b5fc53d4a

![](images/eureka.png) -->

https://github.com/eureka-research/DrEureka/assets/21993118/61b25aa1-e39d-4e13-b6c3-2a58e3b23492


https://github.com/eureka-research/DrEureka/assets/21993118/47b1b4a2-0ba8-488b-8ae0-2fdf9865bbbe
</div>

Transferring policies learned in simulation to the real world is a promising strategy for acquiring robot skills at scale. However, sim-to-real approaches typically rely on manual design and tuning of the task reward function as well as the simulation physics parameters, rendering the process slow and human-labor intensive. In this paper, we investigate using Large Language Models (LLMs) to automate and accelerate sim-to-real design. Our LLM-guided sim-to-real approach requires only the physics simulation for the target task and automatically constructs suitable reward functions and domain randomization distributions to support real-world transfer. We first demonstrate our approach can discover sim-to-real configurations that are competitive with existing human-designed ones on quadruped locomotion and dexterous manipulation tasks. Then, we showcase that our approach is capable of solving novel robot tasks, such as quadruped balancing and walking atop a yoga ball, without iterative manual design.
Expand Down Expand Up @@ -163,4 +166,4 @@ If you find our work useful, please consider citing us!
author = {Yecheng Jason Ma and William Liang and Hungju Wang and Sam Wang and Yuke Zhu and Linxi Fan and Osbert Bastani and Dinesh Jayaraman}
year = {2024},
}
```
```

0 comments on commit 3ee3d07

Please sign in to comment.