DI-engine Agent Demo List

This issue is a collection of various interesting agent demonstration trained by DI-engine, it will be updated continually.

Mario 1-1

https://user-images.githubusercontent.com/33195032/184529481-f55642f7-e9e8-45d3-9b8c-80e5bf3e177b.mp4
Mario 1-2

https://user-images.githubusercontent.com/33195032/184669648-6309ba32-4d33-4b07-8ce0-cce89c82bf97.mp4
rocket landing

https://user-images.githubusercontent.com/33195032/204750411-2c3c4fb3-de6c-474a-a4d3-dd380a605de1.mp4
SMAC 5m VS 6m

https://user-images.githubusercontent.com/33195032/125911080-2fdbd8ce-1107-455b-9772-eac20b717dab.mp4
SMAC MMM

https://user-images.githubusercontent.com/33195032/125910360-de914769-f0fc-4073-85a7-b12b1ef32fa7.mp4
SMAC MMM2

https://user-images.githubusercontent.com/33195032/126481161-9127f790-bfca-4985-bed3-7adaab519b46.mp4
SMAC 3s5z

https://user-images.githubusercontent.com/33195032/125908605-d694bb6a-b10a-4b58-9ec2-b4eb92cfe7a3.mp4
lunarlander

https://user-images.githubusercontent.com/33195032/127827210-cbc1e23c-6ed4-4d5b-a3c4-34a010119993.mp4
gfootball
- rule-based bot vs rule-based bot
https://user-images.githubusercontent.com/33195032/125933672-db8a6229-c2ea-4ab6-a53e-355a2258a0f3.mp4
- trained agent vs rule-based bot
https://user-images.githubusercontent.com/33195032/125933646-e24df3ec-9ec3-41d7-a693-136331f6a27a.mp4
slime_volley
- rule-base bot vs trained agent

Jul 09 '21 08:07 PaParaZz1

gym-pubulley-drones Flythrugate random bot VS trained agent

https://user-images.githubusercontent.com/93968541/207054782-6cea7a97-9a0d-41bb-8d58-4484c877c6e0.mp4

Dec 01 '22 12:12 zjowowen

Evogym Carrier-v0 random bot VS trained agent

https://user-images.githubusercontent.com/93968541/210569397-ad55fd85-71ee-48a3-889d-93e3e85ebf4f.mp4

agent during early training VS trained agent

https://user-images.githubusercontent.com/93968541/210812515-bb4a10af-b319-4238-89bf-514a0a4a473c.mp4

Jan 04 '23 13:01 zjowowen

Procgen Bigfish random bot VS trained agent

https://user-images.githubusercontent.com/93968541/212252949-250e96fd-cc03-4565-aae0-355c85ffe7f1.mp4

Jan 13 '23 06:01 zjowowen

Metadrive env with PPO algorithm demonstration. Map Sequence: Straight + Intersection + Straight + Roundabout

https://user-images.githubusercontent.com/49814804/218450981-4a72e55d-ec4d-4980-b52b-3bef6dbc72a0.mp4

Feb 13 '23 11:02 timothijoe

cliffwalking agent trained by DQN random agent vs trained agent

https://github.com/opendilab/DI-engine/assets/93968541/756cbba6-4405-4882-8753-6e4288e40a40

Dec 14 '23 09:12 zjowowen

Hi @zjowowen, thank you for your sharing. I would like to ask about fly through gate simulation, did you change the compute reward and terminal function or you use the same with original work.

Dec 31 '23 09:12 phuongboi