A Vision-Language-Action Flow Model for General Robot Control

Diffusion model which tries to match expert behaviour with a transformer

see also

Type:
Tags:
Status:
Location:
Created: 23-01-25 16:24

https://www.youtube.com/watch?v=bemrcQcHmMk

Source

Black, K., Brown, N., Driess, D., Esmail, A., Equi, M., Finn, C., Fusai, N., Groom, L., Hausman, K., Ichter, B., Jakubczak, S., Jones, T., Ke, L., Levine, S., Li-Bell, A., Mothukuri, M., Nair, S., Pertsch, K., Shi, L. X., … Zhilinsky, U. (n.d.). π0: A Vision-Language-Action Flow Model for General Robot Control.