Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> we are able to achieve over 80% success on these tasks with only 50 human demonstrations per task

The challenge is that 80% accuracy is a far cry from being actually useful in real-world situations to the point where humans can rely on them to perform the desired task correctly. They have to "just work". And closing the gap on that last 20% becomes exponentially difficult.



Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: