Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
A Visual Guide to Gemma 4 (maartengrootendorst.com)
4 points by mariuz 13 days ago | hide | past | favorite | 1 comment
 help



Incredibly detailed! The vision transformer stuff in particular is very useful to know. It's interesting that the token budgets are so much higher (up to 1120) than GPT, which uses 170 tokens per 512x512 tile. I wonder if that will lead to more granular spatial vision, something GPT struggles with.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: