Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Nobody is deploying 3+GB models to iOS beyond some enthusiast “because you can” apps. Amazing tech but not feasible for any mainstream use yet.

Eg: https://apps.apple.com/app/id6444050820



If you have played any large mobile games, then you would not be surprised to see apps downloading massive files during first open.


A small download + an in-app weights download (and a space requirement warning) is probably sane, right?


I agree, we're too far down a chain of hypotheticals motivated by "but ONNX must be bad compared to $MODELX.cpp?"

Wouldn't make sense to deploy 4-bit quantization as a product either.


The size makes it tough for App Store deployment, but I could imagine using a local LLM on-device for an enterprise app.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: