Ya a lot of ppl don't realize you could spend 2k on a 5090 to run some of the la...

oytis · 2025-09-25T21:38:16 1758836296

20 a month for a commercial model is price dumping financed by investors. For ollama it's hopefully a sustainable price.

theshrike79 · 2025-09-26T06:42:58 1758868978

The 20 a month models definitely aren't sustainable.

This is why everyone needs to get every flavour and speedrun building all the tools they need when the infinite money faucets are turned off.

At some point companies will start raising prices or moving towards per-token pricing (Which is sustainable, but expensive).

gunalx · 2025-09-26T14:14:55 1758896095

Depends. API pricing from oss model inference providers basically has to be sustainable, because of competition in the space.

And with that in mind, i definetly dont use more than a couple of bucks a month in API refils. (not that i really am a power user or anything)

So if you consider the 20 bucks to be balanced between poer and non power users, and with the existing rate limits, its probably not that far off being profitable, at least on the pure inference side.