

The “always free” banner is back on the website, FWIW


The “always free” banner is back on the website, FWIW


It didn’t take long to go from “corporations are people” to “the only people that matter are corporations”


That’s why you put those devices in a separate VLAN with no routing access to the rest of your network


In general, you take the model size in billions of parameters (eg: 397B), divide it by 2 and add a bit for overhead, and that’s how much RAM/VRAM it takes to run it at a “normal” quantization level. For Qwen3.5-397B, that’s about 220 GB. Ideally that would be all VRAM for speed, but you can offload some or all of that to normal RAM on the CPU, you’ll just take a speed hit.
So for something like Qwen3.5-397B, it takes a pretty serious system, especially if you’re trying to do it all in VRAM.


One can hope at least
It’s financial obesity, and should be treated as such