@suicidaleggroll

suicidaleggroll@lemmy.world · 28 days ago

It’s financial obesity, and should be treated as such

suicidaleggroll@lemmy.world · 1 month ago

The “always free” banner is back on the website, FWIW

suicidaleggroll@lemmy.world · 2 months ago

It didn’t take long to go from “corporations are people” to “the only people that matter are corporations”

suicidaleggroll@lemmy.world · 2 months ago

That’s why you put those devices in a separate VLAN with no routing access to the rest of your network

suicidaleggroll@lemmy.world · 3 months ago

In general, you take the model size in billions of parameters (eg: 397B), divide it by 2 and add a bit for overhead, and that’s how much RAM/VRAM it takes to run it at a “normal” quantization level. For Qwen3.5-397B, that’s about 220 GB. Ideally that would be all VRAM for speed, but you can offload some or all of that to normal RAM on the CPU, you’ll just take a speed hit.

So for something like Qwen3.5-397B, it takes a pretty serious system, especially if you’re trying to do it all in VRAM.

suicidaleggroll@lemmy.world · 3 months ago

Tesla is a ~~car company~~
Tesla is a ~~battery company~~
Tesla is a robot company <---- you are here
Tesla is a ~~company~~

One can hope at least