All Sparks only have a memory bandwidth of 270 GB/s though (about the same as th...

postalrat · 2025-11-10T17:04:45 1762794285

The 3090 also has 24gb of ram vs 128gb for the spark

Gracana · 2025-11-10T18:24:35 1762799075

You'd have to be doing something where the unified memory is specifically necessary, and it's okay that it's slow. If all you want is to run large LLMs slowly, you can do that with split CPU/GPU inference using a normal desktop and a 3090, with the added benefit that a smaller model that fits in the 3090 is going to be blazing fast compared to the same model on the spark.

buildbot · 2025-11-10T16:25:50 1762791950

I believe you mean GB/s?

Jackson__ · 2025-11-10T17:06:30 1762794390

Eh, this is way overblown IMO. The product page claims this is for training, and as long as you crank your batch size high enough you will not run into memory bandwidth constraints.

I've finetuned diffusion models streaming from an SSD without noticeable speed penalty at high enough batchsize.