Unfortunately we need to be careful what we wish for. As AI processing becomes more and more popular, if a card like the RTX 4070 had 20 GB vram they would probably all be snatched up by companies to use for AI processing on the cheap, and suddenly 4070 and higher all go for $2000 like the mining craze all over again.
I doubt any big, mainstream western corporations would use GeForce cards for AI, even if they have the same amount of vRAM as workstation cards (and they’d never have the same memory as AI-specific cards such as the H100). Hobbyists will for sure, but they make up such a tiny portion of the market that I doubt it would change the overall demand. Chinese and Russian companies will, I guess, but I don’t think that would have the same effect as crypto mining, which has immediate returns without any expertise for individual users, unlike AI applications.
Nvidia has things like the A series for this sort of stuff, and at their level of use they’ll be buying at a distributor or direct from manufacturer level depending on size.
Some small places will want 4090s and things though, and then yeah maybe they’ll buy from stores and that but I doubt that’ll be enough to kick demand up and cause prices like the crypto boom.
There’s no advantage of the A series unless you plan to use the unique virtual GPU features. Nvidia added a toggle for the 4090 to make use of ECC, which apparently the memory can already do, to turn it into a professional card with less VRAM. It’s otherwise nearly identical to the A series cards, minus the NVLINK to connector two A series cards together.
Hobbyists, sure. Startups, maybe (from what I’ve seen, they’re far more likely to rent a server or something). Mainstream corporations, definitely not.
NVLink is no longer supported on the Ada Lovelace GPU architecture that powers Nvidia’s flagship RTX 4090 graphics. Replacing NVLink is the PCIe Gen 5 standard. Nvidia will use the freed up space from the removal of NVLink to cram in more AI processing capabilities.
Well there’s still only a finite number of them being fabbed and manufactured. Whether they custom order them in bulk or not it still reduces available supply for other customers.
I’m tempted, the frankendriver project fixes the driver issue with these cards (at least for now) and it seems like most people have a good experience with them…
If they were readily available they would be a tempting option! But no, Nvidia continue to be frugal with vram in the consumer space…
Unfortunately we need to be careful what we wish for. As AI processing becomes more and more popular, if a card like the RTX 4070 had 20 GB vram they would probably all be snatched up by companies to use for AI processing on the cheap, and suddenly 4070 and higher all go for $2000 like the mining craze all over again.
I doubt any big, mainstream western corporations would use GeForce cards for AI, even if they have the same amount of vRAM as workstation cards (and they’d never have the same memory as AI-specific cards such as the H100). Hobbyists will for sure, but they make up such a tiny portion of the market that I doubt it would change the overall demand. Chinese and Russian companies will, I guess, but I don’t think that would have the same effect as crypto mining, which has immediate returns without any expertise for individual users, unlike AI applications.
Why wouldn’t they?
Official support, drivers, bulk orders direct from Nvidia, staying on Nvidia’s good side
Nvidia has things like the A series for this sort of stuff, and at their level of use they’ll be buying at a distributor or direct from manufacturer level depending on size.
Some small places will want 4090s and things though, and then yeah maybe they’ll buy from stores and that but I doubt that’ll be enough to kick demand up and cause prices like the crypto boom.
There’s no advantage of the A series unless you plan to use the unique virtual GPU features. Nvidia added a toggle for the 4090 to make use of ECC, which apparently the memory can already do, to turn it into a professional card with less VRAM. It’s otherwise nearly identical to the A series cards, minus the NVLINK to connector two A series cards together.
Plenty of people using 4090s tho
Hobbyists, sure. Startups, maybe (from what I’ve seen, they’re far more likely to rent a server or something). Mainstream corporations, definitely not.
Exactly, even though Nvidia removed NVLink it’s still a popular card for deep learning coz it’s relatively cheap for small labs.
NVLink died because PCIE Gen5 can handle incorrect now, no external hardware needed.
PCIe 5.0 cannot do what NVLink does, and NVLink isn’t dead at all.
https://www.windowscentral.com/hardware/computers-desktops/nvidia-kills-off-nvlink-on-rtx-4090#:~:text=NVLink is no longer supported,in more AI processing capabilities.
The RTX A6000 ADA also drops nvlink. No idea why the downvotes
Doesn’t matter what western companies would do, matters that countries accounting for a billion+ people and second largest economy would use them.
these companies they pre-order them, they arent like your average miners that go to AIB order a bulk like black market.
Well there’s still only a finite number of them being fabbed and manufactured. Whether they custom order them in bulk or not it still reduces available supply for other customers.
Frankenstein 3080M 16GB is interesting, 3080 mobile so roughly on par with desktop 3070 but with 16GB VRAM…and they’re pretty cheap.
https://www.alibaba.com/product-detail/Graphics-Card-RTX3080m-GPU-GeForce-RTX_1600880058343.html?spm=a2700.shop_plgr.41413.35.39f45071W40035
I’m tempted, the frankendriver project fixes the driver issue with these cards (at least for now) and it seems like most people have a good experience with them…