imaginary_num6er@alien.topB to HardwareEnglish · 1 year agoDell reportedly restricts exports of AMD's fastest gaming GPUs to China — Radeon RX 7900 XTX, RX 7900, Pro W7900 purportedly listed as sanctioned techwww.tomshardware.comexternal-linkmessage-square50fedilinkarrow-up11arrow-down10
arrow-up11arrow-down1external-linkDell reportedly restricts exports of AMD's fastest gaming GPUs to China — Radeon RX 7900 XTX, RX 7900, Pro W7900 purportedly listed as sanctioned techwww.tomshardware.comimaginary_num6er@alien.topB to HardwareEnglish · 1 year agomessage-square50fedilink
minus-squareFrom-UoM@alien.topBlinkfedilinkEnglisharrow-up1·1 year agoYou cant compare using using two different impelementations. You compare only on A1111 or only on SHARK. SHARK doesnt even seem be taking any adavantage of the 4090 being significatly slower than the 7900xtx. The recent A1111 Olive branch made the performance of it almost equal SHARK model. A1111 also full uses the 4090. The new results on the same A1111 implention are here - https://www.pugetsystems.com/labs/articles/amd-microsoft-olive-optimizations-for-stable-diffusion-performance-analysis/ You can divide the 4090’s perf by half if you want no Tensor RT which is 35. Thats still significantly higher than the 7900xtx’s 23
minus-squarenoiserr@alien.topBlinkfedilinkEnglisharrow-up1·1 year ago You compare only on A1111 or only on SHARK That’s seems like an arbitrary handicap. You should use whichever solution runs best on the respective hardware.
minus-squarebubblesort33@alien.topBlinkfedilinkEnglisharrow-up1·1 year agoIt mentions Olive. I don’t know what that is, but it’s suggesting it could cause AMD to catch back up. Is that true? Or is it more likely going to get them an extra 10% performance instead of the extra 110% they need to catch up?
You cant compare using using two different impelementations. You compare only on A1111 or only on SHARK.
SHARK doesnt even seem be taking any adavantage of the 4090 being significatly slower than the 7900xtx.
The recent A1111 Olive branch made the performance of it almost equal SHARK model. A1111 also full uses the 4090.
The new results on the same A1111 implention are here -
https://www.pugetsystems.com/labs/articles/amd-microsoft-olive-optimizations-for-stable-diffusion-performance-analysis/
You can divide the 4090’s perf by half if you want no Tensor RT which is 35. Thats still significantly higher than the 7900xtx’s 23
That’s seems like an arbitrary handicap. You should use whichever solution runs best on the respective hardware.
It mentions Olive. I don’t know what that is, but it’s suggesting it could cause AMD to catch back up. Is that true? Or is it more likely going to get them an extra 10% performance instead of the extra 110% they need to catch up?