Hardware Watch
  • Communities
  • Create Post
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
imaginary_num6er@alien.topB to HardwareEnglish · 1 year ago

Dell reportedly restricts exports of AMD's fastest gaming GPUs to China — Radeon RX 7900 XTX, RX 7900, Pro W7900 purportedly listed as sanctioned tech

www.tomshardware.com

external-link
message-square
50
fedilink
1
external-link

Dell reportedly restricts exports of AMD's fastest gaming GPUs to China — Radeon RX 7900 XTX, RX 7900, Pro W7900 purportedly listed as sanctioned tech

www.tomshardware.com

imaginary_num6er@alien.topB to HardwareEnglish · 1 year ago
message-square
50
fedilink
Dell has reportedly asked its customers not to supply AMD's latest graphics cards to China and 22 other locations.
  • upbeatchief@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    I know that the xtx kept up with the 4090 in stable diffusion before the tensorRT update, so there might be some places where the xtx can be a replacement when you build software from the grounds up and willing to lose performance for the benefit of less eyes and hassle on Amd products

    • From-UoM@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Got a source for that keeping up?

      • noobitom@alien.topB
        link
        fedilink
        arrow-up
        1
        ·
        1 year ago

        https://www.pugetsystems.com/labs/articles/stable-diffusion-performance-nvidia-geforce-vs-amd-radeon/

        • From-UoM@alien.topB
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          You cant compare using using two different impelementations. You compare only on A1111 or only on SHARK.

          SHARK doesnt even seem be taking any adavantage of the 4090 being significatly slower than the 7900xtx.

          The recent A1111 Olive branch made the performance of it almost equal SHARK model. A1111 also full uses the 4090.

          The new results on the same A1111 implention are here -

          https://www.pugetsystems.com/labs/articles/amd-microsoft-olive-optimizations-for-stable-diffusion-performance-analysis/

          You can divide the 4090’s perf by half if you want no Tensor RT which is 35. Thats still significantly higher than the 7900xtx’s 23

          • bubblesort33@alien.topB
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            It mentions Olive. I don’t know what that is, but it’s suggesting it could cause AMD to catch back up. Is that true? Or is it more likely going to get them an extra 10% performance instead of the extra 110% they need to catch up?

          • noiserr@alien.topB
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            You compare only on A1111 or only on SHARK

            That’s seems like an arbitrary handicap. You should use whichever solution runs best on the respective hardware.

        • Qesa@alien.topB
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 year ago

          That is, unfortunately, sorely outdated. Particularly with the advent of tensorRT. Best case vs best case the 4080 is about twice as fast today

          https://www.tomshardware.com/pc-components/gpus/stable-diffusion-benchmarks#section-stable-diffusion-512x512-performance

          • From-UoM@alien.topB
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            The gap would be even larger if, or to be precise WHEN, Fp8 and/or sparisity will be used on the Ada Lovelace cards.

          • moofunk@alien.topB
            link
            fedilink
            arrow-up
            1
            ·
            1 year ago

            Of note, TensorRT doesn’t support SDXL yet.

            • DuranteA@alien.topB
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 year ago

              This is no longer true.
              If you use NV’s TensorRT plugin with the A1111 development branch, TensorRT works very well with SDXL (it’s actually much less painful to use than SD1.5 TensorRT was initially).

              The big constraint is VRAM capacity. I can use it for 1024x1024 (and similar-total-pixel-count) SDXL generations on my 4090, but can’t go much beyond that without tiling (though that is generally what you do anyway for larger resolutions).

              Just like for SD1.5, TensorRT speeds up generation by almost a factor of 2 for SDXL (compared to an “optimized” baseline using SDP).

              • moofunk@alien.topB
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 year ago

                Alright thanks. This stuff is moving very fast, and I was only looking at the master branch.

Hardware

hardware

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !hardware@hardware.watch

A place for quality hardware news, reviews, and intelligent discussion.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 1 user / day
  • 1 user / week
  • 26 users / month
  • 76 users / 6 months
  • 1 local subscriber
  • 68 subscribers
  • 506 Posts
  • 4.91K Comments
  • Modlog
  • mods:
  • communick
  • rglullis@communick.news
  • BE: 0.19.8
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org