GPU Performance limit thermal.

Good to see hotspot added in the latest beta now.

I wonder if in this case you are hitting a vrm limit?
Will be interesting to see if new thermal pads fixes it.
 
This is very likely a driver bug or something. I have the MSI Gaming X Trio 3090 and I have managed to flip both power and thermal limits to yes by simply alt-tabbing in and out of Guild Wars 2 and believe me, Guild Wars 2 is DEFINITELY not pushing this card anywhere near thermals or power limits. My card's GPU core doesn't even go above 54°C while playing it, memory junction hovers around 62°C ~ 64°C and hot spot temp around 65°C give or take. Power is usually around 200W ~ 220W with a limit of 380W. I also read in other places that games which use Vulkan or DX12 can also cause this to happen when you alt-tab out of them, but Guild Wars 2 is DX9 so it doesn't seem like it is limited to just those two graphics APIs.

EDIT: Adding image. This time I even got max operating voltage to flip too, even though my GPU is slightly undervolted. Yes, with Guild Wars 2 and alt-tabbing in and out of it.
 

Attachments

  • HWiNFO64_2021-04-10_03-15-35.jpg
    HWiNFO64_2021-04-10_03-15-35.jpg
    169.9 KB · Views: 25
Last edited:
I also see "fake/impossible" readings for thermal limit.
MSI 3080 Suprim X water cooled, latest nvidia drivers 466.11.
MSI Afterburner temp limit is also monitored, but its graph shows flat 0 all the time.


1619648032788.png
 
Okay will do that. It make take days before thermal flip happen. I will set alarm for it in order to see it.
 
Oh, that's too long, the DBG file will grow extremely large. It would be much better if this could be more quickly triggered.
When it takes so long to hit that limit, I'm wondering how have you been able to check in MSI Afterburner that there wasn't a short spike in the past? Have you logged all values?
 
No I have not logged.
Yesterday I had in happen it less than hour. Yesterday evening I rebooted my pc for installing something, and some minutes later I played ACC and after quit I noticed that thermal has been triggered in hwinfo and then I just opened the AB monitor and extended the window maximal width. My monitor is large and it fits about one hour of data :)
Here is sample screenshot from today taken few minutes ago :)
1619680875794.png
 
I will try to force it with debug enabled. Will experiment .. play a game do some alt-tab, run the quickminer, switch MSI AB profiles.
Is it a good idea to enable sensors logging in hwinfo together with debug? My guess is that thermal limit could be flipped only for a moment and is possible not to not be logged in the csv.
 
There's no need to enable CSV logging in HWiNFO, that won't provide me more details. I need to see the raw data in DBG when it happens, the only issue is that long runs will produce a very large file.
 
The difference in capturing of the thermal limit is most likely because it's a very short spike and it depends on the polling interval in a particular tool.
The fact that MSI AB sees this too confirms that it's correctly reported. Why it happens - only NVIDIA knows exactly as this is what the drivers say. Might be a bug there...
 
Back
Top