What to do when machine fails a Prime95 stress test?

5

2

In an effort to diagnose the cause of random resets and freezes I have been running a series of stress tests on my two year old computer. Thus far the machine has passed nine hours of the Prime95 Small FFT test, thirty minutes of the standard burn-in FurMark GPU stress test and thirteen hours (eleven passes) of MemTest86+. The random resets and freezing have been experienced under clean installs of both Windows 7 Ultimate and Windows 8 Pro. According to SpeedFan all case, GPU and CPU temperatures report within normal ranges.

Nevertheless, my computer fails both the Prime95 In-place large FFTs and the Blend stress tests. Sometimes the machine resets immediately. While other times it will run up to ten minutes. Regardless, it fails and either resets or freezes when running these tests.

These are my machine's specs:

I have never overclocked the machine. The BIOS is set to the default configuration which results in Auto being set for most options.

This is the machine's BIOS configuration (screenshots):

BIOS

  • BIOS Version: U1d (this is a UEFI BIOS though the issue was experienced under the original BIOS)

CPU

  • CPU Clock Ratio: 33
  • CPU Frequency: 3.30GHz
  • Internal CPU PLL Overvoltage

Memory

  • Extreme Memory Profiler: disabled
  • System Memory Profiler: 16.00
  • Memory Frequency: 1600MHz
  • Performance Enhacne: Turbo
  • Voltage: 1.5V
  • Profile VTT Voltage: 1.05V
  • Timing: 9-9-9-28 Newegg lists timing as 9-9-9-24 fails the same regardless of setting

Voltage Settings

  • CPU Vcore: 1.85V
  • PCH Core: 1.050V
  • PCH Core: 1.050V
  • CPU PLL: 1.800V
  • System Agent Voltage: 0.920V
  • DRAM Voltage: 1.500V
  • DRAM Voltage Reference: 0.750V
  • DRAM Termination: 0.750V
  • Data Reference (CH A): 0.750V
  • Address Reference (CH A): 0.750V
  • Data Reference (CH B): 0.750V
  • Address Reference (CH B): 0.750V

While the failure has given me hope that I have begun to isolate the issue I am still unclear as to what to test from here. Given recent reviews of the motherboard I am wondering if I just have a bad board and need to replace it. Could the processor actually be bad? Are there further tests I should run? What's my next step when my computer fails a Prime95 large FFT stress test?

ahsteele

Posted 2013-08-29T18:26:17.250

Reputation: 1 730

Programs like Prime95 can often make voltage configuration changes if allowed to so. Why is EMP disabled? – Ramhound – 2013-08-29T18:44:29.843

@Ramhound I have just taking the default of the Prime95 tests. It is affecting the BIOS I am doing nothing to stop it. EMP is disabled by default. I can certainly turn it on if you think it would help. – ahsteele – 2013-08-29T19:07:53.293

Try the other suggestion first be sure you turn adaptive voltage off – Ramhound – 2013-08-29T21:08:45.643

You might a flaky device driver and/or flaky hardware, the only thing to do is to replace hardware one at a time, and see if the problem goes away, very tedious. – Chris O – 2013-08-30T02:00:28.690

@Ramhound it doesn't appear that Gigabyte motherboards (or at least this model) have Adaptive Voltage. I have added additional BIOS configuration information to the question. Unfortunately, the suggestions in nvuono's answer have not yet been fruitful. Any other suggestions would be incredibly helpful.

– ahsteele – 2013-08-30T02:02:34.967

@ChrisO Yeah I thought I might end up having to do that. My goal with the stress testing was to isolate what hardware I should replace first. I have a limited budget so I'd like to try and replace whatever makes the most sense first. – ahsteele – 2013-08-30T02:05:17.667

Right, make perfect sense. Can you borrow a friend's video board temporarily for testing purposes? That's my first guess as to what to try. – Chris O – 2013-08-30T02:08:28.197

@ChrisO The GPU was my first thought as well, but I moved on from blaming it after the FurMark GPU stress test ran for 30 minutes (supposedly a sufficient amount of time). With the failure occuring during a Prime95 stress test what leads you to believe it is the GPU? Is there a better stress test I can run on the GPU to isolate it as the issue? (Truth be told I would love this to be a GPU issue as it is the easiest component to replace).

– ahsteele – 2013-08-30T02:21:05.910

Oops, my little brain just assumed that Prime95 was GPU-accelerated (it's not). Sooooo... wacky crashes with large memory sets? Try swapping the RAM sticks first. Have you tried a thorough memory diagnostic? – Chris O – 2013-08-30T02:28:44.573

@ChrisO I ran MemTest86+ for thirteen hours (eleven passes) without incident. I have not yet swapped the sticks or isolated them.

– ahsteele – 2013-08-30T02:35:34.200

Answers

1

Ultimately, the issue was bad memory. I finally determined that I was dealing with faulty memory by placing the DIMMs in a different motherboard and running the Prime95 In-place large FFTs and the Blend stress tests which resulted in the same system faults.

When I provided this information to G.SKILL they honored their warranty and sent me new modules.

ahsteele

Posted 2013-08-29T18:26:17.250

Reputation: 1 730

3

I see no issues with BIOS settings or running your memory at rated specs. However, assuming your BIOS temp readings are accurate you likely have thermal issues. 61°C at idle in BIOS is extremely high for optimized defaults. Even your ambient of 48°C seems high. Likely ,your cooling in general needs upgrading. For sure , the heat sink needs upgrade or a remount at least and its possible that your case needs additional fans or cleaning those dust bunnies.

Large fft stress testing produces the most heat and will reveal thermal issues and has no doubt happened to you.

owcraftsman

Posted 2013-08-29T18:26:17.250

Reputation: 31

I think it is heat related as my CPU fan is acting up. However, after putting the CPU fan on a case fan connection (so the fan runs at full speed all the time) and leaving the case open the machine is still failing the blend stress test. Definitely the CPU temperature is being kept down, so not sure what it is causing the failure. – ahsteele – 2014-02-09T19:42:49.730

1

There is an answer on a related question BSODs and Prime95 failures where a user, ryantmer, experiencing the same issues increased the RAM voltage from the AUTO setting of 1.5V up to 1.65V and it solved the BSOD issues for them.

You don't have the same RAM but I think it's worth a shot:

I have the exact same RAM kit, and the problem is that it will automatically set itself at 1.5V, whereas it's made to run at 1.65V. Change this in your BIOS, and you should be golden! (Ha.)

Also try searching through the feedback on newegg for related voltage issues. Most people don't have any problems but you'll find some good suggestions in the reviews there:

Other Thoughts: I'm writing this for people who are experiencing random BSODs with this ram in dual channel mode. It seems this RAM, or maybe my motherboard or CPU (i5 2500K), really does not like even as little as ~.05 VCCSA/IO overvoltage. The motherboard defaults to 1.058v VCC on stock settings and this turned out to be too much! The only thing that made the BSODs stop was reducing the voltage down from default to 1.01v. That's contrary to most of the advice about memory errors, which suggests increasing the VCC voltage. Wish I had known this earlier! Hope this helps someone!

And even manufacturer responses:

Manufacturer Response: . Dear Customer

Thank you for your G.Skill purchase and review. Glad to hear the memory is working well in your computer. With an Intel i3 CPU, it does not fully support DDR3-1600 CL9, but you can still utilize the performance of the DRAM by lowering frequency and timings. This high performance memory is unlike standard plug and play memory; input DDR3-1333 8-8-8-24 1.50V in BIOS, and it will perform quicker. With a slightly higher DRAM voltage, it may be capable of even better. This is the tweaking ability of good G.Skill RAM. Should you have any further questions or concerns, please feel free to contact us directly.

Thank you GSKILL SUPPORT

Quality and customer service are our top priorities.

Tech Support Email: ustech@gskillusa.com RMA Dept Email: rma@gskillusa.com G.Skill Forum: http://www.gskill.us/forum/

nvuono

Posted 2013-08-29T18:26:17.250

Reputation: 734