Missing memory (BIOS detects 8 x 2GBs but says only 12 GBs installable)

2

OS: OpenSuse 12.3 (x64)

Problem: Unable to use all detected memory

I recently bought some more RAM for my workstation (2Rx4). The motherboard has 8 slots, and while BIOS recognizes that all 8 have a 2GB stick of RAM, the "Total Installable" is only 12GBs.

dmidecode tells me the maximum supported RAM is 32GBs, so I suspect this may be a rank issue, although I'm not super familiar with memory.

Update

To be explicit - does anyone have any ideas why I am unable to use all the detected memory?

I had 8GBs before in a 4x2GB configuration.

RAM details

2GB 240p PC2-5300 CL5 36c 128x4 DDR2-667 2Rx4 1.8V ECC FBDIMM

All 8 slots are registered as active (see dmidecode output) and the BIOS detects all 8 sticks

enter image description here

Motherboard details

Base Board Information
    Manufacturer: Dell Inc.          
    Product Name: 0GU083
    Version: A00
    Serial Number: ..CN1374073H00UP.

Available memory (BIOS also says 12GB installed memory, but lists 8 2GB sticks)

             total       used       free     shared    buffers     cached
Mem:         12039      11888        151          0        549      10825
-/+ buffers/cache:        512      11527
Swap:         1992        573       1419

Full dmidecode output here

UPDATE 2 memtest x86 only detects 12GBs of memory too;

enter image description here

Alex

Posted 2013-10-04T17:55:29.693

Reputation: 255

You forgot to ask a question. – David Schwartz – 2013-10-04T18:00:46.413

2I think his question is why is only 12GBs install-able...and not the full 16GBs... – TheXed – 2013-10-04T18:04:56.890

How much memory did you have before the upgrade? – Ramhound – 2013-10-04T18:06:02.870

There is always a possibility some of his RAM sticks are faulty.. hence why 8 x 2GB ends up as 12GB.. which means 2 RAM stick may be faulty or simply loose connection. Try checking on your RAM (using memtest+ or simply pull and reseat the RAM) and see how you go. If Memtest / Bios shows all 8 slots are used, thats when things goes weird :) – Darius – 2013-10-04T18:09:29.560

Can you confirm/test that of the FBDIMMs work? Or at least see if the Descriptor 4: Multi-bit ECC memory error error clears? – Hennes – 2013-10-04T20:19:21.120

Is all the memory ecc compatible? – Canadian Luke – 2013-10-04T21:15:09.810

Regarding the BIOS screenshot you added: See the 5000X's manual, page 378, FB-DIMMs SPD interface, SM buses 1,2,3, and 4. That will allow the BIOS to detect which memory modules are installed without accessing the memory directly. – Hennes – 2013-10-05T20:34:00.270

Answers

2

I am going out on a limb here. Please let me know if I am wrong and I will either correct or delete the post:

Pre-amble:

  • Your workstation is a Dell precision 490 with a 0GU083 motherboard.
  • The board has an Intel 5000X chipset memory controller which uses fully buffered DIMMs.
  • You have 8 DIMMs installed. All 8 are FB DIMMs with ECC support. (See your DMI decode log, lines 561 till 578 for DIMM #1, followed by the information of the other 7 DIMMS.
  • 8x2 GiB is 16 GiB. The BIOS succesfully detects that 16 GiB is present.

During POST / memory checks an error is encountered: See line 549: Descriptor 4: Multi-bit ECC memory error and line 927: Status: Firmware-detected hardware failure.

Your motherboard uses tries to use the fully buffered DIMMs (FBDs) in sets of four. (See the manual at ftp://ftp.dell.com/Manuals/all-products/esuprt_desktop/esuprt_dell_precision_workstation/precision-490-dt_user%27s%20guide_en-us.pdf).

End_preamble:

This means that:

  1. You do not need to worry about ranks. (?) Not 100% sure here, but ranks is about the electrical path between the memory controller and the paralel connected DIMMs. FB-DIMMs use a serial interface specifically designed to avoid rank problems.
  2. At least one of the DIMMs posted an error. That error is either still present or needs to be cleared.
  3. Linux probably did not add that faulty 2 GiB DIMM.
    New information added to the POST now makes it clean that while the BIOS detects 16GiB it also only 'installs' 12 GiB. Which is what Linux will use. So the part about Linux memory adding or skipping memory area's is not relevant.

What I do not grok is why that results in 12GiB memory rather than 14 GiB. The only reason I can think of of is if it gave up when it ran into an error after sequentially adding the memory regions. Something like this:

Add 2GiB DIMM.    2GiB now available.
Add 2GiB DIMM.    4iB now available.
Add 2GiB DIMM.    6GiB now available.
Add 2GiB DIMM.    8GiB now available.
Add 2GiB DIMM.   10GiB now available.
Add 2GiB DIMM.   12GiB now available.
Add 2GiB DIMM.   ERROR. STOP ADDING.  Remain at 12GiB.
Stopped. Do not try the possibly working remaining DIMM.

Confirmation of that from someone who knows a lot more about how memory gets added by the kernel would be great. Help on how to test for the broken area and how to avoid it would also be great.


Another potential problem is a damaged memory socket or a damaged trace on the motherboard. This can even affect multiply sockets since FB-DIMMs are chained rather than accessed in parallel.

If you check the datasheet for the Intel 5000X memory controller and look at Figure 1.1 then you will see this:

Graphical representation of a 5000X MCH

The figure shows up to 16 FBDIMM's. Up to is a keyword. Your has 8. The sane way for 8 sockets while keeping quad channel memory access would be 4 channels with 2 sockets each.

If one such channel is not working then you would loose two DIMMs, resulting in 12 GiB.

Hennes

Posted 2013-10-04T17:55:29.693

Reputation: 60 739

If its a specific stick, maybe try the dimms one by one, or hit it with memtest? – Journeyman Geek – 2013-10-04T23:45:42.590

1Swapping what is detected at DIMM #7 and DIMM #8 would/should prove that with the detection of 14GiB of RAM rather than the current 12 GiB. – Hennes – 2013-10-04T23:50:46.323

This is such an excellent answer - thank you. I'll try switching the sticks around tomorrow and see if that makes a difference. As far as I can tell all the accessible memory is working fine (I've put literally all 12 GBs to use in some high-memory application code). – Alex – 2013-10-05T04:09:32.227

1Switched 7 & 8 - no change. Then switched 6->7 7->8 8->6 - also no change. The thing is even in the BIOS I'm told there are only 12 GB of installed of memory, despite 8 2GB DIMMs being listed (all of which are identical). – Alex – 2013-10-05T15:54:11.600

1

OK, so the DIMMs themselves are fine. Damn, those were the easy replaceable parts. The only other Idea which I have is due to the serial nature of FBDIMMS (e.g. see the explanation at http://www.simmtester.com/page/news/showpubnews.asp?num=113) or a damaged channel on the motherboard or in the 5000X. You can rule out the last by confirming quad channel memory access in the old configuration.

– Hennes – 2013-10-05T16:45:08.033

The old configuration was using slots 1-4 only - how would I confirm quad channel memory access in this configuration? – Alex – 2013-10-07T02:23:09.433

In the BIOS screenshot you posted your memory is only running in dual channel configuration. Does that change to quad when you remove the newly added DIMMs? – Hennes – 2013-10-07T14:39:59.490

0

I got exactly the same problem on a 12 gb machine but only reading 4gb. After checking the resource monitor in task manager, it was showing 8gb reserved for BIOS and other devices. After unplugging various USB devices rebooting, all 12 Gb came back. Hope this helps.

tron049

Posted 2013-10-04T17:55:29.693

Reputation: 1

-1

I just found this thread because I had a similar problem. BIOS showed 4 GB in each slot but only had 8 GB available in Dual mode after upgrading to Dual Xeon 5365 and 8 x 4 GB at the same time. Before I had 4 x 2 GB with no issues.

I swapped DIMMs around with no real change ... I ended up cleaning the contacts of all DIMMs and when I put them back in it showed 32 GB available in Quad mode and it is now running MemTest86+ on it.

So, never forget that sometimes it may just be a good cleaning is needed.

David Martin

Posted 2013-10-04T17:55:29.693

Reputation: 1

While this helped you, it doesn't seem likely to help the asker, as the BIOS already shows all the installed RAM, just that it was added up wrong. – Canadian Luke – 2014-08-11T19:42:25.577

That is the way mine was ... all 8 showed as 4 GB DIMMs but the total available did not show 32 GB ... not matter how I switched the DIMMs around. What also caught my attention was that it said Dual mode in the BIOS instead of Quad. – David Martin – 2014-08-12T20:22:32.967

Ahhh, sorry, I misread that. MY BAD! – Canadian Luke – 2014-08-12T20:26:41.537