Error Correcting Code (ECC memory) is used in most computers where data corruption cannot be tolerated under any circumstances.
Questions tagged [ecc]
79 questions
33
votes
5 answers
How do I get notified of ECC errors in Linux?
How do I get notified, when a Linux machine equipped with ECC memory recognizes a memory failure? I'm interested in both correctable and uncorrectable errors.
if a message is written to dmesg/the syslog, this is already fine, but I'd love to know…
Jens Erat
- 1,400
- 2
- 11
- 26
31
votes
4 answers
What is ECC ram and why is it better?
I've seen a dicussion about ECC ram use on servers. Why is it better?
Toby Allen
- 747
- 2
- 10
- 23
27
votes
10 answers
Would you use ECC RAM in a workstation?
Is ECC RAM recommended for use in workstations, or is it something that only gets used in servers? If non-ECC RAM works in PCs, why would we need ECC RAM at all?
Jon Tackabury
- 540
- 1
- 7
- 14
20
votes
2 answers
What is the Rowhammer DRAM bug and how should I treat it?
DRAM chips are very tightly packed. Research has shown that neighboring bits can be flipped at random.
What is the probability of the bug triggering at random in a server-grade DRAM chip with ECC (the CMU-Intel paper cites e.g. the number…
Deer Hunter
- 1,070
- 7
- 17
- 25
20
votes
3 answers
Non-ECC memory with ZFS: a stupid idea?
I have a new server and am planning to upgrade the paltry 2 GB of memory to the maximum of 16 GB. (Theoretically 8 GB is the limit, but empirically 16 GB has been shown to work.) Some guides advise that ECC memory is not that important, but I'm not…
iconoclast
- 1,688
- 2
- 18
- 30
18
votes
2 answers
What RAM options do I need to know before buying Server RAM?
This is a proposed Canonical Question about Server Memory.
I have to buy a Dell R420 server and there are various combinations (1600 and 1333 MHz RDIMMS and UDIMMS) and Performance Optimized vs. Advanced ECC with and without sparing. I noticed…
Stasik
- 325
- 2
- 7
17
votes
5 answers
How to check if RAM is running in ECC mode?
I updated this post since I replaced the processor, but the core of my question (and unfortunately the results as well) are the same.
I built my first FreeNAS box and wanted to use ECC RAM since I want to store critical data. Because I am on a…
comfreak
- 1,451
- 1
- 21
- 32
15
votes
1 answer
Should I use bios "Advanced ECC" in Dell PowerEdge R710 Bios with ECC DIMMs?
I have a Dell PowerEdge R710 with dual Intel Xeon E5503 CPUs. It has 96GB(12x8GB) of ECC DIMMs.
In its BIOS, memory is configured for "Advanced ECC".
My question is if my DIMMs are already ECC, does it make sense to enable this "Advanced ECC" mode…
Mxx
- 2,312
- 2
- 26
- 40
11
votes
5 answers
The importance of ECC memory
Are ECC memory modules important to have on a non-critical server?
I was thinking about getting myself a toy dedicated server for lots of random, non-critical stuff. Sporadic reboots are no big deal. I'm looking at one provider but the prices are…
PJK
- 221
- 2
- 5
10
votes
1 answer
How to force ECC error
I'm looking for a way to force an ECC error in a DRAM DIMM to test some code associated with recovering from these errors. I believe Intel makes a test jig for several thousand dollars, but I'm looking for something a bit cheaper.
I've tried buying…
albiglan
- 1,033
- 8
- 11
9
votes
2 answers
What does ECC RAM failure look like
For Non-ECC memory I have a decent idea of what a failure looks like; certain random things start going wrong (e.g. PNG checksums fail validation once and then not the next time), that sort of thing. But I'm relatively new to ECC RAM. What do I…
arcticmac
- 291
- 3
- 11
9
votes
1 answer
How seriously should I take ECC correctable error warnings?
I have a pile of Sun X2200-M2 servers. These servers have ECC memory.
In some of these servers, I am getting warnings in the eLOM about "correctable ECC errors detected", eg:
# ssh regress11 ipmitool sel elist
1 | 05/20/2010 | 14:20:27 | Memory…
David Mackintosh
- 14,223
- 6
- 46
- 77
8
votes
2 answers
ECC chipkill errors: which DIMM?
We often get DIMMs in our servers going bad with the following errors in syslog:
May 7 09:15:31 nolcgi303 kernel: EDAC k8 MC0: general bus error: participating processor(local node response), time-out(no timeout) memory transaction type(generic…
markdrayton
- 2,429
- 1
- 20
- 24
6
votes
1 answer
Which browsers and OSes supports ECC based SSL certificates?
We are evaluating whether to buy a RSA based certificate or a ECC based certificate.
RSA is older and is supported by all browsers.
ECC is newer, they state it is faster due requiring smaller key for a similar security etc.
Unfortunately, I could…
alik
- 369
- 1
- 4
- 11
5
votes
3 answers
Evaluating uncorrectable ECC errors and fallback methods
I run a server which has just experienced an error I've not encountered before. It emitted a few beeps, rebooted, and got stuck at the startup screen (the part where the bios shows its logo and begins listing information) with the error:
Node0:…
Zhro
- 339
- 2
- 11