Jump to content
Corsair Community

Bad CM72SD512RLP-3200/W


Recommended Posts

Hello, I think I have a bad module(s). Not quite sure how many in a set of four. I had the following machine check exceptions coming from the linux /dev/mcelog:

 

MCE 0

CPU 1 4 northbridge TSC ba5f616a4fd4

ADDR 64389cd0

Northbridge Chipkill ECC error

Chipkill ECC syndrome = 45d1

bit32 = err cpu0

bit46 = corrected ecc error

bus error 'local node origin, request didn't time out

generic read mem transaction

memory access, level generic'

STATUS 9468c00145080813 MCGSTATUS 0

MCE 1

CPU 1 4 northbridge TSC bb7593d9ce06

ADDR 64389cd0

Northbridge Chipkill ECC error

Chipkill ECC syndrome = 45d1

bit32 = err cpu0

bit46 = corrected ecc error

bus error 'local node origin, request didn't time out

generic read mem transaction

memory access, level generic'

STATUS 9468c00145080813 MCGSTATUS 0

MCE 2

CPU 1 4 northbridge TSC bda1f63a84f4

ADDR 63f69cd0

Northbridge Chipkill ECC error

Chipkill ECC syndrome = 45d1

bit32 = err cpu0

bit46 = corrected ecc error

bus error 'local node origin, request didn't time out

generic read mem transaction

memory access, level generic'

STATUS 9468c00145080813 MCGSTATUS 0

 

And a lot more. These would happen all day. Luckily the server was not heavily loaded and mostly idle. I have ample logs to post if necessary.

 

I replaced the memory and everything has been fine ever since.

 

I would like to RMA for the same parts. Or at least have things tested.

I ran memtest86 when I first got these modules, maybe a year or year and a half ago, and I got intermittent errors. The errors would not show up always, so I couldn't say for sure at the time that the memory was definitely bad. I did the normal thing, checked and reseated each module. Placed the modules in another identical server and tested, but everything came out ok.

I even tested, other memory in the same server(another set of 4 CM72SD512RLP-3200) and got ok results.

 

I always suspected something was wrong with these modules. :(:

 

The memory was in a dual opteron 246 tyan thunder system.

Latest bios etc... I tried everything.

 

Thanks

Ernie

Link to comment
Share on other sites

  • Corsair Employees

These modules will not support Chip Kill, so please make sure that is disabled in your bios and then please test the modules one at a time with http://www.memtest.org and I would set the Dim Voltage to 2.7 Volts as well. If you do find one failing then lets get it or them replaced. Please follow the link in my signature “I think I have a bad part!” and we will be happy to replace them or it!

However; if you get errors with both modules, that would suggest some other problem. And I would try testing the memory in another system to be sure.

Link to comment
Share on other sites

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...