olego Posted January 14, 2011 Share Posted January 14, 2011 Hello! I have the desktop with following hardware: 1. MB: ASUS P5QL/EPU 2. RAM: CM2X2048-8500-C5D ver1.2 2.1V 5-5-5-15 (2 x 2 GB) 3. CPU: Intel Dual-Core E6500 4. GPU: nVidia GeForce 9400 GT with binary driver This machine is running for more than two years without any problem, with a default BIOS settings related to RAM, but during the last 2 or 3 months I get deadlock hangs once or twice a week. This machine runs Slackware-current 32 bit with custom compiled vanilla kernel with two additional patches - BFS and TuxOnIce. Deadlocks usually occur when there is no user activity - only rtorrent is running and two KDE4 sessions is open (with firefox, okular, claws-mail, goldendict, virtualbox, rxvt-unicode and other memory consuming apps). This is excerpt from my syslog: Jan 6 20:47:13 oleg2 mcelog: failed to prefill DIMM database from DMI data Jan 6 20:47:13 oleg2 mcelog: Kernel does not support page offline interface Jan 6 20:47:13 oleg2 mcelog: HARDWARE ERROR. This is *NOT* a software problem! Jan 6 20:47:13 oleg2 mcelog: Please contact your hardware vendor Jan 6 20:47:13 oleg2 mcelog: MCE 0 Jan 6 20:47:13 oleg2 mcelog: CPU 0 BANK 0 Jan 6 20:47:13 oleg2 mcelog: TIME 1294339633 Thu Jan 6 20:47:13 2011 Jan 6 20:47:13 oleg2 mcelog: MCG status: Jan 6 20:47:13 oleg2 mcelog: MCi status: Jan 6 20:47:13 oleg2 mcelog: Error overflow Jan 6 20:47:13 oleg2 mcelog: Uncorrected error Jan 6 20:47:13 oleg2 mcelog: Error enabled Jan 6 20:47:13 oleg2 mcelog: Processor context corrupt Jan 6 20:47:13 oleg2 mcelog: MCA: BUS Level-0 Local-CPU-originated-request Generic Memory-access Request-timeout Error Jan 6 20:47:13 oleg2 mcelog: BQ_DCU_READ_TYPE BQ_ERR_HARD_TYPE BQ_ERR_HARD_TYPE Jan 6 20:47:13 oleg2 mcelog: timeout BINIT (ROB timeout). No micro-instruction retired for some time Jan 6 20:47:13 oleg2 mcelog: failure that caused IERR Jan 6 20:47:13 oleg2 mcelog: STATUS f200084000000800 MCGSTATUS 0 Jan 6 20:47:13 oleg2 mcelog: MCGCAP 806 APICID 0 SOCKETID 0 Jan 6 20:47:13 oleg2 mcelog: CPUID Vendor Intel Family 6 Model 23 Jan 6 20:47:13 oleg2 mcelog: HARDWARE ERROR. This is *NOT* a software problem! Jan 6 20:47:13 oleg2 mcelog: Please contact your hardware vendor Jan 6 20:47:13 oleg2 mcelog: MCE 1 Jan 6 20:47:13 oleg2 mcelog: CPU 0 BANK 5 Jan 6 20:47:13 oleg2 mcelog: TIME 1294339633 Thu Jan 6 20:47:13 2011 Jan 6 20:47:13 oleg2 mcelog: MCG status: Jan 6 20:47:13 oleg2 mcelog: MCi status: Jan 6 20:47:13 oleg2 mcelog: Error overflow Jan 6 20:47:13 oleg2 mcelog: Uncorrected error Jan 6 20:47:13 oleg2 mcelog: Error enabled Jan 6 20:47:13 oleg2 mcelog: Processor context corrupt Jan 6 20:47:13 oleg2 mcelog: MCA: BUS Level-3 Generic Generic Other-transaction Request-timeout Error Jan 6 20:47:13 oleg2 mcelog: BQ_DCU_READ_TYPE BQ_ERR_AERR2_TYPE BQ_ERR_AERR2_TYPE Jan 6 20:47:13 oleg2 mcelog: received parity error on response transaction Jan 6 20:47:13 oleg2 mcelog: MCE driven MCE is observed Jan 6 20:47:13 oleg2 mcelog: STATUS f200001034000e0f MCGSTATUS 0 Jan 6 20:47:13 oleg2 mcelog: MCGCAP 806 APICID 0 SOCKETID 0 Jan 6 20:47:13 oleg2 mcelog: CPUID Vendor Intel Family 6 Model 23 Jan 6 20:47:13 oleg2 mcelog: HARDWARE ERROR. This is *NOT* a software problem! Jan 6 20:47:13 oleg2 mcelog: Please contact your hardware vendor Jan 6 20:47:13 oleg2 mcelog: MCE 2 Jan 6 20:47:13 oleg2 mcelog: CPU 1 BANK 5 Jan 6 20:47:13 oleg2 mcelog: TIME 1294339633 Thu Jan 6 20:47:13 2011 Jan 6 20:47:13 oleg2 mcelog: MCG status: Jan 6 20:47:13 oleg2 mcelog: MCi status: Jan 6 20:47:13 oleg2 mcelog: Error overflow Jan 6 20:47:13 oleg2 mcelog: Uncorrected error Jan 6 20:47:13 oleg2 mcelog: Error enabled Jan 6 20:47:13 oleg2 mcelog: Processor context corrupt Jan 6 20:47:13 oleg2 mcelog: MCA: BUS Level-3 Generic Generic Other-transaction Request-timeout Error Jan 6 20:47:13 oleg2 mcelog: BQ_DCU_READ_TYPE BQ_ERR_HARD_TYPE BQ_ERR_HARD_TYPE Jan 6 20:47:13 oleg2 mcelog: received parity error on response transaction Jan 6 20:47:13 oleg2 mcelog: MCE driven Jan 6 20:47:13 oleg2 mcelog: STATUS f200001010000e0f MCGSTATUS 0 Jan 6 20:47:13 oleg2 mcelog: MCGCAP 806 APICID 1 SOCKETID 0 Jan 6 20:47:13 oleg2 mcelog: CPUID Vendor Intel Family 6 Model 23 I just would like to get a clue - is this error related to the memory? By default my BIOS set this memory to 5-5-5-18 and 1.8V @ 1066 MHz. I changed settings to 2.1V and 5-5-5-15. My BIOS doesn't allow me to change 1T/2T timings. Anyway in both configs (default and custom) I get NO errors at all with memtest86+ 4.10 and get a lot of errors in test #7 (random number sequence) with memtest86 3.5a. 1. Should I try my memory @ 2.2V ? Is it safe? It's labeled as 2.1V. 2. Which test is more reliable - memtest86 or memtest86+ ? Since I rose DRAM voltage (a week ago) 1.8V -> 2.1V I got NO errors in my OS. But I'm confused by memtest86 errors. Thanks in advance! Link to comment Share on other sites More sharing options...
vinark Posted January 14, 2011 Share Posted January 14, 2011 Did you disable legacy usb in the bios? It is a known to cause errors in memtest. Link to comment Share on other sites More sharing options...
olego Posted January 14, 2011 Author Share Posted January 14, 2011 Did you disable legacy usb in the bios? It is a known to cause errors in memtest. Nope :confused:. I will try to do it tonight. Thank you for the clue! UPD: I disabled and got no differences - errors with memtest86. Link to comment Share on other sites More sharing options...
Wired Posted January 14, 2011 Share Posted January 14, 2011 Test with the latest version of Memtest86+ (link and instructions are on the sidebar). Link to comment Share on other sites More sharing options...
olego Posted January 14, 2011 Author Share Posted January 14, 2011 Test with the latest version of Memtest86+ (link and instructions are on the sidebar). Ok, let's decide that memtest86+ is more reliable and 100% trusted. I like this version because I have no errors with it =) Link to comment Share on other sites More sharing options...
Recommended Posts
Archived
This topic is now archived and is closed to further replies.