Start a Conversation

This post is more than 5 years old

Solved!

Go to Solution

105873

October 24th, 2016 10:00

A bus fatal error was detected on a component at bus 0 device 7 function 0

Hi all,

I'm trying to diagnose a hardware issue with my Dell PowerEdge R410. The issue occurred after doing a linux kernel security patch and restarting (I'm running Ubuntu 14.04 and am on Linux 3.13.0-100-generic). Initially, the machine would crash while booting into the OS but I somehow managed to get it too boot up consistently.

I'm seeing the following errors:

--------------------------------------------------

Severity : Critical
Date and Time : Sat Oct 22 05:50:30 2016
Description : A bus fatal error was detected on a component at bus 0 device 7 function 0.

Severity : Critical
Date and Time : Sat Oct 22 05:50:30 2016
Description : A bus fatal error was detected on a component at slot 1.

----------------------------------------------------

I'm having a hard time figuring out which component this refers to exactly. Here is the lspci output:

------------------------------------------------------------------------------------------------

00:00.0 Host bridge: Intel Corporation 5500 I/O Hub to ESI Port (rev 13)
00:01.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 1 (rev 13)
00:03.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 3 (rev 13)
00:07.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 (rev 13)
00:14.0 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub System Management Registers (rev 13)
00:14.1 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub GPIO and Scratch Pad Registers (rev 13)
00:14.2 PIC: Intel Corporation 7500/5520/5500/X58 I/O Hub Control Status and RAS Registers (rev 13)
00:1a.0 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #4
00:1a.1 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #5
00:1a.7 USB controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #2
00:1d.0 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #1
00:1d.1 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #2
00:1d.2 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #3
00:1d.3 USB controller: Intel Corporation 82801JI (ICH10 Family) USB UHCI Controller #6
00:1d.7 USB controller: Intel Corporation 82801JI (ICH10 Family) USB2 EHCI Controller #1
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 90)
00:1f.0 ISA bridge: Intel Corporation 82801JIR (ICH10R) LPC Interface Controller
00:1f.2 IDE interface: Intel Corporation 82801JI (ICH10 Family) 4 port SATA IDE Controller #1
00:1f.5 IDE interface: Intel Corporation 82801JI (ICH10 Family) 2 port SATA IDE Controller #2
01:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5716 Gigabit Ethernet (rev 20)
01:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM5716 Gigabit Ethernet (rev 20)
03:00.0 Serial Attached SCSI controller: LSI Logic / Symbios Logic SAS2008 PCI-Express Fusion-MPT SAS-2 [Falcon] (rev 03)
04:03.0 VGA compatible controller: Matrox Electronics Systems Ltd. MGA G200eW WPCM450 (rev 0a)
fe:00.0 Host bridge: Intel Corporation Xeon 5600 Series QuickPath Architecture Generic Non-core Registers (rev 02)
fe:00.1 Host bridge: Intel Corporation Xeon 5600 Series QuickPath Architecture System Address Decoder (rev 02)
fe:02.0 Host bridge: Intel Corporation Xeon 5600 Series QPI Link 0 (rev 02)
fe:02.1 Host bridge: Intel Corporation Xeon 5600 Series QPI Physical 0 (rev 02)
fe:02.2 Host bridge: Intel Corporation Xeon 5600 Series Mirror Port Link 0 (rev 02)
fe:02.3 Host bridge: Intel Corporation Xeon 5600 Series Mirror Port Link 1 (rev 02)
fe:02.4 Host bridge: Intel Corporation Xeon 5600 Series QPI Link 1 (rev 02)
fe:02.5 Host bridge: Intel Corporation Xeon 5600 Series QPI Physical 1 (rev 02)
fe:03.0 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Registers (rev 02)
fe:03.1 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Target Address Decoder (rev 02)
fe:03.2 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller RAS Registers (rev 02)
fe:03.4 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Test Registers (rev 02)
fe:04.0 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Control (rev 02)
fe:04.1 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Address (rev 02)
fe:04.2 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Rank (rev 02)
fe:04.3 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 0 Thermal Control (rev 02)
fe:05.0 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Control (rev 02)
fe:05.1 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Address (rev 02)
fe:05.2 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Rank (rev 02)
fe:05.3 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 1 Thermal Control (rev 02)
fe:06.0 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Control (rev 02)
fe:06.1 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Address (rev 02)
fe:06.2 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Rank (rev 02)
fe:06.3 Host bridge: Intel Corporation Xeon 5600 Series Integrated Memory Controller Channel 2 Thermal Control (rev 02)

------------------------------------------------------------------------------------------------

So one of the components appears to be "00:07.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI Express Root Port 7 (rev 13)", but I'm unclear as to what this component is
and what it connects too. Also slot in, "A bus fatal error was detected on a component at slot 1." seems vague. What is slot 1 in this case?

My specific questions are:

Which components do the critical errors apply to?

Should I do things in the following order:

- Unplug power cord and hold power button for 20 seconds
- Firmware updates
- Try reseating components
- Replace parts

PS. I'm no longer under warranty

Thanks,

Dave

Moderator

 • 

8.6K Posts

October 25th, 2016 11:00

Hi,

Slot 1 should reference PCie slot 1. Most likely this is the slot the PERC controller is in. Your troubleshooting order is correct. If this is the only occurance, reseating and updating firmware is probably enough. 

5 Posts

October 25th, 2016 17:00

Thanks for the response, Josh. 

I updated the firmware + OMSA and that seemed to have did it. No errors showing, and rebooting seems fine.

Also, you're right, slot 1 was the PERC controller.

- Dave

1 Message

October 28th, 2016 08:00

I'm experiencing the same problem with my r410 server. I was just wondering if it is worth upgrading the firmware if the version I have on my server is the same as the one available on DELL's website and whether or not you think  updating OMSA actually helped with this problem (or if it was the firmware upgrade)

-TC 

5 Posts

October 28th, 2016 09:00

In my case, the firmware upgrade seemed to do it. I doubt the OMSA helped but I figured I'd upgrade it anyway.

I used the Dell Repository Manager with help from this post.

If you're absolutely sure that your firmware is up to date the I guess there's no point, but if you have any doubt, I'd just do it anyway to rule it out.

Next step seems to be reseating the physical components. 

1 Message

January 23rd, 2019 23:00

We have replaced new nic card in feature same issue repeat ?

1 Message

March 3rd, 2019 23:00

Do you have a manual or a way to prove the solution to this error?

Required for updating the applicable job firmware.

Because, My team want it!

 

A fatal error was detected on a component at bus 0 device 28 function 7.

A fatal error was detected on a component at bus 7 device 0 function 0.

 

1 Message

June 18th, 2019 23:00

@John Bang : Did you resolve your problem?

I have bug fatal error on same bus, device and function:

A bus fatal error was detected on a component at bus 0 device 28 function 7.

A bus fatal error was detected on a component at bus 7 device 0 function 0.

No Events found!

Top