Start a Conversation

Unsolved

R

1 Rookie

 • 

41 Posts

21

January 16th, 2025 20:19

PCIe - Fatal Error - How to find what device SEL references?

So I've been wrestling with my 3 PowerEdge racks for awhile now. I'm farely attuned to working on these servers by now, but this one is getting by me. Anyway, I have a R730XD that I'm trying to get up and running, and I keep getting this PCIe error.

Here is the output in SEL - I have iDRAC access on a rack KVM only, no web GUI. Even though the system will assign a static IP, there's no throughput.

I have removed all of the PCI cards except the 3 risers, network daughter card, and PERC. I have replaced the network card with a known working one as well as the PERC and it still gives the same errors. Also, I have cleared the logs, reset iDRAC, cleared nvram, flea power drained, reseated all ram and CPUs, double and triple checked all connections.. I'm sure I've done more, I just can't remember it all atm.

Anyway, I can't for the life of me find any reference to what 'Bus 0 Device 1 Function 0' or 'Bus 0 device 0 Function 0" is. Also, I'm very hesitant to reset iDRAC right now because just 3 days ago my R730 just fried itself (iDRAC anyway) resetting iDRAC from the Lifecycle Controller. Not long before that, another R730 in my care fried itself during an update with the Platform Bootable ISO. This will be my 3rd motherboard in less than 2 weeks if this is DOA.

My apologies for the poor quality of the picture, it was taken from a phone on a low res console screen.

Any help is much appreciated, thank you!

1 Rookie

 • 

41 Posts

January 16th, 2025 20:23

Just to clarify, I have not reset iDRAC to factory settings, that is what I am hesitant to do. I have cleared iDRAC from the Lifecycle Controller as well as the System ID button on the server itself. I have updated all firmware and drivers to the latest releases too.

(edited)

1 Rookie

 • 

41 Posts

January 17th, 2025 05:07

Update:   

I have reset iDRAC to factory settings as well as ran the retire and repurpose system option in Lifecycle controller (this doesn't seem to actually clear anything, iDRAC still has the same settings as well as the BIOS) and the PCIe errors are still there. At this point, I'm willing to try anything.

Thanks

Moderator

 • 

4.4K Posts

January 17th, 2025 05:22

Hello,

Bus 0 Device 1 Function 0' or 'Bus 0 device 0 Function 0"

This is usually the CPU.

Image


I'd ask to try a different CPU with min to post config or replace the mobo.

 

https://dell.to/3DS8tBV

1 Rookie

 • 

41 Posts

January 17th, 2025 06:18

@DELL-Young E​ 

I can try swapping them out tomorrow.

I did run hardware diagnostics via LC and both passed without issue. Also, oddly enough, the results of hardware diagnostics said PCIe was fine.

(edited)

No Events found!

Top