linux kernel panic


log

Jul 23 18:04:14 10.16.73.45 [19259390.583100] {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 3  
Jul 23 18:04:14 10.16.73.45 [19259390.587458] {1}[Hardware Error]: event severity: fatal
Jul 23 18:04:14 10.16.73.45 [19259390.589789] {1}[Hardware Error]:  Error 0, type: fatal
Jul 23 18:04:14 10.16.73.45 [19259390.592093] {1}[Hardware Error]:   section_type: PCIe error
Jul 23 18:04:14 10.16.73.45 [19259390.594490] {1}[Hardware Error]:   port_type: 4, root port
Jul 23 18:04:14 10.16.73.45 [19259390.596754] {1}[Hardware Error]:   version: 1.16
Jul 23 18:04:14 10.16.73.45 [19259390.599014] {1}[Hardware Error]:   command: 0x0547, status:0x4010
Jul 23 18:04:14 10.16.73.45 [19259390.601243] {1}[Hardware Error]:   device_id: 0000:00:02.0
Jul 23 18:04:14 10.16.73.45 [19259390.603379] {1}[Hardware Error]:   slot: 6
Jul 23 18:04:14 10.16.73.45 [19259390.605558] {1}[Hardware Error]:   secondary_bus: 0x03
Jul 23 18:04:14 10.16.73.45 [19259390.607726] {1}[Hardware Error]:   vendor_id: 0x8086, device_id:0x6f04
Jul 23 18:04:14 10.16.73.45 [19259390.610003] {1}[Hardware Error]:   class_code: 000406
Jul 23 18:04:14 10.16.73.45 [19259390.612194] {1}[Hardware Error]:   bridge: secondary_status:0x0000, control: 0x0003
Jul 23 18:04:14 10.16.73.45 [19259390.616976] Kernel panic - not syncing: Fatal hardware error!


分析如下:

1.  APEI  GHES

2.  PCIe

3.  slot: 6

4.  vendor_id: 0x8086(Intel Corporation), device_id: 0x6f04(Xeon E7 v4/Xeon E5 v4/Xeon E3 v4/Xeon D PCI Express Root Port 2)

     seach site: https://pci-ids.ucw.cz/read/PC?restrict=

5.  class_code: 000406 (Haswell Integrated Graphics Controller)

     search site: https://raw.githubusercontent.com/pciutils/pciids/master/pci.ids

 so, GPU has problem!!! 显卡坏了