Tuxedo Pulse 15 Gen1 AMD Ryzen 7 4800H with Radeon Graphics: Intermittent Crashes and Worrying Messages in the Journal

Hi everybody,

I’ve been testing the above mentioned laptop for a month. With different firmware versions (A02, A03, and A05). The result is always the same:

  • slowly flashing power button (like the machine would be in suspend mode)
  • in the journal: C-state out of order messages
  • intermittent/ not reproducible crashes

The only way out is a hard shutdown.

The Tuxedo Computers guys messed with the firmware. They provide on GitHub the Tuxedo-Keyboard driver bundle which consist of:

  • tuxedo_keyboard
  • clevo_wmi
  • clevo_acpi
  • tuxedo_io
  • uniwill_wmi

These modules work on Arch Linux based distros. I myself ran Manjaro with it (wasn’t a problem Arch has the bundle in their repos). Hence I believe/hope that that could be the solution but looking at logs I doubt that it’s that simple.

Related issue: https://github.com/QubesOS/qubes-issues/issues/7648

Journal entries:
C-State warning:

-- Logs begin at Wed 2022-07-20 14:05:48 IST, end at Mon 2022-07-25 17:03:14 IST. --
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order
Jul 25 16:10:46 dom0 kernel: [Firmware Bug]: ACPI MWAIT C-state 0x0 not supported by HW (0x0)
Jul 25 16:10:46 dom0 kernel: ACPI: FW issue: working around C-state latencies out of order

Warnings while running Youtube fullscreen and open and closing windows, and starting and stopping qubes (testing); no crash:

Jul 30 18:19:12 dom0 kernel: ------------[ cut here ]------------
Jul 30 18:19:12 dom0 kernel: WARNING: CPU: 3 PID: 4712 at drivers/xen/gntdev.c:399 __unmap_grant_pages_done+0xfe/0x110 [xen_gntdev]
Jul 30 18:19:12 dom0 kernel: Modules linked in: loop vfat fat intel_rapl_msr wmi_bmof intel_rapl_common snd_hda_codec_realtek snd_hda_codec_generic snd_sof_amd_renoir snd_sof_amd_acp snd_hda_codec_hdmi snd_sof_pci snd_sof snd_sof_utils snd_hda_intel ledtrig_audio pcspkr joydev snd_intel_dspcfg snd_intel_sdw_acpi snd_soc_core snd_hda_codec iwlwifi snd_compress ac97_bus snd_hda_core snd_pcm_dmaengine snd_pci_acp6x snd_hwdep iwlmei k10temp snd_seq sp5100_tco snd_seq_device i2c_piix4 snd_pcm cfg80211 snd_pci_acp5x snd_timer snd_rn_pci_acp3x snd_acp_config snd rfkill snd_soc_acpi soundcore snd_pci_acp3x mei r8169 wmi video fuse xenfs ip_tables dm_thin_pool dm_persistent_data dm_bio_prison dm_crypt amdgpu drm_ttm_helper ttm iommu_v2 crct10dif_pclmul crc32_pclmul xhci_pci gpu_sched nvme crc32c_intel hid_multitouch xhci_pci_renesas ghash_clmulni_intel serio_raw xhci_hcd drm_dp_helper ccp nvme_core i2c_hid_acpi i2c_hid xen_acpi_processor xen_privcmd xen_pciback xen_blkback xen_gntalloc xen_gntdev xen_evtchn
Jul 30 18:19:12 dom0 kernel:  uinput
Jul 30 18:19:12 dom0 kernel: CPU: 3 PID: 4712 Comm: Xorg Not tainted 5.18.9-1.fc32.qubes.x86_64 #1
Jul 30 18:19:12 dom0 kernel: Hardware name: TUXEDO TUXEDO Pulse 15 Gen1/PULSE1501, BIOS N.1.07.A05 04/25/2022
Jul 30 18:19:12 dom0 kernel: RIP: e030:__unmap_grant_pages_done+0xfe/0x110 [xen_gntdev]
...
Jul 30 18:19:12 dom0 kernel: Call Trace:
Jul 30 18:19:12 dom0 kernel:  <TASK>
Jul 30 18:19:12 dom0 kernel:  unmap_grant_pages.part.0+0x121/0x1c0 [xen_gntdev]
Jul 30 18:19:12 dom0 kernel:  gntdev_mmap+0x240/0x2d0 [xen_gntdev]
Jul 30 18:19:12 dom0 kernel:  mmap_region+0x4dc/0x6d0
Jul 30 18:19:12 dom0 kernel:  do_mmap+0x33d/0x530
Jul 30 18:19:12 dom0 kernel:  ? request_trusted_key+0x60/0x60
Jul 30 18:19:12 dom0 kernel:  ? security_mmap_file+0x81/0xd0
Jul 30 18:19:12 dom0 kernel:  vm_mmap_pgoff+0xe2/0x180
Jul 30 18:19:12 dom0 kernel:  ksys_mmap_pgoff+0x186/0x1f0
Jul 30 18:19:12 dom0 kernel:  do_syscall_64+0x5c/0x80
Jul 30 18:19:12 dom0 kernel:  ? exit_to_user_mode_prepare+0xc9/0xe0
Jul 30 18:19:12 dom0 kernel:  ? syscall_exit_to_user_mode+0x17/0x30
Jul 30 18:19:12 dom0 kernel:  ? do_syscall_64+0x69/0x80
Jul 30 18:19:12 dom0 kernel:  ? syscall_exit_to_user_mode+0x17/0x30
Jul 30 18:19:12 dom0 kernel:  ? do_syscall_64+0x69/0x80
Jul 30 18:19:12 dom0 kernel:  ? do_syscall_64+0x69/0x80
Jul 30 18:19:12 dom0 kernel:  ? do_syscall_64+0x69/0x80
Jul 30 18:19:12 dom0 kernel:  entry_SYSCALL_64_after_hwframe+0x44/0xae
Jul 30 18:19:12 dom0 kernel: RIP: 0033:0x78e0a858f2e6
Jul 30 18:19:12 dom0 kernel: Code: 01 00 66 90 f3 0f 1e fa 41 f7 c1 ff 0f 00 00 75 2b 55 48 89 fd 53 89 cb 48 85 ff 74 37 41 89 da 48 89 ef b8 09 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 62 5b 5d c3 0f 1f 80 00 00 00 00 48 8b 05 79
Jul 30 18:19:12 dom0 kernel: RSP: 002b:00007ffecf5a8ed8 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
Jul 30 18:19:12 dom0 kernel: RAX: ffffffffffffffda RBX: 0000000000000001 RCX: 000078e0a858f2e6
Jul 30 18:19:12 dom0 kernel: RDX: 0000000000000001 RSI: 00000000001e2000 RDI: 0000000000000000
Jul 30 18:19:12 dom0 kernel: RBP: 0000000000000000 R08: 0000000000000009 R09: 0000000000003000
Jul 30 18:19:12 dom0 kernel: R10: 0000000000000001 R11: 0000000000000246 R12: 00007ffecf5a8ef0
Jul 30 18:19:12 dom0 kernel: R13: 0000000000000001 R14: 0000000000000009 R15: 00000000000001e2
Jul 30 18:19:12 dom0 kernel:  </TASK>
Jul 30 18:19:12 dom0 kernel: ---[ end trace 0000000000000000 ]---
Jul 30 18:19:12 dom0 kernel: ------------[ cut here ]------------
Jul 30 18:19:12 dom0 kernel: WARNING: CPU: 3 PID: 4712 at drivers/xen/gntdev.c:405 __unmap_grant_pages_done+0x105/0x110 [xen_gntdev]

System Crash 3:

Jul 31 12:39:25 dom0 sudo[24111]: pam_unix(sudo:session): session opened for user root by (uid=1000)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:222 vmid:11 pasid:0, for process  pid 0 thread  pid 0)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:   in page starting at address 0x0000be000000c000 from IH client 0x1b (UTCL2)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00B009BC
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          Faulty UTCL2 client ID: CPF (0x4)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          MORE_FAULTS: 0x0
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          WALKER_ERROR: 0x6
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          PERMISSION_FAULTS: 0xb
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          MAPPING_ERROR: 0x1
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          RW: 0x0
Jul 31 12:39:28 dom0 kernel: ------------[ cut here ]------------
Jul 31 12:39:28 dom0 kernel: Bug: No PASID in KFD interrupt
...
Jul 31 12:39:28 dom0 kernel:  uinput
Jul 31 12:39:28 dom0 kernel: CPU: 0 PID: 0 Comm: swapper/0 Not tainted 5.18.9-1.fc32.qubes.x86_64 #1
Jul 31 12:39:28 dom0 kernel: Hardware name: TUXEDO TUXEDO Pulse 15 Gen1/PULSE1501, BIOS N.1.07.A05 04/25/2022
Jul 31 12:39:28 dom0 kernel: RIP: e030:event_interrupt_isr_v9+0x1a9/0x1b0 [amdgpu]
Jul 31 12:39:28 dom0 kernel: Code: c7 e9 ea fe ff ff 44 0f b6 3d 0a f6 59 00 45 84 ff 0f 85 d6 fe ff ff 48 c7 c7 b0 b1 7c c0 c6 05 f3 f5 59 00 01 e8 38 db 95 c1 <0f> 0b e9 bf fe ff ff 0f 1f 44 00 00 41 54 55 53 48 8b 9f c8 00 00
Jul 31 12:39:28 dom0 kernel: RSP: e02b:ffffc90040003d60 EFLAGS: 00010086
Jul 31 12:39:28 dom0 kernel: RAX: 0000000000000000 RBX: ffff88810eea5160 RCX: 0000000000000000
Jul 31 12:39:28 dom0 kernel: RDX: 0000000000010004 RSI: ffffffff8268ddd2 RDI: 00000000ffffffff
Jul 31 12:39:28 dom0 kernel: RBP: ffff88810b2a6400 R08: 0000000000000000 R09: 00000000ffffdfff
Jul 31 12:39:28 dom0 kernel: R10: ffffc90040003ba8 R11: ffffffff829450e8 R12: 000000000000000b
Jul 31 12:39:28 dom0 kernel: R13: 000000000bde0000 R14: 0000000000000000 R15: 0000000000000000
Jul 31 12:39:28 dom0 kernel: FS:  0000000000000000(0000) GS:ffff888134400000(0000) knlGS:0000000000000000
Jul 31 12:39:28 dom0 kernel: CS:  10000e030 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 31 12:39:28 dom0 kernel: CR2: 00007dff41592080 CR3: 0000000021e56000 CR4: 0000000000050660
Jul 31 12:39:28 dom0 kernel: Call Trace:
Jul 31 12:39:28 dom0 kernel:  <IRQ>
Jul 31 12:39:28 dom0 kernel:  kgd2kfd_interrupt+0xd2/0x170 [amdgpu]
Jul 31 12:39:28 dom0 kernel:  amdgpu_irq_dispatch+0xf0/0x210 [amdgpu]
Jul 31 12:39:28 dom0 kernel:  amdgpu_ih_process+0x80/0xf0 [amdgpu]
Jul 31 12:39:28 dom0 kernel:  amdgpu_irq_handler+0x21/0x90 [amdgpu]
Jul 31 12:39:28 dom0 kernel:  __handle_irq_event_percpu+0x46/0x180
Jul 31 12:39:28 dom0 kernel:  handle_irq_event+0x34/0x70
Jul 31 12:39:28 dom0 kernel:  handle_edge_irq+0x9f/0x240
Jul 31 12:39:28 dom0 kernel:  handle_irq_desc+0x36/0x40
Jul 31 12:39:28 dom0 kernel:  consume_one_event+0xfd/0x110
Jul 31 12:39:28 dom0 kernel:  __evtchn_fifo_handle_events+0x72/0xb0
Jul 31 12:39:28 dom0 kernel:  __xen_evtchn_do_upcall+0x72/0xc0
Jul 31 12:39:28 dom0 kernel:  __xen_pv_evtchn_do_upcall+0x39/0x60
Jul 31 12:39:28 dom0 kernel:  xen_pv_evtchn_do_upcall+0xd7/0x100
Jul 31 12:39:28 dom0 kernel:  </IRQ>
Jul 31 12:39:28 dom0 kernel:  <TASK>
Jul 31 12:39:28 dom0 kernel:  exc_xen_hypervisor_callback+0x8/0x10
Jul 31 12:39:28 dom0 kernel: RIP: e030:xen_hypercall_sched_op+0xa/0x20
Jul 31 12:39:28 dom0 kernel: Code: 51 41 53 b8 1c 00 00 00 0f 05 41 5b 59 c3 cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc 51 41 53 b8 1d 00 00 00 0f 05 <41> 5b 59 c3 cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc cc
Jul 31 12:39:28 dom0 kernel: RSP: e02b:ffffffff82803dd0 EFLAGS: 00000246
Jul 31 12:39:28 dom0 kernel: RAX: 0000000000000000 RBX: ffffffff8281a940 RCX: ffffffff81dbc3aa
Jul 31 12:39:28 dom0 kernel: RDX: ffffffff8281a940 RSI: 0000000000000000 RDI: 0000000000000001
Jul 31 12:39:28 dom0 kernel: RBP: 0000000000000000 R08: 000001b36dc0eb24 R09: 0000000000000000
Jul 31 12:39:28 dom0 kernel: R10: 0000000000000400 R11: 0000000000000246 R12: 0000000000000000
Jul 31 12:39:28 dom0 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
Jul 31 12:39:28 dom0 kernel:  ? xen_hypercall_sched_op+0xa/0x20
Jul 31 12:39:28 dom0 kernel:  ? xen_safe_halt+0xc/0x20
Jul 31 12:39:28 dom0 kernel:  ? default_idle+0xa/0x10
Jul 31 12:39:28 dom0 kernel:  ? default_idle_call+0x32/0xe0
Jul 31 12:39:28 dom0 kernel:  ? cpuidle_idle_call+0x13b/0x170
Jul 31 12:39:28 dom0 kernel:  ? do_idle+0x7e/0xe0
Jul 31 12:39:28 dom0 kernel:  ? cpu_startup_entry+0x19/0x20
Jul 31 12:39:28 dom0 kernel:  ? rest_init+0xcb/0xd0
Jul 31 12:39:28 dom0 kernel:  ? arch_call_rest_init+0xa/0x3d
Jul 31 12:39:28 dom0 kernel:  ? start_kernel+0x6a8/0x6e6
Jul 31 12:39:28 dom0 kernel:  ? xen_start_kernel+0x5bb/0x5d9
Jul 31 12:39:28 dom0 kernel:  ? startup_xen+0x3e/0x3e
Jul 31 12:39:28 dom0 kernel:  </TASK>
Jul 31 12:39:28 dom0 kernel: ---[ end trace 0000000000000000 ]---
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:40 vmid:1 pasid:32770, for process Xorg pid 4771 thread X:cs0 pid 4792)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:   in page starting at address 0x00008001022c0000 from IH client 0x1b (UTCL2)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: VM_L2_PROTECTION_FAULT_STATUS:0x00141051
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          Faulty UTCL2 client ID: TCP (0x8)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          MORE_FAULTS: 0x1
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          WALKER_ERROR: 0x0
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          PERMISSION_FAULTS: 0x5
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          MAPPING_ERROR: 0x0
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:          RW: 0x1
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: [gfxhub0] no-retry page fault (src_id:0 ring:40 vmid:1 pasid:32770, for process Xorg pid 4771 thread X:cs0 pid 4792)
Jul 31 12:39:28 dom0 kernel: amdgpu 0000:04:00.0: amdgpu:   in page starting at address 0x00008001022c0000 from IH client 0x1b (UTCL2)
...
Jul 31 12:39:49 dom0 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Jul 31 12:39:49 dom0 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=1095, emitted seq=1097
Jul 31 12:39:49 dom0 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!
Jul 31 12:39:49 dom0 kernel: [drm] free PSP TMR buffer
Jul 31 12:39:49 dom0 kernel: CPU: 5 PID: 23293 Comm: kworker/u16:0 Tainted: G        W         5.18.9-1.fc32.qubes.x86_64 #1
Jul 31 12:39:49 dom0 kernel: Hardware name: TUXEDO TUXEDO Pulse 15 Gen1/PULSE1501, BIOS N.1.07.A05 04/25/2022
Jul 31 12:39:49 dom0 kernel: Workqueue: amdgpu-reset-dev drm_sched_job_timedout [gpu_sched]
Jul 31 12:39:49 dom0 kernel: Call Trace:
Jul 31 12:39:49 dom0 kernel:  <TASK>
Jul 31 12:39:49 dom0 kernel:  dump_stack_lvl+0x45/0x5a
Jul 31 12:39:49 dom0 kernel:  amdgpu_reset_reg_dumps.isra.0+0x13/0x93 [amdgpu]
Jul 31 12:39:49 dom0 kernel:  amdgpu_do_asic_reset+0x27/0x3a6 [amdgpu]
Jul 31 12:39:49 dom0 kernel:  amdgpu_device_gpu_recover_imp.cold+0x524/0x65b [amdgpu]
Jul 31 12:39:49 dom0 kernel:  amdgpu_job_timedout+0x17a/0x1b0 [amdgpu]
Jul 31 12:39:49 dom0 kernel:  drm_sched_job_timedout+0x76/0x110 [gpu_sched]
Jul 31 12:39:49 dom0 kernel:  process_one_work+0x1e5/0x3b0
Jul 31 12:39:49 dom0 kernel:  worker_thread+0x49/0x2e0
Jul 31 12:39:49 dom0 kernel:  ? rescuer_thread+0x3a0/0x3a0
Jul 31 12:39:49 dom0 kernel:  kthread+0xe7/0x110
Jul 31 12:39:49 dom0 kernel:  ? kthread_complete_and_exit+0x20/0x20
Jul 31 12:39:49 dom0 kernel:  ret_from_fork+0x22/0x30
Jul 31 12:39:49 dom0 kernel:  </TASK>
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: MODE2 reset
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset succeeded, trying to resume
Jul 31 12:39:49 dom0 kernel: [drm] PCIE GART of 1024M enabled.
Jul 31 12:39:49 dom0 kernel: [drm] PTB located at 0x000000F400900000
Jul 31 12:39:49 dom0 kernel: [drm] PSP is resuming...
Jul 31 12:39:49 dom0 kernel: [drm] reserve 0x400000 from 0xf47f800000 for PSP TMR
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: RAS: optional ras ta ucode is not available
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: RAP: optional rap ta ucode is not available
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resuming...
Jul 31 12:39:49 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: SMU is resumed successfully!
Jul 31 12:39:49 dom0 kernel: [drm] DMUB hardware initialized: version=0x01010020
Jul 31 12:39:50 dom0 kernel: [drm] kiq ring mec 2 pipe 1 q 0
Jul 31 12:39:50 dom0 kernel: amdgpu 0000:04:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring sdma0 test failed (-110)
Jul 31 12:39:50 dom0 kernel: [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <sdma_v4_0> failed -110
Jul 31 12:39:50 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset(3) failed
Jul 31 12:39:50 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset end with ret = -110
Jul 31 12:39:50 dom0 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -110
Jul 31 12:40:00 dom0 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=1097, emitted seq=1097
Jul 31 12:40:00 dom0 kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Jul 31 12:40:00 dom0 kernel: amdgpu 0000:04:00.0: amdgpu: GPU reset begin!

inxi -Fxxxz:

System:
  Kernel: 5.18.9-1.fc32.qubes.x86_64 x86_64 bits: 64 compiler: gcc 
  v: 2.34-6.fc32 Desktop: Xfce 4.14.3 tk: Gtk 3.24.23 info: xfce4-panel 
  wm: xfwm4 vt: 1 dm: LightDM 1.30.0 Distro: Qubes release 4.1.1 (R4.1) 
Machine:
  Type: Laptop System: TUXEDO product: TUXEDO Pulse 15 Gen1 v: Standard 
  serial: <filter> 
  Mobo: TUXEDO s model: PULSE1501 v: Standard serial: <filter> 
  UEFI: American Megatrends v: N.1.07.A05 date: 04/25/2022 
Battery:
  ID-1: BAT0 charge: 68.4 Wh (100.0%) condition: 68.4/91.6 Wh (74.7%) 
  volts: 12.6 min: 11.6 model: standard type: Li-ion serial: <filter> 
  status: Full 
CPU:
  Info: 8-Core model: AMD Ryzen 7 4800H with Radeon Graphics bits: 64 
  type: MCP arch: Zen 2 rev: 1 cache: L2: 4 MiB 
  flags: avx avx2 lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 
  bogomips: 46313 
  Speed: 2895 MHz min/max: N/A Core speeds (MHz): 1: 2895 2: 2895 3: 2895 
  4: 2895 5: 2895 6: 2895 7: 2895 8: 2895 
Graphics:
  Device-1: AMD Renoir vendor: Tongfang Hongkong Limited driver: amdgpu 
  v: kernel bus-ID: 04:00.0 chip-ID: 1002:1636 class-ID: 0300 
  Display: x11 server: Fedora Project X.org 1.20.11 driver: 
  loaded: ati,modesetting unloaded: fbdev,vesa alternate: amdgpu 
  resolution: 1920x1080~60Hz s-dpi: 96 
  OpenGL: 
  renderer: AMD RENOIR (DRM 3.46.0 5.18.9-1.fc32.qubes.x86_64 LLVM 10.0.1) 
  v: 4.6 Mesa 20.2.3 direct render: Yes 
Audio:
  Device-1: AMD vendor: Tongfang Hongkong Limited driver: snd_hda_intel 
  v: kernel bus-ID: 04:00.1 chip-ID: 1002:1637 class-ID: 0403 
  Device-2: AMD Raven/Raven2/FireFlight/Renoir Audio Processor 
  vendor: Tongfang Hongkong Limited driver: N/A bus-ID: 04:00.5 
  chip-ID: 1022:15e2 class-ID: 0480 
  Device-3: AMD Family 17h HD Audio vendor: Tongfang Hongkong Limited 
  driver: snd_hda_intel v: kernel bus-ID: 04:00.6 chip-ID: 1022:15e3 
  class-ID: 0403 
  Sound Server-1: ALSA v: k5.18.9-1.fc32.qubes.x86_64 running: yes 
Network:
  Device-1: Intel Wi-Fi 6 AX200 driver: pciback v: N/A bus-ID: 01:00.0 
  chip-ID: 8086:2723 class-ID: 0280 
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet 
  vendor: Tongfang Hongkong Limited driver: pciback v: N/A port: f000 
  bus-ID: 02:00.0 chip-ID: 10ec:8168 class-ID: 0200 
Drives:
  Local Storage: total: 1.82 TiB used: 6.04 GiB (0.3%) 
  ID-1: /dev/nvme0n1 vendor: Samsung model: SSD 970 EVO Plus 2TB 
  size: 1.82 TiB speed: 31.6 Gb/s lanes: 4 rotation: SSD serial: <filter> 
  rev: 2B2QEXM7 scheme: GPT 
Partition:
  ID-1: / size: 19.52 GiB used: 5.8 GiB (29.7%) fs: ext4 dev: /dev/dm-4 
  mapped: qubes_dom0-root 
  ID-2: /boot size: 973.4 MiB used: 194.6 MiB (20.0%) fs: ext4 
  dev: /dev/nvme0n1p2 
  ID-3: /boot/efi size: 598.8 MiB used: 49.5 MiB (8.3%) fs: vfat 
  dev: /dev/nvme0n1p1 
Swap:
  ID-1: swap-1 type: partition size: 3.94 GiB used: 0 KiB (0.0%) 
  priority: -2 dev: /dev/dm-5 mapped: qubes_dom0-swap 
Sensors:
  System Temperatures: cpu: 58.0 C mobo: N/A gpu: amdgpu temp: 46.0 C 
  Fan Speeds (RPM): N/A 
Info:
  Processes: 375 Uptime: 4h 19m wakeups: 6 Memory: 3.8 GiB 
  used: 969.7 MiB (24.9%) Init: systemd v: 245 runlevel: 5 
  target: graphical.target Compilers: gcc: N/A Packages: rpm: 1055 
  Shell: Bash v: 5.0.17 running-in: xfce4-terminal inxi: 3.3.03

xl info:

host                   : dom0
release                : 5.18.9-1.fc32.qubes.x86_64
version                : #1 SMP PREEMPT_DYNAMIC Tue Jul 5 21:08:12 CEST 2022
machine                : x86_64
nr_cpus                : 8
max_cpu_id             : 15
nr_nodes               : 1
cores_per_socket       : 8
threads_per_core       : 1
cpu_mhz                : 2894.575
hw_caps                : 178bf3ff:76d8320b:2e500800:244037ff:0000000f:219c91a9:00400004:00000500
virt_caps              : pv hvm hvm_directio pv_directio hap
total_memory           : 63403
free_memory            : 45473
sharing_freed_memory   : 0
sharing_used_memory    : 0
outstanding_claims     : 0
free_cpus              : 0
xen_major              : 4
xen_minor              : 14
xen_extra              : .5
xen_version            : 4.14.5
xen_caps               : xen-3.0-x86_64 hvm-3.0-x86_32 hvm-3.0-x86_32p hvm-3.0-x86_64 
xen_scheduler          : credit2
xen_pagesize           : 4096
platform_params        : virt_start=0xffff800000000000
xen_changeset          : 
xen_commandline        : placeholder console=none dom0_mem=min:1024M dom0_mem=max:4096M ucode=scan smt=off gnttab_max_frames=2048 gnttab_max_maptrack_frames=4096 no-real-mode edd=off
cc_compiler            : gcc (GCC) 10.3.1 20210422 (Red Hat 10.3.1-1)
cc_compile_by          : mockbuild
cc_compile_domain      : [unknown]
cc_compile_date        : Tue Jul 12 00:00:00 UTC 2022
build_id               : 93ffa4957d0db1e724babda3e618271e9c4b09be
xend_config_format     : 4

xl dmesg:

(XEN) Built-in command line: ept=exec-sp spec-ctrl=unpriv-mmio
(XEN) parameter "no-real-mode" unknown!
 Xen 4.14.5
(XEN) Xen version 4.14.5 (mockbuild@[unknown]) (gcc (GCC) 10.3.1 20210422 (Red Hat 10.3.1-1)) debug=n  Tue Jul 12 00:00:00 UTC 2022
(XEN) Latest ChangeSet: 
(XEN) Bootloader: GRUB 2.04
(XEN) Command line: placeholder console=none dom0_mem=min:1024M dom0_mem=max:4096M ucode=scan smt=off gnttab_max_frames=2048 gnttab_max_maptrack_frames=4096 no-real-mode edd=off
(XEN) Xen image load base address: 0xc4e00000
(XEN) Video information:
(XEN)  VGA is graphics mode 1920x1080, 32 bpp
(XEN) Disc information:
(XEN)  Found 0 MBR signatures
(XEN)  Found 1 EDD information structures
(XEN) EFI RAM map:
(XEN)  [0000000000000000, 000000000009ffff] (usable)
(XEN)  [00000000000a0000, 00000000000fffff] (reserved)
(XEN)  [0000000000100000, 0000000009bfefff] (usable)
(XEN)  [0000000009bff000, 0000000009ffffff] (reserved)
(XEN)  [000000000a000000, 000000000a1fffff] (usable)
(XEN)  [000000000a200000, 000000000a20cfff] (ACPI NVS)
(XEN)  [000000000a20d000, 00000000c9dc3fff] (usable)
(XEN)  [00000000c9dc4000, 00000000c9fd3fff] (ACPI NVS)
(XEN)  [00000000c9fd4000, 00000000cb09efff] (usable)
(XEN)  [00000000cb09f000, 00000000cc5bbfff] (reserved)
(XEN)  [00000000cc5bc000, 00000000cc606fff] (ACPI data)
(XEN)  [00000000cc607000, 00000000cc697fff] (ACPI NVS)
(XEN)  [00000000cc698000, 00000000cc698fff] (reserved)
(XEN)  [00000000cc699000, 00000000cc6f9fff] (ACPI NVS)
(XEN)  [00000000cc6fa000, 00000000cc6fafff] (reserved)
(XEN)  [00000000cc6fb000, 00000000cc987fff] (ACPI NVS)
(XEN)  [00000000cc988000, 00000000cd1fefff] (reserved)
(XEN)  [00000000cd1ff000, 00000000cdffffff] (usable)
(XEN)  [00000000ce000000, 00000000cfffffff] (reserved)
(XEN)  [00000000f0000000, 00000000f7ffffff] (reserved)
(XEN)  [00000000fd000000, 00000000ffffffff] (reserved)
(XEN)  [0000000100000000, 0000000faf33ffff] (usable)
(XEN)  [0000000faf340000, 00000010501fffff] (reserved)
(XEN) ACPI: RSDP CC900014, 0024 (r2 ALASKA)
(XEN) ACPI: XSDT CC8FF728, 00E4 (r1 ALASKA   A M I   1072009 AMI   1000013)
(XEN) ACPI: FACP CC5FD000, 0114 (r6 ALASKA   A M I   1072009 AMI     10013)
(XEN) ACPI: DSDT CC5F4000, 86D9 (r2 ALASKA   A M I   1072009 INTL 20120913)
(XEN) ACPI: FACS CC8CD000, 0040
(XEN) ACPI: SSDT CC5FF000, 7216 (r2    AMD AmdTable        2 MSFT  4000000)
(XEN) ACPI: IVRS CC5FE000, 01A4 (r2  AMD   AmdTable        1 AMD         0)
(XEN) ACPI: FIDT CC5F3000, 009C (r1 ALASKA    A M I  1072009 AMI     10013)
(XEN) ACPI: MCFG CC5F2000, 003C (r1 ALASKA    A M I  1072009 MSFT    10013)
(XEN) ACPI: HPET CC5F1000, 0038 (r1 ALASKA    A M I  1072009 AMI         5)
(XEN) ACPI: SSDT CC5F0000, 0228 (r1    AMD     STD3        1 INTL 20120913)
(XEN) ACPI: VFCT CC5E2000, D484 (r1 ALASKA   A M I         1  AMD 31504F47)
(XEN) ACPI: BGRT CC5E1000, 0038 (r1 ALASKA   A M I   1072009 AMI     10013)
(XEN) ACPI: TPM2 CC5E0000, 004C (r4 ALASKA   A M I         1 AMI         0)
(XEN) ACPI: SSDT CC5DC000, 39F4 (r1    AMD AmdTable        1 AMD         1)
(XEN) ACPI: CRAT CC5DB000, 0F28 (r1    AMD AmdTable        1 AMD         1)
(XEN) ACPI: CDIT CC5DA000, 0029 (r1    AMD AmdTable        1 AMD         1)
(XEN) ACPI: SSDT CC5D9000, 0139 (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: SSDT CC5D8000, 00B9 (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: SSDT CC5D7000, 028D (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: SSDT CC5D6000, 0C78 (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: SSDT CC5D4000, 10A5 (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: SSDT CC5D0000, 30C8 (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: WSMT CC5CF000, 0028 (r1 ALASKA   A M I   1072009 AMI     10013)
(XEN) ACPI: APIC CC5CE000, 00DE (r3 ALASKA   A M I   1072009 AMI     10013)
(XEN) ACPI: SSDT CC5CD000, 007D (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: SSDT CC5CC000, 0517 (r1    AMD AmdTable        1 INTL 20120913)
(XEN) ACPI: FPDT CC5CB000, 0044 (r1 ALASKA   A M I   1072009 AMI   1000013)
(XEN) System RAM: 63403MB (64925064kB)
(XEN) Domain heap initialised
(XEN) ACPI: 32/64X FACS address mismatch in FADT - cc8cd000/0000000000000000, using 32
(XEN) IOAPIC[0]: apic_id 17, version 33, address 0xfec00000, GSI 0-23
(XEN) IOAPIC[1]: apic_id 18, version 33, address 0xfec01000, GSI 24-55
(XEN) Enabling APIC mode:  Phys.  Using 2 I/O APICs
(XEN) CPU0: 1400 ... 2900 MHz
(XEN) xstate: size: 0x380 and states: 0x207
(XEN) Speculative mitigation facilities:
(XEN)   Hardware hints: IBRS_FAST IBRS_SAME_MODE
(XEN)   Hardware features: IBPB IBRS STIBP SSBD
(XEN)   Compiled-in support: INDIRECT_THUNK
(XEN)   Xen settings: BTI-Thunk RETPOLINE, SPEC_CTRL: IBRS- STIBP+ SSBD-, Other: BRANCH_HARDEN
(XEN)   Support for HVM VMs: MSR_SPEC_CTRL RSB IBPB-entry
(XEN)   Support for PV VMs: IBPB-entry
(XEN)   XPTI (64-bit PV only): Dom0 disabled, DomU disabled (without PCID)
(XEN)   PV L1TF shadowing: Dom0 disabled, DomU disabled
(XEN) Using scheduler: SMP Credit Scheduler rev2 (credit2)
(XEN) Initializing Credit2 scheduler
(XEN) Platform timer is 14.318MHz HPET
(XEN) Detected 2894.575 MHz processor.
(XEN) AMD-Vi: IOMMU Extended Features:
(XEN) - Peripheral Page Service Request
(XEN) - x2APIC
(XEN) - NX bit
(XEN) - Invalidate All Command
(XEN) - Guest APIC
(XEN) - Performance Counters
(XEN) - Host Address Translation Size: 0x2
(XEN) - Guest Address Translation Size: 0
(XEN) - Guest CR3 Root Table Level: 0x1
(XEN) - Maximum PASID: 0xf
(XEN) - SMI Filter Register: 0x1
(XEN) - SMI Filter Register Count: 0x1
(XEN) - Guest Virtual APIC Modes: 0x1
(XEN) - Dual PPR Log: 0x2
(XEN) - Dual Event Log: 0x2
(XEN) - User / Supervisor Page Protection
(XEN) - Device Table Segmentation: 0x3
(XEN) - PPR Log Overflow Early Warning
(XEN) - PPR Automatic Response
(XEN) - Memory Access Routing and Control: 0
(XEN) - Block StopMark Message
(XEN) - Performance Optimization
(XEN) - MSI Capability MMIO Access
(XEN) - Guest I/O Protection
(XEN) - Enhanced PPR Handling
(XEN) - Attribute Forward
(XEN) - Invalidate IOTLB Type
(XEN) - VM Table Size: 0
(XEN) - Guest Access Bit Update Disable
(XEN) AMD-Vi: IOMMU 0 Enabled.
(XEN) I/O virtualisation enabled
(XEN)  - Dom0 mode: Relaxed
(XEN) Interrupt remapping enabled
(XEN) ENABLING IO-APIC IRQs
(XEN)  -> Using new ACK method
(XEN) Allocated console ring of 32 KiB.
(XEN) HVM: ASIDs enabled.
(XEN) SVM: Supported advanced features:
(XEN)  - Nested Page Tables (NPT)
(XEN)  - Last Branch Record (LBR) Virtualisation
(XEN)  - Next-RIP Saved on #VMEXIT
(XEN)  - VMCB Clean Bits
(XEN)  - DecodeAssists
(XEN)  - Virtual VMLOAD/VMSAVE
(XEN)  - Virtual GIF
(XEN)  - Pause-Intercept Filter
(XEN)  - Pause-Intercept Filter Threshold
(XEN)  - TSC Rate MSR
(XEN)  - MSR_SPEC_CTRL virtualisation
(XEN) HVM: SVM enabled
(XEN) HVM: Hardware Assisted Paging (HAP) detected
(XEN) HVM: HAP page sizes: 4kB, 2MB, 1GB
(XEN) Brought up 8 CPUs
(XEN) Scheduling granularity: cpu, 1 CPU per sched-resource
(XEN) xenoprof: Initialization failed. AMD processor family 23 is not supported
(XEN) TSC warp detected, disabling TSC_RELIABLE
(XEN) Dom0 has maximum 1096 PIRQs
(XEN)  Xen  kernel: 64-bit, lsb, compat32
(XEN)  Dom0 kernel: 64-bit, PAE, lsb, paddr 0x1000000 -> 0x4000000
(XEN) PHYSICAL MEMORY ARRANGEMENT:
(XEN)  Dom0 alloc.:   0000000af0000000->0000000af8000000 (1003797 pages to be allocated)
(XEN)  Init. ramdisk: 0000000fac315000->0000000faf1ff84d
(XEN) VIRTUAL MEMORY ARRANGEMENT:
(XEN)  Loaded kernel: ffffffff81000000->ffffffff84000000
(XEN)  Init. ramdisk: 0000000000000000->0000000000000000
(XEN)  Phys-Mach map: 0000008000000000->0000008000800000
(XEN)  Start info:    ffffffff84000000->ffffffff840004b8
(XEN)  Xenstore ring: 0000000000000000->0000000000000000
(XEN)  Console ring:  0000000000000000->0000000000000000
(XEN)  Page tables:   ffffffff84001000->ffffffff84026000
(XEN)  Boot stack:    ffffffff84026000->ffffffff84027000
(XEN)  TOTAL:         ffffffff80000000->ffffffff84400000
(XEN)  ENTRY ADDRESS: ffffffff831781c0
(XEN) Dom0 has maximum 8 VCPUs
(XEN) Initial low memory virq threshold set at 0x4000 pages.
(XEN) Scrubbing Free RAM in background
(XEN) Std. Loglevel: Errors and warnings
(XEN) Guest Loglevel: Nothing (Rate-limited: Errors and warnings)
(XEN) *** Serial input to DOM0 (type 'CTRL-a' three times to switch input)
(XEN) Freed 576kB init memory

Continuing the discussion from Missing packages: kernel-devel / kernel headers for 5.15.x and 5.18.x:

See https://github.com/QubesOS/qubes-issues/issues/7648 and https://github.com/QubesOS/updates-status/issues/2982#issuecomment-1168488839 - it probably is a problem in the latest linux-firmware package, try to downgrade it and the problem might go away.

1 Like

Hi ned, thanks. I missed that issue. But yes that exactly what happens. I believe it’s also the missing Tuxedo-Keyboard bundle. You don’t, by any chance, know where I get the matching kernel headers?

You mean for dom0? If so, then it seems like kernel-devel and kernel-latest-devel packages are regularly published and you just have to install them with something like this:

sudo qubes-dom0-update --enablerepo=qubes-*testing kernel-latest-devel

The actual package rpms can be seen here: Index of /r4.1/current/dom0/fc32/rpm/ and Index of /r4.1/current-testing/dom0/fc32/rpm/

Yep, that I know, but I don’t get the right version. Installed kernel is 5.18.9-1, the testing kernel-latest-devel is 5.18.14-1. Installing from the stable repo gives me only 5.11.

I created yesterday an additional post for the kernel-header issue:
Missing packages: kernel-devel / kernel headers for 5.15.x and 5.18.x. Also at the end of my original post.

I downgraded linux-firmware and it looks good indeed. But Qubes Update / qubes-dom0-update wants update linux-firmware. So, I tried to figure out how to tell dnf to hold this package back, but couldn’t find any option, which would archive that. Any idea?

qubes-dom0-update ... --exclude=linux-firmware --exclude=linux-firmware-whence

Sorry, missed that topic, but I can’t help you with the problem. Maybe open a new issue or comment in https://github.com/QubesOS/updates-status/issues/2996 ?

Okay, no problem. Thanks.