So far, I’ve got two total freezes last week. In the first case, there were nothing in journalctl IIRC. In the second case, there was some info about soft lockup:
Nov 23 17:13:39 dom0 kernel: xen-blkback: backend/vbd/45/51712: using 2 queues, protocol 1 (x86_64-abi) persistent grants
Nov 23 17:13:39 dom0 kernel: xen-blkback: backend/vbd/45/51728: using 2 queues, protocol 1 (x86_64-abi) persistent grants
Nov 23 17:13:39 dom0 kernel: xen-blkback: backend/vbd/45/51744: using 2 queues, protocol 1 (x86_64-abi) persistent grants
Nov 23 17:14:04 dom0 kernel: watchdog: BUG: soft lockup - CPU#2 stuck for 22s! [Xorg:3761]
Nov 23 17:14:04 dom0 kernel: Modules linked in: snd_seq_dummy snd_hrtimer nct6775 nct6775_core hwmon_vid lm83 jc42 vfat fat snd_hda_codec_realtek snd_hda_codec_generic ledtrig_aud
io snd_hda_codec_hdmi intel_rapl_msr snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi intel_rapl_common snd_hda_codec snd_hda_core snd_hwdep snd_seq snd_seq_device joydev snd_pcm
snd_timer wmi_bmof pcspkr snd r8169 soundcore k10temp i2c_piix4 gpio_amdpt gpio_generic loop fuse xenfs dm_thin_pool dm_persistent_data dm_bio_prison dm_crypt amdgpu amdxcp iommu
_v2 drm_buddy gpu_sched hid_elecom radeon crct10dif_pclmul crc32_pclmul crc32c_intel polyval_clmulni polyval_generic drm_ttm_helper ttm video i2c_algo_bit drm_suballoc_helper drm_
display_helper xhci_pci ghash_clmulni_intel xhci_pci_renesas sha512_ssse3 cec ccp nvme xhci_hcd sp5100_tco nvme_core nvme_common wmi xen_acpi_processor xen_privcmd xen_pciback xen
_blkback xen_gntalloc xen_gntdev xen_evtchn scsi_dh_rdac scsi_dh_emc scsi_dh_alua uinput dm_multipath i2c_dev
Nov 23 17:14:04 dom0 kernel: CPU: 2 PID: 3761 Comm: Xorg Not tainted 6.5.10-1.qubes.fc37.x86_64 #1
Nov 23 17:14:04 dom0 kernel: Hardware name: ASUS System Product Name/TUF GAMING B550-PLUS, BIOS 3404 10/07/2023
Nov 23 17:14:04 dom0 kernel: RIP: e030:smp_call_function_many_cond+0x121/0x4f0
Nov 23 17:14:04 dom0 kernel: Code: 63 d0 e8 d2 e5 61 00 3b 05 2c 53 d1 01 73 25 48 63 d0 49 8b 37 48 03 34 d5 00 eb 9c 82 8b 56 08 83 e2 01 74 0a f3 90 8b 4e 08 <83> e1 01 75 f6 83 c0 01 eb c1 48 83 c4 40 5b 5d 41 5c 41 5d 41 5e
Nov 23 17:14:04 dom0 kernel: RSP: e02b:ffffc9004518f828 EFLAGS: 00000202
Nov 23 17:14:04 dom0 kernel: RAX: 0000000000000000 RBX: 0000000000000208 RCX: 0000000000000011
Nov 23 17:14:04 dom0 kernel: RDX: 0000000000000001 RSI: ffff888134c3bb80 RDI: ffff888100075ee0
Nov 23 17:14:04 dom0 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000001
Nov 23 17:14:04 dom0 kernel: R10: 0000000000007ff0 R11: 0000000000000000 R12: ffff888134cb5100
Nov 23 17:14:04 dom0 kernel: R13: 0000000000000001 R14: 0000000000000002 R15: ffff888134cb5100
Nov 23 17:14:04 dom0 kernel: FS: 00007150d2fc4a80(0000) GS:ffff888134c80000(0000) knlGS:0000000000000000
Nov 23 17:14:04 dom0 kernel: CS: e030 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 23 17:14:04 dom0 kernel: CR2: 00007c7d0a39e020 CR3: 0000000128e2c000 CR4: 0000000000050660
Nov 23 17:14:04 dom0 kernel: Call Trace:
Nov 23 17:14:04 dom0 kernel: <IRQ>
Nov 23 17:14:04 dom0 kernel: ? watchdog_timer_fn+0x1b8/0x220
Nov 23 17:14:04 dom0 kernel: ? __pfx_watchdog_timer_fn+0x10/0x10
Nov 23 17:14:04 dom0 kernel: ? __hrtimer_run_queues+0x112/0x2b0
Nov 23 17:14:04 dom0 kernel: ? hrtimer_interrupt+0xf8/0x230
Nov 23 17:14:04 dom0 kernel: ? xen_timer_interrupt+0x22/0x30
Nov 23 17:14:04 dom0 kernel: ? __handle_irq_event_percpu+0x4a/0x1a0
Nov 23 17:14:04 dom0 kernel: ? handle_irq_event_percpu+0x13/0x40
Nov 23 17:14:04 dom0 kernel: ? handle_percpu_irq+0x3b/0x60
Nov 23 17:14:04 dom0 kernel: ? handle_irq_desc+0x3e/0x50
Nov 23 17:14:04 dom0 kernel: ? __evtchn_fifo_handle_events+0x1b4/0x1e0
Nov 23 17:14:04 dom0 kernel: ? __xen_evtchn_do_upcall+0x65/0xb0
Nov 23 17:14:04 dom0 kernel: ? __xen_pv_evtchn_do_upcall+0x21/0x30
Nov 23 17:14:04 dom0 kernel: ? xen_pv_evtchn_do_upcall+0x85/0xb0
Nov 23 17:14:04 dom0 kernel: </IRQ>
Nov 23 17:14:04 dom0 kernel: <TASK>
Nov 23 17:14:04 dom0 kernel: ? exc_xen_hypervisor_callback+0x8/0x20
Nov 23 17:14:04 dom0 kernel: ? smp_call_function_many_cond+0x121/0x4f0
Nov 23 17:14:04 dom0 kernel: ? smp_call_function_many_cond+0xfe/0x4f0
Nov 23 17:14:04 dom0 kernel: ? __pfx_do_flush_tlb_all+0x10/0x10
Nov 23 17:14:04 dom0 kernel: on_each_cpu_cond_mask+0x24/0x40
Nov 23 17:14:04 dom0 kernel: __purge_vmap_area_lazy+0xd6/0x7d0
Nov 23 17:14:04 dom0 kernel: ? srso_alias_return_thunk+0x5/0x7f
Nov 23 17:14:04 dom0 kernel: ? xa_find+0x90/0xe0
Nov 23 17:14:04 dom0 kernel: _vm_unmap_aliases+0x264/0x2d0
Nov 23 17:14:04 dom0 kernel: change_page_attr_set_clr+0xb4/0x1a0
Nov 23 17:14:04 dom0 kernel: _set_pages_array+0xc3/0x110
Nov 23 17:14:04 dom0 kernel: ttm_pool_alloc+0x410/0x540 [ttm]
Nov 23 17:14:04 dom0 kernel: ttm_tt_populate+0xa1/0x130 [ttm]
Nov 23 17:14:04 dom0 kernel: ttm_bo_handle_move_mem+0x162/0x170 [ttm]
Nov 23 17:14:04 dom0 kernel: ttm_bo_validate+0xe5/0x180 [ttm]
Nov 23 17:14:04 dom0 kernel: ? srso_alias_return_thunk+0x5/0x7f
Nov 23 17:14:04 dom0 kernel: ttm_bo_init_reserved+0x146/0x170 [ttm]
Nov 23 17:14:04 dom0 kernel: ttm_bo_init_validate+0x5a/0xe0 [ttm]
Nov 23 17:14:04 dom0 kernel: ? __pfx_radeon_ttm_bo_destroy+0x10/0x10 [radeon]
Nov 23 17:14:04 dom0 kernel: radeon_bo_create+0x153/0x1e0 [radeon]
Nov 23 17:14:04 dom0 kernel: ? __pfx_radeon_ttm_bo_destroy+0x10/0x10 [radeon]
Nov 23 17:14:04 dom0 kernel: radeon_gem_object_create+0xb7/0x1c0 [radeon]
Nov 23 17:14:04 dom0 kernel: ? ____sys_recvmsg+0xf5/0x1d0
Nov 23 17:14:04 dom0 kernel: radeon_gem_create_ioctl+0x77/0x130 [radeon]
Nov 23 17:14:04 dom0 kernel: ? __pfx_radeon_gem_create_ioctl+0x10/0x10 [radeon]
Nov 23 17:14:04 dom0 kernel: drm_ioctl_kernel+0xcd/0x170
Nov 23 17:14:04 dom0 kernel: drm_ioctl+0x267/0x4a0
Nov 23 17:14:04 dom0 kernel: ? __pfx_radeon_gem_create_ioctl+0x10/0x10 [radeon]
Nov 23 17:14:04 dom0 kernel: radeon_drm_ioctl+0x4d/0x80 [radeon]
Nov 23 17:14:04 dom0 kernel: __x64_sys_ioctl+0x97/0xd0
Nov 23 17:14:04 dom0 kernel: do_syscall_64+0x5f/0x90
Nov 23 17:14:04 dom0 kernel: ? do_syscall_64+0x6b/0x90
Nov 23 17:14:04 dom0 kernel: ? srso_alias_return_thunk+0x5/0x7f
Nov 23 17:14:04 dom0 kernel: ? do_syscall_64+0x6b/0x90
Nov 23 17:14:04 dom0 kernel: ? exit_to_user_mode_prepare+0xa7/0xd0
Nov 23 17:14:04 dom0 kernel: entry_SYSCALL_64_after_hwframe+0x6e/0xd8
Nov 23 17:14:04 dom0 kernel: RIP: 0033:0x7150d36b9e0f
Nov 23 17:14:04 dom0 kernel: Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 18 48 8b 44 24 18 64 48 2b 04 25 28 00 00
Nov 23 17:14:04 dom0 kernel: RSP: 002b:00007ffe1404ad10 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 23 17:14:04 dom0 kernel: RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007150d36b9e0f
Nov 23 17:14:04 dom0 kernel: RDX: 00007ffe1404ade0 RSI: 00000000c020645d RDI: 0000000000000019
Nov 23 17:14:04 dom0 kernel: RBP: 00007ffe1404ade0 R08: 0000000000000011 R09: 0000000000000010
Nov 23 17:14:04 dom0 kernel: R10: 0000000000000002 R11: 0000000000000246 R12: 00000000c020645d
Nov 23 17:14:04 dom0 kernel: R13: 0000000000000019 R14: 0000000000080000 R15: 00007150c810a010
Nov 23 17:14:04 dom0 kernel: </TASK>
There are more messages like this, not sure if they are interesting.
Any ideas?
EDIT: I didn’t have swap. Can this be a root of the issue?