i am experiencing many random crashes on my ThinkPad L14 G2 with a Ryzen 7 PRO 5850U + a no name USB-C dock. There are 4 Monitors: Integrated, Internal HDMI and 2 HDMI from the dock. A mouse and keyboard are attached to the dock behind a KVM switch.
System: Qubes 4.1, Kernel: 5.10.104-3
There is no indication for errors in any logs (at least i have not found them, more on this at the end)
When it happens:
Comletely unpredictable, no pattern observable. I have not found a reliable way to cause a crash reliably but i think it is slightly more likely to happen when starting many qubes as fast as possible right after booting. Or maybe not…
Varies quite a bit. Sometimes it runs stable for many hours, sometimes it crashes within minutes. Crashed 13 times yesterday and 4 times today (so far).
How is it crashing?
Everything freezes for about 2 seconds, then unfreezes for 2 seconds, then go black and reboot.
One time all external monitors went black for 1 second then everything freezes.
This did not happen on 4.0 as far as i can tell. Got the dock two weeks before upgrading to 4.1, so there is a very tiny chance of this happening on 4.0 too, but i honestly do not think so.
When booting with the dock attached, i cannot enter my FDE password on the GUI. When switching to TTY i see 4 chars being entered there before i type anything. Deleting them and entering my password there works fine, but only with integrated keyboard. This does not occur without the dock.
Sometimes when booting with the dock attached, the ‘E’ key is virtually stuck in a downwards position. disconnecting the keyboard or switching the KVM switch interrupts the ‘E-spree’ but it continues on reconnecting/switching back (sometimes). This happens with around a 10% chance on booting.
On 4.0 it was pretty stable, however i observed two weird behaviors:
- System freezing. Very infrequently, usually around half a month of consecutive uptime.
- virtually stuck keys: Sometimes keys on the keyboard seems to get stuck and are pressed indefinitely until i reboot. This also happened around equally frequent, so twice a month.
(As those problems where that infrequent, i have not investigated the cause)
- Some naturally broken stuff like network manager not connecting to the wifi sometimes, changing monitor enumeration on reconnect, not working hiberation, crap battery runtime, broken CPU frequency scaling, and so on.
What i have checked
I have a
dmesg -Tw window open at all times but was unable to see anything happening before it freezes.
Took a look at
journalctl after the crashes but nothing is happening upon the crashes.
Here is the crash while writing this and reboot:
Apr 07 17:05:58 dom0 qrexec-policy-daemon: qrexec: whonix.SdwdateStatus+: sys-whonix -> disp9438: allowed to disp9438 Apr 07 17:05:58 dom0 qrexec-policy-daemon: qrexec: whonix.SdwdateStatus+: sys-whonix -> disp3981: allowed to disp3981 Apr 07 17:05:58 dom0 qrexec-policy-daemon: qrexec: whonix.SdwdateStatus+: sys-whonix -> disp5343: allowed to disp5343 -- Reboot -- Apr 07 17:07:00 dom0 kernel: Linux version 5.10.104-3.fc32.qubes.x86_64 (mockbuild@build-fedora4) (gcc (GCC) 10.3.1 20210422 (Red Hat 10.3.1-1), GNU ld version 2.34-6.fc32) #1 SMP Fri Mar 1> Apr 07 17:07:00 dom0 kernel: Command line: placeholder root=/dev/mapper/qubes_dom0-root ro rd.luks.uid=luks-<scrubbed> rd.lvm.lv=qubes_dom0/root rd.lvm.lv=qubes_d> Apr 07 17:07:00 dom0 kernel: x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating point registers'
Tried finding the xen logs (
tl dmesg-command) but comamnd not found. Here is where i got the idea from, maybe i am doing it wrong tho?
Tried to read the
/var/log/xen/xend.log as mentioned here but it is not there.
/var/log/xen/console/hypervisor.log there is nothing special, but some IO-Page faults (those are normal right? I have a lot of those. All of them are like in this example. This is the crash and the reboot:
[2022-04-07 17:04:23] (XEN) AMD-Vi: IO_PAGE_FAULT: 0000:03:00.0 d1 addr fffffffdf8000000 flags 0x8 I [2022-04-07 17:04:59] (XEN) AMD-Vi: IO_PAGE_FAULT: 0000:03:00.0 d1 addr fffffffdf8000000 flags 0x8 I [2022-04-07 17:05:38] (XEN) AMD-Vi: IO_PAGE_FAULT: 0000:03:00.0 d1 addr fffffffdf8000000 flags 0x8 I ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^> [2022-04-07 17:09:15] (XEN) Built-in command line: ept=exec-sp [2022-04-07 17:09:15] (XEN) parameter "no-real-mode" unknown! [2022-04-07 17:09:15] Xen 4.14.4
So what is next?
Maybe it has something to do with the dock, but in this case i would assume that this behavior should have happened on 4.0 as well.
Maybe my RAM is bad? But again: I would have noticed much more crashes and freezes on 4.0 then.
I will try to run it without the dock and see how this goes. If it still fails, i will run a memtest.
In the meantime, is there anything more i can do/troubleshoot?
Thanks for your help in advance