libera/#devuan/ Friday, 2024-03-22

amarsh04git bisecting a problem with hardware MIDI playback that started with kernel 6.5-rc1 kernels02:59
systemdleteI have a daedalus VM with a frozen desktop.  I can ssh into the VM from the host, and I can run htop, etc.  The desktop in the VM simply doesn't take input.  The mouse, however, does move normally.  What should I look at first?22:21
systemdleteI have another daedalus VM on a different host that does not freeze.22:22
systemdleteBoth hosts have 32GB ram, but one is a FX8350 and the other has an Athlon II X6.22:23
systemdleteThe host does not seem to have any noticeable problems with its own desktop.  The VM's desktop will run for many hours, then freeze.22:24
systemdleteWhile ssh'd into the daedalus VM, df shows me there is plenty of disk space.  And htop does not seem to be screaming any pain.22:25
gnarfacethe frozen xorg instance isn't taking up any cpu?22:26
systemdletegnarface, htop shows xorg using 2.0 MB and 0 cpu22:28
gnarfacesystemdlete: what about its children?22:28
gnarfacenot sure but i suspect something running inside that xorg instance is hung and needs to be killed, then the desktop will unfreeze22:29
systemdleteone child, using same cpu and mem, child marked D22:29
gnarfacehmmm22:30
systemdletemaybe restart xorg?22:30
gnarfaceyea that doesn't seem like enough children22:30
systemdleteor will that cause widespread mayhem22:30
gnarfaceusually you should have a window manager and some other stuff...22:30
gnarfacemine has one child that's just another Xorg instance22:31
systemdletexfwm4 is running22:31
gnarfacebut nothing inside it?22:32
gnarfaceno actual programs just an empty desktop?22:32
systemdletelooks like xfwm4 has 11 children22:33
systemdleteI have several shell windows open, but only 1 is max'd the others are minimized22:34
gnarfaceit's gotta be one of them, not necessarily a maxxed one22:34
systemdletebtw, I did that specifically to run a test on this problem.  I wanted to eliminate things like browsers etc that can mess things up22:34
systemdleteso maybe try killing one at a time?22:34
gnarfaceyea22:35
systemdleteI will start with the highest numbered pid and work backwards22:36
systemdletewhoa.22:37
systemdleteSIGTERM didn't work, so I tried SIGHUP.  That didn't work either.22:37
systemdleteSo I sent that pid SIGKILL, but it killed off all of them apparently.22:38
systemdleteVM desktop is still frozen though22:38
gnarfacebrutal22:39
systemdleteI wonder, gnarface, if maybe I should try killing off the windows themselves22:39
gnarfacemaybe yea22:39
systemdletestand by have to install wmctrl22:40
gnarfaceyou killed the processes, the processes are all gone, and the windows from them are still there?22:40
gnarfacethat's not right...22:41
gnarfacethat suggests xfce itself froze22:41
gnarfaceor went out to lunch somehow22:41
systemdletewell, the desktop still has not refreshed22:41
systemdleteso check panera bread?22:41
systemdlete:D22:41
systemdlete(sorry, getting punchy here.  Been scratching my head for days over this.)22:42
gnarfacetry perf top and radentop?22:42
systemdletenot familiar with those, but I can install them22:42
gnarfaceit's amd gpu too right?22:43
gnarfaceor is it nvidia?22:43
gnarfaceradeontop won't help for nvidia22:43
systemdletehold on...22:43
gnarfacebut perf might still22:43
systemdleteM5A78L/USB3 board iirc22:44
systemdleteand no external graphics card, so its all amd22:44
systemdlete(yes)22:44
* systemdlete looking through system logs inside VM for clues...22:46
gnarfacecheck what size the xorg log is22:46
gnarfacesee if it's got a lot of repeating warnings22:46
gnarface(or errors)22:47
systemdleteuh-oh...22:47
systemdlete.xsession-errors is dated Mar 3 and xorg log Mar 4.   I've had to do the nasty to reboot the VM, so probably those files were not updated22:47
systemdletewhich makes me wonder22:48
* systemdlete checks to see if file systems are mounted ro instead of rw22:48
systemdletenope.  FS are all mounted rw.22:48
systemdlete(was just a thought)22:48
gnarfaceyou're looking in ~/.local/share/xorg/ ?22:48
systemdleteoh, sorry no.  Was looking at /var/log/Xorg...22:49
systemdleteah, now that one is dated Mar 1422:49
systemdleteuptime is 8 days22:50
gnarfacebeing not nvidia, i would assume it would have migrated to running as the user instead of suid root, which would have moved the log to ~/.local/share/xorg/22:50
gnarfacethis is a new change22:50
systemdlete?22:50
gnarfacepretty much everything except nvidia runs xorg as the user now22:50
gnarfaceso the logs go in the home dir instead of /var/log/22:51
systemdleteI launch xfce from command line after log in.22:51
gnarfaceeven if you run startx, nvidia drivers are still wired to start as suid root, afaik22:51
systemdleteWas having a lot of problems with WMs22:51
systemdletewell, apparently that is not the problem here, as you said22:51
gnarfaceyes22:52
gnarfaceis that Xorg log very large?22:52
systemdleteAll I meant is that xorg is running as user22:52
systemdletethe one in ~/.local/share/xorg is 27132 bytes22:52
gnarfacenah that's not a problem then22:52
systemdletelast 2 messages are (EE) No surface to present from.22:53
gnarfaceoh, that's a problem22:53
gnarfaceare there more errors before that?22:53
systemdletelots of messages, one other error earlier:  "(EE) open /dev/fb0: Permission denied"22:54
systemdletebut that was at 52 secs or so, and the desktop had been working fine for a few days22:55
gnarfacea couple errors might be normal while auto-detect brute-force fails its way through every driver until it finds one that works22:55
gnarfacethe last one where it says your surface has gone missing though, that seems like a smoking gun22:55
gnarfacethis makes it seem more like a driver issue22:56
systemdleteThose are at 60 secs in22:56
systemdleteoooh22:56
systemdletemaybe I need to install some specific FW for this board?22:57
systemdlete(main board)22:57
gnarfacepossible, or maybe just make it use a different xorg driver22:57
systemdletehmmm.  I never installed firmware-amd-graphics22:59
systemdletedo I need that in a VM?22:59
gnarfaceno, i doubt that22:59
gnarfacebut you might need some vm drivers23:00
systemdleteThe VM drivers are installed23:00
gnarfacevirtio or something like that23:00
rustyaxeya fbdev is quite old, do any modern drivers use that?23:00
gnarfacearm hardware still maybe23:00
rustyaxeI think thats just xorg probing for the display and trying an old device (fbdev still exists after all but you probably dont want to use it instead of a more optimized driver)23:00
systemdleterustyaxe, gnarface:  Keep in mind this is 15+ year old tech.  AM3 platforms23:01
systemdleteusing a built-in video fw23:01
rustyaxefbdev still predates that23:01
rustyaxeyou're passing the video through to the vm? or using emulated video?23:02
rustyaxeThat'll decide which driver the guest needs23:02
systemdletewell, I'm wondering if parts of the system are starting to drop support for "older" hardware, esp. if the drivers from them might not be quite up  to the standard for u-know-what23:02
systemdletevirtualbox, using kvm virtualization23:03
rustyaxewe still have phenom ii machines running fine23:03
rustyaxeso no the guest shouldnt need the amd graphics stuff as it wont be talking to it, but rather the emulated video card23:03
systemdletebtw, there is just the host and two VMs, and one of the VMs is small (under 1MB)23:04
systemdletecool.23:04
gnarfaceif it's like qemu, you'll want to make sure you're loading the virtual driver modules inside the guest23:04
gnarfacei forget if you'll need to set xorg.conf too23:04
systemdleteso maybe gnarface's suggestion to switch to a different xorg driver?23:04
rustyaxeyea you can likely select which video card is emulated which will change which driver in the guest you need23:04
rustyaxeI dont use virtualbox, rather proxmox and virt-manager where needful, but im sure its similar to them23:05
systemdleteI have the VM set to use VMSVGA, which is the one recommended for most VMs23:05
systemdletevideo memory is 128K23:06
systemdleteooops23:06
systemdlete128M23:06
systemdleteand the VM has 8GB RAM23:06
gnarfacedoes it have its own system clock or do they all use the host system clock?23:07
systemdleteI have all of my VMs and hosts using one NTP server, which in turn uses an upstream NTP server23:08
gnarfacegrasping at straws here, but maybe emulated clock drift could be destabilizing it?23:08
systemdletegood point23:08
systemdletelet me see if it is off23:08
systemdleteno, not by more than a second or so23:08
systemdletebut good idea to check that23:08
systemdletetime sync can be a hazard, esp for network communications23:09
gnarfaceanother daedalus change was the forced migration to ntpsec from ntp, and in the merge of the new example ntp.conf, you might have, like me, accidentally inherited a "...minsec 3" line which, if you're using just one ntp server, will cause it to ignore that server23:09
gnarfaceand then it will drift if it doesn't have a real clock23:09
systemdleteVbox does provide a clock, but as far as I know, I don't use that (except for sync'ing up at VM boot).23:10
gnarfaceah23:10
systemdleteright23:10
gnarfacewith qemu i tell it to use the host's clock because when left to its own devices it screws up23:10
systemdletedo you mean minsane?  I have that set to 123:10
gnarfaceyea, meant minsane, sorry23:11
systemdletenp.  I knew what you meant.23:11
gnarfaceyes, it should be 123:11
systemdleteof course, I'm no longer sure just how much any of this means now that browsers and maybe other programs are using NTP over HTTP or something23:11
systemdleteat any rate, I am not noticing any huge amount of drift, at least not in this case23:13
systemdletealthough, maybe at the very moment that the freeze begins, there might be a lag.  The only problem with that theory, is that I have other VMs that do not have desktops freezing intermittently23:14
systemdleteI have developed an extensive checklist of gotchas for new VMs and hosts, exactly for this reason.   Updating the minsane value is just one of dozens23:16
gnarfacedo you have any shared mounts with the VMs?23:17
systemdleteyes!23:17
gnarfacei wonder if it could be file contention in a shared mount23:17
systemdleteI have a LAN server and just about every host and VM is normally mounted to it.23:17
gnarfacecache directory or something maybe...?23:17
systemdleteI use that as a sort of "clipboard" for passing files and data back and forth23:18
rustyaxeA hazard?23:18
rustyaxeYou mean a must23:18
systemdleteI'm not using it to boot from, nor for any ongoing file operations.  Just to transfer files around.23:18
rustyaxeMany network protocols wont work without good time sync23:19
systemdleterustyaxe, yes.  I have noticed that!23:19
rustyaxeGenerally just throw chrony in them and it'll do the right thing23:19
systemdleteN.B.:  elogind-daemon is in "D" state and does not seem to ever change.23:23
gnarfacedo you actually need elogind if you're starting Xorg with startx?23:24
systemdleteno, probably not.  I think it is an artifact of when I used to login using a WM23:25
systemdleteok, I have disabled elogind23:26
systemdleteI could kill off the remaining processes, one by one, hoping to narrow it down a bit.23:28
systemdleteBut if I kill them in the wrong order, it could kill dependent processes off also, so I won't get an accurate fix.23:28
systemdletenow this is whacked.  "service elogind stop" but elogind-daemon is still running23:28
gnarfacesuspicious23:28
systemdleteit won't die23:29
systemdletekill -KILL 1885 (pid of elogind daemon) doesn't do anything23:29
gnarfacegive it a few23:29
systemdleteits parent is PID 123:29
gnarfacemaybe Xorg is gonna have to be killed23:29
gnarfacei would still try to kill the running programs under xfce first though23:30
systemdletethere is still a xfce4-terminal running23:31
systemdleteI thought I'd killed that with the shells23:31
systemdletestopped dbus, but dbus-launch still running--is that right?23:35
systemdletedesktop is still frozen at this point23:35
systemdleteeven though I've killed off xfce4-* processes23:35
systemdlete(I am shelled into the VM as root, incidientally)23:36
systemdletekilled off at-spi-bus-launcher and now dbus-daemon is gone23:37
systemdleteand desktop still frozen23:37
gnarfaceis the window manager process still there?23:39
systemdleteps -ef |grep wm shows nothing23:40
systemdleteok, killed off all the vbox client processes, still frozen23:40
systemdletexinit, Xorg, xfsettingsd, and xfdesktop still running23:43

Generated by irclog2html.py 2.17.0 by Marius Gedminas - find it at https://mg.pov.lt/irclog2html/!