Bug#931267: times out and drops into useless emergency shell with fsck still ongoing
Michael Biebl
biebl at debian.org
Sun Jun 30 06:12:33 BST 2019
Control: tags -1 + moreinfo
Hi Steve!
Am 30.06.19 um 01:46 schrieb Steve McIntyre:
> [ Ignore the system info etc. below - I'm running reportbug on a
> different system. ]
>
> I've just dist-upgraded my headless home firewall/server from Stretch
> to Buster. I did the usual task of config file merging. and then
> rebooted the machine. It didn't come up on the network again
> afterwards.
>
> After rummaging around to connect a serial cable, there was no
> interaction on the console so I rebooted again. Now I see that the
> machine is running a fsck on the multiple large filesystems (Debian
> mirror, video/audio data etc.) Fine - the machine had not been
> rebooted in a long time, so the fsck was overdue. Then I see that
> systemd has decided to time out startup of services and drop me into
> an Emergency shell. With fsck going on and writing to the console, I
> cannot useful interact with the shell. The fsck completed
> successfully, but I had a headless machine that still needed
> interaction to make it work again.
>
> Several points/questions:
>
> * Why on earth do things have a short timeout when fsck is still
> running? It's normal for fsck on a large fs to take a long time,
> and this should not break bootup. Especially not on a headless box.
This is odd. /lib/systemd/system/systemd-fsckd at .service uses
"TimeoutSec=0", so it should not timeout.
I quickly gave this a test. See
https://people.debian.org/~biebl/boot.mp4
I artifically modified systemd-fsckd at .service to sleep for 200s to
simulate a long fsck, I chose this deliberate to be > 180s, which is the
default systemd timeout for services.
As you can see, after 30s the eye-of-cylon animation kicks in and after
300s the boot proceeds and successfully completes.
This is even a more elaborate setup with LVM and stuff.
So I guess we need more information to debug this. Do you remember which
job timed out? Do you have any logs from this failed boot which could
give a clue which service triggered the start of the emergency shell?
> * Why does systemd try to start an Emergency shell on an already-busy
> console? This is *not* useful.
The default output goes to /dev/console, i.e. the currently active. I
think what you are seeing here is another manifestation of
https://github.com/systemd/systemd/issues/1840
> * I haven't tried to reproduce this, but the initial interaction on
> the console seemed to show a hung machine. I had no useful
> interaction there. Is the Emergency shell setup meant to prompt
> again on password failure if I just hit <enter> several times?
It should, yes.
--
Why is it that all of the instruments seeking intelligent life in the
universe are pointed away from Earth?
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 833 bytes
Desc: OpenPGP digital signature
URL: <http://alioth-lists.debian.net/pipermail/pkg-systemd-maintainers/attachments/20190630/d12e9c3b/attachment.sig>
More information about the Pkg-systemd-maintainers
mailing list