• Bug#1102160: upgrade-reports: Bookworm to Trixie [amd64][EFI] initramfs

    From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Mon Apr 7 07:40:01 2025
    On Sun, 6 Apr 2025 19:36:03 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 13:02:29 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 10:00:45 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sat, 05 Apr 2025 22:56:46 +0300 =?utf-8?q?Martin-=C3=89ric_Racine?= <[email protected]> wrote:
    Package: upgrade-reports
    Severity: important
    X-Debbugs-Cc: [email protected]

    After upgrading an amd64 host from Bookworm to Trixie (2025-04-05 EEST), on reboot, the host gets a kernel panic with:

    initramfs unpacking failed invalid magic at start of compressed archive

    I just checked whether I can rescue the host using the Trixie mini.iso
    on a USB stick:

    Debian GNU/Linux 13 (trixie) amd64 - netboot mini.iso 20241227

    I get the same kernel panic as soon as GRUB loads the kernel.

    If I try with the Bookworm mini.iso, the kernel boots just fine and
    gets me to the rescue menu.

    Debian GNU/Linux 12 (bookworm) amd64 - netboot mini.iso 20230607+deb12u9

    However, using the Bookworm rescue mode is a PITA, because it doesn't know how to mount brtfs sub-volumes (not even with Debian's default @rootfs) so even just chroot'ing myself in using the above recipe requires a LOT of command typing inside the rescue mode's shell.

    As a further test, I've tried booting the following Trixie USB stick
    in BIOS mode via the rescue menu:

    Debian GNU/Linux trixie-DI-alpha1 "Trixie" - Official Alpha amd64
    NETINST with firmware 20241230-11:26

    This succesfully gets me to the rescue process and allows me to chroot
    into the host after typing the long commandline series above.

    Basically, a/b testing shows that Trixie kernels per-se work fine, but something fishy is going on with the grub-efi-amd64 in Trixie. The
    same host was originally installed with a Bookworm mini.iso on USB and booted fine via EFI, so the problem really does seem to be the grub-efi-amd64 in Trixie.

    Given the above, reassigned to bin:grub-efi-amd64.

    And, after checking, bumping severity to Critical, since this bug
    prevents common hardware (ASUS P8H61 motherboard via UEFI) from
    booting.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Sun Apr 13 09:40:01 2025
    On Mon, 7 Apr 2025 08:28:01 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 19:36:03 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 13:02:29 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 10:00:45 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sat, 05 Apr 2025 22:56:46 +0300 =?utf-8?q?Martin-=C3=89ric_Racine?= <[email protected]> wrote:
    Package: upgrade-reports
    Severity: important
    X-Debbugs-Cc: [email protected]

    After upgrading an amd64 host from Bookworm to Trixie (2025-04-05 EEST), on reboot, the host gets a kernel panic with:

    initramfs unpacking failed invalid magic at start of compressed archive

    I just checked whether I can rescue the host using the Trixie mini.iso on a USB stick:

    Debian GNU/Linux 13 (trixie) amd64 - netboot mini.iso 20241227

    I get the same kernel panic as soon as GRUB loads the kernel.

    If I try with the Bookworm mini.iso, the kernel boots just fine and gets me to the rescue menu.

    Debian GNU/Linux 12 (bookworm) amd64 - netboot mini.iso 20230607+deb12u9

    However, using the Bookworm rescue mode is a PITA, because it doesn't know how to mount brtfs sub-volumes (not even with Debian's default @rootfs) so even just chroot'ing myself in using the above recipe requires a LOT of command typing inside the rescue mode's shell.

    As a further test, I've tried booting the following Trixie USB stick
    in BIOS mode via the rescue menu:

    Debian GNU/Linux trixie-DI-alpha1 "Trixie" - Official Alpha amd64
    NETINST with firmware 20241230-11:26

    This succesfully gets me to the rescue process and allows me to chroot into the host after typing the long commandline series above.

    Basically, a/b testing shows that Trixie kernels per-se work fine, but something fishy is going on with the grub-efi-amd64 in Trixie. The
    same host was originally installed with a Bookworm mini.iso on USB and booted fine via EFI, so the problem really does seem to be the grub-efi-amd64 in Trixie.

    Given the above, reassigned to bin:grub-efi-amd64.

    And, after checking, bumping severity to Critical, since this bug
    prevents common hardware (ASUS P8H61 motherboard via UEFI) from
    booting.

    Is there any way I can help the maintainers pinpoint the source of the problem?

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Wed Apr 16 14:30:02 2025
    On Wed, 16 Apr 2025 14:38:14 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 13 Apr 2025 10:23:17 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Mon, 7 Apr 2025 08:28:01 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 19:36:03 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Sun, 6 Apr 2025 13:02:29 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]> wrote:
    On Sun, 6 Apr 2025 10:00:45 +0300 =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]> wrote:
    On Sat, 05 Apr 2025 22:56:46 +0300 =?utf-8?q?Martin-=C3=89ric_Racine?=
    <[email protected]> wrote:
    Package: upgrade-reports
    Severity: important
    X-Debbugs-Cc: [email protected]

    After upgrading an amd64 host from Bookworm to Trixie (2025-04-05 EEST), on reboot, the host gets a kernel panic with:

    initramfs unpacking failed invalid magic at start of compressed archive

    I just checked whether I can rescue the host using the Trixie mini.iso
    on a USB stick:

    Debian GNU/Linux 13 (trixie) amd64 - netboot mini.iso 20241227

    I get the same kernel panic as soon as GRUB loads the kernel.

    If I try with the Bookworm mini.iso, the kernel boots just fine and gets me to the rescue menu.

    Debian GNU/Linux 12 (bookworm) amd64 - netboot mini.iso 20230607+deb12u9

    However, using the Bookworm rescue mode is a PITA, because it doesn't
    know how to mount brtfs sub-volumes (not even with Debian's default @rootfs) so even just chroot'ing myself in using the above recipe requires a LOT of command typing inside the rescue mode's shell.

    As a further test, I've tried booting the following Trixie USB stick in BIOS mode via the rescue menu:

    Debian GNU/Linux trixie-DI-alpha1 "Trixie" - Official Alpha amd64 NETINST with firmware 20241230-11:26

    This succesfully gets me to the rescue process and allows me to chroot
    into the host after typing the long commandline series above.

    Basically, a/b testing shows that Trixie kernels per-se work fine, but
    something fishy is going on with the grub-efi-amd64 in Trixie. The same host was originally installed with a Bookworm mini.iso on USB and
    booted fine via EFI, so the problem really does seem to be the grub-efi-amd64 in Trixie.

    Given the above, reassigned to bin:grub-efi-amd64.

    And, after checking, bumping severity to Critical, since this bug

    Comparing the configs for Bookworm and Trixie kernels, I notice that
    FUSE is configred as a module on Bookworm, but built-in on Trixie.
    Could FUSE possibly be stepping on autofs' shoes and preventing rootfs detection? Could this be the kernel panic the above screenshot is
    telling about?

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Wed Apr 16 16:00:01 2025
    ke 16.4.2025 klo 15.41 Pascal Hambourg ([email protected]) kirjoitti:

    On 16/04/2025 at 13:38, Martin-Éric Racine wrote:

    As the attached screenshot shows, 'fuseblk' seemingly doesn't know
    anything about the 'subvol' option for btrfs, which is what causes the failure to mount root. As to whether this implies a GRUB or Linux
    module, I wouldn't know.

    The failure has nothing to do with fuse.
    I guess the kernel tries to mount the root filesystem with fuseblk as a
    last resort *after* the initramfs unpacking failure because it is the
    only built-in driver, all other block device/filesystem drivers being
    modules in the initramfs.

    Shouldn't a built-in autofs handle that part?

    Anyhow, that line about 'subvol' being an unrecognized option for
    fuseblk is significant.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Pascal Hambourg@21:1/5 to All on Wed Apr 16 19:50:01 2025
    On 16/04/2025 at 15:51, Martin-Éric Racine wrote:
    ke 16.4.2025 klo 15.41 Pascal Hambourg ([email protected]) kirjoitti:
    On 16/04/2025 at 13:38, Martin-Éric Racine wrote:

    As the attached screenshot shows, 'fuseblk' seemingly doesn't know
    anything about the 'subvol' option for btrfs, which is what causes the
    failure to mount root. As to whether this implies a GRUB or Linux
    module, I wouldn't know.

    The failure has nothing to do with fuse.
    I guess the kernel tries to mount the root filesystem with fuseblk as a
    last resort *after* the initramfs unpacking failure because it is the
    only built-in driver, all other block device/filesystem drivers being
    modules in the initramfs.

    Shouldn't a built-in autofs handle that part?

    I am not sure what you mean by "autofs". AFAIK autofs is an on-demand
    mount feature, so irrelevant for the root filesystem. The kernel image (vmlinuz) does not have built-in block/filesystem drivers and needs to
    load the related modules from the initramfs in order to be able to mount
    the root filesystem.

    Anyhow, that line about 'subvol' being an unrecognized option for
    fuseblk is significant.

    I don't think so. fuseblk is the wrong filesystem type and the "subvol"
    option is intended for btrfs so it is not surprising that it is not
    recognized.

    If you start grub shell and load the kernel and initramfs by hand, does
    it show an error when loading the initramfs ?

    No error.

    There may be some silent data corruption when grub reads the initramfs.
    Can you test with the initramfs on another filesystem type (fat, ext4) ?
    Can you test with grub for BIOS/legacy boot ?

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Thu Apr 17 06:10:01 2025
    ke 16.4.2025 klo 20.40 Pascal Hambourg ([email protected]) kirjoitti:

    On 16/04/2025 at 15:51, Martin-Éric Racine wrote:
    ke 16.4.2025 klo 15.41 Pascal Hambourg ([email protected]) kirjoitti:
    On 16/04/2025 at 13:38, Martin-Éric Racine wrote:

    As the attached screenshot shows, 'fuseblk' seemingly doesn't know
    anything about the 'subvol' option for btrfs, which is what causes the >>> failure to mount root. As to whether this implies a GRUB or Linux
    module, I wouldn't know.

    The failure has nothing to do with fuse.
    I guess the kernel tries to mount the root filesystem with fuseblk as a
    last resort *after* the initramfs unpacking failure because it is the
    only built-in driver, all other block device/filesystem drivers being
    modules in the initramfs.

    Shouldn't a built-in autofs handle that part?

    I am not sure what you mean by "autofs". AFAIK autofs is an on-demand
    mount feature, so irrelevant for the root filesystem. The kernel image (vmlinuz) does not have built-in block/filesystem drivers and needs to
    load the related modules from the initramfs in order to be able to mount
    the root filesystem.

    Anyhow, that line about 'subvol' being an unrecognized option for
    fuseblk is significant.

    I don't think so. fuseblk is the wrong filesystem type and the "subvol" option is intended for btrfs so it is not surprising that it is not recognized.

    If you start grub shell and load the kernel and initramfs by hand, does
    it show an error when loading the initramfs ?

    No error.

    There may be some silent data corruption when grub reads the initramfs.
    Can you test with the initramfs on another filesystem type (fat, ext4) ?
    Can you test with grub for BIOS/legacy boot ?

    See above. GRUB-EFI is clearly the culprit. Booting the Trixie d-i in
    rescue mode works in BIOS mode, but not in EFI mode.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Thu Apr 17 09:20:01 2025
    to 17.4.2025 klo 10.03 Pascal Hambourg ([email protected]) kirjoitti:

    On 17/04/2025 at 06:01, Martin-Éric Racine wrote:
    ke 16.4.2025 klo 20.40 Pascal Hambourg ([email protected]) kirjoitti:

    There may be some silent data corruption when grub reads the initramfs.
    Can you test with the initramfs on another filesystem type (fat, ext4) ? >> Can you test with grub for BIOS/legacy boot ?

    See above. GRUB-EFI is clearly the culprit. Booting the Trixie d-i in rescue mode works in BIOS mode, but not in EFI mode.

    d-i boots in BIOS mode with isolinux, not grub.

    This still points to GRUB as the culprit. AFAIK the Linux kernel
    loaded in both modes is the same. The failure to load initrd appears
    when booting the Trixie d-i via UEFI but not via BIOS. Both modes work
    fine when booting using the Bookworm d-i. UEFI no longer does with the
    Trixie d-i.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Fri Apr 18 18:20:01 2025
    On Fri, 18 Apr 2025 11:17:06 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    On Thu, 17 Apr 2025 10:13:15 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    to 17.4.2025 klo 10.03 Pascal Hambourg ([email protected]) kirjoitti:

    On 17/04/2025 at 06:01, Martin-Éric Racine wrote:
    ke 16.4.2025 klo 20.40 Pascal Hambourg ([email protected]) kirjoitti:

    There may be some silent data corruption when grub reads the initramfs.
    Can you test with the initramfs on another filesystem type (fat, ext4) ?
    Can you test with grub for BIOS/legacy boot ?

    See above. GRUB-EFI is clearly the culprit. Booting the Trixie d-i in rescue mode works in BIOS mode, but not in EFI mode.

    d-i boots in BIOS mode with isolinux, not grub.

    This still points to GRUB as the culprit. AFAIK the Linux kernel
    loaded in both modes is the same. The failure to load initrd appears
    when booting the Trixie d-i via UEFI but not via BIOS. Both modes work
    fine when booting using the Bookworm d-i. UEFI no longer does with the Trixie d-i.

    As a further test, I booted into the host using the Bookworm rescue
    mode, chrooted myself in, then purposely downgraded the host's grub*
    to the Bookworm versions and APT-pinned them to 1000. The host boots
    fine now.

    $ dpkg -l | grep grub | awk '{print $2,$3}'
    grub-common 2.06-13+deb12u1
    grub-efi-amd64 2.06-13+deb12u1
    grub-efi-amd64-bin 2.06-13+deb12u1
    grub-efi-amd64-signed 1+2.06+13+deb12u1
    grub-ipxe 1.21.1+git20250317.42a29d56+dfsg-2
    grub2-common 2.06-13+deb12u1

    $ uname -a
    Linux p8h62 6.12.21-amd64 #1 SMP PREEMPT_DYNAMIC Debian 6.12.21-1 (2025-03-30) x86_64 GNU/Linux

    $ ls /sys/firmware/efi/efivars/
    (a very full directory indeed - I won't copy-paste it here)

    Again, this clearly point to Trixie's GRUB-EFI as the culprit.

    Meanwhile, my testing host (i386, GRUB-PC) boots fine:

    $ dpkg -l | grep grub | awk '{print $2,$3}'
    grub-common 2.12-7
    grub-ipxe 1.21.1+git20250317.42a29d56+dfsg-2
    grub-pc 2.12-7
    grub-pc-bin 2.12-7
    grub2-common 2.12-7

    Further googling shows that users of Ubuntu derivatives started
    experiencing similar issues on hosts with a btrfs rootfs about a year
    ago. Given how Ubuntu developers happen to be the key contributors to
    GRUB at Debian, I'd really hope to see a more proactive reaction from
    them to this bug report.

    The Debian changelog suggests what caused this:

    Someone enabled compiling GRUB with FUSE3 to enhance QEMU support.
    The problem with FUSE3 is that it only has very basic support for
    btrfs, lacking among other things support for subvolumes. This is a
    serious issue considering how widely spread btrfs has become and how debian-installer uses subvolumes by default whenever someone selects
    brtfs as the filesystem type. Rendering hosts unbootable is a big
    no-no. I would really hope the release team to step in on this one.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Felix Zielcke@21:1/5 to All on Fri Apr 18 18:50:01 2025
    Am Freitag, dem 18.04.2025 um 19:10 +0300 schrieb Martin-Éric Racine:
    The Debian changelog suggests what caused this:

    Someone enabled compiling GRUB with FUSE3 to enhance QEMU support.
    The problem with FUSE3 is that it only has very basic support for
    btrfs, lacking among other things support for subvolumes. This is a
    serious issue considering how widely spread btrfs has become and how debian-installer uses subvolumes by default whenever someone selects
    brtfs as the filesystem type. Rendering hosts unbootable is a big
    no-no. I would really hope the release team to step in on this one.

    Martin-Éric


    Hi Martin-Éric,

    FUSE support in GRUB 2 is only used for the grub-mount binary. And not
    at all in the actual booting code loaded from EFI/BIOS.

    You could though test the fs drivers of it with grub-mount:

    grub-mount /dev/sda2 /mnt (replace sda2 with your / partition)
    sha1sum /mnt/@rootfs/boot/initrd.img-6.12.20-amd64
    sha1sum /@rootfs/boot/initrd.img-6.12.20-amd64

    both commands should output the same hash sum.
    And yes, this is safe with a mounted fs because the GRUB fs code is
    100% complete read-only.

    Felix

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Felix Zielcke@21:1/5 to All on Sat Apr 19 07:50:01 2025
    Am Freitag, dem 18.04.2025 um 21:17 +0300 schrieb Martin-Éric Racine:
    Hey Felix,

    pe 18.4.2025 klo 19.39 Felix Zielcke ([email protected]) kirjoitti:

    Am Freitag, dem 18.04.2025 um 19:10 +0300 schrieb Martin-Éric
    Racine:
    The Debian changelog suggests what caused this:

    Someone enabled compiling GRUB with FUSE3 to enhance QEMU
    support.
    The problem with FUSE3 is that it only has very basic support for
    btrfs, lacking among other things support for subvolumes. This is
    a
    serious issue considering how widely spread btrfs has become and
    how
    debian-installer uses subvolumes by default whenever someone
    selects
    brtfs as the filesystem type. Rendering hosts unbootable is a big
    no-no. I would really hope the release team to step in on this
    one.

    FUSE support in GRUB 2 is only used for the grub-mount binary. And
    not
    at all in the actual booting code loaded from EFI/BIOS.

    You could though test the fs drivers of it with grub-mount:

    grub-mount /dev/sda2 /mnt (replace sda2 with your / partition)
    sha1sum /mnt/@rootfs/boot/initrd.img-6.12.20-amd64
    sha1sum /@rootfs/boot/initrd.img-6.12.20-amd64

    both commands should output the same hash sum.
    And yes, this is safe with a mounted fs because the GRUB fs code is
    100% complete read-only.

    Exactly how does the above address the issue I reported?

    Well, then we at least know that the bug is not in GRUB's btrfs
    filesystem code
    So your initramfs should be correctly read by it

    Martin-Éric


    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Sat Apr 19 08:10:01 2025
    la 19.4.2025 klo 8.43 Felix Zielcke ([email protected]) kirjoitti:

    Am Freitag, dem 18.04.2025 um 21:17 +0300 schrieb Martin-Éric Racine:
    Hey Felix,

    pe 18.4.2025 klo 19.39 Felix Zielcke ([email protected]) kirjoitti:

    Am Freitag, dem 18.04.2025 um 19:10 +0300 schrieb Martin-Éric
    Racine:
    The Debian changelog suggests what caused this:

    Someone enabled compiling GRUB with FUSE3 to enhance QEMU
    support.
    The problem with FUSE3 is that it only has very basic support for btrfs, lacking among other things support for subvolumes. This is
    a
    serious issue considering how widely spread btrfs has become and
    how
    debian-installer uses subvolumes by default whenever someone
    selects
    brtfs as the filesystem type. Rendering hosts unbootable is a big no-no. I would really hope the release team to step in on this
    one.

    FUSE support in GRUB 2 is only used for the grub-mount binary. And
    not
    at all in the actual booting code loaded from EFI/BIOS.

    You could though test the fs drivers of it with grub-mount:

    grub-mount /dev/sda2 /mnt (replace sda2 with your / partition)
    sha1sum /mnt/@rootfs/boot/initrd.img-6.12.20-amd64
    sha1sum /@rootfs/boot/initrd.img-6.12.20-amd64

    both commands should output the same hash sum.
    And yes, this is safe with a mounted fs because the GRUB fs code is
    100% complete read-only.

    Same checksum. However, manually executing this tells us nothing. It's performed on a filesystem that's already available. It doesn't
    simulate GRUB attempting to cold mount a system on a booting host.

    Exactly how does the above address the issue I reported?

    Well, then we at least know that the bug is not in GRUB's btrfs
    filesystem code
    So your initramfs should be correctly read by it

    There seems to be two issues here:

    1) A magic error upon loading initrd. It also affects the Trixie
    rescue mode via UEFI (but not via BIOS). Bookworm rescue mode boots
    both ways.
    2) fuseblk reporting an unsupported subvol option on Trixie.

    It's a simply a/b test: Downgrading grub* to Bookworm made the host
    bootable again.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Pascal Hambourg@21:1/5 to All on Sat Apr 19 11:50:01 2025
    On 19/04/2025 at 10:19, Martin-Éric Racine wrote:

    Will the release team publish Trixie fully knowing that btrtfs hosts
    will no longer be bootable via UEFI?

    Not all btrfs hosts are affected. A fresh UEFI trixie btrfs virtual
    machine boots fine on bookworm QEMU+OVMF. IIUC you wrote that trixie
    installer is affected too on your host, but it does not use btrfs to
    boot so non-btrfs systems may be affected.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Fri May 2 19:20:01 2025
    pe 2.5.2025 klo 19.44 Chris Hofstaedtler ([email protected]) kirjoitti:

    On Sun, Apr 13, 2025 at 10:23:17AM +0300, Martin-Éric Racine wrote:
    Is there any way I can help the maintainers pinpoint the source of the problem?

    This bug needs two things:

    1) A full reproducer step list, starting from "I have nothing but an empty amd64 VM".

    There isn't much to reproduce:
    1) Start with a fully functional host running Bookworm with GRUB-EFI
    on a brtfs filesystem created using d-i/Bookworm's default @rootfs
    subvolume name.
    2) Change APT sources from Bookworm to Trixie.
    3) Dist-upgrade.
    4) Reboot.
    5) Find the above kernel panic.
    6) Using Bookworm d-i's rescue mode via EFI, APT pin and downgrade
    grub* to Bookworm. All other packages remain at Trixie versions.
    7) Reboot.
    8) The host boots normally.

    Validation:
    1) Try using Trixie d-i's rescue mode via EFI. Same error as above.
    2) Try using Trixie d-i's rescue mode via BIOS. Boots fine, but
    obviously is useless as far as performing maintenance on an EFI host.

    2) A bisect of GRUB between 2.06 and 2.12.

    Too big of a delta.

    If someone had made regular uploads since 2.06, I could quickly go
    through what's in snapshot.d.o and report at which version it failed.
    While there's a stream of 2.12-RC releases there, it's still a jump
    from 2.06 to 2.12, and there's good chances are that the commit we're
    looking for happened between those 2 releases. As amazing as it might
    sound to some people, some of us have a life outside of FLOSS, and
    spending days bisecting and building incremental versions of the same
    package over code spanning a good 2 years is not on the agenda.

    Anyhow, from my perspective, someone ought to interpret my bug report
    as something better acted upon now before other users report the same
    issue once Trixie is out the door. Besides, the signs were already
    there at Ubuntu derivatives for a while, and could have been acted
    upon there much earlier, since the maintainers are the same.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Julian Andres Klode@21:1/5 to [email protected] on Sat May 3 16:50:02 2025
    On 3 May 2025 15:03:18 CEST, "Martin-Éric Racine" <[email protected]> wrote:
    la 3.5.2025 klo 14.42 Chris Hofstaedtler ([email protected]) kirjoitti:

    Control: tags -1 + moreinfo unreproducible

    On Fri, May 02, 2025 at 08:14:37PM +0300, Martin-Éric Racine wrote:
    pe 2.5.2025 klo 19.44 Chris Hofstaedtler ([email protected]) kirjoitti:

    On Sun, Apr 13, 2025 at 10:23:17AM +0300, Martin-Éric Racine wrote:
    Is there any way I can help the maintainers pinpoint the source of the problem?

    This bug needs two things:

    1) A full reproducer step list, starting from "I have nothing but an empty
    amd64 VM".

    There isn't much to reproduce:
    1) Start with a fully functional host running Bookworm with GRUB-EFI
    on a brtfs filesystem created using d-i/Bookworm's default @rootfs
    subvolume name.
    2) Change APT sources from Bookworm to Trixie.
    3) Dist-upgrade.
    4) Reboot.
    5) Find the above kernel panic.
    6) Using Bookworm d-i's rescue mode via EFI, APT pin and downgrade
    grub* to Bookworm. All other packages remain at Trixie versions.
    7) Reboot.
    8) The host boots normally.

    I've followed these steps today, and cannot reproduce the problem.
    The upgraded VM boots successfully.

    I beleive you. Again, googling this exact error mostly pulls up
    reports of this failing on ASUS motherboards of various models.


    Unfortunately some firmware is just broken and there isn't much that can be done about that.

    As we're moving towards more upstream solutions that use more parts of the firmware, such as EFI LoadFile2 protocols for loading the initrd, more hardware will break.

    We cannot commit to keep specific hardware working, whether it's between stable releases or even in security updates when we run out of ability to do backports.

    I suggest documenting no longer supported hardware in the release notes, and potentially say we don't support mainboards made before 2020.

    --
    sent from my phone, excuse the brevity, if any

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From =?UTF-8?Q?Martin=2D=C3=89ric_Racine@21:1/5 to All on Mon May 19 18:40:02 2025
    On Sat, 19 Apr 2025 12:54:07 +0300
    =?UTF-8?Q?Martin=2D=C3=89ric_Racine?= <[email protected]>
    wrote:
    la 19.4.2025 klo 12.43 Pascal Hambourg ([email protected]) kirjoitti:

    On 19/04/2025 at 10:19, Martin-Éric Racine wrote:

    Will the release team publish Trixie fully knowing that btrtfs hosts
    will no longer be bootable via UEFI?

    Not all btrfs hosts are affected. A fresh UEFI trixie btrfs virtual
    machine boots fine on bookworm QEMU+OVMF. IIUC you wrote that trixie installer is affected too on your host, but it does not use btrfs to
    boot so non-btrfs systems may be affected.

    Point taken. Trixe d-i doesn't boot in UEFI on this host, which would
    suggest that the filesystem type used for the root partition may well
    be a red herring.

    Trixie DI RC1 does boot via UEFI so there's hope. However, I have no
    idea of what change makes RC1 succeed where December's Alpha didn't.
    If anyone knows, please do tell.

    Martin-Éric

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Pascal Hambourg@21:1/5 to All on Wed May 21 13:20:01 2025
    On 19/05/2025 at 18:33, Martin-Éric Racine wrote

    Point taken. Trixe d-i doesn't boot in UEFI on this host, which would
    suggest that the filesystem type used for the root partition may well
    be a red herring.

    Trixie DI RC1 does boot via UEFI so there's hope. However, I have no
    idea of what change makes RC1 succeed where December's Alpha didn't.
    If anyone knows, please do tell.

    D-I Alpha1 used GRUB 2.12-5 and D-I RC1 uses GRUB 2.12-7, which is the
    version you reported this bug against, includes a number of security
    fixes and has been used in D-I weekly and daily builds for a couple of
    months. But I do not seen any change related with your issue.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)