понедельник

[Bug 2144522] Re: Dell Machines cannot boot into OS with 6.17.0-1012-oem

** Tags added: kernel-daily-bug

--
You received this bug notification because you are subscribed to linux
in Ubuntu.
Matching subscriptions: Bgg, Bmail, Nb
https://bugs.launchpad.net/bugs/2144522

Title:
Dell Machines cannot boot into OS with 6.17.0-1012-oem

Status in linux package in Ubuntu:
Invalid
Status in linux-oem-6.17 package in Ubuntu:
Invalid
Status in linux source package in Noble:
Invalid
Status in linux-oem-6.17 source package in Noble:
In Progress
Status in linux source package in Questing:
In Progress
Status in linux-oem-6.17 source package in Questing:
Invalid

Bug description:
[Impact]
Dell systems (CID: 202506-36819, 202506-36820, 202506-36823, 202506-36826) with
AMD GFX 11.0.4 (gfx11) graphics cannot boot into the OS after upgrading to
6.17.0-1012-oem. The machine becomes stuck during boot and cannot reach the
desktop. Booting with `nomodeset` works as a workaround, pointing to an amdgpu
driver initialization failure.
Failure rate: 4/4 (100%) on affected Dell systems.

[Fix]
Two patches fix this boot regression:
1. Raise the minimum MES firmware version for calling set_hw_resources_1 on
GC 11.0.4 from 0x50 to 0x52, ensuring firmware that cannot properly
initialize this call is not used.
upstream in mainline kernel v7.0:
1478a34470bf drm/amd: Set minimum version for set_hw_resource_1 on gfx11 to 0x52
2. Remove the MES LR compute workaround (enable_lr_compute_wa) from both
mes_v11_0.c and mes_v12_0.c, since the underlying issue was already fixed by
adjusting the VGPR size, and keeping the workaround causes instability with
newer GC microcode.
upstream in mainline kernel v7.0:
6b0d812971370 drm/amd: Disable MES LR compute W/A

[Test Plan]
Boot the affected machine with kernel 6.17.0-1012-oem (or later oem kernel).
Without the patches: System gets stuck during boot and cannot reach the OS
(amdgpu fails to initialize).
With the patches: System boots normally to the desktop without requiring
`nomodeset`.
To verify:
1. Boot without `nomodeset`
2. Check that the desktop loads successfully
3. Confirm no amdgpu-related errors blocking boot in dmesg:
$ sudo dmesg | grep -i "amdgpu\|mes\|drm"

[Where problems could occur]
These changes affect the amdgpu MES initialization path for gfx11 (mes_v11_0.c)
and gfx12 (mes_v12_0.c) hardware.
For patch 1: If the version threshold 0x52 is incorrect, machines with MES
firmware between 0x50 and 0x51 that previously worked could stop calling
set_hw_resources_1, potentially causing degraded GPU performance or missing
hardware resource configuration. This would manifest as graphical glitches,
GPU compute failures, or silent capability loss after MES initialization.
For patch 2: If the LR compute workaround was actually needed for some
gfx11/gfx12 product beyond gfx1151, removing it could cause GPU hangs or
compute workload failures on those variants. Symptoms would include GPU hangs,
DRM timeout errors, or compute job failures under load.

[Other Info]
Both patches are upstream in Linux 7.0 (merged via drm-next-2026-02-11).


=================================================================

[Summary]
Several Dell Machines failed to boot into OS on 6.17.0-1012-oem.

[Reproduce Steps]
1. Install the GM image (6.11).
2. Run the dist-upgrade to 6.17
3. sudo reboot

On 6.11, the machine can boot into OS after reboot. However, after
upgrading to 6.17, it becomes stuck during boot and cannot reach the
OS. However, if root is modified to nomodeset in 6.17, the machine can
boot into the OS successfully.

[Affected Machines]
https://certification.canonical.com/hardware/202506-36819/
https://certification.canonical.com/hardware/202506-36820/
https://certification.canonical.com/hardware/202506-36823/
https://certification.canonical.com/hardware/202506-36826/

[Failed Rate]
4/4 (100%)

ProblemType: Bug
DistroRelease: Ubuntu 24.04
Package: linux-image-6.17.0-1012-oem 6.17.0-1012.12
ProcVersionSignature: Ubuntu 6.17.0-1012.12-oem 6.17.9
Uname: Linux 6.17.0-1012-oem x86_64
ApportVersion: 2.28.1-0ubuntu3.8
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC1: ubuntu 1271 F.... wireplumber
 /dev/snd/controlC0: ubuntu 1271 F.... wireplumber
 /dev/snd/seq: ubuntu 1266 F.... pipewire
CRDA: N/A
CasperMD5CheckMismatches: ./casper/initrd ./casper/vmlinuz ./casper/minimal.standard.live.hotfix.manifest ./casper/minimal.standard.live.hotfix.size ./casper/minimal.standard.live.size ./casper/minimal.manifest ./casper/minimal.standard.manifest ./casper/minimal.standard.size ./casper/minimal.hotfix.size ./casper/minimal.standard.live.hotfix.squashfs ./casper/minimal.standard.hotfix.squashfs ./casper/minimal.standard.hotfix.size ./casper/minimal.hotfix.squashfs ./casper/minimal.standard.live.manifest ./casper/minimal.size ./boot/grub/grub.cfg
CasperMD5CheckResult: fail
Date: Mon Mar 16 03:28:36 2026
DistributionChannelDescriptor:
 # This is the distribution channel descriptor for Ubuntu 24.04 for Dell
 # For more information see http://wiki.ubuntu.com/DistributionChannelDescriptor
 canonical-oem-somerville-noble-oem-24.04b-proposed-20250617-524
InstallationDate: Installed on 2026-03-16 (0 days ago)
InstallationMedia: Ubuntu OEM 24.04.2 LTS "Noble Numbat" - Release amd64 (20250615)
MachineType: Dell Inc. Dell 16 DC16255
ProcFB: 0 simpledrmdrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-6.17.0-1012-oem root=UUID=d1b47013-8d11-47bc-9f6a-8046300ab97d ro nomodeset quiet splash vt.handoff=7
RelatedPackageVersions:
 linux-restricted-modules-6.17.0-1012-oem N/A
 linux-backports-modules-6.17.0-1012-oem N/A
 linux-firmware 20240318.git3b128b60-0ubuntu2.25
SourcePackage: linux-oem-6.17
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 06/18/2025
dmi.bios.release: 5.3
dmi.bios.vendor: Dell Inc.
dmi.bios.version: 0.4.10
dmi.board.asset.tag: AFTPASS
dmi.board.vendor: Dell Inc.
dmi.chassis.asset.tag: AFTPASS
dmi.chassis.type: 10
dmi.chassis.vendor: Dell Inc.
dmi.chassis.version: 0.4.10
dmi.modalias: dmi:bvnDellInc.:bvr0.4.10:bd06/18/2025:br5.3:svnDellInc.:pnDell16DC16255:pvr:rvnDellInc.:rn:rvr:cvnDellInc.:ct10:cvr0.4.10:sku0D95:
dmi.product.family: Inspiron
dmi.product.name: Dell 16 DC16255
dmi.product.sku: 0D95
dmi.sys.vendor: Dell Inc.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2144522/+subscriptions

Комментариев нет:

Отправить комментарий