-
Notifications
You must be signed in to change notification settings - Fork 5.2k
Kernels 5.15.26/5.16.12 64bit usb ssd issues #4930
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
I was able to get around this issue today with these 2 kernels by reverting a couple of files with this patch. If I remember right the 5.17 tree has these new commits also since I compiled last. https://gitlab.manjaro.org/manjaro-arm/packages/core/linux-rpi4/-/blob/master/xhci-revert.diff |
Hmm. Am I leaking temporary buffers? I borrowed some code from upstream that fixed a theoretical bug with a dwc3 controller, so maybe that's not bullet-proof. |
I tested the upstream 5.17-rc7 yesterday and had no issues. I only test each upstream -rc as it comes out on the pi4 but never go back and check the other upstream trees after that. I do know with this issue it ceases to write when installing another kernel package to my ssd drive and have to recover manually. I have also noticed the same happened when I switched our repo branches and did a complete upgrade with it's packages and the same thing happened. As it is now it can theoretically turn some ones complete install on a usb ssd into fruit salad. |
Ah it's a SG op that's getting linearised into a buffer bigger than a single SWIOTLB entry. usb-storage limits the max consecutive number of written sectors to 256K (exactly the size of an entry) but UAS has no such limit. Why is the swiotlb getting used at all? Please post the contents of /proc/cpuinfo and the output of |
|
@P33M "Why is the swiotlb getting used at all?" I have no clue but I do know when you do a "make bcm2711_defconfig" it gets built into the kernel: |
Can you report the output of |
This is using a good kernel. If you want the output with out the reverted patch it will take me @20 minutes to compile it.
|
That's fine. And one more (sorry) - |
|
From your output I think you have an 8GB Pi 4 rev 1.4. It appears to have a BCM2711B0 (you can confirm that by checking that the second line of the etched writing on the processor package ends in If you were to put |
I can not readily confirm the BO as I have an ice tower mounted on my processor. I will recompile the kernel and and add total_mem=3072 in config.txt and see what happens. I am a little concerned though of having to add this to the config.txt if total_mem=3072 works and limiting the ram if the code stands as is. It is not going to be pretty with people complaining in our forums. |
The 3GB RAM limit is only for diagnosis - it's not an acceptable workaround. |
It is compiling on a 16 core processor; it won't take long.... |
The chip-revision should be visible "sudo busybox devmem 0xfc404000" |
Revised: re-compiled busybox with devmem added:
|
@pelwell It works with the reduced mem. After booting into the test kernel I was able to install another kernel and reboot into it. I used the 5.15.27 kernel released today.
|
I am getting this on a Pi with 2GB ram, what's up with that then? |
I have no clue but the issue is growing as people do upgrades as time goes by. https://archlinuxarm.org/forum/viewtopic.php?f=65&t=15953&p=69197#p69197 |
I'm confused why only fat fs writes can trigger this |
I'm not for sure that it is only fat writes. I did a huge upgrade by switching branches and it was triggered also. |
that's weird, my usb ssd as well as an usb hdd see a lot of write activity, but it never happened except when writing to the boot partition with bootloader or kernel updates |
@Dark-Sky - This patch you are shipping for manjaro arm, is it reverting a single commit in rpi-5.15.y or multiples? Can you call the relevant ones out if applicable? |
@graysky2 I use it on RPi's latest kernel's 5.15 and above. I spent a couple of days narrowing down the issue when the issue first hit and I believe there was more than one commit. I can say I never released the kernels as it was with out reverting and no one has complained of an issue yet. Can you you them if applicable? Not sure what you are asking. |
I think it's the last commit and the one two before that in the pi kernel that touch drivers/usb/host/xhci.c |
And drivers/usb/host/xhci-ring.c |
It's kinda annoying 5.15.y is being rebased still, rewriting commit history. Now it seems like these changes are from 3 days ago, when this bug report is already almost two weeks old |
@Dark-Sky - Sorry, massive typo there. "Can you call the relevant ones commit/commits if applicable?" |
@pelwell - What is your take on the manjaro arm fix? Safe to use while this is sorted out upstream? https://gitlab.manjaro.org/manjaro-arm/packages/core/linux-rpi4/-/blob/master/xhci-revert.diff |
I'm currently doing weekendy things, but will take a look. |
commit a086698adad709fef9a0b73a5153a6f1f95d70b7 from https://github.com/raspberrypi/linux.git rpi-5.15.y This reverts commit 40686d87f87a46b3abf48a8dcaee5e0a031deafb. See: raspberrypi/linux#4930 Signed-off-by: Phil Elwell <[email protected]> Signed-off-by: Meng Li <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: raspberrypi#4930 Signed-off-by: Phil Elwell <[email protected]>
This reverts commit 40686d8. See: #4930 Signed-off-by: Phil Elwell <[email protected]>
BugLink: https://bugs.launchpad.net/bugs/1967733 This reverts commit 40686d87f87a46b3abf48a8dcaee5e0a031deafb. See: raspberrypi/linux#4930 Signed-off-by: Phil Elwell <[email protected]> (cherry picked from commit a086698adad709fef9a0b73a5153a6f1f95d70b7 rpi-5.15.y) Signed-off-by: Juerg Haefliger <[email protected]>
Sure but it will be later. I am having family over.
…On Sun, Mar 20, 2022 at 1:42 PM graysky ***@***.***> wrote:
8f59528
<8f59528>
should be the only revert needed.
@Dark-Sky <https://github.com/Dark-Sky> - Since I believe you have a
system booting from usb, can you try that test? Revert that commit and
report back as to effectiveness for preventing the bug?
—
Reply to this email directly, view it on GitHub
<#4930 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFLJZ54G4RK47T7WQTNDKMTVA5WQ3ANCNFSM5QBOZJNA>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
This reverts commit 40686d8. See: raspberrypi/linux#4930 Signed-off-by: Phil Elwell <[email protected]>
Uh oh!
There was an error while loading. Please reload this page.
Describe the bug
After booting with these new kernels it appears what I can tell while installing another kernel/headers packages or doing a large package upgrades from the repo; writes to my usb ssd ceases. After installing a different kernel there is a large amount of time goes by before returning to the prompt. After installing new kernel packages it will not boot. I can extract the kernel package files and copy them over to my ssd on my desktop and it boots.
There seems to be no issue if booting a sdcard.
Steps to reproduce the behaviour
Install 5.15.26 or 5.16.12 64bit kernel packages on a usb ssd and boot into it then try to install another kernel package then try to boot into the new kernel installed.
Device (s)
Raspberry Pi 4 Mod. B
System
[ray@pi4 ~]$ vcgencmd version
Mar 1 2022 14:21:38
Copyright (c) 2012 Broadcom
version 969fb9b1521fc7ac2b88b15a3a9e942da7678c4d (clean) (release) (start)
[ray@pi4 ~]$ sudo rpi-eeprom-update
BOOTLOADER: up to date
CURRENT: Tue Feb 8 05:24:46 PM UTC 2022 (1644341086)
LATEST: Tue Feb 8 05:24:46 PM UTC 2022 (1644341086)
RELEASE: stable (/lib/firmware/raspberrypi/bootloader/stable)
Use raspi-config to change the release.
VL805_FW: Using bootloader EEPROM
VL805: up to date
CURRENT: 000138a1
LATEST: 000138a1
Logs
journalctl (thousands of these lines):
Mar 04 12:19:13 pi4 kernel: sd 0:0:0:0: [sda] tag#8 CDB: opcode=0x2a 2a 00 00 02 55 ac 00 03 55 00
Mar 04 12:19:13 pi4 kernel: sd 0:0:0:0: [sda] tag#8 data out submit err -11 uas-tag 1 inflight: s-out a-cmd s-cmd work
Mar 04 12:28:42 pi4 kernel: swiotlb_tbl_map_single: 8631 callbacks suppressed
Mar 04 12:28:42 pi4 kernel: xhci_hcd 0000:01:00.0: swiotlb buffer is full (sz: 414208 bytes), total 32768 (slots), used 2 (slots)
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 data out submit err -11 uas-tag 1 inflight: s-out a-cmd s-cmd work
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 CDB: opcode=0x2a 2a 00 00 03 53 64 00 03 29 00
Mar 04 12:28:42 pi4 kernel: xhci_hcd 0000:01:00.0: swiotlb buffer is full (sz: 414208 bytes), total 32768 (slots), used 2 (slots)
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 data out submit err -11 uas-tag 1 inflight: s-out a-cmd s-cmd work
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 CDB: opcode=0x2a 2a 00 00 03 53 64 00 03 29 00
Mar 04 12:28:42 pi4 kernel: xhci_hcd 0000:01:00.0: swiotlb buffer is full (sz: 414208 bytes), total 32768 (slots), used 2 (slots)
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 data out submit err -11 uas-tag 1 inflight: s-out a-cmd s-cmd work
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 CDB: opcode=0x2a 2a 00 00 03 53 64 00 03 29 00
Mar 04 12:28:42 pi4 kernel: xhci_hcd 0000:01:00.0: swiotlb buffer is full (sz: 414208 bytes), total 32768 (slots), used 2 (slots)
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 data out submit err -11 uas-tag 1 inflight: s-out a-cmd s-cmd work
Mar 04 12:28:42 pi4 kernel: sd 0:0:0:0: [sda] tag#25 CDB: opcode=0x2a 2a 00 00 03 53 64 00 03 29 00
Mar 04 12:28:42 pi4 kernel: xhci_hcd 0000:01:00.0: swiotlb buffer is full (sz: 414208 bytes), total 32768 (slots), used 2 (slots)
Additional context
No response
The text was updated successfully, but these errors were encountered: