New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
FS#384 - IPQ806X: CMD: swconfig on R7800 generate Kernel Panic! #5567
Comments
KerwinKoo: After report this bug I found some format error of text content. If you want me to support more information of this bug, call me email: gukaiqiang@gmail.com. I will response you at the first time. |
hnyman: Confirmed with R7800 using two firmware versions: During testing I noticed an aditional symptom:
The three minutes mentioned above is not exact: my router hanged after 4 minutes too. (The error can be triggered by visiting the switch page in LuCI, too.) Self-built firmware, so no issues from shared/nonshared components should materialise. I first tested with the router having uptime of over 12 hours, and there were no problems. The swconfig command ran normally. Then I tested with a recently rebooted router: 14:42:39 up 4 min, load average: 0.12, 0.20, 0.09 root@lede: |
hnyman: I also tested with two older firmware versions: r2708-39d3a4117b of 2016-12-30 same hang, similarly no reaction to WAN cable removal So this is not a new problem for the R7800 device, but has been there for some time. |
dissent1: It's an old issue. I've noticed this issue since first commit that enabled support for r7800 in lede when I've tried to configure the switch in luci right after reboot. |
stintel: I'm seeing similar behavior on my DAP-2695, which also has the AR8337 rev. 2 switch. Running swconfig dev switch0 show shortly after boot seems to completely lock the SoC. I do not see any kernel messages however, the serial console freezes right after the last line here: I traced it back to this part of the code in ar8337.c: |
dissent1: Maybe this can help somehow? |
stintel: Workaround implemented in https://git.lede-project.org/ec1a695d |
dvlemplgk: Please see my pull request lede-project/source#838 |
reiffert: Hi. I was backing off stintel's patch for testing with lede-trunk on kernel-4.4.49 and I was unable to reproduce this issue on Netgear ipq806x/R7500 with ar8327. Guenther, do you mind trying again after manually backing off stintels workaround and let us know if you still get it? Please find the logs attached. Thank you. |
dvlemplgk: Hi Thomas actually we saw this bug on an OpenWrt based kernel. My patch fixes the lockup there, no need for the workaround. Stijn already reverted the workaround on LEDE -17.01 branch. |
KerwinKoo:
log of this issue:
As you can see from the log reported on
/dev/ttyMSM0
(or runningdmseg
), after booting LEDE kernel, runswconfig dev switch0 show
command immediately (less than 2 min after linux system started completely), the kernol crash and reboot.This issue not occurred every time but has a very high frequency. Not only running
swconfig dev switch0 show
command handly, but also happened automatically. This part of log shows the same kernel crash but without running commandswconfig
:Only disconnect the
WAN-port
or run 'swconfig' later than 3 min after kernel boot, The router will run OK.The text was updated successfully, but these errors were encountered: