- Status Closed
- Percent Complete
- Task Type Bug Report
- Category Base system
- Assigned To No-one
- Operating System All
- Severity Low
- Priority Very Low
- Reported Version Trunk
- Due in Version Undecided
-
Due Date
Undecided
- Private
Opened by Hannu Nyman - 29.07.2019
Last edited by Hauke Mehrtens - 18.08.2019
FS#2414 - ath10k crashes quickly on ipq806x R7800 and NBG6817
After the mac80211 version bump to 5.2-rc7 in 0b2c42ced two weeks weeks ago, ath10k based R7800 always crashes quickly after reboot if the old “non-ct” ath10k wifi driver and firmware is enabled. “wifi down” enables the router to stay alive and to work otherwise normally.
(Note, ath10k-ct works ok with the same OpenWrt build. So this is only about the traditional ath10k)
It looks like the wifi driver crashes right after any wifi client initiates contact with the router. Kernel serial log does not provide much more info except that the crash seems to always happen at ieee80211_sta_register_airtime
[ 69.854584] Unable to handle kernel paging request at virtual address fffff9e8 ... [ 70.050841] PC is at ieee80211_sta_register_airtime+0x24/0x148 [mac80211] [ 70.055225] LR is at ath10k_htt_t2h_msg_handler+0x678/0x10f4 [ath10k_core]
OpenWrt master, SNAPSHOT r10624-4dcef8263e
QCA9984
firmware ver 10.4-3.9.0.2-00021
I noticed this first two weeks ago, and when I mentioned it on the forum, and slh responded that he has seen the same with nbg6817
https://forum.openwrt.org/t/netgear-r7800-exploration-ipq8065-qca9984/285/1675?u=hnyman
I have since done several builds, but the situation has not corrected itself. The old ath10k just crashes the router, while -ct works ok.
I have not yet done proper regression testing to make sure that it really is the mac80211 version bump commit, but that seems quite likely, as my ath10k build one week earlier with r10443-ea4e1dac71 (20190708) worked quite ok and there were not that many suspects in the relevant time frame.
Any debugging advice?
log:
[ 75.991714] Unable to handle kernel paging request at virtual address fffff9e8 [ 75.991756] pgd = c0204000 [ 75.997955] [fffff9e8] *pgd=5fdfd861, *pte=00000000, *ppte=00000000 [ 76.000537] Internal error: Oops: 37 [#1] SMP ARM [ 76.006686] Modules linked in: pppoe ppp_async ath10k_pci ath10k_core ath pptp pppox ppp_mppe ppp_generic mac80211 iptable_nat ipt_REJECT ipt_MASQUERADE cfg80211 xt_time xt_tcpudp xt_tcpmss xt_statistic xt_state xt_recent xt_nat xt_multiport xt_mark xt_mac xt_limit xt_length xt_hl xt_helper xt_esp xt_ecn xt_dscp xt_conntrack xt_connmark xt_connlimit xt_connbytes xt_comment xt_TCPMSS xt_REDIRECT xt_LOG xt_HL xt_FLOWOFFLOAD xt_DSCP xt_CT xt_CLASSIFY usbserial slhc nf_reject_ipv4 nf_nat_redirect nf_nat_masquerade_ipv4 nf_conntrack_ipv4 nf_nat_ipv4 nf_log_ipv4 nf_flow_table_hw nf_flow_table nf_defrag_ipv4 nf_conntrack_rtcache nf_conntrack_netlink iptable_raw iptable_mangle iptable_filter ipt_ah ipt_ECN ip_tables crc_ccitt compat chaoskey fuse sch_cake sch_tbf sch_ingress sch_htb sch_hfsc em_u32 cls_u32 [ 76.059974] cls_tcindex cls_route cls_matchall cls_fw cls_flow cls_basic act_skbedit act_mirred ledtrig_usbport xt_set ip_set_list_set ip_set_hash_netportnet ip_set_hash_netport ip_set_hash_netnet ip_set_hash_netiface ip_set_hash_net ip_set_hash_mac ip_set_hash_ipportnet ip_set_hash_ipportip ip_set_hash_ipport ip_set_hash_ipmark ip_set_hash_ip ip_set_bitmap_port ip_set_bitmap_ipmac ip_set_bitmap_ip ip_set nfnetlink ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6t_NPT ip6t_MASQUERADE nf_nat_masquerade_ipv6 nf_nat nf_conntrack nf_log_ipv6 nf_log_common ip6table_mangle ip6table_filter ip6_tables ip6t_REJECT x_tables nf_reject_ipv6 msdos ip_gre gre ifb sit tunnel4 ip_tunnel tun vfat fat hfsplus cifs nls_utf8 nls_iso8859_15 nls_iso8859_1 nls_cp850 nls_cp437 nls_cp1250 sha1_generic md5 md4 [ 76.130634] usb_storage leds_gpio xhci_plat_hcd xhci_pci xhci_hcd dwc3 dwc3_of_simple ohci_platform ohci_hcd phy_qcom_dwc3 ahci ehci_platform sd_mod ahci_platform libahci_platform libahci libata scsi_mod ehci_hcd gpio_button_hotplug ext4 mbcache jbd2 exfat crc32c_generic [ 76.154772] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.14.132 #0 [ 76.177001] Hardware name: Generic DT based system [ 76.182990] task: c0b06d80 task.stack: c0b00000 [ 76.187832] PC is at ieee80211_sta_register_airtime+0x24/0x148 [mac80211] [ 76.192211] LR is at ath10k_htt_t2h_msg_handler+0x678/0x10f4 [ath10k_core] [ 76.199052] pc : [<bf75bfac>] lr : [<bf83e8b0>] psr: a0000113 [ 76.205820] sp : c0b01d54 ip : 00000002 fp : bf869c0c [ 76.211981] r10: 0000003c r9 : dbdca138 r8 : 00060002 [ 76.217192] r7 : 00000000 r6 : dabe1150 r5 : 00000000 r4 : dbdc95c0 [ 76.222401] r3 : 00000000 r2 : 00060002 r1 : 00000000 r0 : 00000000 [ 76.229003] Flags: NzCv IRQs on FIQs on Mode SVC_32 ISA ARM Segment none [ 76.235509] Control: 10c5787d Table: 5c94006a DAC: 00000051 [ 76.242716] Process swapper/0 (pid: 0, stack limit = 0xc0b00210) [ 76.248446] Stack: (0xc0b01d54 to 0xc0b02000) [ 76.254532] 1d40: dbdc95c0 00000000 dabe1150 [ 76.258808] 1d60: 00000001 dabe1150 dbdca138 0000003c bf869c0c bf83e8b0 00000002 c0314b10 [ 76.266969] 1d80: dbdc9c70 00000001 00000001 dabe114c 00010000 00000000 dbdcd724 bf88f3d8 [ 76.275126] 1da0: c0310d28 db393c00 dbdc95c0 00000000 c0b01dd0 c07fb4c4 dbdcd724 00000001 [ 76.283286] 1dc0: 00000022 bf88b09c db393c00 00000022 c0b01dd0 c0b01dd0 00000000 dbdcc5c0 [ 76.291445] 1de0: bf88f04c dbdcd654 dbdcd71c dbdc95c0 00000014 dbdcd724 dbdcc5c0 00000005 [ 76.299605] 1e00: 0004b400 bf85c360 00000000 bf87101c c0b01e24 00000006 00000000 dbdc95c0 [ 76.307764] 1e20: 00000001 00000040 0000012c c0b01e80 1cf51000 bf85c448 dbdcd440 dbdc95c0 [ 76.315925] 1e40: dbdca440 ffffa880 00000040 bf88cb68 dbdcd440 00000001 00000040 ffffa880 [ 76.324084] 1e60: c0b02d00 c06d72e0 dd990080 c0a3f080 c0b255dc c0b047e4 c090afac c090e80c [ 76.332244] 1e80: c0b01e80 c0b01e80 c0b01e88 c0b01e88 dd4cc200 00000000 00000003 c0b0208c [ 76.340405] 1ea0: c0b02080 40000003 ffffe000 00000100 c0b02080 c03015c8 00000000 00000001 [ 76.348564] 1ec0: dd408000 c0a38210 c0b2c7c0 0000000a ffffa880 c0b02d00 c07fb764 00200102 [ 76.356723] 1ee0: dd4cc268 c0a3e414 00000000 00000000 00000001 dd408000 de803000 00000000 [ 76.364883] 1f00: 00000000 c03247cc c0a3e414 c0368f1c c0b03f60 c0b153cc de80200c de802000 [ 76.373042] 1f20: c0b01f48 c0301488 c0308630 60000013 ffffffff c0b01f7c 00000000 c0b00000 [ 76.381204] 1f40: 00000000 c030c08c 00000001 00000000 00000000 c0315180 ffffe000 c0b03cc0 [ 76.389363] 1f60: c0b03c70 00000000 00000000 c0a2da28 00000000 00000000 c0b01f90 c0b01f98 [ 76.397522] 1f80: c030862c c0308630 60000013 ffffffff 00000051 00000000 ffffe000 c035dd18 [ 76.405681] 1fa0: 000000bf c0b03c40 00000000 c0b2c000 dddfce80 c035e060 c0b2c040 c0a00cf4 [ 76.413842] 1fc0: ffffffff ffffffff 00000000 c0a0067c c0a2da28 00000000 00000000 c0b2c1d4 [ 76.422001] 1fe0: c0b03c5c c0a2da24 c0b07ee0 4220406a 512f04d0 4220807c 00000000 00000000 [ 76.430335] [<bf75bfac>] (ieee80211_sta_register_airtime [mac80211]) from [<00000002>] (0x2) [ 76.438314] Code: e1cd81f0 e1a08002 e1cda1f8 e58de020 (e5102618) [ 76.446965] ---[ end trace 227a38ade964d642 ]--- [ 76.452783] Kernel panic - not syncing: Fatal exception in interrupt [ 76.457488] CPU1: stopping [ 76.463817] CPU: 1 PID: 61 Comm: kworker/1:1 Tainted: G D 4.14.132 #0 [ 76.466328] Hardware name: Generic DT based system [ 76.473916] Workqueue: events set_brightness_delayed [ 76.478698] [<c030f2c4>] (unwind_backtrace) from [<c030b4c0>] (show_stack+0x14/0x20) [ 76.483805] [<c030b4c0>] (show_stack) from [<c07dfc18>] (dump_stack+0x88/0x9c) [ 76.491524] [<c07dfc18>] (dump_stack) from [<c030e50c>] (handle_IPI+0xf4/0x1ac) [ 76.498550] [<c030e50c>] (handle_IPI) from [<c03014b8>] (gic_handle_irq+0x9c/0xb8) [ 76.505754] [<c03014b8>] (gic_handle_irq) from [<c030c08c>] (__irq_svc+0x6c/0x90) [ 76.513380] Exception stack(0xdcc5de28 to 0xdcc5de70) [ 76.520959] de20: dbdca138 00000000 000004a9 000004a8 dbdca138 dbdc9288 [ 76.526009] de40: dca54ca0 bf84cdf8 00000000 00000000 dbdc8be0 00000002 00000008 dcc5de78 [ 76.534147] de60: bf84ce24 c07fa42c 20000013 ffffffff [ 76.542300] [<c030c08c>] (__irq_svc) from [<c07fa42c>] (_raw_spin_lock_bh+0x40/0x58) [ 76.547434] [<c07fa42c>] (_raw_spin_lock_bh) from [<bf84ce24>] (ath10k_wmi_tx_beacons_iter+0x2c/0x124 [ath10k_core]) [ 76.555459] [<bf84ce24>] (ath10k_wmi_tx_beacons_iter [ath10k_core]) from [<bf78a304>] (ieee80211_delayed_tailroom_dec+0x144/0x1e0 [mac80211]) [ 76.565953] [<bf78a304>] (ieee80211_delayed_tailroom_dec [mac80211]) from [<bf78a3b0>] (ieee80211_iterate_active_interfaces_atomic+0x10/0x1c [mac80211]) [ 76.578485] [<bf78a3b0>] (ieee80211_iterate_active_interfaces_atomic [mac80211]) from [<bf84cf70>] (ath10k_wmi_cmd_send+0x54/0x1cc [ath10k_core]) [ 76.592143] [<bf84cf70>] (ath10k_wmi_cmd_send [ath10k_core]) from [<bf859a50>] (ath10k_leds_set_brightness_blocking+0x90/0xb0 [ath10k_core]) [ 76.604992] [<bf859a50>] (ath10k_leds_set_brightness_blocking [ath10k_core]) from [<c06a2164>] (set_brightness_delayed+0x98/0x100) [ 76.617580] [<c06a2164>] (set_brightness_delayed) from [<c0337f34>] (process_one_work+0x28c/0x444) [ 76.629113] [<c0337f34>] (process_one_work) from [<c0338430>] (worker_thread+0x344/0x58c) [ 76.638051] [<c0338430>] (worker_thread) from [<c033e478>] (kthread+0x164/0x16c) [ 76.646300] [<c033e478>] (kthread) from [<c0307d28>] (ret_from_fork+0x14/0x2c) [ 76.653758] Rebooting in 3 seconds..
Longer log:
[ 16.856238] Loading modules backported from Linux version v5.2-rc7-0-g6fbc7275c7a9 [ 16.860820] Backport generated by backports.git v5.2-rc7-1-0-g021a6ba1 [ 16.869804] ip_tables: (C) 2000-2006 Netfilter Core Team [ 16.877980] ctnetlink v0.93: registering with nfnetlink. [ 16.900229] usbcore: registered new interface driver usbserial [ 16.900283] usbcore: registered new interface driver usbserial_generic [ 16.905001] usbserial: USB Serial support registered for generic [ 16.924392] xt_time: kernel timezone is -0000 [ 16.954572] PPP generic driver version 2.4.2 [ 16.955096] PPP MPPE Compression module registered [ 16.958724] NET: Registered protocol family 24 [ 16.963210] PPTP driver version 0.8.5 [ 16.975413] ath10k_pci 0000:01:00.0: enabling device (0140 -> 0142) [ 16.976007] ath10k_pci 0000:01:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0 [ 17.139984] ath10k_pci 0000:01:00.0: Direct firmware load for ath10k/QCA9984/hw1.0/firmware-6.bin failed with error -2 [ 17.140014] ath10k_pci 0000:01:00.0: Falling back to user helper [ 17.510554] firmware ath10k!QCA9984!hw1.0!firmware-6.bin: firmware_loading_store: map pages failed [ 17.711192] ath10k_pci 0000:01:00.0: qca9984/qca9994 hw1.0 target 0x01000000 chip_id 0x00000000 sub 168c:cafe [ 17.711250] ath10k_pci 0000:01:00.0: kconfig debug 0 debugfs 1 tracing 0 dfs 1 testmode 1 [ 17.726722] ath10k_pci 0000:01:00.0: firmware ver 10.4-3.9.0.2-00021 api 5 features no-p2p,mfp,peer-flow-ctrl,btcoex-param,allows-mesh-bcast,no-ps crc32 9626782c [ 19.992079] ath10k_pci 0000:01:00.0: board_file api 2 bmi_id 0:1 crc32 cf58c3bc [ 25.882791] ath10k_pci 0000:01:00.0: unsupported HTC service id: 1536 [ 25.995168] ath10k_pci 0000:01:00.0: htt-ver 2.2 wmi-op 6 htt-op 4 cal pre-cal-file max-sta 512 raw 0 hwcrypto 1 [ 26.087064] ath10k_pci 0001:01:00.0: enabling device (0140 -> 0142) [ 26.087877] ath10k_pci 0001:01:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0 [ 26.249728] ath10k_pci 0001:01:00.0: Direct firmware load for ath10k/QCA9984/hw1.0/firmware-6.bin failed with error -2 [ 26.249768] ath10k_pci 0001:01:00.0: Falling back to user helper [ 26.521969] firmware ath10k!QCA9984!hw1.0!firmware-6.bin: firmware_loading_store: map pages failed [ 26.522200] ath10k_pci 0001:01:00.0: qca9984/qca9994 hw1.0 target 0x01000000 chip_id 0x00000000 sub 168c:cafe [ 26.530085] ath10k_pci 0001:01:00.0: kconfig debug 0 debugfs 1 tracing 0 dfs 1 testmode 1 [ 26.542340] ath10k_pci 0001:01:00.0: firmware ver 10.4-3.9.0.2-00021 api 5 features no-p2p,mfp,peer-flow-ctrl,btcoex-param,allows-mesh-bcast,no-ps crc32 9626782c [ 28.817512] ath10k_pci 0001:01:00.0: board_file api 2 bmi_id 0:2 crc32 cf58c3bc [ 34.722258] ath10k_pci 0001:01:00.0: unsupported HTC service id: 1536 [ 34.837088] ath10k_pci 0001:01:00.0: htt-ver 2.2 wmi-op 6 htt-op 4 cal pre-cal-file max-sta 512 raw 0 hwcrypto 1 [ 34.930704] kmodloader: done loading kernel modules from /etc/modules.d/* ... [ 44.931784] IPv6: ADDRCONF(NETDEV_CHANGE): br-lan: link becomes ready [ 46.178478] ath: EEPROM regdomain: 0x80f6 [ 46.178504] ath: EEPROM indicates we should expect a country code [ 46.181468] ath: doing EEPROM country->regdmn map search [ 46.187658] ath: country maps to regdmn code: 0x37 [ 46.192928] ath: Country alpha2 being used: FI [ 46.197625] ath: Regpair used: 0x37 [ 46.201949] ath: regdomain 0x80f6 dynamically updated by user [ 46.205361] ath: EEPROM regdomain: 0x80f6 [ 46.211342] ath: EEPROM indicates we should expect a country code [ 46.215236] ath: doing EEPROM country->regdmn map search [ 46.221397] ath: country maps to regdmn code: 0x37 [ 46.226772] ath: Country alpha2 being used: FI [ 46.231293] ath: Regpair used: 0x37 [ 46.235804] ath: regdomain 0x80f6 dynamically updated by user [ 56.261988] ath10k_pci 0001:01:00.0: unsupported HTC service id: 1536 [ 56.472614] ath10k_pci 0001:01:00.0: Unknown eventid: 36933 [ 56.476361] IPv6: ADDRCONF(NETDEV_UP): wlan1: link is not ready [ 62.469949] ath10k_pci 0000:01:00.0: unsupported HTC service id: 1536 [ 62.675788] ath10k_pci 0000:01:00.0: Unknown eventid: 36933 [ 62.679170] IPv6: ADDRCONF(NETDEV_UP): wlan0: link is not ready [ 62.692506] br-lan: port 2(wlan1) entered blocking state [ 62.692569] br-lan: port 2(wlan1) entered disabled state [ 62.697444] device wlan1 entered promiscuous mode [ 62.711382] br-lan: port 3(wlan0) entered blocking state [ 62.711407] br-lan: port 3(wlan0) entered disabled state [ 62.716036] device wlan0 entered promiscuous mode [ 63.132153] IPv6: ADDRCONF(NETDEV_CHANGE): wlan1: link becomes ready [ 63.132294] br-lan: port 2(wlan1) entered blocking state [ 63.137642] br-lan: port 2(wlan1) entered forwarding state [ 63.561502] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready [ 63.561645] br-lan: port 3(wlan0) entered blocking state [ 63.567005] br-lan: port 3(wlan0) entered forwarding state [ 69.854584] Unable to handle kernel paging request at virtual address fffff9e8 [ 69.854619] pgd = c0204000 [ 69.860830] [fffff9e8] *pgd=5fdfd861, *pte=00000000, *ppte=00000000 [ 69.863399] Internal error: Oops: 37 [#1] SMP ARM [ 69.869551] Modules linked in: pppoe ppp_async ath10k_pci ath10k_core ath pptp pppox ppp_mppe ppp_generic mac80211 iptable_nat ipt_REJECT ipt_MASQUERADE cfg80211 xt_time xt_tcpudp xt_tcpmss xt_statistic xt_state xt_recent xt_nat xt_multiport xt_mark xt_mac xt_limit xt_length xt_hl xt_helper xt_esp xt_ecn xt_dscp xt_conntrack xt_connmark xt_connlimit xt_connbytes xt_comment xt_TCPMSS xt_REDIRECT xt_LOG xt_HL xt_FLOWOFFLOAD xt_DSCP xt_CT xt_CLASSIFY usbserial slhc nf_reject_ipv4 nf_nat_redirect nf_nat_masquerade_ipv4 nf_conntrack_ipv4 nf_nat_ipv4 nf_log_ipv4 nf_flow_table_hw nf_flow_table nf_defrag_ipv4 nf_conntrack_rtcache nf_conntrack_netlink iptable_raw iptable_mangle iptable_filter ipt_ah ipt_ECN ip_tables crc_ccitt compat chaoskey fuse sch_cake sch_tbf sch_ingress sch_htb sch_hfsc em_u32 cls_u32 [ 69.922838] cls_tcindex cls_route cls_matchall cls_fw cls_flow cls_basic act_skbedit act_mirred ledtrig_usbport xt_set ip_set_list_set ip_set_hash_netportnet ip_set_hash_netport ip_set_hash_netnet ip_set_hash_netiface ip_set_hash_net ip_set_hash_mac ip_set_hash_ipportnet ip_set_hash_ipportip ip_set_hash_ipport ip_set_hash_ipmark ip_set_hash_ip ip_set_bitmap_port ip_set_bitmap_ipmac ip_set_bitmap_ip ip_set nfnetlink ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6t_NPT ip6t_MASQUERADE nf_nat_masquerade_ipv6 nf_nat nf_conntrack nf_log_ipv6 nf_log_common ip6table_mangle ip6table_filter ip6_tables ip6t_REJECT x_tables nf_reject_ipv6 msdos ip_gre gre ifb sit tunnel4 ip_tunnel tun vfat fat hfsplus cifs nls_utf8 nls_iso8859_15 nls_iso8859_1 nls_cp850 nls_cp437 nls_cp1250 sha1_generic md5 md4 [ 69.993498] usb_storage leds_gpio xhci_plat_hcd xhci_pci xhci_hcd dwc3 dwc3_of_simple ohci_platform ohci_hcd phy_qcom_dwc3 ahci ehci_platform sd_mod ahci_platform libahci_platform libahci libata scsi_mod ehci_hcd gpio_button_hotplug ext4 mbcache jbd2 exfat crc32c_generic [ 70.017635] CPU: 0 PID: 7566 Comm: iptables Not tainted 4.14.132 #0 [ 70.039864] Hardware name: Generic DT based system [ 70.045855] task: dbd67480 task.stack: da974000 [ 70.050841] PC is at ieee80211_sta_register_airtime+0x24/0x148 [mac80211] [ 70.055225] LR is at ath10k_htt_t2h_msg_handler+0x678/0x10f4 [ath10k_core] [ 70.062088] pc : [<bf75bfac>] lr : [<bf83e8b0>] psr: a0000113 [ 70.068856] sp : da975c2c ip : 00000002 fp : bf869c0c [ 70.075019] r10: 0000003b r9 : dbdc2138 r8 : 00060002 [ 70.080228] r7 : 00000000 r6 : dabfd650 r5 : 00000000 r4 : dbdc15c0 [ 70.085437] r3 : 00000000 r2 : 00060002 r1 : 00000000 r0 : 00000000 [ 70.092038] Flags: NzCv IRQs on FIQs on Mode SVC_32 ISA ARM Segment none [ 70.098546] Control: 10c5787d Table: 5dba806a DAC: 00000051 [ 70.105750] Process iptables (pid: 7566, stack limit = 0xda974210) [ 70.111480] Stack: (0xda975c2c to 0xda976000) [ 70.117572] 5c20: dbdc15c0 00000000 dabfd650 00000001 dabfd650 [ 70.122012] 5c40: dbdc2138 0000003b bf869c0c bf83e8b0 00000002 c0314b10 dbdc1c70 00000001 [ 70.130172] 5c60: 00000001 dabfd64c 00010000 00000000 dbdc5724 bf88f3d8 c0310d28 da8d2780 [ 70.138332] 5c80: dbdc15c0 00000000 da975ca8 c07fb4c4 dbdc5724 00000001 00000022 bf88b09c [ 70.146491] 5ca0: da8d2780 00000022 da975ca8 da975ca8 00000000 dbdc45c0 bf88f04c dbdc5654 [ 70.154652] 5cc0: dbdc571c dbdc15c0 00000014 dbdc5724 dbdc45c0 00000005 0004b400 bf85c360 [ 70.162812] 5ce0: 00000000 bf87101c da975cfc 00000006 00000000 dbdc15c0 00000001 00000040 [ 70.170970] 5d00: 0000012c da975d58 1cf51000 bf85c448 dbdc5440 dbdc15c0 dbdc2440 ffffa61a [ 70.179129] 5d20: 00000040 bf88cb68 dbdc5440 00000001 00000040 ffffa61a c0b02d00 c06d72e0 [ 70.187290] 5d40: dd990080 c0a3f080 c0b255dc c0b047e4 c090afac c090e80c da975d58 da975d58 [ 70.195451] 5d60: da975d60 da975d60 dd4cc200 00000000 00000003 c0b0208c c0b02080 40000003 [ 70.203608] 5d80: ffffe000 00000100 c0b02080 c03015c8 00000000 00000001 dd408000 c0a38210 [ 70.211769] 5da0: c0b2c7c0 0000000a ffffa61a c0b02d00 c07fb764 00400104 dd4cc268 c0a3e414 [ 70.219928] 5dc0: 00000000 00000000 00000001 dd408000 de803000 da975f08 c0914000 c03247cc [ 70.228091] 5de0: c0a3e414 c0368f1c c0b03f60 c0b153cc de80200c de802000 da975e20 c0301488 [ 70.236248] 5e00: c0315140 40000013 ffffffff da975e54 5c13375f da974000 da975f08 c030c08c [ 70.244407] 5e20: dcb1d12c 00000040 00000000 00000212 dcb1d130 b6e4c000 b6e4b000 ddd6bb2c [ 70.252567] 5e40: 5c13375f b6e4c000 da975f08 c0914000 0005fdf9 da975e70 c03f3060 c0315140 [ 70.260728] 5e60: 40000013 ffffffff 00000051 bf000000 38e38e39 b6e4c000 00000000 d94e3480 [ 70.268886] 5e80: c0b06a84 c0b589c0 dcdd01c0 dbbaadbc dcdd01f8 fffff000 00000000 dbbaadb8 [ 70.277047] 5ea0: b6e4c000 b6e4bfff 00000000 00000000 00000000 00000000 db535b48 d94e3480 [ 70.285205] 5ec0: ffffffff 00000000 da975f08 00000000 00000000 dcdd01fc 00000000 c03f37e8 [ 70.293367] 5ee0: 00000000 0000045c da8e6540 dcdd01c0 00000000 0000045c c0307e44 c03f9d14 [ 70.301525] 5f00: d9da8454 00000000 dcdd01c0 00000001 00000000 00000000 ffffffff 00000000 [ 70.309685] 5f20: befb4334 0000003b 00000400 dbfc3000 00000000 d9da1660 00000000 d9da16b0 [ 70.317845] 5f40: d9da83a0 dcdd01c0 00000000 ffffe000 dcdd01c0 00000000 ffffe000 c031cd90 [ 70.326006] 5f60: dcdd01c0 dbd67480 ffffe000 c03231fc 00000002 c041ddac 00000000 000000f8 [ 70.334166] 5f80: 00000000 c0323964 00000002 befb4920 00000003 000000f8 c0307e44 c0323978 [ 70.342323] 5fa0: 00000002 c0307c40 00000002 befb4920 00000002 befb4320 00000000 00000002 [ 70.350483] 5fc0: 00000002 befb4920 00000003 000000f8 00000000 b6e1fce0 00000006 00000000 [ 70.358645] 5fe0: 00000001 befb43b8 b6ebca0c b6ece0e8 60000010 00000002 00000000 00000000 [ 70.366945] [<bf75bfac>] (ieee80211_sta_register_airtime [mac80211]) from [<00000002>] (0x2) [ 70.374957] Code: e1cd81f0 e1a08002 e1cda1f8 e58de020 (e5102618) [ 70.383600] ---[ end trace b799a93daca2cf67 ]--- [ 70.389524] Kernel panic - not syncing: Fatal exception in interrupt [ 70.394141] CPU1: stopping [ 70.400465] CPU: 1 PID: 0 Comm: swapper/1 Tainted: G D 4.14.132 #0 [ 70.402979] Hardware name: Generic DT based system [ 70.410228] [<c030f2c4>] (unwind_backtrace) from [<c030b4c0>] (show_stack+0x14/0x20) [ 70.415074] [<c030b4c0>] (show_stack) from [<c07dfc18>] (dump_stack+0x88/0x9c) [ 70.422965] [<c07dfc18>] (dump_stack) from [<c030e50c>] (handle_IPI+0xf4/0x1ac) [ 70.429991] [<c030e50c>] (handle_IPI) from [<c03014b8>] (gic_handle_irq+0x9c/0xb8) [ 70.437195] [<c03014b8>] (gic_handle_irq) from [<c030c08c>] (__irq_svc+0x6c/0x90) [ 70.444822] Exception stack(0xdd461f80 to 0xdd461fc8) [ 70.452411] 1f80: 00000001 00000000 00000000 c0315180 ffffe000 c0b03cc0 c0b03c70 00000000 [ 70.457452] 1fa0: 00000000 512f04d0 00000000 00000000 dd461fc8 dd461fd0 c030862c c0308630 [ 70.465581] 1fc0: 60000013 ffffffff [ 70.473742] [<c030c08c>] (__irq_svc) from [<c0308630>] (arch_cpu_idle+0x38/0x44) [ 70.477054] [<c0308630>] (arch_cpu_idle) from [<c035dd18>] (do_idle+0xe8/0x1bc) [ 70.484686] [<c035dd18>] (do_idle) from [<c035e060>] (cpu_startup_entry+0x1c/0x20) [ 70.491712] [<c035e060>] (cpu_startup_entry) from [<423017cc>] (0x423017cc) [ 70.499346] Rebooting in 3 seconds..
18.08.2019 19:23
Reason for closing: Fixed
Additional comments about closing:
This was fixed in https:/ /git.openwrt.org/2ceee0e02362519f08be2a2 88bd8f42ca6cef257
The ath10k (non-ct) firmware has not been updated in a while in OpenWRT. You might want to try a newer version from https://github.com/kvalo/ath10k-firmware/tree/master/QCA9984/hw1.0 The newest 10.4-3.9.0.2 version is 10.4-3.9.0.2-00046, or take the 3.10 series, 10.4-3.10-00047 is now available.
I tested with both 10.4-3.9.0.2-00046 and 10.4-3.10-00047 formwares and neither helped.
This branch still contains some more backports version between 4.19 and 5.2, could you please check with which version this problem started:
https://git.openwrt.org/?p=openwrt/staging/hauke.git;a=shortlog;h=refs/heads/mac80211-5.0
Thanks for pointer to those intermediate commits in your staging tree. I checked it out and built minimal versions of the firmware. baseline and three first commits worked ok, but then the chrasing starts
The culprit is the bump to 5.1-rc2-1:
https://git.openwrt.org/?p=openwrt/staging/hauke.git;a=commit;h=a9e9bdb8ba26ae3cf9df0a230d8eacd92deac821
Pure guess, but that commit seems to remove some airtime related patches, and like I have said earlier, the crash always happens at:
PC is at ieee80211_sta_register_airtime+0x24/0x148 [mac80211]
Below are three consecutive crashes with that 5.1-rc2-1
I can reproduce this with r10810-552c48ea43, so using
on my ZyXEL NBG6817 (ipq8065) with QCA9984/ ath10k (firmware ver 10.4-3.10-00047 api 5 features no-p2p,mfp,peer-flow-ctrl,btcoex-param,allows-mesh-bcast,no-ps crc32 19ca6df2)
As soon as the first STA device tries associating, ath10k crashes the kernel and the watchdog reboots the router.
The following log is using ipq806x with kernel v4.19.67 and your hostapd v2.9 update, so slightly modified compared to stock HEAD; the router doesn't have a serial console attached - so I'm slightly limited to what logread can fetch:
I checked where this problem occurs:
This is probably in "if (sta→local→airtime_flags & AIRTIME_USE_TX)".
My assumption is that pubsta is NULL, becasue the invalid access happens at 0xfffff9e8 and container_of() will take the pointer to pubsta, which is probably NULL and substract something to the to the outer pointer.
Could you please try the attached patch and report back if it works better now.
I started testing the patch with the crashing commit from your 5.0 branch, which has the internediate commits. Is the patch supposed to work with that?
(I will test also with the current HEAD of master, but I wanted a test with no other changes.)
The patch helps the router to stay alive, but no wifi connectivity is reached. The firmware crashes and there are repetitive error message like this:
hauke, with the current HEAD of master (r10821-d1f207ecc9 including mac80211 commit "Update to version 5.3-rc4-1"), your patch seems to work almost ok
There is wifi connectivity both on 2.4. and 5 GHz radios. And the router stays alive.
There are intermittent bursts of "SWBA overrun on vdev 0, skipped old beacon" errors.
Like this:
Thank you for testing.
This patch was meant for current OpenWrt master or backports 5.1 or later.
This problem was introduced in mainline kernel 5.1. I will add this to OpenWrt master and already send it upstream.
I assume that the FW was too new for backports 5.0 and it crashed because of that, if it works with the current master now, I think this is fine.
This was fixed in https://git.openwrt.org/2ceee0e02362519f08be2a288bd8f42ca6cef257