Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FS#1349 - System is unstable under high network load. #6306

Closed
openwrt-bot opened this issue Feb 12, 2018 · 4 comments
Closed

FS#1349 - System is unstable under high network load. #6306

openwrt-bot opened this issue Feb 12, 2018 · 4 comments
Labels

Comments

@openwrt-bot
Copy link

ambientsummer:

OpenWrt/LEDE version: 184-r6069-a464fba7eb
Device: Xiaomi MiWiFi Mini (ramips/mt7620)

The problem occurs immediately after the starting fast downloads and uploads in the torrent-client on wired connected PC. On 184-r5993-1cb06d8907 everything is stable.

partial logread
Mon Feb 12 11:39:14 2018 daemon.info pppd[1130]: No response to 5 echo-requests
Mon Feb 12 11:39:14 2018 daemon.notice pppd[1130]: Serial link appears to be disconnected.
Mon Feb 12 11:39:14 2018 daemon.info pppd[1130]: Connect time 7.2 minutes.
Mon Feb 12 11:39:14 2018 daemon.info pppd[1130]: Sent 270076244 bytes, received 213831528 bytes.
Mon Feb 12 11:39:14 2018 daemon.err odhcp6c[1349]: Failed to send DHCPV6 message to ff02::1:2 (Permission denied)
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Network device 'pppoe-wan' link is down
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Network alias 'pppoe-wan' link is down
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'wan_6' has link connectivity loss
Mon Feb 12 11:39:14 2018 daemon.err odhcp6c[1349]: Failed to send DHCPV6 message to ff02::1:2 (Permission denied)
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'wan' has lost the connection
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'vpn0' has lost the connection
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'vpn0_addr' is now down
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'vpn0_addr' is disabled
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Network alias '' link is down
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'vpn0_addr' has link connectivity loss
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'wan_6' is disabled
Mon Feb 12 11:39:14 2018 daemon.notice netifd: tunnel 'gre4-vpn0' link is down
Mon Feb 12 11:39:14 2018 daemon.notice netifd: wan_6 (1349): Command failed: Permission denied
Mon Feb 12 11:39:14 2018 daemon.info dnsmasq[1690]: read /etc/hosts - 4 addresses
Mon Feb 12 11:39:14 2018 daemon.info dnsmasq[1690]: read /tmp/hosts/odhcpd - 2 addresses
Mon Feb 12 11:39:14 2018 daemon.info dnsmasq[1690]: read /tmp/hosts/dhcp.cfg01411c - 3 addresses
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'vpn0' is now down
Mon Feb 12 11:39:14 2018 daemon.notice netifd: Interface 'vpn0' is setting up now
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.434847] ------------[ cut here ]------------
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.439613] WARNING: CPU: 0 PID: 1901 at net/sched/sch_generic.c:316 dev_watchdog+0x1dc/0x25c
Mon Feb 12 11:39:15 2018 kern.info kernel: [ 499.448335] NETDEV WATCHDOG: eth0 (mtk_soc_eth): transmit queue 0 timed out
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.455441] Modules linked in: rt2800soc rt2800pci rt2800mmio rt2800lib pppoe ppp_async rt2x00soc rt2x00pci rt2x00mmio rt2x00lib pppox ppp_generic nf_conntrack_ipv6 mt76x2e mt7603e mt76 mac80211 iptable_nat ipt_REJECT ipt_MASQUERADE cfg80211 xt_time xt_tcpudp xt_state xt_nat xt_multiport xt_mark xt_mac xt_limit xt_conntrack xt_comment xt_TCPMSS xt_REDIRECT xt_LOG slhc nf_reject_ipv4 nf_nat_redirect nf_nat_masquerade_ipv4 nf_conntrack_ipv4 nf_nat_ipv4 nf_nat nf_log_ipv4 nf_defrag_ipv6 nf_defrag_ipv4 nf_conntrack_rtcache nf_conntrack iptable_mangle iptable_filter ip_tables crc_itu_t crc_ccitt compat ip6t_REJECT nf_reject_ipv6 nf_log_ipv6 nf_log_common ip6table_mangle ip6table_filter ip6_tables x_tables ip_gre gre ip_tunnel eeprom_93cx6 leds_gpio ohci_platform ohci_hcd ehci_platform ehci_hcd gpio_button_hotplug
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.528734] usbcore nls_base usb_common
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.531424] CPU: 0 PID: 1901 Comm: gre.sh Not tainted 4.9.77 #0
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.537470] Stack : 8042751a 00000033 00000000 00000001 8723fb3c 803d9207 8038bd90 0000076d
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.546034] 80423660 0000013c ffffffff 00000200 00200000 8004d250 80391570 803d0000
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.554592] 00000003 0000013c 8038f738 87c0de2c 00200000 8007b3cc 00000000 00000006
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.563156] 00000006 00000000 00000000 00000000 00000000 00000000 00000000 00000000
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.571716] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.580285] ...
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.582783] Call Trace:
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.585315] [<8000e458>] show_stack+0x54/0x88
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.589784] [<800239d0>] __warn+0xe4/0x118
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.593966] [<80023a34>] warn_slowpath_fmt+0x30/0x3c
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.599051] [<8024aad4>] dev_watchdog+0x1dc/0x25c
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.603872] [<80057988>] call_timer_fn.isra.4+0x24/0x84
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.609222] [<80057b94>] run_timer_softirq+0x1ac/0x1f8
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.614477] [<8002653c>] __do_softirq+0x250/0x298
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.619295] [<800096f8>] handle_int+0x138/0x144
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.623937] [<801724e8>] __copy_user_common+0x160/0x2d8
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.629301] [<8017ab90>] copy_page_to_iter+0x118/0x2d4
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.634556] [<8007f228>] generic_file_read_iter+0x58c/0x9e8
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.640273] [<800b9b80>] __vfs_read+0x10c/0x14c
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.644908] [<800ba910>] vfs_read+0x98/0x140
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.649266] [<800bb934>] SyS_read+0x58/0xc4
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.653550] [<80012ccc>] syscall_common+0x34/0x58
Mon Feb 12 11:39:15 2018 kern.warn kernel: [ 499.658363] ---[ end trace 365777323827c701 ]---
Mon Feb 12 11:39:15 2018 kern.err kernel: [ 499.663085] mtk_soc_eth 10100000.ethernet eth0: transmit timed out
Mon Feb 12 11:39:15 2018 kern.info kernel: [ 499.669414] mtk_soc_eth 10100000.ethernet eth0: dma_cfg:80000065
Mon Feb 12 11:39:15 2018 kern.info kernel: [ 499.675564] mtk_soc_eth 10100000.ethernet eth0: tx_ring=0, base=07134000, max=128, ctx=0, dtx=0, fdx=82, next=0
Mon Feb 12 11:39:15 2018 kern.info kernel: [ 499.685882] mtk_soc_eth 10100000.ethernet eth0: rx_ring=0, base=07159000, max=128, calc=42, drx=46
Mon Feb 12 11:39:15 2018 daemon.notice pppd[1130]: Connection terminated.
Mon Feb 12 11:39:15 2018 daemon.info pppd[1130]: Connect time 7.2 minutes.
Mon Feb 12 11:39:15 2018 daemon.info pppd[1130]: Sent 270076244 bytes, received 213831528 bytes.
Mon Feb 12 11:39:15 2018 daemon.info pppd[1130]: Sent PADT
Mon Feb 12 11:39:15 2018 daemon.info pppd[1130]: Exit.
Mon Feb 12 11:39:15 2018 daemon.notice netifd: Interface 'wan' is now down
Mon Feb 12 11:39:15 2018 daemon.notice netifd: Interface 'wan' is setting up now
Mon Feb 12 11:39:15 2018 daemon.notice netifd: Interface 'vpn0' is now down

@openwrt-bot
Copy link
Author

moeller0:

Mon Feb 12 11:39:14 2018 daemon.info pppd[1130]: No response to 5 echo-requests
Mon Feb 12 11:39:14 2018 daemon.notice pppd[1130]: Serial link appears to be disconnected.

This indicates that part of the problem is that due to high a load too many of PPP's LCP packets were either dropped or queued up too long. As a work-around for that issue you might want to try installing luci-app-sqm and sqm-scripts and intantiate sqm on pppoe-wan with download/ingress bandwidth set to say 85% of the nominal download bandwidth and upload/egress set to 95% of the nominal upload bandwidth. That should leave enough slack for PPP's LCP packets to not be crowded out.
This will not help with the kernel warning (it might make triggering that less likely but it will not address the warnings root cause, AFAIKT).

@openwrt-bot
Copy link
Author

ambientsummer:

A problem with PPPD occurs when the system stops responding. On 184-r5993-1cb06d8907 no problems with PPPD.

@openwrt-bot
Copy link
Author

airend:

This looks related to FS#1333 (MTK Ethernet Issues After NAPI polling Changes).

@openwrt-bot
Copy link
Author

ambientsummer:

Ignore request to re-open. It was incorrect.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant