Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FS#804 - mt7621: kernel errors - rcu_sched detected stalls on CPUs/tasks #5932

Closed
openwrt-bot opened this issue May 25, 2017 · 71 comments
Closed
Labels

Comments

@openwrt-bot
Copy link

@openwrt-bot openwrt-bot commented May 25, 2017

camel:

current trunk
hardware: zbt3526 mt7621
it give more and more often this kernel bugs ...
(i did not have that much for 2 months ago)
can it be related to newer kernel on TRUNK ?

Thu May 25 18:20:04 2017 user.notice root: Subject: [router.xxx.com] KERNEL error/warnings issue - 2017-05-25:18:20:01
Thu May 25 18:20:39 2017 kern.err kernel: [ 4797.640000] INFO: rcu_sched detected stalls on CPUs/tasks:
Thu May 25 18:20:39 2017 kern.err kernel: [ 4797.640000] 2-...: (0 ticks this GP) idle=dc4/0/0 softirq=370963/370963 fqs=0
Thu May 25 18:20:39 2017 kern.err kernel: [ 4797.650000] (detected by 1, t=6003 jiffies, g=119392, c=119391, q=19565)
Thu May 25 18:20:39 2017 kern.info kernel: [ 4797.650000] Task dump for CPU 2:
Thu May 25 18:20:39 2017 kern.info kernel: [ 4797.660000] swapper/2 R running 0 0 1 0x00100000
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.660000] Stack : 00000000 00003a99 00000000 77de22c0 00000000 00000000 804df2a4 80490000
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.660000] 8048c75c 00000001 00000000 8048c5e0 8048c724 80490000 00000000 800135e4
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.660000] 00000000 814a37e0 8fc72000 8fc73ec0 80490000 8005ec74 1100fc03 00000002
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.660000] 00000000 80490000 804df2a4 8005ec6 80490000 8001b1a8 1100fc03 00000000
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.660000] 00000004 8048c4a0 000000a0 8001b1b0 e8c7e2d3 3a8bf07f 2cfde824 eeff5ebf
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.660000] ...
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.700000] Call Trace:
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.700000] [<8000be98>] __schedule+0x574/0x758
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.710000] [<800135e4>] r4k_wait_irqoff+0x0/0x20
Thu May 25 18:20:39 2017 kern.warn kernel: [ 4797.710000]
Thu May 25 18:20:39 2017 kern.err kernel: [ 4797.710000] rcu_sched kthread starved for 6009 jiffies! g119392 c119391 f0x0 s3 ->state=0x1

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jun 26, 2017

RedDwarf:

It seems it has been happening for a while -> http://lists.infradead.org/pipermail/lede-dev/2017-February/006325.html

I don't fully understand it, but I think it's related to these messages "cron.err crond[756]: time disparity of 1096 minutes detected". So it has real consequences, at the very least it can make crond sleep() for 18 hours without running any job.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jun 26, 2017

RedDwarf:

I have found this on a SK-WB8 (MT7621 too) with a slightly modified (not the kernel) 17.01.2.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jun 29, 2017

john:

i pushed a patch to my stging tree that might fix this issue
https://git.lede-project.org/?p=lede/blogic/staging.git;a=shortlog;h=refs/heads/lede-17.01

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jun 30, 2017

camel:

let me know when the patch is in lede trunk on "ZBT-WG3526" image, and i will test.
cu Erwin

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 12, 2017

bjonglez:

You need to test the commit from john's staging tree before it gets merged into trunk.

By the way, ramips has been switched to linux 4.9, so it can also be worth testing the latest trunk image to see if it changes anything.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 12, 2017

camel:

well, if it would be in the trunk, we could test it ...eg: WG-3526 on which i can support for power testing :)
cu Erwin

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 13, 2017

lister-wrt:

I'm having the same issue on Ubiquiti ERX. Can test patches but I don't know how to build it.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 13, 2017

bjonglez:

I don't really like this, but here are all mt7621 images with lede-17.01 + john's patch (r3464+1-82b20d74cb): https://pub.polyno.me/lede-ramips-FS804/

Please only use these images for testing!

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 15, 2017

lister-wrt:

Thanks Baptiste, I'll try it out. I have a USB-TTL in case it goes horribly wrong.

The only way I know of reliably reproducing this issue is with SQM (errors start after ~5m after install) and it's not in your build. Kernel 4.9.37 merged just after you built this so I won't be able to use the LEDE packages.

Is there another way to test?

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 16, 2017

dchard:

Kernel 4.9.37 is also affected. I am testing John's patched build, and so far I was not able to reproduce this bug with hours of torturing the CPU. Previously it took only 5-10 minutes, so this is good news so far.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 19, 2017

Mushoz:

This seems to be a duplicate or related to the following issue: https://bugs.lede-project.org/index.php?do=details&task_id=764

Unfortunately, during traffic shaping the Dir-860l still crashes with that patch applied. So it does not seem to be a complete fix. It does look like it takes longer for it to manifest, so I believe we're getting closer to the solution for our issues :)

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 19, 2017

camel:

@baptiste Jonglez
is it possible to rollout/commit your patch to TRUNK ?
as i want to test it on real env .. but therefore i need many extra kernel packages to install, too.
cu Erwin

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 22, 2017

camel:

hm,, i tested current trunk ....(without traffic shapping packages installed and same result ...
Sat Jul 22 13:55:48 2017 kern.err kernel: [ 6282.000000] INFO: rcu_sched detected stalls on CPUs/tasks:
Sat Jul 22 13:55:48 2017 kern.err kernel: [ 6282.000000] 2-...: (0 ticks this GP) idle=550/0/0 softirq=447446/447446 fqs=1
Sat Jul 22 13:55:48 2017 kern.err kernel: [ 6282.010000] (detected by 3, t=6003 jiffies, g=152339, c=152338, q=631052)
Sat Jul 22 13:55:48 2017 kern.info kernel: [ 6282.010000] Task dump for CPU 2:
Sat Jul 22 13:55:48 2017 kern.info kernel: [ 6282.020000] swapper/2 R running task 0 0 1 0x00100000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.020000] Stack : 00000068 00000001 00000001 001bb9fe 00000000 778892c0 00000000 00000000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.030000] 804e148c 00000004 8047f934 804f0000 00000001 00000001 bfbf8000 00000000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.040000] 00000000 8000c1cc 00000000 00000000 8fc74000 8fc75f00 00000000 80062f44
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.050000] 1100fc03 00000003 00000001 804e148c 804e148c 80062f3c 00000005 00000000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.060000] 00000000 00000001 00000005 00000000 00000000 00000001 00000001 803ec0b4
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.070000] ...
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.070000] Call Trace:
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.070000] [<803e8408>] __schedule+0x5d4/0x7a4
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.080000] [<8000c1cc>] r4k_wait_irqoff+0x0/0x20
Sat Jul 22 13:55:48 2017 kern.err kernel: [ 6282.080000] rcu_sched kthread starved for 3035 jiffies! g152339 c152338 f0x0 RCU_GP_WAIT_FQS(3) ->state=0x1
Sat Jul 22 13:55:48 2017 kern.info kernel: [ 6282.090000] rcu_sched S 0 7 2 0x00100000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.100000] Stack : 814a9bc0 8fc52340 8fc6be50 814a8320 8fc6be50 804e0000 00000001 8007b548
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.100000] 814a8320 000914e6 814a8320 8fc6be50 804e0000 00000000 804f0000 804e0000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.110000] 804e96b0 803e8644 000914e6 804e0000 8fc6be50 814a8320 000914e6 803eafa8
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.120000] 8fc6a000 8fc6be60 804e9580 804e96b0 814aa240 00000001 8e97f008 8e979408
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.130000] 000914e6 8007ba44 8fc52340 44800002 804e9580 00000001 804e0000 00000000
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.140000] ...
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.140000] Call Trace:
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.140000] [<803e8408>] __schedule+0x5d4/0x7a4
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.150000] [<803e8644>] schedule+0x6c/0x84
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.150000] [<803eafa8>] schedule_timeout+0x160/0x19c
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.160000] [<80078f80>] rcu_gp_kthread+0x7f4/0x7fc
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.160000] [<80044c68>] kthread+0xd8/0xec
Sat Jul 22 13:55:48 2017 kern.warn kernel: [ 6282.170000] [<8000a318>] ret_from_kernel_thread+0x14/0x1c

happening per day ~30 times ...
device: MT7621 - zbt3526

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 22, 2017

john:

could you try if this still happens if remove
target/linux/ramips/patches-4.9/0005-MIPS-use-set_mode-to-enable-disable-the-cevt-r4k-irq.patch
target/linux/ramips/patches-4.9/0006-MIPS-ralink-add-cpu-frequency-scaling.patch

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 22, 2017

camel:

Sorry, can't build own image.
If available on trunk to DL, I will test

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 23, 2017

pparent76:

@john

I have the same problem.

I will try to compile without those patches tomorrow, also for me the bug is not as easily replicable as camel say, at least without traffic shaping and without mt7603e driver it's rare.

Also if you want a ZBT-WG3526, to be able to test yourself , I can send you one for free.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 23, 2017

camel:

For me it is clear related to wlan 2,4GHZ
as currently I'm copying a lot of data over 2,4 ghz, and I have rcu-sched kennel errors all 5minutes

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

pparent76:

@Camel:

Not it's probably not, because the problem happens without 2,4Ghz driver running.

Though 2.4Ghz driver can make the problem happen more often, and the driver mt76 itself has some specific issues independent from that problem.

Traffic shaping makes it happen more often even with 2.4Ghz driver disabled.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

pparent76:

I compiled the version without patch
target/linux/ramips/patches-4.9/0005-MIPS-use-set_mode-to-enable-disable-the-cevt-r4k-irq.patch
target/linux/ramips/patches-4.9/0006-MIPS-ralink-add-cpu-frequency-scaling.patch

You can download it here:

https://www.own-mailbox.com/lede/lede-ramips-mt7621-zbt-wg3526-16M-squashfs-sysupgrade.bin

Please can you test, I will test on my side when I have time

Edit: I have updated firmware on my server with few more packages including kmod-mt76 and kmod-sched at 8:40 GMT md5sum: 9ac127a3d0bf49a8d452e51b2ff9b741

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

camel:

I can't test, as I would med more other packages .
I can only test trunk version, on which I can install my other mandatory packages

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

pparent76:

I have updated firmware on my server with few more packages including kmod-mt76 and kmod-sched at 8:40 GMT md5sum: 9ac127a3d0bf49a8d452e51b2ff9b741

What packages would you need that you cannot install with opkg? (Only kernel related packages cannot be installed with opkg)

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

camel:

is too much, and i don'T know exactly which packackes are then needed to reflect kernel builds ...

but if interested .. this is my list of packages what I'm installing ...

#disk & SD related stuff opkg install block-mount # --force-reinstall opkg install kmod-scsi-core kmod-usb-storage #--force-reinstall opkg install kmod-fs-ext4 kmod-fs-vfat opkg install kmod-nls-utf8 kmod-nls-cp437 kmod-nls-iso8859-1 #--force-reinstall opkg install kmod-fs-nfs nfs-utils #--force-reinstall opkg install kmod-fs-ext4 kmod-fs-vfat kmod-nls-utf8 kmod-nls-base kmod-nls-cp437 kmod-nls-iso8859-1 cfdisk e2fsprogs #--force-reinstall opkg install kmod-fs-f2fs libf2fs f2fs-tools f2fsck mkf2fs opkg install fdisk #--force-reinstall opkg install rsync

#WEBIFACE:
opkg install luci #--force-reinstall
opkg install luci-ssl-openssl

#luci extra stuff:
opkg install luci-app-commands #--force-reinstall
opkg install luci-app-ddns #--force-reinstall
opkg install luci-app-diag-core #--force-reinstall
opkg install luci-app-firewall #--force-reinstall
opkg install luci-app-ntpc #--force-reinstall
opkg install luci-app-uhttpd #--force-reinstall

#3g/4g + luci (https://wiki.openwrt.org/doc/recipes/3gdongle)
opkg install chat #--force-reinstall
opkg install comgt #--force-reinstall
opkg install luci-proto-3g #--force-reinstall
opkg install kmod-usb-serial #--force-reinstall
opkg install kmod-usb-serial-option #--force-reinstall
opkg install kmod-usb-serial-wwan #--force-reinstall #(mostly added now in current release or trunk)

#3ginfo package
opkg install http://dl.eko.one.pl/chaos_calmer/ar71xx/packages/3ginfo-text_20170524_all.ipk
#opkg install http://dl.eko.one.pl/chaos_calmer/ar71xx/packages/3ginfo__20170524_all.ipk --force-checksum --force-reinstall

#pptp + pptpd =>
#=======
opkg install kmod-crypto-manager kmod-crypto-pcompress kmod-crypto-ecb
opkg install kmod-crypto-null kmod-crypto-aead kmod-iptunnel kmod-gre resolveip kmod-pptp kmod-crypto-sha1
opkg install ppp-mod-pptp kmod-nf-nathelper-extra
#is for routing vpn client to vpn client and lan to vpn-client
opkg install kmod-ipt-nat-extra
opkg install luci-proto-ppp

#VPN PPTP server:
opkg install pptpd kmod-mppe

#OPENVPN: (ca. 1MB space needed)
opkg install openvpn-mbedtls luci-app-openvpn
opkg install nano libustream-openssl ca-bundle ca-certificates

logging + rotate

opkg install logrotate --force-reinstall

#monitoring ....
opkg install luci-app-statistics --force-reinstall
opkg install collectd-mod-conntrack --force-reinstall
opkg install collectd-mod-cpu --force-reinstall
opkg install collectd-mod-df --force-reinstall
opkg install collectd-mod-disk --force-reinstall
opkg install collectd-mod-exec --force-reinstall
opkg install collectd-mod-interface --force-reinstall
opkg install collectd-mod-iptables --force-reinstall
opkg install collectd-mod-iwinfo --force-reinstall
opkg install collectd-mod-load --force-reinstall
opkg install collectd-mod-memory --force-reinstall
opkg install collectd-mod-ping --force-reinstall
opkg install collectd-mod-processes --force-reinstall
opkg install collectd-mod-rrdtool --force-reinstall
opkg install collectd-mod-uptime --force-reinstall

#monitoring data traffic (optional) ~
opkg install luci-app-vnstat --force-reinstall

#ftp server:
opkg install --force-maintainer --force-reinstall --force-depends --force-overwrite vsftpd

#traffic shaping:
opkg install luci-app-sqm sqm-scripts kmod-sched-cake --force-reinstall

#loadbalancing:
opkg install luci-app-mwan3 --force-reinstall

#HIGH AVAILIBILITY
opkg install keepalived conntrackd --force-reinstall

#email + ssl (~1MB) - also see: https://wiki.openwrt.org/doc/howto/smtp.client
opkg install libopenssl ssmtp --force-reinstall

#smstools + web (~2,3MB needed)
opkg install smstools3 --force-reinstall

#to USB ...
opkg --add-dest opt:/opt install php7-cgi php7-mod-session php7-mod-json php7-mod-curl --force-reinstall
ln -s /opt/usr/bin/php-cgi /usr/bin/php-cgi
ln -s /opt/usr/lib/libstdc++.so.6 /usr/lib/libstdc++.so.6

#optional file systems tools
opkg install lsof fuse-utils --force-reinstall

#extra apps
opkg install libpcap --force-reinstall
opkg install terminfo --force-reinstall
opkg install rsync --force-reinstall
opkg install sudo --force-reinstall
opkg install shadow-useradd shadow-su --force-reinstall
opkg install coreutils-date --force-reinstall
opkg install glib2 --force-reinstall #libgcc
opkg install mc --force-reinstall

#debugging GSM modem:
opkg --add-dest opt:/opt install picocom --force-reinstall

others

opkg --add-dest opt:/opt install screen --force-reinstall
opkg --add-dest opt:/opt install htop --force-reinstall
opkg --add-dest opt:/opt install procps-ng-watch --force-reinstall
opkg --add-dest opt:/opt install nmap --force-reinstall
opkg --add-dest opt:/opt install tcpdump-mini --force-reinstall
opkg --add-dest opt:/opt install sysstat --force-reinstall
opkg --add-dest opt:/opt install confuse --force-reinstall

#for speedtest: (in 3g package given or also on USB stick - speedtest dir)
opkg --add-dest opt:/opt install curl --force-reinstall
opkg --add-dest opt:/opt install bc --force-reinstall
opkg --add-dest opt:/opt install python # python-base --force-reinstall
opkg --add-dest opt:/opt install --force-reinstall --force-depends python python-base python-codecs python-compiler python-ctypes python-db python-decimal python-distutils python-email python-gdbm python-light python-logging python-multiprocessing python-ncurses python-openssl python-pydoc python-sqlite3 python-unittest python-xml

#console tools - measurement tools
opkg --add-dest opt:/opt install iperf --force-reinstall
opkg --add-dest opt:/opt install bmon libnl-route --force-reinstall
opkg --add-dest opt:/opt install iftop --force-reinstall
opkg --add-dest opt:/opt install iptraf-ng --force-reinstall

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

camel:

well, if wanted i can try to install .... and let you know whcih packages i would missing on kernel related stuff ...

which link for your build can i use ?

i tried it, but it is too much what would be missing to test it longer ...

no luci packages, no modem driver, etc ...

i need to wait till it is in trunk.

i guess, it can not be more worst as it is for now in current trunk ...
and would hope that your patches will be commited soon .. thx
then i will TEST ASAP

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

azuwis:

It's possible to force building the same kernel version as [[https://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kernel_4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4_mipsel_24kc.ipk
|upstream snapshot]], e.g:

$ make clean    
$ make LINUX_VERMAGIC=7f0de30d5b73958cb146494d8e5b2ef4

As long as you use the same code base and same kernel config, kmod from upstream snapshot should work fine.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

pparent76:

I've started testing with qos enabled but mt76 (2.4ghz driver) disabled, with my image, for now it seems that I don't see any RCU_sched warning anymore but I would need confirmation since for me it was always very random, and not easily reproducible.

I guess if there is none to test, I don't know if anything will get to trunk soon. Especially Since this hack is about removing patches impacting all MIPS images. But john should know better than I do.

Here is the latest image:
https://www.own-mailbox.com/lede/lede-ramips-mt7621-zbt-wg3526-16M-squashfs-sysupgrade.bin

The image builder:
https://www.own-mailbox.com/lede/lede-imagebuilder-ramips-mt7621.Linux-x86_64.tar.xz

The Sdk:
https://www.own-mailbox.com/lede/lede-sdk-ramips-mt7621_gcc-5.4.0_musl.Linux-x86_64.tar.xz

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

pparent76:

@zhong Jianxin: will it work even with a modified kernel (since we change patches used in upstream)?

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

camel:

@pierre:
Pls can you build with same like snapshot ?
As described above...

It's possible to force building the same kernel version as upstream snapshot, e.g:

$ make clean
$ make LINUX_VERMAGIC=7f0de30d5b73958cb146494d8e5b2ef4
As long as you use the same code base and same kernel config, kmod from upstream snapshot should work fine

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

pparent76:

I've updated the image+sdk+image-builder on my server compiled with the above command.

The md5sum of the image is 3298cb86e8ff7737fcad8bc4065914ec.

Please test, if you can install your packages with it.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

camel:

ok, first try ..
if i try to install the:
opkg install kmod-fs-nfs nfs-utils it was crashing, image bricked and going into a boot loop.
now set it back to chinese FW .. and now installing it again - and trying only to use what is really needed (ext4 + f2fs)

rsync package seems to be completely missing now on the TRUNK snapshot .. maybe compile failed on some new packages ...
( :( -> needs to be reported to devels anyhow ..)

i let you know as soon as i get it installed

maybe some small issues .. not sure, if i can mount all OK ...
Mon Jul 24 13:46:48 2017 daemon.notice squid[1531]: Logfile: opening log stdio:/var/logs/netdb.state
Mon Jul 24 13:46:48 2017 daemon.notice squid[1531]: ERROR: logfileOpen stdio:/var/logs/netdb.state: (2) No such file or directory
Mon Jul 24 13:46:48 2017 daemon.notice squid[1531]: logfileOpen: stdio:/var/logs/netdb.state: couldn't open!
Mon Jul 24 13:46:48 2017 daemon.notice squid[1531]: netdbSaveState: stdio:/var/logs/netdb.state: (2) No such file or directory
Mon Jul 24 13:46:57 2017 daemon.info dnsmasq[2111]: read /etc/hosts - 4 addresses
Mon Jul 24 13:46:57 2017 daemon.info dnsmasq[2111]: read /tmp/hosts/dhcp.cfg02411c - 2 addresses
Mon Jul 24 13:46:57 2017 daemon.info dnsmasq-dhcp[2111]: read /etc/ethers - 0 addresses
Mon Jul 24 13:49:15 2017 kern.warn kernel: [ 267.690000] f2fs: Unknown symbol posix_acl_chmod (err 0)

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 24, 2017

camel:

hmm :( ..
f2fs can'T be used ... which i need as there are all the logs analysing scripts were written to it :(
can you try if you can include that into your image ?
on mounting it give me that error:
Mon Jul 24 13:55:03 2017 kern.warn kernel: [ 615.690000] f2fs: Unknown symbol posix_acl_chmod (err 0)
Mon Jul 24 13:55:03 2017 daemon.err modprobe: 1 module could not be probed
Mon Jul 24 13:55:03 2017 daemon.err modprobe: - f2fs
Mon Jul 24 13:55:04 2017 kern.warn kernel: [ 615.790000] f2fs: Unknown symbol posix_acl_chmod (err 0)
Mon Jul 24 13:55:04 2017 daemon.err modprobe: 1 module could not be probed
Mon Jul 24 13:55:04 2017 daemon.err modprobe: - f2fs
Mon Jul 24 13:55:04 2017 kern.warn kernel: [ 615.910000] f2fs: Unknown symbol posix_acl_chmod (err 0)
Mon Jul 24 13:55:04 2017 daemon.err modprobe: 1 module could not be probed
Mon Jul 24 13:55:04 2017 daemon.err modprobe: - f2fs
Mon Jul 24 13:55:04 2017 kern.warn kernel: [ 616.010000] f2fs: Unknown symbol posix_acl_chmod (err 0)
Mon Jul 24 13:55:04 2017 daemon.err modprobe: 1 module could not be probed
Mon Jul 24 13:55:04 2017 daemon.err modprobe: - f2fs

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

azuwis:

@Zhong Jianxin: will it work even with a modified kernel (since we change patches used in upstream)?

It depends on the modification, in this case, it should work.

The reason why there are so many unknown symbol is that some kernel configs are only enabled when certain packages selected, e.g. selecting kmod-fs-f2fs will enable [[https://git.lede-project.org/?p=source.git;a=blob;f=package/kernel/linux/modules/fs.mk;h=610d1b58a1ae453409eea2704c5037074f86c4ae;hb=HEAD#l207|this kernel configs]]:

CONFIG_F2FS_FS
CONFIG_F2FS_STAT_FS=y
CONFIG_F2FS_FS_XATTR=y
CONFIG_F2FS_FS_POSIX_ACL=n
CONFIG_F2FS_FS_SECURITY=n
CONFIG_F2FS_CHECK_FS=n

Upstream snapshot build will select many packages, but it's probably not the case of custom build.

Here is another way to build as close as upstream snapshot:

$ wget https://downloads.lede-project.org/snapshots/targets/ramips/mt7621/config.seed -O .config
$ make defconfig
$ make dirclean
$ make

But it will take much longer time.

Just tested this, it built the some kernel version as current upstream snapshot, without overriding LINUX_VERMAGIC.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

@piere: possible to prepare a new build with current new kernel magic id ?

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

I will not use the magic number technique again because:

1- It's useless, since modules that were not included in my build, and therefore not built will not work @zhong Jianxin said. Those who are built I can include them directly in the image.

2- It can lead to wrong diagnostic and wrong conclusions for our testing. Since as we saw we can get kernel errors due to incompatibility between the kernel I built and modules in packages upstream.

3- I will compile an image with traffic shaping in one hour, so that you can test with traffic shaping.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

Hmm, not sure, if you can add the pptpd + pppd packages included, too.
As that would be mandatory for me.
Anyhow, I will try if I can use it for testing anyhow.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

@Camel can you please test with QOS/SQM enabled on the version I just sent to my server?

As that would be mandatory for me.
Anyhow, I will try if I can use it for testing anyhow

Anyways you should not use my images for anything else than testing....

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

sure, i will try ...
i will try to install now

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

pls give me the DL link,
because the other link shows me the identically version like yesterday ...
i DL: https://www.own-mailbox.com/lede/lede-ramips-mt7621-zbt-wg3526-16M-squashfs-sysupgrade.bin
is the same like yesterday ...
pls DL link ?

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

Hum no it should not be, it corresponds to the image I compiled today, with luci-app-sqm sqm-scripts

Md5sum: 71121b4a6a30abd6627595d01bd0374c

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

sorry, my mistake ...
but you forgot to set kernel magic ....

as i can'T install pptp client + server packages ...

root@LEDE:# #pptp + pptpd =>
root@LEDE:
# #=======
root@LEDE:~# opkg install kmod-crypto-manager kmod-crypto-pcompress kmod-crypto-ecb

opkg install ppp-mod-pptp kmod-nf-nathelper-extra
#is for routing vpn client to vpn client and lan to vpn-client
opkg install kmod-ipt-nat-extra

#notfalls via:

opkg install ppp-mod-pptp kmod-nf-nathelper-extra #--force-depends --force-reinstall

#If LuCI support is desired, additionally install the protocol package:
opkg install luci-proto-ppp

#VPN PPTP server:
opkg install pptpd kmod-mppe
Package kmod-crypto-manager (4.9.37-1) installed in root is up to date.
Package kmod-crypto-pcompress (4.9.37-1) installed in root is up to date.
Package kmod-crypto-ecb (4.9.37-1) installed in root is up to date.
root@LEDE:~# opkg install kmod-crypto-null kmod-crypto-aead kmod-iptunnel kmod-gre resolveip kmod-pptp kmod-crypto-sha1
Package kmod-crypto-null (4.9.37-1) installed in root is up to date.
Package kmod-crypto-aead (4.9.37-1) installed in root is up to date.
Installing kmod-iptunnel (4.9.37-1) to root...
Downloading http://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kmod-iptunnel_4.9.37-1_mipsel_24kc.ipk
Installing kmod-gre (4.9.37-1) to root...
Downloading http://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kmod-gre_4.9.37-1_mipsel_24kc.ipk
Installing resolveip (2) to root...
Downloading http://downloads.lede-project.org/snapshots/packages/mipsel_24kc/base/resolveip_2_mipsel_24kc.ipk
Installing kmod-pptp (4.9.37-1) to root...
Downloading http://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kmod-pptp_4.9.37-1_mipsel_24kc.ipk
Installing kmod-crypto-sha1 (4.9.37-1) to root...
Downloading http://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kmod-crypto-sha1_4.9.37-1_mipsel_24kc.ipk
Configuring resolveip.
Collected errors:

  • satisfy_dependencies_for: Cannot satisfy the following dependencies for kmod-iptunnel:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package kmod-iptunnel.
  • satisfy_dependencies_for: Cannot satisfy the following dependencies for kmod-gre:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package kmod-gre.
  • satisfy_dependencies_for: Cannot satisfy the following dependencies for kmod-pptp:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package kmod-pptp.
  • satisfy_dependencies_for: Cannot satisfy the following dependencies for kmod-crypto-sha1:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package kmod-crypto-sha1.
    root@LEDE:#
    root@LEDE:
    # opkg install ppp-mod-pptp kmod-nf-nathelper-extra
    Installing ppp-mod-pptp (2.4.7-12) to root...
    Downloading http://downloads.lede-project.org/snapshots/packages/mipsel_24kc/base/ppp-mod-pptp_2.4.7-12_mipsel_24kc.ipk
    Installing kmod-nf-nathelper-extra (4.9.37-1) to root...
    Downloading http://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kmod-nf-nathelper-extra_4.9.37-1_mipsel_24kc.ipk
    Collected errors:
  • satisfy_dependencies_for: Cannot satisfy the following dependencies for ppp-mod-pptp:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package ppp-mod-pptp.
  • satisfy_dependencies_for: Cannot satisfy the following dependencies for kmod-nf-nathelper-extra:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package kmod-nf-nathelper-extra.
    root@LEDE:# #is for routing vpn client to vpn client and lan to vpn-client
    root@LEDE:
    # opkg install kmod-ipt-nat-extra
    Package kmod-ipt-nat-extra (4.9.37-1) installed in root is up to date.
    root@LEDE:#
    root@LEDE:
    # #notfalls via:
    root@LEDE:# # opkg install ppp-mod-pptp kmod-nf-nathelper-extra #--force-depends --force-reinstall
    root@LEDE:
    # #If LuCI support is desired, additionally install the protocol package:
    root@LEDE:# opkg install luci-proto-ppp
    Upgrading luci-proto-ppp on root from git-17.205.18138-17de308-1 to git-17.205.60871-6c8a9f1-1...
    Downloading http://downloads.lede-project.org/snapshots/packages/mipsel_24kc/luci/luci-proto-ppp_git-17.205.60871-6c8a9f1-1_all.ipk
    Configuring luci-proto-ppp.
    root@LEDE:
    #
    root@LEDE:# #VPN PPTP server:
    root@LEDE:
    # opkg install pptpd kmod-mppe
    Installing pptpd (1.4.0-3) to root...
    Downloading http://downloads.lede-project.org/snapshots/packages/mipsel_24kc/packages/pptpd_1.4.0-3_mipsel_24kc.ipk
    Installing kmod-mppe (4.9.37-1) to root...
    Downloading http://downloads.lede-project.org/snapshots/targets/ramips/mt7621/packages/kmod-mppe_4.9.37-1_mipsel_24kc.ipk
    Configuring kmod-mppe.
    1 module could not be probed
  • ppp_mppe
    Collected errors:
  • satisfy_dependencies_for: Cannot satisfy the following dependencies for pptpd:
  •  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *  kernel (= 4.9.37-1-7f0de30d5b73958cb146494d8e5b2ef4) *
    
  • opkg_install_cmd: Cannot install package pptpd.
    root@LEDE:~#

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

but you forgot to set kernel magic ....

I did not forget I purposly did not do it for reasons mentioned above.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

hmm, but i really need the pptp stuff .. otherwise i can't really test it more in detail
as i have there to connect via pptp
can you add it ?
or can you make 1 version which is having the kernel magic option on compile ?

  • if possible - pls without the SSL package for luci, as i need to use a different one .. otherwise i can't install the openvpn ssl package.

or if kernel magic set to lede trunk snapshot, then i can remove and install the real one which i need...

as
#WEBIFACE:
opkg install luci #--force-reinstall
#opkg install luci-ssl #--force-reinstall
opkg install luci-ssl-openssl

#OPENVPN: (ca. 1MB space needed)
opkg install openvpn-openssl luci-app-openvpn
#opkg install openvpn-mbedtls luci-app-openvpn
opkg install nano libustream-openssl ca-bundle ca-certificates

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

meanwhile i tested ... hmm ...seems to be, that I'M getting a mem issue ...
not sure, what is all included ... but i guess it is not related to the RCU_SCHED errors ...

maybe that was related to a "speedtest.py" which i run every 5min to have the statistics about the router internet speed ...
also i have seen you are having squid installed and so on ...

Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.520000] luci invoked oom-killer: gfp_mask=0x27000c0(GFP_KERNEL_ACCOUNT|__GFP_NOTRACK), nodemask=0, order=1, oom_score_adj=0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.530000] COMPACTION is disabled!!!
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.540000] CPU: 2 PID: 32685 Comm: luci Not tainted 4.9.37 #0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.540000] Stack : 00000000 00000000 80537b2a 00000032 803f4084 00000000 00000000 80530000
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.550000] 81fa462c 804d7da7 8046dff0 00000002 00007fad 80533824 00000001 00200000
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.560000] 00001321 80069890 00000000 800696b0 00000000 00000004 80472c00 82745c3c
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.570000] 00000000 800a5d98 00000000 00000000 80537b2a 00000000 82745d28 00745c3c
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.580000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.580000] ...
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.590000] Call Trace:
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.590000] [<8000f644>] show_stack+0x54/0x88
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.590000] [<801e5924>] dump_stack+0x84/0xc0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.600000] [<800ea424>] dump_header.isra.4+0x84/0x1b4
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.600000] [<800ac264>] oom_kill_process+0xd0/0x484
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.610000] [<800acb40>] out_of_memory+0x3bc/0x3fc
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.610000] [<800b01c4>] __alloc_pages_nodemask+0x5e4/0xa58
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.620000] [<800281b0>] copy_process.isra.8.part.9+0x10c/0x1300
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.620000] [<80029520>] _do_fork+0xcc/0x2d8
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.630000] [<800297dc>] SyS_clone+0x20/0x2c
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.630000] [<80016558>] syscall_common+0x34/0x58
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] Mem-Info:
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] active_anon:13977 inactive_anon:1643 isolated_anon:0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] active_file:805 inactive_file:2995 isolated_file:0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] unevictable:1 dirty:2 writeback:0 unstable:0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] slab_reclaimable:3926 slab_unreclaimable:35491
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] mapped:3433 shmem:8024 pagetables:227 bounce:0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] free:49832 free_pcp:28 free_cma:0
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.670000] Node 0 active_anon:55908kB inactive_anon:6572kB active_file:3220kB inactive_file:11980kB unevictable:4kB isolated(anon):0kB isolated(file):0kB mapped:13732kB dirty:8kB writeback:0kB shmem:32096kB writeback_tmp:0kB unstable:0kB pages_scanned:0 all_unreclaimable? no
Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.700000] Normal free:20008kB min:16384kB low:20480kB high:24576kB active_anon:0kB inactive_anon:0kB active_file:4kB inactive_file:84kB unevictable:0kB writepending:8kB present:262144kB managed:251688kB mlocked:0kB slab_reclaimable:15704kB slab_unreclaimable:141964kB kernel_stack:56192kB pagetables:908kB bounce:0kB free_pcp:232kB local_pcp:0kB free_cma:0kB
Tue Jul 25 12:50:05 2017 kern.emerg kernel: lowmem_reserve[]: 0 2048 2048
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.730000] HighMem free:179216kB min:256kB low:4520kB high:8784kB active_anon:55908kB inactive_anon:6572kB active_file:3148kB inactive_file:11888kB unevictable:4kB writepending:0kB present:262144kB managed:262144kB mlocked:4kB slab_reclaimable:0kB slab_unreclaimable:0kB kernel_stack:0kB pagetables:0kB bounce:0kB free_pcp:52kB local_pcp:0kB free_cma:0kB
Tue Jul 25 12:50:05 2017 kern.emerg kernel: lowmem_reserve[]: 0 0 0
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.770000] Normal: 234kB (UMH) 348kB (UMEH) 1916kB (UMH) 2532kB (UMEH) 1664kB (UME) 9128kB (UMEH) 4256kB (UME) 6512kB (UM) 61024kB (UMEH) 32048kB (UME) 04096kB = 20028kB
Tue Jul 25 12:50:05 2017 kern.emerg kernel: HighMem: 881
4kB (UM) 9578kB (UM) 54316kB (UM) 36032kB (UM) 13364kB (UM) 18128kB (M) 8256kB (UM) 3512kB (M) 41024kB (M) 12048kB (U) 314096kB (M) = 178908kB
Tue Jul 25 12:50:05 2017 kern.emerg kernel: 11849 total pagecache pages
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.800000] 0 pages in swap cache
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.810000] Swap cache stats: add 0, delete 0, find 0/0
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.810000] Free swap = 0kB
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.810000] Total swap = 0kB
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.820000] 131072 pages RAM
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.820000] 65536 pages HighMem/MovableOnly
Tue Jul 25 12:50:05 2017 kern.warn kernel: [ 1065.820000] 2614 pages reserved
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.830000] [ pid ] uid tgid total_vm rss nr_ptes nr_pmds swapents oom_score_adj name
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.840000] [ 731] 0 731 298 149 3 0 0 0 ubusd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.840000] [ 738] 0 738 225 123 3 0 0 0 askfirst
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.850000] [ 1317] 0 1317 823 694 5 0 0 0 logd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.860000] [ 1318] 0 1318 336 198 4 0 0 0 logread
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.870000] [ 1330] 0 1330 381 213 4 0 0 0 rpcd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.880000] [ 1499] 0 1499 435 257 4 0 0 0 netifd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.890000] [ 1548] 0 1548 356 183 4 0 0 0 odhcpd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.900000] [ 1581] 0 1581 300 202 3 0 0 0 crond
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.910000] [ 1603] 0 1603 267 153 4 0 0 0 dropbear
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.910000] [ 1695] 0 1695 299 153 3 0 0 0 udhcpc
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.920000] [ 1703] 0 1703 256 139 3 0 0 0 odhcp6c
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.930000] [ 1762] 65536 1762 963 497 4 0 0 0 lighttpd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.940000] [ 1964] 0 1964 633 413 4 0 0 0 uhttpd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.950000] [ 1996] 0 1996 242 10 3 0 0 0 vsftpd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.960000] [ 2029] 0 2029 751 180 4 0 0 0 keepalived
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.970000] [ 2045] 0 2045 759 37 5 0 0 0 keepalived
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.980000] [ 2237] 0 2237 280 139 3 0 0 0 pppd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.990000] [ 2286] 453 2286 269 203 3 0 0 0 dnsmasq
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1065.990000] [ 2323] 0 2323 420 26 4 0 0 0 hostapd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.000000] [ 2325] 0 2325 420 26 3 0 0 0 hostapd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.010000] [ 2447] 0 2447 1462 564 5 0 0 0 collectd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.020000] [ 2488] 0 2488 228 9 3 0 0 0 ntpclient
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.030000] [ 2535] 65534 2535 300 220 4 0 0 0 collectd_ifacei
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.040000] [ 2541] 65534 2541 301 236 3 0 0 0 collectd_gsminf
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.050000] [ 2542] 65534 2542 300 235 3 0 0 0 collectd_mwan-s
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.060000] [ 2543] 65534 2543 300 236 4 0 0 0 collectd_speedt
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.070000] [ 2635] 0 2635 3576 2803 8 0 0 0 squid
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.080000] [ 3059] 0 3059 317 253 3 0 0 0 mwan3track
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.090000] [ 3945] 0 3945 333 227 4 0 0 0 luci-bwc
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.090000] [ 4547] 0 4547 284 9 5 0 0 0 dropbear
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.100000] [ 4683] 0 4683 300 235 4 0 0 0 ash
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.110000] [10425] 0 10425 306 216 3 0 0 0 smsd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.120000] [10429] 0 10429 315 64 4 0 0 0 smsd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.130000] [10631] 0 10631 244 167 3 0 0 0 vnstatd
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.140000] [12948] 0 12948 284 9 5 0 0 0 dropbear
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.150000] [12962] 0 12962 300 236 4 0 0 0 ash
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.150000] [18822] 0 18822 842 733 5 0 0 0 openvpn
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.160000] [19980] 0 19980 284 9 5 0 0 0 dropbear
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.170000] [20365] 0 20365 300 154 5 0 0 0 ash
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.180000] [11049] 65534 11049 300 10 4 0 0 0 collectd_speedt
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.190000] [11050] 0 11050 5266 4606 9 0 0 0 python
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.200000] [12184] 0 12184 300 139 4 0 0 0 sleep
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.210000] [12380] 65534 12380 301 10 3 0 0 0 collectd_gsminf
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.220000] [12381] 0 12381 303 154 4 0 0 0 3gsignal
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.230000] [12382] 65534 12382 299 138 3 0 0 0 sed
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.240000] [12430] 0 12430 243 145 4 0 0 0 gcom
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.240000] [16962] 65534 16962 299 137 3 0 0 0 sleep
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.250000] [19601] 65534 19601 299 154 3 0 0 0 sleep
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.260000] [24961] 0 24961 336 199 3 0 0 0 logread
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.270000] [25709] 0 25709 310 154 4 0 0 0 hotplug-call
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.280000] [25717] 0 25717 337 229 4 0 0 0 hotplug-call
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.290000] [25747] 0 25747 278 144 4 0 0 0 ipset
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.300000] [26138] 0 26138 842 728 4 0 0 0 openvpn
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.310000] [32683] 0 32683 667 419 3 0 0 0 luci
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.310000] [32684] 0 32684 667 416 4 0 0 0 luci
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.320000] [32685] 0 32685 667 418 6 0 0 0 luci
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.330000] [32687] 0 32687 257 176 4 0 0 0 modprobe
Tue Jul 25 12:50:05 2017 kern.info kernel: [ 1066.340000] [32740] 0 32740 299 138 3 0 0 0 sh
Tue Jul 25 12:50:05 2017 kern.err kernel: [ 1066.350000] Out of memory: Kill process 11050 (python) score 34 or sacrifice child
Tue Jul 25 12:50:05 2017 kern.err kernel: [ 1066.360000] Killed process 11050 (python) total-vm:21064kB, anon-rss:14176kB, file-rss:4244kB, shmem-rss:4kB

for traffic shapping i would need the pptp in any case, as the VPN's are shapped ..

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

meanwhile .. i tested with traffic shapping ... and produced a lot of traff ic on:
.) LAN _=> ~10GB transfer
.) WLAN 5Ghz _=> ~15GB transfer
.) WLAN 2,4Ghz => ~5GB transfer

and got few errors ....

Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.520000] luci invoked oom-killer: gfp_mask=0x27000c0(GFP_KERNEL_ACCOUNT|__GFP_NOTRACK), nodemask=0, order=1, oom_score_adj=0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.530000] COMPACTION is disabled!!! Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.540000] CPU: 2 PID: 32685 Comm: luci Not tainted 4.9.37 #0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.540000] Stack : 00000000 00000000 80537b2a 00000032 803f4084 00000000 00000000 80530000 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.550000] 81fa462c 804d7da7 8046dff0 00000002 00007fad 80533824 00000001 00200000 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.560000] 00001321 80069890 00000000 800696b0 00000000 00000004 80472c00 82745c3c Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.570000] 00000000 800a5d98 00000000 00000000 80537b2a 00000000 82745d28 00745c3c Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.580000] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.580000] ... Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.590000] Call Trace: Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.590000] [<8000f644>] show_stack+0x54/0x88 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.590000] [<801e5924>] dump_stack+0x84/0xc0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.600000] [<800ea424>] dump_header.isra.4+0x84/0x1b4 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.600000] [<800ac264>] oom_kill_process+0xd0/0x484 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.610000] [<800acb40>] out_of_memory+0x3bc/0x3fc Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.610000] [<800b01c4>] __alloc_pages_nodemask+0x5e4/0xa58 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.620000] [<800281b0>] copy_process.isra.8.part.9+0x10c/0x1300 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.620000] [<80029520>] _do_fork+0xcc/0x2d8 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.630000] [<800297dc>] SyS_clone+0x20/0x2c Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.630000] [<80016558>] syscall_common+0x34/0x58 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] Mem-Info: Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] active_anon:13977 inactive_anon:1643 isolated_anon:0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] active_file:805 inactive_file:2995 isolated_file:0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] unevictable:1 dirty:2 writeback:0 unstable:0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] slab_reclaimable:3926 slab_unreclaimable:35491 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] mapped:3433 shmem:8024 pagetables:227 bounce:0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.640000] free:49832 free_pcp:28 free_cma:0 Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.670000] Node 0 active_anon:55908kB inactive_anon:6572kB active_file:3220kB inactive_file:11980kB unevictable:4kB isolated(anon):0kB isolated(file):0kB mapped:13732kB dirty:8kB writeback:0kB shmem:32096kB writeback_tmp:0kB unstable:0kB pages_scanned:0 all_unreclaimable? no Tue Jul 25 12:50:04 2017 kern.warn kernel: [ 1065.700000] Normal free:20008kB min:16384kB low:20480kB high:24576kB active_anon:0kB inactive_anon:0kB active_file:4kB inactive_file:84kB unevictable:0kB writepending:8kB present:262144kB managed:251688kB mlocked:0kB slab_reclaimable:15704kB slab_unreclaimable:141964kB kernel_stack:56192kB pagetables:908kB bounce:0kB free_pcp:232kB local_pcp:0kB free_cma:0kB Tue Jul 25 12:50:05 2017 kern.emerg kernel: lowmem_reserve[]: 0 2048 2048

Tue Jul 25 13:13:39 2017 kern.err kernel: [ 2479.840000] INFO: rcu_sched detected stalls on CPUs/tasks:
Tue Jul 25 13:13:39 2017 kern.err kernel: [ 2479.840000] 3-...: (0 ticks this GP) idle=c95/140000000000000/0 softirq=302510/302510 fqs=1
Tue Jul 25 13:13:39 2017 kern.err kernel: [ 2479.850000] (detected by 2, t=6003 jiffies, g=66650, c=66649, q=150961)
Tue Jul 25 13:13:39 2017 kern.info kernel: [ 2479.860000] Task dump for CPU 3:
Tue Jul 25 13:13:39 2017 kern.info kernel: [ 2479.860000] luci R running task 0 21284 1964 0x08100000
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.870000] Stack : 00000000 8baf8c38 80528b80 8c0a9a98 8c0a9a98 8fc02380 8d0fe180 000000bb
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.870000] 8fc02680 8ecc0080 8ecc2180 800e5fd0 8ecc2180 800e79c0 00000001 811dc800
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.880000] 8c0a9b20 800af53c 00000000 804e0000 8149499c ffffffff 00000001 00008159
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.890000] 8d5676a8 0000005a 00000058 800e55c0 024102c0 802d9e80 00210d00 804d55f4
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.900000] 00000000 800afce0 026012d0 80479f14 811dc800 0000000f 00210d00 804d55f4
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.910000] ...
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.910000] Call Trace:
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.910000] [<803dc338>] __schedule+0x5d4/0x7a4
Tue Jul 25 13:13:39 2017 kern.err kernel: [ 2479.920000] rcu_sched kthread starved for 2050 jiffies! g66650 c66649 f0x0 RCU_GP_WAIT_FQS(3) ->state=0x1
Tue Jul 25 13:13:39 2017 kern.info kernel: [ 2479.930000] rcu_sched S 0 7 2 0x00100000
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.930000] Stack : 814a9bc0 8fc61250 8fc89e50 814b4320 8fc89e50 00000000 00000001 8007b658
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.940000] 814b4320 00034b87 814b4320 8fc89e50 804d0000 00000000 804e0000 804d0000
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.950000] 804dd5f0 803dc574 00034b87 804d0000 8fc89e50 814b4320 00034b87 803deed8
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.960000] 804d55f4 804e0000 00000001 804d0000 814b6240 00000001 00000000 814b4710
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.970000] 00034b87 8007bb54 8fc61250 39800003 804dd4c0 00000001 804d0000 00000000
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.970000] ...
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.980000] Call Trace:
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.980000] [<803dc338>] __schedule+0x5d4/0x7a4
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.980000] [<803dc574>] schedule+0x6c/0x84
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.990000] [<803deed8>] schedule_timeout+0x160/0x19c
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2479.990000] [<80079090>] rcu_gp_kthread+0x7f4/0x7fc
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2480.000000] [<80044be8>] kthread+0xd8/0xec
Tue Jul 25 13:13:39 2017 kern.warn kernel: [ 2480.000000] [<8000a318>] ret_from_kernel_thread+0x14/0x1c

Tue Jul 25 13:17:39 2017 kern.err kernel: [ 2720.280000] INFO: rcu_sched detected stalls on CPUs/tasks:
Tue Jul 25 13:17:39 2017 kern.err kernel: [ 2720.280000] 3-...: (0 ticks this GP) idle=9c0/0/0 softirq=302510/302510 fqs=1
Tue Jul 25 13:17:39 2017 kern.err kernel: [ 2720.290000] (detected by 1, t=6003 jiffies, g=66656, c=66655, q=340412)
Tue Jul 25 13:17:39 2017 kern.info kernel: [ 2720.290000] Task dump for CPU 3:
Tue Jul 25 13:17:39 2017 kern.info kernel: [ 2720.300000] swapper/3 R running task 0 0 1 0x00100000
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.300000] Stack : 00000068 8fc62498 00000001 84ca6177 00000279 00000000 0000022e 009a3220
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.310000] 804d548c 00000008 80472554 804e0000 00000001 00000001 bfbf8000 00000003
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.320000] 00000000 803dc574 00000000 00000000 8fc94000 804d548c 00000008 800c5534
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.330000] 804d548c 00000008 80472554 804e0000 804d548c 8006304c 00000001 00000001
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.340000] 00000001 00000001 00000001 00000001 00000001 00000001 00000001 9fc0046c
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.350000] ...
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.350000] Call Trace:
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.350000] [<803dc338>] __schedule+0x5d4/0x7a4
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.360000] [<803dc574>] schedule+0x6c/0x84
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.360000] [<800c5534>] quiet_vmstat+0x54/0x80
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.370000] [<8006304c>] cpu_startup_entry+0xec/0x150
Tue Jul 25 13:17:39 2017 kern.err kernel: [ 2720.370000] rcu_sched kthread starved for 2496 jiffies! g66656 c66655 f0x0 RCU_GP_WAIT_FQS(3) ->state=0x1
Tue Jul 25 13:17:39 2017 kern.info kernel: [ 2720.380000] rcu_sched S 0 7 2 0x00100000
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.390000] Stack : 814a9bc0 8fc61250 8fc89e50 814b4320 8fc89e50 804d0000 00000001 8007b658
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.390000] 814b4320 0003a7b6 814b4320 8fc89e50 804d0000 00000000 804e0000 804d0000
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.400000] 804dd5f0 803dc574 0003a7b6 804d0000 8fc89e50 814b4320 0003a7b6 803deed8
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.410000] 804d55f4 804e0000 00000001 804d0000 814b6240 00000001 00000000 814b46c8
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.420000] 0003a7b6 8007bb54 8fc61250 35000003 804dd4c0 00000001 804d0000 00000000
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.430000] ...
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.430000] Call Trace:
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.430000] [<803dc338>] __schedule+0x5d4/0x7a4
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.440000] [<803dc574>] schedule+0x6c/0x84
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.440000] [<803deed8>] schedule_timeout+0x160/0x19c
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.450000] [<80079090>] rcu_gp_kthread+0x7f4/0x7fc
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.450000] [<80044be8>] kthread+0xd8/0xec
Tue Jul 25 13:17:39 2017 kern.warn kernel: [ 2720.460000] [<8000a318>] ret_from_kernel_thread+0x14/0x1c

Tue Jul 25 13:19:54 2017 kern.err kernel: [ 2855.490000] INFO: rcu_sched detected stalls on CPUs/tasks:
Tue Jul 25 13:19:54 2017 kern.err kernel: [ 2855.490000] 3-...: (0 ticks this GP) idle=69c/0/0 softirq=302510/302510 fqs=1
Tue Jul 25 13:19:54 2017 kern.err kernel: [ 2855.500000] (detected by 1, t=6003 jiffies, g=66673, c=66672, q=275515)
Tue Jul 25 13:19:54 2017 kern.info kernel: [ 2855.500000] Task dump for CPU 3:
Tue Jul 25 13:19:54 2017 kern.info kernel: [ 2855.510000] swapper/3 R running task 0 0 1 0x00100000
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.510000] Stack : 00000068 00000001 00000001 001fd953 00000000 77edc2c0 00000000 00000000
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.520000] 804d548c 00000008 80472554 804e0000 00000001 00000001 bfbf8000 00000003
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.530000] 00000000 8000c1cc 00000000 00000000 8fc94000 8fc95f00 00000000 80063054
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.540000] 1100fc03 00000001 00000001 804d548c 804d548c 8006304c 00000001 00000001
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.550000] 00000001 00000001 00000001 00000001 00000001 00000001 00000001 9fc0046c
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.560000] ...
Tue Jul 25 13:19:54 2017 kern.warn kernel: [ 2855.560000] Call Trace:
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.560000] [<803dc338>] __schedule+0x5d4/0x7a4
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.570000] [<8000c1cc>] r4k_wait_irqoff+0x0/0x20
Tue Jul 25 13:19:55 2017 kern.err kernel: [ 2855.570000] rcu_sched kthread starved for 3051 jiffies! g66673 c66672 f0x0 RCU_GP_WAIT_FQS(3) ->state=0x1
Tue Jul 25 13:19:55 2017 kern.info kernel: [ 2855.580000] rcu_sched S 0 7 2 0x00100000
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.590000] Stack : 814b5bc0 8fc61250 8fc89e50 814b4320 8fc89e50 804d0000 00000001 8007b658
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.590000] 814b4320 0003da5b 814b4320 8fc89e50 804d0000 00000000 804e0000 804d0000
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.600000] 804dd5f0 803dc574 0003da5b 804d0000 8fc89e50 814b4320 0003da5b 803deed8
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.610000] 804d55f4 804e0000 00000001 804d0000 814b6240 00000001 00000000 814b4870
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.620000] 0003da5b 8007bb54 8fc61250 4f800003 804dd4c0 00000001 804d0000 00000000
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.630000] ...
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.630000] Call Trace:
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.630000] [<803dc338>] __schedule+0x5d4/0x7a4
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.640000] [<803dc574>] schedule+0x6c/0x84
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.640000] [<803deed8>] schedule_timeout+0x160/0x19c
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.650000] [<80079090>] rcu_gp_kthread+0x7f4/0x7fc
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.650000] [<80044be8>] kthread+0xd8/0xec
Tue Jul 25 13:19:55 2017 kern.warn kernel: [ 2855.660000] [<8000a318>] ret_from_kernel_thread+0x14/0x1c

Tue Jul 25 13:20:21 2017 daemon.notice squid[2635]: Logfile: opening log stdio:/var/logs/netdb.state
Tue Jul 25 13:20:21 2017 daemon.notice squid[2635]: ERROR: logfileOpen stdio:/var/logs/netdb.state: (2) No such file or directory
Tue Jul 25 13:20:21 2017 daemon.notice squid[2635]: logfileOpen: stdio:/var/logs/netdb.state: couldn't open!
Tue Jul 25 13:20:21 2017 daemon.notice squid[2635]: netdbSaveState: stdio:/var/logs/netdb.state: (2) No such file or directory

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

We need a better patch...
Hope John will be able to help us...

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

john:

i've done nothing all day but play around with this. i am unfortunately not able to reproduce this issue. i've just sent my latest version of the patch to someone for testing. lets hope for the best.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

@john thanks a lot.

Maybe if you send us your latest version of your patch we can test it too.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

dchard:

@john: I agree with Pierre: if you can send us a build with your (latest) patches inside, we are happy to test. Like we did with previous versions :-)

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

If you don't have a build I can do the build as I did for last hint.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

john:

drop this file into target/linux/ramips/patches-4.9/ on current trunk, ignoring all previous patches. i have had an iperf test run for 2 hours now with near gbit speed using SQM/cake/piece.of.cake setup to rate limit at 600Mbit and have not seen any oopses

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

Pls commit it asao
We will test the trunk. It will be in any case an improvement
Pls let me know if trunk snapshot is compiled.
Then I will test more

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

pparent76:

Thanks,

I will try to compile it tomorrow for all boards and I will try to add pptpd

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 25, 2017

camel:

Thx
Normally, if the same build modules + same kernel magic as trunk build I can install the packages by myself
I don't think, that it would be a big problem

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 26, 2017

pparent76:

Here are the images:

https://www.own-mailbox.com/lede/

@Camel: in order to not run out of memory don't download files to /tmp/ during your tests but to /dev/null

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 26, 2017

bjonglez:

The fix has been pushed to master.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 26, 2017

pparent76:

@baptiste Jonglez ok great!

(Not sure if it is included in the last build yet though, since it was comited 1 hour ago.)

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 26, 2017

camel:

as i can see:
it seems to be, that it already in build, too :) ?

https://git.lede-project.org/?p=source.git;a=summary

and build is from:
Timing:
Start Wed Jul 26 00:13:54 2017
End Wed Jul 26 00:31:31 2017
Elapsed 17 mins, 37 secs

see:
http://phase1.builds.lede-project.org/builders/ramips%2Fmt7621/builds/345

so, we have to wait 1 day (if build commit will be done tonight) longer to use the snapshot trunk :)

I'M very nasty to test - won't wait longer :)

@Biptiste: Thx

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Jul 31, 2017

dchard:

@john:

I am testing your fixes (in trunk) for 5 days, and so far there was no crash, warning, or any other indication of the problem. The kernel and system logs are also clean.

How I tested is the following:

  1. Run openssl test to load all cores/threads to 100%:

openssl speed md5 sha1 sha256 sha512 des des-ede3 aes-128-cbc aes-192-cbc aes-256-cbc rsa2048 dsa2048 rsa4096 -multi 4

  1. In the same time, start Iperf3 as server on the router, and run iperf3 tests on gigabit LAN in paralell with the openssl tests above. This is how I simulated high processor load and network traffic in the same time.

Previously it took only a few minutes to recreate the errors above, now it seems to be gone completely. I will look into to logs every few days to see if anything happens.

Thanks for your hard work!

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Sep 18, 2017

pparent76:

Not sure if it is related but today I got with latest version:

[ 281.257967] BUG: Bad page map in process hotplug-call pte:00ffffff pmd:00000000
[ 281.267040] page:81022fe0 count:0 mapcount:-1 mapping: (null) index:0x0[ 281.271831] device br-lan entered promiscuous mode
[ 281.278495] flags: 0x14(referenced|dirty)
[ 281.282487] page dumped because: bad pte
[ 281.286439] addr:77e78000 vm_flags:00100077 anon_vma:8e8dd9d8 mapping:8f85ca0c index:12
[ 281.294460] file:libgcc_s.so.1 fault:filemap_fault mmap:generic_file_readonly_mmap readpage:squashfs_readpage
[ 281.304379] CPU: 3 PID: 21248 Comm: hotplug-call Not tainted 4.9.49 #0
[ 281.310880] Stack : 00000000 00000000 80557b2a 0000003a 80406a44 00000000 00000000 80550000
[ 281.319232] 8fd988fc 804ebea7 80482a8c 00000003 00005300 80553824 8f85ca0c 804e90c0
[ 281.327580] 804e90c0 8006b878 00000000 8006b698 00000000 00000001 80487674 8e869bd4
[ 281.335926] 00000000 800a9fe0 00000000 00000000 80557b2a 00000000 8f85ca0c 00869bd4
[ 281.344268] 00000000 00000000 00000000 00000000 00000000 00000000 00000000 00000000
[ 281.352612] ...
[ 281.355051] Call Trace:
[ 281.357509] [<8000f7c4>] show_stack+0x54/0x88
[ 281.361864] [<801df11c>] dump_stack+0x8c/0xd0
[ 281.366213] [<800d0430>] print_bad_pte+0x1d8/0x208
[ 281.370983] [<800d2eb0>] unmap_page_range+0x5a4/0x760
[ 281.376013] [<800d3398>] unmap_vmas+0x54/0x74
[ 281.380358] [<800d861c>] exit_mmap+0x6c/0x138
[ 281.384696] [<8002994c>] mmput+0x44/0x120
[ 281.388708] [<800f635c>] flush_old_exec+0x59c/0x740
[ 281.393587] [<8013c360>] load_elf_binary+0x3d4/0x1174
[ 281.398618] [<800f56a0>] search_binary_handler+0xc4/0x25c
[ 281.403991] [<800f6f90>] do_execveat_common+0x490/0x600
[ 281.409302] Disabling lock debugging due to kernel taint
[ 281.414938] BUG: Bad rss-counter state mm:8eaf8700 idx:0 val:-1
[ 281.420833] BUG: Bad rss-counter state mm:8eaf8700 idx:1 val:1

After that the router did not respond, even in UART until I rebooted it.

@openwrt-bot
Copy link
Author

@openwrt-bot openwrt-bot commented Sep 18, 2017

bjonglez:

This looks like an entirely different issue, please open a new bug report.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant