New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Import: Cat_links fails for "small wikis" (one-or-few databases) (Can't import current en wiktionary dump) #543
Comments
Thanks for the log excerpt. I was able to reproduce this on my side. Let me investigate further. Keep in mind that en.wiktionary.org still works: it's just the category system will be missing. Thanks |
Something similar happens when I try to import wiki commons: 20190812_180426.977 gui.splash.bgn Impressive job I have to say if you can get it working. You got much farther at an offline wikipedia than I ever did with my thousands of lines of sed and bash scripts |
Thanks. I was able to figure out a dirty fix for this. I'll try to clean it up by Wednesday or the weekend. Will check back here then. Thanks! |
If it helps, here's a thread dump: Full thread dump OpenJDK 64-Bit Server VM (25.212-b04 mixed mode): "Attach Listener" #32 daemon prio=9 os_prio=0 tid=0x00007f7088001000 nid=0x76e9 runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE "JDWP Transport Listener: dt_socket" #31 daemon prio=10 os_prio=0 tid=0x00007f708c005000 nid=0x62db runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE "wiki.import" #22 prio=5 os_prio=0 tid=0x00007f7129fa3000 nid=0x1f18 waiting on condition [0x00007f70df0ac000] java.lang.Thread.State: TIMED_WAITING (sleeping) at java.lang.Thread.sleep(Native Method) at gplx.core.threads.Thread_adp_.Sleep(Thread_adp_.java:20) at gplx.xowa.bldrs.installs.Xoi_cmd_mgr.Process_async(Xoi_cmd_mgr.java:38) at gplx.xowa.bldrs.installs.Xoi_cmd_mgr.Invk(Xoi_cmd_mgr.java:137) at gplx.Gfo_invk_.Invk_by_msg(Gfo_invk_.java:34) at gplx.core.threads.Thread_adp.run(Thread_adp.java:33) at java.lang.Thread.run(Thread.java:748) "AWT-XAWT" #14 daemon prio=6 os_prio=0 tid=0x00007f7128e6a800 nid=0x1e9b runnable [0x00007f70df3ad000] java.lang.Thread.State: RUNNABLE at sun.awt.X11.XToolkit.waitForEvents(Native Method) at sun.awt.X11.XToolkit.run(XToolkit.java:580) at sun.awt.X11.XToolkit.run(XToolkit.java:544) at java.lang.Thread.run(Thread.java:748) "Java2D Disposer" #12 daemon prio=10 os_prio=0 tid=0x00007f7128e53000 nid=0x1e9a in Object.wait() [0x00007f70df4ae000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) - waiting on <0x00000001dc08fef8> (a java.lang.ref.ReferenceQueue$Lock) at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:144) - locked <0x00000001dc08fef8> (a java.lang.ref.ReferenceQueue$Lock) at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:165) at sun.java2d.Disposer.run(Disposer.java:148) at java.lang.Thread.run(Thread.java:748) "Service Thread" #11 daemon prio=9 os_prio=0 tid=0x00007f712827a800 nid=0x1e97 runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE "C1 CompilerThread3" #10 daemon prio=9 os_prio=0 tid=0x00007f7128277800 nid=0x1e96 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE "C2 CompilerThread2" #9 daemon prio=9 os_prio=0 tid=0x00007f7128275800 nid=0x1e95 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE "C2 CompilerThread1" #8 daemon prio=9 os_prio=0 tid=0x00007f7128273800 nid=0x1e94 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE "C2 CompilerThread0" #7 daemon prio=9 os_prio=0 tid=0x00007f7128270800 nid=0x1e93 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE "JDWP Event Helper Thread" #6 daemon prio=10 os_prio=0 tid=0x00007f712826e800 nid=0x1e92 runnable [0x0000000000000000] java.lang.Thread.State: RUNNABLE "Signal Dispatcher" #4 daemon prio=9 os_prio=0 tid=0x00007f7128252000 nid=0x1e90 waiting on condition [0x0000000000000000] java.lang.Thread.State: RUNNABLE "Finalizer" #3 daemon prio=8 os_prio=0 tid=0x00007f7128229000 nid=0x1e8f in Object.wait() [0x00007f70e7010000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:144) - locked <0x00000001db20e1c0> (a java.lang.ref.ReferenceQueue$Lock) at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:165) at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:216) "Reference Handler" #2 daemon prio=10 os_prio=0 tid=0x00007f7128224800 nid=0x1e8e in Object.wait() [0x00007f70e7111000] java.lang.Thread.State: WAITING (on object monitor) at java.lang.Object.wait(Native Method) at java.lang.Object.wait(Object.java:502) at java.lang.ref.Reference.tryHandlePending(Reference.java:191) - locked <0x00000001db215768> (a java.lang.ref.Reference$Lock) at java.lang.ref.Reference$ReferenceHandler.run(Reference.java:153) "main" #1 prio=5 os_prio=0 tid=0x00007f712800b800 nid=0x1e80 runnable [0x00007f71361d9000] java.lang.Thread.State: RUNNABLE at org.eclipse.swt.internal.gtk.OS.Call(Native Method) at org.eclipse.swt.widgets.Display.sleep(Unknown Source) at gplx.gfui.kits.core.Swt_kit.Kit_run(Swt_kit.java:104) at gplx.xowa.guis.Xoa_gui_mgr.Run(Xoa_gui_mgr.java:139) at gplx.xowa.apps.boots.Xoa_boot_mgr.Run_app(Xoa_boot_mgr.java:137) at gplx.xowa.apps.boots.Xoa_boot_mgr.Run(Xoa_boot_mgr.java:28) at gplx.xowa.Xoa_app_.Run(Xoa_app_.java:24) at gplx.xowa.Xowa_main.main(Xowa_main.java:19) "VM Thread" os_prio=0 tid=0x00007f712821a800 nid=0x1e8d runnable "GC task thread#0 (ParallelGC)" os_prio=0 tid=0x00007f7128021000 nid=0x1e81 runnable "GC task thread#1 (ParallelGC)" os_prio=0 tid=0x00007f7128023000 nid=0x1e82 runnable "GC task thread#2 (ParallelGC)" os_prio=0 tid=0x00007f7128024800 nid=0x1e83 runnable "GC task thread#3 (ParallelGC)" os_prio=0 tid=0x00007f7128026800 nid=0x1e86 runnable "GC task thread#4 (ParallelGC)" os_prio=0 tid=0x00007f7128028000 nid=0x1e87 runnable "GC task thread#5 (ParallelGC)" os_prio=0 tid=0x00007f712802a000 nid=0x1e88 runnable "GC task thread#6 (ParallelGC)" os_prio=0 tid=0x00007f712802b800 nid=0x1e89 runnable "GC task thread#7 (ParallelGC)" os_prio=0 tid=0x00007f712802d800 nid=0x1e8a runnable "GC task thread#8 (ParallelGC)" os_prio=0 tid=0x00007f712802f000 nid=0x1e8b runnable "GC task thread#9 (ParallelGC)" os_prio=0 tid=0x00007f7128031000 nid=0x1e8c runnable "VM Periodic Task Thread" os_prio=0 tid=0x00007f712827d000 nid=0x1e98 waiting on condition JNI global references: 5464 Heap PSYoungGen total 87040K, used 22376K [0x00000005c9100000, 0x00000005cf400000, 0x00000007c0000000) eden space 86528K, 25% used [0x00000005c9100000,0x00000005ca6da178,0x00000005ce580000) from space 512K, 0% used [0x00000005cf380000,0x00000005cf380000,0x00000005cf400000) to space 512K, 0% used [0x00000005cf300000,0x00000005cf300000,0x00000005cf380000) ParOldGen total 666112K, used 32343K [0x00000001db200000, 0x0000000203c80000, 0x00000005c9100000) object space 666112K, 4% used [0x00000001db200000,0x00000001dd195e78,0x0000000203c80000) Metaspace used 33727K, capacity 33989K, committed 34432K, reserved 1081344K class space used 3315K, capacity 3388K, committed 3456K, reserved 1048576K |
Thanks for the thread dump. I fixed it with the commit above. I'm planning to do a build around 8-25. If you want it earlier, let me know which OS/bitness (EX: Linux 64-bit), and I'll upload a version to my google drive. Also, I like to give credit to users for finding bugs. Right now, there's a line in the Change Log like this: Thanks! |
Importing enwiktionary-latest-pages-articles.xml.bz2 fails:
The text was updated successfully, but these errors were encountered: