Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Print page name during zimcheck --main #219

Closed
lidel opened this issue Feb 8, 2021 · 3 comments · Fixed by #220
Closed

Print page name during zimcheck --main #219

lidel opened this issue Feb 8, 2021 · 3 comments · Fixed by #220
Assignees
Labels
enhancement good first issue IPFS Necessary for proper IPFS version of Wikipedia question zimdump
Milestone

Comments

@lidel
Copy link
Contributor

lidel commented Feb 8, 2021

Would it be possible to print the name of the "main page" of the ZIM during zimcheck --main ?

For example wikipedia_tr_all_maxi_2021-01.zim has main set to Kullanıcı:The_other_Kiwix_guy/Landing, so perhaps printing it like this:

$ zimcheck --main ./wikipedia_tr_all_maxi_2021-01.zim
  [INFO] Checking zim file ./wikipedia_tr_all_maxi_2021-01.zim
  [INFO] Searching for main page...
+ [INFO] Main page: Kullanıcı:The_other_Kiwix_guy/Landing
  [INFO] Overall Test Status: Pass
  [INFO] Total time taken by zimcheck: 0 seconds.

It would enable ipfs/distributed-wikipedia-mirror to automatically set the correct landing page.
It is not a hard blocker, but right now we need to eyeball it for every language by opening a ZIM file in some Kiwix reader.

cc @kelson42

@kelson42
Copy link
Contributor

kelson42 commented Feb 9, 2021

@lidel zimdump is the proper tool for that:

$ zimdump info /tmp/pokepedia_fr_all_maxi_2019-05.zim
count-entries: 142378
uuid: 3b22e149-0d9a-4f0e-ba5a-420aa4650552
cluster count: 3677
checksum: ba148c887e7c47090ec16a2bcd138d0a
main page: 22308
favicon: 5

Does the index is useful? Or do you need the URL?

@kelson42 kelson42 added zimdump IPFS Necessary for proper IPFS version of Wikipedia labels Feb 9, 2021
@lidel
Copy link
Contributor Author

lidel commented Feb 9, 2021

@kelson42 I need the URL (article name) – can I get that from index somehow?

@kelson42
Copy link
Contributor

@lidel I believe zimdump should just print the main page url in place ont the index.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement good first issue IPFS Necessary for proper IPFS version of Wikipedia question zimdump
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants