Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP

Loading…

how to make data of e-book #20

Open
sumomo2010 opened this Issue · 6 comments

2 participants

@sumomo2010

I dump wikibooks xml data and make data for wikireader.
but in searching phrase, all data looked.
when searching enguten data, only book title was looked.
why?

i

@wikireader
Owner

The present index files are only generated from the article title.
So only the title and any redirects will show up on the search screen.
Does that answer your question?

@sumomo2010

Thank you for the answer.
Then, is the index files to be made for all articles?
In enguten, for example "heidi", Book is divided some files. heidi,heid-0,and heid-1,etc. , and there seems to be only an index of heidi.
What can I do how to make such index files?

@wikireader
Owner

For the Wikipedia files there are two kinds of items:
1. an article - the title is used as the index entry
and the text is assigned an article number
2. a redirect the text is like: #REDIRECT[[another article]]
the title is used as the index entry and associated with the
article number of "another article"

The Gutenberg files were produced by Tom Bachmann
as mentioned in http://dev.thewikireader.com/2010/06/26/project-gutenberg/

Perhaps you can look at his script:
http://gitorious.org/wikireader-ness/wikireader-ness/blobs/master/host-tools/offline-renderer/BookIndex.py
to see ho he generated the enguten index

@sumomo2010

Thank you very much for the answer.
I search these things , but I can find it.

@wikireader
Owner

I forgot to mention:
host-tools/offline-renderer/DumpFnd.py
which will display a readable version of an index file

@sumomo2010

Thank you.
I will look it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Something went wrong with that request. Please try again.