PDF Index

I want an index of my pdf files stored in Zotero. Other repositories that may be of interest include a server that responds to /query?q=search terms and a client that debounces requests to that server and displays results on the fly.

See this blog post for more information about the project: https://jcuenod.github.io/bibletech/2021/07/26/full-text-search-for-pdfs/

Usage

Build the Index

Be sure to set the root variable in config.json to the root folder under which pdfindex should look for pdfs.

node create-index.js

Query the Index

SELECT
    name,
    page,
    snippet(pages, 2, '<b>', '</b>', '', 15)
FROM
    pages,
    metadata
WHERE
    content MATCH 'NEAR("sabbath" "fire")'
AND
    pages.id = metadata.rowid;

The format for snippet is:

snippet(pages, 2, '<b>', '</b>', '', 15)

pages: Name of the table
2: Index of the column (zero based)
<b>: String to inject before a matching token
</b>: String to inject after a matching token
'': String to inject at beginning and end of string if there is not a matching token there
15: Number of tokens to return (i.e. length of snippet)

Output Example

Weinfeld_1981_Sabbath, temple, and the enthronement of the Lord.pdf|3.0|shall not burn <b>fire</b> in any of your dwellings on the <b>Sabbath</b>״ (Ex. 35:5

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
config.json		config.json
create-index.js		create-index.js
get-pdf-text.js		get-pdf-text.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

config.json

config.json

create-index.js

create-index.js

get-pdf-text.js

get-pdf-text.js

package-lock.json

package-lock.json

package.json

package.json

Repository files navigation

PDF Index

Usage

Output Example

About

Releases

Packages

Languages

jcuenod/pdfindex-create-index

Folders and files

Latest commit

History

Repository files navigation

PDF Index

Usage

Output Example

About

Resources

Stars

Watchers

Forks

Languages