Skip to content

jkytoela/PDFSearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

The project is essentially a POC and really incomplete. The main purpose is to learn more about docker-compose, MinIO and ElasticSearch.

You can upload PDF files to the application. Their basic information is stored in a database and their text content is indexed. Once uploaded, the files can be searched by their text content.

Technologies used:

  • Docker
  • TypeScript
  • Express.js
  • React
  • MinIO (S3 compatible object storage)
  • Apache Tika
  • ElasticSearch
  • PostgreSQL
  • Redis
  • nginx

TODO:

  • The front-end search functionality
  • TBD...

About

Extract contents from PDF files with Apache Tika, index them with ElasticSearch and allow searching files by their content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published