An efficient and lightweight full-text search engine built in Java, designed to index and retrieve documents with high accuracy and speed. It supports multiple file formats and integrates core information retrieval techniques such as TF-IDF scoring, boolean query operations, and inverted indexing for optimized performance.
- Full-text search with relevance ranking
- Query term highlighting and snippet generation
- Configurable stopword filtering
- Stemming and lemmatization for language normalization
- Concurrent indexing for improved scalability
- Java JDK 11+
- Maven 3.6+
- Clone the repository:
git clone https://github.com/yourusername/java-search-engine.git cd java-search-engine - Build the project:
mvn clean install
- Basic Usage:
SearchEngine engine = new SearchEngine(); engine.addDocument("doc1", "path/to/document1.txt"); engine.addDocument("doc2", "path/to/document2.txt"); SearchResults results = engine.search("your query here");