Search

A powerful and flexible text search library for JavaScript that enables you to build a simple text search engine. It provides a set of classes to tokenize, parse, and interpret queries using a binary AST (Abstract Syntax Tree). The library supports various grouping operators (and/or/&/|) and any degree of parenthesis nesting.

Features

Tokenization of search queries
Parsing to Abstract Syntax Trees (AST)
Interpretation to evaluate search queries against text
Normalization of text and query strings
Abstract factory for easy extension

Installation

Install the package with:

npm install @basd/search

Usage

First, import the Search library.

import Search from '@basd/search'

or

const Search = require('@basd/search')

Quick Start

Here's how to create a simple search evaluator and use it.

const Search = require('@basd/search')

const search = new Search()
const evaluator = search.evaluator('apple AND orange')

const result = evaluator('I have an apple and an orange.')
// Returns true

Here's a basic example of how you can use @basd/search to perform a text search:

const { Tokenizer, Parser, Interpreter } = require('@basd/search')

const query = 'apple AND orange OR pear'
const tokenizer = new Tokenizer()
const tokens = tokenizer.tokenize(query)

const parser = new Parser(tokens)
const ast = parser.parse()

const interpreter = new Interpreter(ast)
const result = interpreter.interpret('apple orange') // true

Documentation

API Reference

API Reference

Classes

`SearchFactory`

Factory class to produce instances of Tokenizer, Parser, and Interpreter.

const factory = new SearchFactory(registry)

Methods

createTokenizer(...args): Creates a SearchTokenizer instance.
createParser(...args): Creates a SearchParser instance.
createInterpreter(...args): Creates a SearchInterpreter instance.

`SearchNormalizer`

Normalizes text to be used in tokenization and interpretation.

const normalizedText = SearchNormalizer.normalize('some text')

`SearchTokenizer`

Tokenizes the normalized query.

const tokenizer = new SearchTokenizer()
const tokens = tokenizer.tokenize('apple AND orange')

`SearchParser`

Parses the tokens into an AST.

const parser = new SearchParser(tokens)
const ast = parser.parse()

`SearchInterpreter`

Interprets the AST against a given text.

const interpreter = new SearchInterpreter(ast)
const result = interpreter.interpret('I have an apple.')

`Search`

The main class that combines all the functionalities.

const search = new Search()

Methods

evaluator(needle): Returns an evaluator function for a given search query.
evaluate(needle, haystack): Evaluates a search query against a given text.

Extending the Library

The library is designed to be easily extendable. You can extend SearchTokenizer, SearchParser, and SearchInterpreter to add additional functionalities.

Classes

`TextNormalizer`

Normalizes text by removing punctuations, converting to uppercase, and replacing multiple spaces with a single space.

`Tokenizer`

Tokenizes a query into distinct elements such as words, operators, and parentheses.

`Parser`

Takes the tokens and turns them into a binary AST.

`Interpreter`

Takes the AST and matches a given text string against it.

API Reference

`Tokenizer.tokenize(query: string): Token[]`

Takes a query string and returns an array of tokens.

`Parser.parse(): ASTNode`

Takes an array of tokens and returns a binary AST.

`Interpreter.interpret(data: string): boolean`

Takes a string of text and returns a boolean indicating whether it matches the AST.

Tests

In order to run the test suite, simply clone the repository and install its dependencies:

git clone https://gitlab.com/frenware/framework/plaindb/search.git
cd search
npm install

To run the tests:

npm test

Contributing

Thank you! Please see our contributing guidelines for details.

Donations

If you find this project useful and want to help support further development, please send us some coin. We greatly appreciate any and all contributions. Thank you!

Bitcoin (BTC):

1JUb1yNFH6wjGekRUW6Dfgyg4J4h6wKKdF

Monero (XMR):

46uV2fMZT3EWkBrGUgszJCcbqFqEvqrB4bZBJwsbx7yA8e2WBakXzJSUK8aqT4GoqERzbg4oKT2SiPeCgjzVH6VpSQ5y7KQ

License

@basd/search is MIT licensed.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
docs		docs
lib		lib
test		test
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
LICENSE		LICENSE
README.md		README.md
package.json		package.json
types.d.ts		types.d.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Search

Features

Installation

Usage

Quick Start

Documentation

API Reference

Classes

`SearchFactory`

Methods

`SearchNormalizer`

`SearchTokenizer`

`SearchParser`

`SearchInterpreter`

`Search`

Methods

Extending the Library

Classes

`TextNormalizer`

`Tokenizer`

`Parser`

`Interpreter`

API Reference

`Tokenizer.tokenize(query: string): Token[]`

`Parser.parse(): ASTNode`

`Interpreter.interpret(data: string): boolean`

Tests

Contributing

Donations

License

About

Releases

Packages

Languages

License

basedwon/search

Folders and files

Latest commit

History

Repository files navigation

Search

Features

Installation

Usage

Quick Start

Documentation

API Reference

Classes

SearchFactory

Methods

SearchNormalizer

SearchTokenizer

SearchParser

SearchInterpreter

Search

Methods

Extending the Library

Classes

TextNormalizer

Tokenizer

Parser

Interpreter

API Reference

Tokenizer.tokenize(query: string): Token[]

Parser.parse(): ASTNode

Interpreter.interpret(data: string): boolean

Tests

Contributing

Donations

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`SearchFactory`

`SearchNormalizer`

`SearchTokenizer`

`SearchParser`

`SearchInterpreter`

`Search`

`TextNormalizer`

`Tokenizer`

`Parser`

`Interpreter`

`Tokenizer.tokenize(query: string): Token[]`

`Parser.parse(): ASTNode`

`Interpreter.interpret(data: string): boolean`

Packages