Skip to content

Distributed Query is the PISTIS platform's component that provides to users the ability to discover datasets by searching directly on the stored data.

License

Notifications You must be signed in to change notification settings

PISTIS-Platform/distributed-query

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Distributed Query

The main purpose of this component is to query directly the unstructured or semi-structured data to discover datasets that cannot be retrieved by querying their metadata on the Distributed Data Catalogue.

However, the volume of the data stored in the Data Factories does not allow extensive search approaches to be used. Therefore, Locality Sensitive Hashing techniques are employed to quickly obtain a list of matches.

About

Distributed Query is the PISTIS platform's component that provides to users the ability to discover datasets by searching directly on the stored data.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published