- Luxembourg
-
23:51
(UTC +01:00) - https://costezki.eu/
- in/costezki
- https://meaningfy.ws/
- https://gravatar.com/costezki
Deduplication
A powerful and modular toolkit for record linkage and duplicate detection in Python
🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
A list of free data matching and record linkage software.
PyTorch library for transforming entities like companies, products, etc. into vectors to support scalable Record Linkage / Entity Resolution using Approximate Nearest Neighbors.
Record Linkage ToolKit (Find and link entities)
ReFinED is an efficient and accurate entity linking (EL) system.
A Python script for generating duplicate data to test the performance of record linkage and master data management systems.
🐍 Python Implementation and Extension of RDF2Vec
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for docum…
OpenRefine reconciliation services for VIAF, ORCID, and Open Library + framework for creating more.
An interpretable machine learning pipeline over knowledge graphs
Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)aut…
Open Source alternative to Algolia + Pinecone and an Easier-to-Use alternative to ElasticSearch ⚡ 🔍 ✨ Fast, typo tolerant, in-memory fuzzy Search Engine for building delightful search experiences
Entity Disambiguation as text extraction (ACL 2022)
An open-source library that leverages Python’s data science ecosystem to build powerful end-to-end Entity Resolution workflows.





