jaepil

🎯

Working from home | Focusing

Jaepil Jeong jaepil

🎯

Working from home | Focusing

Working on a multi-purpose database engine that integrates document store, real-time full-text search, dense vector similarity search, and ML pipeline.

93 followers · 143 following

@cognica-io
San Francisco, CA
https://segfaults.co

Achievements

x3 x2

Achievements

x3 x2

Highlights

Developer Program Member

Organizations

Stars

NLP

49 repositories

lovit / KR-WordRank

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다

Python 354 55 Updated Apr 13, 2022

haven-jeon / PyKoSpacing

Automatic Korean word spacing with Python

Python 423 115 Updated Jul 4, 2024

JohnSnowLabs / nlu

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Python 958 138 Updated Jan 28, 2025

google / compact_enc_det

compact_enc_det - Compact Encoding Detection

C 242 84 Updated Feb 12, 2024

hunspell / hunspell

The most popular spellchecking library.

C++ 2,441 265 Updated Feb 21, 2026

huggingface / sentence-transformers

State-of-the-Art Text Embeddings

Python 18,293 2,759 Updated Feb 20, 2026

SKTBrain / KoBERT

Korean BERT pre-trained cased (KoBERT)

Python 1,402 380 Updated Jun 14, 2025

pytorch / translate

Translate - a PyTorch Language Library

Python 834 197 Updated Apr 27, 2023

pytorch / text

Models, data loaders and abstractions for language processing, powered by PyTorch

Python 3,565 812 Updated Sep 10, 2025

neuml / txtai

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Python 12,195 781 Updated Feb 21, 2026

nlp-with-transformers / notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 4,716 1,466 Updated Aug 21, 2024

sebastian-hofstaetter / teaching

Open-Source Information Retrieval Courses @ TU Wien

Python 696 95 Updated Jun 12, 2023

lgalke / vec4ir

Word Embeddings for Information Retrieval

Python 226 41 Updated Oct 4, 2023

Agrover112 / awesome-semantic-search

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

361 29 Updated Dec 9, 2025

featureform / featureform

The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

Go 1,965 102 Updated Jul 3, 2025

google-research / prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

Python 698 63 Updated Mar 6, 2025

koursaros-ai / nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)

Python 674 70 Updated Sep 30, 2020