Skip to content
View jaepil's full-sized avatar
🎯
Working from home | Focusing
🎯
Working from home | Focusing

Organizations

@EpicGames @open-korean-text

Block or report jaepil

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

NLP

49 repositories

비지도학습 방법으로 한국어 텍스트에서 단어/키워드를 자동으로 추출하는 라이브러리입니다

Python 354 55 Updated Apr 13, 2022

Automatic Korean word spacing with Python

Python 423 115 Updated Jul 4, 2024

1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.

Python 958 138 Updated Jan 28, 2025

compact_enc_det - Compact Encoding Detection

C 242 84 Updated Feb 12, 2024

The most popular spellchecking library.

C++ 2,441 265 Updated Feb 21, 2026

State-of-the-Art Text Embeddings

Python 18,293 2,759 Updated Feb 20, 2026

Korean BERT pre-trained cased (KoBERT)

Python 1,402 380 Updated Jun 14, 2025

Translate - a PyTorch Language Library

Python 834 197 Updated Apr 27, 2023

Models, data loaders and abstractions for language processing, powered by PyTorch

Python 3,565 812 Updated Sep 10, 2025

💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows

Python 12,195 781 Updated Feb 21, 2026

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 4,716 1,466 Updated Aug 21, 2024

Open-Source Information Retrieval Courses @ TU Wien

Python 696 95 Updated Jun 12, 2023

Word Embeddings for Information Retrieval

Python 226 41 Updated Oct 4, 2023

A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.

361 29 Updated Dec 9, 2025

The Virtual Feature Store. Turn your existing data infrastructure into a feature store.

Go 1,965 102 Updated Jul 3, 2025

Original Implementation of Prompt Tuning from Lester, et al, 2021

Python 698 63 Updated Mar 6, 2025

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)

Python 674 70 Updated Sep 30, 2020
Jupyter Notebook 89 35 Updated Apr 3, 2025

Implementation of HNSW that supports online updates

C++ 67 20 Updated Dec 17, 2017

Code for ECCV2018 paper: Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors

C++ 220 27 Updated May 7, 2020

Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)

Python 190 10 Updated Jun 24, 2022

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,330 1,085 Updated May 11, 2024

Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch

Python 879 110 Updated Oct 30, 2023

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,728 239 Updated Aug 15, 2025

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Jupyter Notebook 1,001 112 Updated Jan 3, 2024
Python 30 4 Updated Nov 25, 2021
Python 112 10 Updated Aug 5, 2021

Automatically create Faiss knn indices with the most optimal similarity search parameters.

Python 893 78 Updated Nov 4, 2025

Repo for external large-scale work

Python 6,544 723 Updated Apr 27, 2024

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

Jupyter Notebook 572 289 Updated Jun 27, 2025