Skip to content
View niranjanaryan's full-sized avatar
🤿
Deeply Involved
🤿
Deeply Involved

Block or report niranjanaryan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Memory Networks implementations

Lua 1,754 375 Updated Jul 28, 2020

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages

Python 7,392 898 Updated Mar 5, 2025

Python wrapper for Stanford CoreNLP.

Python 922 199 Updated Dec 7, 2021

The CoreNLP package is a thin Elixir client for the Stanford CoreNLP Server.

Elixir 15 1 Updated Feb 15, 2019

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 9,810 2,712 Updated Mar 9, 2025

A Python library for quantum programming using Quil.

Python 1,448 349 Updated Mar 6, 2025

JDsearch: A Personalized Product Search Dataset with Real Queries and Full Interactions

31 Updated May 8, 2023

An open-source dataset for e-commerce user intent detection and embedding retrieval

23 2 Updated Aug 22, 2022

The Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc.

C++ 206 26 Updated Apr 3, 2020

indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2

Jupyter Notebook 124 32 Updated Jan 2, 2024

Resources and tools for Indian language Natural Language Processing

Python 576 160 Updated Jun 7, 2024

Moses, the machine translation system

Roff 1,594 779 Updated Feb 4, 2025

Supplementary material for "When and Why Are Pre-trained Word Embeddings Useful for Neural Machine Translation?" at NAACL 2018

Python 122 19 Updated Apr 27, 2020

Hanzi Converter for Traditional and Simplified Chinese

Python 184 44 Updated Mar 28, 2020

Factoid Question Answering System - An advanced Open-domain Question Answering (ODQA) project that automatically answers factoid questions in Arabic and English languages using NLP and machine lear…

Python 5 Updated Jul 24, 2024

Comostional question answering

Python 17 8 Updated Jun 18, 2021

Metadata and versioning details for the Common Voice dataset

JavaScript 146 15 Updated Dec 17, 2024
Jupyter Notebook 1 Updated Dec 2, 2024

Lexical data at Unicode

Clojure 69 16 Updated Sep 1, 2024

Crawler for linguistic corpora

Python 204 55 Updated Dec 5, 2023

Multilingual parallel corpus based on OPUS Opensubtitles and corpus compiler

Python 2 Updated Jun 27, 2023

Parallel data mining with Mann-ki-Baat

Python 5 1 Updated Mar 2, 2020

Easy Bootstrap Resampling and Approximate Randomization for BLEU, METEOR, and TER using Multiple Optimizer Runs. This implements "Better Hypothesis Testing for Statistical Machine Translation: Cont…

Groff 203 39 Updated Feb 25, 2023

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Python 1,109 166 Updated Jan 6, 2025

A tool that locates, downloads, and extracts machine translation corpora

Python 151 23 Updated Mar 6, 2025

The Open Parallel Corpus

Makefile 66 9 Updated Feb 27, 2025

Stand-alone language identification system

Python 2,366 321 Updated Jan 1, 2020

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,923 378 Updated May 8, 2024

Tooling to play around with multilingual machine translation for Indian Languages.

Python 22 4 Updated Mar 5, 2022
Next
Showing results