Skip to content

Nativeatom/NaturalLanguageProcessing

Repository files navigation

Natural Language Procesing

This repository includes basic concepts of Natural Language Processing, textbooks and blogs of good reputation, popular papers and so on.

This is also the Natural Language Processing part of Machine Learning Resources created by a group of people including jindongwang.

Contributors are welcomed to work together and make it BETTER!

Resource of Textbooks and Lectures

Mathemetical and Statistical Foundation

  • Linear Algebra

  • Matrix Analysis

  • Convex Optimization

    • EE364A Stanford(Stephen Boyd)[pdf][website]
    • Introductory Lectures on Convex Programming(Yu.Nesterov)[pdf]

Machine Learning

Deep Learning

Natural Language Processing

Models and Applications

Blog and Tutorials

Topics and Tasks

Category of areas is based on tracks in ACL 2018, ACL 2020, EMNLP 2020

  • Model
    • Word2Vec
  • Pre-trained Embedding
    • Glove
    • word2vec
    • FastText
  • Contextual Word Embedding
    • ELMo
    • GPT
    • BERT
    • XLNet
    • BART
    • T-5

Sentimental Analysis and Argument Mining

Name Entity Recognition

Tagging, Chunking

  • Task
    • Word Segmentation
    • Syntactic Parsing
  • Model
    • Hidden Markov Model (HMM)
    • Conditional Random Fields (CRFs)
    • Finetuned Language Models

Syntax, Parsing

Document Analysis

  • Tasks

    • Semantic Parsing
      • AMR-to-text
      • Text-to-AMR
      • Table-to-text
      • Code Generation
  • Model

  • Dataset

Semantics: Lexical

  • Tasks
    • Word Sense Disambiguation
  • Tasks
    • Topic Extraction
    • Sentimental Extraction
    • Aspect Extraction

Machine Translation

Text Generation

Text Classification

Dialogue and Interactive Systems

Question Answering

Resources and Evaluation

Linguistic Theories and Cognitive Modeling

Textual Inference

Vision, Robotics, Speech, Multimodal

Language Modeling

Computational Social Science and Social Media

Discourse and Pragmatics

Information Retrieval and Text Mining

Language Grounding to Vision, Robotics and Beyond

Machine Learning for NLP

Theory and Formalism in NLP

Ethics in NLP

Commonsense Knowledge

Interpretability

NLP Applications

Resources and Benchmarks

Interesting NLP

Package

  • Machine Learning Package and Framework
    • sciki-learn
    • Tensorflow
    • Caffe2
    • Pytorch
    • MXNet
  • NLTK
  • gensim
  • jieba
  • Stanford NLP
  • Transformers (huggingface)

如何加入 How to contribute

如果你对本项目感兴趣,非常欢迎你加入!

  • 正常参与:请直接fork、pull都可以
  • 如果要上传文件:请不要直接上传到项目中,否则会造成git版本库过大。正确的方法是上传它的超链接。如果你要上传的文件本身就在网络中(如paper都会有链接),直接上传即可;如果是自己想分享的一些文件、数据等,鉴于国内网盘的情况,请按照如下方式上传:
    • (墙内)目前没有找到比较好的方式,只能通过链接,或者自己网盘的链接来做。
    • (墙外)首先在UPLOAD直接上传(需要注册账号);上传成功后,在DOWNLOAD里找到你刚上传的文件,共享链接即可。

如何开始项目协同合作

快速了解github协同工作 Learn how to collaborate through github

及时更新fork项目 Update through fork

如何使用git提交 How to submit in git

Fetch and Merge in Git