# 📖 Chronological Evolution of NLP Subfields (1900–2025)

---

## 1. 🧩 Foundations of Computational Linguistics (1900–1960s)

- 1900s–1930s – Mathematical linguistics (Saussure’s structuralism, Zipf’s law on word frequency).  
- 1940s – First work on machine translation (Warren Weaver, 1949 memo).  
- 1950 – Alan Turing, “Computing Machinery and Intelligence” → Turing Test.  
- 1952–1954 – Georgetown-IBM experiment: automatic English → Russian word-for-word MT.  
- 1957 – Chomsky’s *Syntactic Structures* introduces transformational grammar.  
- 1960s – ELIZA (Weizenbaum, 1966): first chatbot (rule-based).  

---

## 2. 📚 Rule-Based & Symbolic NLP (1960s–1980s)

- 1960s – SYSTRAN: rule-based MT system (used by NASA/EC).  
- 1970 – SHRDLU (Winograd): natural language understanding in constrained microworlds.  
- 1970s – Knowledge-based systems (frames, semantic networks).  
- 1980s – Expert systems & symbolic parsing (ATNs, DCGs).  

Limited scalability → motivated shift to statistical methods.  

---

## 3. 📊 Statistical NLP Era (1980s–1990s)

- 1983 – IBM Candide Project: statistical MT using aligned bilingual corpora.  
- 1988 – Church & Mercer: Hidden Markov Models for speech recognition.  
- 1990s – SMT dominates (Brown et al., IBM Models 1–5).  
- 1993 – BLEU precursor metrics introduced.  
- 1998 – Maximum Entropy models (Berger, Della Pietra).  

---

## 4. 🗣️ Speech Recognition & Spoken NLP (1970s–2000s)

- 1970s – HARPY (CMU): early speech recognition.  
- 1980s – HMMs become standard for ASR.  
- 1990s – Dragon Dictate: commercial speech-to-text.  
- 2000s – GMM-HMM hybrid models for large vocab speech.  
- 2006+ – Deep learning-based ASR (Hinton, 2012): DNN acoustic models.  

---

## 5. 🌐 Machine Translation (MT)

- 1950s–1970s – Rule-based MT (SYSTRAN, Georgetown).  
- 1990s – Phrase-based SMT (Koehn, Och).  
- 2013 – RCTM (Kalchbrenner & Blunsom): convolution + RNN LM for MT.  
- 2014 – Seq2Seq (Sutskever, Vinyals, Le): RNN encoder-decoder.  
- 2015 – Bahdanau attention → dynamic context vectors.  
- 2017 – Transformer (*Attention is All You Need*, Vaswani et al.).  
- 2018 – Ott et al. *Scaling NMT*: large-batch Transformer training.  
- 2019–2025 – GPT/ChatGPT-style systems unify translation into universal LLM frameworks.  

---

## 6. 📖 Language Modeling & Representation Learning

- 1980s – N-gram language models.  
- 2003 – Bengio et al.: first neural LM.  
- 2013 – Word2Vec (Mikolov et al.): distributional word embeddings.  
- 2014 – GloVe (Pennington et al.).  
- 2018 – BERT (Devlin et al.): bidirectional transformers.  
- 2019 – XLNet, RoBERTa → improved pretraining.  
- 2020–2025 – GPT-3/4/5, PaLM, LLaMA, Mixtral, DeepSeek models.  

---

## 7. 🤖 Question Answering & Information Retrieval

- 1960s – BASEBALL QA system.  
- 1970s–80s – MUC (Message Understanding Conferences).  
- 2000s – TREC QA tracks.  
- 2012 – IBM Watson (Jeopardy!) → Deep QA.  
- 2018 – BERT → SQuAD benchmark breakthroughs.  
- 2020s – Open-domain QA with GPT-3/ChatGPT.  

---

## 8. 📝 Text Classification & Sentiment Analysis

- 1990s – Naive Bayes, SVMs for classification.  
- 2002 – Pang et al.: sentiment analysis benchmark.  
- 2010s – RNNs/CNNs dominate sentiment analysis.  
- 2018 – BERT fine-tuning → near-human classification accuracy.  
- 2020s – Zero-shot classification with LLMs.  

---

## 9. ✂️ Summarization

- 1950s–70s – Early extractive approaches.  
- 2000s – Statistical summarization (LexRank, TextRank).  
- 2016 – Seq2Seq + attention for abstractive summarization.  
- 2019 – BERTSum, PEGASUS (Google).  
- 2020s – ChatGPT-based summarization with controllable styles.  

---

## 10. 🗣️ Dialogue Systems & Chatbots

- 1966 – ELIZA.  
- 1972 – PARRY (simulated paranoia patient).  
- 2000s – ALICE (AIML-based).  
- 2015 – Sequence-to-sequence conversational models.  
- 2019–2020 – Meena, BlenderBot.  
- 2022–2025 – ChatGPT, Gemini, Claude → general-purpose conversational AI.  

---

## 11. 🧠 Semantics & Pragmatics

- 1970s – Montague grammar.  
- 1990s – WordNet lexical database.  
- 2000s – Latent Semantic Analysis (LSA), Latent Dirichlet Allocation (LDA).  
- 2019 – Sentence-BERT: semantic embeddings.  
- 2020s – Embedding-based evaluation (BERTScore, COMET).  

---

## 12. 📑 Evaluation Metrics in NLP

- 1993 – BLEU introduced.  
- 2000s – ROUGE (summarization).  
- 2010s – METEOR, TER, ChrF.  
- 2019–2020s – BERTScore, BLEURT, COMET, MoverScore.  
- 2025 – Hybrid metrics combining automatic + human preference models.  

---

## 13. ⚡ Recent Cutting-Edge Subfields (2020–2025)

- Multilingual LLMs (mBERT, XLM-R).  
- Low-resource NLP with transfer + few-shot learning.  
- Code LLMs (Codex, AlphaCode, StarCoder).  
- Multimodal NLP (CLIP, Flamingo, GPT-4V, Gemini).  
- Responsible NLP → bias, fairness, toxicity mitigation.  
- Efficiency research → distillation, quantization, retrieval-augmented LMs (RAG).  

---

## 🎓 Conclusion
From rule-based linguistics in the 1950s to transformer-based LLMs in the 2020s, NLP has evolved across distinct subfields: MT, LM, QA, IR, summarization, dialogue, semantics, evaluation, and multimodality. Each milestone reflects a shift in paradigm:

**Symbolic → Statistical → Neural RNNs → Attention → Transformers → LLMs.**