# GENAI Introduction

## What is GenAI?

- GenAI (Generative AI) refers to a category of artificial intelligence technologies that are designed to generate new, original content based on input data. This content could range from text, images, music, and even videos to complex data or code.
- GenAI is different from traditional AI in that it doesn't simply analyze or classify existing data; it creates new data based on learned patterns.

- Generative AI models, like GPT (Generative Pretrained Transformer), DALL·E (for images), and others, are trained on large datasets and use deep learning techniques to understand the underlying structures and patterns in the data. Once trained, these models can generate new content that aligns with these learned structures.

## Where Does GenAI Stand?

- Generative AI is in a rapidly evolving stage, largely driven by advancements in machine learning (especially deep learning), natural language processing (NLP), and neural networks.

- It stands at the intersection of several fields:

    - Artificial Intelligence & Machine Learning: A major part of GenAI uses machine learning models (like neural networks) to learn from vast datasets.

    - Natural Language Processing (NLP): For models like GPT, GenAI focuses on understanding and generating human-like language.

    - Creative Fields: GenAI is being used in creative industries (e.g., content creation, gaming, design, music) to generate new and innovative ideas or outputs.

    - Business & Data Automation: It’s also finding applications in automating tasks such as data analysis, report writing, and customer service.

## GenAI Impact Areas

1. Content Creation
2. Business Automation
3. Personalization(Marketing)
4. Healthcare
5. Education

## Problems with GenAI

1. Bias: Since GenAI models are trained on data collected from various sources, they can inadvertently learn and reproduce societal biases present in that data, leading to biased or harmful outputs.

2. Ethical Concerns:

    - Misinformation: GenAI models can generate text or images that are highly convincing but false, leading to the spread of misinformation.

    - Deepfakes: The ability to generate realistic videos or audio can be misused for creating misleading content or impersonating individuals.

3. Copyright & Intellectual Property: As AI generates new content based on existing works, questions arise about ownership and copyright protection for AI-generated works.

4. Job Displacement: Automation powered by GenAI may lead to displacement in certain industries, especially in content creation, customer service, and repetitive tasks.

5. Quality Control: While AI-generated content can be high-quality, there are still instances where the content may lack coherence or contain errors, leading to quality control challenges.

6. Security Risks: There are concerns that malicious actors may use GenAI to create sophisticated phishing attacks, fraud, or even cyberattacks.

## GenAI Mental Model


- Foundation Models
    - User Perspective
    - Builder Perspective

### Builder Perspective

1. Transformer architecture
2. Types of Transformer architecture
    - Encoder Only (BERT)
    - Decoder Only (GPT)
    - Encoder and Decoder (T5)
3. Pretraining Pipeline
    - Training Objectives
    - Tokenization Strategies
    - Training Strategies
    - Challenge Handling
4. Optimization Techniques
    - Training Optimization
    - Model Compression
    - Inference Optimization
5. Fine-tuning Approaches
    - Task-specific Tuning
    - Instruction Tuning
    - RLHF (Reinforcement Learning from Human Feedback)
    - PEFT (Parameter-Efficient Fine-Tuning)
6. Evaluation Framework
7. Deployment


### User Perspective

1. Building Basic LLM Applications
    - Open Source vs Closed Source LLMs
    - Using LLM APIs
    - LangChain
    - HuggingFace
    - Ollama
2. Prompt Engineering Techniques
3. RAG Implementation
4. Fine-Tuning Strategies
5. Agent Development
6. LLMOps
7. Miscellaneous