## What is AI

##### Artificial intelligence is a field of science concerned with building computers and machines that can reason, learn, and act in such a way that would normally require human intelligence or that involves data whose scale exceeds what humans can analyze. 

##### AI is a broad field that encompasses many different disciplines, including computer science, data analytics and statistics, hardware and software engineering, linguistics, neuroscience, and even philosophy and psychology. 

##### On an operational level for business use, AI is a set of technologies that are based primarily on machine learning and deep learning, used for data analytics, predictions and forecasting, object categorization, natural language processing, recommendations, intelligent data retrieval, and more.

##### (Src: https://cloud.google.com/learn/what-is-artificial-intelligence)

## Types of artificial intelligence
##### While Artificial intelligence can be organized in several ways, depending on stages of development or actions being performed. It can broadly be categorized as :

### Predictive AI
##### Predictive AI is where the machine learning system uses historical data and analyses patterns and all of their connections to make predictions about future results.

##### For example, companies can use this system to analyse the purchasing patterns of different demographic groups of users. With this information, they can find out what types of customers are more likely to buy a certain product. This type of technology can also be used to correctly classify information on language or images

### Generative AI
##### Generative AI is a group of group of systems that are used to create new data. 
##### Generative AI tools can be used, for example, to create new images, text or music. 
##### Generative models, after seeing and analysing a large number of examples i.e trained on large volumes of existing data, can use that information to generate something completely new. 
##### That is why they are called “generative”.

##### For example: Gemini can help you draft email, provided you precisely describe what you want it to show you.

## The Inner Workings of Generative AI
#### Simply Put - GenAI uses neural networks to learn patterns from vast datasets and generate new content.


#### Here's a more detailed look at how they work:
#### 1. Training with Vast Datasets:
##### Generative AI models are trained on massive amounts of data, often unlabeled or semi-supervised.
##### This training process allows them to learn the underlying structures and patterns within the data.
##### Examples of foundation models, like GPT-3 and Stable Diffusion, are trained on enormous text and image datasets, respectively. 

#### 2. Neural Networks and Deep Learning:
##### These models rely on sophisticated neural networks, which are computational models inspired by the human brain. 
##### Deep learning algorithms within these networks identify and encode the patterns and relationships in the training data. 
##### They use different learning approaches, like unsupervised or semi-supervised learning, to train the models. 

#### 3. Foundation Models and LLMs:
##### Foundation models, like those behind ChatGPT, are general-purpose AI systems that can perform various tasks.
##### Large Language Models (LLMs), like GPT, are a type of foundation model specifically focused on language-based tasks like summarization, text generation, and conversation. 

#### 4. Generating New Content:
##### Once trained, these models can generate new content based on a prompt or input.
##### For example, a text-based model might generate a new essay based on a short text request.
##### An image generation model could create a photorealistic image based on a text input. 

#### 5. Probability Distributions and Inference:
##### Generative models learn to predict the next item in a sequence, whether it's the next word in a sentence or the next pixel in an image. 
##### They use probability distributions to select the next element based on the learned patterns and relationships. 
##### A process called "inference" refines the output to match the desired result, ensuring it becomes more accurate and realistic. 

#### 6. Generative Adversarial Networks (GANs) and Other Models:
##### GANs are a specific type of generative model that uses two networks: a generator and a discriminator. 
##### The generator attempts to create realistic data, while the discriminator tries to distinguish between real and generated data. 
##### Other generative models, like Variational AutoEncoders (VAEs) and transformers, are also used in various generative AI applications. 

#### 7. Examples of Generative AI in Action:
##### Image Generation: Models like Stable Diffusion create photorealistic images from text prompts. 
##### Text Generation: Models like ChatGPT can generate essays, stories, and code based on user input. 
##### Audio Generation: Models can create music, sound effects, and even human speech. 
##### Video Generation: Models can generate new videos or edit existing ones. 

## Architecture of Gen AI

#### 1. User Input / Prompt
##### The user provides a natural language prompt, which could be a question, instruction, or partial content (e.g., “Generate a stock analysis report”).

##### Input can also include context: documents, images, or structured data.

#### 2. Prompt Processing / Tokenization
##### The input text is tokenized—converted into a sequence of numerical tokens (words/subwords).

##### Example:
##### "Generate text" → [1357, 2021, 98]

##### A tokenizer (like Byte-Pair Encoding) breaks down the text to units the model understands.

#### 3. Model Inference (Core GenAI Model)
##### The tokenized input is fed into a transformer-based model (e.g., GPT-4, Claude, Gemini).

##### Internally, the model:

###### a. Uses self-attention mechanisms to understand relationships between tokens.

###### b. Predicts the next token, one step at a time, using learned probabilities.

##### Output: A sequence of predicted tokens (words) forming a coherent response.

#### 4. Decoding and Generation
##### The model generates text through a decoding strategy:

###### a. Greedy decoding: Picks the highest probability token.

###### b. Beam search: Explores multiple likely paths.

###### c. Top-k / Top-p sampling: Adds randomness for more creative outputs.

##### Tokens are then detokenized into human-readable text.

#### 5. Post-Processing / Tool Use (Optional)
##### The agent or framework might:

###### a. Invoke external tools (e.g., calculators, APIs).

###### b. Use retrieval-augmented generation (RAG) to pull relevant data from external sources.

###### c. Call function APIs (like fetching stock prices or running SQL queries).

#### 6. Memory & Context Handling (Optional in Agents)
##### Memory allows persistence across multiple conversations or tasks.

##### Stores past messages, summaries, preferences.

##### Long context windows (e.g., 128K tokens) allow full-document understanding.

#### 7. Response Delivery
##### The final generated response is rendered back to the user.

##### Optionally includes:

###### a. Citations or references.

###### b. Structured output (e.g., tables, graphs, JSON).

###### c. Follow-up questions or clarification prompts.

#### Under the Hood – Key Components
##### Layer -> Description
###### Transformer Architecture -> Deep learning backbone with self-attention and feed-forward networks.
###### Pretraining -> Model trained on vast public data (code, books, papers, web).
###### Fine-tuning -> Domain-specific training (e.g., finance, legal, customer support).
###### RLHF (Reinforcement Learning from Human Feedback) -> Aligns model outputs with human preferences.
###### Inference Layer -> Optimized backend to serve model fast (via GPUs, TPUs).
###### Middleware/Agents -> Coordinates tools, memory, and task planning in agentic systems.

In [1]:
## Architecture of Gen AI
[User Prompt]
     ↓
[Tokenizer]
     ↓
[Transformer Model]
     ↓
[Decoder → Generated Tokens]
     ↓
[Detokenizer → Natural Language Output]
     ↓
[Optional: Tools / RAG / Memory / API Calls]

SyntaxError: invalid character '↓' (U+2193) (466271512.py, line 3)

## Limitations of GenAI
##### 1. Hallucinations of AI: 
###### AI tools can generate responses that appear correct but that contain errors or are out of context due to a lack of an understanding of the real world or limitations in the training data. It is crucial to review and verify the information obtained.

##### 2. Data Quality: 
###### The results of generative AI depend on the quality of the training data. If the data contain errors or biases or are incomplete, trusting them can be risky. Google is working continuously to assess and mitigate these risks.

##### 3. Confidentiality of Business Data: 
###### It is important to be prudent when using confidential business information in public tools. Read the terms and conditions and, if possible, use paid versions that guarantee confidentiality, such as Google Cloud, ensuring that you establish the necessary control measures.