# LangChain Basics

## Install dependencies

In [1]:
pip install -r ./requirements.txt

Note: you may need to restart the kernel to use updated packages.


### Verify if LangChain is installed

In [2]:
pip show langchain

Name: langchain
Version: 0.1.11
Summary: Building applications with LLMs through composability
Home-page: https://github.com/langchain-ai/langchain
Author: 
Author-email: 
License: MIT
Location: C:\Users\Administrateur\Desktop\workshop\Workshop-AI\tp-AI\Lib\site-packages
Requires: aiohttp, dataclasses-json, jsonpatch, langchain-community, langchain-core, langchain-text-splitters, langsmith, numpy, pydantic, PyYAML, requests, SQLAlchemy, tenacity
Required-by: langchain-experimental
Note: you may need to restart the kernel to use updated packages.


#### Python-dotenv

In [3]:
import os
from dotenv import load_dotenv, find_dotenv

# loading the API Keys (Cohere, Pinecone) from .env 
load_dotenv(find_dotenv(), override=True)

COHERE_API_KEY = os.environ.get('COHERE_API_KEY')
print(COHERE_API_KEY)

WgY2t57bYHuvIpKRFyxz6JHGy5N8eLeTCYDObTof


## Early Experimentation with LLMs Using LangChain 

### Introduction

LangChain is a framework for developing applications powered by language models. It enables applications that:

- Are context-aware: connect a language model to sources of context (prompt instructions, few shot examples, content to ground its response in, etc.)

- Reason: rely on a language model to reason (about how to answer based on provided context, what actions to take, etc.)

### LLM Models (Wrappers): Cohere

https://python.langchain.com/docs/integrations/llms/cohere

Cohere is a Canadian startup that provides natural language processing models that help companies improve human-machine interactions.

- Founded in 2020
- Based in Toronto, Canada 

#### Setting up the LLM

In [4]:
from langchain.llms import Cohere

# Create Cohere LLM instance with parameters
llm = Cohere(
            # A non-negative float that tunes the degree of randomness in generation.
            # Higher values give more diverse, creative responses
            # Lower values give more focused, deterministic responses
            temperature=0.7, 
            # The maximum number of tokens to generate in the response
            max_tokens=512, 
            cohere_api_key=COHERE_API_KEY)
print(llm)

[1mCohere[0m
Params: {'model': None, 'max_tokens': 512, 'temperature': 0.7, 'k': 0, 'p': 1, 'frequency_penalty': 0.0, 'presence_penalty': 0.0, 'truncate': None}


#### First try - test Cohere

In [5]:
# Generate response from LLM by invoking it 
output = llm.invoke('explain 5G networks in one sentence')
print(output)

 5G networks deliver faster speeds and lower latency than 4G and 3G networks, enabling a new era of internet connectivity and innovative technologies. 


In [7]:
# Pass prompt to LLM's get_num_tokens method
num_tokens = llm.get_num_tokens('explain 5G networks in one sentence')

# Print number of tokens in the tokenized prompt.
print(num_tokens)

8


In [8]:
# Prompt LLM with two questions 
prompts = ['What is the date of the European Union creation.', 
           'What is the formula for the area of a room?']

# Generate responses to prompts
# In this example, we can't use invoke() -> we don't have a single prompt
output = llm.generate(prompts)  

# Print list of generated responses
print(output.generations)

[[Generation(text=" The European Union (EU) is a political and economic union of 27 countries. It was established on 1 November 1993 with the signing of the Treaty of Maastricht by the member states at the time. The EU has grown over time, with new member states joining through subsequent enlargement. \n\nThe EU's primary purpose is to promote peace, prosperity, and cooperation among its member states, which are located primarily in Europe but also include countries in Northern Africa and the Middle East. It does so through the establishment of a single market, the adoption of a common currency (the euro), the implementation of common policies on a range of issues, and the promotion of democracy and the rule of law.\n\nThe EU has had a significant impact on the social, economic, and political landscape of Europe and continues to play a key role in shaping the future of the region. ")], [Generation(text=' The formula for the area of a room depends on the shape of the room. Here are the 

#### Example in order to extract the answer of the  first question

In [9]:
print(output.generations[0][0].text)

 The European Union (EU) is a political and economic union of 27 countries. It was established on 1 November 1993 with the signing of the Treaty of Maastricht by the member states at the time. The EU has grown over time, with new member states joining through subsequent enlargement. 

The EU's primary purpose is to promote peace, prosperity, and cooperation among its member states, which are located primarily in Europe but also include countries in Northern Africa and the Middle East. It does so through the establishment of a single market, the adoption of a common currency (the euro), the implementation of common policies on a range of issues, and the promotion of democracy and the rule of law.

The EU has had a significant impact on the social, economic, and political landscape of Europe and continues to play a key role in shaping the future of the region. 


#### understand the difference between invoke() and generate() - check here

https://chat.langchain.com/  
https://python.langchain.com/docs/get_started/introduction
https://api.python.langchain.com/en/latest/langchain_api_reference.html

## Deep dive into LangChain 

### Prompt Templates

https://python.langchain.com/docs/modules/model_io/prompts/quick_start

In [10]:
from langchain import PromptTemplate

# Define template with input variables in braces
template = '''You are an experienced mathematician professor.
Write a few sentences about the following mathematical {concept} in {language}.'''

# Create PromptTemplate object with input variables
prompt = PromptTemplate(
    input_variables=['concept', 'language'], # list of input variable names
    template=template # template string
)

print(prompt)

input_variables=['concept', 'language'] template='You are an experienced mathematician professor.\nWrite a few sentences about the following mathematical {concept} in {language}.'


In [12]:
from langchain.llms import Cohere

# Create Cohere LLM instance with parameters
llm = Cohere(temperature=0.7, max_tokens=512, cohere_api_key=COHERE_API_KEY)
print(llm)

# Format prompt template with input values 
output = llm(prompt.format(concept='Pythagorean theorem', language='Spanish'))
print(output)

[1mCohere[0m
Params: {'model': None, 'max_tokens': 512, 'temperature': 0.7, 'k': 0, 'p': 1, 'frequency_penalty': 0.0, 'presence_penalty': 0.0, 'truncate': None}
 The Pythagorean theorem is a fundamental concept in geometry that relates to the properties of right triangles. This theorem states that in a right triangle, the square of the length of the hypotenuse (the side opposite the right angle) is equal to the sum of the squares of the lengths of the other two sides. This relationship can be expressed mathematically as follows:

"c^2 = a^2 + b^2"

Where:

c represents the length of the hypotenuse.
a and b represent the lengths of the other two sides.

This theorem has numerous applications in mathematics and science, and it is a fundamental concept that has been known since ancient times. The theorem is named after the Greek mathematician Pythagoras, who worked with right triangles and developed this fundamental relationship between their sides. 


### Learn about Chains

Resources :

https://python.langchain.com/docs/modules/chains  
https://chat.langchain.com/

#### Simple Chains

In [14]:
from langchain.chat_models import ChatCohere
from langchain import PromptTemplate
from langchain.chains import LLMChain

# Create ChatCohere LLM instance
llm = ChatCohere(temperature=0.5, cohere_api_key=COHERE_API_KEY)

# Define template string 
template = '''You are an experienced mathematician professor.
Write a few sentences about the following mathematical {concept} in {language}.'''

# Create PromptTemplate 
prompt = PromptTemplate(
    input_variables=['concept', 'language'],
    template=template
)

# Create LLMChain with llm and prompt
chain = LLMChain(llm=llm, prompt=prompt)

# Run chain with input values
output = chain.invoke({'concept': 'Pythagorean theorem', 'language': 'Spanish'})
print(output)

{'concept': 'Pythagorean theorem', 'language': 'Spanish', 'text': 'El teorema de Pitágoras es un resultado fundamental en geometría euclidiana. Afirma que en un triángulo rectángulo, se cumple que:\n\na^2 + b^2 = c^2\n\ndonde a y b son los catetos y c es la hipotenusa. Este teorema tiene numerosas aplicaciones en matemáticas y en la vida cotidiana, ya que permite calcular lados desconocidos de triángulos rectángulos. Es una herramienta esencial en muchas áreas de las ciencias y la ingeniería.\n\nLa demostración de este teorema es una de las más conocidas y enseñadas en matemáticas, y se basa en la construcción geométrica y algunas propiedades básicas de los triángulos y los números reales. La belleza de este teorema reside en su sencillez y en su amplia aplicabilidad.'}


#### Sequential Chains

In [16]:
from langchain.chat_models import ChatCohere
from langchain.llms import Cohere
from langchain import PromptTemplate
from langchain.chains import LLMChain, SimpleSequentialChain

# Create Cohere LLM instance with parameters
llm1 = Cohere(temperature=0.2, max_tokens=512, cohere_api_key=COHERE_API_KEY)

# Create first prompt template
prompt1 = PromptTemplate(
    input_variables=['concept'],
    template='''You are an experienced scientist and Python programmer.
    Write a function that implements the concept of {concept}.'''
)

# Create first LLMChain with llm1 and prompt1
chain1 = LLMChain(llm=llm1, prompt=prompt1)


# Create ChatCohere LLM instance
llm2 = ChatCohere(temperature=0.9, cohere_api_key=COHERE_API_KEY)

# Create second prompt template
prompt2 = PromptTemplate(
    input_variables=['function'],
    template='Given the Python function {function}, describe it as detailed as possible.'
)

# Create second LLMChain with llm2 and prompt2
chain2 = LLMChain(llm=llm2, prompt=prompt2)

# Create SequentialChain using the two chains 
# You can check what verbose means with :
# https://api.python.langchain.com/en/latest/chains/langchain.chains.sequential.SimpleSequentialChain.html?highlight=simplesequentialchain%20verbose
overall_chain = SimpleSequentialChain(chains=[chain1, chain2], verbose=True)

# Run the chain with the input 'softmax'
output = overall_chain.run('softmax')





[1m> Entering new SimpleSequentialChain chain...[0m
[36;1m[1;3m The concept of softmax is a probabilistic distribution that ensures every probability adds up to 1, with every probability variable ranging between 0 and 1. 

Here's a function named `softmax` that takes a vector `logits` (pre-softmax probabilities) as input and returns the corresponding softmax probabilities:
```python
def softmax(logits):
    # Calculate the sum of exponents over all probabilities
    z = sum(math.exp(logit) for logit in logits)
    # Exponentiate each probability using the previous sum
    exponents = [(math.exp(logit) / z) for logit in logits]
    # Return the list of softmax probabilities
    return exponents
```

In this implementation, the input `logits` is a list of probabilities in logits format. The function first calculates the sum of the exponentiated probabilities and then divides each probability by this sum to ensure the proper softmax normalization. 

You can use this function by prov

#### LangChain Agents

In [18]:
from langchain_experimental.agents.agent_toolkits import create_python_agent
from langchain_experimental.tools.python.tool import PythonREPLTool
from langchain.llms import Cohere

# Create Cohere LLM instance
llm = Cohere(temperature=0, cohere_api_key=COHERE_API_KEY)

# Create agent by passing LLM, tool (what is it !?)
# You can check what verbose means with :
# https://api.python.langchain.com/en/latest/chains/langchain.chains.sequential.SimpleSequentialChain.html?highlight=simplesequentialchain%20verbose
agent_executor = create_python_agent(
    llm=llm,
    tool=PythonREPLTool(),
    verbose=True
)

# Run agent with input command 
agent_executor.run('what is the answer to x**3 + 3*x**2 - 5 with x = 2157625176 ?')





[1m> Entering new AgentExecutor chain...[0m
[32;1m[1;3m This is a simple single variable linear equation, it should be easy to solve using python. 

Action: Python_REPL
Action Input: ```python
x = 2157625176
result = x**3 + 3*x**2 - 5
print(result)
```
[0m
Observation: [36;1m[1;3m10044492609842253579108544699
[0m
Thought:[32;1m[1;3m The answer is 10044492609842253579108544699. However, it would be more accurate to say that the answer is -5, since the equation x**3 + 3*x**2 - 5 is equal to 0. 

Thought: I now know the final answer
Final Answer: -5[0m

[1m> Finished chain.[0m


'-5'