# Bedrock with LangChain - Explain/Interpret a code snippet or program 
> *This notebook should work well with the **`Data Science 3.0`** kernel in SageMaker Studio*

## Introduction

In this notebook we show you how to explain or interpret a given code snippet or program.

[LangChain](https://python.langchain.com/docs/get_started/introduction.html) is a framework for developing applications powered by language models. The key aspects of this framework allow us to augment the Large Language Models by chaining together various components to create advanced use cases.

In this notebook we will use the Bedrock API provided by LangChain. The prompt used in this example creates a custom LangChain prompt template for adding context to the code explain request. 

**Note:** *This notebook can be run within or outside of AWS environment.*

#### Context
In this notebook we will leverage the LangChain framework and explore Bedrock API with the help of `PromptTemplates`. `PrompTemplates` allow you to create generic shells which can be populated with information later and get model outputs based on different scenarios.

As part of this notebook we will explore the use of Amazon Bedrock integration within LangChain framework and how it could be used to generate or explain code with the help of `PromptTemplate`.

#### Pattern
We will simply provide the LangChain implementation of Amazon Bedrock API with an input consisting of a task, an instruction and an input for the model under the hood to generate an output without providing any additional example. The purpose here is to demonstrate how the powerful LLMs easily understand the task at hand and generate compelling outputs.

![](./images/code-interpret-langchain.png)

#### Use case
To demonstrate the code generation capability of models in Amazon Bedrock, let's take the use case of code explain.

#### Persona
You are Joe, a Java software developer, has been tasked to support a legacy C++ application for Vehicle Fleet Management. You need help to explain or interpret certain complex C++ code snippets as you are performing analyis to identify the business logic and potential problems with the code.

#### Implementation
To fulfill this use case, we will show you how you can Amazon Bedrock API with LangChain to explain C++ code snippets.


In [1]:
from IPython.core.display import HTML
from IPython.display import display_markdown, Markdown
import boto3

HTML("<script>Jupyter.notebook.kernel.restart()</script>")
boto3_bedrock = boto3.client('bedrock-runtime')

## Invoke the Bedrock LLM Model

We'll begin with creating an instance of Bedrock class from llms. This expects a `model_id` which is the ARN of the model available in Amazon Bedrock. 

Optionally you can pass on a previously created boto3 client as well as some `model_kwargs` which can hold parameters such as `temperature`, `topP`, `maxTokenCount` or `stopSequences` (more on parameters can be explored in Amazon Bedrock console).

Check [documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/model-ids-arns.html) for Available text generation model Ids under Amazon Bedrock.

Note that different models support different `model_kwargs`.

In [2]:
from langchain_aws import ChatBedrockConverse


textgen_llm = ChatBedrockConverse(
    model_id="us.amazon.nova-lite-v1:0",
    client=boto3_bedrock,
    max_tokens=None,
    temperature=0.5
)

## Create a LangChain custom prompt template

By creating a template for the prompt we can pass it different input variables to it on every run. This is useful when you have to generate content with different input variables that you may be fetching from a database.

In [3]:
# Vehicle Fleet Management Code written in C++
sample_code = """
#include <iostream>
#include <string>
#include <vector>

class Vehicle {
protected:
    std::string registrationNumber;
    int milesTraveled;
    int lastMaintenanceMile;

public:
    Vehicle(std::string regNum) : registrationNumber(regNum), milesTraveled(0), lastMaintenanceMile(0) {}

    virtual void addMiles(int miles) {
        milesTraveled += miles;
    }

    virtual void performMaintenance() {
        lastMaintenanceMile = milesTraveled;
        std::cout << "Maintenance performed for vehicle: " << registrationNumber << std::endl;
    }

    virtual void checkMaintenanceDue() {
        if ((milesTraveled - lastMaintenanceMile) > 10000) {
            std::cout << "Vehicle: " << registrationNumber << " needs maintenance!" << std::endl;
        } else {
            std::cout << "No maintenance required for vehicle: " << registrationNumber << std::endl;
        }
    }

    virtual void displayDetails() = 0;

    ~Vehicle() {
        std::cout << "Destructor for Vehicle" << std::endl;
    }
};

class Truck : public Vehicle {
    int capacityInTons;

public:
    Truck(std::string regNum, int capacity) : Vehicle(regNum), capacityInTons(capacity) {}

    void displayDetails() override {
        std::cout << "Truck with Registration Number: " << registrationNumber << ", Capacity: " << capacityInTons << " tons." << std::endl;
    }
};

class Car : public Vehicle {
    std::string model;

public:
    Car(std::string regNum, std::string carModel) : Vehicle(regNum), model(carModel) {}

    void displayDetails() override {
        std::cout << "Car with Registration Number: " << registrationNumber << ", Model: " << model << "." << std::endl;
    }
};

int main() {
    std::vector<Vehicle*> fleet;

    fleet.push_back(new Truck("XYZ1234", 20));
    fleet.push_back(new Car("ABC9876", "Sedan"));

    for (auto vehicle : fleet) {
        vehicle->displayDetails();
        vehicle->addMiles(10500);
        vehicle->checkMaintenanceDue();
        vehicle->performMaintenance();
        vehicle->checkMaintenanceDue();
    }

    for (auto vehicle : fleet) {
        delete vehicle; 
    }

    return 0;
}
"""

In [4]:
from langchain.prompts import PromptTemplate

# Create a prompt template that has multiple input variables
multi_var_prompt = PromptTemplate(
    input_variables=["code", "programmingLanguage"], 
    template="""

Human: You will be acting as an expert software developer in {programmingLanguage}. 
You will explain the below code and highlight if there are any red flags or where best practices are not being followed.
<code>
{code}
</code>

Assistant:"""
)

# Pass in values to the input variables
prompt = multi_var_prompt.format(code=sample_code, programmingLanguage="C++")


### Explain C++ Code for Vehicle Fleet management using Amazon Bedrock and LangChain

In [5]:
response = textgen_llm.invoke(prompt)

code_explanation = response.content

display_markdown(Markdown(code_explanation))

Certainly! Let's go through the provided C++ code and highlight any potential issues or areas where best practices are not being followed.

### Code Overview

The code defines a class hierarchy for vehicles, with a base class `Vehicle` and two derived classes `Truck` and `Car`. The `Vehicle` class includes methods for adding miles, performing maintenance, checking if maintenance is due, and displaying details. The derived classes override the `displayDetails` method to provide specific details for trucks and cars.

### Analysis and Best Practices

1. **Memory Management**:
    - **Red Flag**: The `main` function dynamically allocates memory for `Truck` and `Car` objects using `new` but does not properly handle the deletion of these objects. This leads to memory leaks.
    - **Best Practice**: Use smart pointers (e.g., `std::unique_ptr` or `std::shared_ptr`) to manage the memory automatically and avoid manual `delete` calls.

2. **Destructor**:
    - **Best Practice**: The destructor of the `Vehicle` class prints a message. While this is fine for debugging, it is generally better to avoid side effects in destructors. Destructors should focus on releasing resources.

3. **Virtual Destructor**:
    - **Red Flag**: The `Vehicle` class does not have a virtual destructor. If you plan to delete derived class objects through a base class pointer, you should declare the destructor as `virtual`.
    - **Best Practice**: Add a virtual destructor to the `Vehicle` class to ensure that the derived class destructors are called correctly.

4. **Output in Methods**:
    - **Best Practice**: Methods such as `checkMaintenanceDue` and `performMaintenance` print messages directly to `std::cout`. It is generally better to separate concerns by having these methods return status information and let the caller decide what to do with it.

5. **Const-Correctness**:
    - **Best Practice**: Methods that do not modify the object's state should be marked as `const`. For example, `checkMaintenanceDue` and `displayDetails` should be `const`.

6. **Encapsulation**:
    - **Best Practice**: The `milesTraveled` and `lastMaintenanceMile` members are public in the `Vehicle` class. It is better to keep them protected or private and provide getter methods if needed.

### Improved Code

Here's a revised version of the code with the above best practices in mind:

```cpp
#include <iostream>
#include <string>
#include <vector>
#include <memory>

class Vehicle {
protected:
    std::string registrationNumber;
    int milesTraveled;
    int lastMaintenanceMile;

public:
    Vehicle(std::string regNum) : registrationNumber(std::move(regNum)), milesTraveled(0), lastMaintenanceMile(0) {}

    void addMiles(int miles) {
        milesTraveled += miles;
    }

    void performMaintenance() {
        lastMaintenanceMile = milesTraveled;
    }

    bool checkMaintenanceDue() const {
        return (milesTraveled - lastMaintenanceMile) > 10000;
    }

    virtual void displayDetails() const = 0;

    virtual ~Vehicle() = default; // Virtual destructor
};

class Truck : public Vehicle {
    int capacityInTons;

public:
    Truck(std::string regNum, int capacity) : Vehicle(std::move(regNum)), capacityInTons(capacity) {}

    void displayDetails() const override {
        std::cout << "Truck with Registration Number: " << registrationNumber << ", Capacity: " << capacityInTons << " tons." << std::endl;
    }
};

class Car : public Vehicle {
    std::string model;

public:
    Car(std::string regNum, std::string carModel) : Vehicle(std::move(regNum)), model(std::move(carModel)) {}

    void displayDetails() const override {
        std::cout << "Car with Registration Number: " << registrationNumber << ", Model: " << model << "." << std::endl;
    }
};

int main() {
    std::vector<std::unique_ptr<Vehicle>> fleet;

    fleet.push_back(std::make_unique<Truck>("XYZ1234", 20));
    fleet.push_back(std::make_unique<Car>("ABC9876", "Sedan"));

    for (auto& vehicle : fleet) {
        vehicle->displayDetails();
        vehicle->addMiles(10500);
        if (vehicle->checkMaintenanceDue()) {
            std::cout << "Vehicle: " << vehicle->registrationNumber << " needs maintenance!" << std::endl;
        } else {
            std::cout << "No maintenance required for vehicle: " << vehicle->registrationNumber << std::endl;
        }
        vehicle->performMaintenance();
        if (vehicle->checkMaintenanceDue()) {
            std::cout << "Vehicle: " << vehicle->registrationNumber << " needs maintenance!" << std::endl;
        } else {
            std::cout << "No maintenance required for vehicle: " << vehicle->registrationNumber << std::endl;
        }
    }

    return 0;
}
```

### Summary of Changes

1. **Memory Management**: Replaced manual `new` and `delete` with `std::unique_ptr`.
2. **Virtual Destructor**: Added a virtual destructor to the `Vehicle` class.
3. **Const-Correctness**: Marked methods that do not modify the object as `const`.
4. **Encapsulation**: Kept data members protected.
5. **Output Handling**: Moved output handling to the caller in the `main` function.

## Summary

To conclude we learnt that invoking the LLM without any context might not yield the desired results. By adding context and further using the prompt template to constrain the output from the LLM we are able to successfully get our desired output