# LlamaIndex Platform Demo

## Step 0: Setup environment config for platform

In [1]:
import os

os.environ["LLAMA_CLOUD_API_KEY"] = "your-api-key"

## Step 1: Configure ingestion pipeline (data source, transformations)

In [7]:
from llama_index.core.ingestion import IngestionPipeline
from llama_index.core import SimpleDirectoryReader
from llama_index.core.node_parser import SentenceSplitter
from llama_index.embeddings.openai import OpenAIEmbedding

In [8]:
reader = SimpleDirectoryReader(input_files=['data_sec/source_files/uber_2021.pdf'])
docs = reader.load_data()

In [9]:
sec_pipeline = IngestionPipeline(
    project_name='sec analysis',
    name='uber',
    documents=docs,
    transformations=[
        SentenceSplitter(),
        OpenAIEmbedding(),
    ]
)

In [10]:
sec_pipeline_id = sec_pipeline.register()

Pipeline available at: https://cloud.llamaindex.ai/project/a18cb1c9-393e-44bc-af5b-2cbc554c7a3f/playground/ca240136-f463-46d4-a7f9-70f9566c0b31


In [None]:
from llama_index.core.llama_dataset import LabelledRagDataset
rag_dataset = LabelledRagDataset.from_json("./data_sec/rag_dataset.json")
questions = [example.query for example in rag_dataset.examples[:5]]

In [14]:
for ind, question in enumerate(questions):
    print(f"{ind + 1}. {question}")

1. According to the context information provided, what is the state of incorporation for Uber Technologies, Inc., and what is the company's IRS Employer Identification Number?
2. Based on the information from the document, which type of annual report did Uber Technologies, Inc. file with the SEC for the fiscal year ended December 31, 2021, and on which stock exchange is Uber's Common Stock registered?
3. According to the context information provided from the "uber_2021.pdf" document, what is the classification of the filer as indicated by the check mark, and what does this classification imply regarding the company's filing requirements?
4. As of June 30, 2021, what was the aggregate market value of the voting and non-voting common equity held by non-affiliates of the registrant, and on which stock exchange was this value based?
5. According to the table of contents in the "UBER TECHNOLOGIES, INC." document, what are the main topics covered under Item 7 in Part II, and on which page do

In [15]:
from llama_index.core.evaluation.eval_utils import upload_eval_dataset

upload_eval_dataset(
    project_name='sec analysis',
    dataset_name='AI generated - 5 questions',
    questions=questions,
    overwrite=True,
)

Uploaded 5 questions to dataset AI generated - 5 questions


'27d98fff-4dd2-4eb0-8904-24128082eeea'