# **Project Hint - Reading the Data from Database**

In [1]:
import sqlite3
import pandas as pd

## **Step 1 - Reading the Tables from Database file**

In [2]:
# Read the code below and write your observation in the next cell

conn = sqlite3.connect('eng_subtitles_database.db')
cursor = conn.cursor()
cursor.execute("SELECT name FROM sqlite_master WHERE type='table'")
print(cursor.fetchall())

[('zipfiles',)]


**In the above cell, I am able to read the table inside the database. As mentioned earlier, table name is `zipfiles`. We also know from README.txt that this table contains three columns: 'num', 'name' and 'content'.**

## **Step 2 - Reading the columns of Table**

In [3]:
cursor.execute("PRAGMA table_info('zipfiles')")
cols = cursor.fetchall()
for col in cols:
    print(col[1])

num
name
content


**The above code helps in checking the column names in the database table.**

**Let's now use `SELECT * FROM zipfiles` to read all the data into a `df` variable.**

## **Step 3 - Loading the Database Table inside a Pandas DataFrame**

In [4]:
df = pd.read_sql_query("""SELECT * FROM zipfiles""", conn)
df.head()

Unnamed: 0,num,name,content
0,9180533,the.message.(1976).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x1c\xa9\x...
1,9180583,here.comes.the.grump.s01.e09.joltin.jack.in.bo...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x17\xb9\x...
2,9180592,yumis.cells.s02.e13.episode.2.13.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00L\xb9\x99V...
3,9180594,yumis.cells.s02.e14.episode.2.14.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00U\xa9\x99V...
4,9180600,broker.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x001\xa9\x99V...


In [5]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 82498 entries, 0 to 82497
Data columns (total 3 columns):
 #   Column   Non-Null Count  Dtype 
---  ------   --------------  ----- 
 0   num      82498 non-null  int64 
 1   name     82498 non-null  object
 2   content  82498 non-null  object
dtypes: int64(1), object(2)
memory usage: 1.9+ MB


**Looks like the `content` column donot contain the subtitles text. Instead as mentioned in README.txt, it might be latin-1 encoded.**

## **Step 4 - Printing `content` of 0th Row**

In [6]:
b_data = df.iloc[0, 2]

# here 2 represent the index of content column
# 0 represents the row number

In [7]:
print(b_data)

b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x1c\xa9\x99V\x9fx\x96\xf0\x8c\x9e\x00\x00\x86\x9b\x01\x00;\x00\x00\x00The.Message.1976.REMASTERED.1080p.BluRay.x264-PiGNUS.EN.srt\xad\xbdm\x93\xdc\xc6\x91.\xfa\x9d\x11\xfc\x0f-}\xe1=\x11-\x9d\x06P\x85\x17\x9d\x8d\xd5%%[\xa4-Y>&u\x15>\xdf\xd0\xd3\x98\x19x\xfae\x0cts<\xfe\xf57\x9f\'\xb3\n\xd9\xa4\xbc\xbb\xf7\xc6Fl\xacELW\xa2\xaa\x90\x95\x95\xafO\x16/_l6\xdf\xe0\xff\xea\xf5f\xb3Y}\xf5\xd5\xbf\xaf\xf4AQ\xae7Mx\xf9\xe2\xd7\xfe|s\xbf\xea\x8f\xcf\xab\x8f\xe3n8\xadN\xc7\xfdx\x1cVO\xe3\xf9~\xf5\xf3\xe3p\xfc\xea\xfd/o>\xbc\xfb\xf0\xe3\xef\xde\xbf|\xf1\xfbi\x18Vo\xa6\xd3\xd3<L\xab\xe1\x1f\xe7\xe18\x8f\xa7\xe37\xab\xd3\xbc\xdb~-\xc3\x1e\xfe\xa7<|\xf9\xe2\xe5\x8bR_[~S\xd6\xeb\xa2k\xf3k\xe5A\xb7\xeeb\xf5\xf2\xc5\xbb\xe3\xea|?\xac\x8e\xfdaX\x9dnW?\x9cvk>8\x9c\xe6\xf3\xean\xeao\xc6\xd3ev\x8f~\x1a\xa6\x9b\xf1\xf6\xb2\xff\x1a\xe4\xabD\xbe*d\x11\xa5#_U\xeb\xaa\xd9`\xa6\xa7\xc3\xea\xa7\xcb}\x7f8\xf4F\xf9\xa7a\x9e\x87\xe3\x9d\xcc\\\xdf\x07B!\x13\xaa\xd61n<!\xd9\xaf\xd0\

**From the content, it appears to start with the bytes "PK\x03\......", which suggests that it might be a ZIP archive file. How do I know it? Experience! I have worked with something similar earlier.**

## **Step 5 - Unzipping the content of 385th row and decoding using `latin-1`**

In [8]:
import zipfile
import io

# Assuming 'content' is the binary data from your database
binary_data = df.iloc[385, 2]

# Decompress the binary data using the zipfile module
with io.BytesIO(binary_data) as f:
    with zipfile.ZipFile(f, 'r') as zip_file:
        # Reading only one file in the ZIP archive
        subtitle_content = zip_file.read(zip_file.namelist()[0])

# Now 'subtitle_content' should contain the extracted subtitle content
print(subtitle_content.decode('latin-1'))  # Assuming the content is latin-1 encoded text

1
00:00:06,000 --> 00:00:12,074
Watch any video online with Open-SUBTITLES
Free Browser extension: osdb.link/ext

2
00:00:15,370 --> 00:00:16,506
You lose everything, my girl.

3
00:00:16,530 --> 00:00:19,360
So you've said - four times.

4
00:00:20,330 --> 00:00:22,120
I definitely had
it on yesterday.

5
00:00:22,465 --> 00:00:25,785
Your gloves, your keys, that
handkerchief I embroidered for you

6
00:00:25,809 --> 00:00:26,168
Everything!

7
00:00:26,192 --> 00:00:27,280
Five times.

8
00:00:31,610 --> 00:00:32,920
Miss Scarlet?
- Yes.

9
00:00:36,390 --> 00:00:37,390
I'm Miss Scarlet.

10
00:00:37,872 --> 00:00:40,880
May I inquire if
you've lost something?

11
00:00:41,350 --> 00:00:42,530
Some jewellery perhaps?

12
00:00:42,870 --> 00:00:45,130
Yes, my mother's wedding ring.

13
00:00:45,220 --> 00:00:45,840
Have you found it?

14
00:00:45,950 --> 00:00:47,656
Does your ring have
an inscription?

15
00:00:48,650 -->

**Look's like it worked.**

## **Step 6 - Applying the above Function on the Entire Data**

In [9]:
import zipfile
import io

count = 0

def decode_method(binary_data):
    global count
    # Decompress the binary data using the zipfile module
    # print(count, end=" ")
    count += 1
    with io.BytesIO(binary_data) as f:
        with zipfile.ZipFile(f, 'r') as zip_file:
            # Assuming there's only one file in the ZIP archive
            subtitle_content = zip_file.read(zip_file.namelist()[0])
    
    # Now 'subtitle_content' should contain the extracted subtitle content
    return subtitle_content.decode('latin-1')  # Assuming the content is UTF-8 encoded text

In [10]:
df['file_content'] = df['content'].apply(decode_method)

df.head()

Unnamed: 0,num,name,content,file_content
0,9180533,the.message.(1976).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x1c\xa9\x...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch an..."
1,9180583,here.comes.the.grump.s01.e09.joltin.jack.in.bo...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x17\xb9\x...,"1\r\n00:00:29,359 --> 00:00:32,048\r\nAh! Ther..."
2,9180592,yumis.cells.s02.e13.episode.2.13.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00L\xb9\x99V...,"1\r\n00:00:53,200 --> 00:00:56,030\r\n<i>Yumi'..."
3,9180594,yumis.cells.s02.e14.episode.2.14.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00U\xa9\x99V...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch an..."
4,9180600,broker.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x001\xa9\x99V...,"ï»¿1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch..."


In [33]:
df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 82498 entries, 0 to 82497
Data columns (total 4 columns):
 #   Column        Non-Null Count  Dtype 
---  ------        --------------  ----- 
 0   num           82498 non-null  int64 
 1   name          82498 non-null  object
 2   content       82498 non-null  object
 3   file_content  82498 non-null  object
dtypes: int64(1), object(3)
memory usage: 2.5+ MB


In [11]:
df

Unnamed: 0,num,name,content,file_content
0,9180533,the.message.(1976).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x1c\xa9\x...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch an..."
1,9180583,here.comes.the.grump.s01.e09.joltin.jack.in.bo...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x17\xb9\x...,"1\r\n00:00:29,359 --> 00:00:32,048\r\nAh! Ther..."
2,9180592,yumis.cells.s02.e13.episode.2.13.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00L\xb9\x99V...,"1\r\n00:00:53,200 --> 00:00:56,030\r\n<i>Yumi'..."
3,9180594,yumis.cells.s02.e14.episode.2.14.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00U\xa9\x99V...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch an..."
4,9180600,broker.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x001\xa9\x99V...,"ï»¿1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch..."
...,...,...,...,...
82493,9521935,the.prophets.game.(2000).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\xb8\xa6\x...,"ï»¿1\r\n00:01:16,284 --> 00:01:19,537\r\nGod,\..."
82494,9521937,west.beirut.(1998).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x13\x97\x...,"1\r\n00:00:06,000 --> 00:00:12,074\r\napi.Open..."
82495,9521938,frankenstein.the.true.story.(1973).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00$\x97\x9aV...,"1\r\n00:00:01,001 --> 00:00:04,630\r\n(Dramati..."
82496,9521940,frankenstein.the.true.story.(1973).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x00\x97\x...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nAdvertis..."


In [20]:
import nltk

# Download the punctuations
nltk.download('punkt')
# Download the stop words corpus
nltk.download('stopwords')
# Downloading wordnet before applying Lemmatizer
nltk.download('wordnet')
nltk.download('omw-1.4')

[nltk_data] Downloading package punkt to C:\Users\Abinay
[nltk_data]     Rachakonda\AppData\Roaming\nltk_data...
[nltk_data]   Package punkt is already up-to-date!
[nltk_data] Downloading package stopwords to C:\Users\Abinay
[nltk_data]     Rachakonda\AppData\Roaming\nltk_data...
[nltk_data]   Package stopwords is already up-to-date!
[nltk_data] Downloading package wordnet to C:\Users\Abinay
[nltk_data]     Rachakonda\AppData\Roaming\nltk_data...
[nltk_data]   Package wordnet is already up-to-date!
[nltk_data] Downloading package omw-1.4 to C:\Users\Abinay
[nltk_data]     Rachakonda\AppData\Roaming\nltk_data...
[nltk_data]   Package omw-1.4 is already up-to-date!


True

In [53]:
import re
import nltk
from nltk.tokenize import sent_tokenize, word_tokenize
from nltk.corpus import stopwords
from nltk.stem.porter import PorterStemmer
from nltk.stem import WordNetLemmatizer

In [54]:
stemmer = PorterStemmer()
## We can also use Lemmatizer instead of Stemmer
lemmatizer = WordNetLemmatizer()

In [51]:
def preprocess(raw_text, flag):
    # Removing special characters and digits
    sentence = re.sub("[^a-zA-Z]", " ", str(raw_text))
    
    # change sentence to lower case
    sentence = sentence.lower()

    # tokenize into words
    tokens = sentence.split()
    
    # remove stop words                
    clean_tokens = [t for t in tokens if not t in stopwords.words("english")]
    
    # Stemming/Lemmatization
    if(flag == 'stem'):
        clean_tokens = [stemmer.stem(word) for word in clean_tokens]
    else:
        clean_tokens = [lemmatizer.lemmatize(word) for word in clean_tokens]
    
    return pd.Series([" ".join(clean_tokens), len(clean_tokens)])

In [12]:
df=df.sample(n=10000)

In [13]:
df

Unnamed: 0,num,name,content,file_content
13279,9236872,love.my.way.s03.e08.and.in.the.end.(2007).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00U\x97\x99V...,"ï»¿1\r\n00:00:00,640 --> 00:00:01,473\r\n- It'..."
44425,9364321,beverly.hills.90210.s10.e22.the.easter.bunny.(...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x05\x0c\x...,"ï»¿1\r\n00:00:03,971 --> 00:00:05,482\r\nMAN: ..."
64279,9447124,csi.cyber.s01.e10.click.your.poison.(2015).eng...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00KH\x9aV2\x...,ï»¿[Script Info]\r\nTitle: Default file\r\nScr...
63878,9446076,mom.s07.e02.pop.pop.and.a.puma.(2019).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00JG\x9aV\xe...,ï»¿[Script Info]\r\nTitle: Default file\r\nScr...
39196,9341999,dead.heat.(1988).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x02\xbc\x...,"ï»¿1\r\n00:00:06,000 --> 00:00:12,074\r\nAdver..."
...,...,...,...,...
73303,9482761,the.brief.s02.e02.lack.of.affect.(2005).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00rv\x9aV\x7...,"ï»¿1\r\n00:00:00,167 --> 00:00:02,636\r\nSubti..."
75262,9491555,the.turnaround.(2017).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00-\x83\x9aV...,"ï»¿1\r\n00:00:43,112 --> 00:00:45,424\r\nAlber..."
78521,9505995,joe.90.s01.e10.big.fish.(1968).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\xbd\x89\x...,"1\r\n00:00:00,400 --> 00:00:03,000\r\n(violin ..."
38163,9337444,revenge.of.others.s01.e04.episode.1.4.(2022).e...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\xf3\xb9\x...,"ï»¿1\r\n00:00:02,335 --> 00:00:04,814\r\nDon't..."


In [38]:
from tqdm import tqdm, tqdm_notebook

In [39]:
tqdm.pandas()

In [22]:
temp_df1 = df["file_content"].progress_apply(lambda x: preprocess(x, 'lemma'))

100%|██████████████████████████████████████████████████████████████████████████| 10000/10000 [6:47:36<00:00,  2.45s/it]


In [23]:
temp_df1

Unnamed: 0,0,1
21781,film reel clacking robert englund movie always...,3055
75469,got whole world hand got whole world hand got ...,3342
34923,cheering applause hello welcome top gear tonig...,3680
44434,steve course janet said ring yeah get one knee...,2571
25462,watch video online open subtitle free browser ...,5930
...,...,...
56201,previously velma oh dad need believe mom kidna...,1835
4149,previously font color fffc real housewife atla...,3695
39808,watch video online open subtitle free browser ...,755
42199,narrator worldwide six billion camera watching...,3011


In [25]:
temp_df1.columns = ['clean_text_lemma', 'text_length_lemma']

temp_df1.head()

Unnamed: 0,clean_text_lemma,text_length_lemma
21781,film reel clacking robert englund movie always...,3055
75469,got whole world hand got whole world hand got ...,3342
34923,cheering applause hello welcome top gear tonig...,3680
44434,steve course janet said ring yeah get one knee...,2571
25462,watch video online open subtitle free browser ...,5930


In [31]:
temp_df1.to_csv("subtitile.csv",index=False)

In [35]:
from sklearn.feature_extraction.text import TfidfVectorizer
vocab2 = TfidfVectorizer()
subtitiles_tfidf1 = vocab2.fit_transform(temp_df1['clean_text_lemma'])

In [36]:
subtitiles_tfidf1

<10000x172107 sparse matrix of type '<class 'numpy.float64'>'
	with 8651012 stored elements in Compressed Sparse Row format>

In [47]:
user_query = pd.Series([input('Enter the query:')])

user_query.progress_apply(lambda x: preprocess(x,flag='lemma'))

query_vector = vocab2.transform(user_query)

Enter the query:here.comes.the.grump.


100%|████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 62.48it/s]


In [56]:
from sklearn.metrics.pairwise import cosine_similarity

cosine_similarity =  cosine_similarity(query_vector,subtitiles_tfidf1).flatten()
 
A = 10
    
top_A_indices = cosine_similarity.argsort()[-A:][::-1]
top_A_subtitles = temp_df1.iloc[top_A_indices] [:]    


In [57]:
top_A_indices 

array([2088, 1669, 2095, 8231, 6006, 1777, 9963, 5385, 5682, 7531],
      dtype=int64)

In [58]:
top_A_subtitles

Unnamed: 0,clean_text_lemma,text_length_lemma
3915,api opensubtitles org deprecated please implem...,491
19833,api opensubtitles org deprecated please implem...,621
13808,meet pair friendly snowman help u snowball gru...,494
60548,watch video online open subtitle free browser ...,1981
35254,use free code joinnow www playships eu ten tin...,3376
16308,bird crowing cat meowing mouse squeaking eleph...,802
24712,previously scorpion wanna know fella hell sylv...,2733
81992,jeff stranger begin adventure forever change l...,3119
42846,api opensubtitles org deprecated please implem...,1577
7191,person muffled cannot leave element person per...,1245


In [12]:
import pandas as pd

In [13]:
temp_df1 = pd.read_csv(r"C:\Users\Abinay Rachakonda\Desktop\sreach_Engine\subtitile.csv")

In [14]:
temp_df1.dropna(inplace=True)

In [15]:
temp_df1

Unnamed: 0,clean_text_lemma,text_length_lemma
0,film reel clacking robert englund movie always...,3055.0
1,got whole world hand got whole world hand got ...,3342.0
2,cheering applause hello welcome top gear tonig...,3680.0
3,steve course janet said ring yeah get one knee...,2571.0
5,ive go forward try get depressed try say okay ...,5930.0
...,...,...
10718,previously velma oh dad need believe mom kidna...,1835.0
10719,previously font color fffc real housewife atla...,3695.0
10720,watch video online open subtitle free browser ...,755.0
10721,narrator worldwide six billion camera watching...,3011.0


In [16]:
df = pd.concat([df, temp_df1], axis=1)

df.head()

Unnamed: 0,num,name,content,file_content,clean_text_lemma,text_length_lemma
0,9180533,the.message.(1976).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x1c\xa9\x...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch an...",film reel clacking robert englund movie always...,3055.0
1,9180583,here.comes.the.grump.s01.e09.joltin.jack.in.bo...,b'PK\x03\x04\x14\x00\x00\x00\x08\x00\x17\xb9\x...,"1\r\n00:00:29,359 --> 00:00:32,048\r\nAh! Ther...",got whole world hand got whole world hand got ...,3342.0
2,9180592,yumis.cells.s02.e13.episode.2.13.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00L\xb9\x99V...,"1\r\n00:00:53,200 --> 00:00:56,030\r\n<i>Yumi'...",cheering applause hello welcome top gear tonig...,3680.0
3,9180594,yumis.cells.s02.e14.episode.2.14.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x00U\xa9\x99V...,"1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch an...",steve course janet said ring yeah get one knee...,2571.0
4,9180600,broker.(2022).eng.1cd,b'PK\x03\x04\x14\x00\x00\x00\x08\x001\xa9\x99V...,"ï»¿1\r\n00:00:06,000 --> 00:00:12,074\r\nWatch...",,


In [39]:
df.isnull().sum()

num                      0
name                     0
content                  0
file_content             0
clean_text_lemma     72498
text_length_lemma    72498
dtype: int64

In [17]:
df = df[['num',"name",'clean_text_lemma']]

df.head()

Unnamed: 0,num,name,clean_text_lemma
0,9180533,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
1,9180583,here.comes.the.grump.s01.e09.joltin.jack.in.bo...,got whole world hand got whole world hand got ...
2,9180592,yumis.cells.s02.e13.episode.2.13.(2022).eng.1cd,cheering applause hello welcome top gear tonig...
3,9180594,yumis.cells.s02.e14.episode.2.14.(2022).eng.1cd,steve course janet said ring yeah get one knee...
4,9180600,broker.(2022).eng.1cd,


In [18]:
df.dropna(inplace=True)

In [19]:
df.shape

(10000, 3)

In [20]:
df

Unnamed: 0,num,name,clean_text_lemma
0,9180533,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
1,9180583,here.comes.the.grump.s01.e09.joltin.jack.in.bo...,got whole world hand got whole world hand got ...
2,9180592,yumis.cells.s02.e13.episode.2.13.(2022).eng.1cd,cheering applause hello welcome top gear tonig...
3,9180594,yumis.cells.s02.e14.episode.2.14.(2022).eng.1cd,steve course janet said ring yeah get one knee...
5,9180607,the.myth.(2005).eng.1cd,ive go forward try get depressed try say okay ...
...,...,...,...
10718,9226325,ncis.s04.e03.singled.out.(2006).eng.1cd,previously velma oh dad need believe mom kidna...
10719,9226326,ncis.s04.e04.faking.it.(2006).eng.1cd,previously font color fffc real housewife atla...
10720,9226327,ncis.s04.e05.dead.and.unburied.(2006).eng.1cd,watch video online open subtitle free browser ...
10721,9226328,ncis.s04.e06.witch.hunt.(2006).eng.1cd,narrator worldwide six billion camera watching...


In [21]:
def chunk_document(corpous, id_col, chunk_size=300):
    data = []

    for doc,id  in zip(corpous, id_col):
        
        words = doc.split()
        for i in range(0, len(words), chunk_size):
            chunk = ' '.join(words[i:i + chunk_size])


            data.append((id,chunk))

    df = pd.DataFrame(data)

    return df

In [22]:
chuncked_df = chunk_document(df.clean_text_lemma,df.num)

In [23]:
chuncked_df

Unnamed: 0,0,1
0,9180533,film reel clacking robert englund movie always...
1,9180533,reason romero horror film believe popularity c...
2,9180533,character react monstrous threat come together...
3,9180533,america paving way countless others follow rel...
4,9180533,sea still one landmark special effect film eve...
...,...,...
79468,9226329,small town oh mr wood idea small town um help ...
79469,9226329,pray lord bless keep adam wood keep rome man c...
79470,9226329,lynch mob man jean valjean steal loaf bread th...
79471,9226329,people miss ed lawson managed wish could stron...


In [24]:
chuncked_df.columns =['num','file_content_chunks']

In [25]:
chuncked_df

Unnamed: 0,num,file_content_chunks
0,9180533,film reel clacking robert englund movie always...
1,9180533,reason romero horror film believe popularity c...
2,9180533,character react monstrous threat come together...
3,9180533,america paving way countless others follow rel...
4,9180533,sea still one landmark special effect film eve...
...,...,...
79468,9226329,small town oh mr wood idea small town um help ...
79469,9226329,pray lord bless keep adam wood keep rome man c...
79470,9226329,lynch mob man jean valjean steal loaf bread th...
79471,9226329,people miss ed lawson managed wish could stron...


In [26]:
merged_df = chuncked_df.merge(df,left_on="num",right_on="num",how="left")

In [27]:
merged_df

Unnamed: 0,num,file_content_chunks,name,clean_text_lemma
0,9180533,film reel clacking robert englund movie always...,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
1,9180533,reason romero horror film believe popularity c...,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
2,9180533,character react monstrous threat come together...,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
3,9180533,america paving way countless others follow rel...,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
4,9180533,sea still one landmark special effect film eve...,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
...,...,...,...,...
79468,9226329,small town oh mr wood idea small town um help ...,ncis.s04.e07.sandblast.(2006).eng.1cd,wind chime chiming matthew yes right goal dog ...
79469,9226329,pray lord bless keep adam wood keep rome man c...,ncis.s04.e07.sandblast.(2006).eng.1cd,wind chime chiming matthew yes right goal dog ...
79470,9226329,lynch mob man jean valjean steal loaf bread th...,ncis.s04.e07.sandblast.(2006).eng.1cd,wind chime chiming matthew yes right goal dog ...
79471,9226329,people miss ed lawson managed wish could stron...,ncis.s04.e07.sandblast.(2006).eng.1cd,wind chime chiming matthew yes right goal dog ...


In [28]:
merged_df= merged_df[['num','name','file_content_chunks']]

In [29]:
merged_df

Unnamed: 0,num,name,file_content_chunks
0,9180533,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
1,9180533,the.message.(1976).eng.1cd,reason romero horror film believe popularity c...
2,9180533,the.message.(1976).eng.1cd,character react monstrous threat come together...
3,9180533,the.message.(1976).eng.1cd,america paving way countless others follow rel...
4,9180533,the.message.(1976).eng.1cd,sea still one landmark special effect film eve...
...,...,...,...
79468,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,small town oh mr wood idea small town um help ...
79469,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,pray lord bless keep adam wood keep rome man c...
79470,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,lynch mob man jean valjean steal loaf bread th...
79471,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,people miss ed lawson managed wish could stron...


In [30]:
for id, chunk in enumerate(merged_df):
    print(chunk)

num
name
file_content_chunks


In [60]:
pip install -U sentence-transformers

Collecting sentence-transformersNote: you may need to restart the kernel to use updated packages.





  Downloading sentence_transformers-2.7.0-py3-none-any.whl (171 kB)
     -------------------------------------- 171.5/171.5 kB 2.1 MB/s eta 0:00:00
Collecting transformers<5.0.0,>=4.34.0
  Downloading transformers-4.40.0-py3-none-any.whl (9.0 MB)
     ---------------------------------------- 9.0/9.0 MB 675.2 kB/s eta 0:00:00
Collecting huggingface-hub>=0.15.1
  Downloading huggingface_hub-0.22.2-py3-none-any.whl (388 kB)
     ------------------------------------ 388.9/388.9 kB 504.8 kB/s eta 0:00:00
Collecting fsspec>=2023.5.0
  Downloading fsspec-2024.3.1-py3-none-any.whl (171 kB)
     ------------------------------------ 172.0/172.0 kB 738.6 kB/s eta 0:00:00
Collecting safetensors>=0.4.1
  Downloading safetensors-0.4.3-cp310-none-win_amd64.whl (287 kB)
     ------------------------------------ 287.4/287.4 kB 842.6 kB/s eta 0:00:00
Collecting tokenizers<0.20,>=0.19
  Downloading tokenizers-0.19.1-cp310-none-win_amd64.whl (2.2 MB)
     ---------------------------------------- 2.2/2.2 

In [31]:
from sentence_transformers import SentenceTransformer, util

model = SentenceTransformer('all-MiniLM-L6-v2')

In [63]:
pip install  chromadb

Collecting chromadbNote: you may need to restart the kernel to use updated packages.

  Downloading chromadb-0.4.24-py3-none-any.whl (525 kB)
     -------------------------------------- 525.5/525.5 kB 2.4 MB/s eta 0:00:00
Collecting opentelemetry-exporter-otlp-proto-grpc>=1.2.0
  Downloading opentelemetry_exporter_otlp_proto_grpc-1.24.0-py3-none-any.whl (18 kB)
Collecting pulsar-client>=3.1.0
  Downloading pulsar_client-3.5.0-cp310-cp310-win_amd64.whl (3.3 MB)
     ---------------------------------------- 3.3/3.3 MB 6.6 MB/s eta 0:00:00
Collecting pypika>=0.48.9
  Downloading PyPika-0.48.9.tar.gz (67 kB)
     ---------------------------------------- 67.3/67.3 kB 3.6 MB/s eta 0:00:00
  Installing build dependencies: started
  Installing build dependencies: finished with status 'done'
  Getting requirements to build wheel: started
  Getting requirements to build wheel: finished with status 'done'
  Preparing metadata (pyproject.toml): started
  Preparing metadata (pyproject.toml): finish

ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
rasa 3.6.15 requires prompt-toolkit<3.0.29,>=3.0, but you have prompt-toolkit 3.0.41 which is incompatible.


In [32]:
import pandas as pd
from sentence_transformers import SentenceTransformer
from sklearn.metrics.pairwise import cosine_similarity
import chromadb

In [33]:
merged_df

Unnamed: 0,num,name,file_content_chunks
0,9180533,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
1,9180533,the.message.(1976).eng.1cd,reason romero horror film believe popularity c...
2,9180533,the.message.(1976).eng.1cd,character react monstrous threat come together...
3,9180533,the.message.(1976).eng.1cd,america paving way countless others follow rel...
4,9180533,the.message.(1976).eng.1cd,sea still one landmark special effect film eve...
...,...,...,...
79468,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,small town oh mr wood idea small town um help ...
79469,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,pray lord bless keep adam wood keep rome man c...
79470,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,lynch mob man jean valjean steal loaf bread th...
79471,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,people miss ed lawson managed wish could stron...


In [34]:
df = merged_df.sample(n=8000)

In [35]:
df

Unnamed: 0,num,name,file_content_chunks
19461,9192535,mommy.issues.(2021).eng.1cd,place start door open good morning easy say sp...
29590,9199194,empire.s03.e04.cupid.kills.(2016).eng.1cd,direct result willful preventable malpractice ...
70465,9221632,this.is.us.s05.e15.jerry.2.0.(2021).eng.1cd,narrator previously beauty beast something gem...
50661,9209861,red.rose.s01.e04.manchester.innit.(2022).eng.1cd,mother hajara married bayo refused osage offer...
50587,9209767,fruits.basket.s03.e11.goodbye.(2021).eng.1cd,dialogue dialogue unknown po head thing right ...
...,...,...,...
50344,9209734,fruits.basket.s02.e03.shall.we.go.and.get.you....,working gotta go spent lot money least open ba...
22868,9194963,american.horror.stories.s02.e03.drive.(2022).e...,deeper ever seen black side town white side to...
36814,9202976,doctor.of.doom.(1963).eng.1cd,tell july revolution remember date french revo...
40276,9204652,the.deuce.s02.e06.were.all.beasts.(2018).eng.1cd,kidding oh oh woe life empty without joy oh go...


In [36]:
df.info()

<class 'pandas.core.frame.DataFrame'>
Int64Index: 8000 entries, 19461 to 22930
Data columns (total 3 columns):
 #   Column               Non-Null Count  Dtype 
---  ------               --------------  ----- 
 0   num                  8000 non-null   int64 
 1   name                 8000 non-null   object
 2   file_content_chunks  8000 non-null   object
dtypes: int64(1), object(2)
memory usage: 250.0+ KB


In [40]:
from tqdm import tqdm, tqdm_notebook

In [41]:
tqdm.pandas()

In [42]:
df['doc_vector_pretrained_bert'] = df.file_content_chunks.progress_apply(model.encode)

df.head()

100%|███████████████████████████████████████████████████████████████████████████| 8000/8000 [15:40:40<00:00,  7.06s/it]


Unnamed: 0,num,name,file_content_chunks,doc_vector_pretrained_bert
19461,9192535,mommy.issues.(2021).eng.1cd,place start door open good morning easy say sp...,"[-0.0783906, -0.11347432, -0.004569102, -0.005..."
29590,9199194,empire.s03.e04.cupid.kills.(2016).eng.1cd,direct result willful preventable malpractice ...,"[-0.08458218, 0.008842241, 0.08269427, 0.00555..."
70465,9221632,this.is.us.s05.e15.jerry.2.0.(2021).eng.1cd,narrator previously beauty beast something gem...,"[-0.07014809, -0.065490745, -0.0038970595, 0.0..."
50661,9209861,red.rose.s01.e04.manchester.innit.(2022).eng.1cd,mother hajara married bayo refused osage offer...,"[-0.01848804, 0.033096064, -0.04405071, -0.016..."
50587,9209767,fruits.basket.s03.e11.goodbye.(2021).eng.1cd,dialogue dialogue unknown po head thing right ...,"[0.028834488, -0.0404045, -0.0050393566, -0.06..."


In [43]:
merged_df

Unnamed: 0,num,name,file_content_chunks
0,9180533,the.message.(1976).eng.1cd,film reel clacking robert englund movie always...
1,9180533,the.message.(1976).eng.1cd,reason romero horror film believe popularity c...
2,9180533,the.message.(1976).eng.1cd,character react monstrous threat come together...
3,9180533,the.message.(1976).eng.1cd,america paving way countless others follow rel...
4,9180533,the.message.(1976).eng.1cd,sea still one landmark special effect film eve...
...,...,...,...
79468,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,small town oh mr wood idea small town um help ...
79469,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,pray lord bless keep adam wood keep rome man c...
79470,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,lynch mob man jean valjean steal loaf bread th...
79471,9226329,ncis.s04.e07.sandblast.(2006).eng.1cd,people miss ed lawson managed wish could stron...


In [46]:
df

Unnamed: 0,num,name,file_content_chunks,doc_vector_pretrained_bert
19461,9192535,mommy.issues.(2021).eng.1cd,place start door open good morning easy say sp...,"[-0.0783906, -0.11347432, -0.004569102, -0.005..."
29590,9199194,empire.s03.e04.cupid.kills.(2016).eng.1cd,direct result willful preventable malpractice ...,"[-0.08458218, 0.008842241, 0.08269427, 0.00555..."
70465,9221632,this.is.us.s05.e15.jerry.2.0.(2021).eng.1cd,narrator previously beauty beast something gem...,"[-0.07014809, -0.065490745, -0.0038970595, 0.0..."
50661,9209861,red.rose.s01.e04.manchester.innit.(2022).eng.1cd,mother hajara married bayo refused osage offer...,"[-0.01848804, 0.033096064, -0.04405071, -0.016..."
50587,9209767,fruits.basket.s03.e11.goodbye.(2021).eng.1cd,dialogue dialogue unknown po head thing right ...,"[0.028834488, -0.0404045, -0.0050393566, -0.06..."
...,...,...,...,...
50344,9209734,fruits.basket.s02.e03.shall.we.go.and.get.you....,working gotta go spent lot money least open ba...,"[-0.10251991, -0.08886092, 0.030862536, -0.005..."
22868,9194963,american.horror.stories.s02.e03.drive.(2022).e...,deeper ever seen black side town white side to...,"[-0.00080629764, -0.040539347, -0.06315144, 0...."
36814,9202976,doctor.of.doom.(1963).eng.1cd,tell july revolution remember date french revo...,"[-0.016995015, -0.07232215, 0.009061228, -0.08..."
40276,9204652,the.deuce.s02.e06.were.all.beasts.(2018).eng.1cd,kidding oh oh woe life empty without joy oh go...,"[-0.025916412, -0.08842337, 0.033268996, -0.00..."


In [44]:
User_query = pd.Series([input("Enter a user Query " )])

Enter a user Query  Query  peter


In [55]:
User_query.progress_apply(lambda x:  preprocess(x, "lemma"))

100%|████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:04<00:00,  4.96s/it]


Unnamed: 0,0,1
0,query peter,2


In [62]:
df.info()

<class 'pandas.core.frame.DataFrame'>
Int64Index: 8000 entries, 19461 to 22930
Data columns (total 4 columns):
 #   Column                      Non-Null Count  Dtype 
---  ------                      --------------  ----- 
 0   num                         8000 non-null   int64 
 1   name                        8000 non-null   object
 2   file_content_chunks         8000 non-null   object
 3   doc_vector_pretrained_bert  8000 non-null   object
dtypes: int64(1), object(3)
memory usage: 570.5+ KB


In [67]:
df['doc_vector_pretrained_bert'] = df['doc_vector_pretrained_bert'].astype(str).str.replace("[","").str.replace("]","")


  df['doc_vector_pretrained_bert'] = df['doc_vector_pretrained_bert'].astype(str).str.replace("[","").str.replace("]","")


In [95]:
df['doc_vector_pretrained_bert'] = df['doc_vector_pretrained_bert'].apply(lambda x: [float(i) for i in x.split()])


In [96]:
a=df['doc_vector_pretrained_bert'][:1]

In [106]:
len(a.loc[19461,])

384

In [100]:
Query = User_query.progress_apply(model.encode)

100%|████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00,  7.45it/s]


In [101]:
Query[0].shape

(384,)

In [109]:
import torch

In [116]:
# Convert the DataFrame column into a list of tensors
doc_vectors = [torch.tensor(vec) for vec in df['doc_vector_pretrained_bert']]

# Assuming Query[0] is also a tensor, convert it if necessary
query_tensor = torch.tensor(Query[0])

# Calculate cosine similarity
cos_similarities = [util.cos_sim(query_tensor, vec) for vec in doc_vectors]

In [118]:
cos_similarities

[tensor([[0.1163]]),
 tensor([[0.1591]]),
 tensor([[0.0599]]),
 tensor([[0.1696]]),
 tensor([[0.0965]]),
 tensor([[0.0318]]),
 tensor([[0.0871]]),
 tensor([[0.0605]]),
 tensor([[0.1196]]),
 tensor([[0.0883]]),
 tensor([[0.0688]]),
 tensor([[-0.0204]]),
 tensor([[0.1606]]),
 tensor([[0.1209]]),
 tensor([[0.0925]]),
 tensor([[0.1139]]),
 tensor([[0.0833]]),
 tensor([[0.1320]]),
 tensor([[0.0936]]),
 tensor([[0.1016]]),
 tensor([[0.2035]]),
 tensor([[-0.0028]]),
 tensor([[0.0860]]),
 tensor([[0.1116]]),
 tensor([[0.1195]]),
 tensor([[0.0880]]),
 tensor([[0.0618]]),
 tensor([[0.0491]]),
 tensor([[0.0951]]),
 tensor([[0.1162]]),
 tensor([[0.1488]]),
 tensor([[0.1023]]),
 tensor([[0.0723]]),
 tensor([[0.0497]]),
 tensor([[0.0708]]),
 tensor([[0.1241]]),
 tensor([[0.0711]]),
 tensor([[-0.0625]]),
 tensor([[0.0005]]),
 tensor([[0.0518]]),
 tensor([[0.1344]]),
 tensor([[0.0539]]),
 tensor([[0.0938]]),
 tensor([[0.1045]]),
 tensor([[0.0341]]),
 tensor([[0.0909]]),
 tensor([[0.0682]]),
 tensor([[

In [120]:
import numpy as np

cos_similarities_array = np.array(cos_similarities)
sorted_indices = cos_similarities_array.argsort()[::-1].tolist()


  cos_similarities_array = np.array(cos_similarities)
  cos_similarities_array = np.array(cos_similarities)


In [121]:
sorted_indices

[3734,
 6302,
 7737,
 3216,
 5544,
 7505,
 3204,
 5727,
 5661,
 5000,
 5073,
 4814,
 588,
 2626,
 7183,
 6964,
 5538,
 4418,
 3893,
 4532,
 7006,
 2039,
 5611,
 4282,
 5769,
 4124,
 4982,
 2756,
 4781,
 498,
 6822,
 7362,
 3111,
 4492,
 7325,
 723,
 3062,
 7806,
 7307,
 1798,
 4944,
 1258,
 882,
 3436,
 2759,
 4448,
 2942,
 2999,
 1768,
 4291,
 905,
 3557,
 2082,
 2447,
 3189,
 7686,
 4856,
 1488,
 4267,
 1680,
 1304,
 746,
 5723,
 3483,
 2908,
 6988,
 4225,
 2486,
 6494,
 6693,
 5068,
 1518,
 6825,
 6088,
 7007,
 7688,
 1473,
 5286,
 1067,
 1840,
 3407,
 4053,
 5535,
 6339,
 1532,
 4236,
 4190,
 608,
 261,
 378,
 6614,
 1608,
 6944,
 2446,
 4521,
 6298,
 1242,
 3049,
 6925,
 127,
 7815,
 5108,
 5944,
 5184,
 6595,
 245,
 5935,
 4962,
 59,
 2206,
 5492,
 736,
 7242,
 2767,
 2028,
 5361,
 6315,
 1411,
 3468,
 2275,
 7317,
 2816,
 6634,
 6898,
 4355,
 5572,
 7167,
 6755,
 6849,
 940,
 1109,
 1380,
 727,
 3070,
 5472,
 4875,
 1535,
 2555,
 7583,
 172,
 416,
 7934,
 3559,
 7998,
 7948,
 68

In [141]:
top_sub = merged_df.iloc[sorted_indices[::]]

In [142]:
a=top_sub['file_content_chunks']

In [143]:
a[0]

'film reel clacking robert englund movie always provided safe place face fear dark movie theater deal monster terrorizing u real life seeing defeated big screen horror film like dracula invisible man wolf man provided useful catharsis frightened populace movie gave audience place share collective fear even national trauma brought financial instability world war global tension followed financial fear wartime terror soon followed new threat screaming nation horrified thought nuclear destruction scientist going far wreaking havoc nature cold war soviet american threatened global destruction national fear nuclear annihilation communist infiltration even destructive global conflict world war ii resulted horror film science run amuck alien invasion sky extraterrestrial body snatcher right earth public ever drawn movie help confront anxiety film reel clacking click film reel clacking whirring indistinct conversation man drive movie take care everything courtship babysitting shelter marilyn mo

In [108]:
cos_sim = util.cos_sim(Query[0], df['doc_vector_pretrained_bert'])


ValueError: could not determine the shape of object type 'Series'

In [47]:
merged_df.to_csv("Sub_Titles_with_32.csv",index=False)

In [48]:
df.to_csv("Sub_Titles_BERT.csv",index=False)

In [152]:
doc_vectors[0]

tensor([-7.8391e-02, -1.1347e-01, -4.5691e-03, -5.6716e-03, -2.3426e-02,
         7.3255e-03, -5.6512e-03, -3.8667e-02, -3.6171e-02,  1.5680e-02,
         3.6550e-02, -1.0670e-02, -5.7361e-02, -4.9183e-02, -4.1457e-02,
        -2.9344e-02,  5.0526e-02,  3.5492e-02, -7.6502e-02,  4.5095e-02,
         2.6481e-02, -5.0474e-03,  4.9960e-02, -3.9480e-02, -1.0351e-01,
         4.6609e-02, -1.0493e-01, -6.3279e-04,  2.9298e-03, -3.3578e-02,
        -1.1555e-02,  9.6098e-02,  2.5953e-02,  3.7583e-02,  2.7453e-02,
         2.3105e-02, -1.8266e-02,  1.4465e-01,  3.6982e-02, -2.8531e-02,
        -5.5603e-02, -7.3799e-02, -7.9451e-03,  5.1660e-02, -3.8241e-02,
        -7.6856e-02, -3.3947e-02, -4.9130e-02,  4.0102e-02, -6.1319e-02,
        -3.9372e-02, -1.3360e-02,  3.0897e-02,  3.8811e-02, -3.8338e-02,
        -1.7315e-02,  3.6293e-04, -4.6049e-02, -3.7786e-02, -3.4589e-02,
        -5.0555e-02,  1.7924e-02, -4.3591e-03,  1.4208e-03, -3.1716e-02,
        -2.4761e-02, -1.1563e-02,  2.0041e-02,  4.5

In [148]:
client = chromadb.PersistentClient(path="Search_engine")

In [150]:
client

<chromadb.api.client.Client at 0x1f6f2cc2020>

In [151]:
 collection = client.create_collection(
        name="search",
        metadata={"hnsw:space": "cosine"} # l2 is the default
    )

In [245]:
for i, emb in enumerate(doc_vectors):
    embs=np.array(emb)
    collection.add(ids=f"chunks_{i}",
    embeddings=embs.tolist(),
                   documents=df.loc[i,"file_content_chunks"]
                  )


Add of existing embedding ID: chunks_0
Insert of existing embedding ID: chunks_0
Add of existing embedding ID: chunks_1
Insert of existing embedding ID: chunks_1
Add of existing embedding ID: chunks_2
Insert of existing embedding ID: chunks_2
Add of existing embedding ID: chunks_3
Insert of existing embedding ID: chunks_3
Add of existing embedding ID: chunks_4
Insert of existing embedding ID: chunks_4
Add of existing embedding ID: chunks_5
Insert of existing embedding ID: chunks_5
Add of existing embedding ID: chunks_6
Insert of existing embedding ID: chunks_6
Add of existing embedding ID: chunks_7
Insert of existing embedding ID: chunks_7
Add of existing embedding ID: chunks_8
Insert of existing embedding ID: chunks_8
Add of existing embedding ID: chunks_9
Insert of existing embedding ID: chunks_9
Add of existing embedding ID: chunks_10
Insert of existing embedding ID: chunks_10
Add of existing embedding ID: chunks_11
Insert of existing embedding ID: chunks_11
Add of existing embeddin

Add of existing embedding ID: chunks_99
Insert of existing embedding ID: chunks_99
Add of existing embedding ID: chunks_100
Insert of existing embedding ID: chunks_100
Add of existing embedding ID: chunks_101
Insert of existing embedding ID: chunks_101
Add of existing embedding ID: chunks_102
Insert of existing embedding ID: chunks_102
Add of existing embedding ID: chunks_103
Insert of existing embedding ID: chunks_103
Add of existing embedding ID: chunks_104
Insert of existing embedding ID: chunks_104
Add of existing embedding ID: chunks_105
Insert of existing embedding ID: chunks_105
Add of existing embedding ID: chunks_106
Insert of existing embedding ID: chunks_106
Add of existing embedding ID: chunks_107
Insert of existing embedding ID: chunks_107
Add of existing embedding ID: chunks_108
Insert of existing embedding ID: chunks_108
Add of existing embedding ID: chunks_109
Insert of existing embedding ID: chunks_109
Add of existing embedding ID: chunks_110
Insert of existing embeddi

Insert of existing embedding ID: chunks_195
Add of existing embedding ID: chunks_196
Insert of existing embedding ID: chunks_196
Add of existing embedding ID: chunks_197
Insert of existing embedding ID: chunks_197
Add of existing embedding ID: chunks_198
Insert of existing embedding ID: chunks_198
Add of existing embedding ID: chunks_199
Insert of existing embedding ID: chunks_199
Add of existing embedding ID: chunks_200
Insert of existing embedding ID: chunks_200
Add of existing embedding ID: chunks_201
Insert of existing embedding ID: chunks_201
Add of existing embedding ID: chunks_202
Insert of existing embedding ID: chunks_202
Add of existing embedding ID: chunks_203
Insert of existing embedding ID: chunks_203
Add of existing embedding ID: chunks_204
Insert of existing embedding ID: chunks_204
Add of existing embedding ID: chunks_205
Insert of existing embedding ID: chunks_205
Add of existing embedding ID: chunks_206
Insert of existing embedding ID: chunks_206
Add of existing embed

Add of existing embedding ID: chunks_292
Insert of existing embedding ID: chunks_292
Add of existing embedding ID: chunks_293
Insert of existing embedding ID: chunks_293
Add of existing embedding ID: chunks_294
Insert of existing embedding ID: chunks_294
Add of existing embedding ID: chunks_295
Insert of existing embedding ID: chunks_295
Add of existing embedding ID: chunks_296
Insert of existing embedding ID: chunks_296
Add of existing embedding ID: chunks_297
Insert of existing embedding ID: chunks_297
Add of existing embedding ID: chunks_298
Insert of existing embedding ID: chunks_298
Add of existing embedding ID: chunks_299
Insert of existing embedding ID: chunks_299
Add of existing embedding ID: chunks_300
Insert of existing embedding ID: chunks_300
Add of existing embedding ID: chunks_301
Insert of existing embedding ID: chunks_301
Add of existing embedding ID: chunks_302
Insert of existing embedding ID: chunks_302
Add of existing embedding ID: chunks_303
Insert of existing embed

Insert of existing embedding ID: chunks_388
Add of existing embedding ID: chunks_389
Insert of existing embedding ID: chunks_389
Add of existing embedding ID: chunks_390
Insert of existing embedding ID: chunks_390
Add of existing embedding ID: chunks_391
Insert of existing embedding ID: chunks_391
Add of existing embedding ID: chunks_392
Insert of existing embedding ID: chunks_392
Add of existing embedding ID: chunks_393
Insert of existing embedding ID: chunks_393
Add of existing embedding ID: chunks_394
Insert of existing embedding ID: chunks_394
Add of existing embedding ID: chunks_395
Insert of existing embedding ID: chunks_395
Add of existing embedding ID: chunks_396
Insert of existing embedding ID: chunks_396
Add of existing embedding ID: chunks_397
Insert of existing embedding ID: chunks_397
Add of existing embedding ID: chunks_398
Insert of existing embedding ID: chunks_398
Add of existing embedding ID: chunks_399
Insert of existing embedding ID: chunks_399
Add of existing embed

Add of existing embedding ID: chunks_485
Insert of existing embedding ID: chunks_485
Add of existing embedding ID: chunks_486
Insert of existing embedding ID: chunks_486
Add of existing embedding ID: chunks_487
Insert of existing embedding ID: chunks_487
Add of existing embedding ID: chunks_488
Insert of existing embedding ID: chunks_488
Add of existing embedding ID: chunks_489
Insert of existing embedding ID: chunks_489
Add of existing embedding ID: chunks_490
Insert of existing embedding ID: chunks_490
Add of existing embedding ID: chunks_491
Insert of existing embedding ID: chunks_491
Add of existing embedding ID: chunks_492
Insert of existing embedding ID: chunks_492
Add of existing embedding ID: chunks_493
Insert of existing embedding ID: chunks_493
Add of existing embedding ID: chunks_494
Insert of existing embedding ID: chunks_494
Add of existing embedding ID: chunks_495
Insert of existing embedding ID: chunks_495
Add of existing embedding ID: chunks_496
Insert of existing embed

Insert of existing embedding ID: chunks_581
Add of existing embedding ID: chunks_582
Insert of existing embedding ID: chunks_582
Add of existing embedding ID: chunks_583
Insert of existing embedding ID: chunks_583
Add of existing embedding ID: chunks_584
Insert of existing embedding ID: chunks_584
Add of existing embedding ID: chunks_585
Insert of existing embedding ID: chunks_585
Add of existing embedding ID: chunks_586
Insert of existing embedding ID: chunks_586
Add of existing embedding ID: chunks_587
Insert of existing embedding ID: chunks_587
Add of existing embedding ID: chunks_588
Insert of existing embedding ID: chunks_588
Add of existing embedding ID: chunks_589
Insert of existing embedding ID: chunks_589
Add of existing embedding ID: chunks_590
Insert of existing embedding ID: chunks_590
Add of existing embedding ID: chunks_591
Insert of existing embedding ID: chunks_591
Add of existing embedding ID: chunks_592
Insert of existing embedding ID: chunks_592
Add of existing embed

Add of existing embedding ID: chunks_678
Insert of existing embedding ID: chunks_678
Add of existing embedding ID: chunks_679
Insert of existing embedding ID: chunks_679
Add of existing embedding ID: chunks_680
Insert of existing embedding ID: chunks_680
Add of existing embedding ID: chunks_681
Insert of existing embedding ID: chunks_681
Add of existing embedding ID: chunks_682
Insert of existing embedding ID: chunks_682
Add of existing embedding ID: chunks_683
Insert of existing embedding ID: chunks_683
Add of existing embedding ID: chunks_684
Insert of existing embedding ID: chunks_684
Add of existing embedding ID: chunks_685
Insert of existing embedding ID: chunks_685
Add of existing embedding ID: chunks_686
Insert of existing embedding ID: chunks_686
Add of existing embedding ID: chunks_687
Insert of existing embedding ID: chunks_687
Add of existing embedding ID: chunks_688
Insert of existing embedding ID: chunks_688
Add of existing embedding ID: chunks_689
Insert of existing embed

Insert of existing embedding ID: chunks_774
Add of existing embedding ID: chunks_775
Insert of existing embedding ID: chunks_775
Add of existing embedding ID: chunks_776
Insert of existing embedding ID: chunks_776
Add of existing embedding ID: chunks_777
Insert of existing embedding ID: chunks_777
Add of existing embedding ID: chunks_778
Insert of existing embedding ID: chunks_778
Add of existing embedding ID: chunks_779
Insert of existing embedding ID: chunks_779
Add of existing embedding ID: chunks_780
Insert of existing embedding ID: chunks_780
Add of existing embedding ID: chunks_781
Insert of existing embedding ID: chunks_781
Add of existing embedding ID: chunks_782
Insert of existing embedding ID: chunks_782
Add of existing embedding ID: chunks_783
Insert of existing embedding ID: chunks_783
Add of existing embedding ID: chunks_784
Insert of existing embedding ID: chunks_784
Add of existing embedding ID: chunks_785
Insert of existing embedding ID: chunks_785
Add of existing embed

Add of existing embedding ID: chunks_871
Insert of existing embedding ID: chunks_871
Add of existing embedding ID: chunks_872
Insert of existing embedding ID: chunks_872
Add of existing embedding ID: chunks_873
Insert of existing embedding ID: chunks_873
Add of existing embedding ID: chunks_874
Insert of existing embedding ID: chunks_874
Add of existing embedding ID: chunks_875
Insert of existing embedding ID: chunks_875
Add of existing embedding ID: chunks_876
Insert of existing embedding ID: chunks_876
Add of existing embedding ID: chunks_877
Insert of existing embedding ID: chunks_877
Add of existing embedding ID: chunks_878
Insert of existing embedding ID: chunks_878
Add of existing embedding ID: chunks_879
Insert of existing embedding ID: chunks_879
Add of existing embedding ID: chunks_880
Insert of existing embedding ID: chunks_880
Add of existing embedding ID: chunks_881
Insert of existing embedding ID: chunks_881
Add of existing embedding ID: chunks_882
Insert of existing embed

Insert of existing embedding ID: chunks_967
Add of existing embedding ID: chunks_968
Insert of existing embedding ID: chunks_968
Add of existing embedding ID: chunks_969
Insert of existing embedding ID: chunks_969
Add of existing embedding ID: chunks_970
Insert of existing embedding ID: chunks_970
Add of existing embedding ID: chunks_971
Insert of existing embedding ID: chunks_971
Add of existing embedding ID: chunks_972
Insert of existing embedding ID: chunks_972
Add of existing embedding ID: chunks_973
Insert of existing embedding ID: chunks_973
Add of existing embedding ID: chunks_974
Insert of existing embedding ID: chunks_974
Add of existing embedding ID: chunks_975
Insert of existing embedding ID: chunks_975
Add of existing embedding ID: chunks_976
Insert of existing embedding ID: chunks_976
Add of existing embedding ID: chunks_977
Insert of existing embedding ID: chunks_977
Add of existing embedding ID: chunks_978
Insert of existing embedding ID: chunks_978
Add of existing embed

Insert of existing embedding ID: chunks_1062
Add of existing embedding ID: chunks_1063
Insert of existing embedding ID: chunks_1063
Add of existing embedding ID: chunks_1064
Insert of existing embedding ID: chunks_1064
Add of existing embedding ID: chunks_1065
Insert of existing embedding ID: chunks_1065
Add of existing embedding ID: chunks_1066
Insert of existing embedding ID: chunks_1066
Add of existing embedding ID: chunks_1067
Insert of existing embedding ID: chunks_1067
Add of existing embedding ID: chunks_1068
Insert of existing embedding ID: chunks_1068
Add of existing embedding ID: chunks_1069
Insert of existing embedding ID: chunks_1069
Add of existing embedding ID: chunks_1070
Insert of existing embedding ID: chunks_1070
Add of existing embedding ID: chunks_1071
Insert of existing embedding ID: chunks_1071
Add of existing embedding ID: chunks_1072
Insert of existing embedding ID: chunks_1072
Add of existing embedding ID: chunks_1073
Insert of existing embedding ID: chunks_107

Add of existing embedding ID: chunks_1157
Insert of existing embedding ID: chunks_1157
Add of existing embedding ID: chunks_1158
Insert of existing embedding ID: chunks_1158
Add of existing embedding ID: chunks_1159
Insert of existing embedding ID: chunks_1159
Add of existing embedding ID: chunks_1160
Insert of existing embedding ID: chunks_1160
Add of existing embedding ID: chunks_1161
Insert of existing embedding ID: chunks_1161
Add of existing embedding ID: chunks_1162
Insert of existing embedding ID: chunks_1162
Add of existing embedding ID: chunks_1163
Insert of existing embedding ID: chunks_1163
Add of existing embedding ID: chunks_1164
Insert of existing embedding ID: chunks_1164
Add of existing embedding ID: chunks_1165
Insert of existing embedding ID: chunks_1165
Add of existing embedding ID: chunks_1166
Insert of existing embedding ID: chunks_1166
Add of existing embedding ID: chunks_1167
Insert of existing embedding ID: chunks_1167
Add of existing embedding ID: chunks_1168
I

Insert of existing embedding ID: chunks_1251
Add of existing embedding ID: chunks_1252
Insert of existing embedding ID: chunks_1252
Add of existing embedding ID: chunks_1253
Insert of existing embedding ID: chunks_1253
Add of existing embedding ID: chunks_1254
Insert of existing embedding ID: chunks_1254
Add of existing embedding ID: chunks_1255
Insert of existing embedding ID: chunks_1255
Add of existing embedding ID: chunks_1256
Insert of existing embedding ID: chunks_1256
Add of existing embedding ID: chunks_1257
Insert of existing embedding ID: chunks_1257
Add of existing embedding ID: chunks_1258
Insert of existing embedding ID: chunks_1258
Add of existing embedding ID: chunks_1259
Insert of existing embedding ID: chunks_1259
Add of existing embedding ID: chunks_1260
Insert of existing embedding ID: chunks_1260
Add of existing embedding ID: chunks_1261
Insert of existing embedding ID: chunks_1261
Add of existing embedding ID: chunks_1262
Insert of existing embedding ID: chunks_126

Add of existing embedding ID: chunks_1346
Insert of existing embedding ID: chunks_1346
Add of existing embedding ID: chunks_1347
Insert of existing embedding ID: chunks_1347
Add of existing embedding ID: chunks_1348
Insert of existing embedding ID: chunks_1348
Add of existing embedding ID: chunks_1349
Insert of existing embedding ID: chunks_1349
Add of existing embedding ID: chunks_1350
Insert of existing embedding ID: chunks_1350
Add of existing embedding ID: chunks_1351
Insert of existing embedding ID: chunks_1351
Add of existing embedding ID: chunks_1352
Insert of existing embedding ID: chunks_1352
Add of existing embedding ID: chunks_1353
Insert of existing embedding ID: chunks_1353
Add of existing embedding ID: chunks_1354
Insert of existing embedding ID: chunks_1354
Add of existing embedding ID: chunks_1355
Insert of existing embedding ID: chunks_1355
Add of existing embedding ID: chunks_1356
Insert of existing embedding ID: chunks_1356
Add of existing embedding ID: chunks_1357
I

Insert of existing embedding ID: chunks_1440
Add of existing embedding ID: chunks_1441
Insert of existing embedding ID: chunks_1441
Add of existing embedding ID: chunks_1442
Insert of existing embedding ID: chunks_1442
Add of existing embedding ID: chunks_1443
Insert of existing embedding ID: chunks_1443
Add of existing embedding ID: chunks_1444
Insert of existing embedding ID: chunks_1444
Add of existing embedding ID: chunks_1445
Insert of existing embedding ID: chunks_1445
Add of existing embedding ID: chunks_1446
Insert of existing embedding ID: chunks_1446
Add of existing embedding ID: chunks_1447
Insert of existing embedding ID: chunks_1447
Add of existing embedding ID: chunks_1448
Insert of existing embedding ID: chunks_1448
Add of existing embedding ID: chunks_1449
Insert of existing embedding ID: chunks_1449
Add of existing embedding ID: chunks_1450
Insert of existing embedding ID: chunks_1450
Add of existing embedding ID: chunks_1451
Insert of existing embedding ID: chunks_145

Add of existing embedding ID: chunks_1535
Insert of existing embedding ID: chunks_1535
Add of existing embedding ID: chunks_1536
Insert of existing embedding ID: chunks_1536
Add of existing embedding ID: chunks_1537
Insert of existing embedding ID: chunks_1537
Add of existing embedding ID: chunks_1538
Insert of existing embedding ID: chunks_1538
Add of existing embedding ID: chunks_1539
Insert of existing embedding ID: chunks_1539
Add of existing embedding ID: chunks_1540
Insert of existing embedding ID: chunks_1540
Add of existing embedding ID: chunks_1541
Insert of existing embedding ID: chunks_1541
Add of existing embedding ID: chunks_1542
Insert of existing embedding ID: chunks_1542
Add of existing embedding ID: chunks_1543
Insert of existing embedding ID: chunks_1543
Add of existing embedding ID: chunks_1544
Insert of existing embedding ID: chunks_1544
Add of existing embedding ID: chunks_1545
Insert of existing embedding ID: chunks_1545
Add of existing embedding ID: chunks_1546
I

Insert of existing embedding ID: chunks_1629
Add of existing embedding ID: chunks_1630
Insert of existing embedding ID: chunks_1630
Add of existing embedding ID: chunks_1631
Insert of existing embedding ID: chunks_1631
Add of existing embedding ID: chunks_1632
Insert of existing embedding ID: chunks_1632
Add of existing embedding ID: chunks_1633
Insert of existing embedding ID: chunks_1633
Add of existing embedding ID: chunks_1634
Insert of existing embedding ID: chunks_1634
Add of existing embedding ID: chunks_1635
Insert of existing embedding ID: chunks_1635
Add of existing embedding ID: chunks_1636
Insert of existing embedding ID: chunks_1636
Add of existing embedding ID: chunks_1637
Insert of existing embedding ID: chunks_1637
Add of existing embedding ID: chunks_1638
Insert of existing embedding ID: chunks_1638
Add of existing embedding ID: chunks_1639
Insert of existing embedding ID: chunks_1639
Add of existing embedding ID: chunks_1640
Insert of existing embedding ID: chunks_164

Add of existing embedding ID: chunks_1724
Insert of existing embedding ID: chunks_1724
Add of existing embedding ID: chunks_1725
Insert of existing embedding ID: chunks_1725
Add of existing embedding ID: chunks_1726
Insert of existing embedding ID: chunks_1726
Add of existing embedding ID: chunks_1727
Insert of existing embedding ID: chunks_1727
Add of existing embedding ID: chunks_1728
Insert of existing embedding ID: chunks_1728
Add of existing embedding ID: chunks_1729
Insert of existing embedding ID: chunks_1729
Add of existing embedding ID: chunks_1730
Insert of existing embedding ID: chunks_1730
Add of existing embedding ID: chunks_1731
Insert of existing embedding ID: chunks_1731
Add of existing embedding ID: chunks_1732
Insert of existing embedding ID: chunks_1732
Add of existing embedding ID: chunks_1733
Insert of existing embedding ID: chunks_1733
Add of existing embedding ID: chunks_1734
Insert of existing embedding ID: chunks_1734
Add of existing embedding ID: chunks_1735
I

Insert of existing embedding ID: chunks_1818
Add of existing embedding ID: chunks_1819
Insert of existing embedding ID: chunks_1819
Add of existing embedding ID: chunks_1820
Insert of existing embedding ID: chunks_1820
Add of existing embedding ID: chunks_1821
Insert of existing embedding ID: chunks_1821
Add of existing embedding ID: chunks_1822
Insert of existing embedding ID: chunks_1822
Add of existing embedding ID: chunks_1823
Insert of existing embedding ID: chunks_1823
Add of existing embedding ID: chunks_1824
Insert of existing embedding ID: chunks_1824
Add of existing embedding ID: chunks_1825
Insert of existing embedding ID: chunks_1825
Add of existing embedding ID: chunks_1826
Insert of existing embedding ID: chunks_1826
Add of existing embedding ID: chunks_1827
Insert of existing embedding ID: chunks_1827
Add of existing embedding ID: chunks_1828
Insert of existing embedding ID: chunks_1828
Add of existing embedding ID: chunks_1829
Insert of existing embedding ID: chunks_182

Add of existing embedding ID: chunks_1913
Insert of existing embedding ID: chunks_1913
Add of existing embedding ID: chunks_1914
Insert of existing embedding ID: chunks_1914
Add of existing embedding ID: chunks_1915
Insert of existing embedding ID: chunks_1915
Add of existing embedding ID: chunks_1916
Insert of existing embedding ID: chunks_1916
Add of existing embedding ID: chunks_1917
Insert of existing embedding ID: chunks_1917
Add of existing embedding ID: chunks_1918
Insert of existing embedding ID: chunks_1918
Add of existing embedding ID: chunks_1919
Insert of existing embedding ID: chunks_1919
Add of existing embedding ID: chunks_1920
Insert of existing embedding ID: chunks_1920
Add of existing embedding ID: chunks_1921
Insert of existing embedding ID: chunks_1921
Add of existing embedding ID: chunks_1922
Insert of existing embedding ID: chunks_1922
Add of existing embedding ID: chunks_1923
Insert of existing embedding ID: chunks_1923
Add of existing embedding ID: chunks_1924
I

Insert of existing embedding ID: chunks_2007
Add of existing embedding ID: chunks_2008
Insert of existing embedding ID: chunks_2008
Add of existing embedding ID: chunks_2009
Insert of existing embedding ID: chunks_2009
Add of existing embedding ID: chunks_2010
Insert of existing embedding ID: chunks_2010
Add of existing embedding ID: chunks_2011
Insert of existing embedding ID: chunks_2011
Add of existing embedding ID: chunks_2012
Insert of existing embedding ID: chunks_2012
Add of existing embedding ID: chunks_2013
Insert of existing embedding ID: chunks_2013
Add of existing embedding ID: chunks_2014
Insert of existing embedding ID: chunks_2014
Add of existing embedding ID: chunks_2015
Insert of existing embedding ID: chunks_2015
Add of existing embedding ID: chunks_2016
Insert of existing embedding ID: chunks_2016
Add of existing embedding ID: chunks_2017
Insert of existing embedding ID: chunks_2017
Add of existing embedding ID: chunks_2018
Insert of existing embedding ID: chunks_201

Add of existing embedding ID: chunks_2102
Insert of existing embedding ID: chunks_2102
Add of existing embedding ID: chunks_2103
Insert of existing embedding ID: chunks_2103
Add of existing embedding ID: chunks_2104
Insert of existing embedding ID: chunks_2104
Add of existing embedding ID: chunks_2105
Insert of existing embedding ID: chunks_2105
Add of existing embedding ID: chunks_2106
Insert of existing embedding ID: chunks_2106
Add of existing embedding ID: chunks_2107
Insert of existing embedding ID: chunks_2107
Add of existing embedding ID: chunks_2108
Insert of existing embedding ID: chunks_2108
Add of existing embedding ID: chunks_2109
Insert of existing embedding ID: chunks_2109
Add of existing embedding ID: chunks_2110
Insert of existing embedding ID: chunks_2110
Add of existing embedding ID: chunks_2111
Insert of existing embedding ID: chunks_2111
Add of existing embedding ID: chunks_2112
Insert of existing embedding ID: chunks_2112
Add of existing embedding ID: chunks_2113
I

Insert of existing embedding ID: chunks_2196
Add of existing embedding ID: chunks_2197
Insert of existing embedding ID: chunks_2197
Add of existing embedding ID: chunks_2198
Insert of existing embedding ID: chunks_2198
Add of existing embedding ID: chunks_2199
Insert of existing embedding ID: chunks_2199
Add of existing embedding ID: chunks_2200
Insert of existing embedding ID: chunks_2200
Add of existing embedding ID: chunks_2201
Insert of existing embedding ID: chunks_2201
Add of existing embedding ID: chunks_2202
Insert of existing embedding ID: chunks_2202
Add of existing embedding ID: chunks_2203
Insert of existing embedding ID: chunks_2203
Add of existing embedding ID: chunks_2204
Insert of existing embedding ID: chunks_2204
Add of existing embedding ID: chunks_2205
Insert of existing embedding ID: chunks_2205
Add of existing embedding ID: chunks_2206
Insert of existing embedding ID: chunks_2206
Add of existing embedding ID: chunks_2207
Insert of existing embedding ID: chunks_220

Add of existing embedding ID: chunks_2291
Insert of existing embedding ID: chunks_2291
Add of existing embedding ID: chunks_2292
Insert of existing embedding ID: chunks_2292
Add of existing embedding ID: chunks_2293
Insert of existing embedding ID: chunks_2293
Add of existing embedding ID: chunks_2294
Insert of existing embedding ID: chunks_2294
Add of existing embedding ID: chunks_2295
Insert of existing embedding ID: chunks_2295
Add of existing embedding ID: chunks_2296
Insert of existing embedding ID: chunks_2296
Add of existing embedding ID: chunks_2297
Insert of existing embedding ID: chunks_2297
Add of existing embedding ID: chunks_2298
Insert of existing embedding ID: chunks_2298
Add of existing embedding ID: chunks_2299
Insert of existing embedding ID: chunks_2299
Add of existing embedding ID: chunks_2300
Insert of existing embedding ID: chunks_2300
Add of existing embedding ID: chunks_2301
Insert of existing embedding ID: chunks_2301
Add of existing embedding ID: chunks_2302
I

Insert of existing embedding ID: chunks_2385
Add of existing embedding ID: chunks_2386
Insert of existing embedding ID: chunks_2386
Add of existing embedding ID: chunks_2387
Insert of existing embedding ID: chunks_2387
Add of existing embedding ID: chunks_2388
Insert of existing embedding ID: chunks_2388
Add of existing embedding ID: chunks_2389
Insert of existing embedding ID: chunks_2389
Add of existing embedding ID: chunks_2390
Insert of existing embedding ID: chunks_2390
Add of existing embedding ID: chunks_2391
Insert of existing embedding ID: chunks_2391
Add of existing embedding ID: chunks_2392
Insert of existing embedding ID: chunks_2392
Add of existing embedding ID: chunks_2393
Insert of existing embedding ID: chunks_2393
Add of existing embedding ID: chunks_2394
Insert of existing embedding ID: chunks_2394
Add of existing embedding ID: chunks_2395
Insert of existing embedding ID: chunks_2395
Add of existing embedding ID: chunks_2396
Insert of existing embedding ID: chunks_239

Add of existing embedding ID: chunks_2480
Insert of existing embedding ID: chunks_2480
Add of existing embedding ID: chunks_2481
Insert of existing embedding ID: chunks_2481
Add of existing embedding ID: chunks_2482
Insert of existing embedding ID: chunks_2482
Add of existing embedding ID: chunks_2483
Insert of existing embedding ID: chunks_2483
Add of existing embedding ID: chunks_2484
Insert of existing embedding ID: chunks_2484
Add of existing embedding ID: chunks_2485
Insert of existing embedding ID: chunks_2485
Add of existing embedding ID: chunks_2486
Insert of existing embedding ID: chunks_2486
Add of existing embedding ID: chunks_2487
Insert of existing embedding ID: chunks_2487
Add of existing embedding ID: chunks_2488
Insert of existing embedding ID: chunks_2488
Add of existing embedding ID: chunks_2489
Insert of existing embedding ID: chunks_2489
Add of existing embedding ID: chunks_2490
Insert of existing embedding ID: chunks_2490
Add of existing embedding ID: chunks_2491
I

Insert of existing embedding ID: chunks_2574
Add of existing embedding ID: chunks_2575
Insert of existing embedding ID: chunks_2575
Add of existing embedding ID: chunks_2576
Insert of existing embedding ID: chunks_2576
Add of existing embedding ID: chunks_2577
Insert of existing embedding ID: chunks_2577
Add of existing embedding ID: chunks_2578
Insert of existing embedding ID: chunks_2578
Add of existing embedding ID: chunks_2579
Insert of existing embedding ID: chunks_2579
Add of existing embedding ID: chunks_2580
Insert of existing embedding ID: chunks_2580
Add of existing embedding ID: chunks_2581
Insert of existing embedding ID: chunks_2581
Add of existing embedding ID: chunks_2582
Insert of existing embedding ID: chunks_2582
Add of existing embedding ID: chunks_2583
Insert of existing embedding ID: chunks_2583
Add of existing embedding ID: chunks_2584
Insert of existing embedding ID: chunks_2584
Add of existing embedding ID: chunks_2585
Insert of existing embedding ID: chunks_258

Add of existing embedding ID: chunks_2669
Insert of existing embedding ID: chunks_2669
Add of existing embedding ID: chunks_2670
Insert of existing embedding ID: chunks_2670
Add of existing embedding ID: chunks_2671
Insert of existing embedding ID: chunks_2671
Add of existing embedding ID: chunks_2672
Insert of existing embedding ID: chunks_2672
Add of existing embedding ID: chunks_2673
Insert of existing embedding ID: chunks_2673
Add of existing embedding ID: chunks_2674
Insert of existing embedding ID: chunks_2674
Add of existing embedding ID: chunks_2675
Insert of existing embedding ID: chunks_2675
Add of existing embedding ID: chunks_2676
Insert of existing embedding ID: chunks_2676
Add of existing embedding ID: chunks_2677
Insert of existing embedding ID: chunks_2677
Add of existing embedding ID: chunks_2678
Insert of existing embedding ID: chunks_2678
Add of existing embedding ID: chunks_2679
Insert of existing embedding ID: chunks_2679
Add of existing embedding ID: chunks_2680
I

Insert of existing embedding ID: chunks_2763
Add of existing embedding ID: chunks_2764
Insert of existing embedding ID: chunks_2764
Add of existing embedding ID: chunks_2765
Insert of existing embedding ID: chunks_2765
Add of existing embedding ID: chunks_2766
Insert of existing embedding ID: chunks_2766
Add of existing embedding ID: chunks_2767
Insert of existing embedding ID: chunks_2767
Add of existing embedding ID: chunks_2768
Insert of existing embedding ID: chunks_2768
Add of existing embedding ID: chunks_2769
Insert of existing embedding ID: chunks_2769
Add of existing embedding ID: chunks_2770
Insert of existing embedding ID: chunks_2770
Add of existing embedding ID: chunks_2771
Insert of existing embedding ID: chunks_2771
Add of existing embedding ID: chunks_2772
Insert of existing embedding ID: chunks_2772
Add of existing embedding ID: chunks_2773
Insert of existing embedding ID: chunks_2773
Add of existing embedding ID: chunks_2774
Insert of existing embedding ID: chunks_277

Add of existing embedding ID: chunks_2858
Insert of existing embedding ID: chunks_2858
Add of existing embedding ID: chunks_2859
Insert of existing embedding ID: chunks_2859
Add of existing embedding ID: chunks_2860
Insert of existing embedding ID: chunks_2860
Add of existing embedding ID: chunks_2861
Insert of existing embedding ID: chunks_2861
Add of existing embedding ID: chunks_2862
Insert of existing embedding ID: chunks_2862
Add of existing embedding ID: chunks_2863
Insert of existing embedding ID: chunks_2863
Add of existing embedding ID: chunks_2864
Insert of existing embedding ID: chunks_2864
Add of existing embedding ID: chunks_2865
Insert of existing embedding ID: chunks_2865
Add of existing embedding ID: chunks_2866
Insert of existing embedding ID: chunks_2866
Add of existing embedding ID: chunks_2867
Insert of existing embedding ID: chunks_2867
Add of existing embedding ID: chunks_2868
Insert of existing embedding ID: chunks_2868
Add of existing embedding ID: chunks_2869
I

Insert of existing embedding ID: chunks_2952
Add of existing embedding ID: chunks_2953
Insert of existing embedding ID: chunks_2953
Add of existing embedding ID: chunks_2954
Insert of existing embedding ID: chunks_2954
Add of existing embedding ID: chunks_2955
Insert of existing embedding ID: chunks_2955
Add of existing embedding ID: chunks_2956
Insert of existing embedding ID: chunks_2956
Add of existing embedding ID: chunks_2957
Insert of existing embedding ID: chunks_2957
Add of existing embedding ID: chunks_2958
Insert of existing embedding ID: chunks_2958
Add of existing embedding ID: chunks_2959
Insert of existing embedding ID: chunks_2959
Add of existing embedding ID: chunks_2960
Insert of existing embedding ID: chunks_2960
Add of existing embedding ID: chunks_2961
Insert of existing embedding ID: chunks_2961
Add of existing embedding ID: chunks_2962
Insert of existing embedding ID: chunks_2962
Add of existing embedding ID: chunks_2963
Insert of existing embedding ID: chunks_296

Add of existing embedding ID: chunks_3047
Insert of existing embedding ID: chunks_3047
Add of existing embedding ID: chunks_3048
Insert of existing embedding ID: chunks_3048
Add of existing embedding ID: chunks_3049
Insert of existing embedding ID: chunks_3049
Add of existing embedding ID: chunks_3050
Insert of existing embedding ID: chunks_3050
Add of existing embedding ID: chunks_3051
Insert of existing embedding ID: chunks_3051
Add of existing embedding ID: chunks_3052
Insert of existing embedding ID: chunks_3052
Add of existing embedding ID: chunks_3053
Insert of existing embedding ID: chunks_3053
Add of existing embedding ID: chunks_3054
Insert of existing embedding ID: chunks_3054
Add of existing embedding ID: chunks_3055
Insert of existing embedding ID: chunks_3055
Add of existing embedding ID: chunks_3056
Insert of existing embedding ID: chunks_3056
Add of existing embedding ID: chunks_3057
Insert of existing embedding ID: chunks_3057
Add of existing embedding ID: chunks_3058
I

Insert of existing embedding ID: chunks_3141
Add of existing embedding ID: chunks_3142
Insert of existing embedding ID: chunks_3142
Add of existing embedding ID: chunks_3143
Insert of existing embedding ID: chunks_3143
Add of existing embedding ID: chunks_3144
Insert of existing embedding ID: chunks_3144
Add of existing embedding ID: chunks_3145
Insert of existing embedding ID: chunks_3145
Add of existing embedding ID: chunks_3146
Insert of existing embedding ID: chunks_3146
Add of existing embedding ID: chunks_3147
Insert of existing embedding ID: chunks_3147
Add of existing embedding ID: chunks_3148
Insert of existing embedding ID: chunks_3148
Add of existing embedding ID: chunks_3149
Insert of existing embedding ID: chunks_3149
Add of existing embedding ID: chunks_3150
Insert of existing embedding ID: chunks_3150
Add of existing embedding ID: chunks_3151
Insert of existing embedding ID: chunks_3151
Add of existing embedding ID: chunks_3152
Insert of existing embedding ID: chunks_315

Add of existing embedding ID: chunks_3236
Insert of existing embedding ID: chunks_3236
Add of existing embedding ID: chunks_3237
Insert of existing embedding ID: chunks_3237
Add of existing embedding ID: chunks_3238
Insert of existing embedding ID: chunks_3238
Add of existing embedding ID: chunks_3239
Insert of existing embedding ID: chunks_3239
Add of existing embedding ID: chunks_3240
Insert of existing embedding ID: chunks_3240
Add of existing embedding ID: chunks_3241
Insert of existing embedding ID: chunks_3241
Add of existing embedding ID: chunks_3242
Insert of existing embedding ID: chunks_3242
Add of existing embedding ID: chunks_3243
Insert of existing embedding ID: chunks_3243
Add of existing embedding ID: chunks_3244
Insert of existing embedding ID: chunks_3244
Add of existing embedding ID: chunks_3245
Insert of existing embedding ID: chunks_3245
Add of existing embedding ID: chunks_3246
Insert of existing embedding ID: chunks_3246
Add of existing embedding ID: chunks_3247
I

Insert of existing embedding ID: chunks_3330
Add of existing embedding ID: chunks_3331
Insert of existing embedding ID: chunks_3331
Add of existing embedding ID: chunks_3332
Insert of existing embedding ID: chunks_3332
Add of existing embedding ID: chunks_3333
Insert of existing embedding ID: chunks_3333
Add of existing embedding ID: chunks_3334
Insert of existing embedding ID: chunks_3334
Add of existing embedding ID: chunks_3335
Insert of existing embedding ID: chunks_3335
Add of existing embedding ID: chunks_3336
Insert of existing embedding ID: chunks_3336
Add of existing embedding ID: chunks_3337
Insert of existing embedding ID: chunks_3337
Add of existing embedding ID: chunks_3338
Insert of existing embedding ID: chunks_3338
Add of existing embedding ID: chunks_3339
Insert of existing embedding ID: chunks_3339
Add of existing embedding ID: chunks_3340
Insert of existing embedding ID: chunks_3340
Add of existing embedding ID: chunks_3341
Insert of existing embedding ID: chunks_334

Add of existing embedding ID: chunks_3425
Insert of existing embedding ID: chunks_3425
Add of existing embedding ID: chunks_3426
Insert of existing embedding ID: chunks_3426
Add of existing embedding ID: chunks_3427
Insert of existing embedding ID: chunks_3427
Add of existing embedding ID: chunks_3428
Insert of existing embedding ID: chunks_3428
Add of existing embedding ID: chunks_3429
Insert of existing embedding ID: chunks_3429
Add of existing embedding ID: chunks_3430
Insert of existing embedding ID: chunks_3430
Add of existing embedding ID: chunks_3431
Insert of existing embedding ID: chunks_3431
Add of existing embedding ID: chunks_3432
Insert of existing embedding ID: chunks_3432
Add of existing embedding ID: chunks_3433
Insert of existing embedding ID: chunks_3433
Add of existing embedding ID: chunks_3434
Insert of existing embedding ID: chunks_3434
Add of existing embedding ID: chunks_3435
Insert of existing embedding ID: chunks_3435
Add of existing embedding ID: chunks_3436
I

Insert of existing embedding ID: chunks_3519
Add of existing embedding ID: chunks_3520
Insert of existing embedding ID: chunks_3520
Add of existing embedding ID: chunks_3521
Insert of existing embedding ID: chunks_3521
Add of existing embedding ID: chunks_3522
Insert of existing embedding ID: chunks_3522
Add of existing embedding ID: chunks_3523
Insert of existing embedding ID: chunks_3523
Add of existing embedding ID: chunks_3524
Insert of existing embedding ID: chunks_3524
Add of existing embedding ID: chunks_3525
Insert of existing embedding ID: chunks_3525
Add of existing embedding ID: chunks_3526
Insert of existing embedding ID: chunks_3526
Add of existing embedding ID: chunks_3527
Insert of existing embedding ID: chunks_3527
Add of existing embedding ID: chunks_3528
Insert of existing embedding ID: chunks_3528
Add of existing embedding ID: chunks_3529
Insert of existing embedding ID: chunks_3529
Add of existing embedding ID: chunks_3530
Insert of existing embedding ID: chunks_353

Add of existing embedding ID: chunks_3614
Insert of existing embedding ID: chunks_3614
Add of existing embedding ID: chunks_3615
Insert of existing embedding ID: chunks_3615
Add of existing embedding ID: chunks_3616
Insert of existing embedding ID: chunks_3616
Add of existing embedding ID: chunks_3617
Insert of existing embedding ID: chunks_3617
Add of existing embedding ID: chunks_3618
Insert of existing embedding ID: chunks_3618
Add of existing embedding ID: chunks_3619
Insert of existing embedding ID: chunks_3619
Add of existing embedding ID: chunks_3620
Insert of existing embedding ID: chunks_3620
Add of existing embedding ID: chunks_3621
Insert of existing embedding ID: chunks_3621
Add of existing embedding ID: chunks_3622
Insert of existing embedding ID: chunks_3622
Add of existing embedding ID: chunks_3623
Insert of existing embedding ID: chunks_3623
Add of existing embedding ID: chunks_3624
Insert of existing embedding ID: chunks_3624
Add of existing embedding ID: chunks_3625
I

Insert of existing embedding ID: chunks_3708
Add of existing embedding ID: chunks_3709
Insert of existing embedding ID: chunks_3709
Add of existing embedding ID: chunks_3710
Insert of existing embedding ID: chunks_3710
Add of existing embedding ID: chunks_3711
Insert of existing embedding ID: chunks_3711
Add of existing embedding ID: chunks_3712
Insert of existing embedding ID: chunks_3712
Add of existing embedding ID: chunks_3713
Insert of existing embedding ID: chunks_3713
Add of existing embedding ID: chunks_3714
Insert of existing embedding ID: chunks_3714
Add of existing embedding ID: chunks_3715
Insert of existing embedding ID: chunks_3715
Add of existing embedding ID: chunks_3716
Insert of existing embedding ID: chunks_3716
Add of existing embedding ID: chunks_3717
Insert of existing embedding ID: chunks_3717
Add of existing embedding ID: chunks_3718
Insert of existing embedding ID: chunks_3718
Add of existing embedding ID: chunks_3719
Insert of existing embedding ID: chunks_371

Add of existing embedding ID: chunks_3803
Insert of existing embedding ID: chunks_3803
Add of existing embedding ID: chunks_3804
Insert of existing embedding ID: chunks_3804
Add of existing embedding ID: chunks_3805
Insert of existing embedding ID: chunks_3805
Add of existing embedding ID: chunks_3806
Insert of existing embedding ID: chunks_3806
Add of existing embedding ID: chunks_3807
Insert of existing embedding ID: chunks_3807
Add of existing embedding ID: chunks_3808
Insert of existing embedding ID: chunks_3808
Add of existing embedding ID: chunks_3809
Insert of existing embedding ID: chunks_3809
Add of existing embedding ID: chunks_3810
Insert of existing embedding ID: chunks_3810
Add of existing embedding ID: chunks_3811
Insert of existing embedding ID: chunks_3811
Add of existing embedding ID: chunks_3812
Insert of existing embedding ID: chunks_3812
Add of existing embedding ID: chunks_3813
Insert of existing embedding ID: chunks_3813
Add of existing embedding ID: chunks_3814
I

Insert of existing embedding ID: chunks_3897
Add of existing embedding ID: chunks_3898
Insert of existing embedding ID: chunks_3898
Add of existing embedding ID: chunks_3899
Insert of existing embedding ID: chunks_3899
Add of existing embedding ID: chunks_3900
Insert of existing embedding ID: chunks_3900
Add of existing embedding ID: chunks_3901
Insert of existing embedding ID: chunks_3901
Add of existing embedding ID: chunks_3902
Insert of existing embedding ID: chunks_3902
Add of existing embedding ID: chunks_3903
Insert of existing embedding ID: chunks_3903
Add of existing embedding ID: chunks_3904
Insert of existing embedding ID: chunks_3904
Add of existing embedding ID: chunks_3905
Insert of existing embedding ID: chunks_3905
Add of existing embedding ID: chunks_3906
Insert of existing embedding ID: chunks_3906
Add of existing embedding ID: chunks_3907
Insert of existing embedding ID: chunks_3907
Add of existing embedding ID: chunks_3908
Insert of existing embedding ID: chunks_390

Add of existing embedding ID: chunks_3992
Insert of existing embedding ID: chunks_3992
Add of existing embedding ID: chunks_3993
Insert of existing embedding ID: chunks_3993
Add of existing embedding ID: chunks_3994
Insert of existing embedding ID: chunks_3994
Add of existing embedding ID: chunks_3995
Insert of existing embedding ID: chunks_3995
Add of existing embedding ID: chunks_3996
Insert of existing embedding ID: chunks_3996
Add of existing embedding ID: chunks_3997
Insert of existing embedding ID: chunks_3997
Add of existing embedding ID: chunks_3998
Insert of existing embedding ID: chunks_3998
Add of existing embedding ID: chunks_3999
Insert of existing embedding ID: chunks_3999
Add of existing embedding ID: chunks_4000
Insert of existing embedding ID: chunks_4000
Add of existing embedding ID: chunks_4001
Insert of existing embedding ID: chunks_4001
Add of existing embedding ID: chunks_4002
Insert of existing embedding ID: chunks_4002
Add of existing embedding ID: chunks_4003
I

Insert of existing embedding ID: chunks_4086
Add of existing embedding ID: chunks_4087
Insert of existing embedding ID: chunks_4087
Add of existing embedding ID: chunks_4088
Insert of existing embedding ID: chunks_4088
Add of existing embedding ID: chunks_4089
Insert of existing embedding ID: chunks_4089
Add of existing embedding ID: chunks_4090
Insert of existing embedding ID: chunks_4090
Add of existing embedding ID: chunks_4091
Insert of existing embedding ID: chunks_4091
Add of existing embedding ID: chunks_4092
Insert of existing embedding ID: chunks_4092
Add of existing embedding ID: chunks_4093
Insert of existing embedding ID: chunks_4093
Add of existing embedding ID: chunks_4094
Insert of existing embedding ID: chunks_4094
Add of existing embedding ID: chunks_4095
Insert of existing embedding ID: chunks_4095
Add of existing embedding ID: chunks_4096
Insert of existing embedding ID: chunks_4096
Add of existing embedding ID: chunks_4097
Insert of existing embedding ID: chunks_409

Add of existing embedding ID: chunks_4181
Insert of existing embedding ID: chunks_4181
Add of existing embedding ID: chunks_4182
Insert of existing embedding ID: chunks_4182
Add of existing embedding ID: chunks_4183
Insert of existing embedding ID: chunks_4183
Add of existing embedding ID: chunks_4184
Insert of existing embedding ID: chunks_4184
Add of existing embedding ID: chunks_4185
Insert of existing embedding ID: chunks_4185
Add of existing embedding ID: chunks_4186
Insert of existing embedding ID: chunks_4186
Add of existing embedding ID: chunks_4187
Insert of existing embedding ID: chunks_4187
Add of existing embedding ID: chunks_4188
Insert of existing embedding ID: chunks_4188
Add of existing embedding ID: chunks_4189
Insert of existing embedding ID: chunks_4189
Add of existing embedding ID: chunks_4190
Insert of existing embedding ID: chunks_4190
Add of existing embedding ID: chunks_4191
Insert of existing embedding ID: chunks_4191
Add of existing embedding ID: chunks_4192
I

Insert of existing embedding ID: chunks_4275
Add of existing embedding ID: chunks_4276
Insert of existing embedding ID: chunks_4276
Add of existing embedding ID: chunks_4277
Insert of existing embedding ID: chunks_4277
Add of existing embedding ID: chunks_4278
Insert of existing embedding ID: chunks_4278
Add of existing embedding ID: chunks_4279
Insert of existing embedding ID: chunks_4279
Add of existing embedding ID: chunks_4280
Insert of existing embedding ID: chunks_4280
Add of existing embedding ID: chunks_4281
Insert of existing embedding ID: chunks_4281
Add of existing embedding ID: chunks_4282
Insert of existing embedding ID: chunks_4282
Add of existing embedding ID: chunks_4283
Insert of existing embedding ID: chunks_4283
Add of existing embedding ID: chunks_4284
Insert of existing embedding ID: chunks_4284
Add of existing embedding ID: chunks_4285
Insert of existing embedding ID: chunks_4285
Add of existing embedding ID: chunks_4286
Insert of existing embedding ID: chunks_428

Add of existing embedding ID: chunks_4370
Insert of existing embedding ID: chunks_4370
Add of existing embedding ID: chunks_4371
Insert of existing embedding ID: chunks_4371
Add of existing embedding ID: chunks_4372
Insert of existing embedding ID: chunks_4372
Add of existing embedding ID: chunks_4373
Insert of existing embedding ID: chunks_4373
Add of existing embedding ID: chunks_4374
Insert of existing embedding ID: chunks_4374
Add of existing embedding ID: chunks_4375
Insert of existing embedding ID: chunks_4375
Add of existing embedding ID: chunks_4376
Insert of existing embedding ID: chunks_4376
Add of existing embedding ID: chunks_4377
Insert of existing embedding ID: chunks_4377
Add of existing embedding ID: chunks_4378
Insert of existing embedding ID: chunks_4378
Add of existing embedding ID: chunks_4379
Insert of existing embedding ID: chunks_4379
Add of existing embedding ID: chunks_4380
Insert of existing embedding ID: chunks_4380
Add of existing embedding ID: chunks_4381
I

Insert of existing embedding ID: chunks_4464
Add of existing embedding ID: chunks_4465
Insert of existing embedding ID: chunks_4465
Add of existing embedding ID: chunks_4466
Insert of existing embedding ID: chunks_4466
Add of existing embedding ID: chunks_4467
Insert of existing embedding ID: chunks_4467
Add of existing embedding ID: chunks_4468
Insert of existing embedding ID: chunks_4468
Add of existing embedding ID: chunks_4469
Insert of existing embedding ID: chunks_4469
Add of existing embedding ID: chunks_4470
Insert of existing embedding ID: chunks_4470
Add of existing embedding ID: chunks_4471
Insert of existing embedding ID: chunks_4471
Add of existing embedding ID: chunks_4472
Insert of existing embedding ID: chunks_4472
Add of existing embedding ID: chunks_4473
Insert of existing embedding ID: chunks_4473
Add of existing embedding ID: chunks_4474
Insert of existing embedding ID: chunks_4474
Add of existing embedding ID: chunks_4475
Insert of existing embedding ID: chunks_447

Add of existing embedding ID: chunks_4559
Insert of existing embedding ID: chunks_4559
Add of existing embedding ID: chunks_4560
Insert of existing embedding ID: chunks_4560
Add of existing embedding ID: chunks_4561
Insert of existing embedding ID: chunks_4561
Add of existing embedding ID: chunks_4562
Insert of existing embedding ID: chunks_4562
Add of existing embedding ID: chunks_4563
Insert of existing embedding ID: chunks_4563
Add of existing embedding ID: chunks_4564
Insert of existing embedding ID: chunks_4564
Add of existing embedding ID: chunks_4565
Insert of existing embedding ID: chunks_4565
Add of existing embedding ID: chunks_4566
Insert of existing embedding ID: chunks_4566
Add of existing embedding ID: chunks_4567
Insert of existing embedding ID: chunks_4567
Add of existing embedding ID: chunks_4568
Insert of existing embedding ID: chunks_4568
Add of existing embedding ID: chunks_4569
Insert of existing embedding ID: chunks_4569
Add of existing embedding ID: chunks_4570
I

Insert of existing embedding ID: chunks_4653
Add of existing embedding ID: chunks_4654
Insert of existing embedding ID: chunks_4654
Add of existing embedding ID: chunks_4655
Insert of existing embedding ID: chunks_4655
Add of existing embedding ID: chunks_4656
Insert of existing embedding ID: chunks_4656
Add of existing embedding ID: chunks_4657
Insert of existing embedding ID: chunks_4657
Add of existing embedding ID: chunks_4658
Insert of existing embedding ID: chunks_4658
Add of existing embedding ID: chunks_4659
Insert of existing embedding ID: chunks_4659
Add of existing embedding ID: chunks_4660
Insert of existing embedding ID: chunks_4660
Add of existing embedding ID: chunks_4661
Insert of existing embedding ID: chunks_4661
Add of existing embedding ID: chunks_4662
Insert of existing embedding ID: chunks_4662
Add of existing embedding ID: chunks_4663
Insert of existing embedding ID: chunks_4663
Add of existing embedding ID: chunks_4664
Insert of existing embedding ID: chunks_466

Add of existing embedding ID: chunks_4748
Insert of existing embedding ID: chunks_4748
Add of existing embedding ID: chunks_4749
Insert of existing embedding ID: chunks_4749
Add of existing embedding ID: chunks_4750
Insert of existing embedding ID: chunks_4750
Add of existing embedding ID: chunks_4751
Insert of existing embedding ID: chunks_4751
Add of existing embedding ID: chunks_4752
Insert of existing embedding ID: chunks_4752
Add of existing embedding ID: chunks_4753
Insert of existing embedding ID: chunks_4753
Add of existing embedding ID: chunks_4754
Insert of existing embedding ID: chunks_4754
Add of existing embedding ID: chunks_4755
Insert of existing embedding ID: chunks_4755
Add of existing embedding ID: chunks_4756
Insert of existing embedding ID: chunks_4756
Add of existing embedding ID: chunks_4757
Insert of existing embedding ID: chunks_4757
Add of existing embedding ID: chunks_4758
Insert of existing embedding ID: chunks_4758
Add of existing embedding ID: chunks_4759
I

Insert of existing embedding ID: chunks_4842
Add of existing embedding ID: chunks_4843
Insert of existing embedding ID: chunks_4843
Add of existing embedding ID: chunks_4844
Insert of existing embedding ID: chunks_4844
Add of existing embedding ID: chunks_4845
Insert of existing embedding ID: chunks_4845
Add of existing embedding ID: chunks_4846
Insert of existing embedding ID: chunks_4846
Add of existing embedding ID: chunks_4847
Insert of existing embedding ID: chunks_4847
Add of existing embedding ID: chunks_4848
Insert of existing embedding ID: chunks_4848
Add of existing embedding ID: chunks_4849
Insert of existing embedding ID: chunks_4849
Add of existing embedding ID: chunks_4850
Insert of existing embedding ID: chunks_4850
Add of existing embedding ID: chunks_4851
Insert of existing embedding ID: chunks_4851
Add of existing embedding ID: chunks_4852
Insert of existing embedding ID: chunks_4852
Add of existing embedding ID: chunks_4853
Insert of existing embedding ID: chunks_485

Add of existing embedding ID: chunks_4937
Insert of existing embedding ID: chunks_4937
Add of existing embedding ID: chunks_4938
Insert of existing embedding ID: chunks_4938
Add of existing embedding ID: chunks_4939
Insert of existing embedding ID: chunks_4939
Add of existing embedding ID: chunks_4940
Insert of existing embedding ID: chunks_4940
Add of existing embedding ID: chunks_4941
Insert of existing embedding ID: chunks_4941
Add of existing embedding ID: chunks_4942
Insert of existing embedding ID: chunks_4942
Add of existing embedding ID: chunks_4943
Insert of existing embedding ID: chunks_4943
Add of existing embedding ID: chunks_4944
Insert of existing embedding ID: chunks_4944
Add of existing embedding ID: chunks_4945
Insert of existing embedding ID: chunks_4945
Add of existing embedding ID: chunks_4946
Insert of existing embedding ID: chunks_4946
Add of existing embedding ID: chunks_4947
Insert of existing embedding ID: chunks_4947
Add of existing embedding ID: chunks_4948
I

Insert of existing embedding ID: chunks_5031
Add of existing embedding ID: chunks_5032
Insert of existing embedding ID: chunks_5032
Add of existing embedding ID: chunks_5033
Insert of existing embedding ID: chunks_5033
Add of existing embedding ID: chunks_5034
Insert of existing embedding ID: chunks_5034
Add of existing embedding ID: chunks_5035
Insert of existing embedding ID: chunks_5035
Add of existing embedding ID: chunks_5036
Insert of existing embedding ID: chunks_5036
Add of existing embedding ID: chunks_5037
Insert of existing embedding ID: chunks_5037
Add of existing embedding ID: chunks_5038
Insert of existing embedding ID: chunks_5038
Add of existing embedding ID: chunks_5039
Insert of existing embedding ID: chunks_5039
Add of existing embedding ID: chunks_5040
Insert of existing embedding ID: chunks_5040
Add of existing embedding ID: chunks_5041
Insert of existing embedding ID: chunks_5041
Add of existing embedding ID: chunks_5042
Insert of existing embedding ID: chunks_504

Add of existing embedding ID: chunks_5126
Insert of existing embedding ID: chunks_5126
Add of existing embedding ID: chunks_5127
Insert of existing embedding ID: chunks_5127
Add of existing embedding ID: chunks_5128
Insert of existing embedding ID: chunks_5128
Add of existing embedding ID: chunks_5129
Insert of existing embedding ID: chunks_5129
Add of existing embedding ID: chunks_5130
Insert of existing embedding ID: chunks_5130
Add of existing embedding ID: chunks_5131
Insert of existing embedding ID: chunks_5131
Add of existing embedding ID: chunks_5132
Insert of existing embedding ID: chunks_5132
Add of existing embedding ID: chunks_5133
Insert of existing embedding ID: chunks_5133
Add of existing embedding ID: chunks_5134
Insert of existing embedding ID: chunks_5134
Add of existing embedding ID: chunks_5135
Insert of existing embedding ID: chunks_5135
Add of existing embedding ID: chunks_5136
Insert of existing embedding ID: chunks_5136
Add of existing embedding ID: chunks_5137
I

Insert of existing embedding ID: chunks_5220
Add of existing embedding ID: chunks_5221
Insert of existing embedding ID: chunks_5221
Add of existing embedding ID: chunks_5222
Insert of existing embedding ID: chunks_5222
Add of existing embedding ID: chunks_5223
Insert of existing embedding ID: chunks_5223
Add of existing embedding ID: chunks_5224
Insert of existing embedding ID: chunks_5224
Add of existing embedding ID: chunks_5225
Insert of existing embedding ID: chunks_5225
Add of existing embedding ID: chunks_5226
Insert of existing embedding ID: chunks_5226
Add of existing embedding ID: chunks_5227
Insert of existing embedding ID: chunks_5227
Add of existing embedding ID: chunks_5228
Insert of existing embedding ID: chunks_5228
Add of existing embedding ID: chunks_5229
Insert of existing embedding ID: chunks_5229
Add of existing embedding ID: chunks_5230
Insert of existing embedding ID: chunks_5230
Add of existing embedding ID: chunks_5231
Insert of existing embedding ID: chunks_523

Add of existing embedding ID: chunks_5315
Insert of existing embedding ID: chunks_5315
Add of existing embedding ID: chunks_5316
Insert of existing embedding ID: chunks_5316
Add of existing embedding ID: chunks_5317
Insert of existing embedding ID: chunks_5317
Add of existing embedding ID: chunks_5318
Insert of existing embedding ID: chunks_5318
Add of existing embedding ID: chunks_5319
Insert of existing embedding ID: chunks_5319
Add of existing embedding ID: chunks_5320
Insert of existing embedding ID: chunks_5320
Add of existing embedding ID: chunks_5321
Insert of existing embedding ID: chunks_5321
Add of existing embedding ID: chunks_5322
Insert of existing embedding ID: chunks_5322
Add of existing embedding ID: chunks_5323
Insert of existing embedding ID: chunks_5323
Add of existing embedding ID: chunks_5324
Insert of existing embedding ID: chunks_5324
Add of existing embedding ID: chunks_5325
Insert of existing embedding ID: chunks_5325
Add of existing embedding ID: chunks_5326
I

Insert of existing embedding ID: chunks_5409
Add of existing embedding ID: chunks_5410
Insert of existing embedding ID: chunks_5410
Add of existing embedding ID: chunks_5411
Insert of existing embedding ID: chunks_5411
Add of existing embedding ID: chunks_5412
Insert of existing embedding ID: chunks_5412
Add of existing embedding ID: chunks_5413
Insert of existing embedding ID: chunks_5413
Add of existing embedding ID: chunks_5414
Insert of existing embedding ID: chunks_5414
Add of existing embedding ID: chunks_5415
Insert of existing embedding ID: chunks_5415
Add of existing embedding ID: chunks_5416
Insert of existing embedding ID: chunks_5416
Add of existing embedding ID: chunks_5417
Insert of existing embedding ID: chunks_5417
Add of existing embedding ID: chunks_5418
Insert of existing embedding ID: chunks_5418
Add of existing embedding ID: chunks_5419
Insert of existing embedding ID: chunks_5419
Add of existing embedding ID: chunks_5420
Insert of existing embedding ID: chunks_542

Add of existing embedding ID: chunks_5504
Insert of existing embedding ID: chunks_5504
Add of existing embedding ID: chunks_5505
Insert of existing embedding ID: chunks_5505
Add of existing embedding ID: chunks_5506
Insert of existing embedding ID: chunks_5506
Add of existing embedding ID: chunks_5507
Insert of existing embedding ID: chunks_5507
Add of existing embedding ID: chunks_5508
Insert of existing embedding ID: chunks_5508
Add of existing embedding ID: chunks_5509
Insert of existing embedding ID: chunks_5509
Add of existing embedding ID: chunks_5510
Insert of existing embedding ID: chunks_5510
Add of existing embedding ID: chunks_5511
Insert of existing embedding ID: chunks_5511
Add of existing embedding ID: chunks_5512
Insert of existing embedding ID: chunks_5512
Add of existing embedding ID: chunks_5513
Insert of existing embedding ID: chunks_5513
Add of existing embedding ID: chunks_5514
Insert of existing embedding ID: chunks_5514
Add of existing embedding ID: chunks_5515
I

Insert of existing embedding ID: chunks_5598
Add of existing embedding ID: chunks_5599
Insert of existing embedding ID: chunks_5599
Add of existing embedding ID: chunks_5600
Insert of existing embedding ID: chunks_5600
Add of existing embedding ID: chunks_5601
Insert of existing embedding ID: chunks_5601
Add of existing embedding ID: chunks_5602
Insert of existing embedding ID: chunks_5602
Add of existing embedding ID: chunks_5603
Insert of existing embedding ID: chunks_5603
Add of existing embedding ID: chunks_5604
Insert of existing embedding ID: chunks_5604
Add of existing embedding ID: chunks_5605
Insert of existing embedding ID: chunks_5605
Add of existing embedding ID: chunks_5606
Insert of existing embedding ID: chunks_5606
Add of existing embedding ID: chunks_5607
Insert of existing embedding ID: chunks_5607
Add of existing embedding ID: chunks_5608
Insert of existing embedding ID: chunks_5608
Add of existing embedding ID: chunks_5609
Insert of existing embedding ID: chunks_560

Add of existing embedding ID: chunks_5693
Insert of existing embedding ID: chunks_5693
Add of existing embedding ID: chunks_5694
Insert of existing embedding ID: chunks_5694
Add of existing embedding ID: chunks_5695
Insert of existing embedding ID: chunks_5695
Add of existing embedding ID: chunks_5696
Insert of existing embedding ID: chunks_5696
Add of existing embedding ID: chunks_5697
Insert of existing embedding ID: chunks_5697
Add of existing embedding ID: chunks_5698
Insert of existing embedding ID: chunks_5698
Add of existing embedding ID: chunks_5699
Insert of existing embedding ID: chunks_5699
Add of existing embedding ID: chunks_5700
Insert of existing embedding ID: chunks_5700
Add of existing embedding ID: chunks_5701
Insert of existing embedding ID: chunks_5701
Add of existing embedding ID: chunks_5702
Insert of existing embedding ID: chunks_5702
Add of existing embedding ID: chunks_5703
Insert of existing embedding ID: chunks_5703
Add of existing embedding ID: chunks_5704
I

Insert of existing embedding ID: chunks_5787
Add of existing embedding ID: chunks_5788
Insert of existing embedding ID: chunks_5788
Add of existing embedding ID: chunks_5789
Insert of existing embedding ID: chunks_5789
Add of existing embedding ID: chunks_5790
Insert of existing embedding ID: chunks_5790
Add of existing embedding ID: chunks_5791
Insert of existing embedding ID: chunks_5791
Add of existing embedding ID: chunks_5792
Insert of existing embedding ID: chunks_5792
Add of existing embedding ID: chunks_5793
Insert of existing embedding ID: chunks_5793
Add of existing embedding ID: chunks_5794
Insert of existing embedding ID: chunks_5794
Add of existing embedding ID: chunks_5795
Insert of existing embedding ID: chunks_5795
Add of existing embedding ID: chunks_5796
Insert of existing embedding ID: chunks_5796
Add of existing embedding ID: chunks_5797
Insert of existing embedding ID: chunks_5797
Add of existing embedding ID: chunks_5798
Insert of existing embedding ID: chunks_579

Add of existing embedding ID: chunks_5882
Insert of existing embedding ID: chunks_5882
Add of existing embedding ID: chunks_5883
Insert of existing embedding ID: chunks_5883
Add of existing embedding ID: chunks_5884
Insert of existing embedding ID: chunks_5884
Add of existing embedding ID: chunks_5885
Insert of existing embedding ID: chunks_5885
Add of existing embedding ID: chunks_5886
Insert of existing embedding ID: chunks_5886
Add of existing embedding ID: chunks_5887
Insert of existing embedding ID: chunks_5887
Add of existing embedding ID: chunks_5888
Insert of existing embedding ID: chunks_5888
Add of existing embedding ID: chunks_5889
Insert of existing embedding ID: chunks_5889
Add of existing embedding ID: chunks_5890
Insert of existing embedding ID: chunks_5890
Add of existing embedding ID: chunks_5891
Insert of existing embedding ID: chunks_5891
Add of existing embedding ID: chunks_5892
Insert of existing embedding ID: chunks_5892
Add of existing embedding ID: chunks_5893
I

Insert of existing embedding ID: chunks_5976
Add of existing embedding ID: chunks_5977
Insert of existing embedding ID: chunks_5977
Add of existing embedding ID: chunks_5978
Insert of existing embedding ID: chunks_5978
Add of existing embedding ID: chunks_5979
Insert of existing embedding ID: chunks_5979
Add of existing embedding ID: chunks_5980
Insert of existing embedding ID: chunks_5980
Add of existing embedding ID: chunks_5981
Insert of existing embedding ID: chunks_5981
Add of existing embedding ID: chunks_5982
Insert of existing embedding ID: chunks_5982
Add of existing embedding ID: chunks_5983
Insert of existing embedding ID: chunks_5983
Add of existing embedding ID: chunks_5984
Insert of existing embedding ID: chunks_5984
Add of existing embedding ID: chunks_5985
Insert of existing embedding ID: chunks_5985
Add of existing embedding ID: chunks_5986
Insert of existing embedding ID: chunks_5986
Add of existing embedding ID: chunks_5987
Insert of existing embedding ID: chunks_598

Add of existing embedding ID: chunks_6071
Insert of existing embedding ID: chunks_6071
Add of existing embedding ID: chunks_6072
Insert of existing embedding ID: chunks_6072
Add of existing embedding ID: chunks_6073
Insert of existing embedding ID: chunks_6073
Add of existing embedding ID: chunks_6074
Insert of existing embedding ID: chunks_6074
Add of existing embedding ID: chunks_6075
Insert of existing embedding ID: chunks_6075
Add of existing embedding ID: chunks_6076
Insert of existing embedding ID: chunks_6076
Add of existing embedding ID: chunks_6077
Insert of existing embedding ID: chunks_6077
Add of existing embedding ID: chunks_6078
Insert of existing embedding ID: chunks_6078
Add of existing embedding ID: chunks_6079
Insert of existing embedding ID: chunks_6079
Add of existing embedding ID: chunks_6080
Insert of existing embedding ID: chunks_6080
Add of existing embedding ID: chunks_6081
Insert of existing embedding ID: chunks_6081
Add of existing embedding ID: chunks_6082
I

Insert of existing embedding ID: chunks_6165
Add of existing embedding ID: chunks_6166
Insert of existing embedding ID: chunks_6166
Add of existing embedding ID: chunks_6167
Insert of existing embedding ID: chunks_6167
Add of existing embedding ID: chunks_6168
Insert of existing embedding ID: chunks_6168
Add of existing embedding ID: chunks_6169
Insert of existing embedding ID: chunks_6169
Add of existing embedding ID: chunks_6170
Insert of existing embedding ID: chunks_6170
Add of existing embedding ID: chunks_6171
Insert of existing embedding ID: chunks_6171
Add of existing embedding ID: chunks_6172
Insert of existing embedding ID: chunks_6172
Add of existing embedding ID: chunks_6173
Insert of existing embedding ID: chunks_6173
Add of existing embedding ID: chunks_6174
Insert of existing embedding ID: chunks_6174
Add of existing embedding ID: chunks_6175
Insert of existing embedding ID: chunks_6175
Add of existing embedding ID: chunks_6176
Insert of existing embedding ID: chunks_617

Add of existing embedding ID: chunks_6260
Insert of existing embedding ID: chunks_6260
Add of existing embedding ID: chunks_6261
Insert of existing embedding ID: chunks_6261
Add of existing embedding ID: chunks_6262
Insert of existing embedding ID: chunks_6262
Add of existing embedding ID: chunks_6263
Insert of existing embedding ID: chunks_6263
Add of existing embedding ID: chunks_6264
Insert of existing embedding ID: chunks_6264
Add of existing embedding ID: chunks_6265
Insert of existing embedding ID: chunks_6265
Add of existing embedding ID: chunks_6266
Insert of existing embedding ID: chunks_6266
Add of existing embedding ID: chunks_6267
Insert of existing embedding ID: chunks_6267
Add of existing embedding ID: chunks_6268
Insert of existing embedding ID: chunks_6268
Add of existing embedding ID: chunks_6269
Insert of existing embedding ID: chunks_6269
Add of existing embedding ID: chunks_6270
Insert of existing embedding ID: chunks_6270
Add of existing embedding ID: chunks_6271
I

Insert of existing embedding ID: chunks_6354
Add of existing embedding ID: chunks_6355
Insert of existing embedding ID: chunks_6355
Add of existing embedding ID: chunks_6356
Insert of existing embedding ID: chunks_6356
Add of existing embedding ID: chunks_6357
Insert of existing embedding ID: chunks_6357
Add of existing embedding ID: chunks_6358
Insert of existing embedding ID: chunks_6358
Add of existing embedding ID: chunks_6359
Insert of existing embedding ID: chunks_6359
Add of existing embedding ID: chunks_6360
Insert of existing embedding ID: chunks_6360
Add of existing embedding ID: chunks_6361
Insert of existing embedding ID: chunks_6361
Add of existing embedding ID: chunks_6362
Insert of existing embedding ID: chunks_6362
Add of existing embedding ID: chunks_6363
Insert of existing embedding ID: chunks_6363
Add of existing embedding ID: chunks_6364
Insert of existing embedding ID: chunks_6364
Add of existing embedding ID: chunks_6365
Insert of existing embedding ID: chunks_636

Add of existing embedding ID: chunks_6449
Insert of existing embedding ID: chunks_6449
Add of existing embedding ID: chunks_6450
Insert of existing embedding ID: chunks_6450
Add of existing embedding ID: chunks_6451
Insert of existing embedding ID: chunks_6451
Add of existing embedding ID: chunks_6452
Insert of existing embedding ID: chunks_6452
Add of existing embedding ID: chunks_6453
Insert of existing embedding ID: chunks_6453
Add of existing embedding ID: chunks_6454
Insert of existing embedding ID: chunks_6454
Add of existing embedding ID: chunks_6455
Insert of existing embedding ID: chunks_6455
Add of existing embedding ID: chunks_6456
Insert of existing embedding ID: chunks_6456
Add of existing embedding ID: chunks_6457
Insert of existing embedding ID: chunks_6457
Add of existing embedding ID: chunks_6458
Insert of existing embedding ID: chunks_6458
Add of existing embedding ID: chunks_6459
Insert of existing embedding ID: chunks_6459
Add of existing embedding ID: chunks_6460
I

Insert of existing embedding ID: chunks_6543
Add of existing embedding ID: chunks_6544
Insert of existing embedding ID: chunks_6544
Add of existing embedding ID: chunks_6545
Insert of existing embedding ID: chunks_6545
Add of existing embedding ID: chunks_6546
Insert of existing embedding ID: chunks_6546
Add of existing embedding ID: chunks_6547
Insert of existing embedding ID: chunks_6547
Add of existing embedding ID: chunks_6548
Insert of existing embedding ID: chunks_6548
Add of existing embedding ID: chunks_6549
Insert of existing embedding ID: chunks_6549
Add of existing embedding ID: chunks_6550
Insert of existing embedding ID: chunks_6550
Add of existing embedding ID: chunks_6551
Insert of existing embedding ID: chunks_6551
Add of existing embedding ID: chunks_6552
Insert of existing embedding ID: chunks_6552
Add of existing embedding ID: chunks_6553
Insert of existing embedding ID: chunks_6553
Add of existing embedding ID: chunks_6554
Insert of existing embedding ID: chunks_655

Add of existing embedding ID: chunks_6638
Insert of existing embedding ID: chunks_6638
Add of existing embedding ID: chunks_6639
Insert of existing embedding ID: chunks_6639
Add of existing embedding ID: chunks_6640
Insert of existing embedding ID: chunks_6640
Add of existing embedding ID: chunks_6641
Insert of existing embedding ID: chunks_6641
Add of existing embedding ID: chunks_6642
Insert of existing embedding ID: chunks_6642
Add of existing embedding ID: chunks_6643
Insert of existing embedding ID: chunks_6643
Add of existing embedding ID: chunks_6644
Insert of existing embedding ID: chunks_6644
Add of existing embedding ID: chunks_6645
Insert of existing embedding ID: chunks_6645
Add of existing embedding ID: chunks_6646
Insert of existing embedding ID: chunks_6646
Add of existing embedding ID: chunks_6647
Insert of existing embedding ID: chunks_6647
Add of existing embedding ID: chunks_6648
Insert of existing embedding ID: chunks_6648
Add of existing embedding ID: chunks_6649
I

Insert of existing embedding ID: chunks_6732
Add of existing embedding ID: chunks_6733
Insert of existing embedding ID: chunks_6733
Add of existing embedding ID: chunks_6734
Insert of existing embedding ID: chunks_6734
Add of existing embedding ID: chunks_6735
Insert of existing embedding ID: chunks_6735
Add of existing embedding ID: chunks_6736
Insert of existing embedding ID: chunks_6736
Add of existing embedding ID: chunks_6737
Insert of existing embedding ID: chunks_6737
Add of existing embedding ID: chunks_6738
Insert of existing embedding ID: chunks_6738
Add of existing embedding ID: chunks_6739
Insert of existing embedding ID: chunks_6739
Add of existing embedding ID: chunks_6740
Insert of existing embedding ID: chunks_6740
Add of existing embedding ID: chunks_6741
Insert of existing embedding ID: chunks_6741
Add of existing embedding ID: chunks_6742
Insert of existing embedding ID: chunks_6742
Add of existing embedding ID: chunks_6743
Insert of existing embedding ID: chunks_674

Add of existing embedding ID: chunks_6827
Insert of existing embedding ID: chunks_6827
Add of existing embedding ID: chunks_6828
Insert of existing embedding ID: chunks_6828
Add of existing embedding ID: chunks_6829
Insert of existing embedding ID: chunks_6829
Add of existing embedding ID: chunks_6830
Insert of existing embedding ID: chunks_6830
Add of existing embedding ID: chunks_6831
Insert of existing embedding ID: chunks_6831
Add of existing embedding ID: chunks_6832
Insert of existing embedding ID: chunks_6832
Add of existing embedding ID: chunks_6833
Insert of existing embedding ID: chunks_6833
Add of existing embedding ID: chunks_6834
Insert of existing embedding ID: chunks_6834
Add of existing embedding ID: chunks_6835
Insert of existing embedding ID: chunks_6835
Add of existing embedding ID: chunks_6836
Insert of existing embedding ID: chunks_6836
Add of existing embedding ID: chunks_6837
Insert of existing embedding ID: chunks_6837
Add of existing embedding ID: chunks_6838
I

Insert of existing embedding ID: chunks_6921
Add of existing embedding ID: chunks_6922
Insert of existing embedding ID: chunks_6922
Add of existing embedding ID: chunks_6923
Insert of existing embedding ID: chunks_6923
Add of existing embedding ID: chunks_6924
Insert of existing embedding ID: chunks_6924
Add of existing embedding ID: chunks_6925
Insert of existing embedding ID: chunks_6925
Add of existing embedding ID: chunks_6926
Insert of existing embedding ID: chunks_6926
Add of existing embedding ID: chunks_6927
Insert of existing embedding ID: chunks_6927
Add of existing embedding ID: chunks_6928
Insert of existing embedding ID: chunks_6928
Add of existing embedding ID: chunks_6929
Insert of existing embedding ID: chunks_6929
Add of existing embedding ID: chunks_6930
Insert of existing embedding ID: chunks_6930
Add of existing embedding ID: chunks_6931
Insert of existing embedding ID: chunks_6931
Add of existing embedding ID: chunks_6932
Insert of existing embedding ID: chunks_693

Add of existing embedding ID: chunks_7016
Insert of existing embedding ID: chunks_7016
Add of existing embedding ID: chunks_7017
Insert of existing embedding ID: chunks_7017
Add of existing embedding ID: chunks_7018
Insert of existing embedding ID: chunks_7018
Add of existing embedding ID: chunks_7019
Insert of existing embedding ID: chunks_7019
Add of existing embedding ID: chunks_7020
Insert of existing embedding ID: chunks_7020
Add of existing embedding ID: chunks_7021
Insert of existing embedding ID: chunks_7021
Add of existing embedding ID: chunks_7022
Insert of existing embedding ID: chunks_7022
Add of existing embedding ID: chunks_7023
Insert of existing embedding ID: chunks_7023
Add of existing embedding ID: chunks_7024
Insert of existing embedding ID: chunks_7024
Add of existing embedding ID: chunks_7025
Insert of existing embedding ID: chunks_7025
Add of existing embedding ID: chunks_7026
Insert of existing embedding ID: chunks_7026
Add of existing embedding ID: chunks_7027
I

Insert of existing embedding ID: chunks_7110
Add of existing embedding ID: chunks_7111
Insert of existing embedding ID: chunks_7111
Add of existing embedding ID: chunks_7112
Insert of existing embedding ID: chunks_7112
Add of existing embedding ID: chunks_7113
Insert of existing embedding ID: chunks_7113
Add of existing embedding ID: chunks_7114
Insert of existing embedding ID: chunks_7114
Add of existing embedding ID: chunks_7115
Insert of existing embedding ID: chunks_7115
Add of existing embedding ID: chunks_7116
Insert of existing embedding ID: chunks_7116
Add of existing embedding ID: chunks_7117
Insert of existing embedding ID: chunks_7117
Add of existing embedding ID: chunks_7118
Insert of existing embedding ID: chunks_7118
Add of existing embedding ID: chunks_7119
Insert of existing embedding ID: chunks_7119
Add of existing embedding ID: chunks_7120
Insert of existing embedding ID: chunks_7120
Add of existing embedding ID: chunks_7121
Insert of existing embedding ID: chunks_712

Add of existing embedding ID: chunks_7205
Insert of existing embedding ID: chunks_7205
Add of existing embedding ID: chunks_7206
Insert of existing embedding ID: chunks_7206
Add of existing embedding ID: chunks_7207
Insert of existing embedding ID: chunks_7207
Add of existing embedding ID: chunks_7208
Insert of existing embedding ID: chunks_7208
Add of existing embedding ID: chunks_7209
Insert of existing embedding ID: chunks_7209
Add of existing embedding ID: chunks_7210
Insert of existing embedding ID: chunks_7210
Add of existing embedding ID: chunks_7211
Insert of existing embedding ID: chunks_7211
Add of existing embedding ID: chunks_7212
Insert of existing embedding ID: chunks_7212
Add of existing embedding ID: chunks_7213
Insert of existing embedding ID: chunks_7213
Add of existing embedding ID: chunks_7214
Insert of existing embedding ID: chunks_7214
Add of existing embedding ID: chunks_7215
Insert of existing embedding ID: chunks_7215
Add of existing embedding ID: chunks_7216
I

Insert of existing embedding ID: chunks_7299
Add of existing embedding ID: chunks_7300
Insert of existing embedding ID: chunks_7300
Add of existing embedding ID: chunks_7301
Insert of existing embedding ID: chunks_7301
Add of existing embedding ID: chunks_7302
Insert of existing embedding ID: chunks_7302
Add of existing embedding ID: chunks_7303
Insert of existing embedding ID: chunks_7303
Add of existing embedding ID: chunks_7304
Insert of existing embedding ID: chunks_7304
Add of existing embedding ID: chunks_7305
Insert of existing embedding ID: chunks_7305
Add of existing embedding ID: chunks_7306
Insert of existing embedding ID: chunks_7306
Add of existing embedding ID: chunks_7307
Insert of existing embedding ID: chunks_7307
Add of existing embedding ID: chunks_7308
Insert of existing embedding ID: chunks_7308
Add of existing embedding ID: chunks_7309
Insert of existing embedding ID: chunks_7309
Add of existing embedding ID: chunks_7310
Insert of existing embedding ID: chunks_731

Add of existing embedding ID: chunks_7394
Insert of existing embedding ID: chunks_7394
Add of existing embedding ID: chunks_7395
Insert of existing embedding ID: chunks_7395
Add of existing embedding ID: chunks_7396
Insert of existing embedding ID: chunks_7396
Add of existing embedding ID: chunks_7397
Insert of existing embedding ID: chunks_7397
Add of existing embedding ID: chunks_7398
Insert of existing embedding ID: chunks_7398
Add of existing embedding ID: chunks_7399
Insert of existing embedding ID: chunks_7399
Add of existing embedding ID: chunks_7400
Insert of existing embedding ID: chunks_7400
Add of existing embedding ID: chunks_7401
Insert of existing embedding ID: chunks_7401
Add of existing embedding ID: chunks_7402
Insert of existing embedding ID: chunks_7402
Add of existing embedding ID: chunks_7403
Insert of existing embedding ID: chunks_7403
Add of existing embedding ID: chunks_7404
Insert of existing embedding ID: chunks_7404
Add of existing embedding ID: chunks_7405
I

Insert of existing embedding ID: chunks_7488
Add of existing embedding ID: chunks_7489
Insert of existing embedding ID: chunks_7489
Add of existing embedding ID: chunks_7490
Insert of existing embedding ID: chunks_7490
Add of existing embedding ID: chunks_7491
Insert of existing embedding ID: chunks_7491
Add of existing embedding ID: chunks_7492
Insert of existing embedding ID: chunks_7492
Add of existing embedding ID: chunks_7493
Insert of existing embedding ID: chunks_7493
Add of existing embedding ID: chunks_7494
Insert of existing embedding ID: chunks_7494
Add of existing embedding ID: chunks_7495
Insert of existing embedding ID: chunks_7495
Add of existing embedding ID: chunks_7496
Insert of existing embedding ID: chunks_7496
Add of existing embedding ID: chunks_7497
Insert of existing embedding ID: chunks_7497
Add of existing embedding ID: chunks_7498
Insert of existing embedding ID: chunks_7498
Add of existing embedding ID: chunks_7499
Insert of existing embedding ID: chunks_749

Add of existing embedding ID: chunks_7583
Insert of existing embedding ID: chunks_7583
Add of existing embedding ID: chunks_7584
Insert of existing embedding ID: chunks_7584
Add of existing embedding ID: chunks_7585
Insert of existing embedding ID: chunks_7585
Add of existing embedding ID: chunks_7586
Insert of existing embedding ID: chunks_7586
Add of existing embedding ID: chunks_7587
Insert of existing embedding ID: chunks_7587
Add of existing embedding ID: chunks_7588
Insert of existing embedding ID: chunks_7588
Add of existing embedding ID: chunks_7589
Insert of existing embedding ID: chunks_7589
Add of existing embedding ID: chunks_7590
Insert of existing embedding ID: chunks_7590
Add of existing embedding ID: chunks_7591
Insert of existing embedding ID: chunks_7591
Add of existing embedding ID: chunks_7592
Insert of existing embedding ID: chunks_7592
Add of existing embedding ID: chunks_7593
Insert of existing embedding ID: chunks_7593
Add of existing embedding ID: chunks_7594
I

Insert of existing embedding ID: chunks_7677
Add of existing embedding ID: chunks_7678
Insert of existing embedding ID: chunks_7678
Add of existing embedding ID: chunks_7679
Insert of existing embedding ID: chunks_7679
Add of existing embedding ID: chunks_7680
Insert of existing embedding ID: chunks_7680
Add of existing embedding ID: chunks_7681
Insert of existing embedding ID: chunks_7681
Add of existing embedding ID: chunks_7682
Insert of existing embedding ID: chunks_7682
Add of existing embedding ID: chunks_7683
Insert of existing embedding ID: chunks_7683
Add of existing embedding ID: chunks_7684
Insert of existing embedding ID: chunks_7684
Add of existing embedding ID: chunks_7685
Insert of existing embedding ID: chunks_7685
Add of existing embedding ID: chunks_7686
Insert of existing embedding ID: chunks_7686
Add of existing embedding ID: chunks_7687
Insert of existing embedding ID: chunks_7687
Add of existing embedding ID: chunks_7688
Insert of existing embedding ID: chunks_768

Add of existing embedding ID: chunks_7772
Insert of existing embedding ID: chunks_7772
Add of existing embedding ID: chunks_7773
Insert of existing embedding ID: chunks_7773
Add of existing embedding ID: chunks_7774
Insert of existing embedding ID: chunks_7774
Add of existing embedding ID: chunks_7775
Insert of existing embedding ID: chunks_7775
Add of existing embedding ID: chunks_7776
Insert of existing embedding ID: chunks_7776
Add of existing embedding ID: chunks_7777
Insert of existing embedding ID: chunks_7777
Add of existing embedding ID: chunks_7778
Insert of existing embedding ID: chunks_7778
Add of existing embedding ID: chunks_7779
Insert of existing embedding ID: chunks_7779
Add of existing embedding ID: chunks_7780
Insert of existing embedding ID: chunks_7780
Add of existing embedding ID: chunks_7781
Insert of existing embedding ID: chunks_7781
Add of existing embedding ID: chunks_7782
Insert of existing embedding ID: chunks_7782
Add of existing embedding ID: chunks_7783
I

Insert of existing embedding ID: chunks_7866
Add of existing embedding ID: chunks_7867
Insert of existing embedding ID: chunks_7867
Add of existing embedding ID: chunks_7868
Insert of existing embedding ID: chunks_7868
Add of existing embedding ID: chunks_7869
Insert of existing embedding ID: chunks_7869
Add of existing embedding ID: chunks_7870
Insert of existing embedding ID: chunks_7870
Add of existing embedding ID: chunks_7871
Insert of existing embedding ID: chunks_7871
Add of existing embedding ID: chunks_7872
Insert of existing embedding ID: chunks_7872
Add of existing embedding ID: chunks_7873
Insert of existing embedding ID: chunks_7873
Add of existing embedding ID: chunks_7874
Insert of existing embedding ID: chunks_7874
Add of existing embedding ID: chunks_7875
Insert of existing embedding ID: chunks_7875
Add of existing embedding ID: chunks_7876
Insert of existing embedding ID: chunks_7876
Add of existing embedding ID: chunks_7877
Insert of existing embedding ID: chunks_787

Add of existing embedding ID: chunks_7961
Insert of existing embedding ID: chunks_7961
Add of existing embedding ID: chunks_7962
Insert of existing embedding ID: chunks_7962
Add of existing embedding ID: chunks_7963
Insert of existing embedding ID: chunks_7963
Add of existing embedding ID: chunks_7964
Insert of existing embedding ID: chunks_7964
Add of existing embedding ID: chunks_7965
Insert of existing embedding ID: chunks_7965
Add of existing embedding ID: chunks_7966
Insert of existing embedding ID: chunks_7966
Add of existing embedding ID: chunks_7967
Insert of existing embedding ID: chunks_7967
Add of existing embedding ID: chunks_7968
Insert of existing embedding ID: chunks_7968
Add of existing embedding ID: chunks_7969
Insert of existing embedding ID: chunks_7969
Add of existing embedding ID: chunks_7970
Insert of existing embedding ID: chunks_7970
Add of existing embedding ID: chunks_7971
Insert of existing embedding ID: chunks_7971
Add of existing embedding ID: chunks_7972
I

In [243]:
collection.get(include=["documents"])

{'ids': ['chunk_0',
  'chunk_1',
  'chunk_10',
  'chunk_100',
  'chunk_1000',
  'chunk_1001',
  'chunk_1002',
  'chunk_1003',
  'chunk_1004',
  'chunk_1005',
  'chunk_1006',
  'chunk_1007',
  'chunk_1008',
  'chunk_1009',
  'chunk_101',
  'chunk_1010',
  'chunk_1011',
  'chunk_1012',
  'chunk_1013',
  'chunk_1014',
  'chunk_1015',
  'chunk_1016',
  'chunk_1017',
  'chunk_1018',
  'chunk_1019',
  'chunk_102',
  'chunk_1020',
  'chunk_1021',
  'chunk_1022',
  'chunk_1023',
  'chunk_1024',
  'chunk_1025',
  'chunk_1026',
  'chunk_1027',
  'chunk_1028',
  'chunk_1029',
  'chunk_103',
  'chunk_1030',
  'chunk_1031',
  'chunk_1032',
  'chunk_1033',
  'chunk_1034',
  'chunk_1035',
  'chunk_1036',
  'chunk_1037',
  'chunk_1038',
  'chunk_1039',
  'chunk_104',
  'chunk_1040',
  'chunk_1041',
  'chunk_1042',
  'chunk_1043',
  'chunk_1044',
  'chunk_1045',
  'chunk_1046',
  'chunk_1047',
  'chunk_1048',
  'chunk_1049',
  'chunk_105',
  'chunk_1050',
  'chunk_1051',
  'chunk_1052',
  'chunk_1053',

In [184]:
emb

tensor([-5.2582e-02, -1.0668e-03,  4.8248e-02,  2.6223e-02, -7.6096e-02,
        -1.6457e-02,  2.4056e-02, -2.4001e-02, -9.2438e-02, -4.0610e-02,
         1.8642e-02, -4.5642e-02,  3.7613e-02, -8.2195e-02, -9.2998e-03,
         5.8245e-03,  9.1322e-02,  2.4334e-02, -9.3703e-02, -6.6399e-02,
        -5.2487e-02, -1.3367e-02, -7.5282e-02, -7.6823e-03, -5.9076e-02,
         1.9567e-02, -8.2752e-02, -6.7609e-02,  7.0743e-03, -1.5550e-02,
         4.5352e-02,  8.9172e-02,  3.7166e-02,  1.9877e-02, -5.1579e-02,
         4.7629e-02,  1.4343e-02,  1.3528e-01,  2.1686e-02, -9.4448e-02,
        -5.3571e-02, -9.2216e-02, -7.1292e-02, -1.0110e-01,  3.8090e-02,
        -1.1791e-01,  1.2045e-02, -6.6349e-02,  1.6309e-02, -1.1391e-01,
        -6.0563e-02, -5.2546e-02,  1.2145e-01, -3.8253e-03, -5.1457e-02,
        -4.2768e-02,  2.5652e-02,  1.9279e-02, -1.8188e-02, -1.3724e-02,
        -9.3102e-02,  9.4775e-03,  2.8720e-03, -5.5901e-03,  3.2523e-02,
         4.4477e-03, -2.6833e-02,  1.0527e-02, -3.2

In [178]:
a=doc_vectors

In [183]:
emb = [doc_vec for i.toli]

[-0.07839059829711914,
 -0.11347431689500809,
 -0.004569102078676224,
 -0.005671562626957893,
 -0.023426087573170662,
 0.007325473241508007,
 -0.005651227664202452,
 -0.038666628301143646,
 -0.036171428859233856,
 0.015680037438869476,
 0.0365501306951046,
 -0.010670101270079613,
 -0.05736149847507477,
 -0.049183186143636703,
 -0.04145707190036774,
 -0.029344050213694572,
 0.05052560567855835,
 0.03549210727214813,
 -0.07650180160999298,
 0.045095499604940414,
 0.02648083120584488,
 -0.005047412123531103,
 0.04996003210544586,
 -0.03948022425174713,
 -0.10350827127695084,
 0.046609066426754,
 -0.10492847859859467,
 -0.0006327894516289234,
 0.0029297659639269114,
 -0.033578258007764816,
 -0.011554839089512825,
 0.09609846770763397,
 0.02595331147313118,
 0.03758307173848152,
 0.027452649548649788,
 0.02310495637357235,
 -0.01826608180999756,
 0.1446506381034851,
 0.03698216751217842,
 -0.02853136695921421,
 -0.0556027926504612,
 -0.07379885762929916,
 -0.007945134304463863,
 0.051659770

In [237]:
df = df.reset_index()

In [238]:
df

Unnamed: 0,index,num,name,file_content_chunks,doc_vector_pretrained_bert
0,19461,9192535,mommy.issues.(2021).eng.1cd,place start door open good morning easy say sp...,"[-0.0783905983, -0.113474317, -0.00456910208, ..."
1,29590,9199194,empire.s03.e04.cupid.kills.(2016).eng.1cd,direct result willful preventable malpractice ...,"[-0.0845821798, 0.00884224102, 0.0826942697, 0..."
2,70465,9221632,this.is.us.s05.e15.jerry.2.0.(2021).eng.1cd,narrator previously beauty beast something gem...,"[-0.070148088, -0.065490745, -0.00389705948, 0..."
3,50661,9209861,red.rose.s01.e04.manchester.innit.(2022).eng.1cd,mother hajara married bayo refused osage offer...,"[-0.0184880402, 0.0330960639, -0.0440507084, -..."
4,50587,9209767,fruits.basket.s03.e11.goodbye.(2021).eng.1cd,dialogue dialogue unknown po head thing right ...,"[0.0288344882, -0.0404044986, -0.00503935665, ..."
...,...,...,...,...,...
7995,50344,9209734,fruits.basket.s02.e03.shall.we.go.and.get.you....,working gotta go spent lot money least open ba...,"[-0.102519907, -0.0888609216, 0.0308625363, -0..."
7996,22868,9194963,american.horror.stories.s02.e03.drive.(2022).e...,deeper ever seen black side town white side to...,"[-0.000806297641, -0.0405393466, -0.0631514415..."
7997,36814,9202976,doctor.of.doom.(1963).eng.1cd,tell july revolution remember date french revo...,"[-0.0169950146, -0.0723221526, 0.00906122755, ..."
7998,40276,9204652,the.deuce.s02.e06.were.all.beasts.(2018).eng.1cd,kidding oh oh woe life empty without joy oh go...,"[-0.0259164125, -0.0884233713, 0.0332689956, -..."
