c-btm inference #50

NourFahmy · 2023-05-10T02:19:00Z

inference code for c-btm - replicating formulas 2 & 3 from c-BTM paper, and tested locally.

kindly inform if anything else is needed!

link to #40

inference code for c-btm

mrcabbage972 · 2023-05-12T01:31:49Z

Hi @NourFahmy,
The goal is to have a script which the user can call with the arguments:

The names of the models
The input data file path
The output file path

The script should load the models, run inference on the input data and save the results. This would allow us to evaluate the performance of the method using perplexity and also on downstream tasks.

It would be very helpful if the PR would solve the end to end. It's possible to break this up into a few PR's, if you prefer.

NourFahmy · 2023-05-14T13:38:43Z

Hi @mrcabbage972 - thank you for your feedback! Will update accordingly by Wednesday.

…del names

NourFahmy · 2023-05-19T02:10:47Z

kindly note, still needs to be tested!
latest commit allows user to call with input and output file path and names of models.

as I understand the sequence of tasks that need to be implemented by for c-btm inference are:

calculate the distance between the cluster center of a given domain and the tokenized prompt
all the distances are evaluated, and the top k are retained and normalized to sum to 1, and then determined as the most relevant domains given the context
the prompt is fed to the most relevant domain experts as per step 2 to generate 1 singular new token
the token's probability is weighted by the distance in step 2
the next token in the sequence is chosen by choosing the one with highest probability
token is added to the sequence, and the process continues until we reach an end token or a max length of sequence

cc: @mrcabbage972

kenhktsui · 2023-05-24T15:38:27Z

@NourFahmy @mrcabbage972
I will be finishing the PR #61 by this weekend.
As we do not split the dataset in an unsupervised manner, I trained a few classifiers which gives us the weight of each dataset (and therefore each expert that it is trained on) instead of clustering as in the c-BTM paper.

huu4ontocord · 2023-08-20T16:58:39Z

Where are we on this? @NourFahmy @kenhktsui @mrcabbage972 ?

pass in embedder for prompt

NourFahmy · 2023-09-03T19:38:31Z

scripts/c-btmInference.py

+    tokenizers = []
+
+    for model_name in model_names:
+        model = AutoModelForCausalLM.from_pretrained(model_name)


some issues with loading the models and maintaining HF credentials -- had to load models and tokenizers outside of the function

Ok. good to know. Strange that you can't load. I made all models public now

NourFahmy · 2023-09-03T19:40:51Z

scripts/c-btmInference.py

+  inputs = tokenizer(prompt)
+  print(inputs['input_ids'])
+  sizeOfInputs = len(inputs['input_ids']) 
+  outputs = model(**inputs, max_new_tokens=1, 


max_new_tokens not a parameter of geoptx -- how can I limit the number of tokens

can't you do max_length ?

huu4ontocord · 2023-11-09T15:41:50Z

I merged it. you can keep adding to it as another PR.

Add files via upload

655ea6d

inference code for c-btm

NourFahmy mentioned this pull request May 10, 2023

Expert merging: c-BTM #40

Open

NourFahmy changed the title ~~Add files via upload~~ c-btm inference May 19, 2023

updated inference script to take in input and output file path and mo…

2b39187

…del names

updated c-btm inference script

47314fb

pass in embedder for prompt

NourFahmy commented Sep 3, 2023

View reviewed changes

NourFahmy added 3 commits September 11, 2023 20:57

updated to generate one token at a time

bed3ec1

remove print statement

74dadef

integrated with Nicolo's PR

f1735cf

huu4ontocord merged commit c43a93a into huu4ontocord:main Nov 9, 2023
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

c-btm inference #50

c-btm inference #50

NourFahmy commented May 10, 2023 •

edited

Loading

mrcabbage972 commented May 12, 2023 •

edited

Loading

NourFahmy commented May 14, 2023 •

edited

Loading

NourFahmy commented May 19, 2023 •

edited

Loading

kenhktsui commented May 24, 2023

huu4ontocord commented Aug 20, 2023

NourFahmy Sep 3, 2023

huu4ontocord Sep 5, 2023

NourFahmy Sep 3, 2023

huu4ontocord Sep 5, 2023

huu4ontocord commented Nov 9, 2023

c-btm inference #50

c-btm inference #50

Conversation

NourFahmy commented May 10, 2023 • edited Loading

mrcabbage972 commented May 12, 2023 • edited Loading

NourFahmy commented May 14, 2023 • edited Loading

NourFahmy commented May 19, 2023 • edited Loading

kenhktsui commented May 24, 2023

huu4ontocord commented Aug 20, 2023

NourFahmy Sep 3, 2023

Choose a reason for hiding this comment

huu4ontocord Sep 5, 2023

Choose a reason for hiding this comment

NourFahmy Sep 3, 2023

Choose a reason for hiding this comment

huu4ontocord Sep 5, 2023

Choose a reason for hiding this comment

huu4ontocord commented Nov 9, 2023

NourFahmy commented May 10, 2023 •

edited

Loading

mrcabbage972 commented May 12, 2023 •

edited

Loading

NourFahmy commented May 14, 2023 •

edited

Loading

NourFahmy commented May 19, 2023 •

edited

Loading