How to handle the newly added item? #7

hgzjy25 · 2024-03-22T07:46:08Z

You have been experimenting with academic datasets, where the entire item set serves as the candidate set. However, the question arises: how do you address the issue of newly added items? It's known that RQ-VAE can manage zero-shot or newly added items with their corresponding embeddings. But when the new item is assigned an ID by RQ-VAE that the LLM has never seen before, how does it generate the new ids/retrieve the new item?

zhengbw0324 · 2024-03-22T08:18:38Z

@hgzjy25
Hello, thank you for your interest in our work!

Indeed as you said, RQ-VAE is able to assign indices to zero-shot items when item embeddings are provided. Besides, we do not restrict the generation of unseen items during the inference process, but only restrict the index levels generated at each step, so in theory, LLM can directly generate the indices of unseen items.

However, in fact, a fully trained LLM is often more inclined to generate seen items (unseen items are actually illegal indices). Therefore, it may be difficult to directly expect LLM to generate unseen items. You can try to adjust the use of the get_prefix_allowed_tokens_fn function to limit the generated content of LLM, or use some policy rules to increase the proportion of unseen items.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to handle the newly added item? #7

How to handle the newly added item? #7

hgzjy25 commented Mar 22, 2024

zhengbw0324 commented Mar 22, 2024

How to handle the newly added item? #7

How to handle the newly added item? #7

Comments

hgzjy25 commented Mar 22, 2024

zhengbw0324 commented Mar 22, 2024