Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Simplify zero-shot topic modeling to fix issues with downstream tasks #2

Open
ianrandman opened this issue May 22, 2024 · 0 comments
Open
Assignees

Comments

@ianrandman
Copy link
Collaborator

MaartenGr#1967

@ianrandman ianrandman self-assigned this May 22, 2024
ianrandman added a commit that referenced this issue May 28, 2024
- zero-shot topic modeling is now only the equivalent of a clustering step
  - removed implementation where this functionality is done through merging two models
  - all documents are used at once when calculating representations
  - probability comes from cosine similarity when zeroshot topics are used
- validate `nr_topics` with respect to how many zero-shot topics matched
- track `self._outliers` and `self.topic_labels_` using `@property`, as they are derivatives of other attributes
- validate existence of outliers before outlier reduction
ianrandman added a commit that referenced this issue Jun 18, 2024
…en reducing topics, keep only one zero-shot label, if any (#1, #2)
ianrandman added a commit that referenced this issue Jun 18, 2024
ianrandman added a commit that referenced this issue Jun 18, 2024
ianrandman added a commit that referenced this issue Jun 18, 2024
ianrandman added a commit that referenced this issue Jun 18, 2024
ianrandman added a commit that referenced this issue Jun 21, 2024
ianrandman added a commit that referenced this issue Jun 21, 2024
ianrandman added a commit that referenced this issue Jun 23, 2024
ianrandman added a commit that referenced this issue Jun 23, 2024
…strings, lower threshold zeroshot test, fix outliers for probabilities during zeroshot (#2)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

When branches are created from issues, their pull requests are automatically linked.

1 participant