Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

0.5.4 #112

Merged
merged 54 commits into from
Jun 15, 2023
Merged

0.5.4 #112

merged 54 commits into from
Jun 15, 2023

Conversation

yiwen-h
Copy link
Member

@yiwen-h yiwen-h commented Jun 15, 2023

New version has:

  • Improved test coverage - up to 47%
  • Attempted 2 layer sklearn model (targeting major categories before the minor categories), performed poorly
  • Improved train_test_splitting process so that all model types use the same split for the same random_seed for better comparability
  • Fixed how distilbert model handles categorical features using CategoryEncoder layer
  • Added analysis to bert pipeline so Excel performance and label files are also outputted
  • Added sentiment async endpoint to API
  • Updated params and dataset to v6 of framework

yiwen-h and others added 30 commits June 1, 2023 15:14
same output as predict_multilabel_sklearn - aim is to be able to use both for API. Also added tests
…d in preprocessing due to missing values in q_type
yiwen-h and others added 24 commits June 13, 2023 10:14
still basic - not using async
@yiwen-h yiwen-h requested a review from ChrisBeeley June 15, 2023 11:32
@yiwen-h yiwen-h merged commit 0b65e33 into main Jun 15, 2023
@yiwen-h yiwen-h deleted the 0.5.4 branch June 15, 2023 12:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant