Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

automatically detect labels and bins #61

Open
felixbur opened this issue Aug 30, 2023 · 1 comment
Open

automatically detect labels and bins #61

felixbur opened this issue Aug 30, 2023 · 1 comment
Assignees

Comments

@felixbur
Copy link
Owner

I guess it would be better if the labels would not need to be given explicitly but read from the datafile automatically.
I meant that the labels [anger, disgust, happy...] are already in the data. Currently you have to tell nkululeko which labels to use, but if you want all, that shouldn't be necessary

For regression I would define default binning
e.g. automatically assign three bins: (low, medium, high), and use the borders so they are equally distributed

@felixbur felixbur self-assigned this Aug 30, 2023
@bagustris
Copy link
Collaborator

I meant that the labels [anger, disgust, happy...] are already in the data.

Yes, this should be the default if no labels is given in [DATA] section in the INI file. If there is the option of labels in the DATA section, the labels should use the defined labels.

For regression, we should actually treat it as real regression, i.e., predicting continuous score. Let's use examples from iemocap, msp-improv and msp-podcasts datasets. the format of data usually is "file, valence, arousal, dominance, naturalness," where the last four columns from valence to naturalness are continuous scores. The output should be continuos score. In this case label is required (name of header to predict).

Binning can be added too to map between regression to classification and provide further analysis.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants