Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Tabular Data Classification #589

Closed
2 tasks done
knackerbrot opened this issue Apr 21, 2024 · 3 comments
Closed
2 tasks done

[BUG] Tabular Data Classification #589

knackerbrot opened this issue Apr 21, 2024 · 3 comments
Labels
bug Something isn't working

Comments

@knackerbrot
Copy link

Prerequisites

  • I have read the documentation.
  • I have checked other issues for similar problems.

Backend

Local

Interface Used

UI

CLI Command

No response

UI Screenshots & Parameters

image

Column mapping expanded: { "id": "id", "features": ["amount", "description", "day", "month", "year", "account"], "label": "category" }

csv looks like this
amount,description,day,month,year,account,category,id
-95,random description ,6,4,2023,15,48,1
-160,random description ,6,4,2023,15,72,2
-4.05,random description ,6,4,2023,15,28,3
-20,random description ,5,4,2023,3,44,4
-16,random description ,5,4,2023,3,16,5
-30,random description ,5,4,2023,15,29,6
-480,random description ,5,4,2023,15,28,7

Error Logs

INFO | 2024-04-21 18:30:41 | autotrain.app:handle_form:539 - Column mapping: {'id': 'id', 'features': ['amount', 'description', 'day', 'month', 'year', 'account'], 'label': 'category'}
ERROR: Exception in ASGI application
ValueError: c not in train data

Additional Information

I haven't been able to find how to properly develop column mapping and parameters for Tabular Data Classification

@knackerbrot knackerbrot added the bug Something isn't working label Apr 21, 2024
@abhishekkrthakur
Copy link
Member

all you need to do is specify the id and target column(s). you cannot use free text columns in tabular tasks. the data format is available here: https://huggingface.co/docs/autotrain/tabular

@knackerbrot
Copy link
Author

Ok thanks - I'm suprised a that autotrain is less capable and functional than it was 2 years ago

https://www.youtube.com/watch?v=OH_e0wOkpZc

@abhishekkrthakur
Copy link
Member

Its the same backend. you can have text but those will be treated as categories. There is no change in how we handle columns 🙂

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants