huggingface Dataset

Thank you for this repo which saved me time to do a quick analysis on CVPR24.
I would like to give back to the community and so I made a first draft of what paperlists could produce in Huggingface datasets format : https://huggingface.co/datasets/hunoutl/paperlists

It is a raw dataset. I kept most of the keys that I applied to all of the papers. Do you have an idea of ​​what would be possible to standardize everything?

For now I have a simple code for merging, I will try to find time to make it cleaner and share it.
I had made synthetic data for CVPR with the use of LLM to complete information and add new ones (country of belonging). I'm thinking of going over all the papers in the future.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

huggingface Dataset #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

huggingface Dataset #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions