Skip to content

huggingface Dataset #2

@hunoutl

Description

@hunoutl

Thank you for this repo which saved me time to do a quick analysis on CVPR24.
I would like to give back to the community and so I made a first draft of what paperlists could produce in Huggingface datasets format : https://huggingface.co/datasets/hunoutl/paperlists

It is a raw dataset. I kept most of the keys that I applied to all of the papers. Do you have an idea of ​​what would be possible to standardize everything?

For now I have a simple code for merging, I will try to find time to make it cleaner and share it.
I had made synthetic data for CVPR with the use of LLM to complete information and add new ones (country of belonging). I'm thinking of going over all the papers in the future.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions