Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do the paper_id in MULTICITE can link to according paper in S2ORC? #1

Open
HongJinTsai opened this issue Jul 13, 2021 · 7 comments
Open

Comments

@HongJinTsai
Copy link

Hi! Thanks for proposing such an interesting work ;)
I wonder that whether we can use the paper_id in this dataset to find the according paper in the S2ORC?
Because I think using the full text or other information of the cited paper may be helpful for my work, it would be great if I can use both of dataset at the same time.
Thanks :)

@jacklxc
Copy link

jacklxc commented Jul 21, 2021

It seems that the paper ID and sentence IDs are derived from the _pdf_hash attribute in the pdf_parse of S2ORC dataset. However, the detailed rule cannot be inferred easily.

@kyleclo Can you also provide the mapping between the intent_id and the actual intent? Thank you.

@kyleclo
Copy link
Collaborator

kyleclo commented Jul 21, 2021

Yes, sorry for the delay. I'm uploading a revised version of this shortly w/ the proper IDs / mappings. Thanks for catching this

@jacklxc
Copy link

jacklxc commented Jul 30, 2021

@kyleclo This is a reminder for uploading the dataset with proper IDs. Thanks!

1 similar comment
@afei8178
Copy link

@kyleclo This is a reminder for uploading the dataset with proper IDs. Thanks!

@afei8178
Copy link

@kyleclo I've been waiting. Thanks!

@pcchen-ntunlp
Copy link

@kyleclo
How to use paper_id in full-v20210918.json?
Is there a mapping table now?

@ManasiPat
Copy link

@kyleclo Can you please provide the mapping of paper ids in the dataset to the papers IDs in S2ORC dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants