Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement classification for papers #12

Open
the-superpirate opened this issue Sep 25, 2023 · 0 comments
Open

Implement classification for papers #12

the-superpirate opened this issue Sep 25, 2023 · 0 comments
Labels
enhancement New feature or request good first issue Good for newcomers

Comments

@the-superpirate
Copy link
Contributor

the-superpirate commented Sep 25, 2023

Motivation

Classification of papers is essential task, it solves two tasks: creation of navigational menu in bot and web and also allows to cherry-pick papers on specific topic for mass processing.

The task suggests creation of classifier that takes publication metadata and derives a list of highly likely classes for the record.

Classification approach

https://www.frontiersin.org/articles/10.3389/frma.2023.1149834/full

This approach is described in the paper but have no any sources. One way can be reaching authors and requesting sources for kick-starting implementation

Technical description

What is needed: library that accepts paper description by the dict of the following format

authors: List[{first: str, given: str, name: str}]}
abstract?: str
content?: str
id: {dois: List[str]}
issued_at?: int
languages: List[str]
metadata?: {container_title?: str, publisher?: str}
tags?: List[str]
title: str

and returns SciNobo class for the paper. Fields are more precisely described in the schema. Consider all fields except title and abstract as absent most of times.

How to Start

pip install stc-geck
geck - documents

You will receive a stream of documents that is a subject of the task.

@the-superpirate the-superpirate added enhancement New feature or request good first issue Good for newcomers labels Sep 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

1 participant