Extend database schemas #90

jpwahle · 2022-08-18T07:25:03Z

Is your feature request related to a problem? Please describe.
Currently, everything is stored in the paper collection while the other schemas that were introduced in 2c59cba have not been used.
Because especially aggregate and group are expensive we want to avoid these steps by using the separate collections now.

Describe the solution you'd like
Each dashboard that requires aggregation, grouping, etc. should have a separate collection (e.g., authors, venues).
Also MongoDB should write data to the unused collections and map back to the paper objects.
For fast filtering, each collection should have the key filter elements (e.g., year, inCitationsCount, ...)
The solution should be backward compatible, so the paper collection should remain to be the same.

The text was updated successfully, but these errors were encountered:

jpwahle · 2022-11-03T09:21:54Z

One suggestion here is to switch to a MySQL / PostgreSQL database.

Pros:

Potentially much faster
Can be hosted by GWDG

Cons:

We have to touch all schemas
Normalizing data

jpwahle · 2022-12-01T10:59:04Z

We should also think about adding more data from FatCat and Internet Archive Scholar which export everything in PostgreSQL

jpwahle added the refactoring Pull Request: Refactoring code without logic change label Aug 18, 2022

jpwahle added this to Additional features in cs-insights via automation Aug 18, 2022

jpwahle mentioned this issue Aug 18, 2022

Improve performance of database queries #76

Open

jpwahle moved this from Additional features to Todo in cs-insights Aug 18, 2022

This was referenced Aug 18, 2022

Add caching to preRead, preUpdate, preCreate middleware. #91

Closed

Re-route requests of /fe/{route} to /{route} where possible jpwahle/cs-insights-frontend#92

Closed

Optimize queries #25

Closed

Precompute values #49

Closed

jpwahle moved this from Todo to Near Future in cs-insights Aug 22, 2022

jpwahle removed this from Near Future in cs-insights Aug 23, 2022

jpwahle added this to Backlog in cs-insights Sep 26, 2022

jpwahle removed this from Backlog in cs-insights Oct 12, 2022

jpwahle assigned muhammadtalha242 and jpwahle Sep 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend database schemas #90

Extend database schemas #90

jpwahle commented Aug 18, 2022

jpwahle commented Nov 3, 2022

jpwahle commented Dec 1, 2022

Extend database schemas #90

Extend database schemas #90

Comments

jpwahle commented Aug 18, 2022

jpwahle commented Nov 3, 2022

jpwahle commented Dec 1, 2022