Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Standardize events data #68

Open
lukaspetrasek opened this issue Apr 12, 2024 · 9 comments
Open

Standardize events data #68

lukaspetrasek opened this issue Apr 12, 2024 · 9 comments
Assignees

Comments

@lukaspetrasek
Copy link
Collaborator

DeRisk currently works with event data in a raw format where the relevant data (like the user, token, amount, etc.) is saved in a dictionary or list-like structure in one of the columns. To better extract information from the database, it would be helpful to have a unified data structure where all relevant pieces of information would be saved in a separate column. Thus, we could, e.g., easily query all events of the given type (deposit, withdrawal, liquidation) for the given user, lending protocol, etc.

The task is to take a sample of events (which will be provided on demand) and convert it to a standardized format that can be used to store information about any type of event and any lending protocol.

@lukaspetrasek lukaspetrasek added the enhancement New feature or request label Apr 12, 2024
@lukaspetrasek lukaspetrasek changed the title Events data standardization Standardize events data Apr 12, 2024
@fishonamos
Copy link

I will love to work on this @lukaspetrasek

@khayss
Copy link

khayss commented May 22, 2024

I'll love to handle this issue

@lukaspetrasek
Copy link
Collaborator Author

Hi, can you guys please tell me something about you, what skills/experience do you have and how do you plan to tackle this issue? This task is not simple, so I have to learn more information before I assign anyone 🙏🏼

@godekina
Copy link
Contributor

Hello I will love to work on this issue, @lukaspetrasek
I am a python developer with experience as a data scientist, I can create a function to organize the data in a csv file, then write code for retrieval

@lukaspetrasek
Copy link
Collaborator Author

Okay, assigning you @godekina 👍🏼

@godekina Let me know if everything is clear. If you have any questions, please ask here. What is you TG handler please? 🙏🏼

Consider joining our TG group.
See also our contributor guidelines.

@godekina
Copy link
Contributor

godekina commented May 29, 2024

Thanks @lukaspetrasek I'd like to have the sample data so I can work on writing code to standardize them.
my telegram handle is: @godekina
I have joined the Telegram group.

@lukaspetrasek
Copy link
Collaborator Author

Hi @godekina , you can find the sample e.g. for zkLend here: https://storage.googleapis.com/derisk-persistent-state/zklend_data/events_sample.parquet.

To obtain the sample for other protocols, just change the folder in the address, e.g. https://storage.googleapis.com/derisk-persistent-state/nostra_alpha_data/events_sample.parquet for Nostra Alpha.

@godekina
Copy link
Contributor

godekina commented Jun 1, 2024

@lukaspetrasek the pull request #123 for this feature is ready for review.

@lukaspetrasek
Copy link
Collaborator Author

Hi @godekina , 🔙

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants