Skip to content

ycbelgium/eth-data-science-course

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Learning Data Science on the Ethereum Blockchain with the Omniacs

Description

We want to share our joy of data science with the Ethereum eco-system through an informative set of online courses and case studies that merge data analysis, blockchain analytics and statistical programming.

We aspire to create a modular online course to help new blockchain developers understand the principles and best practices of data science. Using open data sources from across the Ethereum landscape (think StateoftheDApps, UniSwap, & Etherscan), the course will teach the topics of data munging, data visualization, exploratory data analysis, machine learning, and dabble in a bit of deep learning and artificial intelligence. As the course grows, we intend to create case studies around how to apply data science to specific DApps (think machine learning techniques for predicting markets with Numer.ai data or analyzing traits in CryptoKitties).

Our ultimate vision is to spur and inspire the next generation of developers through interesting applications of data science to emerging blockchain technologies.

Gitcoin Grant Link: https://gitcoin.co/grants/562/learning-data-science-on-the-ethereum-blockchain-

Module 1 - Basic Data Structures and Munging

Data Sources: CryptoPunks, Crypto Art Pulse

Module 2 - Statistical Graphics and Visualization

Data Sources: Omni Analytics Group, CryptoPunks

Module 3 - Supervised and Unsupervised Machine Learning

  • The ML workflow
  • Supervised learning for classification
  • Unsupervised learning for grouping
  • Forecasting what's next
  • Deep learning for sequences

Data Sources: Numerai Tournament Data, UniSwap

Module 4 - Case Studies

Data Sources: Beaconscan, Numerai Tournament Data, Haskmasks, Crypto Punks

Updates

Gitcoin Grant Round 8

Like everything else in the world, 2020 flipped our course development plans upside down. Instead of building the course from the bottom up, we chose to repurpose and refactor our Medalla research into motivating case studies on how to perform data analysis on Ethereum 2.0 blockchain data. The case studies walk through, in detail, how we performed the analysis that ultimately netted us a bronze prize. We also included a tutorial on how to use R Shiny as an exploration tool for understanding the Numer.ai dApp's tournament data.

Change log

  • Restructured the repository for clarity
  • Added 3 case studies

Gitcoin Grant Round 9

For Round 9 we've doubled down on our "case study first" approach to teaching data science using projects on the Ethereum blockchain as examples. This update includes a look at NFTs, stable coins, market capitalization estimation, and an introduction to GraphQL. We expanded two of our original case studies to include Python version, so if you are interested in learning more about that language you can check those out here and here. This update also includes our first attempt at creating video lectures for the eager learners who would like to dive deeper into the concepts. We intend to use funding from this round to further expand the set of case studies we produce and improve the quality of our video content.

Change log

  • Major update to the course aesthetics
  • Module 1 updated
  • 5 New case studies (with 2 more to be published during the active grant round)
  • First video lecture published to Youtube

Gitcoin Grant Round 10

This round update had us working with our first outside contributor. @Amelia188 gave our course a proper copy edit by fixing tons of typos, correcting grammar errors and improving the overall readability of the material. We look forward to her continued contributions and encourage others to reach out to us about opportunities to collaborate. To further expand out the base content for the course, we completed the material for the second module that focuses on statistical graphics. Other updates for this round include two NFT related case studies and a host of new video lectures. Your support for this round will help us expand our contributor pool and further improve on the quality of our content.

Change log

  • Tons of copy edits
  • 4 Video lectures for Module 1 have been published
  • Slides for Module 2 - Statistical Graphics and Visualization have been completed
  • 2 NFT related case studies were created and videos produced

Gitcoin Grant Round 11

We've been busy! Over the last quarter we've been working with DAOs to help them understand their data and it has been this work that inspired us to create two new case studies all about blockchain jobs using data from Cryptojobs. In addition to these two case studies, we've included another one on Yearn.Finance created by our newest collaborator @vintro. This update also includes 3 new videos to supplement the case studies. As usual, we really appreciate the support and contributions this round will help us find and compensate additional course contributors.

Change log

  • 3 Video lectures
  • 3 New case studies
  • Various copy edits

FAQ

Do you all have experience in this stuff?
Why yes, we do! Omni Analytics Group is a team of PhD level statistical consultants that have been teaching and solving difficult data science problems for nearly a decade. We are passionate about data science and blockchain technologies. Just check out our twitter.

Do I need any prior experience before taking this course?
Our intention is to start from the beginning and build up not only your data chops, but your statistical intuition and programming knowledge. At the end of these courses, you should be able to match a statistical technique to a blockchain data problem, write a basic script to analyze it and confidently search online for more advanced knowledge.

What programming languages will the course focus on?
We'll initially focus on the statistical language R, but then expand to Python. As the course grows, we hope to include examples with contracts written in Solidity.

Can I request a topic?
Sure! Once we flesh out the initial course material. If funding persists, we'd be more than happy to take suggestions on case studies or topics.

Testimonials

"This course is like a rain following a drought. It kindly walks you through the process starting from the use of R to introduction of graphs and machine learning concepts with interesting case studies. I strongly recommend it not only to researchers interested in Ethereum blockchain but also to any students or professionals that have interest in learning data analysis and science." - Will Shin (Principal Economist at Klaytn)

Featured Projects

About

Learning Data Science on the Ethereum Blockchain with @omnianalytics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 63.9%
  • Jupyter Notebook 35.2%
  • Other 0.9%