Skip to content

In this project we have scrap the top richest 50 athletes data using R language.

Notifications You must be signed in to change notification settings

MunimAhmed/Web-Scrapping-using-R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Web-Scrapping-using-R

In this project we have scrapped the top 50 richest athletes data using R language.

Overview

We have used R language to do web scraping for top richest 50 athletes. We scrap the data from this website. We have installed some libraries first. These are:

  • rvest - helps to scrape (or harvest) data from web pages.
  • stringr - provides a cohesive set of functions designed to make working with strings as easy as possible
  • tibble - helps to create create data frames

Then we create a loop to scrap multiple pages and convert the html format into xml using read_html() function and put them into a dataframe

image

Extracting names

image

Extracting networth

image

Extracting Age

image

Extracting Country

image

Combining all the lists and creating a dataframe and save the dataframe into a csv file

image

Conclusion

After all the steps followed we finally able to find the scrapped data into this csv file - top athletes

About

In this project we have scrap the top richest 50 athletes data using R language.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages