In this project we have scrapped the top 50 richest athletes data using R language.
We have used R language to do web scraping for top richest 50 athletes. We scrap the data from this website. We have installed some libraries first. These are:
- rvest - helps to scrape (or harvest) data from web pages.
- stringr - provides a cohesive set of functions designed to make working with strings as easy as possible
- tibble - helps to create create data frames
Then we create a loop to scrap multiple pages and convert the html format into xml using read_html() function and put them into a dataframe
After all the steps followed we finally able to find the scrapped data into this csv file - top athletes