Scrapping NASA website with BeautifulSoup , Data wrangling with pandas , Data storage and App creation with MongoDB and Flask API
Description : This project aims at building a web application that scrapes various websites for data related to The Mission to Mars and displays the information in a single HTML page.
- Scrape the Mars News Site and collect the latest News Title and Paragraph Text. Assign the text to variables that you can reference later.
- Using splinter navigate through JPL Mars Space Images - Featured Image and find the image url for the current Featured Mars Image.
- Visit the Mars Facts webpage and use Pandas to scrape the table containing facts about the planet including Diameter, Mass, etc. and use Pandas to convert the data to a HTML table string.
- Visit the astrology site to obtain high resolution images for each of Mar's hemispheres and create a dictionary.
- Use MongoDB with Flask templating to create a new HTML page that displays all of the information that was scraped from the URLs above.
- convert your Jupyter notebook into a Python script with a function called scrape that returns one Python dictionary containing all of the scraped data.
- create a route called /scrape that will import your scrape_mars.py script and call your scrape function.
- Store the return value in Mongo as a Python dictionary.
- Create a root route / that will query your Mongo database and pass the mars data into an HTML template to display the data.
- Create a template HTML file called index.html that will take the mars data dictionary and display all of the data in the appropriate HTML elements.
- Jupyter Notebook
- Visual Studio code editor
- pandas
- flask/jinja
- Web Scraping libraries
- Splinter
- Requests
- BeautifulSoup4
- webdriver_manager
- Git clone this repository
- Open Flask app.py in Visual code editor
- Execute the code to launch chrome browser containing scraped data from NASA's Mars sites.
Project Complete
- UTSA BootCamp