# Module 12 Challenge
## Deliverable 1: Scrape Titles and Preview Text from Mars News

In [1]:
# Import Splinter and BeautifulSoup
from splinter import Browser
from bs4 import BeautifulSoup as soup
from webdriver_manager.chrome import ChromeDriverManager
import pandas as pd
import pprint

In [2]:
executable_path = {'executable_path': ChromeDriverManager().install()}
browser = Browser('chrome', **executable_path, headless=False)

### Step 1: Visit the Website

1. Use automated browsing to visit the [Mars NASA news site](https://redplanetscience.com). Inspect the page to identify which elements to scrape.

      > **Hint** To identify which elements to scrape, you might want to inspect the page by using Chrome DevTools.

In [3]:
# Visit the Mars NASA news site: https://redplanetscience.com
url = 'https://redplanetscience.com'

browser.visit(url)

### Step 2: Scrape the Website

Create a Beautiful Soup object and use it to extract text elements from the website.

In [4]:
# Create a Beautiful Soup object
html = browser.html

mars_soup = soup(html, 'html.parser')

In [5]:
# Extract all the text elements
text_elements = mars_soup.find_all('div', class_='list_text')
text_elements

[<div class="list_text">
 <div class="list_date">December 22, 2022</div>
 <div class="content_title">NASA to Reveal Name of Its Next Mars Rover</div>
 <div class="article_teaser_body">After a months-long contest among students to name NASA's newest Mars rover, the agency will reveal the winning name — and the winning student — this Thursday. </div>
 </div>,
 <div class="list_text">
 <div class="list_date">December 21, 2022</div>
 <div class="content_title">Sensors on Mars 2020 Spacecraft Answer Long-Distance Call From Earth</div>
 <div class="article_teaser_body">Instruments tailored to collect data during the descent of NASA's next rover through the Red Planet's atmosphere have been checked in flight.</div>
 </div>,
 <div class="list_text">
 <div class="list_date">December 20, 2022</div>
 <div class="content_title">Screening Soon: 'The Pathfinders' Trains Lens on Mars</div>
 <div class="article_teaser_body">With the Mars 2020 mission ramping up, the documentary — the first of four abo

### Step 3: Store the Results

Extract the titles and preview text of the news articles that you scraped. Store the scraping results in Python data structures as follows:

* Store each title-and-preview pair in a Python dictionary. And, give each dictionary two keys: `title` and `preview`. An example is the following:

  ```python
  {'title': "Mars Rover Begins Mission!", 
        'preview': "NASA's Mars Rover begins a multiyear mission to collect data about the little-explored planet."}
  ```

* Store all the dictionaries in a Python list.

* Print the list in your notebook.

In [6]:
# Extract the title and preview text
for title in mars_soup.find_all('div', class_='content_title'):
    print(title.text)


NASA to Reveal Name of Its Next Mars Rover
Sensors on Mars 2020 Spacecraft Answer Long-Distance Call From Earth
Screening Soon: 'The Pathfinders' Trains Lens on Mars
NASA's New Mars Rover Is Ready for Space Lasers
NASA's Treasure Map for Water Ice on Mars
NASA's Perseverance Mars Rover Gets Its Wheels and Air Brakes
NASA's Perseverance Rover Will Peer Beneath Mars' Surface 
NASA Establishes Board to Initially Review Mars Sample Return Plans
NASA's Curiosity Keeps Rolling As Team Operates Rover From Home
NASA's Mars 2020 Rover Completes Its First Drive
10.9 Million Names Now Aboard NASA's Perseverance Mars Rover
NASA-JPL Names 'Rolling Stones Rock' on Mars
Heat and Dust Help Launch Martian Water Into Space, Scientists Find
Join NASA for the Launch of the Mars 2020 Perseverance Rover
NASA's Perseverance Rover Will Look at Mars Through These 'Eyes'


In [7]:
for preview in mars_soup.find_all('div', class_='article_teaser_body'):
    print(preview.text)

After a months-long contest among students to name NASA's newest Mars rover, the agency will reveal the winning name — and the winning student — this Thursday. 
Instruments tailored to collect data during the descent of NASA's next rover through the Red Planet's atmosphere have been checked in flight.
With the Mars 2020 mission ramping up, the documentary — the first of four about past JPL missions to the Red Planet to be shown at Caltech — tells a gripping backstory.
Perseverance is one of a few Mars spacecraft carrying laser retroreflectors. The devices could provide new science and safer Mars landings in the future.
A new study identifies frozen water just below the Martian surface, where astronauts could easily dig it up.
After the rover was shipped from JPL to Kennedy Space Center, the team is getting closer to finalizing the spacecraft for launch later this summer.
The agency's newest rover will use the first ground-penetrating radar instrument on the Martian surface to help sear

In [12]:
# Create an empty list to store the dictionaries
mars_news = []

result = mars_soup.find_all('div', class_='list_text')

In [13]:
# Loop through the text elements
# Extract the title and preview text from the elements
# Store each title and preview pair in a dictionary
# Add the dictionary to the list

for results in result:
    title = results.find('div', class_='content_title').text
    preview = results.find('div', class_='article_teaser_body').text
    fake_news = {}
    fake_news['title'] = title
    fake_news['preview'] = preview
    mars_news.append(fake_news)


In [14]:
# Print the list to confirm success
print(mars_news)

[{'title': 'NASA to Reveal Name of Its Next Mars Rover', 'preview': "After a months-long contest among students to name NASA's newest Mars rover, the agency will reveal the winning name — and the winning student — this Thursday. "}, {'title': 'Sensors on Mars 2020 Spacecraft Answer Long-Distance Call From Earth', 'preview': "Instruments tailored to collect data during the descent of NASA's next rover through the Red Planet's atmosphere have been checked in flight."}, {'title': "Screening Soon: 'The Pathfinders' Trains Lens on Mars", 'preview': 'With the Mars 2020 mission ramping up, the documentary — the first of four about past JPL missions to the Red Planet to be shown at Caltech — tells a gripping backstory.'}, {'title': "NASA's New Mars Rover Is Ready for Space Lasers", 'preview': 'Perseverance is one of a few Mars spacecraft carrying laser retroreflectors. The devices could provide new science and safer Mars landings in the future.'}, {'title': "NASA's Treasure Map for Water Ice o

In [16]:
browser.quit()

### (Optional) Step 4: Export the Data

Optionally, store the scraped data in a file or database (to ease sharing the data with others). To do so, export the scraped data to either a JSON file or a MongoDB database.

In [None]:
# Export data to JSON
import JSON


In [None]:
# Export data to MongoDB
