Skip to content

Automate a browser to scrape multiple sites to extract data and images (Beautifulsoup, Splinter). Once extracted, storing the info in a NoSQL database (MongoDB). Then using Flask, create a web application to display the data and images.

Notifications You must be signed in to change notification settings

boyerjason700/Mission_to_Mars

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mission_to_Mars

Overview

Automate a browser to scrape multiple sites to extract data and images (Beautifulsoup, Splinter). Once extracted, storing the info in a NoSQL database (MongoDB). Then using Flask, create a web application to display the data and images.

Languages and tools used:

HTML

Python

Bootstrap

MongoDB

Flask

Visual Studio Code




  • Pandas
  • Beautifulsoup
  • Splinter
  • PyMongo <br>

Sites used:

NASA Mars News

Space Images

Mars Facts

Mars Hemisphere

Process

  1. Scraping.py
    • Beautifulsoup and Splinter; drill down into the HTML tags to extract the most recent news article and summary from Mars News site
    • Beautifulsoup and Splinter; drill down into the HTML tags to extract the most recent image from Space Images site
    • Pandas; scrape Mars/Earth table and read as DataFrame then convert into HTML from Mars Facts site
    • Beautifulsoup and Splinter; drill down into the HTML tags to extract all four full size hemisphere images from Mars Hemisphere site
    • Store scraped data in MongoDB database

  1. App.py
    • create app to connect MongoDB through Flask to set up web page
  2. Index.html
    • create containers to display news article, images, and table
    • create 'Scrape New Data' button to perform our scraping function
    • refactor code to ensure data displayed is responsive to multiple device sizes
    • Bootstrap; customize web page appearance(background color change, jumbotron image fill, button color change, text color change)

Summary

Our Flask app displays the most recent article scraped from the NASA Mars News site, an updated image from the Space Images site and a table from the Mars Facts site. Four hemisphere images scraped from the Mars Hemisphere site are displayed at the bottom. The page is responsive to desktops, tablets, and phones.

127 0 0 1_5000_ (1)

Contact

E-mail: boyerjason700@gmail.com

About

Automate a browser to scrape multiple sites to extract data and images (Beautifulsoup, Splinter). Once extracted, storing the info in a NoSQL database (MongoDB). Then using Flask, create a web application to display the data and images.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published