Skip to content

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

License

Notifications You must be signed in to change notification settings

gitchaell/computer-scrapping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Computer Scrapping

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

Objectives

  • Use web scraping techniques to the selected websites to obtain the data. (minimum 03 web pages and 02 web scraping techniques)
  • Build the Dashboard in Power BI that shows the dynamic analysis of the data

Steps

1. Search for companies that sell computers in Trujillo

2. Design of the data model

datamodel

Download Data Model PDF File here

3. Search for tools for data extraction

  • Node JS - Javascript Engine
  • Puppeteer - Web page manipulation tool
  • Cheerio JS - Web page querier tool like JQuery
  • Excel JS - Tool for exporting extracted data to XLSX file

About

Tool that extracts data from the pages of companies that sell computers in the city of Trujillo - Peru, exports them in an XLSX file according to a relational data model, and displays them on a Power BI dashboard.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published