Skip to content

A data ingestion pipeline comprising etl and analyses

Notifications You must be signed in to change notification settings

evans-nyang/jumia-scrapy

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

94 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Overview

A data ingestion pipeline incorporating Extract Load Transform connecting data producers and consumers.

Prerequisites

  • Python >= 3.6
  • Docker >= 19.03.0
  • Docker Compose >= 1.25.0
  • Terraform >= 1.2.0
  • AWS account credentials configured

Project Structure

To view project structure in terminal, run the following command:

cat structure.ini

Getting Started

To set up the project, clone the repository:

git clone https://github.com/evans-nyang/jumia-scrapy.git

Change directory to jumiascraper

cd jumiascraper

Contributing

Contributions are welcome! If you find any issues or have suggestions for improvements, please open an issue or submit a pull request. Feel free to update the content based on your project's specifics, including adding information about the website you are scraping, additional features, or any other relevant details.

License

This project is licensed under the Apache License 2.0.

Please note that this configuration assumes you have AWS account credentials properly configured and have the necessary permissions to create and manage AWS resources.

For more information on Docker, Dbt , Terraform and AWS, refer to their official documentation:

Releases

No releases published

Packages

No packages published

Languages