Skip to content

BU-CS-CE-528-2017/Data-Science-in-the-Cloud

Repository files navigation

Data Science in the Cloud

This project was conceived by a Northeastern University Data Science Professor and developed by four Boston University students for the class EC528 Cloud Computing. Typically when one works on a data science project, the data is given and the researcher is tasked with building the best models to address the problem. The final outcome can be evaluated through an evaluation critereon, but the methods, tools, packages, and libraries can be varied. This is especially challenging for Professors who must grade their students on their methods. To address this problem, this project utilizes Docker to allow the sharing of the environments used to build these data science models.

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes.

Prerequisites

Python 2.7, Django 1.10.5, Docker

apt-get install python2.7
apt-get install python-pip
pip install django==1.10.5

To install Docker we refer you to their documentation: Get Docker

Installing

A step by step series of examples that tell you have to get a development env running

Pull the repository to your local machine

git pull https://github.com/BU-CS-CE-528-2017/Data-Science-in-the-Cloud

To run the website, navigate to the webApp folder and run manage.py

cd ~/your/path/Data-Science-in-the-Cloud/webApp
python manage.py runserver 8000

This will run the website on the localhost, which you can navigate to by going to this url in your web browser

http://localhost:8000/

Deployment

This can be deployed on an Apache server utilizing the mod_wsgi module. This is how we did our installation, specific to Apache 2.4.7 and mod_wsgi 4.5.15 and under the assumption there are no other websites to be hosted.

First, install Apache 2.4.7 and other needed packages:

apt-get install apache2
apt-get install libapache2-mod-python
apt-get install apache2-dev

Once doing so, load the environmental variables so Apache can run doing source /etc/apache2/envvars.

Then follow the instructions to install mod_wsgi.

If it doesn't exist already, on the server create a folder /var/www/html/webApp where webApp contains all of the files that are in /webApp of this repository.

Then do:

cd /var/www/html/webApp
mv 000-default.conf /etc/apache2/sites-available

sudo service apache2 restart

The site should now be running on port 80 and thus be accessible from a browser by inputting the public floating IP.

If you ever change the code and want the site to update, you can do so from touching the wsgi.py file.

touch /var/www/html/webApp/webApp/wsgi.py

User Portal GUI

To update image, either have the .tar or the image loaded. In the User Portal folder, place any files you may want to add to the image (and the tar itself if you don't add it). Then run python addFiles.py to launch the GUI.

Built With

  • Django - The web framework used
  • Bootstrap - The front-end framework used
  • Docker - Primary technology used

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

Authors

See also the list of contributors who participated in this project.

License

Acknowledgments

  • A big thank you to our mentor Srikanth Krishnamurthy - sriboston
  • Another equally big thank you to our instructors:
    • Orran Krieger - okrieg
    • Ata Turk - ataturk
    • Peter Desnoyers
    • Michael Daitzman
  • Thank you to our classmates who provided us with feedback and kept us on our toes
  • Thank you to PurpleBooth for this README template