Skip to content
This repository has been archived by the owner on Jul 11, 2018. It is now read-only.
/ LDSA2016 Public archive

Group repo for the project Large datasets for scientific applications. Spring 2016

Notifications You must be signed in to change notification settings

danieleliassen/LDSA2016

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

73 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

LDSA2016

Group project for the course Large Datasets for Scientific Applications, Spring 2016.

Prerequisites

  • 2+ instaces on an OpenStack-based cloud system (Preferebly running Ubuntu 14.X)
  • pysam
  • Apache Spark
  • python-swiftclient
  • python-keystoneclient
  • exported username, password, api_url for swift

Running it

The commands below assume that you already have set up a spark cluster and installed the prerequisites

git clone https://github.com/adamruul/LDSA2016.git
sudo ./spark-submit --master spark://pmo:7077 --driver-memory 6g --executor-memory 2g ~/LDSA2016/main.py

Authors

  • Daniel Eliassen
  • Octave Mariotti
  • Adam Ruul
  • Marcus Windmark

About

Group repo for the project Large datasets for scientific applications. Spring 2016

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published