Originally forked from the amazing scala-notebook, almost entirely refactored for Massive Dataset Analysis using Apache Spark.
The tool allows performing reproducible analysis with Scala, Apache Spark and more.
This is achieved through an interactive web-based editor that can combine Scala code, SQL queries, Markup or even JavaScript in a collaborative manner.
The Spark is available out of the box, and is simply accessed by the variable sparkContext
.
Want to try out Spark Notebook? Do these steps.
- Go to spark-notebook.io.
- Download one of the builds (master is the latest, but unstable).
- Extract the file somewhere convenient.
- Open a terminal/command window.
- Change to the root directory of the expanded distribution.
- Execute the command
bin/spark-notebook
(*NIX) orbin\spark-notebook
(Windows). - Open your browser to localhost:9000.
For details and cofiguration options, see Launch.
Run straight from sources (for geeks)
Checkout the sources and run:
$ sbt run
C'mon on to Gitter to discuss things, to get some help, or to start contributing!
The documentation is being rewritten. Meanwhile, read the slightly outdated docs about the advanced features and configuration, the different cluster deployment options (Amazon EMR, Mesons, YARN), and to find answers to FAQs.
Skymind - The Deeplearning4j
Spark Notebook gives us a clean, useful way to mix code and prose when we demo and explain our tech to customers. The Spark ecosystem needed this.
It allows our analysts and developers (15+ users) to run ad-hoc queries, to perform complex data analysis and data visualisations, prototype machine learning pipelines. In addition, we use it to power our BI dashboards.
Name | Logo | URL | Description |
---|---|---|---|
Data Fellas | website | Mad Data Science and Scalable Computing | |
Agile Lab | website | The only Italian Spark Certified systems integrator | |
CloudPhysics | website | DATA-DRIVEN INSIGHTS FOR SMARTER IT | |
Aliyun | product | Spark runtime environment on ECS and management tool of Spark Cluster running on Aliyun ECS | |
EMBL European Bioinformatics Institute | website | EMBL-EBI provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry. | |
Metail | website | The best body shape and garment fit company in the world. To create and empower everyone’s online body identity. | |
kt NexR | website | the kt NexR is one of the leading BigData company in the Korea from 2007. | |
Skymind | website | At Skymind, we’re tackling some of the most advanced problems in data analysis and machine intelligence. We offer start-of-the-art, flexible, scalable deep learning for industry. | |
Amino | website | A new way to get the facts about your health care choices. | |
Vinted | website | Online marketplace and a social network focused on young women’s lifestyle. | |
Vingle | website | Vingle is the community where you can meet someone like you. | |
47 Degrees | website | 47 Degrees is a global consulting firm and certified Typesafe & Databricks Partner specializing in Scala & Spark. | |
Barclays | website | Barclays is a British multinational banking and financial services company headquartered in London. | |
Swisscom | website | Swisscom is the leading mobile service provider in Switzerland. | |
Knoldus | website | Knoldus is a global consulting firm and certified "Select" Lightbend & Databricks Partner specializing in Scala & Spark ecosystem. |