- Java JDK 1.8: Link
- Apache Hadoop 2.7.7: Link
- Apache Spark 2.4.4: Link
- Apache Maven 3.6.0: Link
- Apache Kafka 2.3.0: Link
- PostgreSQL: Link
- Docker: Link
-
Java JDK 1.8
- Descargar e Instalar. NOTA: Instalar en una ruta sin espacios.
- Definir variable de entorno
%JAVA_HOME%
-
hadoop-2.7.7.tar.gz
- Descargar y Descomprimir.
- Definir variable de entorno
%HADOOP_HOME%
- Para sistemas windows, hay que descargar el ejecutable winutils.exe, y ubicarlo en el %HADOOP_HOME%\bin
-
spark-2.4.4-bin-hadoop2.7.tgz
- Descargar y Descomprimir.
- Definir variable de entorno
%SPARK_HOME%
-
Apache Maven 3.6 (Opcional - Para compilar los ejemplos y ejercicios del curso)
- Descargar y Descomprimir.
- Definir variable de entorno
%MAVEN_HOME%
-
Configurar las variables de entorno en el PATH del sistema:
PATH=%JAVA_HOME%\bin;%HADOOP_HOME%\bin;%SPARK_HOME%\bin;%PATH%
Directory where the scripts are found to execute the different exercises of the course.
Description of the executions of which the course is composed:
List of exercises and examples of Apache Spark Core:
- Exercise 1 (spark-core-exercise-1): Using context.
- Exercise 2 (spark-core-exercise-2): Reading a plain text file.
- Exercise 3 (spark-core-exercise-3): Reading a plain compressed text file.
List of exercises and examples of Apache Spark SQL:
- Exercise 1 (spark-sql-exercise-1):
List of exercises and examples of Apache Spark Streaming:
- Exercise 1 (spark-streaming-exercise-1):
List of exercises and examples of the use of spark in a cluster:
- Exercise 1 (spark-cluster-exercise-1):