Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
-
Updated
Sep 19, 2024 - Go
Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.
Run in all nodes of your cluster before the cluster starts - lets you customize your cluster
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.
Streaming JSON data to Spark or Google Cloud Dataproc.
[DEPRECATED] Kubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
Sua missão será criar um ecossistema de Big Data usando o Google Cloud Platform (GCP). Para isso, o expert te ensinará a configurar o Google Cloud Dataproc, um Hadoop totalmente gerenciado, usando seus créditos gratuitos da GCP.
Projeto do Curso "Criando um Ecossistema Hadoop Totalmente Gerenciado com Google Cloud Dataproc" do Bootcamp Data Engineer da Digital Innovation One
A sample demo to check latest spark, big query connector and scala 2.12
Add a description, image, and links to the google-cloud-dataproc topic page so that developers can more easily learn about it.
To associate your repository with the google-cloud-dataproc topic, visit your repo's landing page and select "manage topics."