This repo contains guidelines and steps for setting up in-house production infrastructure and cloud services of open-source technologies from scratch.
-
CDC Pipelines
- High-performance coordination service for distributed applications:
- Distributed event streaming platform:
- Distributed framework to stream data into and out of Apache kafka:
- Distributed registry to store kafka-payload's schemas:
-
Databases
- SQL/RDBMS:
- NoSQL
- Document:
- Key-value:
- Graph:
- Time Series:
- Prometheus (NoSQL)
- Timescale (SQL)
-
Distributed Workflow Management
-
Big Data
- Distributed SQL Query Engine on any data storage:
- Distributed & Resilient Data Processing framework:
- SQL on HDFS:
-
Search Engines
-
Centralized Logging
- Elastic Stack
- Filebeat
- Elasticsearch-Ingest-Pipeline
- Kibana
- Elastic Stack
-
Business Intelligence
-
Container Orchestration
-
Service Discovery, Health Checking & Configuration
-
Service Monitoring
-
SSL Certs
-
Load balancing & Reverse Proxying
-
VPN
-
Linux
...