HoloInsight is a cloud-native observability platform with a special focus on real-time log analysis and AI integration.
-
Updated
Jul 10, 2025 - Java
HoloInsight is a cloud-native observability platform with a special focus on real-time log analysis and AI integration.
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning, machine learning, and related models.
An autonomous SRE agent that monitors cloud logs across multiple platforms, leveraging AI models from various providers to detect anomalies, perform root cause analysis, and automate remediation by creating GitHub Pull Requests.
Open source code for AIOpsServing
🚀 Enhance Google Cloud operations with the Gemini SRE Agent, automating log monitoring and incident response for smarter site reliability.
AI-powered alert automation for n8n — unify alerts from monitoring systems, analyze via LLM, and auto-notify DevOps teams on Telegram.
ReliaKit TL-15 is an open-source, planet-grade resilience framework for distributed infrastructure. It integrates automated DDoS protection, geo-aware routing, chaos engineering, and symbolic AI hooks to achieve fault tolerance beyond traditional benchmarks.
The collection of charms used to integrate Juju deployed Slurm clusters with Vantage.
An AI Agent IaC tool that aims to make developing and deploying AI Agents easier.
DevOps, SRE & AI portfolio: multi-cloud (AWS/Azure/GCP/OCI), MLOps & automation.
It is an AI-powered DevOps tool that analyzes Linux server logs to detect anomalies and predict failures. It integrates ML models, automated fixes via Ansible, containerization with Docker, and orchestration using Kubernetes—providing a full-stack solution for predictive maintenance.
🛠️ AI Ops Assistant — A Go-based backend system for automated log summarization, ticket triage, and changelog generation. Built with GraphQL, JWT auth, PostgreSQL, and a worker queue architecture for scalable operations.
Composable, explainable agent governance layer for AI-native platforms
🚦 Streamline alert management by using AI to triage infrastructure alerts, ensuring operators focus only on critical issues with Uptime Kuma integration.
🤖 Simplify IT operations with Wuhr AI Ops, an AI-driven platform for real-time monitoring, log analysis, and seamless CI/CD management.
Open R&D notes demo site for lumencore: grid aware, resource efficient compute, and telecom practices aimed at reducing operational GHG intensity
🚀 Automate self-healing and root cause analysis for financial services with this Kubernetes-native operator designed to enhance system reliability and resilience.
End-to-end design and implementation of a multi-intent AI chatbot architecture. Includes intent detection, dynamic routing, document retrieval, SQL query generation, observability, guardrails, and CI/CD automation for enterprise-scale deployment.
Add a description, image, and links to the ai-ops topic page so that developers can more easily learn about it.
To associate your repository with the ai-ops topic, visit your repo's landing page and select "manage topics."