Semantic caching demo with real-time streaming and a cost & sizing calculator, powered by Azure Managed Redis and Azure OpenAI.
-
Updated
Nov 12, 2025 - Python
Semantic caching demo with real-time streaming and a cost & sizing calculator, powered by Azure Managed Redis and Azure OpenAI.
Reproducible microbenchmark for modeling domain crossing energy in heterogeneous compute systems.
A full-stack GPU profiling and simulation framework that bridges high-level Python ML code with low-level hardware metrics (SM Banks, Tensor Cores) for precise performance analysis.
Distributed engineering cost modeling and team topology pricing platform for CTO decision making.
Add a description, image, and links to the cost-modeling topic page so that developers can more easily learn about it.
To associate your repository with the cost-modeling topic, visit your repo's landing page and select "manage topics."