Skip to content

Commit

Permalink
update 12.15
Browse files Browse the repository at this point in the history
  • Loading branch information
yuxiaoba committed Dec 15, 2024
1 parent 91747e7 commit c917649
Show file tree
Hide file tree
Showing 4 changed files with 14 additions and 7 deletions.
6 changes: 3 additions & 3 deletions Config_optimization/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,9 +3,9 @@

## Deep Learning

- 19_ICSE_DeepPerf: Performance Prediction for Configurable Software with Deep Sparse Neural Network [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8811988) [[code]](https://github.com/DeepPerf/DeepPerf)
- 19_ICSE_DeepPerf: Performance Prediction for Configurable Software with Deep Sparse Neural Network [[paper]](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8811988) [[code]](https://github.com/DeepPerf/DeepPerf)


## Bayesian Optimization

## Online Learning
- 24_NSDI_OPPerTune: Post-Deployment Configuration Tuning of Services Made Easy [[paper]](https://www.usenix.org/conference/nsdi24/presentation/somashekar) [[ppt]](https://www.usenix.org/system/files/nsdi24_slides-somashekarpdf.pdf)
- 22_SIGMOD_Towards Dynamic and Safe Configuration Tuning for Cloud Databases [[paper]](https://arxiv.org/pdf/2203.14473.pdf)
6 changes: 3 additions & 3 deletions Fault_tolerance/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -71,12 +71,12 @@
- 15_SIGMOD_Lineage-driven Fault Injection [[paper]](https://dl.acm.org/doi/10.1145/2723372.2723711)

### Fault Recovery

-24_Eurosys_Atlas: Hybrid Cloud Migration Advisor for Interactive Microservices [[paper]](https://arxiv.org/pdf/2311.06962.pdf)

- 24_SoCC_Deoxys: A Causal Inference Engine for Unhealthy Node Mitigation in Large-scale Cloud Infrastructure [[paper]](https://arxiv.org/abs/2410.17709)
- 24_Eurosys_Atlas: Hybrid Cloud Migration Advisor for Interactive Microservices [[paper]](https://arxiv.org/pdf/2311.06962.pdf)
- 22_KDD_NENYA: Cascade Reinforcement Learning for Cost-Aware Failure Mitigation at Microsoft 365 [[paper]](https://dl.acm.org/doi/pdf/10.1145/3534678.3539127)
- 22_SoCC_Method Overloading the Circuit [[paper]](https://dl.acm.org/doi/abs/10.1145/3542929.3563466) [[video]](https://www.youtube.com/watch?v=A3FWuvDEZJI)
- 21_DSN_FIRestarter: Practical Software Crash Recovery with Targeted Library-level Fault Injection [[paper]](https://download.vusec.net/papers/firestarter_dsn21.pdf) [[code]](https://github.com/vusec/firestarter)
- 20_OSDI_Narya: Predictive and Adaptive Failure Mitigation to Avert Production Cloud VM Interruptions [[paper]](https://www.usenix.org/conference/osdi20/presentation/levy)



Expand Down
6 changes: 5 additions & 1 deletion LLM4Ops/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@
- [Anomaly Detection](#anomaly-detection)
- [Root Cause Analysis and Mitigation](#root-cause-analysis-and-mitigation)
- [Config](#config)
- [Rag](#rag)

## Survey
- 24_A Survey of AIOps for Failure Management in the Era of Large Language Models [[paper]](https://arxiv.org/pdf/2406.11213)
Expand Down Expand Up @@ -42,4 +43,7 @@
- 24_GenKubeSec: LLM-Based Kubernetes Misconfiguration Detection, Localization, Reasoning, and Remediation [[paper]](https://arxiv.org/pdf/2405.19954)
- 24_Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents [[paper]](https://arxiv.org/pdf/2406.12806)
- 24_ICSE-SEIP_MissConf: LLM-Enhanced Reproduction of Configuration-Triggered Bugs [[paper]](https://dl.acm.org/doi/pdf/10.1145/3639478.3647635)
- 24_ISSTA_Config_Face It Yourselves: An LLM-Based Two-Stage Strategy to Localize Configuration Errors via Logs [[paper]](https://arxiv.org/abs/2404.00640)
- 24_ISSTA_Config_Face It Yourselves: An LLM-Based Two-Stage Strategy to Localize Configuration Errors via Logs [[paper]](https://arxiv.org/abs/2404.00640)

## Rag
- 24_DECO: Life-Cycle Management of Enterprise-Grade Chatbots [[paper]](https://arxiv.org/abs/2412.06099)
3 changes: 3 additions & 0 deletions Resource_schedule/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@
- 22_NSDI_Orca: A Distributed Serving System for Transformer-Based Generative Models [[paper]](https://www.usenix.org/conference/osdi22/presentation/yu)

## Resource Allocation
- 24_Eurosys_Atlas: Hybrid Cloud Migration Advisor for Interactive Microservices [[paper]](https://dl.acm.org/doi/pdf/10.1145/3627703.3629587) [[code]](https://github.com/IBM/DeepAtlas)
- 22_SoCC_SIMPPO: A Scalable and Incremental Online Learning Framework for Serverless Resource Management [[paper]](https://haoran-qiu.com/pdf/socc22.pdf)
- 22_Asplos_SOL: Safe On-Node Learning in Cloud Platforms [[paper]](https://dl.acm.org/doi/pdf/10.1145/3503222.3507704)
- 21_ICML_Resource Allocation in Multi-armed Bandit Exploration: Overcoming Sublinear Scaling with Adaptive Parallelism [[paper]](http://proceedings.mlr.press/v139/thananjeyan21a/thananjeyan21a.pdf)
Expand All @@ -20,6 +21,7 @@


## Autoscale
- 24_Eurosys_Erlang: Application-Aware Autoscaling for Cloud Microservices [[paper]](https://dl.acm.org/doi/10.1145/3627703.3650084) [[code]](https://github.com/vigsachi/ erlang)
- 24_SC_Fast and Efficient Scaling for Microservices with SurgeGuard [[paper]](https://dl.acm.org/doi/10.1109/SC41406.2024.00103)
- 24_NSDI_Autothrottle: A Practical Bi-Level Approach to Resource Management for SLO-Targeted Microservices [[paper]](https://www.usenix.org/system/files/nsdi24-wang-zibo.pdf) [[code]](https://github.com/microsoft/autothrottle)
- 23_ATC_AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems [[paper]](https://www.usenix.org/system/files/atc23-qiu-haoran.pdf) [[Code]](https://gitlab.engr.illinois.edu/DEPEND/aware)
Expand All @@ -28,6 +30,7 @@
- 22_SoCC_DeepScaling: Microservices AutoScaling for Stable CPU Utilization in Large Scale Cloud Systems [[paper]](https://dl.acm.org/doi/pdf/10.1145/3542929.3563469)
- 22_SoCC_The Power of Prediction: Microservice Auto Scaling via Workload Learning [[paper]](https://dl.acm.org/doi/abs/10.1145/3542929.3563477)
- 22_TCC_Microscaler: Cost-effective scaling for microservice applications in the cloud with an online learning approach [[paper]](https://ieeexplore.ieee.org/abstract/document/9057418/)
- 22_HPDC_Practical Efficient Microservice Autoscaling with QoS Assurance [[paper]](https://dl.acm.org/doi/pdf/10.1145/3502181.3531460)
- 22_ICWS_HRA: An Intelligent Holistic Resource Autoscaling Framework for Multi-service Applications [[paper]](https://ieeexplore.ieee.org/document/9885736)
- 21_SoCC_SHOWAR: Right-Sizing And Efficient Scheduling of Microservices [[paper]](https://dl.acm.org/doi/10.1145/3472883.3486999)
- 20_OSDI_FIRM: An intelligent fine-grained resource management framework for slo-oriented microservices [[paper]](https://www.usenix.org/conference/osdi20/presentation/qiu)
Expand Down

0 comments on commit c917649

Please sign in to comment.