Machine Learning models have been successfully applied to predict Energy Utilization in different areas. In Data centres, this can be applied to reduce power utilization by dynamically controlling servers (power off and on). However, this has been limited in data centres as if any wrong prediction may result in latency and may not meet SLA.
In this project, the main aim is to predict the load for next 5 seconds as output considering past 30 seconds as input. These predictions can help to dynamically turn on servers if there is more load. The load on servers depends on make and model of servers. Random Forest Regressor and LSTM Model have been applied to predict the load.
The dataset used has 19 features. The statistics are collected at a frequency of 1 second. Considered “group [0].csv” for training the model.
- Time (sec)
- Group Power Factor (%)
- Load of Online Machines(%)
- Average CPU Core Temp. (°c)
- Lowest Load(%)
- Hottest CPU Core Temp. (°c)
- Highest Load(%)
- WC Average CPU Core Temp (°c)
- Number of Requests
- WC Hottest CPU Core Temp (°c)
- Powered BED Count 12.Capacity Used(%)
- Group Power(w)
- I/O Load of Online Machines(%)
- Active BED Count
- I/O Capacity Used(%)
- S3 Powered Count
- Number of Batch Jobs
- Online BED Count