This project aims to predict energy consumption in 5G base stations using Supervised Learning Regression techniques. The goal is to model and estimate the energy consumed by different 5G base stations based on various features such as load, transmitting power, and energy-saving methods. This is particularly relevant given the increasing cost of energy consumption in telecom operations. The dataset and challenge were provided by the International Telecommunication Union (ITU) as part of a global competition in 2023. This project uses a subset of the dataset for learning purposes.
Type of Problem: Supervised Regression
Network operational expenditure (OPEX) accounts for around 25% of a telecom operator's costs, with 90% of it being energy bills. A significant portion of this energy is consumed by the Radio Access Network (RAN), particularly by base stations (BSs). The goal is to build a machine learning model that can estimate energy consumption based on various network and traffic parameters.
- Source: ITU Challenge Dataset
- Size: The dataset includes traffic statistics of 4G/5G sites collected over different days.
- Key Features:
Time
: Date and time of data collection.BS
: Base station identifier.Energy
: Energy consumed (target variable).Load
: Total load on the base station.ESMODE
: Energy-saving method in use.TXpower
: Transmitting power of the base station.
The following regression models were implemented and compared:
- Linear Regression
- Decision Tree Regression
This project uses the following Python libraries:
- Pandas: Data manipulation.
- NumPy: Numerical computations.
- Matplotlib/Seaborn: Data visualization.
- Scikit-learn: Machine learning algorithms and metrics.
- SciPy: Statistical operations (outlier handling).
-
Data Preprocessing:
- Handled missing values.
- Converted time column to
datetime
. - Detected and removed outliers using the Z-score method for the
ESMODE
column.
-
Exploratory Data Analysis (EDA):
- Visualized data distributions and relationships between features.
- Handled categorical features using label encoding.
-
Model Selection & Training:
- Used Linear Regression and Decision Tree Regressor.
- Split the data into 80% training and 20% test sets.
-
Model Evaluation:
- Models were evaluated using the Mean Squared Error (MSE) and R² Score.
-
Results:
- Linear Regression:
- MSE: 1240.52
- R² Score: 0.812
- Decision Tree Regressor:
- MSE: 810.65
- R² Score: 0.872
- Linear Regression:
-
Conclusion & Discussion:
- The Decision Tree Regressor performed better with a lower MSE and higher R² score.
- Further improvements could include the use of more advanced models (e.g., Random Forest, Gradient Boosting), hyperparameter tuning, or feature engineering.
- Clone the repository:
git clone https://github.com/yourusername/5G-Energy-Consumption.git
- Install necessary libraries:
pip install -r requirements.txt
- Run the script:
python 5G_energy_consumption.py
Outliers Handling (ESMODE Column)
Predicting 5G base station energy consumption using supervised machine learning has real-world applications in addressing critical challenges in the telecommunications industry. Here are several real-world problems that my project is either directly solving or can help solve:
- Problem: Telecom companies spend a significant portion of their operational expenses (OPEX) on energy costs—90% of OPEX is estimated to go toward energy bills, with over 70% consumed by 5G base stations.
- Solution: By accurately predicting the energy consumption of 5G base stations based on traffic conditions, configurations, and energy-saving methods, this project enables telecom operators to better manage energy usage. This leads to:
- Cost reduction: Operators can optimize energy use, reducing energy bills and improving profitability.
- Energy efficiency: Predicting high-energy consumption periods helps in scheduling power-saving modes or adjusting station configurations for efficiency.
- Problem: Telecom infrastructure contributes significantly to global energy consumption and carbon emissions. With the rapid expansion of 5G networks, energy demand will continue to increase.
- Solution: The model can forecast energy consumption and help network operators implement energy-saving techniques like turning off unnecessary base stations during low-traffic periods. This helps in:
- Reducing carbon footprint by optimizing energy usage.
- Promoting sustainable business practices in the telecom industry.
- Problem: Network operators must allocate resources efficiently to maintain the quality of service while keeping operational costs low. This involves managing traffic demand, energy usage, and infrastructure configuration.
- Solution: By predicting energy usage under different traffic conditions and configurations, this project can aid in:
- Dynamic scaling of energy resources: Allocating power efficiently during peak and off-peak hours.
- Smart network planning: Operators can use this model to simulate various configurations and conditions to determine the most energy-efficient design for their network infrastructure.
- Problem: As cities become "smarter" with the rise of IoT (Internet of Things), 5G infrastructure will need to handle the increased demand for data traffic while minimizing energy consumption.
- Solution: This model can be used in smart city applications to predict and optimize the energy consumption of 5G base stations. By doing so, it supports smart grids, efficient energy distribution, and more sustainable infrastructure.
- Problem: Many telecom operators are adopting green energy initiatives, including the integration of renewable energy sources like solar or wind power.
- Solution: This model can assist in decision-making by identifying where energy consumption is highest and where renewable energy could be integrated to achieve the greatest cost savings and environmental benefit.
- Problem: With the rollout of 5G, the energy demand of networks is rapidly increasing, but accurately forecasting future energy needs can be challenging.
- Solution: This project helps predict future energy consumption trends based on real-time and historical data. This allows operators to anticipate energy needs and make proactive adjustments, leading to better capacity planning and energy management.
- Reduce energy costs for telecom operators.
- Enhance sustainability by lowering the carbon footprint of 5G networks.
- Optimize resource allocation for better network performance.
- Support green telecom initiatives and decision-making.
- Enable smart city infrastructure by managing energy efficiently.
These applications align with current industry trends towards energy-efficient technologies and sustainable growth. This project has the potential to offer value to telecom companies, environmental agencies, and organizations focused on energy management in large-scale networks.
This project demonstrates the effectiveness of supervised regression techniques in predicting energy consumption of 5G base stations. The Decision Tree Regressor model outperformed the Linear Regression model in terms of accuracy.
Experiment with more advanced models like Random Forest, XGBoost, and Gradient Boosting. Perform hyperparameter tuning using GridSearchCV. Experiment with feature engineering and new datasets. Deploy the model using Flask or Streamlit for real-time predictions.
We welcome contributions to improve the models above. To contribute:
- Fork the repository.
- Create a feature branch (
git checkout -b feature-name
). - Commit your changes (
git commit -m 'Add some feature'
). - Push to the branch (
git push origin feature-name
). - Open a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.
For any inquiries or support related to 5G ENERGY ONSUMPTON PREDICTION, please contact:
Clifford Nwanna
Email: nwannachumaclifford@gmail.com