We propose a novel machine learning-based framework to address the challenge of 5G network optimal network slice orchestration and a novel ML-based model pipeline to address resource allocation. This approach utilises a synergistic combination of supervised learning techniques and reinforcement learning (RL) to dynamically sub-slice and optimally orchestrate 5G network slices. The approach we propose optimises network resource allocation in real time as demands dynamically changes and while proactively scale network resources to meet predicted future network slice demands. This proposed framework enables a new proposed vertical 5G Enterprise Network As A Service which is tagged Company Network As A Service (CNaaS). The CNaaS does not only automates the optimal orchestration of diverse 5G network slices end-to-end to meet the varying needs of diverse application across an entire company’s network but also transforms the way 5G mobile network slice providers collaborate.
Currently, the Virtual Network Operator (VNO) has to wait for the Application Provider (AP) to send the user equipment slice requirement and then form the slice template; the VNO will then send to the Infrastructure Provider (IP) who is waiting on VNO for this information in order to allocate the required physical resource. The IP then sends the required orchestrated 5G networks physical resources to VNO for the slice orchestration who then maps this to demanding user equipment. The proposed CNaaS streamlines this process with an included message bus deployed on the mobile edge, which mobile network providers can subscribe to in real time. The proposed Enhanced Sub-Slice module (eSS) pushes the real time user equipment demands and the predicted slice demands unto the message bus. The network providers such as the AP, VNO and IP can subscribe to this in real time and proactively identify network resource needs thereby efficiently orchestrating network slice and allocating resources optimally. Recent works which highlights the potential of such hybrid approaches of using multiple machine learning algorithms for network optimisation has been the motivation behind the leveraging of multiple ML paradigms [
15,
16]. Their studies emphasises the advantage of using hybrid machine learning algorithm for network slice orchestration for prediction, monitoring and allocation of resources. The authors highlight the potential of zero touch network management with opportunities of hybrid ML for joint optimisation of network resource allocation. Key challenges such as data management, model integration and online learning were highlighted in their study. Our proposed ML-based framework aims to improve the model integration challenge by implementing a series of ML algorithms pipeline as a component in the framework, a typical enterprise network environment is shown in
Figure 2. This shows varying departments within an organisation with diverse needs. For example, the organisation or company marketing department might demand an eMBB slice for to remote sales while the same company with vehicles carrying Bio reactive materials might require a low latency network slice. With the proposed ML-Based framework, the 5G network slice requirements of live applications over the company network will be monitored, analyses and optimally orchestrated in real time.
A. System Architecture
To address the challenges of optimal 5G network slice orchestration and resource management, we propose a novel machine learning-driven framework that leverages the strengths of different ML techniques. Our approach utilises a synergistic combination of Lazypredict or AutoML, supervised learning, and time-series forecasting techniques to dynamically sub-slice and orchestrate 5G network slices. The framework optimises 5G network resource allocation in real time and proactively scales 5G network resources to meet anticipated future demands.
Figure 3, shows the architecture overview of the proposed framework. While Figure 4. Shows the relationship between the eSS module pipeline and the proposed new vertical CNaaS framework with eSS being implemented within the CNaaS.
Figure 3. highlights the components from data ingestion to network topology. The data which is live, historical, generative AI or an combination of any of these is fed into eSS model pipeline which pre-processes the data, analyses it, and identifies the requirements for the network slice template description. The allocation of resources by the eSS and orchestration of the 5G slice goes through sub-slice creation and modification to determine the best network functions and service functions to be allocated thereby efficiently managing the available resources. The performance of the sub-slice is monitored and optimised. Within the framework, the security and compliance function is applied based on the enterprise or company’s policies. The proposed framework is fully customisable and adaptable. Access control policies and other company’s specific policies are applied. This then goes through the service function chain policy engine which describes the service function templates and them configures the service function configuration. The best fit or alternative best fit is the allocated and the ability to auto scale is also implemented in the framework. The resource allocator uses the cloud-computing concept by auto scaling the virtual network function as required based on defined policy and budget while ensuring all applications diverse requirements are met in the dynamic environment.
One of the major challenges of the use of ML approach is the availability of 5G dataset. The more data is available, the better the model is trained. This research work uses augmented data of real world historical 5G network dataset and generative AI. This makes it possible for any new network to start using this framework and start learning with the available dataset with the limited available dataset from the start. As the historical dataset grows over the network, the proposed model keeps training and adapting to the network needs. We evaluated and compared the result of using just the historical dataset against the proposed augmented data and the result shows better accuracy with throughput, jitter, bandwidth and packet loss in augmented data compared to the historical dataset only. This shows the more available data helped with the model training and its accuracy.
The proposed machine learning-based framework consists of three main components, as illustrated in
Figure 5:
1) a Traffic Prediction Module which primarily leverages reinforcement learning technique for anticipated 5G network slice traffic demand prediction and sending the traffic predictions to a slice orchestrator;
2) a Slice Orchestrator which utilises supervised learning models like Random Forest (RF) and SVM for resource allocation and then sends the resource allocation decisions to resource allocator;
3) a Resource Allocator which receives the resource allocation decision from the slice orchestrator and implements it.
In the Traffic Prediction Module, Online learning Automated Machine Learning (AutoML) is further leveraged on to enhance the RL’s agent performance and adaptability. Similar designs which were proposed for hierarchical network resource orchestration in sliced 5G networks inspired this modular architecture. [
15,
17]. From their study, the hierarchical orchestration framework enabled dynamic end-to-end 5G network slicing by supporting inter-domain network resource optimal allocation. The hierarchical orchestration framework has three layers: the service management, which handles the end-to-end network slice lifecycle management, the network slice orchestration which handles the orchestration of 5G network slice across multiple domains, and the resource orchestration which handles and manages the allocation of physical/virtual 5G network resource within a domain in order to meet the slice requirements. This supports flexible sharing of network resource across layers, within and across domains.
With the proposed Enhanced Sub-Slice (eSS) model pipeline, network functions can be optimally shared among isolated network sub-slices thereby efficiently managing, orchestrating and allocating the available resource. The eSS allocates only the required network functions to be chained to make a 5G network slice thereby reducing the risk of over provisioning and reducing cost of allocating standard slices. The eSS model also orchestrates the best alternative fit network slice to reduce the risk of denial of slice and improve Quality of Experience (QoE). Based on the available physical resources, the eSS will analyse the best-fit slice for all live applications and predicted live applications to come online and ensure the available resources are optimally allocated to meet the needs of live applications. It proposes the alternative network slice with lesser network resource requirements but yet still meeting the needs of the application to enhance QoS and QoE.
The Traffic Prediction Module:
The Traffic Prediction Module utilises a two stage learning process:
Reinforcement learning which is an offline learning that learn historical data and predict traffic trends and anticipated demands of 5G network slice. This component is responsible for forecasting future network traffic patterns for each network slice.
AutoML, which is an online learning that continually adapts to the dynamic environment conditions and evolving user requirements. The RL agent adaptively updates its policy based on the live feedback, refine its predictions and enables its ability to dynamically respond to the evolving traffic pattern.
The RL agent of the Traffic Prediction Module learns to predict the future 5G network slice requirements through trial-and-error interactions with its environment by observing the network's state and receiving rewards or penalties based on the accuracy of its predictions. The RL agent's state space is made up of various network parameters while the action space is made up of the possible slice configurations. Current slice utilisation, application-specific Quality of Service (QoS) requirements and user mobility patterns are some of the network parameters. Factors like bandwidth allocation, resource prioritisation, and latency targets, are some possible slice configurations. A crucial aspect of the Traffic Prediction Module is the exploration-exploitation trade-offs. The module exploits its current knowledge base to make informed and accurate predictions. The module also explores new slice configurations to uncover potential improvements.
The AuoML helps automate and optimise RL agent’s training by automating hyper-parameter tuning as the performance of RL algorithm often depends on the right selection of hyper-parameters, enhanced feature engineering to automatically identify the relevant features from the raw network data and traffic patterns, model selection by automatically selecting the right model or using ensembled techniques to automatically combine multiple models, transfer learning to automate the transfer of knowledge from pre-trained models, and continuous adaptation which continuously monitors the RL agents performance and automatically adjusts its parameters or the underlying RL algorithm. This Traffic Prediction Module enables a more robust 5G network slice demand predictions in dynamic 5G network environments.
The Slice Orchestrator:
The core component is the Slice Orchestrator and forms the core of the eSS module pipeline. It manages the overall 5G network slice orchestration process. The Traffic Prediction Module sends traffic predictions to the slice orchestrator and the slice orchestrator interacts with the Resource Allocator to optimise the allocation of network slice resource. The slice orchestrator consist of two subcomponents, which are:
The Network Slice Requirements Analyser, eSS with an inbuilt metrics KPI estimator and which uses lazypredict, a python module to analyse the dataset, automatically select best fit machine learning algorithm for the given network dataset and then train the best supervised machine learning algorithm (e.g., Random Forests, SVM, Gradient Boosting, Neural Networks). This analyses the network traffic data to identify the 5G network slice requirement for the given network pattern. It identifies the best fit supervised learning classifier (e.g., Decision Trees, Support Vector Machines) for training, feature engineering, hyperparameter tuning and selecting the model with optimal performance considering the key metrics such as latency, throughput, bandwidth, jitter and packet loss rate. Although lazypredict is configured by default due to its lightweight capabilities, AutoML can also be configured where computational storage is not a challenge to take advantage of its online learning and automating features.
Network Sub-Slice Mapper, part of the eSS which uses bespoke inbuilt python function to select the best-fit or alternative best fir network functions configurations to be chained based on available network resources and predicted network resource requirements for future slices. The most suitable network sub-slice is mapped to the applications for optimal performance considering the key metrics such as latency, throughput, bandwidth, jitter and packet loss rate.
A similar approach which was employed in recent studies on RL-based resource allocation was a key motivator to employing a supervised learning technique to learn an optimal policy for resource allocation through interactions with the environment (i.e., the 5G network slices) [
12,
15]. The problem was formulated as a Markov Decision Process in their study and Deep Q-network (DQN) algorithm was used to handle the high-dimensional state and action spaces.
Physical resources such as compute, network, and storage are allocated and managed by the Resource Allocator across the network slices. Based on the learned policy, the Resource Allocator exposes an interface for the Slice Orchestrator to request resource deallocations or allocations. This Resource Allocator component enhances and utilises existing frameworks for resource management in virtualised network environments [
21,
22]. Their study delved into the Network Function Virtualisation (NFV), its infrastructure and components. The VNF orchestration and deployment, service function chaining, performance optimisation and resource utilisation plays a critical role in optimally allocating shared network resources.