1. Introduction
In recent years, the development of the Internet of Things (IoT), complemented by advancements in communication technologies, has enabled a vast array of mobile devices (MDs)—including cameras, sensors, and wearable devices—to collect and exchange data [
1]. This proliferation has given rise to numerous application scenarios that leverage wireless devices, such as autonomous driving, face recognition, Virtual Reality (VR), and e-health [
2]. However, these applications often demand substantial computational power and low latency, presenting challenges for MDs, which typically have limited computing capabilities and finite battery life. Mobile Edge Computing (MEC) has risen as a promising solution to these challenges. By offloading computationally intensive tasks to MEC servers, MDs with constrained resources can markedly enhance their computational capacity and reduce latency.
However, wireless devices often have limited battery capacity, which cannot sustain prolonged operation. Consequently, the frequent replacement of MDs’ batteries presents a significant challenge. Wireless Power Transfer (WPT) has emerged as an effective solution to this challenge [
3,
4]. WPT utilizes a Hybrid Access Point (HAP) to broadcast Radio Frequency (RF) energy that can be harvested by wireless devices. By integrating Energy Harvesting (EH) technology, these devices can convert the captured RF signals into usable electrical energy, thereby recharging their batteries [
5]. This harvested energy enables the devices to perform computational tasks either locally or by offloading them to MEC servers. The integration of wireless power and edge computing technologies in Wireless Powered Mobile Edge Computing (WPMEC) networks significantly extends the battery lifespan of wireless devices and markedly enhances their computational capabilities.
In addition to battery limitations, the double-near-far effect can significantly impact network performance, with devices far from the HAP experiencing poor channel conditions [
6]. To counteract this issue, a user cooperation (UC) mechanism has been implemented. In this mechanism, devices that are in close proximity to the HAP act as relays, forwarding signals for those located at a greater distance. This strategy not only mitigates the inefficiency of remote nodes offloading tasks directly to the Access Point (AP) but also optimizes the utilization of idle computational resources within the network, thereby enhancing the overall computational efficiency of the system. For example, the work in [
7] demonstrates how user cooperation can boost the computational efficiency of a WPMEC system under dynamic channel conditions and varying task arrivals. Additionally, other studies, such as [
8,
9], have shown that user cooperation can effectively reduce the impact of the double-near-far effect. However, the aforementioned studies have not yet explored the potential of Backscatter technology to improve energy efficiency further.
Backscatter communication (BackCom) has garnered significant attention in recent years due to its innovative approach to wireless communication [
10]. In BackCom systems, the transmitter operates in full-duplex mode, functioning in a passive mode. It modulates and reflects the incident signal to the receiver, eliminating the need to generate a carrier frequency, while simultaneously harvesting energy to support its circuitry consumption [
11,
12]. This approach contrasts with traditional active communication (AC), where the transmitter first harvests energy and then uses this harvested energy to transmit data, adhering to the harvest-then-transmit (HTT) protocol. Generally, AC consumes more energy than BackCom, although it can achieve a higher data transfer rate. The trade-offs between EH and data transfer are inherent in both BackCom and AC.
The motivation behind this study stems from the recognition that while integrating BackCom and AC paradigms has shown promise in enhancing the energy efficiency (EE) of WPMEC systems [
8,
13], current research is predominantly confined to static, single-time slot scenarios. These studies often assume constant channel conditions and user data arrivals, which contrasts sharply with the dynamic, stochastic nature of real-world MEC networks where data arrivals and channel states are subject to continuous variation. This volatility complicates the prediction and management of network operations, thereby necessitating the development of robust algorithms capable of optimizing long-term energy utilization efficiency and maintaining system queue stability. Addressing these challenges is not only of theoretical interest but also of paramount practical importance, as it directly impacts the sustainability and reliability of MEC services in fluctuating operational environments. Our study, therefore, aims to bridge this gap by proposing an algorithmic framework that can adeptly navigate the complexities of volatile network conditions, ensuring optimal energy efficiency and queue stability in WPMEC networks.
In this paper, we tackle the long-term EE maximization for a Backscatter assisted WPMEC network with user cooperation by jointly optimizing the wireless powered time fraction, BackCom offloading time fraction, AC offloading time fraction, offloading data size and transfer power of MDs. The problem presents significant challenges in two aspects: (1) The randomness of task arrivals and fluctuating wireless channel states impose challenges to achieving optimal EE while ensuring the stability of queue system; (2) The integration of BackCom and AC brings a strong coupling of energy harvest time and task offloading. To address these challenges, we formulate a stochastic optimization problem and propose an efficient, low-complexity algorithm by leveraging techniques such as the Dinkelbach method and the Lyapunov optimization framework. We first transform the sequential decision problem into a deterministic problem for each time slot by leveraging the drift-plus-penalty technique, obtaining a non-convex optimization sub-problem. Then, we convert the non-convex problem into a convex optimization problem by using variable substitution, which allows for efficient solution. We propose a low-complexity dynamic EE maximization algorithm that operates online without requiring prior system information.
The primary contributions of this paper are listed as follows:
We introduce an innovative dynamic task offloading model to optimize EE for a WPMEC network with integration of BackCom and AC communication under user cooperation, taking into account the randomness of task arrival and time-varying wireless channels. Our model effectively balances the trade-off between energy efficiency and system queues stability, while mitigating the double-near-far effect. Additionally, we explore the use of variable data weighting to motivate proximal users to relay data for distant users, enhancing overall network efficiency.
We propose an online control algorithm to maximize the EE metric of WPMEC network by determining the time fraction allocation, data offloading, transmission power and the Backscatter reflection coefficients at each time slot. To address the complex coupling of user cooperation and control decisions over time, we employ Dinkelbach’s method and the Lyapunov optimization theory to decouple the stochastic fractional optimization problem into deterministic sub-problems for each time slot, and transforms it into a convex problem, ensuring an efficient and optimal solution.
We present a rigorous mathematical analysis to demonstrate the performance of our proposed algorithm, that achieves a balanced trade-off between energy efficiency and queue stability within the bounds of . Extensive simulation experiments are conducted to verify the algorithm’s effectiveness and practical applicability. We have systematically evaluated the impact of key control parameters, including variable V, bandwidth, communication gap, and task arrival rates, on the algorithm’s performance.
The remainder of this paper is organized as follows: Section II presents the details of model for Backscatter-assisted WPMEC system. In Section III, we formulate a stochastic programming optimization problem aiming to maximize energy efficiency. Section IV details the application of the Dinkelbach’s method and Lyapunov optimization techniques to simplify the problem, including the algorithm design and theoretical performance analysis. Section V presents an extensive simulation-based evaluation of the proposed algorithm’s performance. Finally, we conclude the paper and suggest directions for future research in Section VI.
1.1. Related Work
The combination of WPT with MEC networks, as an efficient solution for wireless devices to augment their energy and computational capacities, has been extensively studied by recent researches [
14,
15,
16,
17]. Ernest and Madhukumar [
18] proposed an energy efficiency maximization algorithm based on multi-agent deep reinforcement learning for a MEC-supported vehicular network, with jointly considering transmission and computation latencies outperforming existing strategies. Zhang et al. [
19] proposed an algorithm optimizing charging time and data offloading rates for WPT-MEC IoT sensor networks to improve computational rates in different scenarios. Li et al. [
5] studied the system latency minimization problem for an Intelligent Reflecting Surfaces (IRS)-assisted multi-ID MEC system, and presented a hybrid multiple access scheme and optimization framework combined with Frequency Division Multiple Access (FDMA) and Non-Orthogonal Multiple Access (NOMA) technologies. Additionally, in [
20], the authors introduced a deep reinforcement learning-based approach for WPT-aided mobile edge computing to dynamically adapt to real-time changes, make swift decisions, and optimize both task offloading and energy resource allocation. Our previous research [
7] introduced an online control algorithm for dynamic task offloading in WPMEC networks under dynamic network conditions, designed to maximize long-term system energy efficiency. However, the aforementioned studies did not take into account the use of Backscatter technique to further enhance the energy utilization efficiency of wireless power transfer.
To mitigate the double-near-far effect and fully utilize available resources, many researchers employ a user cooperation mechanism [
7,
9,
21,
22,
23]. He et al. [
22] presented a user cooperation scheme, aiming to maximize the network’s total throughput by jointly optimizing the local computing frequency, transmit power, task distribution, and time allocation. Wang et al. [
21] introduced a user cooperation mechanism for a NOMA assisted WPT-MEC network, designed a iterative-based optimal algorithm to minimize overall system energy consumption by leveraging Lagrangian method. Zhang et al. [
24] presents a hierarchical reinforcement learning-based approach for joint caching and resource allocation in a cooperative mobile-edge computing system, aiming to optimize resource utilization and balance server loads through service caching and workload offloading decisions. Sun et al. [
25] proposed an iterative optimization algorithm for minimizing end-to-end latency in an MEC network supporting IoT applications, by jointly optimizing user association and resource allocation in a three-phase operation protocol. Su et al. [
9] explored optimizing the energy beamforming and resource allocation to enhance computation efficiency for WPMEC system with the integration of user cooperation and NOMA, taking into account non-linear energy harvesting model.
In recent years, the integration of BackCom and active communication(AC) has emerged as an effective approach to enhance network energy efficiency, leveraging the unique characteristics of Backscatter technology to balance transmission rates and energy consumption, thereby significantly improving the system’s overall performance [
8,
26,
27,
28,
29]. Lyu et al. [
30] introduces a hybrid HTT and BackCom framework for cognitive wireless powered IoT networks, optimizing time allocation and mode combination to maximize system throughput. Ye et al. [
27] introduced a bisection-based iterative algorithm for minimizing data offloading and computing delays in a WPMEC network with hybrid BackCom and AC for IoT networks. Shi et al. [
31] proposed a scheme for maximizing the weighted sum of computation bits in a Backscatter-assisted WPMEC network, considering a practical non-linear EH model with hybrid HTT and Backscatter communications. Wu and He [
26] proposed an efficient iterative algorithm for EE maximization in a multi-access WPMEC system with the help of a relay. Lin et al. [
32] presents an optimization framework for a BackCom NOMA system, aiming to maximize the sum uplink rate by optimizing reflection coefficients and establishing association policies between base stations and backscatter devices. Fu et al. [
33] addressed the energy efficiency fairness among IoT nodes in a UAV-enabled WPMEC network with integrated BackCom and AC, proposed an optimization framework that maximizes the worst-case IoT node’s energy efficiency by jointly optimizing UAV transmit power and trajectory, IoT nodes’ BackCom and AC parameters, and local computing configurations. However, the aforementioned studies primarily focus on optimizing a single time slot and do not account for the dynamic fluctuations inherent in MEC network environments.
Different from the above research, this paper addresses the problem of hybrid communication modes (e.g., BackCom and AC) and user cooperation for EE maximization in the volatile WPMEC network, without any information about the future. Compared to [
8], our approach considers both nodes capable of processing their own incoming data simultaneously, and we introduce weighted incentives to motivate proximal nodes to assist distant nodes in offloading computational tasks. We account for dynamic factors, including random task arrivals and fluctuating wireless channel states, the prior knowledge of which is difficult to pinpoint accurately, making task offloading and resource allocation significantly challenging. Moreover, the coupling between dynamic battery levels and wireless charging time further complicates the problem.
3. Problem Formulation
In this paper, we aim to design a dynamic offloading algorithm to maximize the
subject to constraints of the system queue stability, by making decisions on time allocation
, power allocation
, reflection coefficients
and the amount of offloaded tasks
at each time slot
t. Simultaneously, our algorithm should ensure the stability of the system network when faced with randomly arriving task loads and dynamically changing wireless channel conditions. By denoting
,
,
, and
the maximization of
for a Backscatter-assisted WPMEC with user cooperation can be formulated as the following problem (P0):
where constraint (20b) ensures that the total offloading time in each slot does not exceed the available time. Constraint (20c) maintains the battery levels within the allowable range for both mobile devices. Constraints (20d) guarantee the stability of data queues. Constraint (20e) indicates that the amount of processed task in the current time slot must not exceed the current queue length. Constraint (20f) guarantees that tasks offloaded by
to
can be processed within the same slot. Constraints in (20g) denote the maximum offloading data depending on the channel condition. The problem is a fractional stochastic programming issue, which presents significant challenges due to several aspects: (1) The randomness of task arrivals, the fluctuating wireless channel state, and the dynamic battery level introduce stochastic factors to the optimization challenge; (2) The temporal coupling in the time fraction and energy consumption exhibited by BackCom and AC poses a considerable challenge in determining the allocation of offloading time.
Due to the fractional nature of the objective function of P0, traditional optimization techniques aren’t directly applicable, such that we leverage Dinkelbach’s method [
39] to transform the problem into a more tractable one. Let
denote the optimal value of
, we derive the following Theorem 1.
Theorem 1.
The optimal is achieved if and only if
Proof. For brevity, here we omit the proof details. See Theorem 1 in [
38]. □
Since
is unknown during the solution process, (20) is still infeasible to tackle. In accordance with the methodology employed in [
38], we introduce a new parameter
and define it as
We set
at the beginning of the problem. Replacing
in (20), the problem P1 can be transformed into
where
is a given parameter that should be updated through the resolution process. It should be noted that
obtained by (22) will get closer to
as time goes by [
38]. Therefore, this transformation is reasonable and has the same optimal solution as P0. While problem P1 is more manageable than problem P0, it still faces several challenges The constraints (20c) and (20d), along with the equation (
4), result in an interdependence of battery levels across various time slots throughout the period, which means that the current energy consumption affects future battery levels. Moreover, the unpredictability of the stochastic task arrivals and the fluctuating channel states add another complexity to the problem. The difficulty in accurately forecasting these elements leads to an inherent temporal coupling in the decision-making process.
5. Simulation Results
In this section, we evaluate the performance of our proposed algorithm for a Backscatter-assisted WPMEC system with user cooperation through extensive numerical simulation. The experiments are conducted on a high-performance platform equipped with a 2.10 GHz Intel(R) Xeon(R) Silver 4116 CPU and four GeForce RTX 2080 Ti GPUs, ensuring efficient simulation execution. We employed the free-space path loss model to simulate signal propagation, where the average channel gain
is calculated by the following formula [
42]:
where
(antenna gain),
MHz (carrier frequency),
(path loss exponent), and
represents the distance between nodes (in meters).
The dynamic channel gains for WPT and task offloading, following the Rayleigh fading model, are represented by the vector . In the model, the channel fading factors all follow an exponential distribution with an expectation of 1, simulating the natural variability of wireless channels. For the sake of model simplification, we assume that the vector of fading factors remains constant at within each time slot, thereby considering the channel gain to be static during that slot.
At each time slot
t, the expected arrival rates of computational tasks
satisfy
. It is assumed that
follow an exponential distribution, where the rate parameters
and
correspond to 1.2 and 1.5, respectively. The main source code is available online at
https://github.com/Toxic-Gulu/bac-with-ac. Other simulation parameters are detailed in
Table 2.
To comprehensively evaluate the performance of our algorithm, we conducted comparative simulations with three representative benchmarks as follows:
(1) UC with the AC scheme (UAC) [
43]: In the WPMEC system, task offloading is facilitated through user collaboration, where
and
communicate exclusively via the AC mode, without the integration of Backscatter modules. This approach only leverages the AC communication model for data transmission.
(2) UC with the BackCom scheme (UBC) [
8]: The WPMEC network employs a collaborative approach among users, with inter-user communication solely relying on the Backscatter technique. Specifically, the HAP continuously broadcasts RF energy to the users throughout each time slot, resulting in the highest energy expenditure for the HAP due to its non-stop energy emission.
(3) Without UC With Integrated BackCom and AC scheme (BC+AC) [
44]: The WPMEC network forgoes user collaboration, with
and
establishing direct communication links with the HAP. In this configuration,
and
are autonomous, eschewing collaborative interactions. Furthermore, each device is integrated with a Backscatter module, enabling the utilization of both BackCom and AC for their communication needs.
(4) Random Offloading Scheme (ROS) : The WPMEC network employs a typical task offloading approach where and independently offload a random subset of their tasks to the HAP. There is no collaboration between the two nodes in terms of communication. Importantly, both nodes are not integrated with Backscatter modules, and they exclusively utilize AC communication.
To ensure a fair comparison, all baseline algorithms are implemented utilizing the Lyapunov optimization framework, which is designed to maintain system stability.
Figure 3 demonstrates the performance comparison of EE under different schemes, with parameters set as
,
dB, and
MHz. It can be observed that our proposed algorithm performs the best in terms of EE, followed by the UBC, then the UAC, the BC+AC ranking fourth, and the ROS method performing the worst. Compared to the other four schemes, our proposed algorithm has improved the EE by 23%, 38%, 48% and 64%, respectively. This superior performance highlights the advantage of integrating both BackCom and AC. The scheme of UBC, suffers from limited transmission capability when
is small, leading to poor performance. Although the UAC can transmit sufficient data, its high circuit power consumption results in it ranking third. Due to the poor channel conditions of remote users, the BC+AC, as well as the ROS method, also have lower energy efficiency. This indicates that even with the integration of BackCom and AC, poor channel conditions between
and HAP may still limit energy harvesting and task offloading. This further emphasizes the importance of user cooperation in enhancing the performance of remote users.
Figure 4 illustrates the impact of controlling the performance gap
on EE. The EE for the other four schemes improve with an increase in
, contrasting with the UAC and ROS scheme’s stable EE. Our proposed scheme consistently delivers the best system performance and the ROS method performing the worst. With
-17dB, the UBC scheme outperforms the UAC scheme in EE, prompting the system to favor the BackCom mode. Upon reaching a higher threshold, the UAC scheme’s EE matches that of our proposed scheme, leading to its predominance use by users. Conversely, when
-21dB, our scheme defaults to the AC mode. The BC+AC and the ROS scheme experience the least performance gains due to suboptimal channel conditions. Overall, the proposed scheme outshines others in flexibility and adaptability, adeptly tuning to varying
levels for optimal EE.
Figure 5 demonstrates the impact of network bandwidth on the performance under different schemes, with the time slot set to 2000. As shown in
Figure 5, within the bandwidth range of [1.00, 1.45] ×
Hz, the EE of all schemes increases with the expansion of bandwidth. This is attributed to the increased bandwidth allows more tasks to be transmitted to the edge server for processing using edge computing resources. Our proposed scheme consistently outperforms other schemes across the different bandwidth scenarios, especially as the bandwidth approaches 1.45MHz. At this point, the advantage in energy efficiency becomes markedly evident, substantially outperforming other baseline algorithms. This not only showcases the high adaptability of our scheme in managing bandwidth but also underscores its effectiveness in utilizing network resources in high-bandwidth scenarios, thereby maximizing energy efficiency.
In
Figure 6, we evaluate the system performance of various algorithms under different weight configurations, where the weight
varies in the range [0, 3], with
and
. As
increases, the EE of all schemes generally declines because a higher
implies that tasks offloading from
receive more resources at the expense of overall EE. Our algorithm consistently performs the best across all weight settings and the ROS algorithm exhibits the poorest performance among the evaluated approaches. Specifically, at
, our algorithm achieves an EE improvement of 23%, 38%, 46% and 55% over other algorithms. This demonstrates that our algorithm can more effectively leverage the advantages of the UC scheme, combined with BackCom and AC. Additionally,
Figure 6 indicates that excessively high weights for edge devices can rapidly degrade network performance, underscoring the importance of proper weight distribution. In
Figure 6, we observe an inversion of the purple curve, which occurs due to the system’s inclination to allocate more resources to support the task offloading of
as the weight system increases. Schemes without user cooperation, constrained by the absence of collaborative efforts and poor channel conditions, experience a sharp decline. Consequently, at a weight coefficient of 0.6, a crossover of the curves is observed, where the EE represented by the purple curve falls below that of any other scheme with user cooperation.
Figure 7 illustrates the impact of the control parameter
V on EE and the average stable queue length, with parameter settings of
dB,
MHz, a distance of 120 meters between nodes
and
, and a task arrival rate at
of
Mbps. As
V increases, EE is enhanced, and the stable queue threshold also increases. This indicates that with the increase of
V, EE becomes the primary concern, while the current queue length plays a relatively minor role in the objective function. However, when
V reaches a certain threshold, the gain in energy efficiency becomes saturated, and further increasing
V will no longer have a significant impact on system performance and queue length. This trend can be interpreted as follows: a larger
V allows the system to buffer more data, which is consistent with the previous theoretical analysis. Thus, the problem is transformed into seeking to maximize EE while to some extent disregarding the current queue length.
Figure 8 illustrates the optimal time allocation of
versus the performance gap
. Initially, when
is low, despite the lower energy consumption of the BackCom mode, achieving a larger number of computational bits at the same energy consumption is challenging. Consequently, MDs prefer the AC mode and allocate more time
for energy collection to meet data processing needs. As
increases,
gradually decreases, and when
exceeds -20 dB,
. This is because, as
increases, the BackCom mode can not only achieve more computational bits at the same energy consumption but also collect sufficient energy to support the circuit consumption under the AC mode, significantly improving energy efficiency. Therefore, as
increases, users are more inclined to choose the BackCom mode, using its reflection capability to perform task transmission and energy collection to meet the circuit consumption [
45]. This scheme ensures that under different
conditions, the system can operate at the highest efficiency, optimizing energy usage.
Figure 9 illustrates that the EE under varying distances between
and
, with
. The distance between the remote MD and the helper device ranges from 80 to 180 meters. It is observed that EE decreases as the distance increases. This is because an increase in distance leads to a reduction in channel gain, necessitating more time and higher power to transmit data to maintain a shorter data queue, thereby increasing energy consumption. This indicates that in practical deployment, the distance between edge node devices and helper devices should be kept within a reasonable range to avoid a rapid decline in network performance. This assessment not only quantifies the specific impact of distance on energy efficiency but also emphasizes the importance of considering the distance factor in the design of efficient networks.
In
Figure 10, we evaluate the EE and the average task queue length as the task arrival rate at
varies. The distance between
and
is set to 120 meters, and control parameter
V is set to 40. As the computation task data arrival rate at
increases, the EE decreases. The rise in data rate causes the local data queue to expand, necessitating a higher data transmission rate from the system. This not only entails processing a larger volume of data but also results in greater energy expenditure to maintain shorter data queues, thereby increasing overall energy consumption. Despite this trade-off in energy efficiency, our algorithm optimizes energy usage, ensuring that the system maintains efficient data processing and rapid response capabilities even as data rates increase.