Preprint
Article

This version is not peer-reviewed.

User-Cooperative Dynamic Resource Allocation for Backscatter-Aided Wireless-Powered MEC Network

A peer-reviewed article of this preprint also exists.

Submitted:

15 August 2024

Posted:

21 August 2024

Read the latest preprint version here

Abstract
Backscatter communication, which transmits information by passively reflecting radio 1 frequency (RF) signals, has become a focal point of interest due to its potential to significantly enhance 2 the energy efficiency of Wireless Power Mobile Edge Computing (WPMEC) networks and extend the 3 operational lifespan of terminal devices. However, there is little research on the integration of user 4 cooperation in WPMEC scenarios within volatile network environments. In this paper, we propose 5 a dynamic task offloading scheme for a Backscatter-assisted WPMEC system, which involves two 6 mobile devices (MDs) and a Hybrid Access Point (HAP) with user cooperation. We formulate the 7 energy efficiency (EE) maximization problem as a stochastic programming problem, considering 8 the randomness of task arrivals and time-varying wireless channels. By leveraging Dinkelbach’s 9 method and stochastic network optimization technique, we transform the problem into a series of 10 deterministic sub-problems for each time slot, and convert the non-convex sub-problem into convex 11 ones. We propose a low-complex EE maximization algorithm to solve the convex problems efficiently. 12 Extensive simulations are conducted to validate the performance of our algorithm under various 13 system parameter settings. Experiment results demonstrate that our algorithm not only outperforms 14 the benchmark algorithms by approximately 23%, but also stabilize all queues within the MEC 15 system.
Keywords: 
;  ;  ;  ;  

1. Introduction

In recent years, the development of the Internet of Things (IoT), complemented by advancements in communication technologies, has enabled a vast array of mobile devices (MDs)—including cameras, sensors, and wearable devices—to collect and exchange data [1,2]. This proliferation has given rise to numerous application scenarios that leverage wireless devices, such as autonomous driving, face recognition, Virtual Reality (VR), and e-health [3,4]. However, these applications often demand substantial computational power and low latency, presenting challenges for MDs, which typically have limited computing capabilities and finite battery life. Mobile Edge Computing (MEC) has risen as a promising solution to these challenges. By offloading computationally intensive tasks to MEC servers, MDs with constrained resources can markedly enhance their computational capacity and reduce latency.
However, wireless devices often have limited battery capacity, which cannot sustain prolonged operation. Consequently, the frequent replacement of MDs’ batteries presents a significant challenge. Wireless Power Transfer (WPT) has emerged as an effective solution to this challenge [5,6]. WPT utilizes a Hybrid Access Point (HAP) to broadcast Radio Frequency (RF) energy that can be harvested by wireless devices. By integrating Energy Harvesting (EH) technology, these devices can convert the captured RF signals into usable electrical energy, thereby recharging their batteries [7]. This harvested energy enables the devices to perform computational tasks either locally or by offloading them to MEC servers. The integration of wireless power and edge computing technologies in Wireless Powered Mobile Edge Computing (WPMEC) networks significantly extends the battery lifespan of wireless devices and markedly enhances their computational capabilities.
In addition to battery limitations, the double-near-far effect can significantly impact network performance, with devices far from the HAP experiencing poor channel conditions [8]. To counteract this issue, a User Cooperation (UC) mechanism has been implemented. In this mechanism, devices that are in close proximity to the HAP act as relays, forwarding signals for those located at a greater distance. This strategy not only mitigates the inefficiency of remote nodes offloading tasks directly to the Access Point (AP) but also optimizes the utilization of idle computational resources within the network, thereby enhancing the overall computational efficiency of the system. For example, the work in [9] demonstrates how user cooperation can boost the computational efficiency of a WPMEC system under dynamic channel conditions and varying task arrivals. Additionally, other studies, such as [10,11], have shown that user cooperation can effectively reduce the impact of the double-near-far effect. However, the aforementioned studies has not yet explored the potential of Backscatter technology to improve energy efficiency further.
Backscatter Communication (BackCom) has garnered significant attention in recent years due to its innovative approach to wireless communication [12]. In BackCom systems, the transmitter operates in full-duplex mode, functioning in a passive mode. It modulates and reflects the incident signal to the receiver, eliminating the need to generate a carrier frequency, while simultaneously harvesting energy to support its circuitry consumption [13,14]. This approach contrasts with traditional Active Communication (AC), where the transmitter first harvests energy and then uses this harvested energy to transmit data, adhering to the harvest-then-transmit (HTT) protocol. Generally, AC consumes more energy than BackCom, although it can achieve a higher data transfer rate. The trade-offs between EH and data transfer are inherent in both BackCom and AC.
A number of researchers have advanced the concept of integrating BackCom and AC paradigms to improve the energy efficiency(EE) of WPMEC systems [10,15,16]. However, existing studies frequently focus on scenarios limited to a single time slot, positing that channel conditions and user data arrivals remain constant. In real MEC networks, the environment is often volatile, with user data arrivals and wireless channel states exhibiting inherent stochastic and fluctuating characteristics, which precludes the precise acquisition of prior knowledge about these dynamic elements. Therefore, the development of efficient algorithms aimed at optimizing long-term energy utilization efficiency and ensuring system queue stability introduces a more substantial challenge and holds greater practical significance.
In this paper, we tackle the long-term EE maximization for a Backscatter assisted WPMEC network with user cooperation by jointly optimizing the wireless powered time fraction, Backcom offloading time fraction, AC offloading time fraction, offloading data size and transfer power of MDs. The problem presents significant challenges in two aspects: (1) The randomness of task arrivals and fluctuating wireless channel states impose challenges to achieving optimal EE while ensuring the stability of queue system; (2) The integration of BackCom and AC brings a strong coupling of energy harvest time and task offloading. To address these challenges, we formulate a stochastic optimization problem and propose an efficient, low-complexity algorithm by leveraging techniques such as the Dinkelbach method and the Lyapunov optimization framework. We first transform the sequential decision problem into a deterministic problem for each time slot by leveraging the drift-plus-penalty technique, obtaining a non-convex optimization sub-problem. Then, we convert the non-convex problem into a convex optimization problem by using variable substitution, which allow for efficient solution. We propose a low-complexity dynamic EE maximization algorithm that operates online without requiring prior system information.
The primary contributions of this paper are listed as follows:
  • We introduce a dynamic task offloading model to optimize EE for a WPMEC network with integration of BackCom and AC communication under user cooperation, taking into account the randomness of task arrival and time-varying wireless channels. Our model can balance the trade-off between energy efficiency and system queues stability, and mitigate the impact of double-near-far effect. Additionally, we have investigated using variable data weighting to motivate proximal users to relay data for distant users.
  • We propose an online control algorithm to maximize the EE metric of WPMEC network by determining the time fraction allocation, data offloading, and transmission power at each time slot. To address the coupling of user cooperation and control decision over time period, we leverage Dinkelbach’s method and the Lyapunov optimization theory to decouple the original problem into a deterministic sub-problem, then convert the sub-problem into a convex one for efficient solution.
  • We conduct extensive simulation experiments to verify the effectiveness and practicality of our algorithm, accessing the impact of the control parameter V, bandwidth, gap, task arrival rate. The results demonstrate that our proposed algorithm outperforms the benchmark algorithms by about 23%, and achieve a theoretical energy efficiency-stability trade-off of [ O ( 1 / V ) , O ( V ) ] .
The remainder of this paper is organized as follows: Section II presents the details of model of Backscatter-assisted WPMEC system. In Section III, we formulate a stochastic programming optimization problem aimed at maximizing energy efficiency. Section IV details the application of the Dinkelbach’s method and Lyapunov optimization techniques to simplify the problem, including the algorithm design and theoretical performance analysis. Section V presents an extensive simulation-based evaluation of the proposed algorithm’s performance. Finally, we conclude the paper and suggest directions for future research in Section VI.

1.1. Related Work

The combination of WPT with MEC networks, as an efficient solution for wireless devices to augment their energy and computational capacities, has been extensively studied by recent researches [17]. Ernest and Madhukumar [18] proposed an energy efficiency maximization algorithm based on multi-agent deep reinforcement learning to achieve maximum energy efficiency in MEC-supported vehicular networks, with jointly considering transmission and computation latencies outperforming existing strategies. Zhang et al. [19] proposed an algorithm optimizing charging time and data offloading rates for WPT-MEC IoT sensor networks to improve computational rates in scenarios. Li et al. [7] studied the system latency minimization problem for an Intelligent Reflecting Surfaces(IRS)-assisted multi-ID MEC system, and presented a hybrid multiple access scheme and optimization framework combined with FDMA and NOMA technologies. Additionally, in [20], the authors introduce a deep reinforcement learning-based approach for WPT-aided mobile edge computing to dynamically adapt to real-time changes, make swift decisions, and optimize both task offloading and energy resource allocation. Our previous research [21] introduced an online control algorithm for dynamic task offloading in WPMEC networks under dynamic network conditions, designed to maximize long-term system energy efficiency. However, the aforementioned work did not take into account the use of Backscatter technique to further enhance the energy utilization efficiency of wireless power transfer.
To mitigate the double-near-far effect and fully utilize available resources, many researchers employ user cooperation mechanism [11,21,22,23]. For instance, He et al. [23] presented a user cooperation scheme, aiming to maximize the network’s total throughput by jointly optimizing the local computing frequency, transmit power, task distribution, and time allocation. Similarly, Wang et al. [22] introduced a user cooperation mechanism for a NOMA assisted WPT-MEC network, designed to minimize overall system energy consumption. They employed the Lagrangian method to transform the inherently non-convex optimization challenge into a tractable convex problem. Sun et al. [24] proposed an iterative optimization algorithm for minimizing end-to-end latency in an MEC network supporting IoT applications, by jointly optimizing user association and resource allocation in a three-phase operation protocol. Su et al. [11] explored optimizing the energy beamforming and resource allocation to enhance computation efficiency for WPMEC system with the integration of user cooperation and non-orthogonal multiple access(NOMA), taking into account non-linear energy harvesting model. He et al. [25] proposed a dynamic control algorithm based on Lyapunov optimization framework to maximize the computation rate for a sustainable WPMEC under power constraints.
In recent years, BackCom has become an effective method for enhancing network energy efficiency and is widely used by researchers [10,16,26]. Ye et al. [26] introduced a bisection-based iterative algorithm for minimizing data offloading and computing delays in a WPMEC network with hybrid BackCom and active transmissions (AT) for IoT networks. Shi et al. [27] proposed a scheme for maximizing the weighted sum of computation bits in a Backscatter-assisted WPMEC network, considering a practical non-linear EH model with hybrid HTT and Backscatter communications. Lyu et al. [28] considered user-cooperation schemes for two types of wireless devices which can support different communication modes, i.e., Backscatter and HTT, enhancing overall communication and energy efficiency through joint optimization of time scheduling, power allocation, and energy beamforming. Wu and He [16] proposed an efficient iterative algorithm for EE maximization in a multi-access WPMEC system with the help of a relay. However, none of these works did not considered the user cooperation and the volatile MEC network environment.
Different from the above research, this paper addresses the problem of hybrid communication modes (e.g., BackCom and AC) and user cooperation for EE maximization in a volatile WPMEC network, without any information about the future. Compared to [10], our approach considers both nodes capable of processing their own incoming data simultaneously, and we introduce weighted incentives to motivate proximal nodes to assist distant nodes in offloading computational tasks. We account for dynamic factors, including random task arrivals and fluctuating wireless channel states, the prior knowledge of which is difficult to pinpoint accurately, making task offloading and resource allocation significantly challenging. Moreover, the coupling of dynamic battery levels and wireless charging time further complicates the problem.

2. System Model

We consider a typical WPTMEC network comprises two MDs and a HAP, depicted as Figure 1. The HAP directly connect with an MEC server to provide computation task offloading services and equipped with an RF energy transmitter to support wireless powered to MDs. MD 1 , one of the MDs, is located at a significant distance from the HAP. In contrast, MD 2 , the second MD, is in a more advantageous position due to its proximity to the HAP and acts as an intermediary. Both MD 1 and MD 2 are equipped with both a BackCom circuit and an AC circuit, enabling them to select between backscatter and active communication modes.
The WPMEC system employs a Time Division Duplex (TDD) method to alternate between communication and energy harvesting phases. Time Division Multiple Access (TDMA) technique is utilized to avoid signal interference [29]. The WPMEC system operates in a discrete time-slot mode over a time horizon period, with each time slot set to last T seconds. As depicted in Figure 2, each time slot is further divided into five time fractions dedicated to energy harvesting and task processing for the different MDs. At the start of each time slot, both MDs initiate the capture of RF signals transmitted by the HAP for the purpose of energy harvesting. A partial offloading strategy is adopted, which permits the flexible offloading of a portion or the complete set of computational data to a remote device. Owing to the terrible channel conditions between the MD 1 and the HAP, along with the compounded near-far effect, directly offloading tasks to HAP is infeasible for the MD 1 , such that the MD 1 offloads tasks to the MD 2 , and the MD 2 then forwards them to HAP.
Both MDs are equipped with BackCom and AC circuits for task offloading during the communication process. At each time slot, the BackCom mode is firstly used for tasks offloading. During intervals ε 1 t and ε 2 t , computation tasks date from the remote device MD1 is offloaded to MD2, which then offloads these task data to the HAP. Similarly, during intervals ε 3 t and ε 4 t , the offloading process is carried out using the AC mode. In the final time allocation ε s t , because the computation result data is typically small, this time slot can be considered negligible and ignored.
The primary symbols and definitions used are listed in Table 1.

2.1. Energy Harvesting Model

The HAP, equipped with a reliable power source, is responsible for the transmission of RF energy to the array of MDs dispersed within its service area. In the initial phase, the HAP broadcasts RF signals to all MDs for a duration of ε 0 t . Subsequently, MD1 offloads tasks to MD2 during the time fraction ε 1 t using BackCom, while MD2 can simultaneously harvest energy. In a similar fashion, MD2 offloads tasks to the HAP during the time fraction ε 2 t using BackCom, and MD1 harvests energy. Let E 1 , wp t and E 2 , wp t denote the harvested energy of MD 1 and MD 2 in the first phase, respectively. Thus, we have [10]
E 1 , wp t = μ h 1 t P 0 ε 0 t + ε 2 t
and
E 2 , wp t = μ h 2 t P 0 ε 0 t + ε 1 t
where 0 < μ < 1 represents the energy conversion efficiency, P 0 denotes the RF transmit power of the HAP. h 1 t and h 2 t represent the channel gain from the HAP to MD 1 and MD 2 respectively, which remain constant within the same time slot and vary across different time slots.

2.2. Dynamic Queues Model

To simulate the dynamic changes in user task data arrival and node battery levels, we have introduced the dynamic queuing model. Both MD 1 and MD 2 maintain a buffer queue for caching incoming task data, which is processed on a first-in, first-out (FIFO) principle. Let Q i t , i { 1 , 2 } denotes the queue lengths of MD 1 and MD 2 at slot t, respectively, which can be observed at the beginning of the time slot. The backlog of task queue Q i t update as follows:
Q i t + 1 = max Q i t d i , loc t + d i , off t , 0 + A i t , i { 1 , 2 }
where A i t represents the raw task data arriving at the MD i data queues during time slot t. We assume that the task arrival is an arbitrary process over time, the is a upper-bound by A m a x . d i , loc t , d i , off t represent the offloading task data and the local processing data at MD i , respectively.
Similarly, we assume that MD nodes are equipped with batteries and maintain a battery energy level queue. The energy captured through wireless charging is first stored in the batteries, and the battery power is consumed for local computing and task offloading. Concurrently, the battery energy level has an upper limit B max and a lower limit B min . The B min level is essential to sustain the basic operations of the MD IoT system [29]. Therefore, the battery energy of the MD nodes dynamically changes according to the following equation
B i t + 1 = min max B i ( t ) H i , l o c t H i , o f f t , B min + E i t , B max
where H i , l o c t and H i , o f f t represent the energy consumption for local computing and task offloading, respectively. The total energy harvested by MD i at slot t is given by E i t = E i , wp t + E i , ba t , where E i , wp t is as previously defined, and E i , ba t denotes the energy harvested during BackCom data transmission.

2.3. Local Computing Model

Upon task arrival at a node, local processing is prioritized; task offloading is considered only when local processing is not feasible. Since each MD is equipped with a battery, the maximum duration for local computation is denoted by T. Let f 1 and f 2 denote the local CPU frequencies of MD 1 and MD 2 , ϕ 1 and ϕ 2 denote the CPU cycles required to process one bit of task at the MD 1 and MD 2 , respectively. Furthermore, the maximum amount of local computation data at MD i cannot exceed the current backlog of Q i ( t ) . Thus, the amount of locally computed task data can be expressed as
d i , loc t = min Q i t , f i T ϕ i , i { 1 , 2 }
and the corresponding energy consumption is [10]
H i , loc t = min κ i f i 2 Q i t ϕ i , κ i f i 3 T , i { 1 , 2 }
where κ i denotes the computing energy efficiency parameter of M D i  [10]. The expression κ i f i 2 Q i t ϕ i represents the power consumption used for processing all current task data in queue Q ( i ) .

2.4. Task Offloading Model

2.4.1. Backcom Data Transmission

During ε 1 t , MD 1 offload tasks to MD 2 by Backcom. Let 0 β 1 1 denote the reflection coefficient of MD 1 , where β 1 represents the fraction of the received signal that is utilized as a carrier for data transfer, and the remaining fraction ( 1 β 1 ) is allocated for energy harvesting [30]. Let g 12 t represent the channel gain between MD 1 and MD 2 . According to Shannon’s theorem [31], the amount of tasks offloaded from MD 1 to MD 2 satisfies [10] Preprints 115335 i002 where ζ represents the performance gap reflecting real modulation [13], σ 2 is the noise power. The corresponding energy consumption by circuit is
H 1 , ba t = P 1 ba ε 1 t
where P 1 ba is the circuit power consumption of MD 1 by BackCom, which is a constant value depending on the circuit structure. Due to utilizing BackCom technology, data transmission and energy harvesting can be performed simultaneously. Thus, the energy harvested for MD 1 during ε 1 t is as [10]
E 1 , ba t = μ h 1 t P 0 ε 1 t 1 β 1 t
After receiving the tasks offloaded from MD 1 , MD 2 will relay a portion of these tasks to the HAP as a relay. Note that MD 2 will also offload its own tasks to the HAP. At time ε 2 t , the task data transmitted by MD 2 through BackCom is constrained as follows Preprints 115335 i003 where β 2 t is the reflection coefficient of the BackCom at MD 2 . h 2 t represents the channel gain from the HAP to the MD 2 at time slot t, and g 2 a t denotes the channel gain from the MD 2 to the HAP at time slot t. The corresponding energy consumption is
H 2 , ba t = P 2 ba ε 2 t
where P 2 ba is the circuit power consumption of MD 2 by BackCom. The energy harvested by Backcom during ε 2 t is
E 2 , ba t = μ h 2 t P 0 ε 2 t 1 β 2 t

2.4.2. AC Data Transmission

After the BackCom offloading is completed, the AC mode offloading is initiated, which proceeds through an offloading process similar to that of BackCom. The AC mode can achieve a higher data transmission rate, but it is incapable of energy harvesting during the offloading phase. During the time fraction ε 3 t , the upper limit of the amount of task from MD 1 offloading to MD 2 can be expressed as Preprints 115335 i004 where 0 P 1 t P max represents the transmit power allocated to AC at MD 1  [10], σ 2 is the noise power and g 12 represents the channel gain from MD 1 and MD 2 . The corresponding AC offloading energy consumption is
H 1 , ac t = P 1 t + P 1 a c ε 3 t
where P 1 a c denotes the circuit power of MD 1 through AC which is a constant value [10].
The upper limit of the amount task offloading from MD 2 to HAP by AC during ε 4 t is given by Preprints 115335 i005 where 0 P 2 t P max denotes the transmit power of the MD 2 through AC. Let P 2 a c denotes the circuit power of MD 2 through AC. The energy consumed for task offloading AC at MD 2 in slot t is
H 2 , ac t = P 2 t + P 2 a c ε 4 t

2.5. Network Stability and Utility

For a dynamically changing WPMEC network system, maintaining system stability is crucial due to the stochastic arrival of tasks and the dynamic changes in the channel environment [32]. Therefore, we first provide the definition of system queue stability as follows.
Definition 1. 
(Queue Stability): A discrete task data queue Q i ( t ) is strong stable [32] if it satisfies
lim K 1 K t = 1 K E Q i t < , i { 1 , 2 }
The energy efficiency of WPMEC system is defined as the ratio of the sum of weighted processed tasks to the total energy consumption. Specifically, the total accomplished tasks D tot ( t ) and the total energy consumption E tot ( t ) of WPMEC network at slot t can be represented as follows, respectively
D tot ( t ) = ω 1 d 1 , loc t + d 1 , off t + ω 2 d 2 , loc t + d 2 , off t
and
H tot ( t ) = H 1 , loc t + H 1 , off t + H 2 , loc t + H 2 , off t
where ω 1 and ω 2 represents the task weights of MD 1 and MD 2 respectively. d i , off t = d i , ba t + d i , ac t represent the total offloading task data at MD i , and H i , off t = H i , ba t + H i , ac t represent the total energy consumption of offloading task at MD i .
Definition 2. 
(Utility-Energy Efficiency ): The network utility ρ E E is defined as the time-averaged amount of computation data achieved per unit of energy consumed [33]. This is expressed as the ratio of the long-term processed data to the total energy consumption, as follows:
ρ E E = lim K 1 K t = 0 K 1 E D tot ( t ) lim K 1 K t = 0 K 1 E E tot ( t ) = D ( t ) ¯ E ( t ) ¯

3. Probelm Formulation

In this paper, we aim to design a dynamic offloading algorithm to maximize the ρ E E with constraint of the system queue stability, by making decisions on time allocation ε t = ε 0 t , ε 1 t , ε 2 t , ε 3 t , ε 4 t , power allocation P t = P 1 t , P 2 t , reflection coefficients β t = β 1 t , β 2 t and the amount of offloaded tasks d t = d 1 , ba t , d 1 , ac t , d 2 , ba t , d 2 , ac t at each time slot t. Simultaneously, our algorithm should ensures the stability of the system network when faced with randomly arriving task loads and dynamically changing wireless channel conditions. By denoting ε = { ε t } t = 1 K , P = { P t } t = 1 K , β = { β t } t = 1 K , and d = { d t } t = 1 K the maximization of ρ E E for a Backscatter-assisted WPMEC with user cooperation can be formulated as the following problem (P0):
P 0 : max ε , P , β , d ρ E E
s . t . ε 0 t + ε 1 t + ε 2 t + ε 3 t + ε 4 t T
B min B i t H i , loc t H i , off t + E i t B max , i { 1 , 2 }
lim K 1 K t = 1 K E Q i t < , i { 1 , 2 }
d i , loc t + d i , off t Q i ( t ) , i { 1 , 2 }
d 1 , off t d 2 , off t
( 7 ) , ( 10 ) , ( 13 ) , ( 15 )
ε 0 t , ε 1 t , ε 2 t , ε 3 t , ε 4 t 0 ,
0 P 1 t , P 2 t P max
where constraint (20b) ensures the total time allocation does not exceed the time slot. Constraint (20c) ensures that the energy consumed is less than the energy in the battery. Constraints (20d) guarantee the stability of data queues. Constraint (20e) indicates that the amount of processed task in the current time slot must not exceed the current queue length. denotes the maximum data processing limits for the MD 1 and MD 2 within time slot t. Constraint (20f) ensures that the data offloaded from the MD 1 can be processed within the same time slot. Constraints (20g) denotes the maximum offloading data depending on the channel condition. The problem is a fractional stochastic programming issue, which presents significant challenges due to several aspects: (1) The randomness of task arrivals, the fluctuating wireless channel state, and the dynamic battery level introduce stochastic factors to the optimization challenge; (2) The temporal coupling in the time fraction and energy consumption exhibited by BackCom and AC poses a considerable challenge in determining the allocation of offloading time.
To simplify problem P0, we employ the Dinkelbach‘s method [34] to transform the original problem into a more tractable one. Let ρ E E * denote the optimal value of ρ E E , we derive the following Theorem 1.
Theorem 1. 
The optimal ρ E E * is achieved if and only if
max ε , P , β , d D ( t ) ¯ ρ E E * E ( t ) ¯ = 0
Proof. 
For brevity, here we omit the proof details. See Theorem 1 in [33].    □
Since ρ E E * is unknown during the solution process, (Section 3) is still infeasible to tackle. In accordance with the methodology employed in [33], we introduce a new parameter u ( t ) and define it as
u ( t ) = 1 K t = 1 K D tot ( t ) E tot ( t )
We set u ( 0 ) = 0 at the beginning of the problem. Replacing ρ E E ( t ) in (Section 3), the problem P1 can be transformed into
P 1 : max ε , P , β , d D ( t ) ¯ u ( t ) E ( t ) ¯
s . t . ( 20b ) ( 20i )
where u ( t ) is a given parameter that should be updated through the resolution process. It should be noted that u ( t ) obtained by (22) will get closer to ρ E E * as time goes by [33]. Therefore, this transformation is reasonable and has the same optimal solution with P0. While Problem P1 is more manageable than Problem P0, it still faces several challenges The constraints (20c) and (20d), along with the equation (4), result in an interdependence of battery levels across various time slots throughout the period, which means that the current energy consumption affects future battery levels. Moreover, The unpredictability of the stochastic task arrivals and the fluctuating channel states add another complexity to the problem. The difficulty in accurately forecasting these elements leads to an inherent temporal coupling in the decision-making process.

4. Algorithm Design

First, to decouple the battery energy level across time slots and ensure the stability of the task queue, we leverage the Lyapunov network optimization technique to eliminate the long-term average constraints.

4.1. Lyapunov Optimization Formulation

To simply the battery energy queue at MD 1 and MD 2 , we introduce two virtual queues for battery level: B ^ i ( t ) = B i ( t ) B max , i { 1 , 2 } . In order to optimize the task queue and the energy queue simultaneously, we define a combined queue vector Θ t Q i t , B ^ i ( t ) , i { 1 , 2 } . Following Lyapunov optimization framework, we obtain the quadratic Lyapunov function and the Lyapunov drift [32] as
L Θ t 1 2 i { 1 , 2 } Q i 2 t + B ^ i 2 ( t )
and
Δ Θ t E L t + 1 L t Θ t
According to the Lyapunov optimization theory [32], the one slot drift-plus-penalty expression can be expressed as
Δ V Θ t = Δ Θ t V E D tot ( t ) u ( t ) E tot t Θ t
where control parameter V is a positive value, used to balance the trade-off between network EE and network stability. Actually, V acts as a weighting factor of the cost optimality in the drift-plus-penalty expression. Increasing the value of V causes the algorithm to focus more on network EE, which may also result in a larger backlog of the task queue Q. We derive an upper bound of Δ V Θ t as Lemma 1.
Lemma 1. 
For any control strategy ε , P , β , d at each time slot t, the one slot Lyapunov drift-plus-penalty Δ V Θ t is bounded by the following inequality:
Δ V Θ t B V E D tot ( t ) u ( t ) E tot ( t ) Θ t + i { 1 , 2 } Q i ( t ) E A i ( t ) d i , loc t d i , off t Θ ( t ) + B ^ i ( t ) E E i t H i , loc t H i , off t Θ ( t )
where B is a constant value, which satisfies t :
B 1 2 i { 1 , 2 } E d i , loc t + d i , off t 2 + A i 2 ( t ) Θ ( t ) + E E i t 2 + H i , loc t + H i , off t 2 Θ ( t )
Proof. 
By using the inequality m a x [ a b , 0 ] + c 2 a 2 + b 2 + c 2 + 2 a c b , a , b , c 0 and combining the definition of task queues and battery energy queues Eq. (3)-(4), we have
Δ Q i ( t ) = 1 2 Q i ( t + 1 ) 2 Q i ( t ) 2 d i , loc t + d i , off t 2 + A i 2 ( t ) 2 + Q i t A i ( t ) d i , loc t d i , off t , i { 1 , 2 }
Δ B ^ i ( t ) = 1 2 B ^ i ( t + 1 ) 2 B ^ i ( t ) 2 1 2 E i t 2 + H i , loc t + H i , off t 2 + B ^ i ( t ) E i t H i , loc t H i , off t , i { 1 , 2 }
Combining the above inequalities (29) and (30), we obtain the upper bound of the Lyapunov drift-plus-penalty.    □
According to the drift-plus-penalty technique in Lyapunov optimization theory, we seek to greedily minimize the upper bound of Δ V Θ t at each time slot t. Notably the value of A ( t ) , Q i ( t ) and B ^ i ( t ) can be observed at the beginning of each time slot t. By eliminating the constant term in the RHS (right hand side) of (27), the problem P1 can be transformed into a one time slot problem as
P 2.1 : min ε , P , β , d Q 1 ( t ) + ω 1 V d 1 , loc t + d 1 , off t Q 2 ( t ) + ω 2 V d 2 , loc t + d 2 , off t + V u ( t ) B ^ 1 ( t ) H 1 , loc t + H 1 , off t + V u ( t ) B ^ 2 ( t ) H 2 , loc t + H 2 , off t
+ B ^ 1 ( t ) E 1 t + B ^ 2 ( t ) E 2 t
s . t . ( 20b ) ( 20c ) , ( 20e ) ( 20i )
Due to the coupling between the parameters ε , P and β , the proposed problem P2.1 is still a non-convex problem. To decouple these parameters, we introduce auxiliary variables φ 1 t = P 1 t ε 3 t , φ 2 t = P 2 t ε 4 t , φ 3 t = ε 1 t β 1 , φ 4 t = ε 2 t β 2 . Then the problem P2.1 can be simplified as:
P 2.2 : min ε , φ , d i { 1 , 2 } C i d i , loc t + d i , off t + C 3 H 1 , loc t + C 3 P 1 ba + C 5 + C 6 ε 1 t + C 3 φ 1 t + C 4 H 2 , loc t + C 4 P 2 ba + C 5 + C 6 ε 2 t + C 4 φ 2 t
+ C 5 + C 6 ε 0 t C 5 φ 3 t C 6 φ 4 t + C 7 ε 3 t + C 8 ε 4 t
s . t . ε 0 t + ε 1 t + ε 2 t + ε 3 t + ε 4 t T
B min B i t H i , loc t H i , off t + E i t B max , i { 1 , 2 }
d i , loc t + d i , off t Q i ( t ) , i { 1 , 2 }
d 1 , ba t + d 1 , ac t d 2 , ba t + d 2 , ac t
d 1 , ba t ε 1 t W log 2 1 + ζ P 0 h 1 t g 12 t φ 3 t ε 1 t σ 2
d 2 , ba t ε 2 t W log 2 1 + ζ P 0 h 2 t g 2 a t φ 4 t ε 2 t σ 2
d 1 , ac t ε 3 t W log 2 1 + g 12 t φ 1 t ε 3 t σ 2
d 2 , ac t ε 4 t W log 2 1 + g 2 a t φ 2 t ε 4 t σ 2
ε 0 t , ε 1 t , ε 2 t , ε 3 t , ε 4 t 0 ,
0 φ 1 t , φ 2 t P max T
where C 1 = Q 1 ( t ) + ω 1 V , C 2 = Q 2 ( t ) + ω 2 V , C 3 = V u ( t ) B ^ 1 ( t ) , C 4 = V u ( t ) B ^ 2 ( t ) , C 5 = μ h 1 t P 0 B ^ 1 ( t ) , C 6 = μ h 2 t P 0 B ^ 2 ( t ) , C 7 = C 3 P 1 ac , C 8 = C 4 P 2 ac .
Lemma 2. 
P2.2 is a convex optimization problem, which can be efficiently solved by convex optimization tools, such as CVX [35].
Proof. 
In problem (P2.2), the objective function (32a) is linear with respect to all variables. Constraints (32b)-(32e) are all linear inequality constraints. What’s more, for constraint (32f), log 2 1 + ζ P 0 h 1 t g 12 t φ 4 t ε 1 t σ 2 is the perspective of log 2 1 + ζ P 0 h 1 t g 12 t β 1 σ 2 which is concave with respect to φ 4 . Since the perspective operation preserves convexity [36], ε 1 t log 2 1 + ζ P 0 h 1 t g 12 t φ 4 t ε 1 t σ 2 is concave with φ 4 and ε 1 t and (32f) is a convex constraint. For the same reason, (32g)-(32i) are all convex constraints. Thus P2.2 is a convex problem.    □
According to Lemma 2, at each time slot, we only need to solve a convex problem P2.2, which contains a small number of variables. By doing so, we can achieve optimal long-term average EE, even without knowledge of future system information. Our proposed algorithm, the Dynamic Offloading for Backscatter-Assisted WPMEC Algorithm (DOBAM), is summarized as Algorithm 1.
Algorithm 1: Dynamic Offloading for Backscatter-Assisted WPMEC Algorithm (DOBAM)
Preprints 115335 i001

4.2. Algorithm Performance Analysis

At each time slot, our algorithm engages in the resolution of a straightforward convex optimization problem, denoted as P2.2, encompassing a manageable set of 13 decision variables. This problem is adeptly addressed through the application of well-established optimization techniques, such as the interior point method. Characterized by a computational complexity of approximately O ( n 3.5 log ( 1 / ϵ ) ) , where n is the number of decision variables, this approach ensures efficiency even for problems of considerable size.
In the following part, we analyze that the proposed scheme can achieve the optimal long-term time-average solution. First, we give some assumptions as follows:
lim K 1 K t = 0 K 1 D tot ( t ) = d ave
lim K 1 K t = 0 K 1 E tot ( t ) = e ave
lim K u ( t ) = 1 K t = 0 K 1 D tot ( t ) 1 K t = 0 K 1 E tot ( t ) = u ave
then, we can obtain that the expectation can also converge the same solutions
lim K 1 K t = 0 K 1 E D tot ( t ) = d ave
lim K 1 K t = 0 K 1 E E tot ( t ) = e ave
lim K E u ( t ) = u ave
Lemma 3. 
Based on (36)-(38), we have
lim K 1 K t = 0 K 1 E u ( t ) E tot ( t ) = D ( t ) ¯
lim K 1 K t = 0 K 1 E u ( t ) u
To start with, we give the existence of the optimal solution based on the current queue status.
Lemma 4. 
If the problem (P1) is feasible, there exists a policy ε , P , β , d * that satisfies the following conditions t , ε > 0 :
E D tot * ( u * ε ) E E tot *
E H i , loc t * + H i , off t , * E i t , * ε , i { 1 , 2 }
E A i ( t ) d i , loc t , * d i , off t , * ε , i { 1 , 2 }
where * represents the value under optimal solution.
Proof. 
See the parts 4 and prat 5 of [32].
Theorem 2. 
The optimal long-average utility function obtained by P1 is limited by a lower bound that is independent with the time space. The following solutions can be achieved by the algorithm,
  • u ( t ) u * B / ( V e ave ) ,
  • All queues Q ( t ) , B ^ m ( t ) , B ^ h ( t ) are mean rate stable, and thus the constraints are satisfied.
Proof. 
For any ε > 0 , let us consider the policy and queue state in (41)–(43). Since the result values are independent of queue status Θ ( t ) , we have
E D t o t * Θ ( t ) ( u * ε ) E E t o t * Θ ( t )
E H i , loc t * + H i , off t , * E i t , * Θ ( t ) ε , i { 1 , 2 }
E A i ( t ) d i , loc t , * d i , off t , * Θ ( t ) ε , i { 1 , 2 }
By integrating these results to (27) and making ε 0 , we have
Δ V Θ t B V E D t o t * ( t ) u * E t o t * ( t ) + Q i ( t ) t = 1 , 2 E A i ( t ) d i , loc t , * d i , off t , * + B ^ i ( t ) t = 1 , 2 E E i t , * H i , loc t * H i , off t , * B V E D t o t * ( t ) + V u * E E t o t * ( t )
Note that u ( t ) is a constant value, which is independent of the current queue status Θ ( t ) . Utilizing the iterated expectation and obtaining the sum of the above inequality over time t { 0 , 1 , . . . , K 1 } , we have
E { L Θ ( K ) } E { L Θ ( 0 ) } V t = 0 K 1 E D t o t ( t ) + V t = 0 K 1 E u ( t ) E t o t ( t ) K B V u * E E t o t * ( t ) + V t = 0 K 1 E u * E t o t * ( t )
Dividing both sides of (44) by V K , utilizing the Jensens inequality and the fact that
1 K t = 0 K 1 E D t o t ( t ) + 1 K t = 0 K 1 E u E t o t ( t ) B V u * E E t o t * ( t ) + 1 K t = 0 K 1 E u ( t ) E t o t * ( t )
Furthermore, letting K , we have
lim K 1 K t = 0 K 1 E D t o t ( t ) + lim K 1 K t = 0 K 1 E u ( t ) E t o t ( t ) B V u * E E t o t * ( t ) + lim K 1 K t = 0 K 1 E u ( t ) E t o t * ( t )
considering lim K 1 K t = 0 K 1 E u ( t ) u , we have
0 B V u * E E t o t * ( t ) + u E E t o t * ( t ) )
Furthermore we obtain
u u * B V d a v e
Theorem 3. 
Let e 1 upper be the upper bound of e 1 ( t ) , the time-average sum rate of queue length is bounded by
Q 1 + Q 2 B ε + u e 1 u p p e r ε
Proof. 
By taking iterated expectation and using telescoping sums over t { 0 , 1 , . . . , K 1 } , we have
E L [ Θ ( K ) ] E L [ Θ ( 0 ) ] V E D t o t ( t ) u ( t ) E t o t ( t ) Θ ( t ) K B ε t = 0 K 1 E Q 1 ( t ) + Q 2 ( t ) + B ^ 1 ( t ) + B ^ 2 ( t ) + V E u ( t ) E t o t ( t )
Dividing both sides of (54) by K ε , taking K , rearranging terms yield
B ε lim K 1 K t = 0 K 1 E Q 1 ( t ) + Q 2 ( t ) + B ^ 1 ( t ) + B ^ 2 ( t ) + V u E t o t u p p e r 0
lim K 1 K t = 0 K 1 E Q 1 ( t ) + Q 2 ( t ) + B ^ 1 ( t ) + B ^ 2 ( t ) B ε + V u E t o t u p p e r ε
Theorems 2 and 3 provide a rigorous mathematical performance analysis for our proposed algorithm. They demonstrate that the time-average η E E increases at a rate of O ( 1 / V ) , while the queue length increases at a rate of O ( V ) . The WPMEC system EE η E E can be improved by adjusting the value of V. However, the time-average task queue Q wilL increase with V. Therefore, we can tune V to achieve a [ O ( 1 / V ) , O ( V ) ] trade-off between η E E network EE and task queue length. According to Little’s low [32], the latency is proportional to the time-average task queue length. This also implies that our proposed algorithm can achieve a EE-latency trade-off. This balance is critical in many real-world applications where both efficiency and response time are important.

5. Simulation Results

In this section, we evaluate the performance of our propose algorithm for a Backscatter-assisted WPMEC system with user cooperation through extensive numerical simulation. The experiments are conducted on a high-performance platform equipped with a 2.10 GHz Intel(R) Xeon(R) Silver 4116 CPU and four GeForce RTX 2080 Ti GPUs, ensuring efficient simulation execution. We employed the free-space path loss model to simulate signal propagation, where the average channel gain h ¯ is calculated by the following formula [4]:
h ¯ = A d ( 3 × 10 8 4 π f c d i ) d e ,
where A d = 3 (antenna gain), f c = 915 MHz (carrier frequency), d e = 3 (path loss exponent), and d i represents the distance between nodes (in meters). The dynamic channel gains for WPT and task offloading, following the Rayleigh fading model, are represented by the vector h t = a 1 t h 1 t , a 2 t h 2 t , a 3 t g 12 t , a 4 t g 2 a t . In the model, the channel fading factors a 1 t , a 2 t , a 3 t , a 4 t all follow an exponential distribution with an expectation of 1, simulating the natural variability of wireless channels. For the sake of model simplification, we assume that the vector of fading factors remains constant at 1.0 , 1.0 , 1.0 , 1.0 within each time slot, thereby considering the channel gain to be static during that slot.
At each time slot t, the expected arrival rates of computational tasks A i t satisfy E A i t = λ i , i { 1 , 2 } . It is assumed that A i t follow an exponential distribution, where the rate parameters λ 1 and λ 2 correspond to 1.2 and 1.5, respectively. Other simulation parameters are detailed in Table 2.
To comprehensively evaluate the performance of our algorithm, we conducted comparative simulations with three representative benchmarks as follows:
(1) UC With the AC scheme: All task offloading for MD 1 and MD 2 are restricted to the use of the AC mode exclusively [9], with the BackCom mode prohibited. This is equivalent to setting the time offsets ε 1 t and ε 2 t to zero in problem P 0 .
(2) UC with the BackCom scheme: Users opt to complete computational tasks exclusively via the BackCom mode. Specifically, the HAP continuously broadcasts RF energy to the users throughout the time slot. This scheme can be implemented in scenario P 0 by setting ϵ 3 = 0 and ϵ 4 = 0 .
(3) Without UC With Integrated BackCom and AC scheme: MD 1 and MD 2 communicate directly with the HAP, offloading tasks via integrated BackCom and AC [37]. Due to poor channel conditions between MD 1 and HAP, it may harvest less energy and offload a smaller portion of tasks. Additionally, devices can complete their computational tasks through local computation throughout the entire time slot.
To ensure fairness, it is crucial to preserve the stability of network queues in all schemes. Thus, the three baseline methods previously discussed are executed within the framework of Lyapunov optimization.
Figure 3 demonstrates the performance comparison of EE under different schemes, with parameters set as V = 40 , ζ = 16 d B , and W = 1.2 M H z . It can be observed that our proposed algorithm achieves the best performance of EE, followed by UC with the BackCom, with UC with the AC ranking third. The method without user cooperation that integrates BackCom and AC performs the worst. Compared to the other three schemes, our proposed algorithm has improved the EE by 23%, 38%, and 48%, respectively. This superior performance highlights the advantage of integrating both BackCom and AC. The scheme of UC with the BackCom, suffers from limited transmission capability when ζ is small, leading to poor performance. Although UC with AC can transmit sufficient data, its high circuit power consumption results in it ranking third. Moreover, the EE of the non-cooperative scheme with integrated BackCom and AC is also low. This indicates that even with the integration of BackCom and AC, poor channel conditions between MD 1 and HAP can still limit energy harvesting and task offloading. This further emphasizes the importance of user cooperation in enhancing the performance of remote users.
Figure 4 illustrates the impact of controlling the performance gap ζ on EE. The EE for the other three schemes improve with an increase in ζ , contrasting with the AC scheme’s stable EE. Our proposed scheme consistently delivers the best system performance. With ζ exceeds -17dB, the BackCom scheme outperforms the AC scheme in EE, prompting the system to favor the BackCom mode. Upon reaching a higher threshold, the BackCom scheme’s EE matches that of our proposed scheme, leading to its predominant use by users. Conversely, when ζ below -21dB, our scheme defaults to the AC mode. The non-cooperative integrated BackCom and AC scheme sees minimal performance gains due to suboptimal channel conditions. Overall, the proposed scheme outshines others in flexibility and adaptability, adeptly tuning to varying ζ levels for optimal EE.
In Figure 5, the maximum time slot is set to 2000, the maximum time slot is set to 2000, achieving a stable solution for the algorithm within this time period. We demonstrate that adjustments of network bandwidth significantly impact the performance of various algorithms. As shown in Figure 5, within the bandwidth range of [1.00, 1.45] × 10 6 Hz, the EE of all schemes increases with the expansion of bandwidth. This is attributed to the increased bandwidth allows more tasks to be transmitted to the edge server for processing using edge computing resources. Our proposed scheme exhibits superior performance across different bandwidth conditions, particularly as the bandwidth approaches 1.45MHz, where its energy efficiency advantage becomes especially pronounced, significantly surpassing other baseline algorithms. This not only demonstrates the high adaptability of our proposed scheme in bandwidth control but also highlights its efficient utilization of network resources in high-bandwidth environments, achieving maximum energy efficiency.
In Figure 6, we evaluated the system performance of various algorithms under different weight configurations, where the weight ω varies in the range [0, 3], with ω 1 = ω / ( ω + 1 ) and ω 2 = 1 / ( ω + 1 ) . As ω increases, the EE of all schemes generally declines. This is because an increase in weight ω implies a higher priority assigned to the tasks of MD 1 , leading the system to allocate more resources to MD 1 tasks, thereby reducing overall energy efficiency. Our algorithm demonstrates optimal EE across all weight settings. Specifically, when ω = 3 , it shows an energy efficiency improvement of 23%, 38%, and 46% compared to other algorithms, respectively. This indicates that our algorithm can more effectively leverage the advantages of the UC scheme integrated with BackCom and AC. Figure 6 also reveals that in practical applications, excessively high weights for edge node devices may rapidly degrade network performance, thereby emphasizing the importance of proper weight distribution.
Figure 7 illustrates the impact of the control parameter V on EE and the average stable queue length, with parameter settings of ζ = 18 d B , W = 1.2 M H z , a distance of 120 meters between nodes MD 1 and MD 2 , and a task arrival rate at MD 2 of λ 2 = 1.5 M b p s . As V increases, EE is enhanced, and the stable queue threshold also increases. This indicates that with the increase of V, EE becomes the primary concern, while the current queue length plays a relatively minor role in the objective function. However, when V reaches a certain threshold, the gain in energy efficiency becomes saturated, and further increasing V will no longer have a significant impact on system performance and queue length. This trend can be interpreted as follows: a larger V allows the system to buffer more data, which is consistent with the previous theoretical analysis. Thus, the problem is transformed into seeking to maximize EE while to some extent disregarding the current queue length.
Figure 8 illustrates the optimal time allocation of ε versus the performance gap ζ . Initially, when ζ is low, despite the lower energy consumption of the BackCom mode, achieving a larger number of computational bits at the same energy consumption is challenging. Consequently, MDs prefer the AC mode and allocate more time ε 0 * for energy collection to meet data processing needs. As ζ increases, ε 0 * gradually decreases, and when ζ exceeds -20 dB, ε 0 * 0 . This is because, as ζ increases, the BackCom mode can not only achieve more computational bits at the same energy consumption but also collect sufficient energy to support the circuit consumption under the AC mode, significantly improving energy efficiency. Therefore, as ζ increases, users are more inclined to choose the BackCom mode, using its reflective event signal function to perform task transmission and energy collection to meet the circuit consumption [38]. This scheme ensures that under different ζ conditions, the system can operate at the highest efficiency, optimizing energy usage.
Figure 9 illustrates that the EE under varying distances between MD 1 and MD 2 , with V = 40 . The distance between the remote MD and the helper device ranges from 80 to 180 meters. It is observed that EE decreases as the distance increases. This is because an increase in distance leads to a reduction in channel gain, necessitating more time and higher power to transmit data to maintain a shorter data queue, thereby increasing energy consumption. This indicates that in practical deployment, the distance between edge node devices and helper devices should be kept within a reasonable range to avoid a rapid decline in network performance. This assessment not only quantifies the specific impact of distance on energy efficiency but also emphasizes the importance of considering the distance factor in the design of efficient networks.
In Figure 10, we evaluate the EE and the average task queue length as the task arrival rate at MD 2 varies. The distance between MD 1 and MD 2 is set to 120 meters, and control parameter V is set to 40. As the computation task data arrival rate at MD 2 increases, the EE decreases. The rise in data rate causes the local data queue to expand, necessitating a higher data transmission rate from the system. his not only entails processing a larger volume of data but also results in greater energy expenditure to maintain shorter data queues, thereby increasing overall energy consumption. Despite this trade-off in energy efficiency, our algorithm optimizes energy usage, ensuring that the system maintains efficient data processing and rapid response capabilities even as data rates increase.

6. Conclusions

In WPMEC systems, the volatility of the network environment and the time coupling of battery charging and discharging present significant challenges for the joint optimization of computation offloading and resource allocation. The integration of BackCom and AC is crucial for enhancing the performance of WPMEC systems across various user scenarios, especially with user collaboration, which can significantly improve the performance of edge node devices with poor channel conditions. In this study, we design a user collaboration-assisted WPMEC system that integrates BackCom and AC, aiming to maximize EE by jointly optimizing the backscatter reflection coefficient, AC transmission power, time, and task allocation under dynamic loads and time-varying channel conditions. To address the complexity brought about by the coupling of multiple parameters, we propose a dynamic control algorithm, which employs variable substitution, fractional programming, Lyapunov optimization, and convex optimization techniques to simplify the problem into a convex optimization form and decouple the sequential decision-making problem into independent optimization tasks for each time slot. Simulation results show that our scheme effectively enhances EE while ensuring system stability, surpassing existing technologies. Furthermore, future research directions include exploring scenarios with multiple edge nodes and a few helper nodes, which is expected to further improve the system’s practicality and adaptability in real-world applications.

Author Contributions

Methodology, H.H.; Validation, Z.C. and Y.Y.; Formal analysis, H.H.; Investigation, H.H. and H.F.; Resources, H.H.; Data curation, H.H. and S.H.; Writing—original draft, H.H. and H.F.; Writing—review and editing, H.H.; Supervision, L.S. and S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Science and Technology Planning Project of Guangdong Province, China (No.2021A0101180005), Science and Technology Development Fund of Macao (FDCT), Macao, China (#0033/2023/RIA1).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

We thank all of the reviewers for their valuable comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Zhao, R.; Zhu, F.; Tang, M.; He, L. Profit maximization in cache-aided intelligent computing networks. Physical Communication 2023, 58, 102065. [Google Scholar] [CrossRef]
  2. He, H.; Shen, H.; Hao, Q.; Tian, H. Online delay-guaranteed workload scheduling to minimize power cost in cloud data centers using renewable energy. Journal of Parallel and Distributed Computing 2022, 159, 51–64. [Google Scholar] [CrossRef]
  3. Wang, X.; Li, J.; Ning, Z.; Song, Q.; Guo, L.; Guo, S.; Obaidat, M.S. Wireless powered mobile edge computing networks: A survey. ACM Computing Surveys 2023, 55, 1–37. [Google Scholar] [CrossRef]
  4. Wu, T.; He, H.; Shen, H.; Tian, H. Energy-Efficiency Maximization for Relay-Aided Wireless-Powered Mobile Edge Computing. IEEE Internet of Things Journal 2024, 11, 18534–18548. [Google Scholar] [CrossRef]
  5. Wang, F.; Xu, J.; Cui, S. Optimal energy allocation and task offloading policy for wireless powered mobile edge computing systems. IEEE Transactions on Wireless Communications 2020, 19, 2443–2459. [Google Scholar] [CrossRef]
  6. Wei, Z.; Zhang, B.; Lin, S.; Wang, C. A self-oscillation WPT system with high misalignment tolerance. IEEE Transactions on Power Electronics 2023. [Google Scholar] [CrossRef]
  7. Li, G.; Zeng, M.; Mishra, D.; Hao, L.; Ma, Z.; Dobre, O.A. Latency minimization for IRS-aided NOMA MEC systems with WPT-enabled IoT devices. IEEE Internet of Things Journal 2023, 10, 12156–12168. [Google Scholar] [CrossRef]
  8. Ju, H.; Zhang, R. Throughput maximization in wireless powered communication networks. IEEE Transactions on Wireless Communications 2013, 13, 418–428. [Google Scholar] [CrossRef]
  9. He, H.; Zhou, C.; Huang, F.; Shen, H.; Li, S. Energy-Efficient Task Offloading in Wireless-Powered MEC: A Dynamic and Cooperative Approach 2024.
  10. He, Y.; Wu, X.; He, Z.; Guizani, M. Energy efficiency maximization of backscatter-assisted wireless-powered MEC with user cooperation. IEEE Transactions on Mobile Computing 2023, 23, 1878–1887. [Google Scholar] [CrossRef]
  11. Su, B.; Ni, Q.; Yu, W.; Pervaiz, H. Optimizing computation efficiency for NOMA-assisted mobile edge computing with user cooperation. IEEE Transactions on Green Communications and Networking 2021, 5, 858–867. [Google Scholar] [CrossRef]
  12. Hoang, D.T.; Niyato, D.; Wang, P.; Kim, D.I.; Han, Z. Ambient backscatter: A new approach to improve network performance for RF-powered cognitive radio networks. IEEE Transactions on Communications 2017, 65, 3659–3674. [Google Scholar] [CrossRef]
  13. Shi, L.; Ye, Y.; Chu, X.; Sun, S.; Lu, G. Energy-efficient resource allocation for backscatter-assisted wireless powered MEC. IEEE Transactions on Vehicular Technology 2023, 72, 9591–9596. [Google Scholar] [CrossRef]
  14. Ye, Y.; Shi, L.; Chu, X.; Lu, G. Throughput fairness guarantee in wireless powered backscatter communications with HTT. IEEE Wireless Communications Letters 2020, 10, 449–453. [Google Scholar] [CrossRef]
  15. Ye, Y.; Shi, L.; Chu, X.; Hu, R.Q.; Lu, G. Resource allocation in backscatter-assisted wireless powered MEC networks with limited MEC computation capacity. IEEE Transactions on Wireless Communications 2022, 21, 10678–10694. [Google Scholar] [CrossRef]
  16. Wu, T.; He, H. An Efficient Energy Efficiency Maximization Algorithm for Backscatter-Assisted WP-MEC Network with Relay. In Proceedings of the Proceedings of the 2024 16th International Conference on Machine Learning and Computing, New York, NY, USA, 2024; ICMLC ’24, p. 720–727. [CrossRef]
  17. Ji, L.; Guo, S. Energy-efficient cooperative resource allocation in wireless powered mobile edge computing. IEEE Internet of Things Journal 2018, 6, 4744–4754. [Google Scholar] [CrossRef]
  18. Ernest, T.Z.H.; Madhukumar, A. Computation offloading in MEC-enabled IoV networks: Average energy efficiency analysis and learning-based maximization. IEEE Transactions on Mobile Computing 2023. [Google Scholar] [CrossRef]
  19. Zhang, S.; Bao, S.; Chi, K.; Yu, K.; Mumtaz, S. DRL-based computation rate maximization for wireless powered multi-AP edge computing. IEEE Transactions on Communications 2023. [Google Scholar] [CrossRef]
  20. Wu, X.; Yan, X.; Yuan, S.; Li, C. Deep Reinforcement Learning-Based Adaptive Offloading Algorithm for Wireless Power Transfer-Aided Mobile Edge Computing. In Proceedings of the 2024 IEEE Wireless Communications and Networking Conference (WCNC). IEEE, 2024, pp. 1–6.
  21. He, H.; Zhou, C.; Huang, F.; Shen, H.; Li, S. Energy-Efficient Task Offloading in Wireless-Powered MEC: A Dynamic and Cooperative Approach. Mathematics 2024, 12. [Google Scholar] [CrossRef]
  22. Wang, R.; Chen, J.; He, B.; Lv, L.; Zhou, Y.; Yang, L. Energy consumption minimization for wireless powered NOMA-MEC with user cooperation. In Proceedings of the 2021 13th International Conference on Wireless Communications and Signal Processing (WCSP). IEEE, 2021, pp. 1–5.
  23. He, B.; Bi, S.; Xing, H.; Lin, X. Collaborative computation offloading in wireless powered mobile-edge computing systems. In Proceedings of the 2019 IEEE Globecom Workshops (GC Wkshps). IEEE, 2019, pp. 1–7.
  24. Sun, Y.; Xu, J.; Cui, S. User association and resource allocation for MEC-enabled IoT networks. IEEE Transactions on Wireless Communications 2022, 21, 8051–8062. [Google Scholar] [CrossRef]
  25. He, H.; Huang, F.; Zhou, C.; Shen, H.; Yang, Y. Maximizing Computation Rate for Sustainable Wireless-Powered MEC Network: An Efficient Dynamic Task Offloading Algorithm with User Assistance. Mathematics 2024, 12. [Google Scholar] [CrossRef]
  26. Ye, Y.; Shi, L.; Chu, X.; Li, D.; Lu, G. Delay minimization in wireless powered mobile edge computing with hybrid BackCom and AT. IEEE Wireless Communications Letters 2021, 10, 1532–1536. [Google Scholar] [CrossRef]
  27. Shi, L.; Ye, Y.; Chu, X.; Lu, G. Computation bits maximization in a backscatter assisted wirelessly powered MEC network. IEEE Communications Letters 2020, 25, 528–532. [Google Scholar] [CrossRef]
  28. Lyu, B.; Hoang, D.T.; Yang, Z. User cooperation in wireless-powered backscatter communication networks. IEEE Wireless Communications Letters 2019, 8, 632–635. [Google Scholar] [CrossRef]
  29. Bi, S.; Zhang, Y.J. Computation rate maximization for wireless powered mobile-edge computing with binary computation offloading. IEEE Transactions on Wireless Communications 2018, 17, 4177–4190. [Google Scholar] [CrossRef]
  30. Xu, Y.; Gu, B.; Hu, R.Q.; Li, D.; Zhang, H. Joint computation offloading and radio resource allocation in MEC-based wireless-powered backscatter communication networks. IEEE Transactions on Vehicular Technology 2021, 70, 6200–6205. [Google Scholar] [CrossRef]
  31. Verdu, S. Fifty years of Shannon theory. IEEE Transactions on information theory 1998, 44, 2057–2078. [Google Scholar] [CrossRef]
  32. Neely, M. Stochastic network optimization with application to communication and queueing systems; Springer Nature, 2022.
  33. Sun, M.; Xu, X.; Huang, Y.; Wu, Q.; Tao, X.; Zhang, P. Resource management for computation offloading in D2D-aided wireless powered mobile-edge computing networks. IEEE Internet of Things Journal 2020, 8, 8005–8020. [Google Scholar] [CrossRef]
  34. Dinkelbach, W. On nonlinear fractional programming. Management science 1967, 13, 492–498. [Google Scholar] [CrossRef]
  35. Diamond, S.; Boyd, S. CVXPY: A Python-embedded modeling language for convex optimization. Journal of Machine Learning Research 2016, 17, 1–5. [Google Scholar]
  36. Zawawi, Z.B.; Huang, Y.; Clerckx, B. Multiuser wirelessly powered backscatter communications: Nonlinearity, waveform design, and SINR-energy tradeoff. IEEE Transactions on Wireless Communications 2018, 18, 241–253. [Google Scholar] [CrossRef]
  37. Shi, L.; Ye, Y.; Chu, X.; Lu, G. Computation Bits Maximization in a Backscatter Assisted Wirelessly Powered MEC Network. IEEE Communications Letters 2021, 25, 528–532. [Google Scholar] [CrossRef]
  38. Xu, Y.; Gui, G. Optimal Resource Allocation for Wireless Powered Multi-Carrier Backscatter Communication Networks. IEEE Wireless Communications Letters 2020, 9, 1191–1195. [Google Scholar] [CrossRef]
Figure 1. System model of WPMEC network with user-assisted
Figure 1. System model of WPMEC network with user-assisted
Preprints 115335 g001
Figure 2. Time division structure
Figure 2. Time division structure
Preprints 115335 g002
Figure 3. Energy efficiency EE in different schemes over time slots.
Figure 3. Energy efficiency EE in different schemes over time slots.
Preprints 115335 g003
Figure 4. Energy efficiency EE in different schemes versus the performance gap.
Figure 4. Energy efficiency EE in different schemes versus the performance gap.
Preprints 115335 g004
Figure 5. Energy efficiency EE in different schemes versus bandwidth W.
Figure 5. Energy efficiency EE in different schemes versus bandwidth W.
Preprints 115335 g005
Figure 6. Energy efficiency EE in different schemes versus wight ω .
Figure 6. Energy efficiency EE in different schemes versus wight ω .
Preprints 115335 g006
Figure 7. Convergence performance of energy efficiency EE versus parameter V .
Figure 7. Convergence performance of energy efficiency EE versus parameter V .
Preprints 115335 g007
Figure 8. The optimal time allocation versus the performance gap.
Figure 8. The optimal time allocation versus the performance gap.
Preprints 115335 g008
Figure 9. Energy efficiency EE versus different distances between MD 1 and MD 2 .
Figure 9. Energy efficiency EE versus different distances between MD 1 and MD 2 .
Preprints 115335 g009
Figure 10. Energy efficiency versus different task arrival rate MD 2
Figure 10. Energy efficiency versus different task arrival rate MD 2
Preprints 115335 g010
Table 1. KEY NOTATIONS AND DEFINITIONS.
Table 1. KEY NOTATIONS AND DEFINITIONS.
Notation Definition
T The time block
ε 0 t The time for WPT at slot t
ε 1 t , ε 2 t The time for offloading by Backcom of MD 1 and MD 2 at slot t
ε 3 t , ε 4 t The time for offloading by AC of MD 1 and ND 2 at slot t
E i t The energy harvested by MD i at slot t
h i t The WPT channel gain between MD i and HAP at slot t
g 12 t , g 2 a t The offloading channel gain between MD 1 and MD 2 , MD 2 and HAP at slot t
P 0 t , P 1 t , P 2 t The transmit power by AC at HAP, MD 1 , MD 2 at slot t
P i ba , P i ac The circuit power by Backcom and AC at MD i
d i , loc t The amount of tasks processed locally at MD i at slot t
d i , ba t The amount of tasks offloaded by Backcom at MD i at slot t
d i , ac t The amount of tasks offloaded by AC at MD i at slot t
H i loc , t The energy consumed by processing tasks locally at MD i at slot t
H i , ba t The energy consumed by offloading tasks by Backcom at MD i at slot t
H i , ac t The energy consumed by processing tasks at helper at slot t
E i , wp t The energy harvested by WPT at MD i at slot t
E i , ba t The energy harvested by Bakcom at MD i at slot t
d i , off t The amount of tasks of MD i processed by offloading at slot t
f i The local CPU frequency at MD i
ϕ i The CPU cycles required to compute one bit task at MD i
β i t The reflection coefficient of MD i at slot t
μ The energy conversion efficiency
κ The computing energy efficiency
W The channel bandwidth
σ 2 The additive white Gaussian noise
Table 2. SIMULATION PARAMETERS.
Table 2. SIMULATION PARAMETERS.
Symbol Value
Time slot length 1 s
Maximum battery capacity B max 50 J
Minimum battery capacity B min 0 J
Transmit power of the AP P 0 5 W
Noise power σ 2 10 3 W
The circuit consumption of the Backcom p 1 b a and p 2 b a 0.1 W
The circuit consumption of the AC p 1 a c and p 2 a c 0.7 W
CPU frequency of M D 1   f 1 500 MHz
CPU cycles to compute 1 bit task of M D 1   ϕ 1 490 cycles/bit
CPU frequency of M D 2   f 2 480 MHz
CPU cycles to compute 1 bit task of M D 2   ϕ 2 470 cycles/bit
Equal computing efficiency parameter of M D 1   κ 1 10 8
Equal computing efficiency parameter of M D 2   κ 2 10 8
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated