In this section, we cover the research contribution in the area of resource allocation for IoV, UAV as well as UAV based IoV networks using AI.
4.1. AI for Resource Allocation in IoV Network
Different objectives have been considered by the authors for AI based resource allocation ranging from load balancing, better QoS or QoE to energy and latency minimization. In this section we focus on reviewing the contributions and research progress in AI based resource allocation for IoV networks.
In [
52], the authors combine the network function virtualization (NFV) and QoS guaranteed network slicing (NS) with RL in beyond 5G (B5G) architecture. The slice acceptance advantage of 5% is achieved as compared to the best existing work. One impediment of the research is that the authors assumed the RRB resource allocation in the RAN over direct data rate in RAN. Secondly, there is room for improving the slice allocation capability of the proposed algorithm.
In [
53], the authors devised a method for multiple V2V links to reuse the V2I spectrum by using the multi-agent RL. In the training stage proposed model is centralized and in implementation stage it is decentralized. The MARL and SARL algorithms are used for the comparison purpose. The proposed model considerably improves the overall system level performance.
In [
54], the authors proposed an AI based task offloading and resource management model. There are two offloading layers proposed. First layer selects between cloud computing (CC) and MEC server using random forest (RF) for task offloading and computing platform is selected in second layer by using DDPG algorithm. The proposed model is compared with KNN, MLP and SVM. T-Drive trajectory dataset, from Microsoft website, is used for the model training and testing. This dataset includes the GPS trajectories of 10,357 taxis during one week in Beijing. The RF model achieves the highest classification accuracy score of 99.83% for task offloading decision as compared to the KNN, MLP and SVM achieved 98%, 94.81% and 90.94%, respectively. Moreover, it is reported that the DDPG significantly minimized the latency cost by 85%.
In [
55], the authors proposed the algorithm to build an energy efficient communication system using capabilities of AI and intelligent reflecting surface (IRS). The IRS reflecting signal propagation is used to produce efficient beam-forming for directional transmission in vehicles. Then, DRL model is used for network resource control and allocation. An energy efficiency-maximizing model is formulated and joint optimization of the settings of all vehicles is done for effective and latency-efficient communication. The proposed model is compared with model without IRS and random resource allocation model and the proposed model outperforms both the models.
In [
56], a model named ARTNet is proposed to make an AI enabled V2X framework for dividing resources effectively and improvised communication. To achieve their target the authors implemented the proposed algorithm in the software defined vehicular based FC (SDV-F) architecture. In ARTNet, the controllers are responsible to achieve optimal resource utilization at the fog layer, and maximum reduction in the average delay of time critical IoV applications. The proposed model is compared with NRPO and AEC in terms of energy consumption, average latency and energy shortfall.
In [
57], an integrated fuzzy based approach is taken to coordinate and manage the abundant resources available in VANETs. In this approach, the system decides the resources that vehicles should use when set under different circumstances including the condition of the network created among vehicles, which is represented by the QoS in the network, its time duration, its size, and the available resource, together with the application requirements. The proposed method achieves higher QoS.
In [
58], to tackle the problem of scarcity of computational resources, a selection criteria is proposed to select volunteers’ vehicles capable of executing the computationally intensive task. For the volunteer vehicle identification, the authors used various machine learning based regression techniques including LR, SVR, KNN, DT, RF, GB, XGBoosting, AdaBoost, and ridge regression. For the training and testing of the models vehicular onboard unit computing capability dataset is collected. It contains three different datasets. All three datasets have seven features but different number of samples or sizes. The dataset1 has 17800 samples, dataset2 is made up of 4600 samples and finally dataset3 has only 250 samples. Among all the models implemented, SVR and ridge regression have lowest error value and maximum accuracy.
In [
59], the authors predicted leading vehicle trajectory with the speed and acceleration information of the vehicle at front using the proposed method based on joint time-series modeling approach. Moreover, before making any prediction the gaussian mixture model (GMM) is applied on the leading vehicle to predict the driving styles. After style prediction, a LSTM and RNN based proposed joint time series modeling (JTSM) method is implemented to predict the front vehicle trajectories. The Next Generation Simulation (NGSIM) dataset on the US101, and I-80 is used to train and test the proposed model. The proposed model is compared with constant kalman filter (CKF), LSTM, multiple LSTM (MLLSTM). The proposed model shows significant improvement in terms of RMSE.
In [
60], authors predict vehicles’ trajectory by using LSTM algorithm. Then, the predicted value is provided to the QL algorithm to figure out the optimal resource allocation policy for the nodes. The real world vehicle trajectory data is used in this research provided by Didi Chuxing, a ride sharing company. The dataset contains 600,000 traffic trajectory samples. The ultimate goal is to enhance the QoS for non-safety related services in MEC-based vehicular networks and the proposed model outperforms the other models.
In [
61], the authors implemented RNN to predict resource availability pattern based on the position of vehicle and RSU at the time of prediction. The RSU burden is shared by the parked and slow vehicles. The performance of the proposed method is better than the heuristic and other ML based resource allocation algorithms.
In [
62], for improved communication and network security, the authors proposed a cluster-enabled cooperative scheduling based on RL (CCSRL). The experimental results show that the CCSRL provide better PDR and PLR as compared to the Wu’s scheme and mobile service amount based link scheduling (MSA-LS). MATLAB is used as simulation software, Veins software is used to generate the vehicle distribution, and SUMO generates the road topology.
In [
63], the authors divide the coverage area into clusters and then federated DRL obtains the global model for each cluster. The global model is implemented every time there is a new vehicle added in the cluster. The performance of the proposed algorithm is better than the other decentralized learning scheme.
In [
64], the researchers introduced a wireless full-duplex (FD) technique to enhance spectrum efficiency and optimize system throughput. This approach employs V2V links within an FD cellular network. Managing resource allocation and power control becomes challenging due to the interference between self-interference (SI) at FD base stations (BS) and V2I links. To address this issue, they proposed a Dual Graph Coloring-based Interference Management (DGCIM) scheme. The two graph coloring schemes employed are the Grouping Graph Coloring with Recursive Largest First (GPGC-RLF) and Greedy Graph Coloring (GYGC) algorithms, respectively. Simulation results demonstrate that the proposed model satisfies high reliability and low latency constraints for V2V links.
In [
65] a VNF-RACAG scheme is proposed to derive the optimal number of clusters. It reduced and minimizes the end-to-end delay in edge networks as well as allocated physical resources to users and minimize the number of transfers between clusters. The VNF-RACAG scheme uses a NFV stochastic model as well as geographic contexts and transfer history of the users in the chaining time optimization process. The proposed scheme is compared with the WiNE and PSwH schemes. The simulation are done in MATLAB and the results showed that end-to-end delay is minimized by using the proposed VNF-RACAG algorithm. The VNF-RACAG scheme implemented gradient descent to calculate the number of clusters to minimize the end-to-end delay. After this, a graph partitioning algorithm is performed to minimize the movement between the clusters.
In [
66], the authors performed the resource assignment and video selection for vehicular devices by using the stochastic optimization. The aim is to achieve maximum QoE and secure a threshold duration of video at each receiver. A highway environment is simulated with different vehicular densities. The proposed method achieves better results with compared with two different baseline methods.
In [
67], Two-Arch2 algorithm is adopted by introducing a hierarchical clustering idea to optimize 5G based IoV architecture. The proposed algorithm is compared with NSGA-III, Two-Arch2, CA-MOEA and MOEA/D. The experimental results show that the improved algorithm can obtain the better resource allocation scheme than the other algorithms in terms of stability, energy consumption and load balancing.
In [
68], a multi-platform intelligent offloading and resource allocation algorithm is proposed to dynamically organize the computing resources. KNN algorithm selects the best option available out of cloud computing, mobile edge computing, or local computing platforms. In addition, when the task is offloaded to a desired server, RL is implemented to solve the resource allocation strategy. The state is defined as the MEC computing capacity, actions are offloading decision and computation resource allocation and the reward is the minimum total cost. The proposed joint optimisation is compared with full MEC and Full local techniques and it is concluded that the proposed scheme reduced the total system cost and optimize the overall system performance.
In [
69], a centralized multi-agent learning is adopted for joint communication, computing and caching resource allocation problem. The RSUs are controlled by central the controller to assign resources in their coverage areas. The soft actor critic (SAC) algorithm updates policy in the central controller and share with edge agents to regulate their resource allocation strategies. The proposed algorithm increases the system’s performance as compared to other schemes.
In [
70], the authors optimized the resource allocation and computation offloading by using MDP. The aim is to maximize the network operators’ revenue and for this DDPG algorithm is used to optimize the the wireless and computing resources dynamically. Compared with average and random algorithms methods, the suggested method is more effectual with little resources. This paper ignores the dynamics of task offloading as optimization.
In [
71], for V2V communication, the authors proposed a RL based decentralized resource allocation mechanism. It is applied to both unicast and broadcast scenarios. V2V links are agents and based on the minimum interference select their spectrum and transmitted power for V2I and V2V links. The transmitted power is divided into three levels and the agents select them based on their state information.
In [
72], the authors perform queue-length resource allocation. At controller level, the network safety flows are managed. The safety flows get more priority ratio based on the criticality, and the non-safety flows get less priority. The bandwidth allocation is the main fairness allocation criterion to obtain maximum rate for different application. The simulation environment use mininet-wifi for multiple RSUs and vehicles to communicate in V2v and V2I scenarios. The authors implemented LSTM, CNN, and DNN and compare their results. LSTM outperforms all models in terms of accuracy.
In [
73], the authors used RNN to automate the resource management for the IoV network. The dataset used is called GWAT-13 Materna dataset with 12 attributes. available on Materna 13, an open source directory. It has three traces expanded over three months period with each trace having 850 VMs data on average. The prediction results of ML models are used by ASR-CUMS to decide the resource requirements and update the physical resources.
In [
74], the authors proposed a DL based CSI estimation technique is proposed. It is assumed that the environment remains the same so the DNN model can learn the non-linear CSI relations with insignificant overhead. Moreover, a resource allocation based on dynamic eMMB and URLLC network slicing is implemented for vehicles. The proposed scheme attains 50% reduction in overhead and 12% higher threshold violations compared to an ideal case with perfect CSI knowledge.
In [
75] the authors maximize the overall system throughput by implementing the weighted minimum mean square error (WMMSE) algorithm for power allocation. Simulation results show that the DNN algorithm improves the approximation of the iterative WMMSE algorithm achieves the computational overhead reduction.
ML techniques have proven to be very effective in VANET environment yet face some issues. For instance, the SVM faces issues such as the choice of the optimized kernel and models complexity, for KNN, the optimum value of k changes from one dataset to another which makes it a time-consuming and complex method to find the optimum value of k, RF focuses on the construction of several DTs which is time consuming and in vehicular network for time-sensitive applications this method is not recommended. The RL is the most commonly applied technique in IoV. However, it is noted that in vehicular networks it suffers from the issues of dimensionality makes it harder to implement RL on physical systems.
4.2. AI for Resource Allocation in UAV Network
The mobility as well as LOS links provided by UAVs make them a promising alternate to the fixed BS for the wireless communication networks. Similarly, AI has gained a lot of interest in wireless communication as it has the ability to learn from data and also learn from the environment and take decision herself. That is why, the research community is working on integrating intelligence in UAVs networks using different AI algorithms. In this section, we cover the potential applications of AI in UAV based wireless networks. These networks when extended, can work as a basic but fundamental ground for UAV based vehicular networks.
In [
76], the authors handle the radio resource management in cellular based UAVs by combining resource block (RB) allocation and beam-forming design. To realize this, at terrestrial BSs an outer MDP characterizes the dynamic RB possession environment while the inner MDP is used to track the time-varying feature of B2D small-scale fading. A DRL-aided hybrid D3QN-TD3 algorithm is proposed to maximize the reduction of the UAV user equipment’s (DUE’s) ergodic outage duration(EOD) for outer and inner MDPs. In discrete domain, RB coordination is achieved using the deep double duelling Q network (D3QN). The beam forming in continuous time is achieved using twin delayed deep deterministic policy gradient (TD3).
In [
77], the authors combined the features of FL and MARL, for location deployment and resource allocation and proposed a multi-agent collaborative environment learning (MACEL) algorithm to optimize the network utility by ensuring the user coverage. The UAVs act as agents which are trained using the Deep-Q-Network (DQN) algorithm. The FL model does the aggregation of local gradients and weights at cloud server to make a global model. The final results demonstrate the superiority of the proposed model in adapting to the ever changing environment.
In [
78], to deploy UAVs-BS, the authors proposed regression neural networks (GRNN) and RF based models. For this purpose, the system implements the load prediction algorithm (LPA) to predict maco-cell congestion based on load history generated by the mobile network. Then, to calculate the accurate number of UAV-BS required, the UAV-BSs clustering and positioning algorithm (UCPA), is implemented. The results showed that the RF model achieved the accuracy of over 85%.
In [
79] the authors integrate DNN in UAV at MEC for communication resource allocation, model optimization and UAV trajectory control to ensure the service latency minimization while ensuring the learning accuracy and energy consumption requirements.
In [
80], the authors proposed the optimal UAV-RIS deployment mechanism based on dueling DQN to facilitate multiple DL users with the help of the MC-PD-NOMA scheme. The total transmit power is minimized. Moreover, UAVs’s trajectories are optimized jointly along with sub-carrier allocations, phase shifts at each RIS. The results show that more total transmit power is achieved using the proposed model when compared with other baseline methods.
In [
81], the authors maximize the sum rate of the UAV-enabled multi-cast network by jointly designing the UAV movement, re-configurable intelligent surface (RIS) reflection matrix, and beam-forming design from the UAV to users based on a multi-pass deep Q Network (BT-MP-DQN). In the proposed model, the UAV is the agent and beam-forming control and trajectory design are considered system actions. The movement of the UAV is discrete action whereas the beam-forming design is continuous action. It is shown that the achievable rate is maximized as well as the minimum rate of multi-cast group users is achieved.
In [
82], the multi agent DRL model is applied for resource management on cellular and IoT networks by integrating UAV in the system. First, the resource management related challenges in UAVs-BS based IoT networks are pointed out. The K-means algorithm and round robin scheduling algorithms are used for clustering and service request queue respectively. The accuracy, RMSE and testing time(s), are used as metrics to compare the proposed method with previous works. The proposed system achieves 94% resource management accuracy, RMSE score of 2.40% and testing times(s) levels of 1.05s. In terms of precision and recall estimation (as 88.05%, 93.04% and 92%, 90.01% for two of the clusters), the proposed model performs better than the Q-learning approach.
In [
83], the paper deploys mobile UAVs as BSs. The aim of this research is to achieve data collection and energy transmission maximization as well as minimum UAV energy consumption. This could be achieved by UAV transmission power, hovering time slots, and flight decisions optimization. This multi-objective optimization problem is solved by implementing the MJDDPG dynamic resource allocation algorithm. In this study, the experimental results of the proposed model are not compared with any other models to proof the authenticity of the model.
In [
84],the authors proposed a dynamic information exchange management schemes in a UAV network. First scheme shares the channel exchange information. Then, the dynamic time slot model is implemented in second scheme where priority based time slot sharing is performed. For both the schemes, to solve the problem of dynamic management, a DQN network coupled with LSTM is proposed. The LSTM network ensure the fast convergence of the DQN model.
In [
85], the authors proposed the MDP and then an Actor-Critic based RL technique (A2C) solution in UAVs to offload the computational tasks of the ground users and achieve the minimum mission time. This scheme achieves the terrestrial users offloading through UAVs and in turn the offloading of UAVs through MEC server.
In [
86], the authors probe the offloading of the task in UAV via MEC servers to minimize latency and the energy of the UAVs. Each UAV is associated with its corresponding task by keeping check of the available energy along with the optimal MEC server selection two Q-learning models are proposed. In [
87], the Multi Agent RL (MARL) model is proposed for resource allocation for multiple UAVs network. The authors perform the sub-channel and user selection and allocate power to each user by considering parameters such as the signal-to-noise ratio, LOS and Non-LOS conditions. The proposed MARL algorithm achieves a good trade-off between the information exchange overhead and the system performance.
In [
88], the authors consider a swarm of HAPS for communication and aim at comparing the RL and SI algorithms. The SI algorithm performs better as compared to RL as it converges quickly by maintaining the stable user coverage profile. On the other hand, the RL algorithm shows some convergence dips. However, the overall coverage rates are higher for RL.
In [
89], the authors proposed FL based resource allocation method for joint optimization of the UAV position and local accuracy of FL model and user computation and communication resource. After formulating the joint problem, these three problems are solved by divided them into separate sub problems. The proposed algorithm is compared with the fixed altitude UAV-assisted FL ratio and performs better learning and reduces the system overall energy consumption.
In [
90], authors perform joint power allocation and scheduling for a UAV swarm network. In the network one drone is selected as a leader and all other drones are make a group of drones following the leader. Every group transmits the update of its local FL model to the leader drone so it could combine all the local parameters for global parameter update to the global model. While the drones exchange updates, the wireless transmissions gets affected by many internal and external losses and interference. The impact of these factors on FL model is also considered. Moreover, experimental results approve the FL convergence analysis and the combined design strategy minimize the convergence time by 35% in comparison to the other state of the art designs.
In [
91], the authors perform task offloading to manage the resources by ensuring the energy and latency minimization for high-altitude balloon networks. The user association is decided using SVM-based FL model. The FL provided the privacy as the data is not being shared across the network.
To summarize, the ML methods have been recently (the last two years) widely exploited in dynamic resource allocation in UAV and fixed BS communication. The supervised learning, RL and FL are studied for fast making decision and response, resource allocation, task offloading, joint power allocation in high-dynamic UAV networks. However, the implementation of AI models is limited to the basic and much simpler state of the art models. The researchers ignored the limitation of these models while implementing them to solve the problem in hand.
4.3. AI for Resource Allocation in UAV-IoV Network
The heterogeneity of vehicular networks, their high dynamic nature with fast moving have made the vehicular network more complex and demand new requirements for networking algorithms that could meet the stringent network control and resource allocation demands such as efficient spectrum sharing, transmission power maximization and computational resource management to minimize the energy requirements. UAV-IoV networks are three dimensional and contrast to terrestrial networks, the (UAV) base station itself is moving. Therefore,traditional optimization techniques are unable to capture complex patterns. In this section we review AI/ML based resource allocation in UAV-IoV networks. A brief summary of these papers is provided in
Table 3.
Resource management in UAV-IoV is divided into radio resource allocation and computation resources management. The radio resource allocation is further divided into spectrum and channel access optimization. The main goal of radio resource management is to limit channel interference, power usage and network congestion. The computation resource management includes service, task and traffic offloading in MEC where the edge cloud nodes located in BSs and/or UAVs. This decentralization of the system generates faster response times as compared to the central deployments.
In [
92], the authors focus on bandwidth allocation, location control deployment and trajectory of UAVs for communication capacity maximization to enable the UAV to process more data at edge computing. They propose an actor-critic mixing network (AC-Mix) and multi-attentive DDPG (MA2DDPG) network. The AC-Mix is the combination of Qmix (relies on Q function and don not deal with continuous values) mixed with the actor–critic framework. The critic in AC-Mix has a individual value network and a mixing network maintained by a virtual agent. Each agent inputs a state–action pair to GRU network which in turn outputs the global action value. TensorFlow 1.15.0 is used to design all algorithms. For comparison, user fairness, and load balance metrics are presented. MA2DDPG convergence velocity is 30.0% and 63.3%, better than the MADDPG algorithm.
In [
93], the authors proposed a mechanism for energy harvesting by UAV from BS and vehicles using wireless power transfer (WPT) and simultaneous wireless information and power transfer (SWIPT) techniques, respectively. Maximum data offloading to the UAV is the main goal of this research. For optimized resource management and UAV velocity a DRL-based resource allocation and speed optimization (DRL-RASO) model is adopted. The proposed model achieves much higher offloading rates data offloaded to the UAV than the DRL-FTPA and Dueling-DQN. The authors named a simple DRL based DDPG algorithm DRL-RASO and no modification is suggested in the basic algorithm. The reported difference between DRL-RASO and DRL-FTPA is 5.79%.
In [
94], an UAV based vehicular network is built to deal with caching and computing problem in addition to BS. The energy minimization is achieved by combining the cache refreshing optimization, computation unloading and status age updates. The online decision making is performed using DDPG. The BS decides if the cache needs to be refreshed, task has to be executed and what should be the bandwidth distribution. The total energy consumption is the reward function. The learning performance of proposed model is compared with traditional DDPG algorithm in terms of convergence rate. Then, the energy consumption for four bench marks namely random refreshing, random offloading, popular refreshing and equal bandwidth is calculated. It is claimed that proposed model outperforms DDPG in terms of system energy consumption and computational capabilities of UAV MEC server. But the authors do not report any results obtained using DDPG model.
In [
95], authors formulated a combined auction-coalition building method to assign UAV coalitions to various IoV groups. The coalition formation game maximizes the combined profits of all UAVs. The auction-coalition creation method is suggested to achieve the UAV coalition stability. The reported simulation results show that the FL communication latency is decreased.
In [
96], the authors proposed a FL based approach for the development of IoV based applications. The authors used the Gale-Shapley algorithm to match the lowest cost UAV to each sub-region. The simulation results show that the lowest marginal cost of node coverage for a UAV is assigned to each sub-region for task completion. The UAV energy harvesting is suggested as a future work for UAVs to continue their flight without any need of going back to their charging stations.
The authors in [
97], proposed a secure bandwidth allocation scheme based on the game theory on the IoV assisted by UAVs. Also, for delay reduction and increased data privacy the proposed blockchain-based system introduces an emerging consensus mechanism. Furthermore, to allocate the limited safe bandwidth, based on the real-time feedback of each UAV an optimal decision search algorithm based on gradient descent to achieve Stackelberg equilibrium is proposed. The performance of the proposed scheme is compared with the many-to-one scheme, the maximum signal-strength-indicator (max-RSSI) scheme, the maximum signal-to-interference-plus-noise-ratio (max-SINR) scheme, the Auction-based UAV Swarm Many-to-Many scheme (AMMA) and UE-Optimal Many-to-One Matching scheme (UMOA). The proposed scheme achieves better throughput of about 95% as compared to other models but the authors do not provide any data to strengthen their claim about privacy and secured bandwidth allocation.
In [
98], the vehicular task offloading optimization problem is dealt by jointly considering the resource allocation, and the security assurance. Then, this problem is divided into two separate problems and finally iterative algorithm called LBTO is proposed. LBTO decides if a certain MEC is selected depending on the load of the MECs and uses the Lagrangian dual decomposition for optimized offloading ratio and the computation resource. Task to be processed is selected based on the size of the task, computing resources required to execute a task, task’s allowed latency and ratio of the offloaded task at UAV MEC or locally to total task. For security assurance wiretap coding is used to protect the vehicle’s MEC. The proposed algorithm provides better task offloading ratio and delay than the other algorithms. However, this research considers that the UAVs are fixed and that is why completely ignores their energy consumption during mobility in the objective function.
In [
99], the authors proposed a system wise computation capacity maximization of UAV based MEC for a group of vehicles equipped with WPT. They proposed a second-order convex approximation based method to solve a sequence of sub-problems. The platooning vehicles and UAVs are coupled.The drawback of this technique is that it does not use the AI/ML to solve the non-convex problem to give a comparative analysis.
In [
100] the authors proposed a MDP based model for UAVs to take trajectory decisions to maximize the communication coverage area. The Actor-Critic algorithm learns the environment. The problem under consideration is mixed integer non-linear and non-convex problem. The DRL model learns the underlying non-linearity and non-convexity optimally. The model inputs are residual energy of each UAV, number and position of vehicles, positions of UAVs with respect to ground level etc. The UAV travelling distance is taken as the action. The penalty on the network incurred if UAV does not provide coverage to a vehicle, the deployment of a new UAV, the remaining energy of each UAV and the UAV goes outside the designated path. The proposed model is claimed to be compared with random dispatch, fixed dispatch and hovering models for metrics such as energy consumption, time versus maximum performance and vehicular density versus average convergence. But, there is no comparison of the proposed model provided with other state of the art models. Overall 40% of improvement is reported in converge area of the UAVs using the proposed model.
In [
101], the authors combined auction-integration (AI) formations to integrate UAV into the groups of IoV elements with target of achieving the total revenue maximization of a single UAV. The proposed method is compared with the merge-and-split algorithm to find the UAV coalitions formation to the IoV groups, and random UAV coalition partitioning with second-price auction.
In [
102], the authors proposed a FL-based technique to make sure security for IoV applications. The accurate reporting of UAV types is achieved by applying MD contracts with its identity qualities and using many sources of variability. The least costly UAV for each sub-region is found using the Gale-Shapley method. The results show the success of the proposed matching scheme.
In [
103], the authors proposed a model free Q network to select the best UAV advice with least stalling time. The results show that the proposed system provide a high-quality user experience. This paper is limited in its scope as the limitation of UAV server capacity, speed and vehicle speed are not considered.
The research in the area of UAV based IoV networks is in its infancy. Many issues in the research are not addressed and ignored. The main focus of research is on system energy minimization, energy harvesting, caching and bandwidth allocation. Although in achieving these goals the researches ignore the latency requirements, changes in UAV height, the variations in vehicular node density. To manage resources, reinforcement learning is the most rampantly used AI technique. However, the studies used the state of the art RL models to minimize the system energy and bandwidth allocation etc. The lack of dataset availability in UAV based IoV networks makes it very hard to use ML and DL models for resource management and to explore the proven potential of these AI models. Furthermore, DL techniques have not been investigated because of the restricted power and processing resources available in UAV. In addition to this, the most important issue of security and data privacy are not considered in the research. The unencrypted and unauthenticated channels are used for UAV based communication which make them vulnerable to cyber attacks. Federated Learning can play a pivotal role in providing the security and privacy by training the ML model on data without transferring it to cloud server.