1. Introduction
LiDAR and specifically Airborne Laser Scanning (ALS) has emerged as the technique of choice for a diversity of mapping and change detection tasks in various fields of application. Amongst others, its provision of accurate 3D measurements with high spatial resolution, its minimal requirements concerning accessibility of a scene and its penetration capabilities through vegetation are widely utilized in earth sciences [
1], natural hazard management such as flood applications [
2], natural resource management [
3], object change detection in urban environments ([
4,
5]) and animal species diversity assessment [
6].
Change detection with LiDAR relies on repeated surveys in suitably spaced time epochs ([
1,
4,
7,
8]). Numerous exemplary studies applying such multi-temporal ALS data can be found in
Table 1.
While the sensor properties and surrounding conditions of each data acquisition epoch are extensively discussed, the data orientation and registration procedure is often depicted as an independent epoch-wise preprocessing task or not even mentioned. Most of the studies in
Table 1 trust that data providers and standard processing workflows sufficiently account for potential systematic discrepancies between separate epochs. Notably, the significance of comparisons between different epochs crucially depends on whether the respective datasets are consistent in terms of sharing a common coordinate frame. Even if the individual accuracy and precision of the datasets are clearly sufficient for a given task, change detection may be hindered by systematic discrepancies between the datasets ([
1,
23]).
In this paper, we investigate the impact of temporal decorrelation, varying sensor characteristics and varying environmental conditions on the orientation of multi-temporal ALS data based on a state-of-the-art strip adjustment procedure [
40]. Our aim is to optimize the workflow with respect to these additional challenges and answer the following questions:
What are the most important issues for the conceptualization, acquisition and orientation of multi-temporal ALS datasets?
Which criteria define an optimal workflow and how can these be achieved in practice? The presumably most rigorous method for the improvement of multi-temporal ALS data would be one simultaneous strip adjustment of all epochs combined. However, certain drawbacks can be associated with this approach. The number of observations and unknowns increases with each epoch leading to a high demand of computational resources. An integration of new epochs is elaborate and implicates datum changes for all old epochs. The central question thus is: Do the adjustment results justify these drawbacks or can they be matched by a more efficient approach?
What accuracy can be realistically achieved with the proposed methods?
The remainder of this article is structured as follows. In
Section 2 we discuss related contributions from the literature.
Section 3 introduces the study area and gives an overview of all relevant datasets. In
Section 4, the methodology is presented including the basic strip adjustment approach, the proposed workflows for strip adjustment of multi-temporal datasets and the criteria for quality assessment. The results are presented and discussed in
Section 5. The article is concluded in
Section 6 by summarizing our findings and deducing recommendations for the design of future LiDAR time series.
2. Related Work
The questions we are addressing in this paper are approached from different directions in the literature.
The aspect of ALS data registration with strip adjustment is for instance treated in [
41] and [
42]. Recently, many publications specifically adapt the workflow to UAV-borne sensors, such as [
40,
43,
44,
45] and [
46]. Due to its kinematic acquisition characteristics, the georeferencing of ALS data crucially depends on trajectory estimation [
47] in order to achieve optimal accuracy. This can either be solved in two separate steps (1. trajectory estimation - 2. strip adjustment with additional trajectory correction) or by directly integrating trajectory estimation and strip adjustment [
48]. An extensive review of integrated georeferencing approaches is provided in [
49] while [
50] propose an alternative approach in case of missing or unusable trajectory data. In the cited articles, evaluation is usually carried out by applying the proposed methods to datasets with rather homogeneous characteristics compared to the multi-temporal case. ALS data and reference data are typically acquired within a short time period resulting in comparable field conditions. Even then, some precautions have to be made in order to eliminate the influence of highly dynamic surfaces such as water bodies and vegetation canopy. However, it is not clear whether slow decorrelation of wider areas over several years or in the seasonal cycle can be handled. Another major difference to multi-temporal datasets is the fact that typically data from one specific sensor constellation are adjusted. Several data characteristics may still be fairly dissimilar, especially when combining the adjustment of ALS data with the adjustment of (aerial) images, as in [
51,
52] and [
53]. But also in this case, the respective configuration is explicitly taken into account by using different suitable functional models for LiDAR and image data, respectively. Compared to that, issues may arise if a dataset acquired over many years with various LiDAR sensors is treated in a uniform manner despite divergent properties such as point density, beam divergence, measurement accuracy, wavelength, pulse length, scan pattern etc.
Contrariwise, there is the application-driven view on multi-temporal Lidar data which has already been outlined in
Section 1. In an abundant number of studies, heterogeneous datasets are analyzed in detail and all sorts of conclusions are drawn. However, the question about orientation and geometric accuracy over different epochs is rarely discussed in depth. Publications such as [
1] and [
7] point out concerns about heterogeneous multi-temporal data in their respective fields of application. [
11] take into account the aspect of similar survey configurations during selection of suitable epochs. Furthermore, they provide an explicit workflow to ensure vertical co-registration of epochs and an uncertainty estimation for the resulting application, in this case tree growth assessment. [
26] conduct thorough uncertainty analysis for glacier extent analysis with DTMs interpolated from multi-temporal ALS data. The DTMs are registered using known stable areas in vicinity of the glacier. A similar approach is developed by [
54]. After epoch-wise strip adjustment of multi-temporal ALS data, a rigid body transformation is estimated for each epoch in order to share the datum of one reference block. The significance of remaining DTM differences is assessed with means of error propagation in order to separate actual changes from measurement noise. More recently, [
23] discuss the reliability of change detection using multi-temporal LiDAR and point out the importance of the registration procedure.
We summarize that none of the reviewed approaches performs an integrated strip adjustment for an extensive multi-temporal ALS dataset. If discovered, potential datum discrepancies are usually compensated in a separate step with significantly less rigorous approaches.
5. Results and Discussion
Coming to the adjustment results, we first investigate the individual block-wise adjustment with control patches (CPC,
Section 4.2.1), which is a typical solution in practice. The blockpair-difference matrices for the CPC approach (
Figure 4) clearly show the disadvantages. Many of the explainable differences in matrix A also translate into discrepancies in stable areas (matrix B) - for example epoch 2016-06-16. Additionally, even blocks separated only by days (e.g. 2014-10-14 and 2014-10-16) show high differences which can not be explained with actual changes on the ground.
The small values in the diagonal compared to the rest of the matrix indicate a strong bias of the CPC approach towards minimizing relative differences in individual blocks. The absolute datum definition with sparse control patches is rather weak leading to poor consistency across epochs as well as little robustness with respect to decorrelation.
Similar results were obtained for the expanded control areas (CPC+,
Section 4.2.2,
Figure 5). In this study area, the coverage with stable areas is apparently not sufficient to support consistent datum definition, especially if their wide distribution encourages the use of more flexible correction models (c.f.
Section 4.1).
The stable areas (
Figure 5 B) show slightly smaller differences than in the CPC approach (
Figure 4 B). This is not surprising as CPC+ is the only approach explicitly using these stable areas for strip adjustment. In contrast, the full blockpair difference matrix (
Figure 5 A) indicates higher discrepancies than CPC and all other approaches (
Figure 4 A and
Figure 6 A) which suggests overfitting towards the sparse control areas.
Compared to these single-block approaches, massive improvements can be observed for the bi-temporal adjustment (REF,
Section 4.2.3,
Figure 6). There are still striking discrepancies in the area-wide matrix (
Figure 6 A), mainly for the blocks under leaf-on conditions (May-October). But as opposed to
Figure 4 and
Figure 5, most of these blocks are hardly recognizable in the stable-area differences (
Figure 6 B). This indicates high robustness in terms of decorrelated areas not significantly influencing the orientation of unchanged surfaces nearby. The diagonal representing the
for each individual block is comparable between CPC and REF, however, the deviations from the rest of the matrix are much smaller in the latter approach. Notably, the reference block (2014-10-14) appears similar to several other blocks in
Figure 6 B. This suggests that the good overall datum does not come at the cost of strong overfitting.
When it comes to the stable blockpair-difference matrix resulting from the FULL approach (
Figure 7 A, c.f. Section
Figure 7), the values have a similar order of magnitude as for the REF approach. However, their distribution is somewhat different.
Figure 7 B provides a direct comparison of discrepancies in stable areas between REF and FULL. The following observations can be made:
The REF approach tends to deal better with single blocks differing from others in key properties (e.g. phenology). This is especially obvious for block 2016-06-16, but also for 2014-10-16 and 2013-10-26.
In the FULL approach, accumulations of blocks with similar properties seem to produce datum clusters in the orientation procedure. As an example, the three blocks from February 2014 act as a huge aggregated reference block which pulls other blocks towards its datum, resulting in consistently green rows and columns. Another cluster seems to be formed by the first six blocks (2013-04-15 to 2014-02-21) all acquired with the same scanner.
The row/column of the reference block (2014-10-14) predominantly is white or light green in
Figure 7 B, i.e. similar or even slightly better discrepancies in the FULL approach. This encourages the assumption that the global datum is not exceedingly biased towards the explicitly defined reference block in the REF approach. Note: As each bi-temporal adjustment leads to a (slightly) different variant of the the reference block datum, its CPC result is used for this comparison.
Later blocks, i.e. 2016-11-04 to 2021-03-09, seem to be more indifferent concerning the choice between FULL and REF as they show similar difference patterns between each other in both approaches. One possible explanation is the state-of-the-art equipment and the higher point density used for these blocks. But also their sparser distribution over time could play a role as it avoids any major cluster aggregation in the FULL approach. Thus, they share comparably weak influence on the global datum in both variants and are fit to an externally defined datum in a similar manner.
The diagonal is predominantly close to zero with only few exceptions. This suggests that the relative orientation within single epochs is not significantly influenced by the choice of orientation procedure.
A synopsis of these points suggests clear advantages of the REF approach compared to FULL. However, the mean of all values in
Figure 7 B is -0.7 mm, i.e. slightly in favor of FULL. In order to clarify this apparent contradiction, several strip difference mosaics were analyzed in depth. As an example,
Figure 8 compares the two approaches by means of the full strip difference mosaic for blocks 2013-05-24 and 2014-02-13.
Differences are in general comparably high for this block pair, which is not surprising due to the late May block 2013-05-24 effectively having full leave-on conditions in contrast to most other blocks. Apart from that, the distribution of differences strongly varies between the two approaches. The FULL approach (
Figure 8 A) shows several areas with distinct systematic datum discrepancies. These are situated around agricultural areas in the block center where many blocks overlap. Notably, surrounding stable areas (e.g. roads, building roofs in the close-up map) are equally affected. Only the town center which predominantly consists of stable areas comes close to the results of the REF approach (
Figure 8 B). In comparison, the results from the REF approach seem clearly better for most of the block. However, the quality significantly drops in vicinity of the reference block boundary where the sparse control patches remain the only link to ensure a consistent datum definition.
Several other block pairs show a similar pattern leading to the assumption that the blockpair difference matrix in
Figure 6 does not reflect the full potential of the REF approach. It is disadvantageously biased by the limited coverage of the reference block, which is significantly smaller than the total area covered by all blocks combined (
Figure 1).
As a result of this analysis,
Table 3 rates the different approaches with respect to the quality criteria formulated earlier.
The overall view of the various criteria clearly favors the REF approach for the given dataset. This will be further supported by quantitative summaries of the blockpair-difference matrices (
Figure 4 to
Figure 7), an analysis of DSMs computed for each block and a comparison to check points in
Table 4.
Using a DSM allows to show the full potential of the resulting data. A DSM was calculated for the REF results of each epoch following the methodology of [
60]. In comparison to the models of individual LiDAR strips and their differences, the DSMs filter random noise to a higher extent.
Figure 9 visualizes the differences between epoch-wise DSMs in the style of blockpair difference matrices. The full differences (
Figure 9 A) hereby also include forests as the DSM calculation contains no roughness masking. Thus, the difference patterns are massively influenced by phenology and vegetation growth. More importantly, the stable DSM differences (
Figure 9 B) proof a consistent common datum definition with discrepancies of less than 2 - 3 cm (
) for all DSM pairs without notable outliers.
Independent validation was carried out using the point measurements from 02.02.2023 (
Section 3.2). Therefore, the height measured in these points was compared to the DSM height for every ALS epoch.
Figure 10 illustrates the spatial distribution of differences (mean and standard deviation) for each check point. After an analysis of these differences at the individual check points, we will look at the distribution of these differences w.r.t. each epoch.
The consistently small standard deviation (2 cm and smaller) for all check points in
Figure 10 confirms the high relative accordance of all epochs. Apart from that, the exclusively positive mean differences between 0 and 8 cm are remarkable. One possible explanation is a combination of (i) a biased global datum of ALS data or check points and (ii) local datum deformations of the ALS data leading to the varying order of magnitude. A more likely reason is the difference in data acquisition: For the check points, single terrestrial measurements were carried out, mostly on manhole covers. The DSM is a result of interpolating an ALS point cloud. It is thus also influenced by a certain neighbourhood which predominantly contains slightly (e.g. road surface) or significantly (e.g. curbstone, low vegetation) higher points.
Table 4 provides a statistical comparison of all major results discussed in this section by listing quantitative measures for each epoch. The first columns show the mean
of blockpair-wise strip discrepancies for each epoch relative to all other epochs. These strip difference measures underline the assumption illustrated in
Figure 4,
Figure 5,
Figure 6 and
Figure 7 that the approaches CPC and CPC+ fall short of REF and FULL in terms of across-epoch accuracy. The overall average differences are slightly smaller for the FULL approach (4.47 cm) in comparison to REF (4.54 cm). When comparing DSM based differences to individual strip based differences (again each epoch is compared to all others), it is notable that the average over all
differences reduces from 4.5 cm to 1.7 cm. Furthermore, the range of values is 2.1 cm to only 1.2 cm (Column "DSM diff." in
Table 4). This relative accuracy demonstrates the high precision and consistency airborne LiDAR measurements can provide. The differences between each DSM and independent check points confirm an absolute accuracy in the range of few centimeters while also showing the positive bias of mean differences already discussed in the context of
Figure 10.
As a specific example,
Figure 11 provides a DSM difference model between two blocks separated by nearly seven years. Despite the long time span and differing phenology, roads and building roofs are consistently white, i.e. less than 2 cm difference. Notably, this is also the case in direct vicinity of agricultural land and vegetation. Agricultural land is mostly negative (red), meaning 2014-06-10 is higher than 2021-03-09 due to leaf-on conditions. Positive differences (blue) show new objects such as photovoltaic panels (west of A) or new buildings (north-east of B). Vegetation gives a mixed image, depending on whether growth/plantation (blue) or phenology/clearance (red) is the dominant effect over the given time span. The grassland around (C) had been mowed in 2014-06-10 and not yet overgrown in 2021-03-09 resulting in effectively no height difference except for the hay deposited to dry in 2014. Due to the datum accordance of the two epochs, also more subtle changes such as slight degradation of the soccer field north-east of (D) can be observed. Only the differences in the river channel (south to north through D) are not meaningful as refraction corrections for the bathymetric data have not been applied.
6. Conclusions
In this paper, we compared various strip adjustment approaches to achieve a common datum for 21 LiDAR blocks distributed over a time span of approximately 8 years. The results proved that the seemingly most rigorous approach of combining all epochs into one large adjustment is not necessarily the optimal solution. Besides the high computational costs, the resulting common datum is potentially compromised by clusters of similar blocks.
Compared to that, the much more flexible pairwise adjustment of each epoch with one reference block (
Section 4.2.3) showed exceptionally consistent results within the area covered by the reference block. Relative discrepancies of the derived height models were below 2-3 cm for all involved epochs with a mean of 1.7 cm. In this bi-temporal adjustment of two independent epochs, the utilized adjustment concept (
Section 4.1) demonstrated high robustness with respect to short- or long-term surface changes such as vegetation or construction sites. Without the explicit application of masks, the automated matching and rejection of correspondences efficiently avoided impact on stable areas nearby.
For setting up a new time series following this approach, especially the design of the reference block plays a key role:
Chronology: Due to practical reasons, the reference block is presumably acquired in the very beginning of a time series since all earlier epochs would have a preliminary datum prior to its availability. Notably, our results suggest no decrease in accuracy related to an increasing temporal offset from the reference block.
Coverage: In order to ensure ideal datum consistency, the reference block should cover the entire designated study area. All datasets protruding beyond this area have to be supported by additional control data.
Point density: It is not necessary that the reference block has the highest point density of the time series. At minimum, it needs to be high enough to allow the reliable computation of local surface normal vectors for relevant surface patches (e.g. small roofs) during strip adjustment.
Accuracy: While we found no extreme overfitting to the reference block in our work, its quality decisively determines the achievable accuracy for the whole time series. In order to give stability to other more volatile blocks, the original trajectory estimation for the reference block has to be of high quality so that rigid trajectory correction models are sufficient for the reference block in the adjustment process. The interior and especially the borders of the reference block have to be covered with control patches varying in slope and exposition. In order to support more blocks in the time series, permanent surfaces (e.g. building roofs) for these control patches are preferred. If this is not possible, temporary targets should be placed in some distance from the permanent surface (e.g. ground) nearby. This avoids the establishment of erroneous correspondences for blocks acquired after the removal of these targets.
All other blocks of the time series need to cover an area within the reference block which is interspersed with stable patches as extensively as possible. In terms of decorrelation, vegetation is less problematic due to the robustness of the adjustment procedure. However, large areas with slow changes over time (e.g. sliding slopes) have the risk of systematically compromising datum estimation and are ideally eliminated using explicit masking.
For future work, a meaningful next step is to test this approach under changed circumstances. This includes a stronger variation of flying heights, the combination of airborne and UAV-borne sensors, flatter or more mountainous terrain characteristics etc. Furthermore, a close look into the effect of different scanner systems is worthwhile, especially when differing in scanner wavelengths and beam characteristics (c.f. [
56]). Finally, working with classified data could bring benefits in terms of a broader quality control, e.g. using terrain points only, but may also help improve the adjustment procedure in challenging areas such as large forests.
Author Contributions
Conceptualization, M.W.; methodology, M.W.; software, M.W.; validation, M.W. and G.M.; formal analysis, M.W.; investigation, M.W.; resources, N.P.; data curation, G.M. and M.W.; writing—original draft preparation, M.W.; writing—review and editing, M.W., N.P., C.R., and G.M.; visualization, M.W.; supervision, N.P.; project administration, N.P. and G.M.; funding acquisition, N.P. All authors have read and agreed to the published version of the manuscript.