Comparison of photogrammetric canopy models from archived and made-to-order aerial imagery in forest inventory

In remote sensing-based forest inventories 3D point cloud data, such as acquired from airborne laser scanning, are well suited for estimating the volume of growing stock and stand height, but tree species recognition often requires additional optical imagery. A combination of 3D data and optical imagery can be acquired based on aerial imaging only, by using stereo photogrammetric 3D canopy modeling. The use of aerial imagery is well suited for large-area forest inventories, due to low costs, good area coverage and temporally rapid cycle of data acquisition. Stereophotogrammetric canopy modeling can also be applied to previously acquired imagery, such as for aerial ortho-mosaic production, assuming that the imagery has sufficient stereo overlap. In this study we compared two stereo-photogrammetric canopy models combined with contemporary satellite imagery in forest inventory. One canopy model was based on standard archived imagery acquired primarily for ortho-mosaic production, and another was based on aerial imagery whose acquisition parameters were better oriented for stereo-photogrammetric canopy modeling, including higher imaging resolution and greater stereo-coverage. Aerial and satellite data were tested in the estimation of growing stock volume, volumes of main tree species, basal area and diameter and height. Despite the better quality of the latter canopy model, the difference of the accuracy of the forest estimates based on the two different data sets was relatively small for most variables (differences in RMSEs were 0–20%, depending on variable). However, the estimates based on stereo-photogrammetrically oriented aerial data retained better the original variation of the forest variables present in the study area.


Introduction
Multi-source national forest inventory (MSNFI) technique has been used operationally for producing remote sensing-based information of forest resources in the form of thematic maps and forest statistics for various areas since the 1990s in Finland (Tomppo et al. 2008;Mäkisara et al. 2019). The method is based on combining information from satellite imagery, digital map data and field measurements of national forest inventory (NFI) plots (Tomppo et al. 2008). Since the introduction of the method, the availability of satellite imagery has been improved due to new optical earth observation satellites and open access policies adopted by the satellite operators (Barrett et al. 2016;Saarinen et al. 2018). Despite the new open access satellite data from missions such as the Copernicus Sentinel 2, the main drawback of the MSNFI has remained until today: the accuracy of satellite image-based forest estimates is not sufficient at individual stand level (Tomppo et al. 2008;Tuominen et al. 2017a). For this reason, parallel forest inventory systems are maintained for fulfilling the information requirements of forest management (Tuominen et al. 2014;Kangas et al. 2018).
In variables such as the quantity of aboveground forest biomass or (stem) volume of growing stock, it is difficult to achieve any major improvement in the accuracy of remote sensing-based forest estimates using traditional 2-dimensional (2D) image analysis, due to the low correlation between the forest variables and spectral or textural image features (Tuominen et al. 2017a). A significant improvement in the accuracy can be achieved by using 3-dimensional (3D) remote sensing data, e.g., in the form of LiDAR or photogrammetric points clouds.
Although very well suited for estimating local (stand or pixel level) forest variables, the acquisition cycle of airborne laser scanning (ALS) data is not well suited for MSNFI; due to the relatively long acquisition cycle of LiDAR scanning the temporal resolution is low. Moreover, the continuous areas covered by LiDAR scanning each year are not very large, leading to low numbers of NFI plots in each scanning area . Regarding the acquisition cycle and area coverage, aerial imaging is better suited for MSNFI; according to the national aerial imaging program in Finland, each part of the country should be covered with intervals of 5 years, providing sufficient temporal resolution for MSNFI. (Maanmittauslaitos 2019a) On the other hand, cloudy weather may prevent obtaining the data within the planned cycle, which is a risk regarding the suitability of photogrammetric data (Kangas et al. 2019). There are also plans for shortening the ALS rotation from 10 to 6 years (Maanmittauslaitos 2019b), which may improve the suitability of ALS data for MSNFI purposes in the future.
The aerial imagery acquired in the national aerial imaging program is mainly aimed at producing RGB and color-infrared ortho-photo mosaics for mapping, survey, forestry and agricultural purposes. Typically, the imaging parameters such as stereo-coverage and spatial resolution have not been adjusted for stereo-photogrammetric canopy modeling. Nevertheless, the stereo coverage of the imagery is sufficient for deriving stereo-photogrammetric 3D point clouds on the basis of the imagery, including archived old images. Although the canopy height models (CHM) based on such imagery appear somewhat crude and contain fewer features when compared visually e.g. to CHM based on ALS, they still have high potential in predicting forest variables such as stem volume and aboveground biomass (Baltsavias et al. 2008;Järnstedt et al. 2012;Pitt et al. 2014). Although the higher precision and detail of the ALS based CHM leads to better accuracy in the estimated of forest variables (Järnstedt et al. 2012), the difference in the accuracy of predicted forest variables between photogrammetric and ALS based CHMs has in many cases been fairly low (Tuominen et al. 2017a;Kangas et al. 2019). On the other hand, in some studies the differences have been remarkable, for instance Kukkonen et al. (2017) reported a difference in RMSE of total volume to be 10.53 percentage units.
Number of different algorithms have been used for generating stereo-photogrammetric point clouds for estimating forest characteristics (Baltsavias et al. 2008;Haala 2014;Stepper et al. 2017;Goodbody et al. 2018;Ullah et al. 2017;Melin et al. 2017). Regardless of the algorithm used, the image resolution, factors related to viewing geometry and stereo-matching parameters have a significant impact on the quality of image-based point clouds (Goodbody et al. 2019).
While there are no universally applicable criteria for an appropriate image resolution for deriving photogrammetric point clouds, improving the image resolution will produce crisper transitions at geometric shapes such as tree canopies, and supposedly improved 3D data product (Leberl et al. 2010).
Increasing the stereo overlap of the imagery provides another means for potential improvement of the 3D data product. From overlapping images, an object can be visible on multiple image pairs, allowing for multi-view matching, which reduces the opportunity for occlusions, and improves geometric accuracy (Leberl et al. 2010;White et al. 2013). The high overlaps also increase the chances for successful matching of tie-points for avoiding gross errors. The high overlaps also reduce the need for dense ground control points and the deformation caused by errors (Kraus 2004;Leberl et al. 2010;Goodbody et al. 2019).
CHMs based on aerial imagery with centimeter class spatial resolution and stereo overlaps of 60-80% would allow high estimation accuracy for forest variables related to the amount of stem growing stock and the size of individual trees, but such data is not economically or operationally feasible in any large area forest inventory. Evidently, it is possible to adjust the parameters of conventional large area aerial imaging towards better suited for applying stereo-photogrammetric 3D modeling, but this will happen at the expense of cost per ha and area coverage. There is no clear-cut solution, what is a feasible trade-off between the cost of aerial imaging when adjusting resolution and stereo-coverage more suited for stereo-photogrammetric 3D modeling and the improvement of forest estimates potentially achieved.
The objective of this study was to examine the performance of two aerial datasets acquired using different imaging resolutions and stereo overlaps in forest inventory. Specifically, the aim was to find out whether the aerial dataset, whose acquisition was based on imaging parameters that were more purpose built for stereo-photogrammetric canopy modelling would result in significant improvement compared to archived aerial imagery in the accuracy of the estimates of growing stock volume, tree species specific volumes and mean stand height and diameter.

Study area and field data
The study area is located in central Finland in the municipalities of Virrat and Keuruu. The area covers approximately 75 000 ha of a larger test area used in study by (Tuominen et al. 2017a). The geographical, topographical and ecological characteristics of the study are described in detail in (Tuominen et al. 2017a). Map of the study area is presented in Fig. 1.
The reference data for this study was allocated in the study area using systematic cluster sampling. The field sample plots were established originally as a part of a larger sample plot network (Tomppo et al. 2016). The field data set consisted of a total of 384 sample plots, of which 312 were located on forestry land. In each of the full clusters, there were eight plots with 250 m spacing in two orthogonal rows, and clusters were placed 4.3 km apart from each other both horizontally and vertically (Tomppo et al. 2016).
The first round of field measurements was carried out in 2013 applying circular 9 m fixedradius sample plots, from which all trees with diameter (at breast height, DBH) was 4.5 cm or more. Tree species, DBH, tree class, distance and compass bearing from the center point were recorded from all tally trees. Additionally, the mean height and age were measured from the tree of median basal area of each tree species. Additionally, basic stand-level characteristics were measured from each stand intersecting a plot (Tomppo et al. 2016). The plots were re-measured in 2017, excluding the stand-level variables using the same measurement protocols as in 2013. The average values of the field variables measured in 2013 and 2017 were close to each other, but there were differences in the distribution of the field variables measured with 4 year interval, partly due to forest increment and removals, but also due to somewhat differing inclusion of tally trees. The main statistics of field data from years 2013 and 2017 are presented in Table 1. The tree species in the

Remote sensing data
Two sets of remote sensing data corresponding to the dates of the two field measurements were tested in this study. Data set corresponding to year 2013 consisted of aerial imagery acquired in July and August of that year originally with the purpose of aerial ortho-mosaic production for forest management. Imagery was acquired using a Microsoft UltraCam Eagle camera, which is a frame camera. The imaging altitude was 4700 m, and the original imagery had stereo overlap (forward) of 80% and (lateral) 35%. Images contained red (R), green (G), blue (B) and near-infrared (NIR) spectral bands, and they were ortho-rectified to a ground resolution of 30 cm per pixel. Based on the photogrammetric measurements of stereo pair images, a 3D point cloud (canopy surface model, CSM) representing the uppermost canopy layer was calculated using Trimble MATCH-T software. This point cloud covers the study area in the form of an evenly spaced grid with 0.75 m intervals between points (point density approx. 1.8 points m -2 ). The features extracted from this data are hereafter referred as standard (aerial) data set. Additionally, Landsat 8 satellite images were used to complement the aerial imagery, since it has been noted that the higher spectral resolution of multispectral satellite imagery provides information not contained by typical color-infrared aerial imagery, and their combination usually lead in to more accurate forest estimates than each of the image data individually. As no suitable Landsat 8 materials were available from summer 2013, a scene acquired on July 23rd 2014 was used for covering the main part of the study area. As that image was partly covered by clouds, another Landsat 8 scene acquired on July 3rd 2015 was used to cover the patches affected by clouds and their shadows. From available Landsat products, atmospherically corrected surface reflectance products were downloaded from USGS EarthExplorer data access portal. The reflectance histograms of the 2015 image bands (bands 1-7) were matched to those of the 2014 image, and the clouded patches of 2014 were filled with the data of 2015.
Aerial imaging-based remote sensing data set corresponding to year 2017 was acquired using Leica DMC III aerial camera which has a push broom sensor. The imaging altitude was same as in 2013 (4700 m), and the stereo overlap was somewhat higher compared to that of 2013: 80% (forward) and 60% (lateral). The average ground sampling distance was 25 cm. Stereophotogrammetric point cloud was derived from the imagery (using Trimble MATCH-T software) with 25 cm distances between points (point density approx. 16 points m -2 ). The features extracted from this data are hereafter referred as stereo-oriented (aerial) data set.
Examples of the stereo-photogrammetric point clouds are illustrated in Fig. 3. As in standard aerial data set, a contemporary satellite image was used to complement aerial imagery and photogrammetric point cloud. Sentinel 2 image from 30th June 2017 was used to cover most of the test area, but as in Landsat images from 2013the temporally most fitting image scene had cloud gaps, which were covered using another Sentinel image from 17th June 2017.

Remote sensing features
The stereo-photogrammetric point cloud CSMs derived from the aerial image datasets were converted to canopy height models (CHM) by normalizing the elevation values (Z-coordinates) of the points to terrain level using digital terrain model by National Land Survey of Finland based on airborne laser scanning (Maanmittauslaitos 2020). The canopy height values of the photogrammetric point cloud were also interpolated into a raster format CHMs at spatial resolution and grid spacing similar to the digital aerial images.
Remote sensing features from raster format aerial data were extracted based on 16 × 16 m square windows, whose centers coincided with the centers of the sample plots. The following features were extracted: The textural features based on co-occurrence matrices of pixel values were extracted as average values for features calculated in four directions in the extraction window: horizontally (0° angle), vertically (90°), and diagonally (45° and 135°). Lags of 2.7 and 5.7 and 2.5 and 5.0 m were used for standard and stereo-oriented data sets respectively.
From the Landsat and Sentinel satellite imagery, the image features were extracted using single pixels on which the sample plots were located. The following features were extracted: 3. Percentage of canopy points having H ≥ than corresponding Hp, p is one of 20, 40, 60, 80, 95). 4. Ratio of canopy points to all points. 5. Percentage of canopy points above height limits Hmin + s/10 × (Hmax − Hmin), where s is one of 1-9.* 6. Canopy relief ratio. 7. Skewness and Kurtosis. 8. L-moments (L1, L2, L3, L4) and L-moment skewness and kurtosis. 9. Quadratic mean and cubic mean. 10. Average Absolute Deviation. 11. Median of the absolute deviations from the overall median. 12. Inner Volume and outer Volume.

Estimation and testing of forest variables
The k-nearest neighbor (k-nn) method was used for the estimation of the forest variables (Kilkki and Päivinen 1987;Muinonen and Tokola 1990;Tomppo 1991). The estimated stand variables were mean DBH and mean height, basal area, total volume of growing stock (volume), the volumes of Scots pine, Norway spruce and deciduous species.
The remote sensing datasets encompassed a total of 84 aerial photograph features, 32 features from rasterized canopy height data and 49 3D point features. Additionally, a total of 30 and 69 satellite image features were extracted from Landsat and Sentinel imagery respectively (the numbers differ due to the different number of bands in Landsat 8 and Sentinel-2 images). As the extracted feature set was large, presenting a high-dimensional feature space for the estimation, the dimensionality of the data was reduced by selecting a subset of features with the aim of good discrimination ability. The selection of the features was performed with a genetic algorithm (GA) -based approach, implemented in the R language by means of the Genalg package (Willighagen and Ballings 2015;R Development Core Team 2016). The objective of the genetic algorithm was to minimize the RMSEs of the applied forest variable estimates for each variable in leave-one-out cross-validation.
This approach searches for the subset of predictor variables based on criteria defined by the user. Although there is no guarantee of finding the optimal predictor variable subset (Garey and Johnson 1979), and the algorithm does not go through all possible combinations, solutions close to optimal can usually be found in a feasible computational time.
The general GA procedure begins by generating an initial population of strings (chromosomes or genomes), which consist of a random combination of predictor variables (genes). Each chromosome is considered a binary string having values 1 or 0 indicating that certain variable is either `selected' in the subset or `not selected'. The strings evolve over a user-defined number of iterations (generations). This evolution includes the following operations: selecting strings for mating by applying a user-defined objective criterion (the more copies in the mating pool, the better), allowing the strings in the mating pool to swap parts (cross over), causing random noise (mutations) in the offspring (children), and passing the resulting strings to the next generation. The process is repeated until a pre-defined criterion is fulfilled or a pre-determined number of iterations have been completed (Broadhurst et al. 1997;Tuominen and Haapanen 2013;Moser et al. 2017).
Different values of k were tested in the estimation procedure. In the k-nn estimation, the Euclidean distances between the sample plots were calculated in the feature space defined by the applied remote sensing features. The stand variable estimates for each sample plots were calculated as weighted averages of the stand variables of the k-nearest neighbors (Eq. 1).

where:
= estimate for variable y, k = number of nearest neighbors, y i = measured value of variable y in the ith nearest neighbor plot.
Weighting by inverse squared Euclidean distance in the feature space was applied (Eq. 2) for diminishing the bias of the estimates (Altman 1992). Without weighting, the k-nn method often causes undesirable averaging in the estimates, leading to over-or underestimation in the peripheral areas of forest variable distribution, especially when limited number of reference plots are available.
where: d i = Euclidean distance (in the feature space) to the i th nearest neighbor plot, g = parameter for adjusting the progression of weight with increasing distance.
The accuracy of the estimates was calculated via leave-one-out cross-validation by comparing the estimated forest variable values with the measured values (ground truth) of the field plots. In the cross-validation all circular plots within same experimental plot were excluded from the nearest neighbors. The accuracy of the estimates was measured in terms of the relative root mean square error (RMSE) (Eq. 3), relative bias (Eq. 4) and coefficient of determination (R 2 ) between estimated and measured forest variables. It is also of interest to assess, how well the distribution of predicted values matches with the distribution of the true values. Especially the predictions in the high and low tails of the distribution are of interest. In this study, the distributions were assessed using an error index defined as: where p is the number of classes of the distribution considered, v k is the true mean volume in the k th class, and v k was the mean of predicted volumes in the k th class. This index was calculated for the distribution of total volume and the species-level volume distributions and both datasets.

Results
When comparing the height where 80% of the CHM points of a sample plot were accumulated (H80) to the mean stand height measured from the plot, which typically have high correlation with each other in point cloud data sets, there are practically no differences in the correlation between H80 and mean height in the standard and stereo-oriented data (Fig. 4) the second decimal. In the point cloud data sets from both years there seem to be clear geometric errors in certain sample plots, where the point cloud indicated very low stand height compared to field measurement. There was little difference in the general estimation accuracies between the standard and stereo-oriented data sets, which can be noted in the scatterplots (Fig. 5) representing the observed values of total volume of growing stock, mean DBH, mean height and basal area and the respective estimated values based on standard and stereo-oriented aerial data. Similar results can be seen from the RMSE values of the estimates, as presented in Table 2 as well. Except for pine and spruce volumes, the estimation accuracy of all forest variables was lower when using stereo-oriented aerial data, which was contrary to expectations. The estimation accuracies for those variables (as measured by RMSE values) were 3-6% lower with stereo-oriented data compared to estimates based on standard data set. However, the accuracy of estimates of the spruce volume were 17% higher and estimates of pine volume 2% higher with stereo-oriented data set.
When examining the coefficients of determination (R 2 ) between the estimates based on the standard and stereo-oriented aerial data sets and the respective ground truth values, the R 2 were generally slightly higher for variables based on stereo-oriented data set, except for spruce volume, where R 2 markedly higher and for mean height estimates where standard data set resulted in slightly higher R 2 ( Table 2).
When aerial data sets were combined with contemporary satellite imagery, the standard data set combined with Landsat data again resulted in, contrary to expectations, better estimation accuracy for total volume, deciduous volume and basal area compared to stereo-oriented data set (including Sentinel image). However, estimation accuracies for pine and spruce volumes as well as for DBH and height were better when using stereo-oriented aerial data combined with Sentinel data. As with results without satellite data, the only variable showing clear difference in favour of the stereo-oriented data was the volume of spruce, where the estimation accuracy was 20.9% higher than with standard data, as measured by the RMSE. For the volume of pine, the estimation accuracy was 3.3% higher, whereas for the volume of deciduous trees the accuracy was 1.2% lower. The estimation accuracies of DBH and height were 0.4% and 3.3% higher, respectively, with stereo-oriented aerial and Sentinel data. The RMSEs of the estimates based on the combination of aerial and satellite data are presented in Table 3.
A more distinct difference between the combinations of aerial and satellite data can be noted when examining the R 2 values of the estimates. Again, there was little difference between the two data sets, when estimating total volume, mean height and basal area. However, when estimating species-specific volumes of the main tree species, as well as mean DBH, the coefficient of determination was markedly better when using stereo-oriented aerial and Sentinel data set (see Table 3).
When comparing the distributions of the forest variable estimates based on standard and stereo-oriented aerial data sets with the corresponding field measurements from the sample plots, similar averaging trend was noted for all forest variables. As often typical for k-nn estimates, the classes in the middle of distribution are overestimated, and the classes in the low and especially in the high end of the distribution were underestimated. In the distribution of total growing stock ( Fig. 6) volume this trend was less pronounced than in volumes per tree species, and the under-  representation is most significant in the highest volume classes of volume estimates based on standard data set. In the estimates based on stereo-oriented data set, the highest volume classes were better represented in relation to the field data. In the total volume distribution, using the same classes as in Fig. 6, the R index was 30.3 for the standard data set and 31.5 for the stereo-oriented data set. Thus, in the total volume the difference was small but to the benefit of standard data set, in the same way as in the RMSE results.
In the distributions of the volumes per tree species (pine, spruce and deciduous) the differences of the measured and estimated (standard vs. stereo-oriented aerial data sets) variables became obvious also in the low end of distribution due to the difficulties in the discrimination of the proportions of tree species with the applied remote sensing data. However, for the volumes of individual tree species, the distributions of the estimated and measured volumes were markedly closer to each other in the stereo-oriented data set for almost all volume classes compared to standard data set. This is confirmed with the R indices, with values 36.45 and 31.47 for pine, 37.47 and 30.01 for spruce and 33.14 and 26.02 for birch in standard and stereo-oriented data sets, respectively. This means that the distributions of the species level results could be improved, even though the distribution of the total volume could not.
In the estimation of stand height (Fig. 7), neither the standard nor the stereo-oriented data set proved clearly better than the other in replicating the height distribution of the measured data from the corresponding year. Fig. 6. Distribution of estimated (est) total and species-specific stand volumes based on standard and stereo-oriented data sets and the respective field measurements.

Discussion
The results indicate that customizing the aerial imaging parameters for stereo photogrammetric aerial imaging, while simultaneously aiming at retaining the capability for covering large areas did not bring any marked improvement in the accuracy of the predictions of the forest variables. When examining the accuracy of the total growing stock volume estimates and stand height estimates, whose accuracy is typically highly dependent on the 3D point cloud features, there was no significant difference between the two CHMs that would justify the higher acquisition cost of stereo-oriented imagery. Similarly, when examining the distributions of the total volume and height estimates in comparison with the distributions of field observations, neither the standard nor the stereo-oriented data set demonstrated clear superiority in that regard. When considering the raw point cloud features, such as the height where 80% of points have accumulated (H80) which typically has the highest correlation with stand height, the performance of those point cloud features was virtually similar, as shown by the R 2 values presented in Fig. 4, despite the higher point density and geometric quality of CHM based on stereo-oriented data set (as illustrated in Fig. 3).
In the estimation of species specific (pine, spruce and deciduous) growing stock volumes there was some improvement in the plot level accuracies when using the stereo-oriented data, especially in distinguishing the proportions of coniferous species. More significant improvement could be noted, when examining the distributions of species-specific volume estimates, where the stereo-oriented data set seemed to retain better the original variation of the field data in the remote sensing-based estimates. The error indices describing the species-level volume distributions also showed improvement.
When using a combination of aerial and satellite data, which would be a feasible option in practical forest inventory, the inclusion of satellite data generally brought moderate improvement to the estimates based on of both standard and stereo-oriented data sets. The inclusion of satellite data seemed to emphasize the difference of the data sets when estimating species-specific volumes. Here the improvement is mainly due to the better spectral and radiometric resolution of the Sentinel 2 satellite imagery compared to the Landsat 8 imagery. This conclusion can be drawn, when examining the list of RS features selected by GA from the standard and stereo-oriented data sets. Among the features extracted from the stereo-oriented data set, the Sentinel 2 satellite features have higher importance than the Landsat features used with the standard data set. On the other hand, Fig. 7. Distribution of the stand mean height estimates based on standard and stereo-oriented data sets and the respective field measurements. the number of selected CHM and aerial image features in the stereo-oriented data set was notably lower than in the standard data set. However, although lower number of features may indicate their lesser weight in the estimation, sometimes it may also indicate better correlation of individual RS features and field measurements, resulting in fewer features needed. A complete list of features selected from the standard and stereo-oriented data sets is presented in the Supplementary file S1, available at https://doi.org/10.14214/sf.10291.
Earlier studies (Järnstedt et al. 2012;Gobakken et al. 2015;Tuominen et al. 2017a) have shown that even the much higher geometric detail provided by ALS based CHMs do not result in significantly better forest estimates than photogrammetric CHM with the applied point densities. In that respect the results of this study are highly consistent with other studies comparing different point cloud data sources. However, also studies with a clear difference between photogrammetric data and ALS exist . The reasons for this kind of variation of the results may depend e.g. on the algorithms used for obtaining the photogrammetric data, and the quality of the aerial photos (for example the shadows of the clouds in the photos). It is likely, that the algorithms can be improved in the future, but the problems due to photo quality are not easily diminished.
When comparing photogrammetric data with varying overlap, Puliti et al. (2017) found 80% overlap slightly outperforms 60%, but Bohlin et al. (2012) reported to have detected no improvement from higher overlap. Our results are also well in line with these studies. It thus seems that utilizing the aerial photos taken for normal forest management are good enough, besides being markedly cheaper.Considerably higher plot level estimation accuracies for similar forest variables as tested in this study have been achieved by using considerably lower level aerial imaging altitudes and UAV-based imaging sensors, which allows significantly higher spatial resolution with similar stereo overlaps as applied in this study (i.e. stereo-oriented data) (e.g. Tuominen et al. 2017b). However, in the present technological situation in UAV-technology, covering large forest areas in the context of NFI, such inventory would not be feasible economically or operationally. In the long run, it is however likely that the development in UAV-technology, as well as increased use of automatization and robotics in their operation will allow covering large forest areas cost-efficiently by UAV-based aerial imaging, while simultaneously enabling the acquisition of geometrically highly accurate CHMs based on spatially high imaging accuracy.

Conclusions
The results showed no clear difference between the standard and stereo-oriented aerial data sets in the estimation of forest variables. Thus, there was no conclusive evidence of the advantages brought by the adjustment of the aerial imaging parameters for stereo-photogrammetric 3D modelling instead of using standard aerial imaging parameters intended for ortho-photo production. On the other hand, the results demonstrate the potential benefit of standard aerial imagery which could be put into good use by the application of stereo-photogrammetric 3D modelling. Furthermore, using satellite imagery in combination with aerial data improves the estimation accuracy, especially for the proportions of tree species.