Articles | Volume 29, issue 13
https://doi.org/10.5194/hess-29-2837-2025
https://doi.org/10.5194/hess-29-2837-2025
Research article
 | 
07 Jul 2025
Research article |  | 07 Jul 2025

Quantitative soil characterization using the frequency domain electromagnetic induction method in heterogeneous fields

Gaston Mendoza, Guillaume Blanchy, Ellen Van De Vijver, Jeroen Verhegge, Wim Cornelis, and Philippe De Smedt
Abstract

The frequency domain electromagnetic induction (FDEM) method is a widely used tool for geophysical soil exploration. Field surveys using FDEM provide apparent electrical conductivity (ECa), which is typically used for qualitative interpretations. Quantitative estimations of soil properties remain challenging, especially in heterogeneous fields. Quantitative approaches are based on either deterministic or empirical modeling. While the deterministic approach faces limitations related to instrumental drift, data calibration, inversion, and pedophysical modeling, the empirical approach requires developing a local model, which involves extensive field sampling.

This study aims to evaluate the effectiveness of FDEM modeling based on either a deterministic or empirical approach, identify its limitations, and search for optimal field protocols. We provide practical guidelines for end users to quantitatively predict soil water content, bulk density, clay content, cation exchange capacity, and water EC in heterogeneous fields. Two field surveys were conducted in Belgium, where FDEM data were collected using Dualem-421S and Dualem-21HS sensors, along with data taken from electrical resistivity tomography (ERT) measurements and an impedance moisture probe and soil sampling. A comprehensive sensitivity analysis revealed that deterministic modeling procedures could not predict water content more accurately than a mean value approximation (negative R2). This analysis also highlighted the sensitivity of the minimization method used in FDEM data inversion and the applied pedophysical model. Empirical modeling, which does not require FDEM data calibration or inversion, outperformed the deterministic approach. However, its prediction accuracy is limited, particularly if soil sample depth is not considered.

Share
1 Introduction

Frequency domain electromagnetic induction (FDEM) tools are widely applied in geophysical soil surveys (Boaga, 2017). These instruments often serve to qualitatively determine spatiotemporal changes in the apparent electrical conductivity (ECa), reflecting the influence of soil characteristics within the measured soil volume (Doolittle and Brevik, 2014). As the relationship between electrical conductivity (EC) and several of such soil attributes has been investigated extensively, FDEM is also capable of their quantitative assessment. Specifically, soil water content is a preferred target because of its central role in soil–plant interaction, groundwater assessment, soil ecological functioning, and climate regulation.

Despite these applications, a broader practical implementation of FDEM remains mainly limited to academic settings (Altdorff et al., 2017; Huang et al., 2007). Two major challenges hinder wider adoption. Firstly, the FDEM methodology itself faces issues such as instrumental drift, approximations to translate raw FDEM data to ECa, calibration difficulties (Hanssens et al., 2020; Minsley et al., 2012), and the necessity for data inversion of ECa to true EC before a non-local quantitative assessment can be made. This reality persists even though – particularly for research purposes – adaptive correction procedures and open-source inversion codes have become available. Secondly, a significant obstacle in translating soil EC data into a target soil property lies in the current limitations of pedophysical models. These models are non-site-specific deterministic and link geophysical variables with soil properties (see, e.g., Friedman, 2005; Glover, 2015; Romero-Ruiz et al., 2018) but often suffer from a lack of precision and are difficult to generalize. This is exacerbated by the variability in soil types, spatial heterogeneity, temperature conditions, and the electromagnetic frequency of measurements (Jougnot et al., 2018; Moghadas and Badorreck, 2019). Importantly, the difference between the laboratory-analyzed and FDEM-measured soil volumes can cause scale inconsistencies (Lück et al., 2022) that affect the validity of these relationships. As an alternative to pedophysical models, field-specific empirical relationships can be composed at the cost of obtaining significant amounts of calibration data (Corwin and Lesch, 2003). Despite empirical modeling being inherently limited to the conditions represented by the dataset the model has been trained for, exhaustive assessments of this method demonstrated useful predictions of various soil properties across agricultural fields (Boaga, 2017; Rentschler et al., 2020).

Here, we evaluate how FDEM data can serve to quantitatively predict spatial variations in volumetric soil water content (θ), bulk density (ρb), clay content, cation exchange capacity (CEC), and water EC (ECw) in a practical, straightforward manner on two heterogeneous test sites. In search for optimal field protocols, we evaluate to which extent considering instrumental limitations and different procedures of FDEM data correction and processing influences the accuracy of the predicted soil attributes and what the trade-off between deploying a physics-driven deterministic versus field-specific data-driven model implies. Finally, we propose field and modeling strategies for optimal soil characterization with FDEM surveys.

2 Methodology

2.1 FDEM functioning

FDEM devices function by passing an alternating current through a transmitter coil, creating a oscillating primary electromagnetic field that varies over time. This primary field (HP) interacts with the subsurface, inducing eddy currents which subsequently produce a secondary electromagnetic field (HS). The HS/HP ratio is detected by the receiver coil and is quantified as a complex number, consisting of an in-phase component (IP = Re(HS/HP)) and a quadrature component (QP = Im(HS/HP)). Both IP and QP are typically measured, reflecting the device setup and the conditions of the subsurface.

The QP expressed in parts per million (ppm) can be converted to the actual ECa using the linear model developed by McNeill (1980) assuming a homogeneous subsurface electrical conductivity. This model assumes a uniform subsurface EC and is known as the low induction number (LIN) approximation. It is valid when the induction number (β) is low (β≪1). The LIN approximation proposed by McNeill (1980) is given by

(1) EC a = QP 4 μ 0 ω s 2 when β = s µ 0 ω EC 2 1 ,

where ω is the angular frequency, μ0 is the magnetic permeability of free space (1.257×10-8 H m−1), and s is the coil separation. It can be seen from this equation that large frequencies and higher EC soils will violate the β≪1 specification. It is important to note that the LIN approximation also assumes that the FDEM device is operated at ground level above a homogeneous, poorly conductive subsurface (Callegary et al., 2007; McNeill, 1980).

2.2 Data collection

Two heterogeneous agricultural fields were examined in this study. Site 1, located in Middelkerke, Belgium, is shown in Fig. 1a. Belgian soil map data (Van Ranst and Sys, 2000) indicate that the field is affected by saline groundwater and exhibits a soil texture varying from loam (26 % clay, 34 % sand) to silt loam (10 % clay, 40 % sand) (USDA textures), with clay layers starting at depths greater than 0.50 m. In contrast, Site 2, located in Bottelare, Belgium (Fig. 1b), is characterized by fresh groundwater. The soil texture at this location ranges from sandy loam (13 % clay, 76 % sand) to clay (64 % clay, 5 % sand). Both sites have a plow horizon, are actively used for agriculture, and had no standing crops at the time of the survey.

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f01

Figure 1Site 1 in Middelkerke (a) and Site 2 in Bottelare (b) (Belgium), with the position of the ERT transects and 15 soil sampling locations per site, selected via conditioned Latin Hypercube Sampling based on previously obtained FDEM LIN ECa data. The mapped data are ERT-calibrated robust ECa HCP1.0.

Field surveys at both sites involved collecting FDEM data using different sensors, all operating at 9 kHz: the Dualem-421S at Site 1 with a 3 m crossline sampling density and the Dualem-21HS at Site 2 with a 1 m crossline sampling density, both with a constant distance above ground of 0.165 m using a sled. The instrument was pulled with a quad across the field with a driving speed of approximately 10 km h−1 and a measurement sampling rate of 10 Hz. The crossline sampling density was decided based on the time to survey each field, Site 1 being bigger. The surveys at both sites provided in-phase (IP) and quadrature-phase (QP) data with an in-line sampling density of approximately 0.3 m in horizontal co-planar (HCP) and perpendicular (PRP) configuration. For both sites, transmitter–receiver separations of 1.0 m (HCP1.0), 1.1 m (PRP1.1), 2.0 m (HCP2.0), and 2.1 m (PRP2.1) were used. Additionally, 4.0 m HCP (HCP4.0) and 4.1 m PRP data (PRP4.1) were collected at Site 1 and 0.5 m HCP (HCP0.5) and 0.6 m PRP data (PRP0.6) at Site 2. Electrical resistivity tomography (ERT; Syscal Pro, Iris Instruments) was performed with 0.5 m electrode spacing, during the FDEM surveys (Fig. 1). The ERT transects were located based on previous surveys to include the largest ECa range across the field.

In addition to geophysical surveys, soil sampling was carried out at 15 predetermined locations at each site. These locations were strategically selected using the Latin hypercube sampling method (Minasny and McBratney, 2006) and were based on insights from previously collected FDEM data. Undisturbed soil samples were extracted from two depths, 0.10 m (topsoil) and 0.50 m (subsoil) below the surface, in stainless steel 100 cm3 cores using an auger. In total, 30 samples per site were analyzed in the laboratory to obtain soil texture (after sieving at 2 mm), CEC (CoHex method, Ciesielski et al., 1997a, b), θ, and ρb (gravimetric method with convective oven drying at 105 °C).

To accurately determine in situ EC, ECw, and temperature within the soil sampling volume (100 cm3), measurements were taken at each sampling location using a HydraProbe soil probe (HydraProbe, Stevens Water Monitoring Systems, 2008). The correction proposed by Logsdon et al. (2010) was applied to improve the quality of these EC readings.

2.3 Data processing

The general processing workflow of the FDEM survey follows Hanssens et al. (2020) and is described in Fig. 2. The methodology aims at processing the FDEM data to obtain reliable EC data at sampling locations and then predict soil properties. For this process, a meaningful physical modeling sequence was followed. For instance, no inversion was implemented on uncalibrated FDEM data. This involved four key steps: ERT inversion, FDEM data calibration, FDEM data inversion, and pedophysical modeling (Hanssens et al., 2020). All computer code used is open-source, and default parameters were prioritized, ensuring reproducibility of methodology and results. All developed codes for this section are available in Mendoza Veirana (2024b) and collected data in Mendoza Veirana (2024a).

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f02

Figure 2Workflow of geophysical data process including prediction of soil properties. Ellipses represent observations that exist independently of all data processes. Rectangles represent data over all the field and/or soil sampling locations. Parallelograms represent data over the ERT profiles. The square represents an external model. Colors represent modeling processes: light blue for ERT inversion (Jupyter Notebook “00_inv-ERT”), green for FDEM data calibration (Jupyter Notebook “01_QP_ cal”), orange for FDEM data inversion (Jupyter Notebook `02_EC_invert'), and red for soil properties modeling (Jupyter Notebook “03_Soil_properties_modelling”).

Download

2.3.1 ERT inversion

The measured ERT data were inverted using the ResIPy (v3.5.4) open software (Blanchy et al., 2020) which is based on the R2 codes (Binley and Kemna, 2005) (see Fig. 2 light-blue box; see full code in the Jupyter Notebook “00_inv-ERT”). The method for inversion follows a least-squares minimization between the measured ERT data and modeled subsurface geoelectrical parameters. The inversion process incorporates regularization techniques that balance data fidelity and smoothness of the subsurface model to address the inherent non-uniqueness and instability in ERT data inversion (Binley, 2015). A standard inversion using a triangular mesh was implemented, converging after three iterations. After inversion, extraction of ERT profiles was done by averaging the EC in a neighborhood of 0.5 m around each electrode. Alternatively, to obtain smoother profiles, an extraction window of 2.5 m was also used.

2.3.2 FDEM data calibration

Calibrating raw FDEM data is required for obtaining reliable EC data at sampling locations, and such calibration was done by combining ERT and FDEM data (see Fig. 2, green boxes) (Lavoué et al., 2010; van der Kruk et al., 2018). On the one hand, the raw uncalibrated FDEM data (in ppm) were transformed to ECa data following the LIN approximation. On the other hand, the inverted ERT EC data were firstly grouped by profile, and any profiles at the beginning or end of each transect that did not reach a minimum depth of 4 m were removed due to the lower sensitivity in those peripheral zones. After this, 100 profiles remained for Site 1 and 40 profiles for Site 2. Subsequently, the inverted ERT profiles were forward-modeled to the theoretical FDEM LIN ECa measured by the Dualem instrument over the ERT transect (Lavoué et al., 2010). The forward model implements a 1D full solution of Maxwell's equations considering an electromagnetic field, which, after FDEM instrument reading, is composed by IP and QP signals.

After calculating both FDEM LIN ECa data over the ERT profiles, these data were matched by spatial proximity with the closest FDEM data point. A linear regression was then fitted for the six coil configurations (see Fig. 3). This linear regression was applied to the entire FDEM survey dataset, effectively performing a linear calibration.

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f03

Figure 3Calibration of FDEM QP data. A linear regression is fit between the FDEM LIN ECa data collected on the field (X axis) over the ERT transect and the FDEM response that was forward-modeled from the inverted ERT data (Y axis). This is shown for the three different QP coil configurations across the three subplot columns and for both sites displayed in the top and bottom rows.

Download

Lastly, the calibrated FDEM QP data were transformed to robust ECa (rECa) values to provide reliability beyond LIN constraints (Hanssens et al., 2019), such as high salinity and clay content levels. A visual comparison of the forwarded FDEM ECa and uncalibrated and calibrated LIN and rECa FDEM data over the ERT transect is shown in Fig. 4. The rECa consistently exceeds the LIN ECa, reflecting the known limitations of the LIN approach and aligning with the tests reported by Hanssens et al. (2019).

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f04

Figure 4Comparison of HCP1.0 QP ECa data along the ERT transect for Site 1 (a) and Site 2 (b). This includes calibrated and uncalibrated FDEM (LIN) ECa and calibrated robust (rECa) data. Also, the forward-modeled ERT (LIN) ECa data are shown.

Download

2.3.3 FDEM data inversion

To assess the impact of the different modeling steps, we provide the parameters used and alternatives for comparison. To obtain top- and subsoil EC, 1D data inversion was performed with EMagPy (v1.2.2) (McLachlan et al., 2021) using the full Maxwell-based forward model (Wait, 1982). For both sites and based on borehole observations, a five-layer subsurface discretization was maintained, with fixed interfaces at 0.3, 0.6, 1.0, and 2.0 m. Another option for layer interfaces definition consists in using a logarithmic scale from 0.15 to 2.0 m. The closest FDEM observation to each sampling location was selected as the reference, in contrast to averaging FDEM observations within a radius (2 m for Site 1 and 1 m for Site 2) around the sampling location. Additional parameters of the inversion problem include an optimization method (Gauss–Newton) (Virtanen et al., 2020) or Robust Parameter Estimation (ROPE) (Bárdossy and Singh, 2008), a vertical smoothing parameter (α, default = 0.07), and L2 norm objective function. Moreover, inversion data were composed of all the coil configurations. Removing HCP2.0 and PRP2.1 for Site 1 and HCP0.5 and PRP0.6 for Site 2 could lead to lower inversion errors. The starting model for inversion was set to the average of the ERT profiles using the given subsurface layers; alternatively, one particular reference ERT profile can be used. Inversions with a negative R2 error were discarded and not analyzed further. Finally, EC limits (constraints) were applied to rECa FDEM data at sampling locations during its inversion process (just for ROPE solver). These were defined as the minimum and maximum EC values of the inverted ERT profiles.

Once the EC data were obtained by inversion of the rECa FDEM data for each sample location, they were used to calculate the soil properties of interest.

2.3.4 Soil property calculation

Linking EC data to soil properties at sampling locations can follow two basic modeling strategies: local empirical and universal deterministic modeling.

Local empirical modeling enables us to empirically predict several soil properties across the surveyed field, at the expense of collecting and analyzing soil samples to build a training dataset. This modeling consists of fitting functions to the training dataset and predicting at targeted locations. Traditionally, polynomial functions have been used for this task (Rentschler et al., 2020), but in recent years machine learning algorithms (such as artificial neural networks and random forest) have performed better (Moghadas and Badorreck, 2019; Rentschler et al., 2020; Terry et al., 2023). However, using machine learning requires a large number of training data that may not be obtainable for practical FDEM applications. Thus, we stick to polynomial functions for empirical modeling.

In our case, for both sites the original soil analysis dataset (n=30) was randomly split into a training dataset (n=20), while the remaining was used as test dataset (n=10); this process was repeated 100 times. The optimal polynomial degree was chosen as the one that maximizes the median R2 errors on all the 100 test sets.

Three distinct approaches to polynomial development were utilized. A first approach, named “layers together” (ST-LT), consisted of combining data from different soil depths so that no differentiation was made between top- and subsoil samples for model development. Secondly, these sample sets were considered separately in an approach whereby different polynomials were developed for each soil layer (“layers separate”(ST-LS)). In this modeling approach, the same polynomial degree was maintained for both top- and subsoil data. Finally, the ST-LS2 approach was like ST-LS but permitted different polynomial degrees for the models of each layer.

Model training features included calibrated (LIN and robust) and uncalibrated (LIN) ECa data from the six FDEM coils and the inverted EC data at soil sampling locations, while independent targeted soil properties were θ, cation exchange capacity (CEC), clay content, ρb, and ECw.

Deterministic modeling uses general pedophysical ECθ relationships based on physical principles that have been validated across a wide range of soil conditions (e.g., models presented by Rhoades et al., 1976). Such modeling does not require calibration data, avoiding the cost of field sampling and laboratory analysis. However, such pedophysical models may fall short in representing extreme scenarios outside the tested soil characteristics ranges. Additionally, soil data (such as porosity, ECw, and texture) must be available to adequately populate the model and predict the target property. Lastly, soil data also require corrections of temperature and electromagnetic frequency (Moghadas and Badorreck, 2019). Because the relationship of EC with soil properties is most straightforward for θ, predicting other targets, such as soil texture or salinity, is generally not feasible under deterministic modeling.

To compare performances between deterministic and empirical modeling strategies, we tested the pedophysical models on the same test datasets used for empirical modeling. Three different approaches were employed to populate the pedophysical model. The deterministic approach for layers together (DT-LT) consisted of averaging soil property data from all samples regardless of their depth. The layers separate approach (DT-LS) utilized averaged soil property data from samples at the same layers. The last approach, termed the “ideal” (DT-ID) scenario, used the actual soil property data from each specific location. Hereby, ideal EC refers to the EC at each sampling location that would result in a perfect θ prediction after pedophysical modeling.

Predicting θ via pedophysical modeling followed three steps. First the inverted EC data at 9 kHz were transformed to direct current (DC) EC using the model proposed by Longmire and Smith (1975), which was further validated by Cavka et al. (2014). Then, the resultant DC EC was temperature-corrected using the model proposed by Sheets and Hendrickx (1995). Lastly, the EC data were converted to θ based on Fu et al. (2021):

(2) EC = EC w θ 2 + θ 0.654 clay 100 - clay + 0.018 + 1 - EC s ,

with the solid-phase conductivity ECs (considered negligible) and porosity =1-ρb/ρp, where ρp is the soil particle density (=2.65g cm−3). All steps were implemented automatically using Pedophysics open-source software (Mendoza Veirana and De Smedt, 2024). The pedophysical model of Eq. (2) has been validated for samples with 0 % to 33 % clay content, ρb from 1.05 to 1.83 g cm−3, ECw from 0.03 to 5.6 S m−1, and θ up to 50 %.

Evaluating the deterministic modeling goodness in comparison with previous studies is not possible because the performance of the FDEM technique is site dependent (Boaga, 2017). Therefore, error indicators (R2 and RMSE) are compared between deterministic and empirical modeling approaches. Additionally, to assess the limitations of the deterministic modeling, the inverted FDEM EC and ideal EC data are compared to the in-site EC measured with the impedance probe, along with their associated water content.

2.4 Sensitivity analysis

In order to develop practical recommendations for FDEM end users and understand the impact of a given parameter (Pannell, 1997), we performed a sensitivity analysis for the most relevant parameters described above. This analysis aimed at finding the impact of alternative choices made during the whole FDEM data processing workflow for deterministic estimation of water content in the soil samples. The one-at-a-time method, which is the most widely used sensitivity analysis in environmental sciences (Saltelli and Annoni, 2010), was employed. It consists of altering one parameter in a stepwise manner and calculating the outcome while fixing other influencing parameters to a predefined origin. Although the one-at-a-time method is practical and easy to implement, it does not give clear information about the effect of all parameters (Saltelli and Annoni, 2010), as the combined effect of two or more parameters is not evaluated. This was solved by deploying the elementary effects method (Saltelli and Annoni, 2010), which consists of changing one parameter at a time but without returning to an origin. Then, using elementary effects, all combinations of parameters' values were evaluated.

In this study, we defined the origin (X0; see Table 1) as the standard set of parameters used for the whole data process (F) that correspond to a standard inversion and subsequent solution (Y0), which is the standard solution for volumetric water content (θ0):

(3) F observed data , X 0 = Y 0 = θ 0 .

Table 1List of model parameters used across all the data workflows. Standard values for each parameter are presented in the second column (X0) and alternatives to these values in the third column.

Download Print Version | Download XLSX

3 Results and discussion

3.1 Comparing EC data

A comparison of EC data obtained by the soil probe observations, standard FDEM inversion (using X0 parameter values), and ideal EC for both sites is shown in Fig. 5 alongside the water and clay content of associated samples. The observed water content has mean values of 0.34 and 0.29 and variance of 0.003 and 0.008 for Site 1 and Site 2, respectively.

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f05

Figure 5Comparison between EC obtained with the soil probe (a, d), FDEM standard inversion (b, e), and ideal EC (c, f) versus water content and clay content as an additional dimension. All the EC data are corrected for electromagnetic frequency and temperature (direct current EC at 25 °C).

Download

Considering the EC measured by the soil probe as the reference for actual data, the inverted EC significantly deviates from this reality. Furthermore, while the ideal EC and soil probe EC display a similar trend, this trend is noticeably stretched (compare first and third column in Fig. 5). It is also noteworthy that as the difference between ideal and soil probe EC (for both sites) increases, so does the clay content, with a Pearson correlation of 0.83 (p<0.005), not shown in Fig. 5. This disparity becomes even more pronounced for clay contents exceeding ∼30 %, which is in accordance with the validity range of Eq. (2) (clay contents up to 33 %).

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f06

Figure 6Bar plots showing median results of empirical modeling of soil properties. The median R2 values are obtained after testing such models in test datasets randomly generated as 30 % of the original dataset and iterating 100 times to ensure good data distribution.

Download

3.2 Empirical modeling results

The performance of empirical models for predicting observed soil properties is presented in Fig. 6. Poor predictions (negative median R2 over test datasets) were obtained when considering topsoil and subsoil data jointly (LT) for model development over training datasets (i.e., not considering sample depth). This may be due to an oversimplistic modeling that does not consider sample depths. Negative R2 values occur when the model fits the data worse than a simple horizontal line at the mean of the observed values. Approaches LS and LS2, which use different fitted functions per soil layer, resulted in better results as expected. No significant differences were observed in R2 values for features ECa uncalibrated and calibrated LIN or rECa. This may be due to the linear relationship between uncalibrated and calibrated ECa and quasi-linear relationship with calibrated rECa, which does not add information to such variables as polynomial features (Lavoué et al., 2010). However, the FDEM-inverted EC data generally underperformed compared to the rest of features, with the only exception of the LS2 approach for θ prediction at Site 1, with a R2=0.19 and a RMSE = 0.047, while for Site 2 the maximum R2 is 0.31, with a RMSE = 0.066.

Comparing performances for predicting different soil properties, ECw was shown to be an easier target than any other soil property. ECw prediction was generally better for Site 2 that does not have the influence of saline groundwater. Predicting CEC, clay content, and ρb, on the other hand, seemed to be highly site dependent.

An important consideration when interpreting the empirical approach is the disparity in scale between the soil volumes measured by FDEM (m3) and the smaller volumes analyzed in the laboratory (100 cm3). While some studies have addressed this issue, their conclusions are often site specific and inconsistent (see, e.g., Cong-Thi et al., 2024; Dimech et al., 2023). Despite these discrepancies, it is worth noting that non-inverted EC generally outperforms inverted EC for soil properties prediction, even though inverted EC represents a smaller soil volume (cubic decimeters). This performance gap may be attributed to errors in the EC inversion process rather than the difference in spatial scales.

When the best-performing models for θ prediction are implemented using the entire dataset (Fig. 7) – using both training and test data – this outperforms the modeling presented in Fig. 6, where only test data are incorporated into error assessment. While this is a common approach, we want to highlight this is improper practice to critically evaluate model performance as the inclusion of training data in error estimation results in an overestimation of model performance (Altdorff et al., 2017; Lipinski et al., 2008; Tibshirani et al., 2001). In other words, implemented model errors should not be confused with actual expected accuracy of target property predictions.

To evaluate the influence of other soil properties in θ prediction, the residuals of the implemented empirical models were correlated with other soil properties, but these were not significant.

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f07

Figure 7Empirical model implementation. Best-performant empirical models for θ prediction (based on Fig. 6 results) were implemented using the 30 samples per site. Panel (a) shows the data for Site 1, and panel (b) shows data for Site 2.

Download

3.3 Sensitivity analysis

The result of the sensitivity analysis is presented in Fig. 8 for Site 1 (upper subplot) and 2 (lower subplot). Generally, no possible combination of parameter values yielded an RMSE lower than 11 % (or 0.11 cm3 cm−3) for θ predictions, which corresponds to a negative R2 value; that is, they performed worse than a single mean solution. Additionally, the predictions for θ were worse for Site 2 than for Site 1, presumably due to the larger variance in θ data at Site 2. The standard solution θ0 obtained using the X0 parameter values was poorly performant too (see red lines in both subplots of Fig. 8). From Fig. 8, the boxes which differ the most from the rest represent the most sensitive parameters. For both sites, the most sensitive parameters are the minimization method used and the pedophysical model approach.

https://hess.copernicus.org/articles/29/2837/2025/hess-29-2837-2025-f08

Figure 8Box-and-whisker plot results of uncertainty analysis. The figure shows the error outcomes of an elementary effect sensitivity analysis using parameters involved in processing all FDEM data. The top row displays results for Site 1 and the bottom row for Site 2. The plotted data are the median of RMSE. The error associated with the θ0 solution is highlighted in red. Each box represents 50 % of the data (i.e., the error associated with a specific parameter value), with a horizontal line indicating the median, while the whiskers represent 25 % of the data at each end.

Download

Using the minimization method ROPE leads in general to better θ predictions, despite its average inversion error (R2=0.64 for Site 1 and R2=0.19 for Site 2) being higher than for Gauss–Newton (R2=0.75 for Site 1 and R2=0.94 for Site 2). Also, about 75 % of ROPE inversions for both sites did not converge or reached a negative R2 error, while for Gauss–Newton most of the inversions converged with a positive R2.

The optimal approach in deterministic modeling is not the same at both sites. While the ID approach was the best at Site 1, the best at Site 2 was LT. This could be because ID uses actual soil properties to populate the pedophysical model (focusing on variance of the error), and the LT approach uses average soil properties (attacking the bias error), resulting in an unclear benefit because of the general poor performance.

4 Limitations

Although the presented research focuses on comparing different choices made along modeling steps, it is important to highlight its site-specific nature (Boaga, 2017). Therefore, because both sites were selected based on their heterogeneous nature, the challenge that they represent is not necessarily representative of most common fields where the FDEM technique is applied, where collected EC FDEM data normally have a narrower range (Minsley et al., 2012; van der Kruk et al., 2018). Furthermore, the effects of scale disparities among soil sampling, ERT, FDEM, and probe measurements were not examined, even though they can introduce additional uncertainties when interpreting subsurface properties.

While several modeling parameters were tested, the data acquisition strategy remained unchanged. New insights might be gained by, for instance, using a different algorithm to select sampling locations (Brus, 2019) or reducing the FDEM crossline sampling density to better match FDEM data with ERT data at sampling locations. The FDEM data calibration strategy used in this study was indeed unique, and it assumes that the inverted ERT data accurately represent the subsurface. This assumption poses a limitation, as inverted ERT data can, in practice, contain acquisition and inversion errors, which ultimately impact FDEM data calibration (Minsley et al., 2012). Additionally, the evaluation of different parameters in the sensitivity analysis was not exhaustive, with its results being relative to the parameters chosen.

For instance, using different optimization methods would improve the FDEM inversion error and offer more flexibility in the inversion, such as allowing variable layer thicknesses. However, not all optimization methods, such as the Gauss–Newton method, support variability in subsoil layer depths. Additionally, only 1D forward and inversion models using FDEM methods were employed, without considering lateral smoothing through 2D or 3D inversions.

Furthermore, three different deterministic modeling approaches were tested using Eq. (2), but other pedophysical models were not considered. The difficulty in obtaining the ECs parameter of Eq. (2) led to its exclusion, which might have compromised the model's effectiveness.

Lastly, the study was limited to univariable empirical modeling. Multivariable regression incorporating more than one feature (such as using inverted EC and uncalibrated EC), as well as other machine learning methods, was not explored.

5 Conclusions and suggestions

Absolute soil property quantification using the FDEM method in heterogeneous fields is far from being accurate and methodologically solved.

The classical field-specific empirical modeling of soil properties, though limited, still offers the most straightforward solution. Based on our cases, uncalibrated ECa data can be used without compromising the effectiveness of such an approach. This bypasses the issues of physics-driven deterministic modeling, such as data calibration, robust EC estimation, geophysical inversion, and pedophysical modeling. Such empirical models should consider vertical soil variability; otherwise large mispredictions are expected. However, this is at the cost of building a dataset by sampling and analyzing the soil target properties at the desired exploration depth. In samples not used for training, water content predictions achieved a poor best estimation with an R2 value of 0.31 (RMSE = 6.6 %), which may be inadequate depending on the application, whereas ECw was better estimated. Scale disparity between FDEM field measurements and soil samples is a less significant issue than using inverted EC data for empirical estimation of soil properties. In the case that sampling is not an option, a universal deterministic approach can be followed at the expense of FDEM calibration data, e.g., through ERT. A comprehensive sensitivity analysis of this approach shows that no possible combination of modeling parameters could currently lead to reasonable predictions of water content for the studied sites. Particularly, the pedophysical model of Eq. (2) should be reworked and validated for soil samples above 30 % of clay content, and a pedotransfer function for ECs would help to ease its implementation. Additionally, the minimization method implemented in geophysical inversion turned out to be of key importance. Thus, further work is required to improve the deterministic modeling predictions.

Code availability

All computer code used for the data processing and modeling in this study is openly available at the GitHub repository (https://github.com/SENSE-UGent/FDEM_quantitative_soil) and archived with a DOI at Zenodo (https://doi.org/10.5281/zenodo.13385389, Mendoza Veirana, 2024b).

Data availability

The dataset supporting the findings of this study, including all field measurements and laboratory analyses, is publicly available and archived at Zenodo under the DOI https://doi.org/10.5281/zenodo.13465721 (Mendoza Veirana, 2024a). This dataset is our own original data collected and processed as part of this research.

Interactive computing environment (ICE)

All data processing, analysis, and modeling were performed using Jupyter Notebooks running in a Python 3 environment. The open-source software and scripts are compatible with standard scientific Python libraries (NumPy, SciPy, pandas, scikit-learn, matplotlib), ensuring full reproducibility. The notebooks are available in the project's GitHub repository at https://github.com/SENSE-UGent/FDEM_quantitative_soil (SENSE-UGent, 2024).

Author contributions

PS, MV, and GB designed the surveys and carried them out. MV, GB, and PS developed the model code and performed the simulations. EV contributed to statistical formal analysis. JV and WC gave writing advice. MV prepared the manuscript with contributions from all co-authors.

Competing interests

The contact author has declared that none of the authors has any competing interests

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

Text readability and code performance for an earlier draft were boosted using generative AI (Chat-GPT, OpenAI).

Financial support

This research has been supported by the Bijzonder Onderzoeksfonds UGent (grant no. 01IO4120).

Review statement

This paper was edited by Nunzio Romano and reviewed by Damien Jougnot and two anonymous referees.

References

Altdorff, D., von Hebel, C., Borchard, N., van der Kruk, J., Bogena, H. R., Vereecken, H., and Huisman, J. A.: Potential of catchment-wide soil water content prediction using electromagnetic induction in a forest ecosystem, Environ. Earth Sci., 76, 111, https://doi.org/10.1007/s12665-016-6361-3, 2017. 

Bárdossy, A. and Singh, S. K.: Robust estimation of hydrological model parameters, Hydrol. Earth Syst. Sci., 12, 1273–1283, https://doi.org/10.5194/hess-12-1273-2008, 2008. 

Binley, A.: Tools and Techniques: Electrical Methods, in: Treatise on Geophysics, 233–259, Elsevier, https://doi.org/10.1016/B978-0-444-53802-4.00192-5, 2015. 

Binley, A. and Kemna, A.: DC Resistivity and Induced Polarization Methods, edited by: Rubin, Y. and Hubbard, S. S., Hydrogeop., 50, 129–156, Springer Netherlands, https://doi.org/10.1007/1-4020-3102-5_5, 2005 

Blanchy, G., Saneiyan, S., Boyd, J., McLachlan, P., and Binley, A.: ResIPy, an intuitive open source software for complex geoelectrical inversion/modeling, Comput. Geosci., 137, 104423, https://doi.org/10.1016/j.cageo.2020.104423, 2020. 

Boaga, J.: The use of FDEM in hydrogeophysics: A review, J. Appl. Geophys., 139, 36–46, https://doi.org/10.1016/j.jappgeo.2017.02.011, 2017 

Brus, D. J.: Sampling for digital soil mapping: A tutorial supported by R scripts, Geoderma, 338, 464–480, https://doi.org/10.1016/j.geoderma.2018.07.036, 2019. 

Callegary, J. B., Ferre, T. P. A., and Groom, R. W.: Vertical spatial sensitivity and exploration depth of low-induction-number 35 electromagnetic-induction instruments, Vadose Zone J., 6, 158–167, https://doi.org/10.2136/vzj2006.0120, 2007. 

Cavka, D., Mora, N., and Rachidi, F.: A Comparison of Frequency-Dependent Soil Models: Application to the Analysis of Grounding Systems, IEEE T. Electromagn. C., 56, 177–187, https://doi.org/10.1109/TEMC.2013.2271913, 2014. 

Ciesielski, H., Sterckeman, T., Santerne, M., and Willery, J. P.: A comparison between three methods for the determination of cation exchange capacity and exchangeable cations in soils, Agronomie, 17, 9–16, https://doi.org/10.1051/agro:19970102, 1997a. 

Ciesielski, H., Sterckeman, T., Santerne, M., and Willery, J. P.: Determination of cation exchange capacity and exchangeable cations in soils by means of cobalt hexamine trichloride. Effects of experimental conditions, Agronomie, 17, 1–7, https://doi.org/10.1051/agro:19970101, 1997b. 

Cong-Thi, D., Dieu, L. P., Caterina, D., De Pauw, X., Thi, H. D., Ho, H. H., Nguyen, F., and Hermans, T.: Quantifying salinity in heterogeneous coastal aquifers through ERT and IP: Insights from laboratory and field investigations, J. Contam. Hydrol., 262, 104322, https://doi.org/10.1016/j.jconhyd.2024.104322, 2024. 

Corwin, D. L. and Lesch, S. M.: Application of soil electrical conductivity to precision agriculture: Theory, principles, and guidelines, Agron J., 95, 455–471, https://doi.org/10.2134/agronj2003.4550, 2003. 

Dimech, A., Isabelle, A., Sylvain, K., Liu, C., Cheng, L., Bussière, B., Chouteau, M., Fabien-Ouellet, G., Bérubé, C., Wilkinson, P., Meldrum, P., and Chambers, J.: A multiscale accuracy assessment of moisture content predictions using time-lapse electrical resistivity tomography in mine tailings, Sci. Rep.-UK, 13, 20922, https://doi.org/10.1038/s41598-023-48100-w, 2023. 

Doolittle, J. A. and Brevik, E. C.: The use of electromagnetic induction techniques in soils studies, Geoderma, 223–225, 33–45, https://doi.org/10.1016/j.geoderma.2014.01.027, 2014. 

Friedman, S. P.: Soil properties influencing apparent electrical conductivity: A review, Comput. Electron. Agr., 46, 45–70, https://doi.org/10.1016/j.compag.2004.11.001, 2005. 

Fu, Y., Horton, R., Ren, T., and Heitman, J. L.: A general form of Archie's model for estimating bulk soil electrical conductivity, J. Hydrol., 597, 126160, https://doi.org/10.1016/j.jhydrol.2021.126160, 2021. 

Glover, P. W. J.: Geophysical Properties of the Near Surface Earth: Electrical Properties, Treatise on Geophysics, 89–137, Elsevier, https://doi.org/10.1016/B978-0-444-53802-4.00189-5, 2015. 

Hanssens, D., Delefortrie, S., Bobe, C., Hermans, T., and De Smedt, P.: Improving the reliability of soil EC-mapping: Robust apparent electrical conductivity (rECa) estimation in ground-based frequency domain electromagnetics, Geoderma, 337, 1155–1163, https://doi.org/10.1016/j.geoderma.2018.11.030, 2019. 

Hanssens, D., Waegeman, W., Declercq, Y., Dierckx, H., Verschelde, H., and Smedt, P. D.: High-resolution surveying with small-loop requency-domain electromagnetic systems: Efficient survey design and adaptive processing, IEEE Geosci. Remote. S., 9, 167–183, https://doi.org/10.1109/MGRS.2020.2997911, 2020. 

Huang, H., SanFilipo, B., Oren, A., and Won, I. J.: Coaxial coil towed EMI sensor array for UXO detection and characterization, J. Appl. Geophys., 61, 217–226, https://doi.org/10.1016/j.jappgeo.2006.06.005, 2007. 

Jougnot, D., Jiménez-Martínez, J., Legendre, R., Le Borgne, T., Méheust, Y., and Linde, N.: Impact of small-scale saline tracer heterogeneity on electrical resistivity monitoring in fully and partially saturated porous media: Insights from geoelectrical milli-fluidic experiments, Adv. Water Resour., 113, 295–309, https://doi.org/10.1016/j.advwatres.2018.01.014, 2018. 

Lavoué, F., van der Kruk, J., Rings, J., André, F., Moghadas, D., Huisman, J. A., Lambot, S., Weihermuller, L., Vanderborght, J., and Vereecken, H.: Electromagnetic induction calibration using apparent electrical conductivity modelling based on electrical resistivity tomography, Near Surf. Geophys., 8, 553–561, https://doi.org/10.3997/1873-0604.2010037 2010. 

Lipinski, B. A., Sams, J. I., Smith, B. D., and Harbert, W.: Using HEM surveys to evaluate disposal of by-product water from CBNG development in the Powder River Basin, Wyoming, Geophysics, 73, B77–B84, https://doi.org/10.1190/1.2901200, 2008. 

Logsdon, S. D., Green, T. R., Seyfried, M., Evett, S. R., and Bonta, J.: Hydra Probe and Twelve-Wire Probe Comparisons in Fluids and Soil Cores, Soil Sci. Soc. Am. J., 74, 5–12, https://doi.org/10.2136/sssaj2009.0189, 2010. 

Longmire, C. and Smith, K.: A Universal Impedance for Soils, Mission Research Corporation, Report DNA 3788T, 1975. 

Lück, E., Guillemoteau, J., Tronicke, J., Rummel, U., and Hierold, W.: From point to field scale – Indirect monitoring of soil moisture variations at the DWD test site in Falkenberg, Geoderma, 427, 116134, https://doi.org/10.1016/j.geoderma.2022.116134, 2022. 

McLachlan, P., Blanchy, G., and Binley, A.: EMagPy: Open-source standalone software for processing, forward modeling and inversion of electromagnetic induction data, Comput. Geosci., 146, 104561, https://doi.org/10.1016/j.cageo.2020.104561, 2021. 

McNeill, J. D.: Electromagnetic terrain conductivity measurement at low induction numbers, Technical Note 6, p. 13, Geonics Limited, 1980. 

Mendoza Veirana, G. M.: Data set. Quantitative soil characterization using frequency domain electromagnetic induction method in heterogeneous fields, Zenodo [data set], https://doi.org/10.5281/zenodo.13465721, 2024a. 

Mendoza Veirana, G. M.: orbit-ugent/FDEM_quantitative_soil: 0.1 (Version 0.1), Zenodo [code], https://doi.org/10.5281/zenodo.13385389, 2024b. 

Mendoza Veirana, G. M. and De Smedt, P.: orbit-ugent/Pedophysics: First release 0.1 (Version 0.1), Zenodo [code], https://doi.org/10.5281/ZENODO.13465700, 2024. 

Minasny, B. and McBratney, A. B.: A conditioned Latin hypercube method for sampling in the presence of ancillary information, Comput. Geosci., 32, 1378–1388, 2006. 

Minsley, B. J., Smith, B. D., Hammack, R., Sams, J. I., and Veloski, G.: Calibration and filtering strategies for frequency domain electromagnetic data, J. Appl. Geophys., 80, 56–66, https://doi.org/10.1016/j.jappgeo.2012.01.008, 2012. 

Moghadas, D. and Badorreck, A.: Machine learning to estimate soil moisture from geophysical measurements of electrical conductivity, Near Surf. Geophys., 17, 181–195, https://doi.org/10.1002/nsg.12036, 2019. 

Pannell, D.: Sensitivity analysis of normative economic models: Theoretical framework and practical strategies, Agr. Econ., 16, 139–152, https://doi.org/10.1016/S0169-5150(96)01217-0, 1997. 

Rentschler, T., Werban, U., Ahner, M., Behrens, T., Gries, P., Scholten, T., Teuber, S., and Schmidt, K.: 3D mapping of soil organic carbon content and soil moisture with multiple geophysical sensors and machine learning, Vadose Zone J., 19, e20062, https://doi.org/10.1002/vzj2.20062, 2020. 

Rhoades, J. D., Raats, P. A. C., and Prather, R. J.: Effects of Liquid-phase Electrical Conductivity, Water Content, and Surface Conductivity on Bulk Soil Electrical Conductivity, Soil Sci. Soc. Am. J., 40, 651–655, https://doi.org/10.2136/sssaj1976.03615995004000050017x, 1976. 

Romero-Ruiz, A., Linde, N., Keller, T., and Or, D.: A Review of Geophysical Methods for Soil Structure Characterization: Geophysics and soil structure, Rev. Geophys., 56, 672–697, https://doi.org/10.1029/2018RG000611, 2018. 

Saltelli, A. and Annoni, P.: How to avoid a perfunctory sensitivity analysis, Environ. Modell. Softw., 25, 1508–1517, https://doi.org/10.1016/j.envsoft.2010.04.012, 2010. 

SENSE-UGent: FDEM_quantitative_soil, GitHub, https://github.com/SENSE-UGent/FDEM_quantitative_soil (last access: 3 July 2025), 2024. 

Sheets, K. R. and Hendrickx, J. M. H.: Noninvasive Soil Water Content Measurement Using Electromagnetic Induction, Water Resour. Res., 31, 2401–2409, https://doi.org/10.1029/95wr01949, 1995. 

Stevens Water Monitoring Systems.: The Hydra Probe® soil sensor: Comprehensive Stevens Hydra Probe user's manual, instrument manual, Stevens Water Monitoring Systems, Beaverton, OR, 2008. 

Terry, N., Day-Lewis, F. D., Lane Jr., J. W., Johnson, C. D., and Werkema, D.: Field evaluation of semi-automated moisture estimation from geophysics using machine learning, Vadose Zone J., 22, e20246, https://doi.org/10.1002/vzj2.20246, 2023. 

Tibshirani, R., Friedman, J., and Hastie, T.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer, 2001. 

van der Kruk, J., von Hebel, C., Brogi, C., Sarah Kaufmann, M., Tan, X., Weihermüller, L., Huisman, J. A., Vereecken, H., and Mester, A.: Calibration, inversion, and applications of multiconfiguration electromagnetic induction for agricultural top- and subsoil characterization, SEG Technical Program Expanded Abstracts 2018, Soc. Expl. GEOP., 2546–2550, https://doi.org/10.1190/segam2018-2965257.1, 2018. 

Van Ranst, E. and Sys, C.: Eenduidige legende voor de digitale bodemkaart vanVlaanderen (Schaal 1:20 000), Laboratorium voor Bodemkunde Krijgslaan 281/S8 9000 Gent, University of Ghent, https://ecopedia.s3.eu-central-1.amazonaws.com/pdfs/550.pdf (last access: 6 June 2025), 2000.  

Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C., Polat, I., Feng, Y., Moore, E., and VanderPlas, J.: SciPy 1.0: Fundamental algorithms for scientific computing in Python, Nat. Methods, 17, 261–272, https://doi.org/10.1038/s41592-019-0686-2, 2020. 

Wait, J. R.: Geo-Electromagnetism, Elsevier, https://doi.org/10.1016/B978-0-12-730880-7.X5001-7, 1982. 

Download
Short summary
This study explores two methods for predicting soil properties using the frequency domain electromagnetic induction technique in Belgium. We compared deterministic models, which often require extensive data adjustments, to empirical models. Our findings suggest that empirical models are generally more effective for soil analysis, although each method has its limitations. This research helps improve soil property prediction, crucial for agriculture and environmental management.
Share