Forecasting the flow of groundwater requires a hydrostratigraphic model, which describes the architecture of the subsurface. State-of-the-art multiple-point statistical (MPS) tools are readily available for creating models depicting subsurface geology. We present a study of the impact of key parameters related to stochastic MPS simulation of a real-world hydrogeophysical dataset from Kasted, Denmark, using the snesim algorithm. The goal is to study how changes to the underlying datasets propagate into the hydrostratigraphic realizations when using MPS for stochastic modeling. This study focuses on the sensitivity of the MPS realizations to the geophysical soft data, borehole lithology logs, and the training image (TI). The modeling approach used in this paper utilizes a cognitive geological model as a TI to simulate ensemble hydrostratigraphic models. The target model contains three overall hydrostratigraphic categories, and the MPS realizations are compared visually as well as quantitatively using mathematical measures of similarity. The quantitative similarity analysis is carried out exhaustively, and realizations are compared with each other as well as with the cognitive geological model.

The results underline the importance of geophysical data for constraining MPS simulations. Relying only on borehole data and the conceptual geology, or TI, results in a significant increase in realization uncertainty. The airborne transient electromagnetic SkyTEM data used in this study cover a large portion of the Kasted model area and are essential to the hydrostratigraphic architecture. On the other hand, the borehole lithology logs are sparser, and 410 boreholes were present in this study. The borehole lithology logs infer local changes in the immediate vicinity of the boreholes, thus, in areas with a high degree of geological heterogeneity, boreholes only provide limited large-scale structural information. Lithological information is, however, important for the interpretation of the geophysical responses. The importance of the TI was also studied. An example was presented where an alternative geological model from a neighboring area was used to simulate hydrostratigraphic models. It was shown that as long as the geological settings are similar in nature, the realizations, although different, still reflect the hydrostratigraphic architecture. If a TI containing a biased geological conceptualization is used, the resulting realizations will resemble the TI and contain less structure in particular areas, where the soft data show almost even probability to two or all three of the hydrostratigraphic units.

Geological models are important from both a societal and economic perspective, since they are used to locate essential natural resources, such as freshwater, oil, metals, and rare earth minerals. Additionally, they are used in risk assessment related to natural hazards, such as earthquakes, sinkholes, volcanic eruptions, and landslides. Building 3-D models depicting real-world subsurface geology is no trivial task. Information from multiple sources is required, i.e., conceptual geological understanding, geological information, lithology logs, and geophysical data. Such data are sparse, uncertain, and redundant. Dataset gaps force geoscientists to make uncertain predictions or estimates, which carries over into the resulting geological model. During the modeling procedure, such problems are dealt with as best as possible. Gaps in knowledge will render the resulting model uncertain, and quantifying such uncertainty is essential to making better use of the models and to making better predictions.

A common approach for building geological models is cognitive modeling (e.g., Jørgensen et al., 2013; Royse, 2010). Here, the datasets containing borehole lithology logs and geophysical models are co-interpreted by a professional with experience in the fields of geoscience, geophysics, and geological modeling, with a relevant regional conceptual model in mind. This modeling approach is deterministic and results in a single model realization. These specialists are trained in assessing the uncertainty of the underlying structures, and qualitative uncertainty estimates are often made on the structural model (e.g., indicating different levels of uncertainty in different subparts of the model domain). However, qualitative uncertainty estimates are difficult to carry over into the subsequent analysis, and the effect of the uncertainty of the geological model can therefore not be quantified in the resulting forecasts. If the forecasts are based on a single geologic model, the prediction does not encase the full complexity of the problem. Alternatively, if the model uncertainty can be quantified, it enables the option to include it in the forecast. However, quantifying the uncertainty in a cognitive modeling approach is difficult and tedious (Seifert et al., 2012). Another approach is stochastic modeling using multiple-point statistics (MPS) methodologies, where ensembles of models are produced, e.g., Comunian et al. (2012), Ferré (2017), He et al. (2016), Okabe and Blunt (2005), and Pirot (2017). MPS provides a framework which can integrate geophysical and borehole information, as well as conceptual geological information via a so-called training image (TI). Multiple model realizations are created from the dataset, and the resulting model ensemble reflects the uncertainty related to the underlying datasets and overall modeling procedure.

Studying model uncertainty using MPS results in numerous sets of model
ensembles. Visual comparison of such a large number of realization ensembles
is tedious and subjective but offers an overall understanding of the
geological realism of the models (Barfod et al., 2018). Therefore,
quantitative measures of similarity are desirable. Barfod et al. (2018)
present a comparison of 3-D hydrostratigraphic models using the modified
Hausdorff distance (

Some types of forecasts are related to groundwater flow and accurate geological models are crucial to accurate predictions of hydraulic flow, since subsurface hydraulic flow is largely controlled by geological heterogeneity (e.g., Feyen and Caers, 2006; Fleckenstein et al., 2006; Fogg et al., 1998; Gelhar, 1984; LaBolle and Fogg, 2001; Zhao and Illman, 2017). Geological units, however, contain additional complexities not related to hydrologic units; therefore, the concept of hydrostratigraphic units is very useful. A detailed definition of hydrostratigraphic classification is given by Maxey (1964).

Stochastic geological modeling procedures like the MPS methodology apply two overall types of data, i.e., geophysical data and borehole data. Associated with these data types are the definitions of hard and soft data. Typically, hard data are considered certain information without an associated uncertainty, while soft data are uncertain information, which can be associated with an uncertainty. Geophysical data are typically considered soft data (Strebelle, 2002). Geophysical data are spatially dense and provide a smeared image of the overall subsurface geology. Resolution decreases with depth, especially beyond a specific depth, which is dependent on the geophysical method. Geophysical instruments portray bulk physical properties of the subsurface. Although geophysical data provide spatially dense information, it is not possible to exhaustively sample the subsurface. The density of the geophysical data will affect the final uncertainty. The raw geophysical data goes through a processing and modeling step, where the raw data are translated into geophysical models. During this step incorrect measurements, due to instrument error or interference, are identified and removed, further decreasing the geophysical information density. Such incomplete data can either be reconstructed or used as is during modeling. Another consideration in regards to modeling of geophysical data is the choice of inversion scheme and thereby the choice of a priori information (e.g., Ellis and Oldenburg, 1994; Tarantola and Valette, 1982). Here, several approaches can be taken, which yield different geophysical models. A common inversion scheme for airborne electromagnetic (AEM) data, such as SkyTEM data, is the spatially constrained inversion (SCI) (Viezzoli et al., 2008). However, this inversion approach does not represent the subsurface properly, e.g., layer boundaries are smeared and extreme values are not represented properly. A so-called sharp inversion scheme, suggested by Vignoli et al. (2015), tackles such issues. Therefore, the choice of inversion scheme influences the hydrostratigraphic model and should be considered as an integral step in the hydrostratigraphic modeling process.

The other overall type of data source is borehole lithology logs, which are commonly considered to be ground truth or hard data (e.g., Gunnink and Siemon, 2015; Tahmasebi et al., 2012). However, lithology logs are also uncertain. Generally, the uncertainty of borehole lithology logs relates to a number of parameters, such as drilling methods, the frequency with which sediment samples are collected precision with which the location is measured, the purpose of the borehole, and the choice of drilling contractor – see Barfod et al. (2016) and He et al. (2014) for more details. The resolution of borehole lithology logs is especially dependent on the sampling method. If a core is extracted for the entirety of the borehole, the resolution is, in principal, unlimited. However, this is expensive. It is more common to use either an auger drill, a rotary drill, or a cable tool, which yields a relatively limited resolution, compared to core drilling, depending on how samples are collected and handled.

We present a study of the uncertainty related to stochastic MPS modeling of hydrogeophysical data. The goal is (i) to understand the consequences of modifying the MPS setup to reflect some of the biases related to real-world hydrogeophysical datasets and (ii) to study the propagation of uncertainty in the resulting ensembles of stochastic hydrostratigraphic models. An essential choice during MPS modeling is, for example, the choice of TI, or how to incorporate borehole and/or geophysical data. A hydrogeophysical dataset, consisting of an airborne transient electromagnetic survey, numerous lithological borehole logs, and 3-D categorical TIs, is used to assess how different choices during MPS modeling influence the resulting uncertainty of the hydrostratigraphic model ensemble.

The Kasted area is located northwest of Aarhus, Denmark (Fig. 1a), and has also been presented by Barfod et al. (2018), Marker et al. (2017), and Høyer et al. (2015). The regional geology of the Kasted area is dominated by a Quaternary buried valley complex with complex abutting relationships between the individual valleys. The buried valleys are infilled with a combination of till and glacial meltwater deposits. The valleys are incised into the substratum, which consists of hemipelagic clay. The regional geology has been described in detail by Høyer et al. (2015), who created a detailed cognitive geological model of the area.

An important geological feature of the Danish subsurface is the buried tunnel valleys (e.g., Jørgensen and Sandersen, 2006; Sandersen et al., 2009). The geological heterogeneity varies considerably across Denmark and can in some places be quite complicated, such as in the Egebjerg area (Jørgensen et al., 2010). In the Kasted area, the main buried valleys are clearly outlined in the geophysical dataset thanks to the significant resistivity contrasts between the infill of the buried valleys and the underlying Paleogene clay (Høyer et al., 2015). Therefore, the Kasted survey is ideal for studying the uncertainty related to stochastic hydrostratigraphic modeling using MPS methods.

The survey covers an area of 45 km

The Kasted survey area and
resistivity–hydrostratigraphic
relationship histograms.

The multiple-point statistics framework stems from the general geostatistics framework. Here, multiple-point (MP) information from a training image is used to condition simulations to probable geological patterns (Journel and Zhang, 2007). The TI thus provides a conceptual geological understanding of a given area and can be viewed as a database containing probable geological patterns, which are used to condition the MPS simulation. The choice of TI is an important step in any MPS setup and influences the realization results, as will be illustrated. The TI does not need to carry locally accurate information – i.e., the TI does not need to spatially or geographically overlap with real-world geological units – and can be purely conceptual in nature. Together with the TI, it is also possible to use geophysical datasets for constraining MPS simulations, resulting in realizations that reflect real-world regional geology. Today, MPS is a widely used tool, which is used in a variety of geoscience fields, including, but not limited to, reservoir modeling (e.g., Okabe and Blunt, 2004; Strebelle and Journel, 2001), hydrology (e.g., Le Coz et al., 2011; Hermans et al., 2015; Høyer et al., 2017), and geological modeling (e.g., de Iaco and Maggio, 2011).

The MPS method used in this paper is known as the single normal equation simulation (snesim) framework (Strebelle, 2002), and is implemented in the Stanford Geostatistical Modeling Software, or SGeMS. The snesim framework allows for simulating a real-world categorical geological model using a TI, constrained using soft geophysical data and hard borehole data. The snesim algorithm scans the entire TI, ahead of simulation, and stores the MP information contained in the TI in a search-tree database. The MP information can then be retrieved from the database during simulation. The integration of soft geophysical data for constraining the simulations is achieved by utilizing the tau model, which will be described in detail in Sect. 3.2 (Journel, 2002; Krishnan, 2004). Here, the continuous soft data variable needs to be translated into a probability grid, describing the probability of finding a given geological unit based on the geophysical data; see Barfod et al. (2018). In order to guarantee the reproduction of geological patterns at all scales, snesim uses the multiple-grid formulation, presented by Tran (1994).

In the field of geoscience, we are always dealing with incomplete datasets, since we cannot sample the subsurface exhaustively. Several approaches exist for dealing with incomplete datasets, of which two general approaches can be defined. A common approach is to reconstruct incomplete datasets using geostatistical tools (e.g., Goovaerts, 1997; Mariethoz and Renard, 2010), which means that during the hydrostratigraphic modeling process no information is present in the dataset gaps. However, it is important to emphasize that the reconstructed information is not as valuable as the actual measured geophysical information. The other common approach is to just use the incomplete dataset as is. This means that no information is present in the dataset gaps during the hydrostratigraphic modeling process, which, depending on the modeling method, might result in large uncertainties.

In this study the MPS method called direct sampling (DS) is used for stochastic reconstruction of incomplete datasets (Mariethoz and Renard, 2010). The DS method uses the dataset we wish to reconstruct both as a simulation grid and a TI. This means that the patterns that are present in the incomplete dataset are inserted into the simulation grid before reconstruction. It is, according to Mariethoz and Renard (2010), important that the patterns we wish to reconstruct are actually present in the incomplete dataset, since we are borrowing the patterns from the TI, or incomplete dataset, to stochastically reconstruct the dataset. If the patterns are not present in the incomplete dataset they will, simply put, not be inferred in the reconstructed dataset. Provided that enough information on the overall patterns is available in the incomplete dataset, the DS method is a straightforward approach for reconstructing incomplete datasets.

Combining information from different sources is a frequent challenge in
subsurface modeling. A fundamental challenge of the research conducted in
this paper was to combine conditional probabilities from different sources.
In this paper we used the common tau model approach (Journel, 2002). The tau
model generally combines the probability values from different sources using
Bayes' theorem and a set of

We will now briefly introduce the tau model; for more details see
Journel (2002). Firstly, we define a data event,

Then the following distance ratio is computed using the tau model
expression:

Comparing a large set of extensive 3-D models is a common problem encountered in stochastic MPS modeling. A common approach is visual comparison, which is not an objective or quantitative comparison method. Each equiprobable hydrostratigraphic model in this study contains 1 187 823 cells. Furthermore, a total of 400 MPS realizations were computed, Table 1, which makes it difficult to visually compare modeling results. This, along with advances in stochastic modeling tools such as MPS, motivated Tan et al. (2014) to develop a framework in which multiple 2-D or 3-D realizations can be compared quantitatively. The idea is to use a distance measure, which measures the distance between two realizations. Realizations which are geometrically similar have small distance values, while dissimilar realizations have a large distance value. The comparison techniques in this study are based on the principles presented by Tan et al. (2014). In this study the distances between individual realizations are based on the Euclidean distance transforms (Maurer et al., 2003). The usage of EDT as a measure for similarity will be described in more detail below. A full distance matrix is computed containing distances between each individual realization for all the different cases. The resulting 400 by 400 distance matrix is then interpreted by itself.

A 2-D example of the Euclidean distance transforms (EDT) as a
measure for the similarity between categorical MPS realizations. In this
example a set of 50 realizations, from the basic modeling setup, are compared
based on the differences in EDT for sand and gravel units.

The visual comparison can be helped by creating so-called ensemble mode ratio
maps, or EMR maps. The idea is to create a summary map portraying the mode
ratio of a given ensemble of models, ranging between

The EMR values are then computed for each grid cell using Eqs. (6) and (7),
which, simply put, is the occurrence ratio of the mode category of a given
ensemble containing a given number of realizations,

The hydrostratigraphic realizations are categorical and contain three
hydrostratigraphic units. Comparing two realization grids, they first need to
be transformed from a categorical grid into continuous Euclidean distance
grids by using EDT (Maurer et al., 2003). The two 3-D EDT grids are then
compared by calculating the average difference in the respective grids.
Similar images have a small average EDT distance, and dissimilar images have
a large average EDT distance. The EDT computes the Euclidean distances for
all locations of a binary grid, i.e., a grid containing only two states (codes
0 and 1). The EDT map is simply the Euclidean distance of the medium depicted
by the state code 1, i.e., for grid cell at location

A schematic diagram presenting the conversion of the lithological logs into probability logs for the three hydrostratigraphic units: sand and gravel, glacial clay, and hemipelagic clay. Step 1: the lithology log is translated into a hydrostratigraphic log. Step 2: the hydrostratigraphic logs are resampled according to the vertical modeling grid intervals and an interval probability is calculated for each of the hydrostratigraphic units.

The

To illustrate the

From this point forward we leave the 2-D example behind and will from here
on only consider

The average Euclidean distance difference,

The Kasted dataset used in this study is comprised of a dense geophysical dataset acquired using the SkyTEM system (Sørensen and Auken, 2004), borehole lithology logs and a cognitive geological model (Høyer et al., 2015). We show how uncertainty related to resistivity data, measured with the SkyTEM system, and borehole lithology logs influences the hydrostratigraphic modeling realizations. Two readily available MPS tools are showcased. The first tool is the direct sampling method for reconstruction of incomplete datasets (Mariethoz and Renard, 2010). The other MPS tool is the single normal equation simulation, which is used for stochastic hydrostratigraphic modeling (Strebelle, 2002). The MPS modeling setup is similar to the one presented by Barfod et al. (2018). However, the goal of this study is different. The study is divided into a total of six cases, or eight sub-cases, which are designed to study how perturbations of the underlying MPS setup affect the hydrostratigraphic realizations using snesim and study the propagation of uncertainties into the hydrostratigraphic models. First the basic case (case 0), from which the other cases are perturbed, uses a hydrostratigraphic simplification of the 3-D cognitive model of the Kasted area as a TI, the gap-filled SkyTEM data present as smooth inversion models as soft data, and the borehole lithological logs as hard data. Then, in case 1, the TI is substituted by two other TIs. The cases 2 and 3 are related to the SkyTEM data, where the incomplete SkyTEM data and sSCI inversion models are used, respectively. The influence of the boreholes is studied in case 4 either leaving out boreholes as hard data or changing them into soft data. Finally, in case 5 the SkyTEM data are not used. In other words, one of the overall goals of this study is to improve the Kasted model by using stochastic ensemble modeling to quantify the uncertainty of the model, such as suggested by Ferré (2017) and Pirot (2017). Further details on the individual cases follow in the coming sections and Table 1 summarizes each case. First of all, the model discretization and parameterization as well as construction of hard and soft data grids are described.

The Kasted model covers an area of 12 km by 7 km, discretized on a modeling grid with 229 by 133 by 39 cells, containing a total of 1 187 823 cells. Each cell has a size of 50 m by 50 m by 5 m. It is parameterized into three hydrostratigraphic units, listed as follows.

Sand and gravel: a combination of coarse lithological units, including sand till, meltwater sand, gravel and pebbles of glacial origin, late glacial freshwater sand, and postglacial freshwater sand.

Glacial clay: this category contains silty and sandy clays, including clay till and meltwater clay of glacial origin.

Hemipelagic clay: a combination of fine grained conductive clays, containing the extensive and homogeneous hemipelagic Paleogene and Oligocene clays found in Denmark.

These three categories serve the purpose of simplifying the geology of the Kasted area. The Kasted survey lithology logs reveal a combination of 59 geological categories, which are translated into a set of hydrostratigraphic logs (Step 1, Fig. 3) using these hydrostratigraphic categories. Similarly, the 42 geological units in the cognitive geological model are divided into the three abovementioned categories (Fig. 4a). The vertical proportions of the three-category hydrostratigraphic Kasted model can be viewed in Fig. 5a.

An overview of the training images (TIs) which are used during MPS simulation.
A horizontal slice and vertical cross section is presented for each TI, portraying
the hydrostratigraphic architecture;

The borehole lithology logs need to be assigned to a 3-D grid, which is carried out in three overall steps. The first step is to translate the borehole lithology logs into hydrostratigraphic logs using the above mentioned three categories; Step 1, Fig. 3. The second step is then to divide the hydrostratigraphic logs into intervals identical to the vertical intervals of the model grid. At each resampled interval a probability value is directly calculated for each hydrostratigraphic unit; Step 2, Fig. 3. The probability is simply the percentage of the given unit, which is present within the interval. Finally, the last step is to assign the hydrostratigraphic probabilities to a grid. The probability values are assigned to the grid cell in which the given hydrostratigraphic log is present. On the rare occasion that multiple logs are present within a given cell, the probabilities are combined accordingly to one representative probability value. The end result is a grid containing the borehole probability values of each hydrostratigraphic unit: sand and gravel, glacial clay, and hemipelagic clay. It is common to view borehole lithology logs as hard information, or ground truth. The borehole probability grid can therefore be translated into a hard data grid, by assigning the most probable hydrostratigraphic unit in each grid cell.

The 1-D SkyTEM resistivity models are assigned to a 3-D grid, identical to the modeling grid. The first step is to fill all grid cells containing a resistivity model. This is carried out using block kriging and results in an incomplete resistivity grid of block average resistivities (Fig. 6a). The second and final step is to stochastically reconstruct the incomplete resistivity grid using DS stochastic reconstruction (Mariethoz and Renard, 2010) (Fig. 7). The reconstruction procedure was originally presented by Barfod et al. (2018); however, we have made some improvements for this study. Originally, a simple kriging estimation approach was used to assign the resistivity models to a 3-D modeling grid. This resulted in an incomplete resistivity grid, which contained resistivity information not only pertaining to grid cells containing a SkyTEM sounding; i.e., the resistivity grid had already been partly reconstructed in the proximity of the geophysical soundings. To avoid this, block kriging estimation was used instead. The block kriging method is also a variogram-based estimation method, which estimates the average value of a rectangular block (Goovaerts, 1997). For more details on reconstructing incomplete resistivity grids see Barfod et al. (2018).

The SGeMS snesim framework utilizes the tau model for soft data conditioning (Journel, 2002), which requires the translation of resistivity grids into probability grids. This requires information on the regional resistivity–hydrostratigraphic relationship. Such knowledge is not always available, but if enough boreholes and electromagnetic geophysical data are available, the framework for studying the resistivity–hydrostratigraphic relationship, presented by Barfod et al. (2016), can be used to create a set of histograms. The resistivity–hydrostratigraphic histograms, Fig. 1b, are compiled from available hydrostratigraphic logs and SkyTEM resistivity models and are presented in more detail in Barfod et al. (2016, 2018). The estimated histograms (Fig. 1b) are then used to directly translate each resistivity value, in a given resistivity grid, into three probabilities, one for each hydrostratigraphic unit.

The general MPS workflow can be summarized in seven overall steps as follows:

Using block kriging, the SkyTEM resistivity models are assigned to a 3-D grid identical to the Kasted model grid.

The incomplete resistivity grids (Fig. 6a) are stochastically reconstructed using direct sampling (Fig. 7a). The result is an ensemble of 50 equiprobable reconstructed resistivity grids.

The reconstructed resistivity grids are translated into probability grids using the resistivity–hydrostratigraphic relationship histograms (Fig. 1b).

The borehole lithology logs are translated into hydrostratigraphic logs; Step 1, Fig. 3.

The hydrostratigraphic logs are resampled and three probability values, one for each hydrostratigraphic unit, are directly computed at each resampled interval; Step 2, Fig. 3.

The borehole probabilities are assigned to a grid identical to the cognitive Kasted model grid.

The borehole probability grid is translated into a hard data grid, by assigning the most likely hydrostratigraphic unit to each grid cell.

A total of 400 realizations are created, with 50 realizations per sub-case (Table 1). In snesim a random number seed needs to be manually selected for each realization to initialize the random number generator and in particular define a random path through the modeling grid. The random seed convention chosen in this paper was to apply the same random seed vector to each sub-case. The vector contains 50 linearly increasing random seed numbers, ensuring consistency when comparing realizations from the individual sub-cases.

An overview table showing information on the MPS cases along with information on the number of realizations for each case or sub-case, and a brief description of each case.

The basic modeling setup is designed to act as the base from which all other cases are built. The different sub-cases are simply modified versions of the basic modeling setup, each designed to study how modification to the base setup relates to hydrostratigraphic MPS modeling. The basic modeling setup uses the borehole data as hard information, SCI models with smooth inversion constraints as soft data, and the cognitive hydrostratigraphic Kasted model as a TI (Fig. 4a), for which the global proportions are listed in Table 2 and vertical proportions are displayed in Fig. 5a.

The basic modeling setup uses the actual cognitive geological model of the Kasted survey area as a TI (Høyer et al., 2015). In Denmark, it is common practice to build 3-D cognitive geological models of the near-subsurface. Many cognitive models exist and are publicly available. Such models can easily be adapted and used as 3-D TIs to simulate new survey areas, provided the geological settings are similar. Case 1 is divided into two sub-cases. The first sub-case, case 1a, uses the basic setup, but, in place of the cognitive Kasted model, the cognitive geological model of the Egebjerg area (Fig. 1a) is used as a TI (Fig. 4c). The geologic setting in Egebjerg is relevant since it is partly dominated by a buried valley complex (Jørgensen et al., 2010).

The Egebjerg model consists of a total of 72 geological units which are
categorized accordingly to reflect the three hydrostratigraphic units of the
Kasted hydrostratigraphic model. Egebjerg additionally contains undesired
features, such as local Miocene complexes. Two such local geological
environments, which do not reflect the geological setting of the Kasted area,
are present. One is found south of the buried valley complex and the other
to the west. By cropping the model and rotating it 90

The global proportions related to each of the three TIs presented in Fig. 4.

The second sub-case, case 1b, utilizes a purely conceptual TI. The conceptual TI is created by using a set of hyperbolic secant functions to populate a 3-D matrix and is purely mathematical in nature. The conceptual TI can be seen in Fig. 4b and is designed to have three overall buried valleys eroded into a hemipelagic clay substratum. There are two narrow and shallow glacial clay valleys, and a broad and deep sand and gravel valley. One of the glacial clay valleys is a younger valley that is eroded into the older sand and gravel valley and they run roughly parallel to each other. The last glacial clay valley is almost orthogonal to the other valleys, and also erodes into the sand and gravel valley. The upper part of the TI contains a cover layer of glacial clay (Fig. 5b). The simple conceptual TI is designed to contain the main geological architecture of the Kasted area, namely the buried valley complexes. The sand and gravel valley, trending west–northwest to east–southeast, was chosen on purpose to study what happens when oversimplified and smooth MP information is added to a TI. The global proportions of the conceptual TI are consistent with the other TIs, while the vertical proportions for sand and gravel and glacial clay units show a significantly different pattern (Fig. 5b).

The vertical proportions of the three training images for each of
the hydrostratigraphic categories, where

During reconstruction of the resistivity grid, it is assumed that the patterns in the incomplete dataset contain information regarding the content of the dataset gaps. This is true only when the incomplete grid contains a sufficient amount of data. Sufficient, in this case, means that the parameter space is sampled densely enough to reflect the patterns we wish to reconstruct (Mariethoz and Renard, 2010). If the grid is too sparse, then limited or no information which can help reconstruct missing patterns is present. Signs of mediocre data density are seen in the incomplete grids (Fig. 6a). Artifacts from the DS reconstruction are present in the completed resistivity grids. The resistive valley to the west in the horizontal slices and vertical cross sections in Fig. 7a and b reveal a striated pattern. An alternative to reconstructing the resistivity grid beforehand is to use the incomplete resistivity grids for simulation, meaning no information is present in the resistivity dataset gaps. Grid cells containing a resistivity model are translated into three probability values using the resistivity–hydrostratigraphic relationship histograms (Fig. 1b, 6b–d). Areas without soft resistivity data rely on the TI during simulation, emphasizing the fact that no actual information is present between soundings. The overall setup is identical to the basic setup; the only difference is the reconstructed soft data grids are interchanged for the incomplete soft data grid (Fig. 6b–d).

A presentation of the incomplete resistivity grid. Each
grid is portrayed as a horizontal slice at 20 m b.s.l. and a vertical
cross section intersecting at YUTM 6 230 100 m.

The choice of inversion algorithm results in different SkyTEM resistivity models. The purpose of this case is to study how using sSCI (Vignoli et al., 2015) models influences the modeling results. A common inversion approach is SCI, where a smooth regularization is used (Constable et al., 1987). Such resistivity models have a smooth transition from resistive to conductive features, and vice versa. Geological layer boundaries are rarely smooth in nature, meaning such soft transitions in resistivities seldom reflect reality. Furthermore, extreme resistivity values are not presented correctly in the smooth model inversions. Vignoli et al. (2015) propose an alternative SCI approach, employing a minimum gradient support regularization term instead. Such sSCI models produce resistivity models with sharp layer boundaries and a better representation of extreme values. The setup in case 3 is identical to the basic setup, except that the SCI models are interchanged for sSCI models. The DS grid reconstruction is then conducted on the sSCI models, which are then translated into probability grids. Finally, these grids are used as soft data for simulation using the snesim method.

The sharp resistivity models are different from the smooth models, but no particularly sharp layer boundaries are reflected in the reconstructed resistivity grid (Fig. 7b).

An overview of the key differences between reconstructing the
resistivity grid using smooth and sharp inversion resistivity models. Each
grid is portrayed as a horizontal slice at 20 m b.s.l. and a vertical
cross section intersecting at YUTM 6 230 100 m.

One of the obvious differences is found in the resistivity patterns of the sand and gravel valley to the far west of the survey area. The valley itself is not significantly different; however, the small resistive patch, west of the large valley, is more pronounced in the sharp model and has an overall more pronounced fingerprint (Fig. 7b). The sharp resistivity models better estimate the true bulk resistivity values of specific geological units, such as the resistive patch accentuated here. The ensemble standard deviation grid, Fig. 7c and d, shows a general reduction in the ambiguity of the reconstructed sharp resistivity models. This is clear from the reduction areas with large standard deviation, shown in red colors, which are overall reduced in size.

This case is dedicated towards how the borehole data are handled and how they influence the hydrostratigraphic modeling results. The hard borehole data are normally sparse, relative to geophysical data. Boreholes are commonly considered ground truth since they directly sample the subsurface sediments or petrological units. This case is divided into two sub-cases. The first sub-case, case 4a, portrays what happens when hard data are not included in the snesim simulation. The model setup is therefore identical to the basic MPS setup, but without including the borehole data.

The second sub-case, case 4b, incorporates the borehole lithology logs as
soft data. The certainty of a lithological log varies depending on a range of
factors, e.g., drilling method, the purpose of the borehole, and sampling
frequency (e.g., Barfod et al., 2016; He et al., 2014). The hydrostratigraphic
probability logs, introduced in the basic modeling setup (Step 2, Fig. 3),
are utilized in place of the hard borehole grid. The boreholes are assigned a
lateral footprint, so the information is not only found at the borehole
locations. The borehole footprint is assigned by creating a grid where the
borehole probability values have been estimated in a radius of 200 m around
each borehole using simple kriging with a search radius of 200 m and a mean
of

A visual representation of the tau model procedure for combining the
soft resistivity and borehole grids. Each grid is portrayed as a horizontal
slice at 20 m b.s.l. and a vertical cross section intersecting at YUTM
6 230 100 m.

The final case, case 5, illustrates the consequences of not including the soft SkyTEM resistivity information in the MPS simulation routine. The basic setup is simply run without the inclusion of soft data; i.e., the setup only uses the cognitive Kasted TI and hard borehole information.

For each of the presented cases two hydrostratigraphic realizations are
presented (Fig. 9), along with an EMR map (Fig. 10). The EMR maps show the
occurrence ratio of the most likely simulated category for each grid cell
based on 50 realizations. The two realizations and EMR map of the basic
modeling setup, Figs. 9a and 10a, reveal the same overall trends as the
cognitive geological model, Fig. 4a: namely the western sand and gravel
valley striking

The case 1a realizations (Fig. 9b, 10b), which use the Egebjerg TI (Fig. 4c), show the same overall trends as in the basic modeling setup. The subset of buried valleys mentioned above is present; however, an obvious difference is the coarse and block-like appearance of the case 1a realization ensemble. This appearance is similar to the block-like appearance of the Egebjerg TI (Fig. 4c). Furthermore, the horizontal slice of the realizations and EMR map reveals that the glacial-clay-dominated area to the east has a generally larger occurrence ratio and is thus more certain. The realizations are clearly influenced by the choice of TI, especially when case 1b is also considered (Fig. 9c). The hydrostratigraphic realizations of case 1b (Fig. 9c, 10c) clearly depict the same overall buried valley trends, but the valleys in the central part of the model are largely filled with the opposite of the valley filling hydrostratigraphic units. Furthermore, the occurrence ratio seems quite low in certain areas, such as to the south of the model, which means the ambiguity has increased. Finally, the realizations also reveal an absence of small-scale patterns, which corresponds to the conceptual TI that only contains homogenous hydrostratigraphic units.

The importance of reconstructing the incomplete resistivity grid is seen in case 2 (Fig. 9d, 10d). The two realizations in Fig. 9d show the main buried valley features, e.g., the western sand and gravel valley. However, the hydrostratigraphic units are sporadic, especially in areas with no data. Patches of sand and gravel and glacial clay are randomly spread throughout the presented horizontal slice and vertical cross section (Fig. 9d). The EMR map also reveals an increase in low occurrence ratios in areas without soft data (Fig. 10d).

The uncertainty related to the choice of geophysical modeling procedure is portrayed by case 3. Here, snesim realizations are constrained to sharp resistivity models. Generally, the realizations (Fig. 9e) are quite similar to the basic modeling setup realizations (Fig. 9a). However, a key difference is the significant reduction or absence of patches of glacial clay in the homogeneous hemipelagic clay. In fact only one patch is found in the first realization (Fig. 9e) in the southwest corner, while it is not present in the second realization, and the EMR map further reveals a reduction of the occurrence ratios generally, especially along the southern margin of the realizations (Fig. 10e).

Case 4 shows the influence that the hard data have on the hydrostratigraphic
realizations in two sub-cases: case 4a, where snesim simulations are
run without hard data; and case 4b, where the borehole data are treated as
soft information. Fig. 9f and g shows two hydrostratigraphic realizations
without hard data and with soft borehole data, respectively. These
realizations do not differ significantly from the basic modeling setup
realizations and in fact are quite similar. One key difference is the central
glacial clay valley striking

The final case, case 5, illustrates the importance of the SkyTEM soft data. The snesim simulations are run using only hard data and the cognitive geological model as a TI. The output realizations (Fig. 9h) portray smooth and large-scale hydrostratigraphic units. The hydrostratigraphic architecture of the buried valleys is not reflected. However, the sand and gravel valley, to the west, does seem to protrude slightly in the realizations (Fig. 9h) and the EMR map reveals a significant decrease in the occurrence ratio and thus an increase in the ambiguity of the model ensemble (Fig. 10h).

Each case is displayed by two realizations: realization
no. 1 of 50 and realization no. 30 of 50. Each realization is portrayed as a
horizontal slice at 20 m b.s.l. and a vertical cross section intersecting at
YUTM 6230100 m.

A presentation of the ensemble mode ratio (EMR) maps, computed
for the different case ensembles of hydrostratigraphic models. Each EMR map
is presented as a horizontal slice centered on 20 m b.s.l. and a vertical
cross section intersecting at YUTM 6 230 150 m.

The distances between each of the 400 realizations have been computed using Eqs. (8) and (10). The full distance matrix is presented in Fig. 11a. The distances between each realization and the cognitive geological models have also been computed and plotted in Fig. 11b. To aid the interpretation of the distance matrix and distances to the cognitive model a summary table, Table 3, has been compiled.

The basic modeling setup constitutes a common snesim setup, with the geophysical data as soft data, boreholes as hard data, and a 3-D geological conceptualization encased in a TI. The ensemble average variability is computed according to the equations presented by Barfod et al. (2018), and the resulting ensemble average variability is 10.1 m, with an average distance to the cognitive model of 24.3 m. This means that the Euclidean distance mismatch between the individual realizations related to basic modeling setup is 10.1 m, and the average difference in Euclidean distance to the nearest active cell between the realizations and the cognitive model was 24.3 m.

The 3-D geological conceptualization contained in the TI influences the final hydrostratigraphic realizations as illustrated in case 1, which is divided into two sub-cases: case 1a and case 1b. In case 1a, using a 3-D cognitive geological model from the Egebjerg area as a TI for hydrostratigraphic simulation increases the average distance to the cognitive model to 24.9 m (Fig. 11a, Table 3). Furthermore, the average variability has increased to 13.6 m (Fig. 11b, Table 3). The other sub-case revolves around using an entirely conceptual geological model as a TI. The conceptual TI was designed to reflect the overall geology, yet still contains some bias. The results reflect the bias, with increased distances to the cognitive geological model, which are now centered on 25.6 m (Fig. 11a, Table 3). The ensemble variability has increased to 14.8 m (Fig. 11b, Table 3).

The importance of proper reconstruction of the incomplete resistivity grid is illustrated in case 2, where the incomplete resistivity grid was used for simulation. The resulting realization ensemble has a large ensemble variability centered on 24.1 m (Fig. 11a, Table 3). The distance to the cognitive geological model is also large, with an average value of 33.1 m (Fig. 11b, Table 3).

In case 3 the sharp SCI models were used for simulation in place of the smooth SCI models. The realizations related to case 3 were the closest to the cognitive model with an average value of 21 m (Fig. 11b, Table 3). The variability of the case 3 realization ensemble, i.e., the distances between the realizations pertaining to case 3, is small, with an average value of 9.4 m – see Table 3. Recalling the raw hydrostratigraphic realizations (Fig. 9e) and the EMR map (Fig. 10e), the large reduction in distances could partly be related to the removal of non-hemipelagic clay units along the southern border of the model and an overall increase in confidence along the southern and southeastern border of the model.

The influence of the borehole lithology logs on the hydrostratigraphic realizations is reflected in case 4, which is divided into two sub-cases. In the first sub-case, case 4a, the borehole information is not used as hard data, and the realizations are created only using soft geophysical data and the Kasted TI. However, the borehole data are still used for creating the resistivity–hydrostratigraphic histograms (Fig. 1b), which are used for creating the probability grids. The ensemble average variability is 10.7 m (Fig. 11a, Table 3) and the average distance to the cognitive model is 24.3 m (Fig. 11b, Table 3). In the second sub-case, case 4b, the boreholes are used as soft information to reflect the uncertainty of the borehole information. The ensemble average variability is 10.9 m, and the average distance to the cognitive model is 24.3 m. This illustrates how the snesim realizations are not particularly sensitive towards the sparse borehole hard data.

Not including the geophysical soft data in the snesim simulations, case 5, resulted in the largest ensemble average variability of 40.0 m (Fig. 11a, Table 3). The average distance between case 5 realizations and the cognitive model was 59.3 m. This means that the realizations of case 5 are the most different from the rest of the realizations. The snesim realizations are sensitive towards not including the geophysical data, or using the incomplete resistivity grid. This underlines the importance of the geophysical soft data in relation to hydrostratigraphic modeling using the snesim methodology.

A presentation of the average Euclidean distance calculations.

A summary table showing the average distance value for each
50 by 50 square representing a given case in the distance matrix
(Fig. 11a). The final column, labeled
Distance

The cognitive geological model was created based on smooth SkyTEM resistivity models and lithological logs (Høyer et al., 2015) as well as the conceptual geological understanding of the area. The model was simplified from a full 3-D geological model containing a total of 42 unique geological units to a hydrostratigraphic model containing only 3 hydrostratigraphic units. The cognitive geological model, although detailed and extensive, is not the true geological model. The ensemble realizations should not directly reflect the cognitive model, yet the cognitive model can be thought of as a reference point in modeling space, which we would prefer our models to resemble.

The results revealed the importance of the SkyTEM dataset. Not including the resistivity models in the MPS simulations, case 5, yielded realizations which were both the least similar to the cognitive geological model and had the largest variability between the individual realizations. Including the incomplete resistivity grid, case 2, improved the realization results compared to not including them at all. Yet, the ensemble variability was large and resulting realizations were ranked second least similar to the cognitive geological model. The realization ensemble which was closest to the cognitive geological model belongs to case 3. Here, the resistivity grid was reconstructed from the sharp SCI models that, in this case, increased the fingerprint of resistive extreme values, which in turn results in less ambiguous reconstructed resistivity grids; compare Fig. 7c and d. It should be noted that the usage of block kriging for assigning the sharp resistivity models to the modeling grid resulted in smoothing of sharp vertical boundaries otherwise found in sSCI models. These three cases together reveal the importance of the geophysical soft data when using the snesim setup presented in this study.

In relation to case 5, it can be argued that, even though the SkyTEM resistivity models are not used as soft data, they are still included indirectly since the TI, or cognitive geological model, was created using smooth SkyTEM resistivity models. However, the realizations related to case 5 revealed an ensemble of realizations, which did not replicate the overall geological architecture, implying the importance of using the SkyTEM models as soft data.

On the other hand, the cases related to studying the sensitivity towards borehole information, case 4a and case 4b, revealed that the large-scale hydrostratigraphic architecture was not changed significantly. The distance measure used in this study observes similarities or dissimilarities of large-scale hydrostratigraphic architecture and is not sensitive towards local changes in small-scale patterns. The amount of geophysical information is relatively large, meaning the relative influence of (few) borehole data becomes less significant. This does not mean that the borehole data are not important; they both contain locally accurate information and are used to estimate the regional resistivity–hydrostratigraphic relationship (Fig. 1b). In other surveys, where the contrast between geophysical and lithological information is smaller, the importance of the borehole data will likely increase. In relation to this study, such small-scale changes are insignificant. Yet if the realizations are to be used for flow simulations or predictions on a smaller scale, such smaller scales might suddenly have an important impact on prediction accuracy. Additionally, if such small-scale patterns are important, the size of the model grid cells should be smaller to accommodate simulations of these variations. Discretizing hydrostratigraphic and groundwater models with relatively small grid cells can be CPU demanding, depending on the total number of grid cells.

In case 4b, the borehole data were used as soft information as in the study by Høyer et al. (2017). This was done since boreholes are associated with uncertainty related to a number of factors, as described above. Therefore, the soft borehole probability values derived during the assigning of the boreholes to the modeling grid are combined with the SkyTEM-based probability grids using the tau model. This approach enables the borehole probability to alter the final probability grid, while still conditioning the SkyTEM data. Combining the information rather than letting the borehole data count as ground truth, i.e., hard data, allows the borehole data to influence the realizations, especially if the soft borehole information disagrees with the soft geophysical data (Fig. 8).

The conceptual geological understanding has always been considered an integral part of geological modeling. In this case the conceptual geological understanding is implemented via the TI, which makes it easy to change the underlying conceptual geological understanding of a given model. A total of three different TIs were used for simulation in this study: the Kasted, Egebjerg, and conceptual TIs. The results showed that models simulated using the Egebjerg TI, case 1a, portrayed the same overall hydrostratigraphic architecture. This opens for the possibility of using 3-D cognitive geological models as TIs for new survey areas, as long as the geological settings are similar. One key difference between the models, however, was the more block-like and coarse nature of the realizations using the Egebjerg TI, due to the coarseness of the Egebjerg TI. An important observation is that, when a spatially dense and extensive geophysical dataset, such as SkyTEM, is present, the snesim realizations are not as sensitive towards the choice of TI when the TI is relatively similar to the expected scenario. However, as illustrated by case 1b, a TI which has significantly differing vertical proportions in comparison with the actual model, which in this case is assumed to be the cognitive model (Fig. 5a, b), causes a conflict between the TI and the soft data. In such a case the TI dominates the realizations and simulates improbable hydrostratigraphic units in places where the soft data reveal low probability for those specific units. In Figs. 9c and 10c, it can be seen that the glacial clay valley, both present in the soft data variable (Fig. 8b) and the cognitive Kasted geological model (Fig. 4a), is represented as sand and gravel. This leads to the conclusion that one needs to pay attention to the construction of the TI, as also witnessed in the study by Høyer et al. (2017). Furthermore, the large-scale and homogenous nature of the hydrostratigraphic architecture in the conceptual TI results in realizations which reflects the homogeneity. In comparison with the realizations based on the TIs derived from cognitive models, the realizations contain much fewer small-scale patterns.

Small-scale patterns are present in the realizations although they are not at the same degree present in the TIs. This can be explained by the fact that the simulations are probabilistic in nature and are based on random processes. At the beginning of the simulation a random path is drawn so that the simulation grid is filled by visiting each grid cell only once. The small-scale patterns are partly due to the hard data that are inserted into the simulation grid before the simulation commences, which is excluded from the simulation path. As the grid is filled out, the hard borehole data might suggest a certain category but the soft data suggest another category. As the grid is filled out the overall category from the soft data dominates and, if the random path visits the grid cells near the hard data point towards the end of the random path, then we are left with a small intercalation (Hansen et al., 2018). The intercalations are also inherent in the simulations without hard data, which is mainly due to process randomness related to how the snesim algorithm draws from a cumulative density function (Strebelle, 2002).

The Kasted model and TI are influenced by non-stationarity, which has not been dealt with in the MPS setup. Even though the models are influenced by non-stationarity, the simulations result in models which overall resemble the cognitive model, e.g., cases 1–4. However, once the geophysical data are removed in case 5, the resulting MPS models are increasingly random and are heavily influenced by non-stationarity. It is important to note that the increasing amounts of soft geophysical data generally decrease the effects of non-stationarity, due to the increased conditioning of the soft data.

The reconstruction of the resistivity grid is an important step of the MPS setup presented in this study. This was illustrated in case 2, where the incomplete resistivity grid was used instead of the reconstructed resistivity grid, resulting in larger realization variability and distance to the cognitive geological model. These realizations could have been improved by increasing the prior knowledge provided to snesim before simulation. One such option is to provide so-called vertical proportions, in place of solely the target global proportions. The global proportions simply give a percentage fraction of the different hydrostratigraphic units in the outcome realizations. The vertical proportions are defined for each simulation grid layer and determine the proportions as a function of depth. This makes sense if the different units in the realizations are clearly linked to geological units, which in turn have clear stratigraphic layering. In our case, this would have impacted the realizations by not allowing the presence of hemipelagic clay at the top of the model. Furthermore, sand and gravel and glacial clay would not be allowed at the bottom of the model. However, vertical proportions were not used in any other cases and were therefore not used in case 2. The usage of vertical proportions for conditioning could also improve the results of case 5.

Part of the considerations of this study was to utilize the DeeSse
code for direct sampling simulation. Whereas Chugunova and Hu (2008)
present an MPS method for constraining a categorical simulation using
continuous auxiliary data, DS is much more flexible and allows multivariate
simulations reproducing spatial statistics within and between variables,
which can be categorical or continuous. DS requires the construction of a
multivariate TI, and it is important that every variable reflects the spatial
relationship to be modeled. In our context, one could envision creating a
bivariate TI consisting of a hydrostratigraphic model and a continuous
auxiliary variable reflecting an AEM dataset. It is no trivial task, and, to
our knowledge, no studies have been presented where an auxiliary variable is
created for a 3-D AEM dataset. However, Lochbühler et al. (2014)
presented a generalized example on creating auxiliary variables for
tomographic images, i.e., 2-D images. Generally, the requirements for the
geophysical modeling procedure are twofold. Firstly, the categorical TI needs
to be populated with resistivity values, e.g., as in Christensen et al. (2017)
where a Bayesian Markov chain Monte Carlo (MCMC) algorithm
is used to create 1-D resistivity models drawn from a posterior probability distribution. This is no straightforward
task. Secondly, the populated resistivity model then ideally needs to be
forward modeled using full 3-D forward modeling code, which is
computationally expensive. Alternatively, approximate 1-D forward modeling is
also an option. The correct system parameters of the AEM instrument and data
processing parameters have to be taken into account. Thirdly, the synthetic
data obtained by forward modeling must be inverted using the same procedure
as the field dataset. To our knowledge, such usage of an auxiliary variable
for constraining soft geophysical models is not widespread within the domain
of AEM geophysical methods. In this study, DS was only used to reconstruct
the incomplete AEM dataset (a univariate case with the dataset as a TI, and
hard data; see Sect. 3.2) and the snesim method was used for
hydrostratigraphic modeling, due to its usage of the

The study presented by Barfod et al. (2018) used the alternative modified Hausdorff distance measure for comparing realizations. Due to the computational burden of the method, it was difficult to create exhaustive distance computations, i.e., where all information from individual realizations is used. The usage of differences in EDT of binary translations of the categorical realizations for comparing the individual realizations proved to be a more computationally feasible approach. In this paper an efficient algorithm for computing the EDT was used (Maurer et al., 2003). This computationally advantageous approach for computing the distance between two realizations allows for a full analysis of the realizations. Each realization is then compared based on each of the hydrostratigraphic categories and on the entire 3-D objects, resulting in a detailed comparison. The resulting distance matrix (Fig. 11a) was able to differentiate between the realizations pertaining to the different cases. The random number seed between cases was chosen so the first realization of each case has the same random seed and the second realization has the same seed. This can be seen in the distance matrix (Fig. 11a), where off-diagonal cases have smaller distance values along the diagonal within the given 50 by 50 sub-matrix. An example is the 50 by 50 sub-matrix between the basic setup and case 1a, where the diagonal is clearly marked by lower distances relative to the remaining sub-matrix.

A hydrogeophysical dataset from Kasted in Denmark was used for stochastic hydrostratigraphic simulation using the snesim algorithm. The main goal of this study was to improve our understanding of ensemble hydrostratigraphic modeling variability related to stochastic MPS modeling. The study was divided into eight sub-cases designed to reflect the impact related to key components of the hydrostratigraphic modeling setup, i.e., the TI, borehole lithology logs, and SkyTEM resistivity models. The results revealed that the hydrostratigraphic realizations were sensitive first and foremost to the geophysical dataset due to its extensive nature. Not including the geophysical data in the realizations resulted in an average Euclidean distance variability of 40 m and a distance to the cognitive model of 59 m, which was, by far, the largest distance of all realizations. Furthermore, the geophysical modeling procedure influences the resulting realizations. It was shown that choosing so-called sharp inversion models (sSCI), in place of smooth inversion models (SCI), resulted in a realization ensemble which had similar distance-based variabilities, 9.4 and 10.1 m, respectively. However, using sSCI models decreased the distance to the cognitive geological model from 24.3 to 21.6 m. The choice of a TI containing a relevant geological conceptualization is important. The cognitive Egebjerg model was used as a TI to simulate the hydrostratigraphic Kasted model, which yielded similar realizations to the case where the cognitive Kasted model was used as a TI. The Egebjerg TI contained relevant geological architecture, but if a conceptual TI is introduced containing significantly different vertical proportions, the resulting realizations will reflect these differing vertical proportions. Finally, it was seen that the borehole lithology logs did not significantly influence the realizations. The lithology logs only carry information in the immediate vicinity of the borehole and are sparse in comparison to the resistivity data. The boreholes therefore only have a minor influence on the realizations. The comparison measures used here mainly compare the overall large-scale architecture components of the realizations and do not reflect small-scale changes. In relation to this study the usage of the lithology logs as hard data does not show a significant impact on the MPS realizations. However, if the hydrostratigraphic models are used for predicting groundwater flow, the boreholes might be important. However, it should be mentioned that the resistivity–hydrostratigraphic histograms, which are used extensively in this research, are created from the borehole information.

Both the geophysical models and the borehole data used in
this research paper are publicly available. The borehole data can be found in
the national borehole database of Denmark
(

We would like to thank Senior Research Engineer Celine Scheidt of Stanford University for pointing us in the direction of using Euclidean distance transforms for comparing MPS realizations. Three anonymous reviewers and the editor are thanked for valuable comments that improved the clarity of the manuscript. This study is supported by HyGEM (Integrating geophysics, geology, and hydrology for improved groundwater and environmental management, project no. 11-116763). The funding for HyGEM is provided by the Danish Council for Strategic Research. Edited by: Jesús Carrera Reviewed by: three anonymous referees