Warning

You're reading an old version (v5.0) of this documentation. If you want up-to-date information, please have a look at master.

Data Uncertainties

The data uncertainties, often referred to as the error bars, are critical in EM inversion since the data misfit and inversion search directions are both scaled by the data uncertainties. The error bars are just as important as the data! While most EM data and uncertainties are estimated as complex numbers in data processing codes, the complex values are often transformed into other forms for inversion (e.g., real, imaginary, amplitude, phase, log10 amplitude, apparent resistivity etc). Here we provide a table showing how to correctly scale the data uncertainties for each data type supported by MARE2DEM. For the interested reader, we also provide a brief review of the uncertainty propagations for the various transforms. For more details and discussion of some of the advantages and disadvantages of the various data scalings, see [WCK15].

Data Type

Data

Standard Error

complex

z=x+iy=aeiϕ

δz=2σ

real or imaginary

x,y

δx=δy=σ

amplitude

a=|z|

δa=σ

log10(amplitude)

l=log10|z|

δl=1ln(10)σa

apparent resistivity

ρa=1ωμ|z|2

δρa=2ρa|z|σ

log10(app.resist.)

log10ρa

δlog10ρa=1ln(10)δρaρa

phase

ϕ=arctan(y/x)

δϕ=σa=12δρaρa

Note that MARE2DEM inverts phase with units degrees, so multiply the phase and its uncertainty defined above by 180/π.

Complex Data

The complex data z=x+iy can be related to amplitude a and phase ϕ (in radians) using Euler’s formula:

z=x+iy=aeiϕ=a(cosϕ+isinϕ),

where

x=acosϕ,y=asinϕ.

Likewise, the amplitude and phase can be found from the complex components using

a=x2+y2,tanϕ=yx,ϕ=arctanyx.

When computing ϕ, you should use the atan2(y,x) function since it returns the four-quadrant phase whereas the atan(y/x) function is restricted to πϕπ.

Also, when dealing with real data, we generally convert phase into degrees:

ϕ(degrees)=180πϕ(radians)

Real and Imaginary

For complex data z=x+iy where x is the real component and y is the imaginary component, the standard error σ is generally assumed to be isotropic so that σ=δx=δy. Isotropic error means that z has a circle of uncertainty around it in the complex plane as shown in Fig. 43.

_images/complex_errors.png

Fig. 43 Isotropic uncertainty in the complex plane

For CSEM data, z is the complex electric or magnetic field at a given frequency. For MT data, z is a component of the MT impedance tensor (or a component of the tipper vector) at a given frequency. In both cases, complex data z is what is output from the EM response estimation code that process the time series data.

MARE2DEM allows us to invert the data as a complex quantity, but often we find it more intuitive to look at the data in amplitude and phase form, or log(amplitude) and phase. In the case of MT, the apparent resistivity form is much easier to comprehend visually than the complex data z. When inverting CSEM and MT responses with a large dynamic range (i.e. spanning multiple orders of magnitude), there are some stability advantages to treating the data as log10(amplitude); see [WCK15].

Given a data scaling transform, we have to know the associated data uncertainty transforms. Suppose q is the transformed version of the original complex data z. Thus qq(x,y). For example, q could be the amplitude, phase, log(amplitude), apparent resistivity, a polarization ellipses parameter, etc. For computing misfits during modeling and inversion, we need to know the uncertainty δq. This can be found using the standard method for linear propagation of errors, which uses a first-order Taylor’s series expansion. Assuming the variables x and y are independent with standard error σ, the first-order variance propagation formula is

(4)δq2=(qx)2σ2+(qy)2σ2.

The sections below show how to apply this for the various data types supported in MARE2DEM.

Amplitude

The amplitude a of complex variable z is:

a=|z|=x2+y2,

Thus we have

ax=xa,ay=ya,

Using eq. (4), we find

δa2=(xa)2σ2+(ya)2σ2,=σ2.

The standard error is therefore

δa=σ.

In Fig. 43 you can see that this result makes sense since δa=σ=δx=δy.

log10 Amplitude

Warning

The standard error of log scaled data is not the log of the linear standard error!

Denote the log of the amplitude as variable l:

l=log10|z|=log10(a)=log10(x2+y2).

Using the chain rule we have

lx=laax=1ln(10)axa=xln(10)a2,

with a similar expression for the y derivative.

Using eq. (4), the variance of l is then:

δl2=(xln(10)a2)2σ2+(yln(10)a2)2σ2,=σ2ln(10)2a2.

The standard error of l is then:

δl=1ln(10)σa=0.4343δaa.

This shows that the standard error for the log10 scaled amplitude is just a scaled version of the relative amplitude error δaa. So if we say the data has 1% error in amplitude, the standard error of the log10 amplitude is then 0.004343.

It can also be helpful to think of visualizing the error bars on a log10 scaled plot of the amplitudes. The formula above shows that if the data have a fixed relative error, the error bars on the plot will all have the same vertical length, regardless of the data values.

Apparent Resistivity

The MT apparent resistivity ρa is found from the impedance Z=x+iy as

ρa=1ωμ|z|2=1ωμ(x2+y2)

Using the chain rule we have

ρax=2xωμ

and similarly for the y derivative. The standard error of ρa is then:

δρa=2σ|Z|ρa

and the relative error in apparent resistivity is

δρaρa=2σ|Z|

which is twice the relative error in the impedance, where the factor of two is due to the apparent resistivity being proportional to the square of the impedance magnitude.

Phase

The phase ϕ of z in degrees is:

ϕ=180πarctan(z)=180πarctan(yx),

with partial derivatives

ϕx=180π11+(yx)2yx2,=180πyx2+y2.

and

ϕy=180πxx2+y2.

Using eq. (4), we find

δϕ2=(180πyx2+y2)2σ2+(180πxx2+y2)2σ2,=(180π)2σ2a2

The standard error of ϕ is then:

δϕ=180πσa=180πδaa=180π12δρaρa

So for 1% relative error in amplitude (δaa), the corresponding phase error δϕ is then 0.573º. Note also that the last term on the right shows the phase error for MT data will be half the relative error in apparent resistivity.

In Fig. 43 you can see that this result makes sense, since small changes in ϕ will scale with δaa.

A word of warning

The uncertainty propagation analyses above relied on a first-order Taylor’s series expansion that implicitly assumes σ<<|z|, in other words it assumes that the uncertainty is much smaller than the data amplitude. The formulas for transformed uncertainties break down when the uncertainty grows too large. See [WCK15] for an in-depth analysis of the break-down. Hence, data with large errors (say greater than 50% or so) should generally be omitted from the inverted data set given issues with propagating the error; furthermore very noisy data probably doesn’t help the inversion resolve conductivity so it can have little useful value, that is unless the noisy data are all you have to work with.