.. _sect_scratch_folder:

Scratch Folder for CSEM Inversions 
------------------------------------

A scratch folder (or directory) for temporary file storage needs to be
specified when running CSEM inversions. This can be done in one of two
ways, using either the command line argument ``-scratch
<scratchfolder>`` (see :ref:`sect_command_line_args`) or by
specification in the ``mare2dem.settings`` file (see
:ref:`sect_settings_file`). Any scratch folder specified in the settings
file will take precedence over that given in the command line. If not
specified in either way, MARE2DEM uses the default scratch folder ``/tmp``.

For example, starting MARE2DEM with:

    ``mpirun -n 12 MARE2DEM -scratch /local Demo.0.resisitivy`` 
    
will use ``/local`` for temporarily storage of the scratch
files.   

.. Note::
    Since huge volumes of scratch data can be written, it is recommend
    to use a scratch folder on the local hard drive for each compute
    node in the cluster since that will give the fastest speed. If
    instead a network mounted directory (like your home director) is
    used, all that data will need to be passed across the cluster
    network and that can result in much slower file I/O as well as
    potentially overwhelming the cluster network. Most clusters usually
    have the directories  ``\tmp`` or ``\local`` available for local
    file storage. When in doubt, check the user documentation for your
    cluster or ask someone from the IT team.


Scratch File Rationale 
~~~~~~~~~~~~~~~~~~~~~~ 

When inverting CSEM data (i.e. from dipole sources), MARE2DEM writes
scratch files to the file space (hard-drive) to temporarily store the
large arrays formed by the wavenumber domain sensitivity functions:

.. math:: \frac{\partial \hat F_i(k_x)}{\partial \sigma(y,z)}

where :math:`{\hat F}_i` contains the wavenumber domain electric or
magnetic field for the *i*\ th  requested data response at wavenumber
:math:`{k}_x`. These are computed serially over a discrete set of
wavenumbers and files are written for each wavenumber. Once all
wavenumbers have been computed, the sensitivities are Fourier
transformed to the *x* spatial domain.

For a given modeling task running on a single processing core, the
number of total  CSEM sensitivity function elements is

.. math:: n_s = {2 n_{kx} n_r   n_p}

where  :math:`{n_{kx}}` is the number of wavenumbers (MARE2DEM uses 30
by default), :math:`n_r` is the number of responses, :math:`{n_{p}}` is
the number of free parameters and the factor 2 arises from the negative
wavenumbers also being computed by MARE2DEM.  :math:`n_r` depends on
number of receivers :math:`{n_{rx}}`  and tranmitters :math:`{n_{tx}}`
used in the parallel data decompositon, as well as the particular
requested responses and pairs of transmitters and receivers given in the
input data file's data table. For example, suppose  the data table has
all receiver-transmitter pairs and all *x,y,z* components of the
electric and magnetic fields, then :math:`n_r` will be

.. math:: n_r = {  6 n_{rx}  n_{tx}}

For most real CSEM data sets,   :math:`n_r` can be much smaller than
this since often not all receiver-transmitter pairs have data, for
example at long offsets where the transmitter is to far away from the
receiver. Further, usually only the inline horizontal electric field and
possibly the crossline magnetic  and vertical electric fields are
inverted (since the other components are nominally nil).

To give an idea of the total size of the sensitivity functions above,
suppose a given problem inverts only the inline electric field and there
are 100,000 free parameters (:math:`{n_{p}=10^5}`). Further, suppose
:math:`{n_{rx}=20,n_{tx}=20}`  and all receiver-transmitter pairs have
data. Combined with MARE2DEM's default wavenumber sampling
(:math:`{n_{kx}=30}`), this gives :math:`{n_r = 400}` and thus

.. math:: n_s = {2\times 30\times 400 \times 100000 = 2.4 \times 10^9}  

Since the sensitivity uses complex double precision values requiring 16
bytes of storage per value, this example requires 36 GB storage and if
stored in memory rather than in files, this could exceed available
memory of the cluster node. While most modern cluster nodes have at
least 128 GB memory, consider a 24 core node that is running 24
independent modeling tasks as part of the parallel data decomposition
approach. When all 24 cores are running, a total of 864 GB memory would
be needed, which likely exceeds the available memory on many readily
accessible cluster nodes (other than the most state of the art systems).
Hence MARE2DEM writes the wavenumber sensitivity functions to scratch
space on the fly so that memory is conserved. Further, the sensitivity
data are written to separate files consisting of no more than 10
responses per wavenumber. Thus once all the wavenumbers have been
computed and written to scratch files, the Fourier transformation to the
*x* domain can proceed serially over these file subsets, requiring only
a modest amount of memory.

Another subtle aspect is that MARE2DEM's memory usage during CSEM inversion
depends on :math:`{n_r}`, and so the parallel data decomposition settings for 
:math:`{n_{rx}}` and :math:`{n_{tx}}` can be adjust to reduce memory usage 
as needed.