equinor · StephanDeHoop · Jan 16, 2025 · Jan 21, 2025 · Jan 23, 2025 · Jan 23, 2025
diff --git a/docs/everest/development.rst b/docs/everest/development.rst
@@ -18,6 +18,8 @@ client component.
     :width: 700px
     :alt: Everest architecture
 
+    Everest architecture
+
 Every time an optimization instance is ran by a user, the client component of the
 application spawns an instance of the server component, which is started either on a
 cluster node using LSF (when the `queue_system` is defined to be *lsf*) or on the
@@ -52,3 +54,54 @@ long as the optimization process is running.
    * - POST
      - '/stop'
      - Signal everest optimization run termination. It will be called by the client when the optimization needs to be terminated in the middle of the run
+
+
+EVEREST vs. ERT data models
+===========================
+EVEREST uses ERT for running an experiment, but instead of submitting an `ensemble` (ERT) to the queue we submit
+a `batch` in EVEREST. `Batches` are in principle very similar to `ensembles`, ERT queue system doesn't treat them differently,
+but they have some hierarchical differences in terms of the meaning behind the data.
+ERT history matches `realizations` (i.e., `model parameters`) to data, hence an `ensemble` contains a number of `realizations`.
+EVEREST optimizes a set of `controls` and assumes static (i.e., unchanging) `realizations`.
+In terms of collecting the results of forward model runs, there is a distinction between `unperturbed controls`
+(i.e., current `objective function` value) and `perturbed controls` (i.e., required to calculate the `gradient`).
+Furthermore, when performing robust optimization (i.e., multiple static `realizations`) a `batch` contains a
+certain number of `realizations` (denoted by `<GEO_ID>`) and each `realization` contains a number of `simulations`
+(i.e., forward model runs). These `simulations` are forward model runs for either `unperturbed controls` and/or
+`perturbed controls`. This is the key differences between the hierarchical data model of EVEREST and ERT (Fig 3).
+
+.. figure:: images/Everest_vs_Ert_01.png
+    :align: center
+    :width: 700px
+    :alt: EVEREST vs. ERT data models
+
+    Difference between `ensemble` in ERT and `batch` in EVEREST.
+
+.. figure:: images/Everest_vs_Ert_02.png
+    :align: center
+    :width: 700px
+    :alt: Additional explanation of Fig 3
+
+    Different meaning of `realization` and `simulation`.
+
+As is evident from the image above, in terms of execution in the queue `realization` (ERT) and `simulation` (EVEREST) are synonymous.
+This means that ERT queue system is agnostic about the meaning of each run only when the data is collected back in EVEREST (`GEN_DATA`) is meaning
+of each run attributed.
+The mapping from data models in EVEREST and ERT is done in the `ropt` library, it maps from `realization` (ERT) to `<GEO_ID>` and `pertubation` (EVEREST) and vice versa.
+`Batches` in EVEREST can contain several different configurations depending on the algorithm used. Gradient-based algorithms can have a single function
+evaluation (`unperturbed controls`) per `<GEO_ID>`, a set of `perturbed controls` per `<GEO_ID>` to evaluate the gradient, or both.
+Derivative-free methods can have several function evaluations per `<GEO_ID>` and no `perturbed controls`.
+**NOTE:** the optimizer may decide that some `<GEO_ID>` are not needed, these are then skipped and the mapping from `ropt`
+should reflect this (i.e., less `<GEO_ID>` in the batch results than expected).
+
+Another thing to note is that continuity for `realizations` between `ensemble` exists; however, this is not the case for `simulations` in `batches`.
+A `batch` can contain several different configurations (Fig 5) and `simulation 0` for `<GEO_ID> = 0` can be either `unperturbed`
+or `perturbed controls`. `<GEO_ID>` is continuous from one `batch` to the next since they are not changing at all over the course of the optimization.
+
+.. figure:: images/Everest_vs_Ert_03.png
+    :align: center
+    :width: 700px
+    :alt: Other `batch` configurations EVEREST
+
+    Three other possible configurations of EVEREST `batches` in the context of gradient-based (i.e., `optpp_q_newton`)
+    and gradient-free (i.e., **WHICH ONE DO WE SUPPORT?**) optimization algorithms.
diff --git a/docs/everest/images/Everest_vs_Ert_01.png b/docs/everest/images/Everest_vs_Ert_01.png
diff --git a/docs/everest/images/Everest_vs_Ert_02.png b/docs/everest/images/Everest_vs_Ert_02.png
diff --git a/docs/everest/images/Everest_vs_Ert_03.png b/docs/everest/images/Everest_vs_Ert_03.png