Timestep Synchronization¶

Many models are time dependent with variables being updated as the time is incremented. When integrating two (or more) models that have timesteps, care must be taken to ensure that variables are correctly synchonized between the models at each timestep. To aid in this process yggdrasil provides a special timestep synchronization interface and model driver. Synchronizations proceeds as follows:

A client model requests synchronization at a given timestep by calling the synchronization interface with the time of the desired timestep (with units) and any local state variables to the synchronization driver.
The synchronization driver converts synonymous state variables from all models to a base set of state variables.
The synchronization driver interpolates state variables from all models to get values at the time received from the client.
The synchronization driver aggregates state variables across models for the requested time (including interpolated values).
The synchronization driver converts the aggregated base set of state variables to the synonymous state variables used by the client model.
The synchronization driver responds to the client model that issued the request with the resulting state variables.

Example Using Defaults¶

In the example below, two model are initialized from the same source code. Both models have two state variables (x and y) that are calculated as a sine and cosine with periods of 10 days and 5 days respectively; the two models differ only in the size of their timesteps and the units that they use to represent time (the timestep and units for each model are set by input arguments to the model as passed in the yaml).

In the yaml file below, the models are defined as usual, but they also have the timesync parameter set to True. Setting the timesync parameter tells yggdrasil that the model has time dependent variables that need to be synchonized. The timesync parameter can also be set to a string that will be used to group models that should be synchronized; in this way, sets of models can be independently synchronized (e.g. if there are unrelated processes that don’t need to be synced). For each unique value of the timesync parameter in the provided yamls, yggdrasil sets up a specialized model with that name to handle synchronization between the models with the same timesync parameter value (setting timesync to True creates a specialized timestep synchronization model with the name timesync).

Model YAML:

---

models:
  - name: modelA
    language: python
    args:
      - ./src/timesync.py
      - 7  # Pass the timestep in hours
      - hr
    timesync: True
    outputs:
      name: output
      default_file:
        name: modelA_output.txt
        in_temp: True
        filetype: table
  - name: modelB
    language: python
    args:
      - ./src/timesync.py
      - 1  # Pass the timestep in days
      - day
    timesync: True
    outputs:
      name: output
      default_file:
        name: modelB_output.txt
        in_temp: True
        filetype: table

(Example in other languages)

In addition to the yaml parameter, models performing timestep synchronization will need to make use of the timestep synchronization interface. In Python, this is YggTimesync. At each timestep (including the initial time), the model executes the call method for the timestep synchronization interface. The output variable (the variable being sent as a request by the call method) is excepted to be the time of the timestep and a mapping type between state varaible names and their values at the timestep. The return variable (the variable received in response by the call method) will be a mapping type between state variable names and their values that have been updated with information from the other models.

Model Code:

import sys
import numpy as np
from yggdrasil import units
from yggdrasil.interface.YggInterface import (
    YggTimesync, YggOutput)


def timestep_calc(t):
    r"""Updates the state based on the time where x is a sine wave
    with period of 10 days and y is a cosine wave with a period of 5 days.

    Args:
        t (float): Current time.

    Returns:
        dict: Map of state parameters.

    """
    state = {
        'x': np.sin(2.0 * np.pi * t / units.add_units(10, 'day')),
        'y': np.cos(2.0 * np.pi * t / units.add_units(5, 'day'))}
    return state


def main(t_step, t_units):
    r"""Function to execute integration.

    Args:
        t_step (float): The time step that should be used.
        t_units (str): Units of the time step.

    """
    print('Hello from Python timesync: timestep = %s %s' % (t_step, t_units))
    t_step = units.add_units(t_step, t_units)
    t_start = units.add_units(0.0, t_units)
    t_end = units.add_units(5.0, 'day')
    state = timestep_calc(t_start)

    # Set up connections matching yaml
    # Timestep synchonization connection will default to 'timesync'
    timesync = YggTimesync('timesync')
    out = YggOutput('output')

    # Initialize state and synchronize with other models
    t = t_start
    ret, state = timesync.call(t, state)
    if not ret:
        raise RuntimeError("timesync(Python): Initial sync failed.")
    print('timesync(Python): t = % 8s, x = %+ 5.2f, y = %+ 5.2f' % (
        t, state['x'], state['y']))

    # Send initial state to output
    flag = out.send(dict(state, time=t))
    if not flag:
        raise RuntimeError("timesync(Python): Failed to send "
                           "initial output for t=%s." % t)
    
    # Iterate until end
    while t < t_end:

        # Perform calculations to update the state
        t = t + t_step
        state = timestep_calc(t)

        # Synchronize the state
        ret, state = timesync.call(t, state)
        if not ret:
            raise RuntimeError("timesync(Python): sync for t=%f failed." % t)
        print('timesync(Python): t = % 8s, x = %+ 5.2f, y = %+ 5.2f' % (
            t, state['x'], state['y']))

        # Send output
        flag = out.send(dict(state, time=t))
        if not flag:
            raise RuntimeError("timesync(Python): Failed to send output for t=%s." % t)

    print('Goodbye from Python timesync')


if __name__ == '__main__':
    # Take time step from the first argument
    main(float(sys.argv[1]), sys.argv[2])

(Example in other languages)

Below is the result of the synchronization between the two models. The states variable x and y are shown in blue and red respectively, solid lines show results for model A, dashed lines show results for model B, and the True values for x and y are plotted in black.

Model B has a larger timestep than model A so values for model B are interpolated to get values at the timesteps for model A. The default interpolation method assumes a linear relationship between values and the default aggregation method averages across all models; as a result, the values for the model with the smaller timestep (model A) end up being pulled away from the “True” value when aggregated with the interpolated with the values from the model with the larger timestep (model B). There are ways to control how timesteps are synchronized (as discussed below), but in practice, integrating two models of the same process with the same degree of accuracy that use different timesteps is unlikely to be useful.

Controlling Synchonization¶

There are several ways to customize how timesteps are merged between models. In order to set these options, an explicit entry must be added to the yaml for the timestep synchronization model. At a minimum the timestep synchronization model yaml entry should have language: timesync and it’s name should be the same as the timesync parameter value for the models it will synchronize ('timesync' if the models have timesync: True). Additional parameters in the timestep synchronization model yaml entry will be used to control how timesteps are synchronized.

In the example below, these parameters are used to modify how the state variables are synchronized. Models A and B are identical except:

Model A has state variables x and y while Model B has state variables xvar and yvar (set in the yaml)
xvar in Model B is equal to half of x in Model A
Model A has state variables z1 and z2 which can be used to calculate z, a state variable calculated directly by Model B.
Model A alone calculates state variable a, while model B alone calculates state variable b. The models need to exchange these variables.

Model YAML:

---

models:
  - name: statesync
    language: timesync
    synonyms:
      modelB:
        x:
          alt: xvar
          alt2base: ./src/timesync.py:xvar2x
          base2alt: ./src/timesync.py:x2xvar
        y: yvar
      modelA:
        z:
          alt: [z1, z2]
          alt2base: ./src/timesync.py:merge_z
          base2alt: ./src/timesync.py:split_z
    interpolation:
      modelA: krogh
      modelB:
        method: spline
        order: 5
    aggregation:
      x: ./src/timesync.py:xagg
      y: sum
    additional_variables:
      modelA: [b]
      modelB: [a]
  - name: modelA
    language: python
    args:
      - ./src/timesync.py
      - 7  # Pass the timestep in hours
      - hr
      - A
    timesync: statesync
    outputs:
      name: output
      default_file:
        name: modelA_output.txt
        in_temp: True
        filetype: table
  - name: modelB
    language: python
    args:
      - ./src/timesync.py
      - 1  # Pass the timestep in days
      - day
      - B
    timesync: statesync
    outputs:
      name: output
      default_file:
        name: modelB_output.txt
        in_temp: True
        filetype: table

(Example in other languages)

(The source code associated with this model is very similar to the souce code above, but can be found here for reference.)

Synonyms (Conversion)¶

It is unlikely that variables will match perfectly between models, beit in name or definition. For example, in the model defined above, model A and B use different names to describe the same variable (y and yvar respectively). Similarly the variables x and xvar used by models A and B respectively are analagous, but xvar is defined slightly differently (namely xvar=x/2).

To handle reconcilation between analagous variables, yggdrasil allows users to define relationships between variables in different models using the synonyms parameter. The value for the synonyms parameter should be a mapping between model names and mapping from base variable names (these should be variables named by one or more models) and the analogous variable that the model uses. If the model just uses a different name for the same concept as the base, this can be just a string (e.g. the synonyms entry for y in modelB above). If additional calculations are required to convert between the variables used in the models, a mapping can be provided (e.g. the synonyms entry for x in modelB). In addition, calculation mapping from one model to another can also involve more than one variable (e.g. the synonyms entry for z in modelA). The keys in the entry should be:

alt: One or more state variables used by the model to calculate the base state variable.
alt2base: Function for converting from the state variable(s) used by the model to the base variable.
base2alt: Function for converting from the base variable to the state variable(s) used by the model.

Note

Units conversions will be handled by yggdrasil and do not need to be addressed by the synonyms parameters so long as the state variables have units when passed to the timestep synchronization interface call method.

Additional Variables¶

Models can also request state variables from other models that they do not calculate themselves via the additional_variables parameter in the yaml. The value for this parameter should be a mapping from model name to a list of external variables that should be returned to the model. For example, in the above yaml, model A requests state variable b from model B and model B requests state variable a from model A.

Interpolation¶

The interpolation parameter controls how data is obtained for timesteps that are not sampled by the model. This is particularly important for models that have very different timesteps. Values for the interpolation parameter can be strings specifying the method that should be used to interpolate or mapping of keyword arguments to pandas.DataFrame.interpolate. A single interpolation parameter can be used for all models or specialed interpolation parameters can be specified a model-by-model basis using a mapping as in the example above. Interpolation is done on the data from each model independently after applying the transformations described by the synonyms parameter.

Aggregation¶

After the model data is interpolated to get missing timesteps, the data samples at each timestep are aggregated. By default, the samples from each model are averaged, but there are many ways to aggregate data. In the example above, the data for the x variable are aggregated using a function xagg from the ./src/timesync.py file which always returns the largest absolute value. The data for the y variable are aggregated by summing as would be the case when two models represent separate processes that modify a cumulative variable (e.g. mass added/subtracted by different processes).

Additional Tips¶

In addition to dictionaries mapping from variable to method, a single value can be provided for the aggregation and interpolation parameter; the same method will the be used for all of the variables e.g.:

- name: statesync
  language: timesync
  synonyms:
    modelB:
      x:
        alt: xvar
        alt2base: ./src/timesync.py:xvar2x
        base2alt: ./src/timesync.py:x2xvar
      y: yvar
  interpolation: nearest
  aggregation: min

Note

Since each synchronization call invokes overhead, it is not advised that the call method be executed inside integration methods. Instead, synchonization call methods should be executed at larger timesteps.