noaa-owp / lstm Goto Github PK

View Code? Open in Web Editor NEW

6.0 6.0 10.0 111.24 MB

License: Other

Python 3.15% Jupyter Notebook 96.85%

lstm's People

Contributors

Stargazers

Watchers

Forkers

jmframe madmatchstick hellkite500 zprivf robertbartel peckhams groutr dustming

lstm's Issues

Add example using all static attributes and forcings

We need to have an example that uses all the static attributes and forcings, that way the BMI functionality can be tested on all those items. For instance, to test the functionality of the set_value for the two wind speed directions, we need an example that takes those as forcing data.

Compatibility with heat model example from CSDMS

Short description explaining the high-level reason for the new issue.

Current behavior

The current implementation of the LSTM_BMI does not strictly follow the template from the CSDMS heat model example: https://github.com/csdms/bmi-example-python/tree/master/heat. Or some of the other BMI python models. For instance, in the Snow model example by @SnowHydrology, the model input values attributes of the model itself: https://github.com/SnowHydrology/snowBMI/blob/0886ac4140c4ede8300f92e42b039267ef2eaa40/snow/snow.py#L136, whereas in the LSTM model, they model inputs are passed in as arguments to the model forward function:

lstm/lstm/nextgen_cuda_lstm.py

Line 19 in bd0ccf3

def forward(self, input_layer, h_t, c_t):

Expected behavior

Is it expected that the LSTM model conform the the examples by the CSDMS? This will require some significant modification to the LSTM model implementation itself. There was, at some point, an expectation that BMI not require modifications to the specific model implementation, and that the BMI wrapper could be developed in a manner to require minimally, or none at all, changes to the model itself.

Steps to replicate behavior (include URLs)

To conform to the Snow BMI example, the LSTM model would add in an _input_layer as an attribute to the LSTM class, such as this: self._input_layer = torch.tensor([0,0,...,0]), then, during the initialization of the LSTM_BMI, and during the set_value function, there would be an in-place operation to over-ride the input layer. This would allow the reference: val = self.get_value_ptr(var_name), which could then allow for the in-place operation through val[:] = src.reshape(val.shape).

Questions for OWP

Is there a list of requirements for the BMI implementations of model formulations?
Specifically, is it required that BMI implementations follow the templates from the examples set by CSDMS?
Is there a hard requirement of code parsimony?
Is there a continuing philosophy of domain experts recognizing the original model code?
Are model formulation codes required to conform to the BMI templates, or is there room to allow BMI to conform to model formulation codes, provided that the BMI functions perform as intended?

Broken link to example configuration file

This link is broken. It looks like the link just needs to be updated to ../trained_neuralhydrology_models/hourly_all_attributes_and_forcings/config.yml.

lstm/bmi_config_files/README.md

Line 6 in 63116cc

    
           - `train_cfg_file: ./trained_neuralhydrology_models/hourly_all_attributes_and_forcings/config.yml` found [here]( ./trained_neuralhydrology_models/hourly_all_attributes_and_forcings/config.yml). This is a very important part of the LSTM model. This is a configuration file used when training the model. It has critical information on the LSTM architecture and should not be altered.

For longevity, it might be best to use a permalink though:

https://github.com/NOAA-OWP/lstm/blob/63116cc6a6bbdb5537868f20ff55cc326795b570/trained_neuralhydrology_models/hourly_all_attributes_and_forcings/config.yml

Im happy to open a PR to fix this, just let me know which one you all prefer.

Bound the runoff to zero

Short description explaining the high-level reason for the new issue.

Current behavior

Can predict less than zero runoff

Expected behavior

Should predict no less than zero runoff.

Simulated output in jupyter notebooks

Both notebooks contain plots comparing observed and simulated outputs. The LSTM-BMI outputs are clearly incorrect. Let's check (hardcoded) variable units, timestep size/units, etc. See section [4] in run-lstm-with-bmi.ipynb.

Initialize all the possible forcings, and don't hard code any specific forcings.

Short description explaining the high-level reason for the new issue.
An LSTM can be trained with any combination of forcings, so don't hard code them. Let the BMI determine the forcings to include from the training configuration. But we can still initialize everything, and just not include them in the input tensor.

Current behavior

def initialize_forcings(self):
    #------------------------------------------------------------ 
    if 'total_precipitation' in self.cfg_train['dynamic_inputs']:
        self.total_precipitation = 0
    #------------------------------------------------------------ 
    if 'temperature' in self.cfg_train['dynamic_inputs']:
        self.temperature = 0

Expected behavior

    for forcing_name in self.cfg_train['dynamic_inputs']:
        setattr(self, forcing_name, 0)

Don't hard code the attribute names when initalizing

Short description explaining the high-level reason for the new issue.
The training configuration includes the specific attributes used during training, and these need to be included in forward mode. If a model is trained with a specific set of attributes as inputs, the BMI should be able to provide those, as indicated from the training configuration file.

Current behavior

def set_static_attributes(self):
    #------------------------------------------------------------ 
    if 'elev_mean' in self.cfg_train['static_attributes']:
        self.elev_mean = self.cfg_bmi['elev_mean']
        self.all_lstm_input_values_dict['elev_mean'] = self.cfg_bmi['elev_mean']
    #------------------------------------------------------------ 
    if 'slope_mean' in self.cfg_train['static_attributes']:
        self.slope_mean = self.cfg_bmi['slope_mean']
        self.all_lstm_input_values_dict['slope_mean'] = self.cfg_bmi['slope_mean']

Expected behavior

def set_static_attributes(self):
    """ Get the static attributes from the configuration file
    """
    for attribute in self._static_attributes_list:
        if attribute in self.cfg_train['static_attributes']:

            # This is probably the right way to do it,
            setattr(self, attribute, self.cfg_bmi[attribute])

            # and this is just in case.
            self.all_lstm_input_values_dict[attribute] = self.cfg_bmi[attribute]

We need an example that includes all the LSTM inputs

Short description explaining the high-level reason for the new issue.

Current behavior

The examples only include four LSTM inputs:
dynamic_inputs:

total_precipitation
temperature

static_attributes:

elev_mean
slope_mean

Expected behavior

An example that includes all currently possible inputs:
dynamic_inputs:

total_precipitation
longwave_radiation
shortwave_radiation
pressure
specific_humidity
temperature
wind_u
wind_v

static_attributes:

elev_mean
slope_mean
area_gages2
frac_forest
lai_max
lai_diff
gvf_max
gvf_diff
soil_depth_pelletier
soil_depth_statsgo
soil_porosity
soil_conductivity
max_water_content
sand_frac
silt_frac
clay_frac
carbonate_rocks_frac
geol_permeability
p_mean
pet_mean
aridity
frac_snow
high_prec_freq
high_prec_dur
low_prec_freq
low_prec_dur

Temperature should actually be in kelvin

Short description explaining the high-level reason for the new issue.

Current behavior

BMI claims it wants temperature in Kelvin, but the model actually wants Celcius

Expected behavior

Steps to replicate behavior (include URLs)

Either add in a conversion,
or change the BMI unit

Have an option to pass the full sequence to the LSTM at each time step, which means also re-setting the state space at each time step. This option would increase run time by a factor equal to the sequence length. For example, if the training sequence length is 336 timesteps, this would require 336 calls the the model for each time step of prediction.
Train the NeuralHydrology CustomLSTM and eliminate the state re-setting.

Screenshots

Add functions for read_initial_state and save_current_state

Short description explaining the high-level reason for the new issue.

Current behavior

Only option to save and load the model states

Expected behavior

save the state at some point
load in the saved state and start from there.

Decide on an output unit, and make output available in get_value

Short description explaining the high-level reason for the new issue.
We need one output for this model. We can provide it in units of either: ['mm', 'ft3 s-1', 'm3 s-1]. So decide on that, and then make it available through the get_value. And that requires associating the chosed output unit with 'land_surface_water__runoff_volume_flux'.

Current behavior

Nothing currently associated with 'land_surface_water__runoff_volume_flux'.

Expected behavior

_var_name_units_map = {'land_surface_water__runoff_volume_flux':['streamflow_cfs','ft3 s-1'],  ... etc.}  

def scale_output(self):  
    ...  
    self._values['land_surface_water__runoff_volume_flux'] = self.streamflow_cms * (1/35.314)  
    ...OR...  
    self.land_surface_water__runoff_volume_flux = self.streamflow_cms * (1/35.314)  

etc.

Add ngen integration workflow to gh actions

Reminder to include a ngen integration workflow similar to other formulation repos (e.g. cfe). Currently, the existing workflow yaml just speaks to (1) standalone & (2) bmi unit testing scripts.

Ensure correct implementations for getter and setter functions

There are some issues with the current implementations of the getter and setter functions. These are things that don't properly adhere to the BMI specification.

`get_value`

First, the signature of this function isn't correct. Right now it is:

def get_value(self, var_name):

But in the specification it is defined as:

def get_value(self, name: str, dest: np.ndarray) -> np.ndarray:

This directly prevents it from behaving correctly. It is expected to copy the values of the desired variable into the (already appropriately initialized) dest NumPy array.

Also related, the function right now is returning the backing NumPy array from get_value_ptr. It probably needs to return dest once that's added. Regardless, it shouldn't return a direct reference to the backing variable.

`get_value_at_indices`

This will need to be modified after changes to get_value. I think it's also making an unnecessary copy of the entire backing NumPy array, before then copying the right individual values into the supplied dest array.

`set_value`

This function has its required second parameter for containing the values to set, but appears to makes invalid assumptions that are based on something beyond the BMI specification: e.g., that it is a singleton array. That in particular also seems to be contradicted by there being valid get_value_at_indices and set_value_at_indices functions (i.e., why not raise an exception to indicate there is only index 0, and thus the at_indices functions are unnecessary and inappropriate?).

`set_value_at_indices`

Again, this seems to try to account for the possibility of singleton arrays in ways that contradict needing at_indices functions, and that aren't strictly consistent with BMI.

Set_value functionality is mostly commented out.

Short description explaining the high-level reason for the new issue.

Current behavior

The set_value function, starting on line 792, copies the input to an internal array, down to line 805. So far, so good. But, then the rest of the code to actually set the value is commented out, so the current code cannot run input values.

Expected behavior

There should be some mechanism to get the input value set into the model, which is in the commented out code below from lines 807-830. Looks like there has been some discussion on the correct mechanisms to get the input value set, but any consensus may be obscured by the commented out code.

Steps to replicate behavior (include URLs)

Uncomment lines 807-830.
Replace the item "value" (which doesn't exist) from lines 812-830 with the "internal_array" variable.

Screenshots

Config file path should be type string

See bmi.initialize() definition,

def initialize( self, bmi_cfg_file=None ):
    #NJF ensure this is a Path type so the follow open works as expected
    #When used with NGen, the bmi_cfg_file is just a string...
    bmi_cfg_file = Path(bmi_cfg_file)

Per #NJF comment, we should maintain bmi_config_file as a string for ngen compliance.

Current behavior

bmi_config_file is of type path

Expected behavior

bmi_config_file is of type string

GH Actions Fail for py v3.7

Manually ran unit testing workflow and python version 3.7 fails for both mac and linux os. See job details here.

Reminder to look into.

set_values_dictionary is redundant, get rid of it and organize input values from _values

Short description explaining the high-level reason for the new issue.
We don't need a separate dictionary for what will be included in the input, vs all possible input values. When we create the input tensor we can organize the inputs from the full dictionary (_values), or if we decide to go with storing the input values as individual attributes, we can reference them from there.

Current behavior

def set_values_dictionary(self):
self._values = {self._var_name_map_short_first[x]:self.all_lstm_input_values_dict[x] for x in self.all_lstm_inputs}
...
def create_scaled_input_tensor(self):
self.set_values_dictionary()

Expected behavior

def create_scaled_input_tensor(self):
self.input_array = np.array([getattr(self, self._var_name_map_short_first[x]) for x in self.all_lstm_inputs])
... or...
self.input_array = np.array([self._values[self._var_name_map_short_first[x]] for x in self.all_lstm_inputs])

noaa-owp / lstm Goto Github PK

lstm's People

Contributors

Stargazers

Watchers

Forkers

lstm's Issues

Current behavior

Expected behavior

Steps to replicate behavior (include URLs)

Questions for OWP

Current behavior

Expected behavior

Current behavior

Expected behavior

Current behavior

Expected behavior

Current behavior

Expected behavior

Current behavior

Expected behavior

Steps to replicate behavior (include URLs)

Current behavior

Expected behavior

Current behavior

Expected behavior

Current behavior

Expected behavior

Current behavior

Expected behavior

Steps to replicate behavior (include URLs)

Screenshots

Current behavior

Expected behavior

Current behavior

Expected behavior

get_value

get_value_at_indices

set_value

set_value_at_indices

Current behavior

Expected behavior

Steps to replicate behavior (include URLs)

Screenshots

Current behavior

Expected behavior

Current behavior

Expected behavior

Recommend Projects

Recommend Topics

Recommend Org

`get_value`

`get_value_at_indices`

`set_value`

`set_value_at_indices`