Code Monkey home page Code Monkey logo

djgagne / hagelslag Goto Github PK

View Code? Open in Web Editor NEW
67.0 67.0 27.0 93.46 MB

Hagelslag supports segmentation and tracking of weather fields and scalable verification, including performance diagrams and reliability diagrams.

License: MIT License

Python 12.35% Jupyter Notebook 87.57% Shell 0.07%
geojson hail hrrr machine-learning mrms netcdf performance performance-diagram python reliability segmentation storms tracking verification weather zarr

hagelslag's People

Contributors

ahijevyc avatar alburke avatar charlie-becker avatar djgagne avatar lmadaus avatar mariajmolina avatar thomasmgeo avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

hagelslag's Issues

STObject time tracking

STObject was built under the assumption of hourly integer time steps originally, and some of the derived functions have not been updated to enable arbitrary time differencing when tracking or calculating trajectories and centroids. This functionality needs to be fixed so that arbitrary times can be used rather than integer times with a step of 1.

different centroid in track_step csv and patches nc

The centroid in the csv track_step file is different than the centroid in the patches netCDF file.
When I say different, I don't mean more than 0.2 degrees different, but definitely more than machine roundoff error.

For example:
object: d01_REFL_1KM_AGL_20110427-0000_24_25_622_01
csv longitude: -87.70007
patches longitude: -87.73282

Data format

Good day!

I have been reviewing the all the codes from hagelslag and I would like to ask the data input format that it needs. Thank you.

Patch Centroid lon / lat

The centroid lon / lat data in the netCDF patch data represents the the longitude / latitude values at the center of the patch, not the center of the object, and thus there are mis-matches between centroid lon / lat in the CSV files and netCDF files for the same storm objects.

hagelslag/bin/hsdata

Lines 642 to 645 in 57d1051

for c_var in ["lon", "lat"]:
out_file.variables["centroid_" + c_var][:] = np.concatenate([np.array(f_track.attributes[c_var])[:,
patch_radius, patch_radius]
for f_track in forecast_tracks])

Is there any reason to keep the patch-center derived lons / lats? If not, the values could be replaced using the same process to generate the centroids for the CSV file.

hagelslag/bin/hsdata

Lines 454 to 455 in 57d1051

centroid_x, centroid_y = forecast_track.center_of_mass(step)
centroid_lon, centroid_lat = proj(centroid_x, centroid_y, inverse=True)

Issue with Probability Evaluation Function

The ampersand (&) symbol in the /hagelslag/evaluation/ProbabilityMetrics.py module's DictributedROC class update() function did not produce expected behavior. My interpretation is that np.count_nonzero((forecasts >= threshold) & (observations >= self.obs_threshold)) should yield the frequency of values where boolean values are true and intersect, but instead this yields for me the frequency of times either boolean array is true. I added a np.logical_and and it fixed this issue for me. Please let me know if you can replicate this issue, and if so, I can submit a pull request of my fix.

    def update(self, forecasts, observations):
        """
        Update the ROC curve with a set of forecasts and observations
        Args:
            forecasts: 1D array of forecast values
            observations: 1D array of observation values.
        """
        for t, threshold in enumerate(self.thresholds):
            tp = np.count_nonzero((forecasts >= threshold) & (observations >= self.obs_threshold))
            fp = np.count_nonzero((forecasts >= threshold) &
                                  (observations < self.obs_threshold))
            fn = np.count_nonzero((forecasts < threshold) &
                                  (observations >= self.obs_threshold))
            tn = np.count_nonzero((forecasts < threshold) &
                                  (observations < self.obs_threshold))
            self.contingency_tables.iloc[t] += [tp, fp, fn, tn]

skimage regionprops depreciating xy coordinates

Eventually we need to accept the objects' region properties in row-column coordinates, as opposed to xy.

skimage.measure regionprops function will stop using xy coordinates and use row-column instead.
STObject.py uses it for shape properties.

We could specify regionprops(..., coordinates='xy') for now to avoid the warning, but this will break.

See https://scikit-image.org/docs/0.14.x/release_notes_and_installation.html#deprecations for details on how to avoid this message.
warn(XY_TO_RC_DEPRECATION_MESSAGE)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.