Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Find peaks #3729

Merged
merged 7 commits into from
Jan 9, 2025
Merged

Find peaks #3729

merged 7 commits into from
Jan 9, 2025

Conversation

dopplershift
Copy link
Member

@dopplershift dopplershift commented Jan 4, 2025

Description Of Changes

This adds find_peaks (and persistence) as API's for finding peaks, as well as makes scattertext() a public API for plotting this information. This is an alternative to #3110 that makes use of persistent homology as the mathematical underpinning for peak-finding in 2D. This runs in 30s on my mac for a 1377x2145 HRRR grid, which isn't too bad. It's much better for smaller grids.

TODO:

  • Improve example
  • Clean up scattertext() a bit to handle formatting and broadcasting values

In the meanwhile I wanted to get this up if anyone had API thoughts. Here's a quick example:

import cartopy.crs as ccrs
import cartopy.feature as cfeature
import matplotlib.pyplot as plt
import metpy.calc as mpcalc
from metpy.cbook import get_test_data
import metpy.plots as mpplots
from metpy.units import units
import numpy as np
import xarray as xr

data = xr.open_dataset(get_test_data('GFS_test.nc', as_file_obj=False)).metpy.parse_cf()
mslp = data.Geopotential_height_isobaric.metpy.sel(vertical=850*units.hPa).squeeze()
dataproj = mslp.metpy.cartopy_crs

h_y, h_x = mpcalc.find_peaks(mslp.values)
l_y, l_x = mpcalc.find_peaks(mslp.values, maxima=False)

fig = plt.figure(figsize=(8., 8.))
ax = fig.add_subplot(1, 1, 1, projection=ccrs.LambertConformal(central_longitude=-95))

clevmslp = np.arange(0., 2000., 30)
ax.contour(mslp.metpy.x, mslp.metpy.y, mslp,
           clevmslp, colors='k', linewidths=1.25, linestyles='solid', transform=dataproj)

mpplots.scattertext(ax, mslp.metpy.x[h_x], mslp.metpy.y[h_y], ['H']*2, size=20, color='red',
                    fontweight='bold', transform=dataproj)
mpplots.scattertext(ax, mslp.metpy.x[l_x], mslp.metpy.y[l_y], ['L']*4, size=20, color='blue',
                    fontweight='bold', transform=dataproj)
mpplots.scattertext(ax, mslp.metpy.x[l_x], mslp.metpy.y[l_y], [format(f, '.0f') for f in mslp.values[l_y, l_x]],
                    size=12, color='blue', fontweight='bold', loc=(0, -15), transform=dataproj)

ax.add_feature(cfeature.OCEAN)
ax.add_feature(cfeature.LAND)
ax.add_feature(cfeature.COASTLINE)

image

Checklist

@dopplershift dopplershift requested review from kgoebber and a team as code owners January 4, 2025 00:48
@dopplershift dopplershift requested review from dcamron and removed request for a team January 4, 2025 00:48
@@ -13,6 +11,7 @@
from .patheffects import * # noqa: F403
from .skewt import * # noqa: F403
from .station_plot import * # noqa: F403
from .text import * # noqa: F403

Check notice

Code scanning / CodeQL

'import *' may pollute namespace Note

Import pollutes the enclosing namespace, as the imported module
metpy.plots.text
does not define '__all__'.

def test_find_peaks(peak_data):
"""Test find_peaks correctly identifies peaks."""
data = xr.open_dataset(get_test_data('GFS_test.nc', as_file_obj=False))

Check warning

Code scanning / CodeQL

File is not always closed Warning test

File is opened but is not closed.

def test_find_peaks_minima(peak_data):
"""Test find_peaks correctly identifies peaks."""
data = xr.open_dataset(get_test_data('GFS_test.nc', as_file_obj=False))

Check warning

Code scanning / CodeQL

File is not always closed Warning test

File is opened but is not closed.
@dcamron dcamron added this to the 1.7.0 milestone Jan 6, 2025
@kgoebber
Copy link
Collaborator

kgoebber commented Jan 7, 2025

Generally, I think I like the user experience of the new function. The only thing I'm wondering is whether it would be useful to include the number of peaks in the output of the function. I only say this because when you go to plot with the now public scattertext, you need to know the number of peaks for when you need to plot a given number of H or L "symbols". Clearly this could be gotten after the fact from a simple len() call, but don't know if it would be more convenient to have that number be a third output from the function.

@dopplershift
Copy link
Member Author

@kgoebber My plan is to tweak scattertext() so that it broadcasts arguments correctly and you don't need to supply that for plotting. Given that, I don't think there's a need to directly return it and any other usage can use len().

@kgoebber
Copy link
Collaborator

kgoebber commented Jan 7, 2025

@dopplershift That was another thought I had too, which I think is an even better solution! And if I could actually read, I would have seen that it was you put on the to do list. D'oh!

@dopplershift
Copy link
Member Author

Ok, now the example (which is included) reads as:

import cartopy.crs as ccrs
import cartopy.feature as cfeature
import matplotlib.pyplot as plt
import metpy.calc as mpcalc
from metpy.cbook import get_test_data
import metpy.plots as mpplots
from metpy.units import units
import numpy as np
import xarray as xr

data = xr.open_dataset(get_test_data('GFS_test.nc', as_file_obj=False)).metpy.parse_cf()
mslp = data.Geopotential_height_isobaric.metpy.sel(vertical=850*units.hPa).squeeze()
dataproj = mslp.metpy.cartopy_crs

h_y, h_x = mpcalc.find_peaks(mslp.values)
l_y, l_x = mpcalc.find_peaks(mslp.values, maxima=False)

fig = plt.figure(figsize=(8., 8.))
ax = fig.add_subplot(1, 1, 1, projection=ccrs.LambertConformal(central_longitude=-95))

clevmslp = np.arange(0., 2000., 30)
ax.contour(mslp.metpy.x, mslp.metpy.y, mslp,
           clevmslp, colors='k', linewidths=1.25, linestyles='solid', transform=dataproj)

mpplots.scattertext(ax, mslp.metpy.x[h_x], mslp.metpy.y[h_y], 'H', size=20, color='red',
                    fontweight='bold', transform=dataproj)
mpplots.scattertext(ax, mslp.metpy.x[l_x], mslp.metpy.y[l_y], 'L', size=20, color='blue',
                    fontweight='bold', transform=dataproj)
mpplots.scattertext(ax, mslp.metpy.x[l_x], mslp.metpy.y[l_y], mslp.values[l_y, l_x],
                    formatter='.0f',
                    size=12, color='blue', fontweight='bold', loc=(0, -15), transform=dataproj)

ax.add_feature(cfeature.OCEAN)
ax.add_feature(cfeature.LAND)
ax.add_feature(cfeature.COASTLINE)

@dopplershift dopplershift force-pushed the find-peaks branch 2 times, most recently from 2288631 to 9bb59e3 Compare January 8, 2025 03:03
@kgoebber
Copy link
Collaborator

kgoebber commented Jan 8, 2025

Looking good. Should we update the GEMPAK comparison guide in this PR or do that with a DOC update? I don't know that the table gets used all that much at this point, but would be good to update at some point once this PR is merged. There would be a HIGH and LOW function to update within the table.

@dopplershift
Copy link
Member Author

Good point about the GEMPAK table. I went ahead and added that.

kgoebber
kgoebber previously approved these changes Jan 8, 2025
Copy link
Collaborator

@kgoebber kgoebber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This all looks good to me at this point.

src/metpy/calc/tools.py Outdated Show resolved Hide resolved
This finds local maxima, or minima, for a 2D array.
This will make it easier to use for various use cases.
This allows e.g. `scattertext(ax, x, y, 'H')`.
This allows scattertext to handle converting an array of values to a
sequence of strings.
Replaces less intuitive use of StationPlot.
This uses our new find_peaks implementation.
Copy link
Member

@dcamron dcamron left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm happy with the implementation after playing around with it a bit locally. Let's get it in

@dcamron dcamron merged commit d9bcf95 into Unidata:main Jan 9, 2025
35 checks passed
@dopplershift dopplershift deleted the find-peaks branch January 9, 2025 23:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Clean up High/Low Plotting Function A Bit
3 participants