Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Climate change and health - precipitation and healthcare provision #1485

Draft
wants to merge 212 commits into
base: master
Choose a base branch
from

Conversation

RachelMurray-Watson
Copy link
Collaborator

@RachelMurray-Watson RachelMurray-Watson commented Oct 9, 2024

Branch that investigates the effect of changes in precipitation patterns - caused by climate change - on the provision of healthcare services.

Steps:

  • Using SHAP files, grid Malawi into 1degree-by-1degree squares

  • Associate facilities reported in ResourceFile_Master_Facilities_List with a grid (which will be used to provide location semi-specific climate projections)

  • Download projected weather patterns from each available model for 2015 - 2100 (daily: historical data can be monthly). Initial plan: three SSPs (SSP1 - 1.9, SSP2-4.5 (current path), and SSP5 - 8.5) https://cds.climate.copernicus.eu/datasets/projections-cmip6?tab=download

        - [X] ssp1-1.9
        - [X] ssp2-4.5
        - [ ] ssp5-8.5
    
  • Aggregate daily precipitation across all models (median is usual metric)

  • Download historical monthly precipitation data for Malawi from https://cds.climate.copernicus.eu/datasets/reanalysis-era5-single-levels-monthly-means?tab=overview (probably 2011 - 2024, as this period has good reporting from healthcare facilities)

  • Download historical reporting patterns of facilities from DHIS2 (initially just central facilities)

  • Associate reporting facilities with grid on map

  • Association between historical precipitation events and reporting of facilities

Issues so far...

  1. No specific data for location of facilities.
    - Could use the largest intersecting area, but for some the intersections could be evenly divided between >2 grid squares.
    - Could randomly assign a grid square that is known to overlap with the region. WILL DO THIS but then leads to issue of small overlaps are as likely to lead to grid square being chosen as larger ones. Now DO have locations for most facilities, but may still need random assignation if no direct match can be made.

Very weak association between historical precipitation and reporting... Could improve by curtailing data but loose many data points.

HOWEVER, have not accounted for the fact that there can be serious knock-on events. Precipitation may temporarily remove access, but also destroy buildings (see https://www.manaonline.gov.mw/index.php/the-star/item/4099-phalombe-health-centre-re-opened-for-operation). In DHIS2, these are returning "0" for % reporting, so even though there may be no rain, they are still not operating as usual.
#################
So, instead of looking at total monthly rainfall, could instead look at daily maximums... ERA5 reanalysis data is now available (published 14/10/2024).

  • Download daily maximum precipitation from 2014 - 2024 - in progress, some issues with server
  • Re-examine relationship between historical reporting and rainfall
  • Download smaller-level facility data
    • have done (clinics, district/rural/central hospitals, health centres). 90 have no expectation of reporting
  • Associate smaller facilities with grid squares based on new file

Issues part 2:

  1. Can get reporting data for 760 facilities, but only location <400 on a grid. Still may be fine, that's a lot of data

#################
Reporting metric is slightly obtuse - don't have a good sense of e.g. what a drop in reporting means for a facility level. Excellent suggestion to look at ANC data, as is more interpretable and perhaps more stable than other metrics.

  • Download HMIS Total Antenatal Visits from DHIS2

  • Run regression again with ANC and daily maximum (and/or monthly data)

  • Start projecting forward - perhaps days with projected disruptions?

################## 21.11.24

  • Based on feedback from Sally, re-evaluating ERA5 data for historical trend. Was in part based on this paper, which showed good agreement for the Mozambique/Malawi data and more "observational" datasets (occasionally agreeing better than with the observational with themselves). However, should do own more specific checks.
  • Build ML model of association
  • Download DHIS2 data for other service provisions (e.g. HIV care?)
  • KD-Tree and Ball algorithm - potential means of avoiding regridding etc. CMIP6 data. Instead of assigning each facility to a grid point, go directly to projections and make "individual ensembles" for each facility?
  • Mask COVID19 months
  • Add distance to closest clinic

Downloaded CHIRPS and TAMSAT data. There is no very strong agreement between the three (some outliers for all), but ERA5 and CHIRPS have better agreement than TAMSAT and the others.

################## December 2024

  • Download downscaled CMIP6 data from https://planetarycomputer.microsoft.com/dataset/cil-gdpcir-cc0#Ensemble-example
  • Find min, median, and max model for all time points across all grid points
  • Calculate indices of interest (5 day monthly max and monthly cumulative)
  • Associate facilities with gridsquares using KDBall algorithm
  • Use these as new bases for CMIP6 projections
  • Build demographic model in Excel? Or way of using TLO model - e.g. could get an estimate of the births from the Longterm projections model?
  • Estimate number of Antenatal disruptions from the models (how does this translate? Is one entry one appointment? How does ANC4 relate?)

…then gridded over this graph in (1x1 degrees), which will be used as a reference for climate data retrieval. Have additionally loaded in facilities from the ResourceFile_Master_Facilities_List, and mapped them onto specific districts (from the ResourceFile_mwi_admbnda_adm2_nso_20181016 shap file)
… to folder.

NB - can only work for ssp1_1_9, as there are different models available for each ssp so need to collate.

Also produces zipped files
…etCDF files

NB - can only work for ssp1_1_9 at the moment
extracted into areas of interest. from 2000-2024, but only really reliable after 2011
…here are many "duplicates"

# in facilities_with_districts_shap_files (as each facility is paired with any matching grid)
# removing the duplicates PENDING a better system (e.g. assigning based on size)
…care facilities, as a different gridding system is used. Though I requested the netCDF the max/min x/y coordinates for Malawi, I suppose they were rounded up/down to (e.g., from the NetCDF file, the lowest latitude is -16 ish, but from the Shap file it should be -17).

SO have regrided Malawi/reassign facilities based on the NetCDF file.
However, am getting extremely high numbers of distruption...
…e differences are being used. (Effect was diminished by fact that +ve values were included.)
…e differences are being used. (Effect was diminished by fact that +ve values were included.)
also reduced negative sum to single number
also, saved the dataframe of historical estimated disruptions
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant