데이터셋 상세
미국
Daily mean air temperature data for the North American Great Lakes based on coastal weather stations; 1897-2023 (NCEI Accession 0291722)
This dataset contains a record of daily mean air temperature for each of the U.S. Great Lakes from January 1, 1897 to October 22, 2023. These temperatures were derived using the following method. Daily maximum and minimum air temperature data were obtained from the Global Historical Climatology Network-Daily (GHCNd, Menne, et al. 2012) and the Great Lakes Air Temperature/Degree Day Climatology, 1897-1983 (Assel et al. 1995). Daily air temperature was calculated by taking a simple average of daily maximum and minimum air temperature. Following Cohn et al. (2021), a total of 24 coastal locations along the Great Lakes were selected. These 24 locations had relatively consistent station data records since the 1890s. Each of the selected locations had multiple weather stations in their proximity covering the historical period from 1890s to 2023, representing the weather conditions around the location. For most of the locations, datasets from multiple stations in the proximity of each location were combined to create a continuous data record from the 1890s to 2023. When doing so, data consistency was verified by comparing the data during the period when station datasets overlap. This procedure resulted in almost continuous timeseries, except for a few locations that still had temporal gaps of one to several days. Any temporal data gap less than 10 days in the combined timeseries were filled based on the linear interpolation. This resulted in completely continuous timeseries for all the locations. Average daily air temperature was calculated from by simply making an average of timeseries data from corresponding locations around each lake. This resulted in daily air temperature records for all five Great Lakes (Lake Superior, Lake Huron, Lake Michigan, Lake Erie, and Lake Ontario).
연관 데이터
GLERL Great Lakes Air Temperature/Degree Day Climatology, 1897-1983, Version 1
공공데이터포털
Daily maximum and minimum temperatures for 25 stations around the Great Lakes, 1897 to 1983, were given to NSIDC by the NOAA Great Lakes Environmental Research Laboratory (GLERL), Ann Arbor, MI. Daily data can be used to produce daily maximum, minimum, and mean temperatures, and seasonal accumulations of freezing and thawing degree days. The statistical data are archived in ASCII text files transcribed from 25 reels of 35 mm microfilm, one roll per station. Microfilm rolls are the appendices to Assel (1980), with updates covering 1978 to 1983 spliced to the end of each microfilm roll. Data sources include U.S. Department of Commerce summaries of meteorological data for Minnesota, Michigan, Wisconsin, Illinois, Pennsylvania, and New York, and the monthly meteorological observations published by Environment Canada.
Daily surface temperature predictions for 185,549 U.S. lakes with associated observations and meteorological conditions (1980-2020)
공공데이터포털
Daily lake surface temperatures estimates for 185,549 lakes across the contiguous United States from 1980 to 2020 generated using an entity-aware long short-term memory deep learning model. In-situ measurements used for model training and evaluation are from 12,227 lakes and are included as well as daily meteorological conditions and lake properties. Median per-lake estimated error found through cross validation on lakes with in-situ surface temperature observations was 1.24 °C. The generated dataset will be beneficial for a wide range of applications including estimations of thermal habitats and the impacts of climate change on inland lakes.
Daily surface temperature predictions for 185,549 U.S. lakes with associated observations and meteorological conditions (1980-2020)
공공데이터포털
Daily lake surface temperatures estimates for 185,549 lakes across the contiguous United States from 1980 to 2020 generated using an entity-aware long short-term memory deep learning model. In-situ measurements used for model training and evaluation are from 12,227 lakes and are included as well as daily meteorological conditions and lake properties. Median per-lake estimated error found through cross validation on lakes with in-situ surface temperature observations was 1.24 °C. The generated dataset will be beneficial for a wide range of applications including estimations of thermal habitats and the impacts of climate change on inland lakes.
GLERL Great Lakes Air Temperature/Degree Day Climatology, 1897-1983, Version 1
공공데이터포털
Daily maximum and minimum temperatures for 25 stations around the Great Lakes, 1897 to 1983, were given to NSIDC by the NOAA Great Lakes Environmental Research Laboratory (GLERL), Ann Arbor, MI. Daily data can be used to produce daily maximum, minimum, and mean temperatures, and seasonal accumulations of freezing and thawing degree days. The statistical data are archived in ASCII text files transcribed from 25 reels of 35 mm microfilm, one roll per station. Microfilm rolls are the appendices to Assel (1980), with updates covering 1978 to 1983 spliced to the end of each microfilm roll. Data sources include U.S. Department of Commerce summaries of meteorological data for Minnesota, Michigan, Wisconsin, Illinois, Pennsylvania, and New York, and the monthly meteorological observations published by Environment Canada.
Oceanographic and surface meteorological data collected from station GB17 by University of Wisconsin-Milwaukee and assembled by Great Lakes Observing System (GLOS) in the Great Lakes region from 2014-07-01 to 2020-10-05 (NCEI Accession 0123640)
공공데이터포털
This dataset contains oceanographic and surface meteorological data in netCDF formatted files, which follow the Climate and Forecast metadata convention (CF) and the Attribute Convention for Data Discovery (ACDD). University of Wisconsin-Milwaukee collected the data from station GB17, an in-situ moored station, in the Great Lakes. GLOS, which assembles data from University of Wisconsin-Milwaukee and other sub-regional coastal and ocean observing systems of the Great Lakes region of the United States, submitted the data to NCEI as part of the Integrated Ocean Observing System Data Assembly Centers (IOOS DACs) Data Stewardship Program. Each month, NCEI adds to this dataset the data collected during the previous month.
Oceanographic and surface meteorological water parameter data collected from moored Realtime Coastal Observation Network, ReCON, Muskegon M20 Buoy (NDBC station 45161), Lake Michigan, in the Great Lakes region by NOAA Great Lakes Environmental Research Laboratory from 2020-07-30 to 2020-10-27 (NCEI Accession 0243922)
공공데이터포털
NOAA Great Lakes Environmental Research Laboratory collected the data from moored Realtime Coastal Observation Network, ReCON, Muskegon M20 Buoy (NDBC station 45161), Lake Michigan, an in-situ moored station, in the Great Lakes. Observations have been collected at this location since 2009, this record contains the 2020 observations. Note, the short deployment of this buoy in 2020 is due to COVID-19 and a reduced field work season. This station is also known as NOAA National Data Buoy Center (NDBC) station Muskegon Buoy, MI (45161). A temporal subset of these data are available from NDBC and the Great Lakes Observing System (GLOS) since 2012, this data accession contains the complete record of observations. The ReCON buoy provides continuous, real-time observations facilitates modification of sampling parameters in anticipation of episodic events, facilitates the collection of field samples in response to episodic events, supports long term research, and contributes to sensor and system development. Parameters collected include currents and water temperature. The block of text at the beginning of each file contains information about the location and sensor used to collect data and the data headers followed by the observed data. Column 1 of the data is the timestamp, column 2 is the observed data, and column 3, where applicable, the QARTOD flag. Five QARTOD tests were run including gross range, climatological, spike, rate of change, and flat line tests. The highest value from the five tests were included under the “Qartod” column. If data were known to be invalid, that line of data was removed from the dataset.
Oceanographic and surface meteorological water parameter data collected from moored Realtime Coastal Observation Network, ReCON, Muskegon M45 Buoy, Lake Michigan, in the Great Lakes region by NOAA Great Lakes Environmental Research Laboratory from 2020-07-30 to 2020-10-26 (NCEI Accession 0243994)
공공데이터포털
NOAA Great Lakes Environmental Research Laboratory collected the data from moored Realtime Coastal Observation Network, ReCON, Muskegon M45 Buoy, Lake Michigan, an in-situ moored station, in the Great Lakes. Observations have been collected at this location since 2016, this record contains the 2020 observations. Note, the short deployment of this buoy in 2020 is due to COVID-19 and a reduced field work season. The ReCON buoy provides continuous, real-time observations facilitates modification of sampling parameters in anticipation of episodic events, facilitates collection of field samples in response to episodic events, supports long term research, and contributes to sensor and system development. Parameters collected include currents and water temperature. The block of text at the beginning of each file contains information about the location and sensor used to collect data and the data headers followed by the observed data. Column 1 of the data is the timestamp, column 2 is the observed data, and column 3, where applicable, the QARTOD flag. Five QARTOD tests were run including gross range, climatological, spike, rate of change, and flat line tests. The highest value from the five tests were included under the “Qartod” column. If data were known to be invalid, that line of data was removed from the dataset.
Oceanographic and surface meteorological data collected from MTU1 Buoy by Michigan Technological University and assembled by Great Lakes Observing System (GLOS) in the Great Lakes region from 2014-07-01 to 2020-10-05 (NCEI Accession 0123646)
공공데이터포털
This dataset contains oceanographic and surface meteorological data in netCDF formatted files, which follow the Climate and Forecast metadata convention (CF) and the Attribute Convention for Data Discovery (ACDD). Michigan Technological University collected the data from MTU1 Buoy, an in-situ moored station, in the Great Lakes. GLOS, which assembles data from Michigan Technological University and other sub-regional coastal and ocean observing systems of the Great Lakes region of the United States, submitted the data to NCEI as part of the Integrated Ocean Observing System Data Assembly Centers (IOOS DACs) Data Stewardship Program. Each month, NCEI adds to this dataset the data collected during the previous month.
Daily water column temperature predictions for thousands of Midwest U.S. lakes between 1979-2022 and under future climate scenarios
공공데이터포털

Lake temperature is an important environmental metric for understanding habitat suitability for many freshwater species and is especially useful when temperatures are predicted throughout the water column (known as temperature profiles). In this data release, multiple modeling approaches were used to generate predictions of daily temperature profiles for thousands of lakes in the Midwest.

Predictions were generated using two modeling frameworks: a machine learning model (specifically an entity-aware long short-term memory or EA-LSTM model; Kratzert et al., 2019) and a process-based model (specifically the General Lake Model or GLM; Hipsey et al., 2019). Both the EA-LSTM and GLM frameworks were used to generate lake temperature predictions in the contemporary period (1979-04-12 to 2022-04-11 for EA-LSTM and 1980-01-01 to 2021-12-31 for GLM; times differ due to modeling spin-up/spin-down configurations) using the North American Land Data Assimilation System [NLDAS; Mitchell et al., 2004] as meteorological drivers. In addition, GLM was used to generate lake temperature predictions under future climate scenarios (covering 1981-2000, 2040-2059, and 2080-2099) using six dynamically downscaled Global Climate Models (GCM; Notaro et al., 2018) as meteorological drivers. Appropriate application of the six GCMs is dependent on the use-case and will be up to the user to determine. For an example of a similar analysis in the Midwest and Great Lakes region using 31 GCMs, see Byun and Hamlet, 2018.

The modeling frameworks and driver datasets have slightly different footprints and input data requirements. This means that some of the lakes do not meet the criteria to be included in all three modeling approaches, which results in different numbers of lakes in the output (noted in the file descriptions below). The input data requirements for lakes to be included in the EA-LSTM predictions are lake latitude, longitude, elevation, and surface area, plus NLDAS drivers at the lake's location. All 62,966 lakes included this data release met these requirements. The input data requirements for lakes to be included in the contemporary GLM NLDAS-driven predictions are lake location (within one of the following 11 states: North Dakota, South Dakota, Iowa, Michigan, Indiana, Illinois, Wisconsin, Minnesota, Missouri, Arkansas, and Ohio), latitude, longitude, maximum depth (though more detailed hypsography was used where available), surface area, and a clarity esitmate, plus NLDAS drivers at the lake's location. 12,688 lakes included this data release met these requirements. The input data requirements for lakes to be included in the future climate scenario GCM-driven predictions were the same as for the contemporary GLM predictions, except GCM drivers at the lake's location were required in place of NLDAS drivers. 11,715 lakes included this data release met these requirements.

This data release includes the following files:

  1. lake_locations.zip: shapefiles with the centroid of each lake (62,966 lakes)
  2. lake_metadata.csv: metadata for each lake with predictions available (62,966 lakes)
  3. lake_id_crosswalk.csv: mapping between the identifications for lakes used in this data release to state and other organization systems
  4. lake_hypsography.csv: lake-specific area-depth relationships (13,785 lakes)
  5. lake_temperature_observations.zip: temperature observational data used in training and/or evaluation (8,760 lakes)
  6. meteorological_inputs_GCM.zip: meteorological input data for future climate scenarios, zipped NetCDF files. One NetCDF file per climate model (see the "lake_metadata.csv" file for how to map the lakes to the cells in these NetCDF files).
  7. meteorological_inputs_NLDAS_{GROUP}.zip: meteorological input data for the contemporary period organized into grids, groups of zipped CSV files (see the "lake_metadata.csv"
Oceanographic and surface meteorological water parameter data collected from moored Realtime Coastal Observation Network, ReCON, Alpena Buoy (NDBC station 45162), Lake Huron, in the Great Lakes region by NOAA Great Lakes Environmental Research Laboratory from 2020-08-07 to 2020-10-20 (NCEI Accession 0244738)
공공데이터포털
NOAA Great Lakes Environmental Research Laboratory collected the data from moored Realtime Coastal Observation Network, ReCON, Alpena Buoy (NDBC station 45162), Lake Huron, an in-situ moored station, in the Great Lakes. Observations have been collected at this location since 2005, this record contains the 2020 observations. Note, the short deployment of this buoy in 2020 is due to COVID-19 and a reduced field work season. This station is also known as NOAA National Data Buoy Center (NDBC) station Thunder Bay Buoy, Alpena, MI (45162). A temporal subset of these data are available from NDBC and the Great Lakes Observing System (GLOS) since 2012, this data accession contains the complete record of observations. The ReCON buoy provides continuous, real-time observations facilitates modification of sampling parameters in anticipation of episodic events, facilitates collection of field samples in response to episodic events, supports long term research, and contributes to sensor and system development. Parameters collected include currents and water temperature. The block of text at the beginning of each file contains information about the location and sensor used to collect data and the data headers followed by the observed data. Column 1 of the data is the timestamp, column 2 is the observed data, and column 3, where applicable, the QARTOD flag. Five QARTOD tests were run including gross range, climatological, spike, rate of change, and flat line tests. The highest value from the five tests were included under the “Qartod” column. If data were known to be invalid, that line of data was removed from the dataset.