데이터셋 상세
미국
Datasets for Integrating stream gage data and Landsat imagery to complete time-series of surface water extents in Central Valley, California
This data release comprises the data files and code necessary to perform all analyses presented in the associated publication. The *.csv data files are aggregations of water extent on the basis of the European Commission's Joint Research Centre (JRC) Monthly Water History database (v1.0) and the Dynamic Surface Water Extent (DSWE) algorithm. The shapefile dataset contains the study area 8-digit hydrologic unit code (HUC) regions used as the basis for analysis. Html files provide an overview of the study workflow and integrated R notebooks (in .Rmd format) for recreating all project results and plots. The R notebook ingest the necessary data files from their online locations. These data support the following publication: Walker JJ, Soulard CE, Petrakis RE. In press. Integrating stream gage data and Landsat imagery to complete time-series of surface water extents in Central Valley, California. International Journal of Applied Earth Observation and Geoinformation, http://dx.doi.org/xx.xxxxx/
데이터 정보
연관 데이터
Datasets for Integrating stream gage data and Landsat imagery to complete time-series of surface water extents in Central Valley, California
공공데이터포털
This data release comprises the data files and code necessary to perform all analyses presented in the associated publication. The *.csv data files are aggregations of water extent on the basis of the European Commission's Joint Research Centre (JRC) Monthly Water History database (v1.0) and the Dynamic Surface Water Extent (DSWE) algorithm. The shapefile dataset contains the study area 8-digit hydrologic unit code (HUC) regions used as the basis for analysis. Html files provide an overview of the study workflow and integrated R notebooks (in .Rmd format) for recreating all project results and plots. The R notebook ingest the necessary data files from their online locations. These data support the following publication: Walker JJ, Soulard CE, Petrakis RE. In press. Integrating stream gage data and Landsat imagery to complete time-series of surface water extents in Central Valley, California. International Journal of Applied Earth Observation and Geoinformation, http://dx.doi.org/xx.xxxxx/
Example Groundwater-Level Datasets and Benchmarking Results for the Automated Regional Correlation Analysis for Hydrologic Record Imputation (ARCHI) Software Package
공공데이터포털
This data release provides two example groundwater-level datasets used to benchmark the Automated Regional Correlation Analysis for Hydrologic Record Imputation (ARCHI) software package (Levy and others, 2024). The first dataset contains groundwater-level records and site metadata for wells located on Long Island, New York (NY) and some surrounding mainland sites in New York and Connecticut. The second dataset contains groundwater-level records and site metadata for wells located in the southeastern San Joaquin Valley of the Central Valley, California (CA). For ease of exposition these are referred to as NY and CA datasets, respectively. Both datasets are formatted with column headers that can be read by the ARCHI software package within the R computing environment. These datasets were used to benchmark the imputation accuracy of three ARCHI model settings (OLS, ridge, and MOVE.1) against the widely used imputation program missForest (Stekhoven and Bühlmann, 2012). The ARCHI program was used to process the NY and CA datasets on monthly and annual timesteps, respectively, filter out sites with insufficient data for imputation, and create 200 test datasets from each of the example datasets with 5 percent of observations removed at random (herein, referred to as "holdouts"). Imputation accuracy for test datasets was assessed using normalized root mean square error (NRMSE), which is the root mean square error divided by the standard deviation of the observed holdout values. ARCHI produces prediction intervals (PIs) using a non-parametric bootstrapping routine, which were assessed by computing a coverage rate (CR) defined as the proportion of holdout observations falling within the estimated PI. The multiple regression models included with the ARCHI package (OLS and ridge) were further tested on all test datasets at eleven different levels of the p_per_n input parameter, which limits the maximum ratio of regression model predictors (p) per observations (n) as a decimal fraction greater than zero and less than or equal to one. This data release contains ten tables formatted as tab-delimited text files. The “CA_data.txt” and “NY_data.txt” tables contain 243,094 and 89,997 depth-to-groundwater measurement values (value, in feet below land surface) indexed by site identifier (site_no) and measurement date (date) for CA and NY datasets, respectively. The “CA_sites.txt” and “NY_sites.txt” tables contain site metadata for the 4,380 and 476 unique sites included in the CA and NY datasets, respectively. The “CA_NRMSE.txt” and “NY_NRMSE.txt” tables contain NRMSE values computed by imputing 200 test datasets with 5 percent random holdouts to assess imputation accuracy for three different ARCHI model settings and missForest using CA and NY datasets, respectively. The “CA_CR.txt” and “NY_CR.txt” tables contain CR values used to evaluate non-parametric PIs generated by bootstrapping regressions with three different ARCHI model settings using the CA and NY test datasets, respectively. The “CA_p_per_n.txt” and “NY_p_per_n.txt” tables contain mean NRMSE values computed for 200 test datasets with 5 percent random holdouts at 11 different levels of p_per_n for OLS and ridge models compared to training error for the same models on the entire CA and NY datasets, respectively. References Cited Levy, Z.F., Stagnitta, T.J., and Glas, R.L., 2024, ARCHI: Automated Regional Correlation Analysis for Hydrologic Record Imputation, v1.0.0: U.S. Geological Survey software release, https://doi.org/10.5066/P1VVHWKE. Stekhoven, D.J., and Bühlmann, P., 2012, MissForest—non-parametric missing value imputation for mixed-type data: Bioinformatics 28(1), 112-118. https://doi.org/10.1093/bioinformatics/btr597.
National Hydrography Data - NHD and 3DHP
공공데이터포털
The USGS National Hydrography Dataset (NHD) Downloadable Data Collection from The National Map (TNM) is a comprehensive set of digital spatial data that encodes information about naturally occurring and constructed bodies of surface water (lakes, ponds, and reservoirs), paths through which water flows (canals, ditches, streams, and rivers), and related entities such as point features (springs, wells, stream gages, and dams). The information encoded about these features includes classification and other characteristics, delineation, geographic name, position and related measures, a "reach code" through which other information can be related to the NHD, and the direction of water flow. The network of reach codes delineating water and transported material flow allows users to trace movement in upstream and downstream directions. In addition to this geographic information, the dataset contains metadata that supports the exchange of future updates and improvements to the data. The NHD supports many applications, such as making maps, geocoding observations, flow modeling, data maintenance, and stewardship. For additional information on NHD, go to https://www.usgs.gov/core-science-systems/ngp/national-hydrography. DWR was the steward for NHD and Watershed Boundary Dataset (WBD) in California. We worked with other organizations to edit and improve NHD and WBD, using the business rules for California. California's NHD improvements were sent to USGS for incorporation into the national database. The most up-to-date products are accessible from the USGS website. Please note that the California portion of the National Hydrography Dataset is appropriate for use at the 1:24,000 scale. For additional derivative products and resources, including the major features in geopackage format, please go to this page: https://data.cnra.ca.gov/dataset/nhd-major-features Archives of previous statewide extracts of the NHD going back to 2018 may be found at https://data.cnra.ca.gov/dataset/nhd-archive. In September 2022, USGS officially notified DWR that the NHD would become static as USGS resources will be devoted to the transition to the new 3D Hydrography Program (3DHP). 3DHP will consist of LiDAR-derived hydrography at a higher resolution than NHD. Upon completion, 3DHP data will be easier to maintain, based on a modern data model and architecture, and better meet the requirements of users that were documented in the Hydrography Requirements and Benefits Study (2016). The initial releases of 3DHP will be the NHD data cross-walked into the 3DHP data model. It will take several years for the 3DHP to be built out for California. Please refer to the resources on this page for more information. The FINAL,STATIC version of the National Hydrography Dataset for California was published for download by USGS on December 27, 2023. This dataset can no longer be edited by the state stewards. The first public release of the 3D Hydrography Program map service may be accessed at https://hydro.nationalmap.gov/arcgis/rest/services/3DHP_all/MapServer. Questions about the California stewardship of these datasets may be directed to nhd_stewardship@water.ca.gov.
Land Cover Maps for the Scotts Creek Watershed, Lake County, California for 2018, 2020, and 2022
공공데이터포털
The USGS, in cooperation with the U.S. Bureau of Land Management (BLM), created a series of geospatial products of the Scotts Creek Watershed in Lake County, California, using National Agriculture Imagery Program (NAIP) imagery from 2018, 2020 and 2022. The imagery was downloaded from United States Department of Agriculture (USDA) - Natural Resources Conservation Service (NRCS) Geospatial Data Gateway (https://datagateway.nrcs.usda.gov). The NAIP imagery from 2018, 2020 and 2022 was classified using Random Forest Modeling to produce land cover maps with three main classifications – bare, vegetation, and shadows. A total of 600 independent reference points were used in the accuracy assessment. The overall accuracy for all classes for each dataset is 98%. See attached ScottsCreek_20XX_AccuracyAssessment.csv files (contained within each LandCoverMap_associated_files_20XX.zip for each year respectively) for details. A preview image of the land cover map for 2018 is attached to this data release as an example (see LandCoverMap_RF_ScottsCreekWatershed_USGS2022_CC0.png). The percentage of bare, vegetation and shadow pixels were calculated for the complete watershed and each individual NHDPlus2.1 catchment basins (slightly modified to support hydrological modeling). These metrics can be used to quantify bare and vegetated areas and detect and quantify vegetation changes over time. Users should be aware of the inherent errors in remote sensing products.
Land Cover Maps for the Scotts Creek Watershed, Lake County, California for 2018, 2020, and 2022
공공데이터포털
The USGS, in cooperation with the U.S. Bureau of Land Management (BLM), created a series of geospatial products of the Scotts Creek Watershed in Lake County, California, using National Agriculture Imagery Program (NAIP) imagery from 2018, 2020 and 2022. The imagery was downloaded from United States Department of Agriculture (USDA) - Natural Resources Conservation Service (NRCS) Geospatial Data Gateway (https://datagateway.nrcs.usda.gov). The NAIP imagery from 2018, 2020 and 2022 was classified using Random Forest Modeling to produce land cover maps with three main classifications – bare, vegetation, and shadows. A total of 600 independent reference points were used in the accuracy assessment. The overall accuracy for all classes for each dataset is 98%. See attached ScottsCreek_20XX_AccuracyAssessment.csv files (contained within each LandCoverMap_associated_files_20XX.zip for each year respectively) for details. A preview image of the land cover map for 2018 is attached to this data release as an example (see LandCoverMap_RF_ScottsCreekWatershed_USGS2022_CC0.png). The percentage of bare, vegetation and shadow pixels were calculated for the complete watershed and each individual NHDPlus2.1 catchment basins (slightly modified to support hydrological modeling). These metrics can be used to quantify bare and vegetated areas and detect and quantify vegetation changes over time. Users should be aware of the inherent errors in remote sensing products.
Streamflow-gain- and streamflow-loss data for streamgages in the Central Valley Hydrologic Model
공공데이터포털
This digital dataset contains 61 sets of annual streamflow gains and losses between 1961 and 1977 along Central Valley surface-water network for the Central Valley Hydrologic Model (CVHM). The Central Valley encompasses an approximate 50,000 square-kilometer region of California. The complex hydrologic system of the Central Valley is simulated using the USGS's numerical modeling code MODFLOW-FMP (Schmid and others, 2006). This simulation is referred to here as the CVHM (Faunt, 2009). Utilizing MODFLOW-FMP, the CVHM simulates groundwater and surface-water flow, irrigated agriculture, land subsidence, and other key processes in the Central Valley on a monthly basis from 1961-2003. The total active modeled area is 20,334 square-miles. The CVHM includes complex surface-water management processes. The hydrology of the present-day Central Valley and the CVHM model are driven by surface-water deliveries and associated groundwater pumpage. The Streamflow Routing Package (SFR1) is linked to MODFLOW-FMP to facilitate the simulated conveyance of surface-water deliveries. If surface-water deliveries do not meet the farm-delivery requirement, the FMP invokes simulated groundwater pumping to meet the demand. The surface-water network represents a subset of the entire stream network in the valley. Quantitative observations of streamflow gains and losses were available for 57 reaches of 20 major stream systems in the Central Valley for water years 1961-77 (Mullen and Nady, 1985). These observations were included in parameter estimation process and in the model-fit statistics. The CVHM is the most recent regional-scale model of the Central Valley developed by the U.S. Geological Survey (USGS). The CVHM was developed as part of the USGS Groundwater Resources Program (see "Foreword", Chapter A, page iii, for details).
Streamflow-gain- and streamflow-loss data for streamgages in the Central Valley Hydrologic Model
공공데이터포털
This digital dataset contains 61 sets of annual streamflow gains and losses between 1961 and 1977 along Central Valley surface-water network for the Central Valley Hydrologic Model (CVHM). The Central Valley encompasses an approximate 50,000 square-kilometer region of California. The complex hydrologic system of the Central Valley is simulated using the USGS's numerical modeling code MODFLOW-FMP (Schmid and others, 2006). This simulation is referred to here as the CVHM (Faunt, 2009). Utilizing MODFLOW-FMP, the CVHM simulates groundwater and surface-water flow, irrigated agriculture, land subsidence, and other key processes in the Central Valley on a monthly basis from 1961-2003. The total active modeled area is 20,334 square-miles. The CVHM includes complex surface-water management processes. The hydrology of the present-day Central Valley and the CVHM model are driven by surface-water deliveries and associated groundwater pumpage. The Streamflow Routing Package (SFR1) is linked to MODFLOW-FMP to facilitate the simulated conveyance of surface-water deliveries. If surface-water deliveries do not meet the farm-delivery requirement, the FMP invokes simulated groundwater pumping to meet the demand. The surface-water network represents a subset of the entire stream network in the valley. Quantitative observations of streamflow gains and losses were available for 57 reaches of 20 major stream systems in the Central Valley for water years 1961-77 (Mullen and Nady, 1985). These observations were included in parameter estimation process and in the model-fit statistics. The CVHM is the most recent regional-scale model of the Central Valley developed by the U.S. Geological Survey (USGS). The CVHM was developed as part of the USGS Groundwater Resources Program (see "Foreword", Chapter A, page iii, for details).
Relative distance of California's Central Valley from trough to valley edge and supporting data
공공데이터포털
California's Central Valley ranges from the mountain fronts toward a central trough, mainly defined by the San Joaquin and Sacramento Rivers, and the relative distance from trough to valley edges is of interest. This data release provides supplemental data for the USGS Professional Paper 1766, titled Groundwater Availability of the Central Valley Aquifer, California and provides geographic information systems (GIS) datasets containing this relative distance grid and supporting data. Included in this data release are shapefiles used to define the Central Valley study area, the Central Valley trough, and a relative distance grid that may be used to spatially define other GIS data into zones between the edge of the Central Valley and the trough. These relative distances were calculated as part of groundwater availability study documented in the Professional Paper, for a 30 x 30-meter cell size grid for the Central Valley. The edge of the valley was represented by the boundary of the valley fill deposits and was assigned an arbitrary value of 1000. The valley trough was represented by the division of California's Department of Water Resource's groundwater subbasins from west to east, from the intersection of Enterprise, Anderson, and Millville subbasins in the north to the Westside and Kings subbasins in the south with an extended line through historic lakes Tulare, Buena Vista, and Kern. This valley trough was assigned a value of 0 which included the boundaries of the historic lakes.
Relative distance of California's Central Valley from trough to valley edge and supporting data
공공데이터포털
California's Central Valley ranges from the mountain fronts toward a central trough, mainly defined by the San Joaquin and Sacramento Rivers, and the relative distance from trough to valley edges is of interest. This data release provides supplemental data for the USGS Professional Paper 1766, titled Groundwater Availability of the Central Valley Aquifer, California and provides geographic information systems (GIS) datasets containing this relative distance grid and supporting data. Included in this data release are shapefiles used to define the Central Valley study area, the Central Valley trough, and a relative distance grid that may be used to spatially define other GIS data into zones between the edge of the Central Valley and the trough. These relative distances were calculated as part of groundwater availability study documented in the Professional Paper, for a 30 x 30-meter cell size grid for the Central Valley. The edge of the valley was represented by the boundary of the valley fill deposits and was assigned an arbitrary value of 1000. The valley trough was represented by the division of California's Department of Water Resource's groundwater subbasins from west to east, from the intersection of Enterprise, Anderson, and Millville subbasins in the north to the Westside and Kings subbasins in the south with an extended line through historic lakes Tulare, Buena Vista, and Kern. This valley trough was assigned a value of 0 which included the boundaries of the historic lakes.
Geospatial data, flood-frequency analysis, and surface-water model archive for streamflow extents and hydraulic characteristics of Meadow Valley Wash at Stuart Ranch, near Rox, Nevada
공공데이터포털
This U.S. Geological Survey data release consists of multiple datasets used to simulate the streamflow extents and hydraulic characteristics of Meadow Valley Wash at Stuart Ranch near Rox, Nevada. Streamflow extents equal the maximum area of flow inundation and were estimated using a two-dimensional (2D) hydraulic model. The modeled extents represent baseflow and six annual exceedance probabilities simulated for the current (2021) topography and modified topography associated with possible restoration of Stuart Ranch along Meadow Valley Wash. The data release includes: 1) a zip file containing a shapefile of a polygon dataset of the streamflow-inundation extents (Stuart_streamflow_extent.shp); 2) a zip file containing all raster datasets of streamflow depth (Stuart_Depth.zip); 3) a zip file containing all raster datasets of streamflow velocity (Stuart_Velocity.zip); 4) a zip file containing all relevant files to document and run the PeakFQ flood-frequency analysis used as input into the hydraulic model (09418700_Stuart_Flood_Frequency_Archive.zip); 5) a zip file containing all relevant files to document and run the 2D Hydrological Engineering Center-River Analysis System (HEC-RAS) hydraulic model used to generate a polygon dataset of streamflow-inundation extents (Stuart_SWmodel_Archive.zip); 6) a zip file containing a raster dataset of a digital elevation model (Stuart_DEM.tif) derived from survey points; 7) a zip file containing two polygon datasets of the restoration modifications used in conjunction with the Stuart_DEM to model possible restoration conditions (Beaver_Dam_Modification.shp and SideChannels_IrrigationDig_Modifications.shp); 8) a zip file containing a point dataset of survey points (SurveyData.shp) collected from December 14, 2020 to January 11, 2024, using real-time kinematic global navigation satellite system (GNSS), static GNSS, total station (TS), manual entries, and filtered ground observations from TLS surveys; and 9) a point dataset of lidar points (Stuart_Raw_TLS.las) collected at 44 scan locations by TLS surveys.