데이터셋 상세
미국
Meta data and supporting documentation
We include a description of the data sets in the meta-data as well as sample code and results from a simulated data set. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: The R code is available on line here: https://github.com/warrenjl/SpGPCW. Format: Abstract The data used in the application section of the manuscript consist of geocoded birth records from the North Carolina State Center for Health Statistics, 2005-2008. In the simulation study section of the manuscript, we simulate synthetic data that closely match some of the key features of the birth certificate data while maintaining confidentiality of any actual pregnant women. Availability Due to the highly sensitive and identifying information contained in the birth certificate data (including latitude/longitude and address of residence at delivery), we are unable to make the data from the application section publicly available. However, we will make one of the simulated datasets available for any reader interested in applying the method to realistic simulated birth records data. This will also allow the user to become familiar with the required inputs of the model, how the data should be structured, and what type of output is obtained. While we cannot provide the application data here, access to the North Carolina birth records can be requested through the North Carolina State Center for Health Statistics and requires an appropriate data use agreement. Description Permissions: These are simulated data without any identifying information or informative birth-level covariates. We also standardize the pollution exposures on each week by subtracting off the median exposure amount on a given week and dividing by the interquartile range (IQR) (as in the actual application to the true NC birth records data). The dataset that we provide includes weekly average pregnancy exposures that have already been standardized in this way while the medians and IQRs are not given. This further protects identifiability of the spatial locations used in the analysis. File format: R workspace file. Metadata (including data dictionary) • y: Vector of binary responses (1: preterm birth, 0: control) • x: Matrix of covariates; one row for each simulated individual • z: Matrix of standardized pollution exposures • n: Number of simulated individuals • m: Number of exposure time periods (e.g., weeks of pregnancy) • p: Number of columns in the covariate design matrix • alpha_true: Vector of “true” critical window locations/magnitudes (i.e., the ground truth that we want to estimate). This dataset is associated with the following publication: Warren, J., W. Kong, T. Luben, and H. Chang. Critical Window Variable Selection: Estimating the Impact of Air Pollution on Very Preterm Birth. Biostatistics. Oxford University Press, OXFORD, UK, 1-30, (2019).
데이터 정보
연관 데이터
Data associated with Wallis et al. 2024
공공데이터포털
Metadata supporting Wallis et al. 2024 in Environment International. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Data from the National Children's Study must be accessed through the National Institutes of Health, National Institute of Child Health and Human Development's Data and Specimen Hub (DASH) at https://dash.nichd.nih.gov/. Format: Participant demographic, lifestyle, residence, occupational, and other types of data from questionnaire and observational survey instruments are in .csv and .xlsx files. PFAS measurements in serum and house dust in .csv files. This dataset is associated with the following publication: Wallis, D., K. Miller, N. Deluca, K. Thomas, C. Fuller, J. McCord, E. Cohen-Hubal, and J. Minucci. Understanding prenatal household exposures to per- and polyfluorylalkyl substances using paired Biological and dust measurements with sociodemographic and housing variables. ENVIRONMENT INTERNATIONAL. Elsevier B.V., Amsterdam, NETHERLANDS, 194(December): 109157, (2024).
Dataset for non-targeted urinary biomarkers
공공데이터포털
This dataset contains a summary of compounds found in human urine samples. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: The original dataset contains identification information for the sample subjects and all of their descriptors including age, gender, race, and medical screening information. The analyzed data cannot be made publicly available. Format: This dataset contains a summary of compounds found in human urine samples. This dataset is associated with the following publication: O’Lenick, C., J. Pleil, M. Stiegel, J. Sobus, and A. Wallace. Detection and analysis of endogenous polar volatile organic compounds (PVOCs) in urine for human exposome research. BIOMARKERS. Taylor & Francis, Inc., Philadelphia, PA, USA, 24(3): 240-248, (2019).
Data for this project include human subjects PII and cannot be shared.
공공데이터포털
Data on approximately 2 million births occurring in NJ, OH, and PA from 2000 - 2005. Linked to PM2.5 and ozone concentration estimates from EPA CMAQ fused model. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Birth data can be acquired through application to the state health statistics departments of NJ, OH, and PA. Contact author for code. rappazzo.kristen@epa.gov. Format: No data included. This dataset is associated with the following publication: Rappazzo, K., D. Lobdell, L. Messer, C. Poole, and J. Daniels. Comparison of gestational dating methods and implications for exposure-outcome associations: an example with PM2.5 and preterm birth. JOURNAL OF OCCUPATIONAL AND ENVIRONMENTAL MEDICINE. Lippincott Williams & Wilkins, Philadelphia, PA, USA, 74(2): 138-143, (2017).
Temp air preterm NC JUN2024
공공데이터포털
Dataset is birth registry data from NC for 2006 - 2015, linked to air pollution and temperature exposures. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Health data can be requested through the North Carolina Department of Human Health Services. Air pollution data can be accessed through the EPA RSIG portal. Temperature data can be requested from the CDC. Format: SAS, R, and csv files. This dataset is associated with the following publication: Krajewski, A., B. Alman, A. Vaidyanathan, T. Luben, and K. Rappazzo. Effect of extreme heat exposure on the associations between weekly gestational exposure to fine particulate matter and preterm birth in a North Carolina birth cohort. Presented at Society of Epidemiologic Research (SER) and Society of Pediatric and Perinatal Epidemiologic Research (SPER), Austin, TX, USA, 06/16/2024 - 06/21/2024.
CARES COPD casecrossover
공공데이터포털
Information on hospitalizations of COPD patients from electronic health records linked to air pollution concentrations for the study period. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Can be requested through NCTracts https://tracs.unc.edu/index.php/services/comparative-effectiveness-research/data-linkage. Format: Data used in this analysis include electronic health records from the UNC healthcare system. This dataset is associated with the following publication: Cowan, K., L. Wyatt, T. Luben, J. Sacks, C. Ward-Caviness, and K. Rappazzo. Effect measure modification of the association between short-term exposures to PM2.5 and hospitalizations by longs-term PM2.5 exposure among a cohort of people with Chronic Obstructive Pulmonary Disease (COPD) in North Carolina, 2002–2015. ENVIRONMENTAL HEALTH. Academic Press Incorporated, Orlando, FL, USA, 22: 49, (2023).
이화여자대학교산학협력단 - 대기 오염에 의한 조산 위험 지수
공공데이터포털
■ 상품설명 및 특징 - 본 데이터는 모집된 대상자의 개인식별정보를 비식별화하여 제공합니다. - 모든 데이터는 기관생명윤리위원회(IRB)의 승인을 받은 데이터입니다. - 본 데이터는 임신부 감염병 관리를 위하여 전국 13개 대학병원에 방문한 임신부를 대상으로 코로나19 진단 임신부와 진단받지 않은 정상 임산부의 데이터를 제공합니다. ■ 데이터 범위 - 2020년 ~ 2021년 데이터가 제공되며 3개월마다 업데이트 됩니다. ■ 참고사항 (1) 코딩 방법 - 해당 질병 있음: 1 - 해당 질병 없음: 0 (2) 임신 삼분기 재태주수 - 임신 1분기: ~14주 0일 - 임신 2분기: 14주 1일 ~ 28주 0일 - 임신 3분기: 28주 1일 - 미세먼지 단위: ㎍/㎥ ■ 데이터 상품 요약 - 임신부에게 노출되었을 것으로 예상되는 대기오염 지수와 조산 및 임신합병증 발병 여부와 미세먼지 농도 데이터 ■ 활용 예제 - 대기질 데이터와 임신 합병증 데이터 사이의 위험성 분석
이화여자대학교산학협력단 - 서울시 임신부 대기오염과 COVID-19 감염 데이터
공공데이터포털
■ 상품설명 및 특징 - 본 데이터는 모집된 대상자의 개인식별정보를 비식별화하여 제공합니다. - 모든 데이터는 기관생명윤리위원회(IRB)의 승인을 받은 데이터입니다. - 본 데이터는 임신부 감염병 관리를 위하여 전국 13개 대학병원에 방문한 임신부를 대상으로 코로나19 진단 임신부와 진단받지 않은 정상 임산부의 데이터를 제공합니다. ■ 데이터 범위 - 2020년 ~ 2021년 데이터가 제공되며 3개월마다 업데이트 됩니다. ■ 참고사항 (1) 코딩 방법 - 해당 질병 있음: 1 - 해당 질병 없음: 0 (2) 임신 삼분기 재태주수 - 임신 1분기: ~14주 0일 - 임신 2분기: 14주 1일 ~ 28주 0일 - 임신 3분기: 28주 1일 - 미세먼지 단위: ㎍/㎥ ■ 데이터 상품 요약 - 서울시 지역의 임신부의 임상 정보와 실외 데이터 정보 및 코로나 19 감염에 관한 데이터 ■ 활용 예제 - 서울시 대기오염 농도와 COVID-19 감염률의 상관관계 분석 또는 공기질 개선 대책 효과 분석 및 건강 위험 지도 작성
여성가족부 아이돌봄 서비스제공기관 지정기준 정보 서비스
공공데이터포털
아이돌봄 서비스제공기관 지정기준 정보 서비스 정보를 제공합니다.
제주데이터허브 - 산후조리업 정보
공공데이터포털
- 출산 이후 임산부 및 신생아의 건강과 위생을 관리하기 위한 시설 정보를 제공합니다. - 원본 파일의 좌표가 중부원점 TM(EPSH:2097) 좌표계를 따르고 있는데 위경도(WGS84) 좌표계로 변환 시 정확하게 변환이 되지 않아 파일에서 좌표 정보는 제외하였습니다. 데이터 제공 사이트로 가시면 원본 좌표를 확인하실 수 있습니다. - 데이터 제공처: LOCALDATA