데이터셋 상세
미국
LOINC
Logical Observation Identifiers Names and Codes (LOINC) is a database and universal standard for identifying medical laboratory observations. This data is being used as an Ontology for use in Phenotyping Natural Language Processing (NLP)
데이터 정보
연관 데이터
RxNorm
공공데이터포털
Ontology for use in Phenotyping Natural Language Processing (NLP)
Code used to produce terms list in the work "NLP-Driven Electron Microscopy Ontology Development"
공공데이터포털
This is a collection of code written by Maurice Curran that was used to process the Microscopy and Microanalysis conference proceeding corpus into word products described in the publication "NLP-Driven Electron Microscopy Ontology Development". The scripts are written in Python, to be used in the following order:1. SettingUpTextFiles.py and CopyingText.py to get the raw text files; 2. SentenceConversion.py; 3. reference_remover.py; 4. testing.py and testingavg.py; 5. SentenceCreator.py; 6. matscholar_model.py to get matscholar tags; 7. training_model_gensim.py to get gensim model;8. word2vecscript.py and gensim_visual.py;
NHRIC (National Health Related Items Code)
공공데이터포털
The National Health Related Items Code (NHRIC) is a system for identification and numbering of marketed device packages that is compatible with other numbering systems such as the National Drug Code (NDC) or Universal Product Code (UPC). Those manufacturers who desire to use the NHRIC number for unique product identification may apply to FDA for a labeler code. This database contains NHRIC data retrieved from records that date back 20 years.
RxNorm
공공데이터포털
RXNorm is a normalized naming system for generic and branded drugs developed by the U.S. National Library of Medicine to support semantic interoperbility between drug terminologies and pharmacy systems
LymeDisease 9211 county
공공데이터포털
To facilitate the public health and research community's access to NNDSS data on Lyme disease, CDC has developed a public use dataset. Based on reports submitted to CDC, this dataset provides the number of confirmed cases by county for the years 1992���2011, in four 5���year intervals. County tabulation is by American National Standard Institute (ANSI) [formerly Federal Information Processing Standard (FIPS)] codes. County codes of "0" represent "unknown" county of residence within each state. More recent county-level case counts are not publicly available at this time.
NLP-Driven Microscopy Ontology Development - Raw data DOIs
공공데이터포털
This dataset contains the DOIs of the corpus, used for the natural language processing analysis described in the article of the same title. The DOIs all point to articles published in the Microscopy and Microanalysis conference proceeding, spanning 2002 through 2019.
Value Set Authority Center
공공데이터포털
The VSAC is a repository and authoring tool for public value sets created by external programs. Value sets are lists of codes and corresponding terms, from NLM-hosted standard clinical vocabularies (such as SNOMED CT®, RxNorm, LOINC® and others), that define clinical concepts to support effective and interoperable health information exchange. The VSAC does not create value set content. The VSAC also provides downloadable access to all official versions of value sets specified by the Centers for Medicare & Medicaid Services (CMS) electronic Clinical Quality Measures (eCQMs). For information on CMS eCQMs, visit the eCQI Resource Center. The VSAC is provided by the National Library of Medicine (NLM), in collaboration with the Office of the National Coordinator for Health Information Technology (ONC) and CMS.
Clinical Investigator Inspector List (CLIIL)
공공데이터포털
The Clinical Investigator Inspection List (CLIIL) contains names, addresses, and other pertinent information gathered from inspections of clinical investigators who have performed studies with investigational new drugs. The list contains information on inspections that have been closed since July 1977.