교육데이터 활용•지원 서비스

로그인

데이터셋 상세

미국

Vector Alignment Search Tool (VAST)

A computer algorithm that identifies similar protein 3-dimensional structures. Structure neighbors for every structure in MMDB are pre-computed and accessible via links on the MMDB Structure Summary pages.

데이터 정보

데이터 포털
미국
META URL
https://catalog.data.gov/dataset/vector-alignment-search-tool-vast
라이선스
other-license-specified
비용
제공기관
U.S. Department of Health & Human Services
관리부서
데이터
- Vector Alignment Search Tool (VAST)
- 랜딩 페이지

연관 데이터

Structure - Molecular Modeling Database (MMDB)

공공데이터포털

Three dimensional structures provide a wealth of information on the biological function and the evolutionary history of macromolecules. They can be used to examine sequence-structure-function relationships, interactions, active sites, and more.

Multiple Sequence Alignment (MSA) Viewer

공공데이터포털

An interactive Web application that enables users to visualize multiple alignments created by database search results or other software applications. The MSA Viewer allows users to upload an alignment and set a master sequence and to explore the data using features such as zooming and changing of coloration.

공공데이터포털

The Protein database is a collection of sequences from several sources, including translations from annotated coding regions in GenBank, RefSeq and TPA, as well as records from SwissProt, PIR, PRF, and PDB. Protein sequences are the fundamental determinants of biological structure and function.

공공데이터포털

CDTree is a stand-alone application for classifying protein sequences and investigating their evolutionary relationships. CDTree can import, analyze and update existing Conserved Domain (CDD) records and hierarchies, and also allows users to create their own.

Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation

공공데이터포털

Data file for "Vliet SMF, Hazemi M, Blatz D, Jensen M, Mayasich S, Transue TR, Simmons C, Wilkinson A, LaLone CA. Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation. J Vis Exp. 2023 Feb 10;(192). doi: 10.3791/63970. PMID: 36847398.". This dataset is associated with the following publication: Vliet, S., M. Hazemi, D. Blatz, M. Jensen, S. Mayasich, T. Transue, C. Simmons, A. Wilkinson, and C. Lalone. Demonstration of the Sequence Alignment to Predict Across Species Susceptibility Tool for Rapid Assessment of Protein Conservation. Journal of Visualized Experiments. JoVE, Somerville, MA, USA, 192, (2023).

공공데이터포털

Compute cDNA-to-Genomic sequence alignments.

Open Reading Frame Finder (ORF Finder)

공공데이터포털

A graphical analysis tool that finds all open reading frames in a user's sequence or in a sequence already in the database.

Highly Scalable Matching Pursuit Signal Decomposition Algorithm

공공데이터포털

In this research, we propose a variant of the classical Matching Pursuit Decomposition (MPD) algorithm with significantly improved scalability and computational performance. MPD is a powerful iterative algorithm that decomposes a signal into linear combinations of its dictionary elements or “atoms”. A best fit atom from an arbitrarily defined dictionary is determined through cross-correlation. The selected atom is subtracted from the signal and this procedure is repeated on the residual in the subsequent iterations until a stopping criteria is met. A sufficiently large dictionary is required for an accurate reconstruction; this in return increases the computational burden of the algorithm, thus limiting its applicability and level of adoption. Our main contribution lies in improving the computational efficiency of the algorithm to allow faster decomposition while maintaining a similar level of accuracy. The Correlation Thresholding and Multiple Atom Extractions techniques were proposed to decrease the computational burden of the algorithm. Correlation thresholds prune insignificant atoms from the dictionary. The ability to extract multiple atoms within a single iteration enhanced the effectiveness and efficiency of each iteration. The proposed algorithm, entitled MPD++, was demonstrated using real world data set.

목록