Enables rapid bioinformatics queries and sequence analysis across 20+ genomic databases directly from your terminal or Python scripts.
gget is a high-performance toolkit designed for researchers and computational biologists to streamline the retrieval of genomic data and execution of sequence analysis workflows. It provides a unified interface for querying gene metadata from Ensembl and UniProt, performing BLAST/BLAT searches, predicting protein structures via AlphaFold, and accessing large-scale expression data. By consolidating disparate bioinformatics databases into a single tool, gget eliminates the need for manual data scraping and complex API implementations, making it an essential skill for rapid scientific discovery and data integration within a development environment.
Key Features
01Support for multiple sequence alignment (Muscle5) and local alignment (DIAMOND)
02Automated protein structure prediction using AlphaFold2 and PDB queries
03Unified access to 20+ genomic databases including Ensembl, UniProt, and NCBI
041 GitHub stars
05Reference genome retrieval and gene metadata searching across multiple species
06High-speed BLAST and BLAT sequence alignment and search capabilities
Use Cases
01Querying tissue-specific gene expression correlations and single-cell RNA-seq data
02Performing rapid protein structure predictions and eukaryotic linear motif analysis
03Retrieving gene sequences and functional annotations for specific species directly in the CLI