Applies unsupervised machine learning models to genomic interval data for embedding, clustering, and bioinformatics analysis.
Geniml is a specialized skill for analyzing and modeling genomic interval data (BED files) using machine learning. It enables Claude to implement advanced workflows including Region2Vec for region embeddings, scEmbed for single-cell ATAC-seq analysis, and BEDspace for joint region-metadata embeddings. This skill is essential for bioinformaticians building searchable genomic databases, standardizing consensus peak universes, or performing large-scale genomic feature learning within their development environment.
主な機能
01Automate BED file tokenization and embedding evaluation metrics
02Create joint embeddings for regions and metadata via BEDspace
03Analyze single-cell ATAC-seq data with scEmbed and scanpy integration