소개
Geniml is a specialized toolkit designed for processing genomic interval data from BED files into machine-learning-ready formats. It enables the creation of unsupervised region embeddings using Region2Vec, joint metadata-region analysis via BEDspace, and single-cell chromatin accessibility studies with scEmbed. By standardizing the way genomic 'universes' are built and tokenized, Geniml allows researchers to perform similarity searches, clustering, and downstream analysis on large-scale chromatin accessibility datasets with statistical rigor.