01Multilingual support for 99 languages with automatic language identification.
02Flexible model scaling from 39M to 1550M parameters to balance speed and accuracy.
03Robust performance against background noise and various accents through large-scale training.
04English translation task for converting foreign speech directly into English text.
05Generates detailed timestamps and word-level alignment for subtitle creation.
063,983 GitHub stars