01Multi-lingual preprocessing with spaCy NER (Japanese, English, Chinese)
020 GitHub stars
03Automatic key discovery from unstructured text
04Robust processing of noisy and complex inputs
05Type-safe output via Pydantic validation
06Multiple output formats: JSON, YAML, and TOML