SpeechBrain is an open-source toolkit providing cutting-edge technology for various speech and audio processing tasks. It offers support for speech recognition, speech enhancement, audio separation, text-to-speech conversion, speaker recognition, speech-to-speech translation, and understanding spoken language.
Additionally, the toolkit incorporates diverse audio technologies like vocoding, audio augmentation, feature extraction, sound event detection, beamforming, and multi-microphone signal processing capabilities.
SpeechBrain also offers tools for training Language Models, ranging from basic n-gram LMs to modern Large Language Models, seamlessly integrated into speech processing workflows.
This toolkit facilitates research and development in Conversational AI, providing pre-built configurations for popular datasets, comprehensive documentation, tutorials, and easy-to-use interfaces for pre-trained models.
It is designed for adaptability, flexibility, and transparency to meet the needs of various users. The system is engineered for ease of installation, use, and customization.
Open-source toolkit
Cutting-edge technologies
Supports speech recognition
No offline capability
No support across multiple platforms
No version control

Released 1 year ago
From $9.99/month

Released 2 years ago
Free + from $3/month

Released 5 years ago
Contact for pricing

Released 4 years ago
Free + from $0.30/unit

Speech-to-Text API that supports multiple languages and offers exceptional accuracy.
Released 8 years ago
Free + from free tier available

Released 1 year ago
Contact for pricing

Released 3 years ago
Free + from $41/month

Released 9 months ago
Free + from $3.83/month

Released 2 years ago
Free + from $29/month

Released 4 years ago
Contact for pricing

Python SDK designed for seamless integration with Speechmatics speech recognition APIs.
Released 3 months ago
Free + from free tier available

Released 2 years ago
Free + from $5