AI Transcription
Overview
Relay.app leverages OpenAI Whisper to provide a simple interface for transcribing audio files into text. Users can upload audio files, and the system processes them using Whisper's state-of-the-art transcription technology, delivering accurate text results.
Supported File Types
Our transcription steps support the following file types, based on what OpenAI Whisper supports: MP3, WAV, M4A, OGG, MPEG, MP4, MPGA
Note: Whisper's transcription quality may vary based on the clarity and quality of the input audio.
File Size and Duration Guidelines
File Size Limit
Maximum size: 25MB per audio file.
Approximate Audio Length
The duration of audio that can be processed depends on the format and compression quality:
High-compression formats (e.g., MP3, OGG): Up to 30 minutes of audio.
Low-compression formats (e.g., WAV): Shorter durations, typically 10–20 minutes, due to larger file sizes.
Best Practices
Ensure the audio is clear and free from excessive background noise for optimal transcription quality.
Use shorter clips for faster processing.
Prefer MP3 or M4A formats for longer durations within the file size limit.
For longer audio transcription needs, we recommend leveraging Assembly AI for transcription, which is offered as a separate integration in Relay.app
Troubleshooting
Common Issues:
File exceeds 25MB: Compress or trim the audio file using an audio editing tool before uploading.
Unsupported format: Convert the file to one of the supported formats using free tools like Audacity or online converters.
Last updated