Audio Transcription

Overview

Relay.app leverages OpenAI Whisper to provide a simple interface for transcribing audio files into text. Users can upload audio files, and the system processes them using Whisper's state-of-the-art transcription technology, delivering accurate text results.

Supported File Types

Our transcription steps support the following file types, based on what OpenAI Whisper supports: MP3, WAV, M4A, OGG, MPEG, MP4, MPGA

Note: Whisper's transcription quality may vary based on the clarity and quality of the input audio.

File Size and Duration Guidelines

File Size Limit

Maximum size: 25MB per audio file.

Approximate Audio Length

The duration of audio that can be processed depends on the format and compression quality:

High-compression formats (e.g., MP3, OGG): Up to 30 minutes of audio.
Low-compression formats (e.g., WAV): Shorter durations, typically 10–20 minutes, due to larger file sizes.

Best Practices

Ensure the audio is clear and free from excessive background noise for optimal transcription quality.
Use shorter clips for faster processing.
Prefer MP3 or M4A formats for longer durations within the file size limit.
For longer audio transcription needs, we recommend leveraging Assembly AI for transcription, which is offered as a separate integration in Relay.app

Troubleshooting

Common Issues:

File exceeds 25MB: Compress or trim the audio file using an audio editing tool before uploading.
Unsupported format: Convert the file to one of the supported formats using free tools like Audacity or online converters.

Last updated 1 year ago

Was this helpful?

hashtagOverview

hashtagSupported File Types

hashtagFile Size and Duration Guidelines

hashtagFile Size Limit

hashtagApproximate Audio Length

hashtagBest Practices

hashtagTroubleshooting

hashtagCommon Issues: