Relay.app Docs
  • Getting Started
    • Introduction
    • Helpful Resources
    • FAQ
  • Triggers
    • Triggers 101
    • Webhook Trigger
    • Mailhook Trigger
    • Manual Trigger
    • Scheduled Trigger
    • Batch Triggers
    • RSS Trigger
  • Actions in Apps
    • App Actions 101
  • Creating Templated Documents
  • DATA
    • Step Outputs
    • Find Steps
    • Lists (Arrays)
    • Inspecting Run Data
  • AI
    • AI Steps
    • Human-in-the-Loop AI reviews
    • AI Credits
    • Agentic Tool Use
    • Knowledge
    • Prompt Templates
    • Prompt Tips
    • Audio Transcription
  • Built-in actions
    • Transform Data
    • Create Constants
    • Search Google
    • Scrape Text from Website
    • Custom HTTP Requests
    • Run Custom Code (JS)
  • Flow Control
    • Paths
    • Iterators
    • Wait steps
    • Sequences
  • Human-in-the-Loop
    • Human-in-the-Loop Steps
    • AI output reviews
    • Roles
  • Workflows
    • Folders and Organization
    • Sharing Workflows
    • Headings
    • Notes
  • TEMPLATES
    • About Workflow Templates
    • Using a Template (Importing)
    • Creating a Template (Exporting)
  • Workspace
    • Step & AI credit usage
    • Billing and Plans
    • Workspace administration
    • (Sharing) App Accounts
  • App-Specific FAQs
    • Airtable
    • Attio
    • Cal.com
    • Coda
    • DeepSeek
    • Discord
    • Fireflies
    • Google AI Studio (Gemini)
    • Gmail
    • Google Docs
    • Google Drive
    • Google Sheets
    • Microsoft Permissions
    • Microsoft Outlook Mail
    • Notion
    • OpenAI
    • OpenPhone
    • Slack
    • X (Twitter)
    • QuickBooks Online
Powered by GitBook
On this page
  • Overview
  • Supported File Types
  • File Size and Duration Guidelines
  • Best Practices
  • Troubleshooting

Was this helpful?

  1. AI

Audio Transcription

Overview

Relay.app leverages OpenAI Whisper to provide a simple interface for transcribing audio files into text. Users can upload audio files, and the system processes them using Whisper's state-of-the-art transcription technology, delivering accurate text results.

Supported File Types

Our transcription steps support the following file types, based on what OpenAI Whisper supports: MP3, WAV, M4A, OGG, MPEG, MP4, MPGA

Note: Whisper's transcription quality may vary based on the clarity and quality of the input audio.


File Size and Duration Guidelines

File Size Limit

  • Maximum size: 25MB per audio file.

Approximate Audio Length

The duration of audio that can be processed depends on the format and compression quality:

  • High-compression formats (e.g., MP3, OGG): Up to 30 minutes of audio.

  • Low-compression formats (e.g., WAV): Shorter durations, typically 10–20 minutes, due to larger file sizes.


Best Practices

  • Ensure the audio is clear and free from excessive background noise for optimal transcription quality.

  • Use shorter clips for faster processing.

  • Prefer MP3 or M4A formats for longer durations within the file size limit.


Troubleshooting

Common Issues:

  • File exceeds 25MB: Compress or trim the audio file using an audio editing tool before uploading.

  • Unsupported format: Convert the file to one of the supported formats using free tools like Audacity or online converters.

Last updated 6 months ago

Was this helpful?

For longer audio transcription needs, we recommend leveraging for transcription, which is offered as a separate integration in Relay.app

Assembly AI