Deepgram

8 tools available

Deepgram provides AI-powered speech recognition and understanding services, offering APIs for real-time and pre-recorded audio transcription, text-to-speech, and audio intelligence.

artificial intelligencetranscription

Tools & actions 8

Get Model by ID

Retrieve metadata for a specific Deepgram model by its UUID. Returns detailed model information including name, architecture, supported languages, version, and capabilities. Works for both STT (speech-to-text) and TTS (text-to-speech) models.

Get Public Models

Retrieve metadata on all public Deepgram models (speech-to-text and text-to-speech). Returns comprehensive model information including supported languages, architectures, versions, and capabilities. Set include_outdated to True to include deprecated versions.

List Deepgram Projects

Tool to list all Deepgram projects. Use after authenticating with your API key.

Get Project Usage Summary

Retrieves aggregated usage statistics for a Deepgram project including total audio duration, billable duration, number of requests, channels processed, and confidence/relevance scores. Returns both overall totals and breakdowns by model/accessor/tag. Use this to analyze API consumption, track costs, or monitor transcription quality metrics over time.

List Project Scopes

Tool to list all scopes for a specified Deepgram project. Use when you need to retrieve all permission scopes for a project.

List Think Models

Tool to list available think models for AI agent processing and voice agent configuration. Use when you need to see which think models are available for voice agents.

Transcribe Pre-recorded Audio

Tool to transcribe pre-recorded audio files into text. Use when converting a publicly accessible audio file URL to text. Primary transcript is at `results.channels[0].alternatives[0].transcript` in the response. Silent audio returns a valid empty transcript, not an error. Verify supported models and language codes via `DEEPGRAM_GET_MODELS` when uncertain.

Text-to-Speech (REST)

Tool to convert text into natural-sounding speech. Use when you need TTS audio from text inputs.

Ready to automate with Deepgram?

Wire it up in minutes. No coding required.

← All integrations