Create Audio Transcription
Tool to transcribe audio into text in the same language as the audio. Use when you need to convert speech to text while preserving the original language. Supports multiple formats including mp3, mp4, wav, and webm.
7 tools available
GroqCloud provides high-performance AI inference services, enabling developers to deploy and manage AI models efficiently.
Tool to transcribe audio into text in the same language as the audio. Use when you need to convert speech to text while preserving the original language. Supports multiple formats including mp3, mp4, wav, and webm.
Tool to create a model response for the given input. Beta endpoint with simplified interface compared to chat completions. Use when you need a streamlined API for generating model responses.
Tool to translate an audio file into English text. Use when you have a non-English recording and need an accurate English transcript. Use after confirming the file path.
Tool to generate a chat-based completion for a conversation. Use when you have a list of prior messages and need the model's next reply. Response completion text is at choices[0].message.content in the returned envelope.
Tool to retrieve detailed information about a specific model. Use after listing models when you need metadata for a chosen model. Returned metadata may change as models update; do not cache.
Tool to list all available models and their metadata. Always call this to retrieve current model IDs rather than using hard-coded or cached identifiers, as deprecated names cause failures in GROQCLOUD_GROQ_RETRIEVE_MODEL and GROQCLOUD_GROQ_CREATE_CHAT_COMPLETION. Returns availability and metadata only — excludes usage stats, latency metrics, and pricing. Response may include many models; filter client-side by provider, family, modality, or context length. Frequent polling combined with high-volume requests risks HTTP 429 rate_limit_exceeded; use backoff and minimize call frequency.
Tool to retrieve available TTS voices for Groq PlayAI models. Use when you need to discover voice options before calling text-to-speech. Note: static list maintained manually; no live endpoint exists.
Wire it up in minutes. No coding required.