# Google Cloud Vision AI integration on Definable

> Google Cloud Vision API enables developers to integrate vision detection features into applications, including image labeling, face and landmark detection, optical character recognition (OCR), and explicit content tagging.

## What this connects

Google Cloud Vision API enables developers to integrate vision detection features into applications, including image labeling, face and landmark detection, optical character recognition (OCR), and explicit content tagging.

Vendor: https://cloud.google.com/vision

## Tools available

**29** tools available. First 12:

- `GOOGLE_CLOUD_VISION_ANNOTATE_FILES` — Annotate Files with Vision API — Tool to perform image detection and annotation for batch files in Google Cloud Vision. Supports PDF, TIFF, and GIF files. Extracts up to 5 frames (GIF) or pages (PDF/TIFF) from each file and performs detection for each image. Use when you need to analyze documents or multi-page images with features like text detection, label detection, face detection, or other Vision API capabilities.
- `GOOGLE_CLOUD_VISION_ANNOTATE_FILES_ASYNC_BATCH` — Async Batch Annotate Files — Tool to run asynchronous image detection and annotation for a list of generic files (PDF, TIFF, GIF). Use when processing multi-page documents that may contain multiple images per page. Results are written to Google Cloud Storage and progress can be tracked via the returned operation name using VisionGetOperation.
- `GOOGLE_CLOUD_VISION_ANNOTATE_IMAGES` — Annotate Images — Run image detection and annotation for a batch of images using Google Cloud Vision API. Performs various types of image analysis including face detection, landmark detection, logo detection, label detection, text detection (OCR), safe search detection, image properties, crop hints, web detection, product search, and object localization. Supports up to 16 images in a single batch request. Each image can have multiple feature types analyzed simultaneously.
- `GOOGLE_CLOUD_VISION_ANNOTATE_IMAGES_ASYNC_BATCH` — Annotate Images Async Batch — Tool to run asynchronous image detection and annotation for a batch of images. Use when processing multiple images or large images that require longer processing time. Results are written to Google Cloud Storage as JSON files.
- `GOOGLE_CLOUD_VISION_ANNOTATE_LOCATION_IMAGES` — Annotate Location Images — Tool to run image detection and annotation for a batch of images scoped to a specific project and location. Performs various types of image analysis including label detection, face detection, landmark detection, logo detection, OCR text detection, safe search detection, image properties, crop hints, web detection, product search, and object localization. Supports processing up to 16 images per request with regional endpoint routing (us, asia, eu). Use this when you need to analyze images with location-specific processing for content extraction, text recognition, object detection, face identification, or landmark/logo recognition.
- `GOOGLE_CLOUD_VISION_CREATE_PRODUCT` — Create Vision Product — Creates a new Product resource in Google Cloud Vision Product Search. A Product represents a physical item that can be visually searched using reference images. After creating a product, you can add reference images to it and include it in product sets for visual search capabilities. Prerequisites: - Vision API must be enabled in your Google Cloud project - Product Search must be enabled - Valid project ID and location (us-west1, us-east1, europe-west1, or asia-east1) Use this action to: - Register a new product for visual search - Set up product metadata (display name, description, category, labels) - Obtain a product resource name for adding reference images
- `GOOGLE_CLOUD_VISION_CREATE_PRODUCT_SET` — Create Product Set — Creates a new ProductSet resource in Google Cloud Vision Product Search. A ProductSet is a container for grouping related products together for visual search. After creating a product set, you can add products to it using the AddProductToProductSet action. Prerequisites: - Vision API must be enabled in your Google Cloud project - Product Search must be enabled - Valid project ID and location (us-west1, us-east1, europe-west1, or asia-east1) Use this action to: - Create a new product set container - Organize products into logical groups - Obtain a product set resource name for adding products
- `GOOGLE_CLOUD_VISION_CREATE_REFERENCE_IMAGE` — Create ReferenceImage — Tool to create a ReferenceImage under a product. Use when adding a new image to a product for detection.
- `GOOGLE_CLOUD_VISION_DELETE_PRODUCT` — Delete Product — Permanently deletes a Product and its associated reference images from Google Cloud Vision API. This is a destructive operation that cannot be undone. The product metadata and all images are deleted immediately, though search queries against ProductSets may temporarily return cached results until caches refresh. Use this tool when you need to remove a product that is no longer needed. Ensure you have the correct product resource name before deletion.
- `GOOGLE_CLOUD_VISION_GET_PRODUCT` — Get Product — Tool to get information associated with a Product. Use when you have the product resource name and need its details.
- `GOOGLE_CLOUD_VISION_GET_PRODUCT_SET` — Get Product Set — Tool to get a ProductSet. Use when you need metadata details of an existing ProductSet by its full resource name. Use after obtaining the resource name.
- `GOOGLE_CLOUD_VISION_IMPORT_PRODUCT_SETS` — Import Product Sets — Asynchronously imports product sets and reference images from a CSV file stored in Google Cloud Storage. This bulk import operation creates ProductSets, Products, and ReferenceImages from a properly formatted CSV file. Returns a long-running Operation that can be polled for completion status. The import automatically creates new ProductSets and Products if they don't exist based on the IDs in the CSV. Use this when you need to: - Bulk import multiple products and images at once - Create product catalogs from existing CSV data - Set up initial product search datasets Note: Requires OAuth authentication with cloud-platform or cloud-vision scope, and read access to the GCS bucket.

## Auth

Auth schemes: `API_KEY`.

## How agents use Google Cloud Vision

Inside a Definable workflow, Google Cloud Vision is one of the tools the **Distributor specialist** can call. Example coordination patterns:

- **Researcher → Google Cloud Vision** — the Researcher (GPT-5.5) pulls context from Google Cloud Vision (records, threads, documents), synthesises findings, and briefs the rest of the team.
- **Writer → Distributor → Google Cloud Vision** — the Writer (Claude Opus 4.7) drafts copy in brand voice, the Verifier passes it, then the Distributor writes the result into Google Cloud Vision (create record, post message, draft email).
- **Designer / Engineer → Distributor → Google Cloud Vision** — the Designer ships an asset or the Engineer ships a code change, the Distributor delivers it via Google Cloud Vision (attach file, open PR comment, post status).

The Verifier checks every Google Cloud Vision call. On rate limit, schema drift, or auth refresh it self-heals and retries — the workflow completes without manual intervention.

## Categories

- artificial intelligence — https://definable.ai/apps/category/artificial-intelligence/

## Related

- HTML page: https://definable.ai/apps/google_cloud_vision/
- Same category (artificial intelligence): https://definable.ai/apps/category/artificial-intelligence/
- All integrations: https://definable.ai/apps/
- Workflow (multi-agent loop): https://definable.ai/workflow/
- Apps llms.txt index: https://definable.ai/llms-apps.txt
