image

Image Generation

OCR Text Extraction

OCR Text Extraction is a image generation capability available through Google Cloud Vision on Aweb. Extract text from images with full document and handwriting support. Access it through a single unified API with automatic failover and intelligent routing.

Try OCR Text ExtractionAPI docs

Best for

Highest quality

Google Cloud Vision

Premium tier

Contract

Max Latency5000ms

Providers (1)

ProviderScoreQualityPricing
Google Cloud VisionDEFAULT
92premiumstandard

Quick start

Call OCR Text Extraction through Alfred — automatic provider selection, failover, and load balancing included.

cURL

curl -X POST https://api.alfred-ai.app/v1/execute \
  -H "Authorization: Bearer $ALFRED_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "capability": "image.ocr",
    "input": { "prompt": "Hello world" }
  }'

TypeScript

import { Alfred } from '@alfred/core';

const alfred = new Alfred({ apiKey: process.env.ALFRED_API_KEY });

// Alfred automatically selects the best provider
const result = await alfred.execute({
  capability: 'image.ocr',
  input: { prompt: 'Hello world' },
});

console.log(result.output);

Orchestration pipeline

import { Alfred } from '@alfred/core';

const alfred = new Alfred({ apiKey: process.env.ALFRED_API_KEY });

// Multi-step pipeline with automatic failover
const result = await alfred.orchestrate({
  steps: [
    { id: 'step1', capability: 'image.ocr', input: { prompt: 'Hello world' } },
    { id: 'step2', capability: 'llm.chat', dependsOn: ['step1'],
      input: { prompt: 'Summarize: $step1.output' } },
  ],
});

Related Image Generation capabilities

Image Generation

image

Image Editing

image

Image Upscaling

image

Computer Vision

image

3D Model Generation

image

Pro Image Generation

image

Getting started →API reference →All providers →All capabilities →