Computer Vision Client

Updeveloper-tools by Microsoft

Bing Computer Vision API analyzes images — OCR, object detection, face detection, scene captioning, content moderation.

Visit site ↗Health checked 14h ago

Use it when

Read API's OCR quality is industry-leading for both English and Chinese

Watch for

Face identify (identification) service was restricted for new customers in 2022 due to policy

First check

Create a Computer Vision resource in Azure for endpoint + key. POST /vision/v3.2/read/analyze to submit an OCR job; GET to fetch results.

Auth

—

CORS

HTTPS

Yes

Signup

Latency

202 ms

Protocol

REST

Pricing

—

Uptime · 30-day window

Probes: 30Uptime: 90%Avg latency: 1067ms

About this API

Computer Vision is Azure Cognitive Services' vision intelligence API, bundling multiple capabilities: Read API (OCR — strong on both handwritten and printed text), Analyze Image (scene captioning, tag generation, brand detection), Detect Objects (bounding boxes), Generate Thumbnail (smart crop), Content Moderation (NSFW, violence detection). Each capability is independent — no model training needed. Compared to Google Cloud Vision and AWS Rekognition, Microsoft has traditional strengths in OCR (especially Chinese handwriting) and scene captioning. Note: Face Identify (identifying specific people in photos) has been restricted to limited domains for new customers since 2022.

What you can build

1Bulk OCR text from images/PDFs
2Auto-moderate UGC image uploads
3Auto-tag e-commerce product images
4Generate alt-text descriptions for visually impaired users

Strengths & limitations

Strengths

Read API's OCR quality is industry-leading for both English and Chinese
Multiple capabilities (detection, captioning, moderation) in one API
Batch async jobs supported

Limitations

Face identify (identification) service was restricted for new customers in 2022 due to policy
Not suitable for diagnostic medical imaging

Getting started

Create a Computer Vision resource in Azure for endpoint + key. POST /vision/v3.2/read/analyze to submit an OCR job; GET to fetch results.

FAQ

Does OCR support Chinese?+

Yes. Read API v3.2+ has high-quality recognition for Simplified/Traditional Chinese including handwriting.

Can it identify specific people?+

Face API detect/group is available; identify (matching to known people) has been restricted to approved use cases since 2022.

What's the max input size?+

Read API: 500MB / 2000 pages (PDF) per file. Sync analysis: 4MB max.

Technical details

CORS: NoHTTPS: YesSignup: ?Open source: No

Auth type: unknown
Pricing: unknown
Protocols: REST
SDKs: python, javascript, csharp, java, go
Response time: 202 ms
Last health check: 6/26/2026, 6:23:46 AM

More from Microsoft

View all from Microsoft →

AutoSuggest Client

Bing Autosuggest API returns search suggestions from a partial query — for search-box type-ahead completion.

Custom Image Search Client

Bing Custom Image Search restricts image search to a custom set of domains — the image counterpart of Bing Custom Search.

Custom Search Client

Bing Custom Search lets you define a custom domain subset and run Bing search against it — for vertical or enterprise search.

Custom Vision Prediction Client

Custom Vision Prediction API runs inference using your trained Custom Vision model — upload an image, get classification labels and confidence scores.

Custom Vision Training Client

Custom Vision Training API programmatically trains image-classification / object-detection models — create projects, upload labeled images, trigger training, publish iterations.

Entity Search Client

Bing Entity Search recognizes queries as named entities (people, places, organizations) and returns structured card info (description, image, related links).

Image Search Client

Bing Image Search runs image search over Bing's web-wide image index — by keyword, similar image, aspect ratio, or size filter.

Local Search Client

Bing Local Business Search finds local businesses (restaurants, hotels, hospitals) by geographic query — returns business info, address, ratings.