CloudPriceCheck

Azure OpenAI Service Pricing (2026)

Updated Feb 20, 2026AzureAI / ML

Compare Azure OpenAI Service prices across all instance types and regions. Starting from $0.0000/hr ($0.01/mo) for Embeddings 3 Small (per 1K tokens).

On-Demand Pricing

InstancevCPUMemoryLinux/hrWindows/hrLinux/mo
GPT-4o-mini (Input, per 1K tokens)01 GB$0.0001-$0.11
GPT-4o-mini (Output, per 1K tokens)01 GB$0.0006-$0.44
GPT-4o (Input, per 1K tokens)010 GB$0.0025-$1.82
GPT-4o (Output, per 1K tokens)010 GB$0.0100-$7.30
GPT-4 Turbo (Input, per 1K tokens)0100 GB$0.0100-$7.30
GPT-4 Turbo (Output, per 1K tokens)0100 GB$0.0300-$21.90
Embeddings 3 Small (per 1K tokens)01 GB$0.0000-$0.01
DALL-E 3 (per image)010 GB$0.0400-$29.20

Pricing by Instance Family

Text Generation

6 instance types available

InstancevCPUMemoryLinux/hrWindows/hrLinux/mo
GPT-4o-mini (Input, per 1K tokens)01 GB$0.0001-$0.11
GPT-4o-mini (Output, per 1K tokens)01 GB$0.0006-$0.44
GPT-4o (Input, per 1K tokens)010 GB$0.0025-$1.82
GPT-4o (Output, per 1K tokens)010 GB$0.0100-$7.30
GPT-4 Turbo (Input, per 1K tokens)0100 GB$0.0100-$7.30
GPT-4 Turbo (Output, per 1K tokens)0100 GB$0.0300-$21.90

Embeddings

1 instance type available

InstancevCPUMemoryLinux/hrWindows/hrLinux/mo
Embeddings 3 Small (per 1K tokens)01 GB$0.0000-$0.01

Image Generation

1 instance type available

InstancevCPUMemoryLinux/hrWindows/hrLinux/mo
DALL-E 3 (per image)010 GB$0.0400-$29.20

Reserved Instance & Savings Plans Pricing

InstancevCPUMemoryLinux/hrWindows/hrLinux/mo1yr RI/hr3yr RI/hr
GPT-4o-mini (Input, per 1K tokens)01 GB$0.0001-$0.11--
GPT-4o-mini (Output, per 1K tokens)01 GB$0.0006-$0.44--
GPT-4o (Input, per 1K tokens)010 GB$0.0025-$1.82--
GPT-4o (Output, per 1K tokens)010 GB$0.0100-$7.30--
GPT-4 Turbo (Input, per 1K tokens)0100 GB$0.0100-$7.30--
GPT-4 Turbo (Output, per 1K tokens)0100 GB$0.0300-$21.90--
Embeddings 3 Small (per 1K tokens)01 GB$0.0000-$0.01--
DALL-E 3 (per image)010 GB$0.0400-$29.20--

How OpenAI Service Pricing Works

On-Demand

Pay per hour with no long-term commitment. Ideal for variable workloads and development environments.

Reserved / Committed Use

Commit to 1 or 3 years for significant discounts.

Spot / Preemptible

Use spare capacity at steep discounts. Best for fault-tolerant, batch, and stateless workloads.

Monthly Cost Examples

Small Workload
Embeddings 3 Small (per 1K tokens)
0 vCPU, 1 GB RAM
$0.01/mo
Medium Workload
GPT-4 Turbo (Input, per 1K tokens)
0 vCPU, 100 GB RAM
$7.30/mo
Large Workload
DALL-E 3 (per image)
0 vCPU, 10 GB RAM
$29.20/mo

Compare with Other Providers

Related Azure Services