Claude Haiku 3.5 – fastest Claude-3.5 model.
tokens
tokens
Last Update
Overall Rating
Type | Price | Unit | Context Type |
---|---|---|---|
Input | $0.800000 | per mtokens | Standard |
Output | $4.000000 | per mtokens | Standard |
Cache Write (5 min) | $1.000000 | per mtokens | Standard Cache storage for 5 minutes |
Cache Write (1 hour) | $1.600000 | per mtokens | Standard Cache storage for 1 hour |
Cache Hit | $0.080000 | per mtokens | Standard Accessing cached content |
Type | Price | Unit | Context Type | Savings |
---|---|---|---|---|
Input | $0.400000 | per mtokens | Standard | |
Output | $2.000000 | per mtokens | Standard |
Tier | RPM | TPM | IPM |
---|---|---|---|
tier_1 | 50 | 50000 | 10,000 |
tier_2 | 1,000 | 100000 | 20,000 |
tier_3 | 2,000 | 200000 | 40,000 |
tier_4 | 4,000 | 400000 | 80,000 |
Batch processing allows for delayed, cost-efficient bulk operations
Tier | RPM | TPM | Additional Limits |
---|---|---|---|
tier_1 | 50 | — | Queue:100,000 requests Per Batch:100,000 requests |
tier_2 | 1,000 | — | Queue: |
tier_3 | 2,000 | — | Queue:300,000 requests Per Batch:100,000 requests |
tier_4 | 4,000 | — | Queue:500,000 requests Per Batch:100,000 requests |