Meta: Llama 3.3 8B Instruct (free)
OpenAI • text • function-calling • json-mode
Provider ID
meta-llama/llama-3.3-8b-instruct:freeA lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.
Quick Summary
Best For:
High-volume, low-latency tasks where cost efficiency is paramount
Pricing:
$0.00/1M input tokens, $0.00/1M output tokens
Context Window:
128,000 tokens
Key Differentiator:
Balanced performance and cost
Specifications
Context Window
128,000 tokens
Max Output Tokens
4,028 tokens
Streaming
Yes
JSON Mode
Yes
Vision
No
Tier
Free
Capabilities
text
function-calling
json-mode
Tags
free
function-calling