Llama 3.1 405B Instruct
Replicate • text • code
Provider ID
meta/llama-3.1-405b-instructQuick Summary
Best For:
High-volume, low-latency tasks where cost efficiency is paramount
Pricing:
$0.59/1M input tokens, $0.79/1M output tokens
Context Window:
128,000 tokens
Key Differentiator:
Cost-optimized for high-volume usage
Specifications
Context Window
128,000 tokens
Max Output Tokens
8,192 tokens
Streaming
Yes
JSON Mode
Yes
Vision
No
Tier
Affordable
Capabilities
text
code
Tags
replicate
llm