Meta: Llama 3.3 8B Instruct (free)

OpenAI • text • function-calling • json-mode

Provider IDmeta-llama/llama-3.3-8b-instruct:free

A lightweight and ultra-fast variant of Llama 3.3 70B, for use when quick response times are needed most.

Quick Summary

High-volume, low-latency tasks where cost efficiency is paramount

$0.00/1M input tokens, $0.00/1M output tokens

128,000 tokens

Balanced performance and cost

Specifications

Context Window

128,000 tokens

Max Output Tokens

4,028 tokens

Streaming

Yes

JSON Mode

Yes

Vision

Tier

Free

Capabilities

text

function-calling

json-mode