Door background frame
Llama 4 Maverick
50k+ Runs
Llama 4-Maverick visualization
Llama 4 Maverick
50k+ Runs
17B active parameter multimodal Mixture-of-Experts (MoE) model with 128 experts, tuned for complex business tasks requiring speed and logic.

Llama API Usage

POST /v1/chat/completions
import requests
import json

url = "https://api.akashml.com/v1/chat/completions"

payload = {
    "model": "llama-4-maverick",
    "messages": [
        {
            "role": "user",
            "content": "Hello, how are you?"
        }
    ],
    "max_tokens": 150,
    "temperature": 0.7
}

headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer YOUR_API_KEY"
}

response = requests.post(url, json=payload, headers=headers)
print(response.json())
Pricing
Price (per 1M Tokens)
Model
Input
Output
Llama 4 Maverick
NA
NA
Model Details
Model Details
Provider
Meta
Type
Chat
Parameters
685B
Context Length
128k
Don't see the model you need?
Let us know, and we'll add it for you.
AkashML
X (Twitter)Discord
AI Inference Service
Akash Network
Built on Akash Network
Copyright 2025 © akashml.com