Llama 4 Maverick

50k+ Runs

17B active parameter multimodal Mixture-of-Experts (MoE) model with 128 experts, tuned for complex business tasks requiring speed and logic.

Llama API Usage

POST /v1/chat/completions

import requests
import json

url = "https://api.akashml.com/v1/chat/completions"

payload = {
    "model": "llama-4-maverick",
    "messages": [
        {
            "role": "user",
            "content": "Hello, how are you?"
        }
    ],
    "max_tokens": 150,
    "temperature": 0.7
}

headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer YOUR_API_KEY"
}

response = requests.post(url, json=payload, headers=headers)
print(response.json())

Pricing

Price (per 1M Tokens)

Model

Input

Output

Llama 4 Maverick

Model Details

Provider