LLaMA 4 Scout from Meta is an efficient 17 billion parameter model with 16 experts, optimized for lightweight chat and text tasks. As the best multimodal model in its class, it processes text, images, and video, supporting multiple languages and a industry-leading 10 million token context window. Highly steerable, it allows easy tailoring of responses through system prompts, unlocking advanced use cases in memory and extended context while remaining powerful and efficient.
curl -X POST https://neuralhubapi.com/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer NEURALHUB_API_KEY" \
-d '{
"model": "meta/llama4-scout-17b",
"messages": [
{ "role": "system", "content": "You are a helpful assistant." },
{ "role": "user", "content": "What is the answer to life, the universe, and everything?" }
],
"temperature": 0.7,
"max_tokens": 500,
"top_p": 0.9
}'
The API returns an OpenAI-compatible response. Example:
{
"id": "chatcmpl-<uuid>",
"object": "chat.completion",
"created": 1756503193,
"model": "meta/llama4-scout-17b",
"choices": [
{
"index": 0,
"message": {
"role": "assistant",
"content": "The answer to life, the universe, and everything is famously **42**, according to Douglas Adams' *The Hitchhiker’s Guide to the Galaxy*..."
},
"finish_reason": "stop"
}
],
"usage": {
"prompt_tokens": 26,
"completion_tokens": 169,
"total_tokens": 195
}
}