Llama 4 Maverick is a high-performance multimodal model built with a mixture-of-experts architecture. It has 17 billion active parameters (with 400 billion total) and is designed for general assistant and chat use cases.
The company that provides the model
The number of tokens you can send in a prompt
The maximum number of tokens a model can generate in one request
The cost of prompt tokens sent to the model
The cost of output tokens generated by the model
When the model's knowledge ends
When the model was launched
Capability for the model to use external tools
Ability to process and analyze visual inputs, like images
Support for multiple languages
Whether the model supports fine-tuning on custom datasets
The pricing for Llama 4 Maverick is not yet available.
Llama 4 Maverick supports a context window of up to 1,000,000 tokens.
Llama 4 Maverick can generate up to 8,192 tokens in a single output.
Llama 4 Maverick was released on April 5, 2025.
The knowledge cut-off date for Llama 4 Maverick is not available.
Yes, Llama 4 Maverick supports vision capabilities.
Yes, Llama 4 Maverick supports tool calling or functions.
Yes, Llama 4 Maverick supports multiple languages.
Yes, Llama 4 Maverick supports fine-tuning on custom datasets.
You can find the official documentation for Llama 4 Maverick here.
Collaborate with thousands of AI builders to discover, manage, and improve prompts—free to get started.