[](#model-card-for-mixtral-8x22b-instruct-v01)Model Card for Mixtral-8x22B-Instruct-v0.1
========================================================================================

The Mixtral-8x22B-Instruct-v0.1 Large Language Model (LLM) is an instruct fine-tuned version of the [Mixtral-8x22B-v0.1](https://huggingface.co/mistralai/Mixtral-8x22B-v0.1).

[](#run-the-model)Run the model
-------------------------------

    from transformers import AutoModelForCausalLM
    from mistral_common.protocol.instruct.messages import (
        AssistantMessage,
        UserMessage,
    )
    from mistral_common.protocol.instruct.tool_calls import (
        Tool,
        Function,
    )
    from mistral_common.tokens.tokenizers.mistral import MistralTokenizer
    from mistral_common.tokens.instruct.normalize import ChatCompletionRequest
    
    device = "cuda" # the device to load the model onto
    
    tokenizer_v3 = MistralTokenizer.v3()
    
    mistral_query = ChatCompletionRequest(
        tools=[
            Tool(
                function=Function(
                    name="get_current_weather",
                    description="Get the current weather",
                    parameters={
                        "type": "object",
                        "properties": {
                            "location": {
                                "type": "string",
                                "description": "The city and state, e.g. San Francisco, CA",
                            },
                            "format": {
                                "type": "string",
                                "enum": ["celsius", "fahrenheit"],
                                "description": "The temperature unit to use. Infer this from the users location.",
                            },
                        },
                        "required": ["location", "format"],
                    },
                )
            )
        ],
        messages=[
            UserMessage(content="What's the weather like today in Paris"),
        ],
        model="test",
    )
    
    encodeds = tokenizer_v3.encode_chat_completion(mistral_query).tokens
    model = AutoModelForCausalLM.from_pretrained("mistralai/Mixtral-8x22B-Instruct-v0.1")
    model_inputs = encodeds.to(device)
    model.to(device)
    
    generated_ids = model.generate(model_inputs, max_new_tokens=1000, do_sample=True)
    sp_tokenizer = tokenizer_v3.instruct_tokenizer.tokenizer
    decoded = sp_tokenizer.decode(generated_ids[0])
    print(decoded)
    

Alternatively, you can run this example with the Hugging Face tokenizer. To use this example, you'll need transformers version 4.39.0 or higher.

    pip install transformers==4.39.0
    

    from transformers import AutoModelForCausalLM, AutoTokenizer
    
    model_id = "mistralai/Mixtral-8x22B-Instruct-v0.1"
    tokenizer = AutoTokenizer.from_pretrained(model_id)
    conversation=[
        {"role": "user", "content": "What's the weather like in Paris?"},
        {
            "role": "tool_calls",
            "content": [
                {
                    "name": "get_current_weather",
                    "arguments": {"location": "Paris, France", "format": "celsius"},
                    
                }
            ]
        },
        {
            "role": "tool_results",
            "content": {"content": 22}
        },
        {"role": "assistant", "content": "The current temperature in Paris, France is 22 degrees Celsius."},
        {"role": "user", "content": "What about San Francisco?"}
    ]
    
    
    tools = [{"type": "function", "function": {"name":"get_current_weather", "description": "Getthecurrentweather", "parameters": {"type": "object", "properties": {"location": {"type": "string", "description": "The city and state, e.g. San Francisco, CA"}, "format": {"type": "string", "enum": ["celsius", "fahrenheit"], "description": "The temperature unit to use. Infer this from the users location."}},"required":["location","format"]}}}]
    
    # render the tool use prompt as a string:
    tool_use_prompt = tokenizer.apply_chat_template(
                conversation,
                chat_template="tool_use",
                tools=tools,
                tokenize=False,
                add_generation_prompt=True,
    
    )
    model = AutoModelForCausalLM.from_pretrained("mistralai/Mixtral-8x22B-Instruct-v0.1")
    
    inputs = tokenizer(tool_use_prompt, return_tensors="pt")
    
    outputs = model.generate(**inputs, max_new_tokens=20)
    print(tokenizer.decode(outputs[0], skip_special_tokens=True))
    

[](#instruct-tokenizer)Instruct tokenizer
=========================================

The HuggingFace tokenizer included in this release should match our own. To compare: `pip install mistral-common`

    from mistral_common.protocol.instruct.messages import (
        AssistantMessage,
        UserMessage,
    )
    from mistral_common.tokens.tokenizers.mistral import MistralTokenizer
    from mistral_common.tokens.instruct.normalize import ChatCompletionRequest
    
    from transformers import AutoTokenizer
    
    tokenizer_v3 = MistralTokenizer.v3()
    
    mistral_query = ChatCompletionRequest(
        messages=[
            UserMessage(content="How many experts ?"),
            AssistantMessage(content="8"),
            UserMessage(content="How big ?"),
            AssistantMessage(content="22B"),
            UserMessage(content="Noice  !"),
        ],
        model="test",
    )
    hf_messages = mistral_query.model_dump()['messages']
    
    tokenized_mistral = tokenizer_v3.encode_chat_completion(mistral_query).tokens
    
    tokenizer_hf = AutoTokenizer.from_pretrained('mistralai/Mixtral-8x22B-Instruct-v0.1')
    tokenized_hf = tokenizer_hf.apply_chat_template(hf_messages, tokenize=True)
    
    assert tokenized_hf == tokenized_mistral
    

[](#function-calling-and-special-tokens)Function calling and special tokens
===========================================================================

This tokenizer includes more special tokens, related to function calling :

*   \[TOOL\_CALLS\]
*   \[AVAILABLE\_TOOLS\]
*   \[/AVAILABLE\_TOOLS\]
*   \[TOOL\_RESULTS\]
*   \[/TOOL\_RESULTS\]

If you want to use this model with function calling, please be sure to apply it similarly to what is done in our [SentencePieceTokenizerV3](https://github.com/mistralai/mistral-common/blob/main/src/mistral_common/tokens/tokenizers/sentencepiece.py#L299).

[](#the-mistral-ai-team)The Mistral AI Team
===========================================

Albert Jiang, Alexandre Sablayrolles, Alexis Tacnet, Antoine Roux, Arthur Mensch, Audrey Herblin-Stoop, Baptiste Bout, Baudouin de Monicault, Blanche Savary, Bam4d, Caroline Feldman, Devendra Singh Chaplot, Diego de las Casas, Eleonore Arcelin, Emma Bou Hanna, Etienne Metzger, Gianna Lengyel, Guillaume Bour, Guillaume Lample, Harizo Rajaona, Jean-Malo Delignon, Jia Li, Justus Murke, Louis Martin, Louis Ternon, Lucile Saulnier, Llio Renard Lavaud, Margaret Jennings, Marie Pellat, Marie Torelli, Marie-Anne Lachaux, Nicolas Schuhl, Patrick von Platen, Pierre Stock, Sandeep Subramanian, Sophia Yang, Szymon Antoniak, Teven Le Scao, Thibaut Lavril, Timothe Lacroix, Thophile Gervet, Thomas Wang, Valera Nemychnikova, William El Sayed, William Marshall

## Model overview

The `Mixtral-8x22B-Instruct-v0.1` is a Large Language Model (LLM) that has been instruct fine-tuned by the Mistral AI team. It is an extension of the [Mixtral-8x22B-v0.1](https://huggingface.co/mistralai/Mixtral-8x22B-v0.1) model, which is a pretrained generative Sparse Mixture of Experts. The Mixtral-8x22B-Instruct-v0.1 model aims to be a helpful AI assistant that can engage in dialogue and assist with a variety of tasks.

## Model inputs and outputs

The Mixtral-8x22B-Instruct-v0.1 model takes textual prompts as input and generates textual responses. The input prompts should be formatted with `[INST]` and `[/INST]` tokens to indicate the instructional context. The model can then generate responses that are tailored to the specific instruction provided.

### Inputs
- Textual prompts surrounded by `[INST]` and `[/INST]` tokens to indicate the instructional context

### Outputs
- Textual responses generated by the model based on the provided instruction

## Capabilities

The Mixtral-8x22B-Instruct-v0.1 model is capable of engaging in natural language dialogue and assisting with a variety of tasks. It can provide helpful information, answer questions, and generate text in response to specific instructions. The model has been trained on a diverse set of data, allowing it to converse on a wide range of topics.

## What can I use it for?

The Mixtral-8x22B-Instruct-v0.1 model can be used for a variety of applications, such as:

- Building conversational AI assistants
- Generating text content (e.g., articles, stories, scripts)
- Providing task-oriented assistance (e.g., research, analysis, problem-solving)
- Enhancing existing applications with natural language capabilities

The [Mistral-7B-Instruct-v0.2](https://aimodels.fyi/models/huggingFace/mistral-7b-instruct-v02-mistralai) and [Mistral-7B-Instruct-v0.1](https://aimodels.fyi/models/huggingFace/mistral-7b-instruct-v01-mistralai) models from the same maintainer are similar and can also be explored for related use cases.

## Things to try

One interesting aspect of the Mixtral-8x22B-Instruct-v0.1 model is its ability to handle complex instructions and engage in multi-turn dialogues. You could try providing the model with a series of related instructions and see how it responds, maintaining context and coherence throughout the conversation.

Another interesting experiment would be to provide the model with specific task-oriented instructions, such as generating a business plan, writing a research paper, or solving a coding problem. Observe how the model's responses adapt to the given task and the level of detail and quality it provides.