[](#e5-mistral-7b-instruct)E5-mistral-7b-instruct
-------------------------------------------------

[Improving Text Embeddings with Large Language Models](https://arxiv.org/pdf/2401.00368.pdf). Liang Wang, Nan Yang, Xiaolong Huang, Linjun Yang, Rangan Majumder, Furu Wei, arXiv 2024

This model has 32 layers and the embedding size is 4096.

[](#usage)Usage
---------------

Below is an example to encode queries and passages from the MS-MARCO passage ranking dataset.

### [](#sentence-transformers)Sentence Transformers

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("intfloat/e5-mistral-7b-instruct")
    # In case you want to reduce the maximum sequence length:
    model.max_seq_length = 4096
    
    queries = [
        "how much protein should a female eat",
        "summit define",
    ]
    documents = [
        "As a general guideline, the CDC's average requirement of protein for women ages 19 to 70 is 46 grams per day. But, as you can see from this chart, you'll need to increase that if you're expecting or training for a marathon. Check out the chart below to see how much protein you should be eating each day.",
        "Definition of summit for English Language Learners. : 1  the highest point of a mountain : the top of a mountain. : 2  the highest level. : 3  a meeting or series of meetings between the leaders of two or more governments."
    ]
    
    query_embeddings = model.encode(queries, prompt_name="web_search_query")
    document_embeddings = model.encode(documents)
    
    scores = (query_embeddings @ document_embeddings.T) * 100
    print(scores.tolist())
    

Have a look at [config\_sentence\_transformers.json](/intfloat/e5-mistral-7b-instruct/blob/main/config_sentence_transformers.json) for the prompts that are pre-configured, such as `web_search_query`, `sts_query`, and `summarization_query`. Additionally, check out [unilm/e5/utils.py](https://github.com/microsoft/unilm/blob/9c0f1ff7ca53431fe47d2637dfe253643d94185b/e5/utils.py#L106) for prompts we used for evaluation. You can use these via e.g. `model.encode(queries, prompt="Instruct: Given a claim, find documents that refute the claim\nQuery: ")`.

### [](#transformers)Transformers

    import torch
    import torch.nn.functional as F
    
    from torch import Tensor
    from transformers import AutoTokenizer, AutoModel
    
    
    def last_token_pool(last_hidden_states: Tensor,
                     attention_mask: Tensor) -> Tensor:
        left_padding = (attention_mask[:, -1].sum() == attention_mask.shape[0])
        if left_padding:
            return last_hidden_states[:, -1]
        else:
            sequence_lengths = attention_mask.sum(dim=1) - 1
            batch_size = last_hidden_states.shape[0]
            return last_hidden_states[torch.arange(batch_size, device=last_hidden_states.device), sequence_lengths]
    
    
    def get_detailed_instruct(task_description: str, query: str) -> str:
        return f'Instruct: {task_description}\nQuery: {query}'
    
    
    # Each query must come with a one-sentence instruction that describes the task
    task = 'Given a web search query, retrieve relevant passages that answer the query'
    queries = [
        get_detailed_instruct(task, 'how much protein should a female eat'),
        get_detailed_instruct(task, 'summit define')
    ]
    # No need to add instruction for retrieval documents
    documents = [
        "As a general guideline, the CDC's average requirement of protein for women ages 19 to 70 is 46 grams per day. But, as you can see from this chart, you'll need to increase that if you're expecting or training for a marathon. Check out the chart below to see how much protein you should be eating each day.",
        "Definition of summit for English Language Learners. : 1  the highest point of a mountain : the top of a mountain. : 2  the highest level. : 3  a meeting or series of meetings between the leaders of two or more governments."
    ]
    input_texts = queries + documents
    
    tokenizer = AutoTokenizer.from_pretrained('intfloat/e5-mistral-7b-instruct')
    model = AutoModel.from_pretrained('intfloat/e5-mistral-7b-instruct')
    
    max_length = 4096
    # Tokenize the input texts
    batch_dict = tokenizer(input_texts, max_length=max_length, padding=True, truncation=True, return_tensors='pt')
    
    outputs = model(**batch_dict)
    embeddings = last_token_pool(outputs.last_hidden_state, batch_dict['attention_mask'])
    
    # normalize embeddings
    embeddings = F.normalize(embeddings, p=2, dim=1)
    scores = (embeddings[:2] @ embeddings[2:].T) * 100
    print(scores.tolist())
    

[](#supported-languages)Supported Languages
-------------------------------------------

This model is initialized from [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) and fine-tuned on a mixture of multilingual datasets. As a result, it has some multilingual capability. However, since Mistral-7B-v0.1 is mainly trained on English data, we recommend using this model for English only. For multilingual use cases, please refer to [multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large).

[](#mteb-benchmark-evaluation)MTEB Benchmark Evaluation
-------------------------------------------------------

Check out [unilm/e5](https://github.com/microsoft/unilm/tree/master/e5) to reproduce evaluation results on the [BEIR](https://arxiv.org/abs/2104.08663) and [MTEB benchmark](https://arxiv.org/abs/2210.07316).

[](#faq)FAQ
-----------

**1\. Do I need to add instructions to the query?**

Yes, this is how the model is trained, otherwise you will see a performance degradation. The task definition should be a one-sentence instruction that describes the task. This is a way to customize text embeddings for different scenarios through natural language instructions.

Please check out [unilm/e5/utils.py](https://github.com/microsoft/unilm/blob/9c0f1ff7ca53431fe47d2637dfe253643d94185b/e5/utils.py#L106) for instructions we used for evaluation.

On the other hand, there is no need to add instructions to the document side.

**2\. Why are my reproduced results slightly different from reported in the model card?**

Different versions of `transformers` and `pytorch` could cause negligible but non-zero performance differences.

**3\. Where are the LoRA-only weights?**

You can find the LoRA-only weights at [https://huggingface.co/intfloat/e5-mistral-7b-instruct/tree/main/lora](https://huggingface.co/intfloat/e5-mistral-7b-instruct/tree/main/lora).

[](#citation)Citation
---------------------

If you find our paper or models helpful, please consider cite as follows:

    @article{wang2023improving,
      title={Improving Text Embeddings with Large Language Models},
      author={Wang, Liang and Yang, Nan and Huang, Xiaolong and Yang, Linjun and Majumder, Rangan and Wei, Furu},
      journal={arXiv preprint arXiv:2401.00368},
      year={2023}
    }
    
    @article{wang2022text,
      title={Text Embeddings by Weakly-Supervised Contrastive Pre-training},
      author={Wang, Liang and Yang, Nan and Huang, Xiaolong and Jiao, Binxing and Yang, Linjun and Jiang, Daxin and Majumder, Rangan and Wei, Furu},
      journal={arXiv preprint arXiv:2212.03533},
      year={2022}
    }
    

[](#limitations)Limitations
---------------------------

Using this model for inputs longer than 4096 tokens is not recommended.

This model's multilingual capability is still inferior to [multilingual-e5-large](https://huggingface.co/intfloat/multilingual-e5-large) for some cases.

## Model Overview

The `e5-mistral-7b-instruct` model is a large language model developed by the researcher intfloat. It is based on the [E5 text embedding model](https://arxiv.org/pdf/2401.00368.pdf) and has been instruct fine-tuned, giving it the ability to understand and respond to natural language instructions. 

This model is similar to other instruct-tuned models like the [multilingual-e5-large](https://aimodels.fyi/models/huggingFace/multilingual-e5-large-intfloat) and [multilingual-e5-base](https://aimodels.fyi/models/huggingFace/multilingual-e5-base-intfloat) models, also developed by intfloat. These models leverage large pretraining datasets and fine-tuning on various text tasks to create powerful text understanding and generation capabilities.

## Model Inputs and Outputs

The `e5-mistral-7b-instruct` model takes in text prompts and generates relevant text responses. The input prompts can include instructions, questions, or other natural language text. The model outputs are coherent, contextually appropriate text continuations.

### Inputs
- **Freeform text prompts**: The model accepts any natural language text as input, such as instructions, questions, or descriptions.

### Outputs 
- **Generated text**: The model produces relevant, coherent text responses based on the input prompts. The output text can range from short phrases to multi-sentence paragraphs.

## Capabilities

The `e5-mistral-7b-instruct` model excels at understanding and responding to natural language instructions. It can handle a wide variety of tasks, from answering questions to generating creative writing. Some example capabilities of the model include:

- Answering questions and providing factual information
- Generating summaries and abstracting key points from text
- Proposing solutions to open-ended problems
- Engaging in freeform dialogue and maintaining context
- Providing step-by-step instructions for completing tasks

The model's broad knowledge base and language understanding make it a versatile tool for many text-based applications.

## What Can I Use It For?

The `e5-mistral-7b-instruct` model could be leveraged in a variety of projects and applications, such as:

- **Virtual assistants**: The model's conversational and instructional capabilities make it well-suited for building intelligent virtual assistants that can engage in natural language interactions.

- **Content generation**: The model can be fine-tuned or prompted to generate high-quality text for applications like article writing, creative storytelling, and summarization.

- **Educational tools**: The model's ability to provide step-by-step instructions and explanations could be useful for developing interactive learning experiences and online tutoring systems.

- **Research and analysis**: Researchers could leverage the model's text understanding abilities to build tools for text mining, topic modeling, and information extraction.

To get started, you can find example code for using the `e5-mistral-7b-instruct` model in the [intfloat/e5-mistral-7b-instruct](https://aimodels.fyi/creators/huggingFace/intfloat) model page.

## Things to Try

One interesting aspect of the `e5-mistral-7b-instruct` model is its ability to engage in open-ended dialogue and adapt its responses to the context of the conversation. You could try prompting the model with a series of back-and-forth exchanges, observing how it maintains coherence and builds upon the previous context.

Another interesting experiment would be to evaluate the model's performance on specific tasks, such as question answering or instructions following, and compare it to other language models. This could help you understand the unique strengths and limitations of the `e5-mistral-7b-instruct` model.

Overall, the `e5-mistral-7b-instruct` model represents a powerful and versatile tool for working with natural language text. Its combination of broad knowledge and instructional capabilities makes it a compelling option for a wide range of applications.