[](#passage-reranking-multilingual-bert--)Passage Reranking Multilingual BERT  
=======================================================================================

[](#model-description)Model description
---------------------------------------

**Input:** Supports over 100 Languages. See [List of supported languages](https://github.com/google-research/bert/blob/master/multilingual.md#list-of-languages) for all available.

**Purpose:** This module takes a search query \[1\] and a passage \[2\] and calculates if the passage matches the query. It can be used as an improvement for Elasticsearch Results and boosts the relevancy by up to 100%.

**Architecture:** On top of BERT there is a Densly Connected NN which takes the 768 Dimensional \[CLS\] Token as input and provides the output ([Arxiv](https://arxiv.org/abs/1901.04085)).

**Output:** Just a single value between between -10 and 10. Better matching query,passage pairs tend to have a higher a score.

[](#intended-uses--limitations)Intended uses & limitations
----------------------------------------------------------

Both query\[1\] and passage\[2\] have to fit in 512 Tokens. As you normally want to rerank the first dozens of search results keep in mind the inference time of approximately 300 ms/query.

#### [](#how-to-use)How to use

    from transformers import AutoTokenizer, AutoModelForSequenceClassification
    
    tokenizer = AutoTokenizer.from_pretrained("amberoad/bert-multilingual-passage-reranking-msmarco")
    
    model = AutoModelForSequenceClassification.from_pretrained("amberoad/bert-multilingual-passage-reranking-msmarco")
    

This Model can be used as a drop-in replacement in the [Nboost Library](https://github.com/koursaros-ai/nboost) Through this you can directly improve your Elasticsearch Results without any coding.

[](#training-data)Training data
-------------------------------

This model is trained using the [**Microsoft MS Marco Dataset**](https://microsoft.github.io/msmarco/ "Microsoft MS Marco"). This training dataset contains approximately 400M tuples of a query, relevant and non-relevant passages. All datasets used for training and evaluating are listed in this [table](https://github.com/microsoft/MSMARCO-Passage-Ranking#data-information-and-formating). The used dataset for training is called _Train Triples Large_, while the evaluation was made on _Top 1000 Dev_. There are 6,900 queries in total in the development dataset, where each query is mapped to top 1,000 passage retrieved using BM25 from MS MARCO corpus.

[](#training-procedure)Training procedure
-----------------------------------------

The training is performed the same way as stated in this [README](https://github.com/nyu-dl/dl4marco-bert "NYU Github"). See their excellent Paper on [Arxiv](https://arxiv.org/abs/1901.04085).

We changed the BERT Model from an English only to the default BERT Multilingual uncased Model from [Google](https://huggingface.co/bert-base-multilingual-uncased).

Training was done 400 000 Steps. This equaled 12 hours an a TPU V3-8.

[](#eval-results)Eval results
-----------------------------

We see nearly similar performance than the English only Model in the English [Bing Queries Dataset](http://www.msmarco.org/). Although the training data is English only internal Tests on private data showed a far higher accurancy in German than all other available models.

Fine-tuned Models

Dependency

Eval Set

Search Boost[](#benchmarks)

Speed on GPU

**`amberoad/Multilingual-uncased-MSMARCO`** (This Model)

![PyTorch](https://img.shields.io/badge/PyTorch-blue)

[bing queries](http://www.msmarco.org/)

**+61%** (0.29 vs 0.18)

~300 ms/query[](#footnotes)

`nboost/pt-tinybert-msmarco`

![PyTorch](https://img.shields.io/badge/PyTorch-red)

[bing queries](http://www.msmarco.org/)

**+45%** (0.26 vs 0.18)

~50ms/query[](#footnotes)

`nboost/pt-bert-base-uncased-msmarco`

![PyTorch](https://img.shields.io/badge/PyTorch-red)

[bing queries](http://www.msmarco.org/)

**+62%** (0.29 vs 0.18)

~300 ms/query[](#footnotes)

`nboost/pt-bert-large-msmarco`

![PyTorch](https://img.shields.io/badge/PyTorch-red)

[bing queries](http://www.msmarco.org/)

**+77%** (0.32 vs 0.18)

\-

`nboost/pt-biobert-base-msmarco`

![PyTorch](https://img.shields.io/badge/PyTorch-red)

[biomed](https://github.com/naver/biobert-pretrained)

**+66%** (0.17 vs 0.10)

~300 ms/query[](#footnotes)

This table is taken from [nboost](https://github.com/koursaros-ai/nboost) and extended by the first line.

[](#contact-infos)Contact Infos
-------------------------------

[![](https://amberoad.de/images/logo_text.png)](https://amberoad.de/images/logo_text.png)

Amberoad is a company focussing on Search and Business Intelligence. We provide you:

*   Advanced Internal Company Search Engines thorugh NLP
*   External Search Egnines: Find Competitors, Customers, Suppliers

**Get in Contact now to benefit from our Expertise:**

The training and evaluation was performed by [**Philipp Reissel**](https://reissel.eu/) and [**Igli Manaj**](https://github.com/iglimanaj)

[](https://de.linkedin.com/company/amberoad)[![Amberoad](https://i.stack.imgur.com/gVE0j.png)](https://i.stack.imgur.com/gVE0j.png) Linkedin | [Homepage](https://de.linkedin.com/company/amberoad) | [Email](/amberoad/bert-multilingual-passage-reranking-msmarco/blob/main/info@amberoad.de)

## Model overview

The `bert-multilingual-passage-reranking-msmarco` model is a multilingual BERT-based passage reranking model developed by Amberoad. This model can be used to improve the relevance of search results by re-scoring passages based on how well they match a given query. The model is built on top of BERT with a densely connected neural network that takes the 768-dimensional [CLS] token as input and outputs a single value between -10 and 10, indicating how well the passage matches the query.

The model is trained on the [Microsoft MS Marco Dataset](https://microsoft.github.io/msmarco/), which contains approximately 400 million query-passage pairs. This dataset covers over 100 different languages, allowing the model to perform passage reranking across a wide range of languages.

Compared to similar multilingual models like [multilingual-e5-base](https://aimodels.fyi/models/huggingFace/multilingual-e5-base-intfloat) and [multilingual-e5-large](https://aimodels.fyi/models/huggingFace/multilingual-e5-large-intfloat), the `bert-multilingual-passage-reranking-msmarco` model is more specialized for the task of passage reranking. Its architecture is tailored for this specific task, rather than being a more general-purpose text embedding model.

## Model inputs and outputs

### Inputs
- **Search query**: A text query for which the model will re-score and rank relevant passages.
- **Passage**: A text passage that the model will evaluate for relevance to the given query.

### Outputs
- **Relevance score**: A single numerical value between -10 and 10, indicating how well the passage matches the query. Higher scores indicate a better match.

## Capabilities

The `bert-multilingual-passage-reranking-msmarco` model can be used to improve the relevance of search results by re-scoring passages based on their match to the query. This can be particularly useful for applications like web search, enterprise search, or question answering, where retrieving the most relevant information is crucial.

The model's multilingual capabilities allow it to perform this passage reranking task across a wide range of languages, making it a versatile tool for global search and retrieval applications.

## What can I use it for?

The `bert-multilingual-passage-reranking-msmarco` model can be used as a drop-in replacement in the [Nboost Library](https://github.com/koursaros-ai/nboost) to directly improve the results of Elasticsearch searches. By re-ranking the top retrieved passages based on their relevance to the query, the model can boost the overall quality of the search results by up to 100%.

Additionally, the model could be useful in any application where you need to retrieve and rank relevant passages or documents based on a user query, such as:

- **Question answering systems**: Using the model to re-score candidate passages or documents to find the most relevant answers to user questions.
- **Chatbots and virtual assistants**: Leveraging the model to improve the relevance of information retrieved in response to user queries.
- **Academic or enterprise search**: Enhancing the quality of search results for research papers, internal documents, or other knowledge repositories.

## Things to try

One interesting aspect of the `bert-multilingual-passage-reranking-msmarco` model is its ability to perform passage reranking across a wide range of languages. You could experiment with using the model to improve search results for queries in different languages, and analyze how the performance varies across languages.

Additionally, you could explore combining the model's passage reranking capabilities with other search or retrieval techniques, such as using it in conjunction with traditional search engines or other AI-powered text ranking models. This could lead to even more robust and accurate search and retrieval solutions.