[](#model-card-for-gliner-multi)Model Card for GLiNER-multi
===========================================================

GLiNER is a Named Entity Recognition (NER) model capable of identifying any entity type using a bidirectional transformer encoder (BERT-like). It provides a practical alternative to traditional NER models, which are limited to predefined entities, and Large Language Models (LLMs) that, despite their flexibility, are costly and large for resource-constrained scenarios.

This version has been trained on the **Pile-NER** dataset (Research purpose). Commercially permission versions are available (**urchade/gliner\_smallv2**, **urchade/gliner\_mediumv2**, **urchade/gliner\_largev2**)

[](#links)Links
---------------

*   Paper: [https://arxiv.org/abs/2311.08526](https://arxiv.org/abs/2311.08526)
*   Repository: [https://github.com/urchade/GLiNER](https://github.com/urchade/GLiNER)

[](#available-models)Available models
-------------------------------------

Release

Model Name

\# of Parameters

Language

License

v0

[urchade/gliner\_base](https://huggingface.co/urchade/gliner_base)  
[urchade/gliner\_multi](https://huggingface.co/urchade/gliner_multi)

209M  
209M

English  
Multilingual

cc-by-nc-4.0

v1

[urchade/gliner\_small-v1](https://huggingface.co/urchade/gliner_small-v1)  
[urchade/gliner\_medium-v1](https://huggingface.co/urchade/gliner_medium-v1)  
[urchade/gliner\_large-v1](https://huggingface.co/urchade/gliner_large-v1)

166M  
209M  
459M

English  
English  
English

cc-by-nc-4.0

v2

[urchade/gliner\_small-v2](https://huggingface.co/urchade/gliner_small-v2)  
[urchade/gliner\_medium-v2](https://huggingface.co/urchade/gliner_medium-v2)  
[urchade/gliner\_large-v2](https://huggingface.co/urchade/gliner_large-v2)

166M  
209M  
459M

English  
English  
English

apache-2.0

v2.1

[urchade/gliner\_small-v2.1](https://huggingface.co/urchade/gliner_small-v2.1)  
[urchade/gliner\_medium-v2.1](https://huggingface.co/urchade/gliner_medium-v2.1)  
[urchade/gliner\_large-v2.1](https://huggingface.co/urchade/gliner_large-v2.1)  
[urchade/gliner\_multi-v2.1](https://huggingface.co/urchade/gliner_multi-v2.1)

166M  
209M  
459M  
209M

English  
English  
English  
Multilingual

apache-2.0

[](#installation)Installation
-----------------------------

To use this model, you must install the GLiNER Python library:

    !pip install gliner
    

[](#usage)Usage
---------------

Once you've downloaded the GLiNER library, you can import the GLiNER class. You can then load this model using `GLiNER.from_pretrained` and predict entities with `predict_entities`.

    from gliner import GLiNER
    
    model = GLiNER.from_pretrained("urchade/gliner_multi")
    
    text = """
    Cristiano Ronaldo dos Santos Aveiro (Portuguese pronunciation: [kitjnu naldu]; born 5 February 1985) is a Portuguese professional footballer who plays as a forward for and captains both Saudi Pro League club Al Nassr and the Portugal national team. Widely regarded as one of the greatest players of all time, Ronaldo has won five Ballon d'Or awards,[note 3] a record three UEFA Men's Player of the Year Awards, and four European Golden Shoes, the most by a European player. He has won 33 trophies in his career, including seven league titles, five UEFA Champions Leagues, the UEFA European Championship and the UEFA Nations League. Ronaldo holds the records for most appearances (183), goals (140) and assists (42) in the Champions League, goals in the European Championship (14), international goals (128) and international appearances (205). He is one of the few players to have made over 1,200 professional career appearances, the most by an outfield player, and has scored over 850 official senior career goals for club and country, making him the top goalscorer of all time.
    """
    
    labels = ["person", "award", "date", "competitions", "teams"]
    
    entities = model.predict_entities(text, labels)
    
    for entity in entities:
        print(entity["text"], "=>", entity["label"])
    

    Cristiano Ronaldo dos Santos Aveiro => person
    5 February 1985 => date
    Saudi Pro League => competitions
    Al Nassr => teams
    Portugal national team => teams
    Ballon d'Or => award
    UEFA Men's Player of the Year Awards => award
    European Golden Shoes => award
    UEFA Champions Leagues => competitions
    UEFA European Championship => competitions
    UEFA Nations League => competitions
    Champions League => competitions
    European Championship => competitions
    

    from gliner import GLiNER
    
    model = GLiNER.from_pretrained("urchade/gliner_multi")
    
    text = """
     - ,   .
    """
    # Gold:  - Drugname,  - Drugform
    
    labels = ["Drugname", "Drugform"]
    
    entities = model.predict_entities(text, labels)
    
    for entity in entities:
        print(entity["text"], "=>", entity["label"])
    

     => Drugname
     => Drugform
    

[](#named-entity-recognition-benchmark-result)Named Entity Recognition benchmark result
---------------------------------------------------------------------------------------

[![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317233cc92fd6fee317e030/Y5f7tK8lonGqeeO6L6bVI.png)](https://cdn-uploads.huggingface.co/production/uploads/6317233cc92fd6fee317e030/Y5f7tK8lonGqeeO6L6bVI.png)

[](#model-authors)Model Authors
-------------------------------

The model authors are:

*   [Urchade Zaratiana](https://huggingface.co/urchade)
*   Nadi Tomeh
*   Pierre Holat
*   Thierry Charnois

[](#citation)Citation
---------------------

    @misc{zaratiana2023gliner,
          title={GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer}, 
          author={Urchade Zaratiana and Nadi Tomeh and Pierre Holat and Thierry Charnois},
          year={2023},
          eprint={2311.08526},
          archivePrefix={arXiv},
          primaryClass={cs.CL}
    }

## Model overview

The `gliner_multi` model is a Named Entity Recognition (NER) model capable of identifying any entity type, providing a practical alternative to traditional NER models that are limited to predefined entities. Unlike Large Language Models (LLMs) that can be costly and large, this model is designed for resource-constrained scenarios. It uses a bidirectional transformer encoder (BERT-like) architecture and has been trained on the Pile-NER dataset.

Similar models include [mDeBERTa-v3-base-xnli-multilingual-nli-2mil7](https://aimodels.fyi/models/huggingFace/mdeberta-v3-base-xnli-multilingual-nli-2mil7-moritzlaurer), a multilingual model that can perform natural language inference on 100 languages, and [bert-base-NER](https://aimodels.fyi/models/huggingFace/bert-base-ner-dslim) and [bert-large-NER](https://aimodels.fyi/models/huggingFace/bert-large-ner-dslim), which are fine-tuned BERT models for named entity recognition.

## Model inputs and outputs

### Inputs
- **Text**: The `gliner_multi` model takes in arbitrary text as input and can identify entities within that text.

### Outputs
- **Named entities**: The model outputs a list of named entities found in the input text, along with their type (e.g., person, location, organization).

## Capabilities

The `gliner_multi` model is capable of identifying a wide range of entity types, going beyond the predefined categories typical of traditional NER models. This makes it a versatile tool for analyzing and understanding text content. The model's use of a BERT-like architecture also allows it to capture contextual information, improving the accuracy of its entity recognition.

## What can I use it for?

The `gliner_multi` model can be useful in a variety of applications that require understanding and analyzing textual data, such as:

- **Content analysis**: Identifying key entities in news articles, social media posts, or other text-based content to gain insights.
- **Information extraction**: Extracting specific types of entities (e.g., people, organizations, locations) from large corpora of text.
- **Knowledge graph construction**: Building knowledge graphs by connecting entities and their relationships extracted from text.
- **Recommendation systems**: Improving the accuracy of recommendations by understanding the entities mentioned in user-generated content.

## Things to try

One interesting aspect of the `gliner_multi` model is its ability to handle a wide range of entity types, going beyond the traditional categories. Try experimenting with different types of text, such as technical documents, social media posts, or literature, to see how the model performs in identifying less common or domain-specific entities. This can provide insights into the model's versatility and potential applications in various industries and use cases.

[](#about)About
===============

GLiNER is a Named Entity Recognition (NER) model capable of identifying any entity type using a bidirectional transformer encoder (BERT-like). It provides a practical alternative to traditional NER models, which are limited to predefined entities, and Large Language Models (LLMs) that, despite their flexibility, are costly and large for resource-constrained scenarios.

[](#links)Links
---------------

*   Paper: [https://arxiv.org/abs/2311.08526](https://arxiv.org/abs/2311.08526)
*   Repository: [https://github.com/urchade/GLiNER](https://github.com/urchade/GLiNER)

[](#available-models)Available models
-------------------------------------

Release

Model Name

\# of Parameters

Language

License

v0

[urchade/gliner\_base](https://huggingface.co/urchade/gliner_base)  
[urchade/gliner\_multi](https://huggingface.co/urchade/gliner_multi)

209M  
209M

English  
Multilingual

cc-by-nc-4.0

v1

[urchade/gliner\_small-v1](https://huggingface.co/urchade/gliner_small-v1)  
[urchade/gliner\_medium-v1](https://huggingface.co/urchade/gliner_medium-v1)  
[urchade/gliner\_large-v1](https://huggingface.co/urchade/gliner_large-v1)

166M  
209M  
459M

English  
English  
English

cc-by-nc-4.0

v2

[urchade/gliner\_small-v2](https://huggingface.co/urchade/gliner_small-v2)  
[urchade/gliner\_medium-v2](https://huggingface.co/urchade/gliner_medium-v2)  
[urchade/gliner\_large-v2](https://huggingface.co/urchade/gliner_large-v2)

166M  
209M  
459M

English  
English  
English

apache-2.0

v2.1

[urchade/gliner\_small-v2.1](https://huggingface.co/urchade/gliner_small-v2.1)  
[urchade/gliner\_medium-v2.1](https://huggingface.co/urchade/gliner_medium-v2.1)  
[urchade/gliner\_large-v2.1](https://huggingface.co/urchade/gliner_large-v2.1)  
[urchade/gliner\_multi-v2.1](https://huggingface.co/urchade/gliner_multi-v2.1)

166M  
209M  
459M  
209M

English  
English  
English  
Multilingual

apache-2.0

[](#installation)Installation
-----------------------------

To use this model, you must install the GLiNER Python library:

    !pip install gliner
    

[](#usage)Usage
---------------

Once you've downloaded the GLiNER library, you can import the GLiNER class. You can then load this model using `GLiNER.from_pretrained` and predict entities with `predict_entities`.

    from gliner import GLiNER
    
    model = GLiNER.from_pretrained("urchade/gliner_multi-v2.1")
    
    text = """
    Cristiano Ronaldo dos Santos Aveiro (Portuguese pronunciation: [kitjnu naldu]; born 5 February 1985) is a Portuguese professional footballer who plays as a forward for and captains both Saudi Pro League club Al Nassr and the Portugal national team. Widely regarded as one of the greatest players of all time, Ronaldo has won five Ballon d'Or awards,[note 3] a record three UEFA Men's Player of the Year Awards, and four European Golden Shoes, the most by a European player. He has won 33 trophies in his career, including seven league titles, five UEFA Champions Leagues, the UEFA European Championship and the UEFA Nations League. Ronaldo holds the records for most appearances (183), goals (140) and assists (42) in the Champions League, goals in the European Championship (14), international goals (128) and international appearances (205). He is one of the few players to have made over 1,200 professional career appearances, the most by an outfield player, and has scored over 850 official senior career goals for club and country, making him the top goalscorer of all time.
    """
    
    labels = ["person", "award", "date", "competitions", "teams"]
    
    entities = model.predict_entities(text, labels)
    
    for entity in entities:
        print(entity["text"], "=>", entity["label"])
    

    Cristiano Ronaldo dos Santos Aveiro => person
    5 February 1985 => date
    Al Nassr => teams
    Portugal national team => teams
    Ballon d'Or => award
    UEFA Men's Player of the Year Awards => award
    European Golden Shoes => award
    UEFA Champions Leagues => competitions
    UEFA European Championship => competitions
    UEFA Nations League => competitions
    Champions League => competitions
    European Championship => competitions
    

[](#named-entity-recognition-benchmark-result)Named Entity Recognition benchmark result
---------------------------------------------------------------------------------------

[![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317233cc92fd6fee317e030/Y5f7tK8lonGqeeO6L6bVI.png)](https://cdn-uploads.huggingface.co/production/uploads/6317233cc92fd6fee317e030/Y5f7tK8lonGqeeO6L6bVI.png)

[](#model-authors)Model Authors
-------------------------------

The model authors are:

*   [Urchade Zaratiana](https://huggingface.co/urchade)
*   Nadi Tomeh
*   Pierre Holat
*   Thierry Charnois

[](#citation)Citation
---------------------

    @misc{zaratiana2023gliner,
          title={GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer}, 
          author={Urchade Zaratiana and Nadi Tomeh and Pierre Holat and Thierry Charnois},
          year={2023},
          eprint={2311.08526},
          archivePrefix={arXiv},
          primaryClass={cs.CL}
    }

## Model overview

The `gliner_multi-v2.1` model is a Named Entity Recognition (NER) model developed by urchade that can identify any entity type using a bidirectional transformer encoder (BERT-like). It provides a practical alternative to traditional NER models, which are limited to predefined entities, and Large Language Models (LLMs) that are costly and large for resource-constrained scenarios. The model is part of the [GLiNER](https://aimodels.fyi/models/huggingFace/glinermulti-urchade) family of NER models developed by urchade.

The `gliner_multi-v2.1` model is a multilingual version of the GLiNER model, trained on the Pile-NER dataset. Commercially licensed versions are also available, such as [gliner_small-v2.1](https://aimodels.fyi/models/huggingFace/glinersmall-v2-1-urchade), [gliner_medium-v2.1](https://aimodels.fyi/models/huggingFace/glinermedium-v2-1-urchade), and [gliner_large-v2.1](https://aimodels.fyi/models/huggingFace/glinerlarge-v2-1-urchade).

## Model inputs and outputs

### Inputs
- **Text**: The `gliner_multi-v2.1` model takes in text as input and can process multilingual text.

### Outputs
- **Entities**: The model outputs a list of entities identified in the input text, along with their corresponding entity types.

## Capabilities

The `gliner_multi-v2.1` model can identify a wide range of entity types, unlike traditional NER models that are limited to predefined entities. It can handle both English and multilingual text, making it a flexible choice for various natural language processing tasks.

## What can I use it for?

The `gliner_multi-v2.1` model can be used in a variety of applications that require named entity recognition, such as information extraction, content analysis, and knowledge graph construction. Its ability to handle multilingual text makes it particularly useful for global or international use cases.

## Things to try

You can try using the `gliner_multi-v2.1` model to extract entities from text in different languages and compare the results to traditional NER models. You can also experiment with different entity types and see how the model performs on your specific use case.

[](#model-card-for-gliner-base)Model Card for GLiNER-base
=========================================================

GLiNER is a Named Entity Recognition (NER) model capable of identifying any entity type using a bidirectional transformer encoder (BERT-like). It provides a practical alternative to traditional NER models, which are limited to predefined entities, and Large Language Models (LLMs) that, despite their flexibility, are costly and large for resource-constrained scenarios.

[](#links)Links
---------------

*   Paper: [https://arxiv.org/abs/2311.08526](https://arxiv.org/abs/2311.08526)
*   Repository: [https://github.com/urchade/GLiNER](https://github.com/urchade/GLiNER)

[](#available-models)Available models
-------------------------------------

Release

Model Name

\# of Parameters

Language

License

v0

[urchade/gliner\_base](https://huggingface.co/urchade/gliner_base)  
[urchade/gliner\_multi](https://huggingface.co/urchade/gliner_multi)

209M  
209M

English  
Multilingual

cc-by-nc-4.0

v1

[urchade/gliner\_small-v1](https://huggingface.co/urchade/gliner_small-v1)  
[urchade/gliner\_medium-v1](https://huggingface.co/urchade/gliner_medium-v1)  
[urchade/gliner\_large-v1](https://huggingface.co/urchade/gliner_large-v1)

166M  
209M  
459M

English  
English  
English

cc-by-nc-4.0

v2

[urchade/gliner\_small-v2](https://huggingface.co/urchade/gliner_small-v2)  
[urchade/gliner\_medium-v2](https://huggingface.co/urchade/gliner_medium-v2)  
[urchade/gliner\_large-v2](https://huggingface.co/urchade/gliner_large-v2)

166M  
209M  
459M

English  
English  
English

apache-2.0

v2.1

[urchade/gliner\_small-v2.1](https://huggingface.co/urchade/gliner_small-v2.1)  
[urchade/gliner\_medium-v2.1](https://huggingface.co/urchade/gliner_medium-v2.1)  
[urchade/gliner\_large-v2.1](https://huggingface.co/urchade/gliner_large-v2.1)  
[urchade/gliner\_multi-v2.1](https://huggingface.co/urchade/gliner_multi-v2.1)

166M  
209M  
459M  
209M

English  
English  
English  
Multilingual

apache-2.0

[](#installation)Installation
-----------------------------

To use this model, you must install the GLiNER Python library:

    !pip install gliner
    

[](#usage)Usage
---------------

Once you've downloaded the GLiNER library, you can import the GLiNER class. You can then load this model using `GLiNER.from_pretrained` and predict entities with `predict_entities`.

    from gliner import GLiNER
    
    model = GLiNER.from_pretrained("urchade/gliner_base")
    
    text = """
    Cristiano Ronaldo dos Santos Aveiro (Portuguese pronunciation: [kitjnu naldu]; born 5 February 1985) is a Portuguese professional footballer who plays as a forward for and captains both Saudi Pro League club Al Nassr and the Portugal national team. Widely regarded as one of the greatest players of all time, Ronaldo has won five Ballon d'Or awards,[note 3] a record three UEFA Men's Player of the Year Awards, and four European Golden Shoes, the most by a European player. He has won 33 trophies in his career, including seven league titles, five UEFA Champions Leagues, the UEFA European Championship and the UEFA Nations League. Ronaldo holds the records for most appearances (183), goals (140) and assists (42) in the Champions League, goals in the European Championship (14), international goals (128) and international appearances (205). He is one of the few players to have made over 1,200 professional career appearances, the most by an outfield player, and has scored over 850 official senior career goals for club and country, making him the top goalscorer of all time.
    """
    
    labels = ["person", "award", "date", "competitions", "teams"]
    
    entities = model.predict_entities(text, labels)
    
    for entity in entities:
        print(entity["text"], "=>", entity["label"])
    

    Cristiano Ronaldo dos Santos Aveiro => person
    5 February 1985 => date
    Al Nassr => teams
    Portugal national team => teams
    Ballon d'Or => award
    UEFA Men's Player of the Year Awards => award
    European Golden Shoes => award
    UEFA Champions Leagues => competitions
    UEFA European Championship => competitions
    UEFA Nations League => competitions
    Champions League => competitions
    European Championship => competitions
    

[](#named-entity-recognition-benchmark-result)Named Entity Recognition benchmark result
---------------------------------------------------------------------------------------

[![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317233cc92fd6fee317e030/Y5f7tK8lonGqeeO6L6bVI.png)](https://cdn-uploads.huggingface.co/production/uploads/6317233cc92fd6fee317e030/Y5f7tK8lonGqeeO6L6bVI.png)

[](#model-authors)Model Authors
-------------------------------

The model authors are:

*   [Urchade Zaratiana](https://huggingface.co/urchade)
*   Nadi Tomeh
*   Pierre Holat
*   Thierry Charnois

[](#citation)Citation
---------------------

    @misc{zaratiana2023gliner,
          title={GLiNER: Generalist Model for Named Entity Recognition using Bidirectional Transformer}, 
          author={Urchade Zaratiana and Nadi Tomeh and Pierre Holat and Thierry Charnois},
          year={2023},
          eprint={2311.08526},
          archivePrefix={arXiv},
          primaryClass={cs.CL}
    }

## Model Overview

The `gliner_base` model is a Named Entity Recognition (NER) model developed by Urchade Zaratiana. It is capable of identifying any entity type using a bidirectional transformer encoder, providing a practical alternative to traditional NER models with predefined entities or large language models (LLMs) that can be costly and large for resource-constrained scenarios. The [GLiNER-multi](https://aimodels.fyi/models/huggingFace/glinermulti-urchade) model is a similar version trained on the Pile-NER dataset for research purposes, while commercially licensed versions are also available.

The `gliner_base` model was trained on the CoNLL-2003 Named Entity Recognition dataset, which contains 14,987 training examples and distinguishes between the beginning and continuation of entities. It can identify four types of entities: location (LOC), organization (ORG), person (PER), and miscellaneous (MISC). In terms of performance, the model achieves an F1 score of 91.7 on the test set.

## Model Inputs and Outputs

### Inputs
- Plain text to be analyzed for named entities

### Outputs
- A list of identified entities, including the entity text, entity type, and position in the input text

## Capabilities

The `gliner_base` model can be used to perform Named Entity Recognition (NER) on natural language text. It is capable of identifying a wide range of entity types, going beyond the traditional predefined set of entities. This flexibility makes it a practical alternative to traditional NER models or large language models that can be costly and unwieldy.

## What Can I Use It For?

The `gliner_base` model can be useful in a variety of applications that require named entity extraction, such as information extraction, data mining, content analysis, and knowledge graph construction. For example, you could use it to automatically extract entities like people, organizations, locations, and miscellaneous information from text documents, news articles, or social media posts. This information could then be used to power search, recommendation, or analytics systems.

## Things to Try

One interesting thing to try with the `gliner_base` model is to compare its performance on different types of text. Since it was trained on news articles, it may perform better on formal, journalistic text than on more conversational or domain-specific language. You could experiment with applying the model to different genres or domains and analyze the results to better understand its strengths and limitations.

Another idea is to use the model as part of a larger NLP pipeline, combining it with other models or components to tackle more complex text understanding tasks. For example, you could use the `gliner_base` model to extract entities, then use a relation extraction model to identify the relationships between those entities, or a sentiment analysis model to understand the overall sentiment expressed in the text.