[](#english-ner-in-flair-ontonotes-large-model)English NER in Flair (Ontonotes large model)
-------------------------------------------------------------------------------------------

This is the large 18-class NER model for English that ships with [Flair](https://github.com/flairNLP/flair/).

F1-Score: **90.93** (Ontonotes)

Predicts 18 tags:

**tag**

**meaning**

CARDINAL

cardinal value

DATE

date value

EVENT

event name

FAC

building name

GPE

geo-political entity

LANGUAGE

language name

LAW

law name

LOC

location name

MONEY

money name

NORP

affiliation

ORDINAL

ordinal value

ORG

organization name

PERCENT

percent value

PERSON

person name

PRODUCT

product name

QUANTITY

quantity value

TIME

time value

WORK\_OF\_ART

name of work of art

Based on document-level XLM-R embeddings and [FLERT](https://arxiv.org/pdf/2011.06993v1.pdf/).

* * *

### [](#demo-how-to-use-in-flair)Demo: How to use in Flair

Requires: **[Flair](https://github.com/flairNLP/flair/)** (`pip install flair`)

    from flair.data import Sentence
    from flair.models import SequenceTagger
    
    # load tagger
    tagger = SequenceTagger.load("flair/ner-english-ontonotes-large")
    
    # make example sentence
    sentence = Sentence("On September 1st George won 1 dollar while watching Game of Thrones.")
    
    # predict NER tags
    tagger.predict(sentence)
    
    # print sentence
    print(sentence)
    
    # print predicted NER spans
    print('The following NER tags are found:')
    # iterate over entities and print
    for entity in sentence.get_spans('ner'):
        print(entity)
    

This yields the following output:

    Span [2,3]: "September 1st"   [ Labels: DATE (1.0)]
    Span [4]: "George"   [ Labels: PERSON (1.0)]
    Span [6,7]: "1 dollar"   [ Labels: MONEY (1.0)]
    Span [10,11,12]: "Game of Thrones"   [ Labels: WORK_OF_ART (1.0)]
    

So, the entities "_September 1st_" (labeled as a **date**), "_George_" (labeled as a **person**), "_1 dollar_" (labeled as a **money**) and "Game of Thrones" (labeled as a **work of art**) are found in the sentence "_On September 1st George Washington won 1 dollar while watching Game of Thrones_".

* * *

### [](#training-script-to-train-this-model)Training: Script to train this model

The following Flair script was used to train this model:

    from flair.data import Corpus
    from flair.datasets import ColumnCorpus
    from flair.embeddings import WordEmbeddings, StackedEmbeddings, FlairEmbeddings
    
    # 1. load the corpus (Ontonotes does not ship with Flair, you need to download and reformat into a column format yourself)
    corpus: Corpus = ColumnCorpus(
                    "resources/tasks/onto-ner",
                    column_format={0: "text", 1: "pos", 2: "upos", 3: "ner"},
                    tag_to_bioes="ner",
                )
    
    # 2. what tag do we want to predict?
    tag_type = 'ner'
    
    # 3. make the tag dictionary from the corpus
    tag_dictionary = corpus.make_tag_dictionary(tag_type=tag_type)
    
    # 4. initialize fine-tuneable transformer embeddings WITH document context
    from flair.embeddings import TransformerWordEmbeddings
    
    embeddings = TransformerWordEmbeddings(
        model='xlm-roberta-large',
        layers="-1",
        subtoken_pooling="first",
        fine_tune=True,
        use_context=True,
    )
    
    # 5. initialize bare-bones sequence tagger (no CRF, no RNN, no reprojection)
    from flair.models import SequenceTagger
    
    tagger = SequenceTagger(
        hidden_size=256,
        embeddings=embeddings,
        tag_dictionary=tag_dictionary,
        tag_type='ner',
        use_crf=False,
        use_rnn=False,
        reproject_embeddings=False,
    )
    
    # 6. initialize trainer with AdamW optimizer
    from flair.trainers import ModelTrainer
    
    trainer = ModelTrainer(tagger, corpus, optimizer=torch.optim.AdamW)
    
    # 7. run training with XLM parameters (20 epochs, small LR)
    from torch.optim.lr_scheduler import OneCycleLR
    
    trainer.train('resources/taggers/ner-english-ontonotes-large',
                  learning_rate=5.0e-6,
                  mini_batch_size=4,
                  mini_batch_chunk_size=1,
                  max_epochs=20,
                  scheduler=OneCycleLR,
                  embeddings_storage_mode='none',
                  weight_decay=0.,
                  )
    

* * *

### [](#cite)Cite

Please cite the following paper when using this model.

    @misc{schweter2020flert,
        title={FLERT: Document-Level Features for Named Entity Recognition},
        author={Stefan Schweter and Alan Akbik},
        year={2020},
        eprint={2011.06993},
        archivePrefix={arXiv},
        primaryClass={cs.CL}
    }
    

* * *

### [](#issues)Issues?

The Flair issue tracker is available [here](https://github.com/flairNLP/flair/issues/).

## Model overview

The `ner-english-ontonotes-large` model is a large 18-class Named Entity Recognition (NER) model for English that ships with the Flair NLP library. It is based on document-level XLM-R embeddings and the FLERT approach, and achieves an impressive F1-score of 90.93 on the Ontonotes dataset. This model can recognize 18 different entity types, including cardinal values, dates, events, facilities, geopolitical entities, languages, laws, locations, money, numeric values, organizations, people, products, and more.

The model was developed and trained by the [Flair](https://aimodels.fyi/creators/huggingFace/flair) team. It can be used as a drop-in component for a variety of NLP tasks that require named entity recognition, and is a strong alternative to other popular English NER models like [bert-large-NER](https://aimodels.fyi/models/huggingFace/bert-large-ner-dslim) and [roberta-large-ner-english](https://aimodels.fyi/models/huggingFace/roberta-large-ner-english-jean-baptiste).

## Model inputs and outputs

### Inputs
- **Plain text**: The model takes in raw text as input and performs NER on the entire document or sentence.

### Outputs
- **Named entity spans**: The model outputs a list of entity spans, each with a predicted entity type (e.g. 'PERSON', 'ORGANIZATION', 'LOCATION', etc.) and a confidence score.

## Capabilities

The `ner-english-ontonotes-large` model excels at accurately identifying a wide range of entity types in English text. It can be used to extract valuable information from documents, social media posts, news articles, and other textual data. For example, you could use it to build applications that automatically catalog the people, places, and organizations mentioned in a corpus of legal documents, or to power a chatbot that can understand and respond to queries about current events by recognizing the relevant entities.

## What can I use it for?

This Flair NER model is a powerful tool for a variety of NLP applications that require named entity extraction, such as:

- **Information Extraction**: Automatically identify and extract key entities (people, organizations, locations, etc.) from large text corpora.

- **Question Answering**: Use the recognized entities to help answer questions about who, what, where, and when in a given text.

- **Knowledge Graph Construction**: Build knowledge graphs by linking the extracted entities and their relationships.

- **Sentiment Analysis**: Combine entity recognition with sentiment analysis to understand how different entities are being discussed.

- **Chatbots and Conversational AI**: Enable chatbots to understand and respond to user queries that reference specific entities.

The model's broad coverage of entity types and high accuracy make it a versatile tool that can be integrated into a wide range of NLP applications. By leveraging the [Flair library](https://github.com/flairNLP/flair), developers can easily incorporate this model into their projects.

## Things to try

One interesting aspect of the `ner-english-ontonotes-large` model is its ability to recognize a diverse set of entity types, beyond just the common ones like people, organizations, and locations. For example, it can identify more specialized entities like laws, works of art, and languages. 

You could try experimenting with the model on different types of text data to see how it performs. For instance, you might use it to extract entities from legal documents, news articles, or even fictional stories to explore how its capabilities vary across domains. Additionally, you could investigate how the model handles ambiguous or context-dependent entity references, and whether you need to perform any post-processing of the output to improve its accuracy for your specific use case.

Overall, this Flair NER model provides a robust and versatile entity extraction solution that can be a valuable addition to a wide range of NLP projects.