[](#deberta-med-ner-2)deberta-med-ner-2
=======================================

This model is a fine-tuned version of [DeBERTa](https://huggingface.co/microsoft/deberta-v3-base) on the PubMED Dataset.

[](#model-description)Model description
---------------------------------------

Medical NER Model finetuned on BERT to recognize 41 Medical entities.

### [](#training-hyperparameters)Training hyperparameters

The following hyperparameters were used during training:

*   learning\_rate: 2e-05
*   train\_batch\_size: 8
*   eval\_batch\_size: 16
*   seed: 42
*   gradient\_accumulation\_steps: 2
*   total\_train\_batch\_size: 16
*   optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
*   lr\_scheduler\_type: cosine
*   lr\_scheduler\_warmup\_ratio: 0.1
*   num\_epochs: 30
*   mixed\_precision\_training: Native AMP

[](#usage)Usage
---------------

The easiest way is to load the inference api from huggingface and second method is through the pipeline object offered by transformers library.

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    pipe = pipeline("token-classification", model="Clinical-AI-Apollo/Medical-NER", aggregation_strategy='simple')
    result = pipe('45 year old woman diagnosed with CAD')
    
    
    
    # Load model directly
    from transformers import AutoTokenizer, AutoModelForTokenClassification
    
    tokenizer = AutoTokenizer.from_pretrained("Clinical-AI-Apollo/Medical-NER")
    model = AutoModelForTokenClassification.from_pretrained("Clinical-AI-Apollo/Medical-NER")
    

### [](#author)Author

Author: [Saketh Mattupalli](https://huggingface.co/blaze999)

### [](#framework-versions)Framework versions

*   Transformers 4.37.0
*   Pytorch 2.1.2
*   Datasets 2.1.0
*   Tokenizers 0.15.1

## Model overview

`Medical-NER` is a fine-tuned version of the [DeBERTa](https://huggingface.co/microsoft/deberta-v3-base) model developed by the [Clinical-AI-Apollo](https://aimodels.fyi/creators/huggingFace/Clinical-AI-Apollo) team. This model was trained on the PubMed dataset to recognize 41 medical entities, making it a specialized tool for natural language processing tasks in the healthcare and biomedical domains.

## Model inputs and outputs

### Inputs
- Text data, such as clinical notes, research papers, or other biomedical literature

### Outputs
- Identified named entities within the input text, categorized into 41 different medical classes, including diseases, symptoms, medications, and more.

## Capabilities

The `Medical-NER` model excels at extracting relevant medical concepts and entities from unstructured text. This can be particularly useful for tasks like clinical information retrieval, adverse event monitoring, and knowledge extraction from large biomedical corpora. By leveraging the model's specialized training on medical data, users can achieve more accurate and reliable results compared to general-purpose NER models.

## What can I use it for?

The `Medical-NER` model can be utilized in a variety of healthcare and biomedical applications. For example, it could be integrated into clinical decision support systems to automatically identify key medical information from patient records, or used to extract relevant entities from research literature to aid in systematic reviews and meta-analyses. The model's capabilities can also be valuable for pharmaceutical companies monitoring drug safety, or for public health organizations tracking disease outbreaks and trends.

## Things to try

One interesting aspect of the `Medical-NER` model is its ability to recognize a wide range of specialized medical terminology. Users might experiment with feeding the model complex, domain-specific text, such as clinical trial protocols or grant proposals, to see how it performs at identifying relevant concepts and entities. Additionally, the model could be fine-tuned on more targeted datasets or combined with other NLP techniques, such as relation extraction, to unlock even more advanced biomedical text processing capabilities.