![Refuel.ai](https://assets-global.website-files.com/6423879a8f63c1bb18d74bfa/648818d56d04c3bdf36d71ab_Refuel_rev8-01_ts-p-1600.png)

[](#model-details)Model Details
-------------------------------

RefuelLLM-2-small, aka Llama-3-Refueled, is a Llama3-8B base model instruction tuned on a corpus of 2750+ datasets, spanning tasks such as classification, reading comprehension, structured attribute extraction and entity resolution. We're excited to open-source the model for the community to build on top of.

*   More details about [RefuelLLM-2 family of models](https://www.refuel.ai/blog-posts/announcing-refuel-llm-2)
*   You can also try out the models in our [LLM playground](https://labs.refuel.ai/playground)

**Model developers** - Refuel AI

**Input** - Text only.

**Output** - Text only.

**Architecture** - Llama-3-Refueled is built on top of Llama-3-8B-instruct which is an auto-regressive language model that uses an optimized transformer architecture.

**Release Date** - May 8, 2024.

**License** - [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/deed.en)

[](#how-to-use)How to use
-------------------------

This repository contains weights for Llama-3-Refueled that are compatible for use with HuggingFace. See the snippet below for usage with Transformers:

    >>> import torch
    >>> from transformers import AutoModelForCausalLM, AutoTokenizer
    
    >>> model_id = "refuelai/Llama-3-Refueled"
    >>> tokenizer = AutoTokenizer.from_pretrained(model_id)
    >>> model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.float16, device_map="auto")
    
    >>> messages = [{"role": "user", "content": "Is this comment toxic or non-toxic: RefuelLLM is the new way to label text data!"}]
    
    >>> inputs = tokenizer.apply_chat_template(messages, return_tensors="pt", add_generation_prompt=True).to("cuda")
    
    >>> outputs = model.generate(inputs, max_new_tokens=20)
    >>> print(tokenizer.decode(outputs[0]))
    

[](#training-data)Training Data
-------------------------------

The model was both trained on over 4 Billion tokens, spanning 2750+ NLP tasks. Our training collection consists majorly of:

1.  Human annotated datasets like Flan, Task Source, and the Aya collection
2.  Synthetic datasets like OpenOrca, OpenHermes and WizardLM
3.  Proprietary datasets developed or licensed by Refuel AI

[](#benchmarks)Benchmarks
-------------------------

In this section, we report the results for Refuel models on our benchmark of labeling tasks. For details on the methodology see [here](https://refuel.ai/blog-posts/announcing-refuel-llm-2).

Provider

Model

LLM Output Quality (by task type)

Overall

Classification

Reading Comprehension

Structure Extraction

Entity Matching

Refuel

RefuelLLM-2

83.82%

84.94%

76.03%

88.16%

92.00%

OpenAI

GPT-4-Turbo

80.88%

81.77%

72.08%

84.79%

97.20%

Refuel

RefuelLLM-2-small (Llama-3-Refueled)

79.67%

81.72%

70.04%

84.28%

92.00%

Anthropic

Claude-3-Opus

79.19%

82.49%

67.30%

88.25%

94.96%

Meta

Llama3-70B-Instruct

78.20%

79.38%

66.03%

85.96%

94.13%

Google

Gemini-1.5-Pro

74.59%

73.52%

60.67%

84.27%

98.48%

Mistral

Mixtral-8x7B-Instruct

62.87%

79.11%

45.56%

47.08%

86.52%

Anthropic

Claude-3-Sonnet

70.99%

79.91%

45.44%

78.10%

96.34%

Anthropic

Claude-3-Haiku

69.23%

77.27%

50.19%

84.97%

54.08%

OpenAI

GPT-3.5-Turbo

68.13%

74.39%

53.21%

69.40%

80.41%

Meta

Llama3-8B-Instruct

62.30%

68.52%

49.16%

65.09%

63.61%

[](#limitations)Limitations
---------------------------

The Llama-3-Refueled does not have any moderation mechanisms. We're looking forward to engaging with the community on ways to make the model finely respect guardrails, allowing for deployment in environments requiring moderated outputs.

## Model overview

`Llama-3-Refueled` is an instruction-tuned Llama 3-8B base model developed by [Refuel AI](https://aimodels.fyi/creators/huggingFace/refuelai). The model was trained on over 2,750 datasets spanning tasks such as classification, reading comprehension, structured attribute extraction, and entity resolution. It builds on the Llama 3 family of models, which are a collection of pretrained and instruction-tuned generative text models in 8B and 70B sizes developed by Meta. The Llama 3-Refueled model aims to provide a strong foundation for NLP applications that require robust text generation and understanding capabilities.

## Model inputs and outputs

### Inputs
- **Text only**: The model takes text as input.

### Outputs
- **Text only**: The model generates text as output.

## Capabilities

`Llama-3-Refueled` is a capable text-to-text model that can be used for a variety of natural language processing tasks. It has demonstrated strong performance on benchmarks covering classification, reading comprehension, and structured data extraction. Compared to the base Llama 3-8B model, the Refueled version shows improved performance, particularly on instruction-following tasks.

## What can I use it for?

The `Llama-3-Refueled` model can be a valuable foundation for building NLP applications that require robust language understanding and generation capabilities. Some potential use cases include:

- **Text classification**: Classifying the sentiment, topic, or intent of text input.
- **Question answering**: Answering questions based on given text passages.
- **Named entity recognition**: Identifying and extracting key entities from text.
- **Text summarization**: Generating concise summaries of longer text inputs.

By leveraging the capabilities of the `Llama-3-Refueled` model, developers can accelerate the development of these types of NLP applications and benefit from the model's strong performance on a wide range of tasks.

## Things to try

One interesting aspect of the `Llama-3-Refueled` model is its ability to handle open-ended, freeform instructions. Developers can experiment with prompting the model to perform various tasks, such as generating creative writing, providing step-by-step instructions, or engaging in open-ended dialogue. The model's flexibility and robustness make it a promising foundation for building advanced language-based applications.