[](#dolphin_orca_platypus_llama_70b)Dolphin\_ORCA\_PlatyPus\_LLaMA\_70b
=======================================================================

### [](#dataset)Dataset

Here is the list of datasets used:

*   Dolphin
*   Open-Platypus
*   OpenOrca

**mixed strategy: 100%Open-Platypus + ~1%Dolphin(GPT-4) + ~1%OpenOrca(GPT-4)**  

**Model Finetuned By fangloveskari.**

  

### [](#training-framework-and-parameters)Training FrameWork and Parameters

#### [](#framework)FrameWork

[https://github.com/hiyouga/LLaMA-Efficient-Tuning](https://github.com/hiyouga/LLaMA-Efficient-Tuning) We add flash\_attention\_2 and ORCA dataset support, with some minor modifications.

  

#### [](#parameters)Parameters

We list some training parameters here:

Parameter

Value

Finetune\_Type

QLoRA(NF4)

LoRA\_Rank

16

LoRA\_Alpha

16

Batch\_Size

14

GPUs

8xA100(80G)

LR\_Scheduler

cosine

LR

3e-4

Epoch

1

DeepSpeed

ZERO-2

  

### [](#model-export)Model Export

We tried two methods to fuse the adapter back to the base model:

*   [https://github.com/hiyouga/LLaMA-Efficient-Tuning/blob/main/src/export\_model.py](https://github.com/hiyouga/LLaMA-Efficient-Tuning/blob/main/src/export_model.py)
*   [https://github.com/jondurbin/qlora/blob/main/qmerge.py](https://github.com/jondurbin/qlora/blob/main/qmerge.py)

Generally, the second will get better ARC(+0.15) and Truthful\_QA(+0.3) scores but the other two(MMLU(-0.2) and HelloSwag(-0.2)) seems to degenerate (Just for my model).

  

### [](#evaluation)Evaluation

Metric

Value

ARC (25-shot)

72.27

HellaSwag (10-shot)

87.74

MMLU (5-shot)

70.23

TruthfulQA (0-shot)

63.37

Avg.

73.40

  

### [](#license-disclaimer)license disclaimer:

This model is bound by the license & usage restrictions of the original Llama-2 model. And comes with no warranty or gurantees of any kind.

  

### [](#limitations--biases)Limitations & Biases:

Llama 2 and fine-tuned variants are a new technology that carries risks with use. Testing conducted to date has been in English, and has not covered, nor could it cover all scenarios. For these reasons, as with all LLMs, Llama 2 and any fine-tuned varient's potential outputs cannot be predicted in advance, and the model may in some instances produce inaccurate, biased or other objectionable responses to user prompts. Therefore, before deploying any applications of Llama 2 variants, developers should perform safety testing and tuning tailored to their specific applications of the model.

Please see the Responsible Use Guide available at [https://ai.meta.com/llama/responsible-use-guide/](https://ai.meta.com/llama/responsible-use-guide/)

  

### [](#citiation)Citiation:

Please kindly cite using the following BibTeX:

    @article{platypus2023,
        title={Platypus: Quick, Cheap, and Powerful Refinement of LLMs}, 
        author={Ariel N. Lee and Cole J. Hunter and Nataniel Ruiz},
        booktitle={arXiv preprint arxiv:2308.07317},
        year={2023}
    }
    

    @misc{mukherjee2023orca,
          title={Orca: Progressive Learning from Complex Explanation Traces of GPT-4}, 
          author={Subhabrata Mukherjee and Arindam Mitra and Ganesh Jawahar and Sahaj Agarwal and Hamid Palangi and Ahmed Awadallah},
          year={2023},
          eprint={2306.02707},
          archivePrefix={arXiv},
          primaryClass={cs.CL}
    }
    

    @software{touvron2023llama2,
      title={Llama 2: Open Foundation and Fine-Tuned Chat Models},
      author={Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava,
     Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller,
    Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez Madian Khabsa, Isabel Kloumann,
    Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov,
    Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith,
    Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu , Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan,
    Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom},
      year={2023}
    }

## Model overview

The `ORCA_LLaMA_70B_QLoRA` model is a powerful AI language model created by fangloveskari, a maintainer on Hugging Face. This model is a variant of the LLaMA-70B model, which has been fine-tuned using a Quantized Low-Rank Adaptation (QLoRA) technique. The model was trained on a mixed dataset that combines the [Open-Platypus](https://github.com/garage-bAInd/Open-Platypus) dataset (~1% Dolphin and ~1% OpenOrca) with a significant amount of the Open-Platypus dataset.

The `ORCA_LLaMA_70B_QLoRA` model shares similarities with other LLaMA-based models, such as [orca_mini_13b](https://aimodels.fyi/models/huggingFace/orcamini13b-pankajmathur) and [dolphin-llama-13b](https://aimodels.fyi/models/huggingFace/dolphin-llama-13b-cognitivecomputations), which also leverage the Orca dataset and techniques. However, this model stands out with its larger 70B parameter size and the use of QLoRA fine-tuning, which aims to improve efficiency and performance.

## Model inputs and outputs

### Inputs
- **Text prompts**: The model can accept various text-based prompts, ranging from simple instructions to more complex queries or narratives.

### Outputs
- **Generated text**: The model outputs generated text that responds to the input prompt. The generated text can be used for a variety of tasks, such as answering questions, generating stories, or providing explanations.

## Capabilities

The `ORCA_LLaMA_70B_QLoRA` model is a versatile and powerful language model that can be used for a wide range of text-to-text tasks. Its large size and specialized training on the Orca dataset give it strong capabilities in areas like logical reasoning, task-oriented dialogue, and open-ended question answering. The model has demonstrated impressive performance on benchmark tasks like the AI2 Reasoning Challenge, HellaSwag, and TruthfulQA.

## What can I use it for?

The `ORCA_LLaMA_70B_QLoRA` model can be useful for a variety of applications, including:

- **Question answering**: The model can be used to answer a wide range of questions, from factual queries to more open-ended, exploratory questions.
- **Dialogue and conversational AI**: The model's capabilities in task-oriented dialogue and its ability to engage in natural conversations make it a strong candidate for building conversational AI assistants.
- **Content generation**: The model can be used to generate creative and informative content, such as stories, articles, or reports.
- **Research and analysis**: Researchers and analysts can leverage the model's strong reasoning and inference capabilities to help with tasks like scientific analysis, policy research, or market insights.

## Things to try

One interesting aspect of the `ORCA_LLaMA_70B_QLoRA` model is its potential to serve as a strong foundation for further fine-tuning and customization. Given its large parameter size and specialized training on the Orca dataset, users could explore fine-tuning the model on their own datasets or for their specific use cases, potentially unlocking even more impressive capabilities. Additionally, the use of QLoRA fine-tuning opens up opportunities to explore ways of making the model more efficient and cost-effective to deploy, without sacrificing too much in terms of performance.