[](#model-card-for-una-cybertron-7b-v2-bf16-una-uniform-neural-alignment)Model Card for una-cybertron-7b-v2-bf16 (UNA: Uniform Neural Alignment)
================================================================================================================================================

We strike back, introducing **Cybertron 7B v2** a 7B MistralAI based model, best on it's series. Trained on SFT, DPO and UNA (Unified Neural Alignment) on multiple datasets. He scores [EXACTLY](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__una-cybertron-7b-v2-bf16) **#1** with **69.67**\+ score on HF LeaderBoard board, **#8** ALL SIZES top score.

*   v1 Scoring **#1** at 2 December 2023 with 69.43 ..few models were releasse .. but only 1 can survive: CYBERTRON!
*   v2 Scoring **#1** at 5 December 2023 with 69.67

Model

Average

ARC (25-s)

HellaSwag (10-s)

MMLU (5-s)

TruthfulQA (MC) (0-s)

Winogrande (5-s)

GSM8K (5-s)

[mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1)

60.97

59.98

83.31

64.16

42.15

78.37

37.83

[Intel/neural-chat-7b-v3-2](https://huggingface.co/Intel/neural-chat-7b-v3-2)

68.29

67.49

83.92

63.55

59.68

79.95

55.12

[perlthoughts/Chupacabra-7B-v2](https://huggingface.co/perlthoughts/Chupacabra-7B-v2)

63.54

66.47

85.17

64.49

57.6

79.16

28.35

[fblgit/una-cybertron-7b-v1-fp16](https://huggingface.co/fblgit/una-cybertron-7b-v1-fp16)

**69.49**

**68.43**

**85.85**

63.34

**63.28**

**80.90**

**55.12**

[fblgit/una-cybertron-7b-v2-bf16](https://huggingface.co/fblgit/una-cybertron-7b-v2-bf16)

**69.67**

**68.26**

**85.?4**

63.23

**64.63**

**81.37**

**55.04**

The model excels in mathematics, logic, reasoning, overall very smart. He can make a deep reasoning over the context and prompt, it gives the impression of not missing details around.

[](#model-details)Model Details
-------------------------------

Adiestrated with UNA: Uniform Neural Alignment technique (paper going out soon).

*   What is **NOT** UNA? Its not a merged layers model. Is not SLERP or SLURP or similar.
*   What **is** UNA? A formula & A technique to _TAME_ models
*   When will be released the code and paper? When have time, contribute and it'll be faster.

### [](#model-description)Model Description

*   **Developed by:** [juanako.ai](https://juanako.ai)
*   **Author:** [Xavier M.](/fblgit/una-cybertron-7b-v2-bf16/blob/main/xavi@juanako.ai)
*   **Investors** [CONTACT HERE](/fblgit/una-cybertron-7b-v2-bf16/blob/main/billing@juanako.ai)
*   **Model type:** MistralAI 7B
*   **Funded by Cybertron's H100's** with few hours training.

### [](#prompt)Prompt

The model is very good, works well on almost any prompt but ChatML format and Alpaca System gets the best

    <|im_start|>system
    - You are a helpful assistant chatbot trained by MosaicML.
    - You answer questions.
    - You are excited to be able to help the user, but will refuse to do anything that could be considered harmful to the user.
    - You are more than just an information source, you are also able to write poetry, short stories, and make jokes.<|im_end|>
    <|im_start|>user
    Explain QKV<|im_end|>
    <|im_start|>assistant
    

    ### Assistant: I am StableVicuna, a large language model created by CarperAI. I am here to chat!
    
    ### Human: Explain QKV
    ### Assistant:
    

    [Round <|round|>]
    Explain QKV
    
    

    [Round <|round|>]
    QuestionExplain QKV
    Answer
    

    QuestionExplain QKV
    Answer
    

Using Exllamav2\_HF set alpha=2.5 for 16K Context

**Users also report that exllamav2\_HF loader, 8bpw-h8 exl2 quant, simple-1 preset provides good results**

### [](#framework-versions)Framework versions

*   Transformers 4.35.0-UNA
*   Pytorch 2.1.0
*   Datasets 2.14.6
*   Tokenizers 0.14.1

### [](#citations)Citations

    If you find Cybertron, Juanako or any of our models useful, specially if you use it for your big brand.. or you clone/merge my modelsm, cite please:
    

    @misc{unacybertron7b,
      title={Cybertron: Uniform Neural Alignment}, 
      author={Xavier Murias},
      year={2023},
      publisher = {HuggingFace},
      journal = {HuggingFace repository},
      howpublished = {\url{https://huggingface.co/fblgit/una-cybertron-7b-v2-bf16}},
    }
    

Special thanks to @TheBloke & @bartowski for converting the models and their support to the community. Thank you!

[](#open-llm-leaderboard-evaluation-results)Open LLM Leaderboard Evaluation Results
===================================================================================

Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_fblgit__una-cybertron-7b-v2-bf16)

Metric

Value

Avg.

69.67

AI2 Reasoning Challenge (25-Shot)

68.26

HellaSwag (10-Shot)

85.85

MMLU (5-Shot)

63.23

TruthfulQA (0-shot)

64.63

Winogrande (5-shot)

80.98

GSM8k (5-shot)

55.04

## Model overview

The `una-cybertron-7b-v2-bf16` model, developed by [juanako.ai](https://juanako.ai) and maintained by [fblgit](https://aimodels.fyi/creators/huggingFace/fblgit), is a 7 billion parameter AI model that uses the UNA (Uniform Neural Alignment) technique. It outperforms other 7B models, scoring #1 on the HuggingFace Open LLM Leaderboard with a score of 69.67. Similar models include the [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1), [Intel/neural-chat-7b-v3-2](https://huggingface.co/Intel/neural-chat-7b-v3-2), [perlthoughts/Chupacabra-7B-v2](https://huggingface.co/perlthoughts/Chupacabra-7B-v2), and [fblgit/una-cybertron-7b-v1-fp16](https://huggingface.co/fblgit/una-cybertron-7b-v1-fp16).

## Model inputs and outputs

The `una-cybertron-7b-v2-bf16` model is a text-to-text AI model, meaning it takes text as input and generates text as output. It performs well on a variety of natural language tasks, including question answering, logical reasoning, and open-ended conversation.

### Inputs
- Text prompts in natural language

### Outputs
- Generated text responses in natural language

## Capabilities

The `una-cybertron-7b-v2-bf16` model excels at mathematical and logical reasoning, scoring highly on benchmarks such as the HuggingFace Open LLM Leaderboard. It can engage in deep contextual analysis and provide detailed, well-reasoned responses.

## What can I use it for?

The `una-cybertron-7b-v2-bf16` model could be used for a wide range of natural language processing tasks, such as:

- Chatbots and conversational AI assistants
- Question answering and information retrieval
- Content generation for websites, blogs, or social media
- Summarization and text analysis
- Logical and mathematical problem-solving

## Things to try

One interesting aspect of the `una-cybertron-7b-v2-bf16` model is its use of the UNA (Uniform Neural Alignment) technique, which the maintainer claims helps "tame" the model. Experimenting with different prompts and tasks could reveal insights into how this technique affects the model's behavior and capabilities.