YAYI 2
======

[GitHub](https://github.com/wenge-research/YAYI2) | [](https://yayi.wenge.com)

[](#introduction)/Introduction
----------------------------------

YAYI 2  Base  Chat  30BYAYI2-30B  Transformer  2.65  Tokens 

 YAYI2-30B Base  YAYI 2  [GitHub](https://github.com/wenge-research/YAYI2) [YAYI 2: Multilingual Open-Source Large Language Models](https://arxiv.org/abs/2312.14862)

YAYI 2 is a collection of open-source large language models launched by Wenge Technology. YAYI2-30B is a Transformer-based large language model, and has been pretrained for 2.65 trillion tokens of multilingual data with high quality. The base model is aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF).

We opensource the pre-trained language model in this release, namely **YAYI2-30B**. For more details about the YAYI 2, please refer to our [GitHub](https://github.com/wenge-research/YAYI2) repository. For more technical details, please read our technical report YAYI 2: Multilingual Open-Source Large Language Models.

[](#model-details)/Model Details
----------------------------------------

Hyperparameter

Value

n\_layers

64

n\_heads

64

hidden\_size

7168

vocab\_size

81920

sequence length

4096

[](#requirements)/Requirements
----------------------------------

*   python 3.8
    
*   pytorch 2.0.1 
    
*    CUDA 11.7 
    
*    BF16  FP16 80GB1xA100
    
*   python 3.8 and above
    
*   pytorch 2.0.1 and above
    
*   CUDA 11.7 and above are recommended
    
*   To run YAYI2-30B in bf16/fp16, at least 80GB GPU memory is required (e.g., 1xA100-80GB)
    

[](#quick-start)/Quick Start
------------------------------------

    >>> from transformers import AutoModelForCausalLM, AutoTokenizer
    >>> tokenizer = AutoTokenizer.from_pretrained("wenge-research/yayi2-30b", trust_remote_code=True)
    >>> model = AutoModelForCausalLM.from_pretrained("wenge-research/yayi2-30b", device_map="auto", trust_remote_code=True)
    >>> inputs = tokenizer('The winter in Beijing is', return_tensors='pt')
    >>> inputs = inputs.to('cuda')
    >>> pred = model.generate(
            **inputs, 
            max_new_tokens=256, 
            eos_token_id=tokenizer.eos_token_id, 
            do_sample=True,
            repetition_penalty=1.2,
            temperature=0.4, 
            top_k=100, 
            top_p=0.8
            )
    >>> print(tokenizer.decode(pred.cpu()[0], skip_special_tokens=True))
    

[](#evaluation)/Evaluation
----------------------------------

 C-EvalMMLU CMMLUAGIEvalGAOKAO-BenchGSM8KMATHBBHHumanEval  MBPPYAYI 2 

We evaluate our model on standard benchmarks, including C-Eval, MMLU, CMMLU, AGIEval, GAOKAO-Bench, GSM8K, MATH, BBH, HumanEval, and MBPP. Our goal is to assess the model's performance in language comprehension, knowledge comprehension, mathematical reasoning, logical reasoning, and code generation. YAYI 2 has demonstrated exceptional performance across models with similar size.

Knowledge

Math

Logic reasonning

Code

Model

C-Eval(val)

MMLU

AGIEval

CMMLU

GAOKAO-Bench

GSM8K

MATH

BBH

HumanEval

MBPP

5-shot

5-shot

3/0-shot

5-shot

0-shot

8/4-shot

4-shot

3-shot

0-shot

3-shot

**MPT-30B**

\-

46.9

33.8

\-

\-

15.2

3.1

38.0

25.0

32.8

**Falcon-40B**

\-

55.4

37.0

\-

\-

19.6

5.5

37.1

0.6

29.8

**LLaMA2-34B**

\-

62.6

43.4

\-

\-

42.2

6.2

44.1

22.6

33.0

**Baichuan2-13B**

59.0

59.5

37.4

61.3

45.6

52.6

10.1

49.0

17.1

30.8

**Qwen-14B**

71.7

67.9

51.9

70.2

62.5

61.6

25.2

53.7

32.3

39.8

**InternLM-20B**

58.8

62.1

44.6

59.0

45.5

52.6

7.9

52.5

25.6

35.6

**Aquila2-34B**

98.5

76.0

43.8

78.5

37.8

50.0

17.8

42.5

0.0

41.0

**Yi-34B**

81.8

76.3

56.5

82.6

68.3

67.6

15.9

66.4

26.2

38.2

**YAYI2-30B**

80.9

**80.5**

**62.0**

**84.0**

64.4

**71.2**

14.8

54.5

**53.1**

**45.8**

 [OpenCompass Github ](https://github.com/open-compass/opencompass)  [OpenCompass](https://opencompass.org.cn)  20231215 [OpenCompass](https://opencompass.org.cn/leaderboard-llm)  MPTFalcon  LLaMa 2 [LLaMA 2](https://arxiv.org/abs/2307.09288) 

We evaluate our model using the source code from the [OpenCompass Github repository](https://github.com/open-compass/opencompass). If available, we report results for comparative models assessed by OpenCompass with the evaluation reference date set to Dec. 15th, 2013. For MPT, Falcon, and Llama, which have not been evaluated by OpenCompass, we use the results reported in the [LLaMA 2](https://arxiv.org/abs/2307.09288) paper.

[](#license)/License
------------------------

 [Apache-2.0](https://github.com/wenge-research/YAYI2/blob/main/LICENSE)  YAYI 2 [ YAYI 2 ](https://github.com/wenge-research/YAYI2/blob/main/COMMUNITY_LICENSE) YAYI 2 [ YAYI 2 ](https://github.com/wenge-research/YAYI2/blob/main/REGISTRATION_INFORMATION) [yayi@wenge.com](mailto:yayi@wenge.com)3[ YAYI 2 ](https://github.com/wenge-research/YAYI2/blob/main/COMMERCIAL_LICENSE)

The code in this project is open-sourced under the [Apache-2.0](https://github.com/wenge-research/YAYI2/blob/main/LICENSE) license. The use of YaYi series model weights and data must adhere to the [YAYI 2 Community License](https://github.com/wenge-research/YAYI2/blob/main/COMMUNITY_LICENSE). If you intend to use the YAYI 2 series models or their derivatives for commercial purposes, please complete the [YAYI 2 Model Commercial Registration Information](https://github.com/wenge-research/YAYI2/blob/main/REGISTRATION_INFORMATION_EN) and send it to [yayi@wenge.com](mailto:yayi@wenge.com). After receiving the email, we will conduct an audit within 3 working days. Once the audit is passed, you will receive a commercial license. Please strictly comply with the relevant content of the [YAYI 2 Model Commercial License Agreement](https://github.com/wenge-research/YAYI2/blob/main/COMMERCIAL_LICENSE) during the use process. Thank you for your cooperation!

[](#citation)/Citation
--------------------------



If you are using the resource for your work, please cite our paper.

    @article{YAYI 2,
      author    = {Yin Luo, Qingchao Kong, Nan Xu, et.al.},
      title     = {YAYI 2: Multilingual Open Source Large Language Models},
      journal   = {arXiv preprint arXiv:2312.14862},
      url       = {https://arxiv.org/abs/2312.14862},
      year      = {2023}
    }

## Model Overview

`yayi2-30b` is a large language model developed by the Wenge Research team. It is a 30 billion parameter Transformer model that has been pretrained on 2.65 trillion tokens of multilingual data. The model has been aligned with human values through supervised fine-tuning with millions of instructions and reinforcement learning from human feedback (RLHF). 

The `yayi2-30b` model is part of the larger YAYI 2 collection of open-source language models released by Wenge Technology. The YAYI 2 models have demonstrated strong performance on a variety of benchmarks, including C-Eval, MMLU, CMMLU, AGIEval, GAOKAO-Bench, GSM8K, MATH, BBH, HumanEval, and MBPP.

Similar large language models include [Nous-Hermes-2-Yi-34B](https://aimodels.fyi/models/huggingFace/nous-hermes-2-yi-34b-nousresearch) from Nous Research, which is a 34 billion parameter model trained on 1 million high-quality GPT-4 generated data, and [Baichuan2-13B-Base](https://aimodels.fyi/models/huggingFace/baichuan2-13b-base-baichuan-inc) from Baichuan Inc, a 13 billion parameter model trained on 2.6 trillion tokens.

## Model Inputs and Outputs

The `yayi2-30b` model is a text-to-text transformer, taking natural language text as input and generating natural language text as output. 

### Inputs
- Natural language text of up to 4096 tokens in length

### Outputs
- Continuation of the input text, generating additional natural language content
- The model can be used for a variety of text generation tasks, such as:
    - Open-ended conversation
    - Question answering 
    - Summarization
    - Creative writing

## Capabilities

The `yayi2-30b` model has demonstrated strong performance across a wide range of benchmarks, showcasing its capabilities in language understanding, knowledge, and generation. For example, the model has achieved high scores on the C-Eval, MMLU, and CMMLU benchmarks, demonstrating its proficiency in areas like general knowledge, logical reasoning, and language comprehension.

In terms of specific capabilities, the `yayi2-30b` model can engage in open-ended conversations, answer questions, and generate fluent and coherent text across a variety of topics and domains. The model's multilingual training allows it to understand and generate content in multiple languages, including Chinese and English.

## What Can I Use it For?

The `yayi2-30b` model can be a powerful tool for a variety of natural language processing applications, such as:

- **Conversational AI assistants**: The model's ability to engage in open-ended dialogue and answer questions makes it well-suited for building conversational AI agents that can assist users with a wide range of tasks.

- **Content generation**: The model's text generation capabilities can be leveraged to create original written content, such as articles, stories, or product descriptions.

- **Summarization**: The model can be used to automatically summarize long-form text, distilling key information and insights.

- **Translation**: The model's multilingual capabilities can be utilized for machine translation between languages.

## Things to Try

One interesting aspect of the `yayi2-30b` model is its strong performance on benchmarks like C-Eval, MMLU, and CMMLU. This suggests the model has a robust understanding of a wide range of knowledge domains, from general trivia to logical reasoning and language comprehension. 

Developers could explore using the `yayi2-30b` model as a foundation for building specialized knowledge-driven applications, such as question-answering systems or educational tools. By fine-tuning the model on domain-specific data, it may be possible to create highly capable and knowledgeable AI assistants that can engage in substantive discussions and provide authoritative answers on complex topics.

Another interesting direction to explore is the model's multilingual capabilities. Given its proficiency in both Chinese and English, the `yayi2-30b` model could be utilized for building cross-lingual applications, such as bilingual chatbots or translation services. Developers could experiment with prompting the model to generate content in one language based on input in another, or to switch seamlessly between languages during a conversation.