[](#openchat-less-is-more-for-open-source-models)OpenChat: Less is More for Open-source Models
==============================================================================================

OpenChat is a series of open-source language models fine-tuned on a diverse and high-quality dataset of multi-round conversations. With only ~6K GPT-4 conversations filtered from the ~90K ShareGPT conversations, OpenChat is designed to achieve high performance with limited data.

**Generic models:**

*   OpenChat: based on LLaMA-13B (2048 context length)
    *   ** 105.7%** of ChatGPT score on Vicuna GPT-4 evaluation
    *   ** 80.9%** Win-rate on AlpacaEval
    *   ** Only used 6K data for finetuning!!!**
*   OpenChat-8192: based on LLaMA-13B (extended to 8192 context length)
    *   **106.6%** of ChatGPT score on Vicuna GPT-4 evaluation
    *   **79.5%** Win-rate on AlpacaEval

**Code models:**

*   OpenCoderPlus: based on StarCoderPlus (native 8192 context length)
    *   **102.5%** of ChatGPT score on Vicuna GPT-4 evaluation
    *   **78.7%** Win-rate on AlpacaEval

_Note:_ Please load the pretrained models using _bfloat16_

[](#code-and-inference-server)Code and Inference Server
-------------------------------------------------------

We provide the full source code, including an inference server compatible with the "ChatCompletions" API, in the [OpenChat](https://github.com/imoneoi/openchat) GitHub repository.

[](#web-ui)Web UI
-----------------

OpenChat also includes a web UI for a better user experience. See the GitHub repository for instructions.

[](#conversation-template)Conversation Template
-----------------------------------------------

The conversation template **involves concatenating tokens**.

Besides base model vocabulary, an end-of-turn token `<|end_of_turn|>` is added, with id `eot_token_id`.

    # OpenChat
    [bos_token_id] + tokenize("Human: ") + tokenize(user_question) + [eot_token_id] + tokenize("Assistant: ")
    # OpenCoder
    tokenize("User:") + tokenize(user_question) + [eot_token_id] + tokenize("Assistant:")
    

_Hint: In BPE, `tokenize(A) + tokenize(B)` does not always equals to `tokenize(A + B)`_

Following is the code for generating the conversation templates:

    @dataclass
    class ModelConfig:
        # Prompt
        system: Optional[str]
    
        role_prefix: dict
        ai_role: str
        eot_token: str
        bos_token: Optional[str] = None
    
        # Get template
        def generate_conversation_template(self, tokenize_fn, tokenize_special_fn, message_list):
            tokens = []
            masks = []
    
            # begin of sentence (bos)
            if self.bos_token:
                t = tokenize_special_fn(self.bos_token)
                tokens.append(t)
                masks.append(False)
    
            # System
            if self.system:
                t = tokenize_fn(self.system) + [tokenize_special_fn(self.eot_token)]
                tokens.extend(t)
                masks.extend([False] * len(t))
    
            # Messages
            for idx, message in enumerate(message_list):
                # Prefix
                t = tokenize_fn(self.role_prefix[message["from"]])
                tokens.extend(t)
                masks.extend([False] * len(t))
    
                # Message
                if "value" in message:
                    t = tokenize_fn(message["value"]) + [tokenize_special_fn(self.eot_token)]
                    tokens.extend(t)
                    masks.extend([message["from"] == self.ai_role] * len(t))
                else:
                    assert idx == len(message_list) - 1, "Empty message for completion must be on the last."
    
            return tokens, masks
    
    
    MODEL_CONFIG_MAP = {
        # OpenChat / OpenChat-8192
        "openchat": ModelConfig(
            # Prompt
            system=None,
    
            role_prefix={
                "human": "Human: ",
                "gpt": "Assistant: "
            },
            ai_role="gpt",
            eot_token="<|end_of_turn|>",
            bos_token="<s>",
        ),
    
        # OpenCoder / OpenCoderPlus
        "opencoder": ModelConfig(
            # Prompt
            system=None,
    
            role_prefix={
                "human": "User:",
                "gpt": "Assistant:"
            },
            ai_role="gpt",
            eot_token="<|end_of_turn|>",
            bos_token=None,
        )
    }
    

[](#license)License
-------------------

Our weight license is subject to their corresponding base model. For example, OpenChat and OpenChat-8192 are the same as the model [License](https://github.com/facebookresearch/llama/blob/main/MODEL_CARD.md) of LLaMA for non-commercial use only, while OpenCoderPlus is under the [License](https://huggingface.co/blog/starcoder) of StarCoder. Furthermore, we should follow the [Privacy Practices](https://chrome.google.com/webstore/detail/sharegpt-share-your-chatg/daiacboceoaocpibfodeljbdfacokfjb) of ShareGPT. The [code](https://github.com/imoneoi/openchat) released on GitHub is under Apache License 2.0.

[](#citation)Citation
---------------------

    @software{openllms23,
      title = {{OpenLLMs: Less is More for Open-source Models}},
      author = {Wang, Guan and Cheng, Sijie and Yu, Qiying and Liu, Changling},
      doi = {10.5281/zenodo.8105775},
      url = {https://github.com/imoneoi/openchat},
      version = {pre-release},
      year = {2023},
      month = {7},
    }

## Model overview

The `openchat` model is a series of open-source language models fine-tuned on a diverse and high-quality dataset of multi-round conversations. According to the maintainer, the [OpenChat models](https://aimodels.fyi/creators/huggingFace/openchat) are designed to achieve high performance with limited data, with only ~6K GPT-4 conversations filtered from the ~90K ShareGPT conversations used for fine-tuning.

The [OpenChat-3.5-0106](https://aimodels.fyi/models/huggingFace/openchat-35-0106-openchat) model in particular is described as the "Overall Best Performing Open Source 7B Model" for coding, generalization, and mathematical reasoning tasks. It outperforms both [ChatGPT (March)](https://aimodels.fyi/models/huggingFace/openchat-35-0106-openchat#comparison-with-xai-grok-models) and the proprietary [Grok-1](https://aimodels.fyi/models/huggingFace/openchat-35-0106-openchat#comparison-with-xai-grok-models) model on various benchmarks.

## Model inputs and outputs

The `openchat` model accepts conversational inputs in a specific format, with an `<|end_of_turn|>` token marking the end of each turn. The model can operate in different modes, including a "Default Mode (GPT4 Correct)" for general tasks and a "Mathematical Reasoning Mode" tailored for solving math problems.

### Inputs
- **Conversational inputs**: The model expects a sequence of conversational turns, with each turn separated by the `<|end_of_turn|>` token.
- **Mode selection**: The model can be instructed to operate in different modes, such as "Default Mode (GPT4 Correct)" or "Mathematical Reasoning Mode", by including a mode identifier in the input.

### Outputs
- **Conversational responses**: The model generates a response to the provided conversational input, which can be used to continue the conversation.
- **Task-specific outputs**: Depending on the mode, the model can produce outputs tailored for tasks like mathematical problem-solving or general language understanding.

## Capabilities

The `openchat-3.5-0106` model excels at a variety of tasks, including summarization, question answering, extraction, and classification. It has demonstrated strong performance on benchmarks like MT-Bench, HumanEval, and GSM8K, often outperforming larger proprietary models.

## What can I use it for?

The `openchat` models are suitable for a wide range of applications, from building open-source chatbots and virtual assistants to integrating language understanding capabilities into educational or creative tools. The maintainers encourage using the models for research purposes, such as probing the limitations and biases of dialogue models or exploring safe deployment strategies.

## Things to try

One interesting aspect of the `openchat` models is their ability to operate in different modes, allowing users to tailor the model's behavior to specific types of tasks. For example, you could experiment with the "Mathematical Reasoning Mode" to see how the model performs on math-focused prompts, or try the "Default Mode (GPT4 Correct)" for more general language understanding and generation tasks.

Another area to explore is the model's few-shot capabilities, as the maintainers note that the model often performs even better with few-shot prompts. This could be a valuable avenue for further research and development.