[](#rakutenai-7b-chat)RakutenAI-7B-chat
=======================================

[](#model-description)Model Description
---------------------------------------

RakutenAI-7B is a systematic initiative that brings the latest technologies to the world of Japanese LLMs. RakutenAI-7B achieves the best scores on the Japanese language understanding benchmarks while maintaining a competitive performance on the English test sets among similar models such as OpenCalm, Elyza, Youri, Nekomata and Swallow. RakutenAI-7B leverages the Mistral model architecture and is based on [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) pre-trained checkpoint, exemplifying a successful retrofitting of the pre-trained model weights. Moreover, we extend Mistral's vocabulary from 32k to 48k to offer a better character-per-token rate for Japanese.

_The technical report can be accessed at [arXiv](https://arxiv.org/abs/2403.15484)._

_If you are looking for a foundation model, check [RakutenAI-7B](https://huggingface.co/Rakuten/RakutenAI-7B)_.

_If you are looking for an instruction-tuned model, check [RakutenAI-7B-instruct](https://huggingface.co/Rakuten/RakutenAI-7B-instruct)_.

An independent evaluation by Kamata et.al. for [Nejumi LLM Neo](https://wandb.ai/wandb-japan/llm-leaderboard/reports/Nejumi-LLM-Neo--Vmlldzo2MTkyMTU0#%E7%B7%8F%E5%90%88%E8%A9%95%E4%BE%A1) using a weighted average of [llm-jp-eval](https://github.com/llm-jp/llm-jp-eval) and [Japanese MT-bench](https://github.com/Stability-AI/FastChat/tree/jp-stable/fastchat/llm_judge) also confirms the highest performance of instruct/chat versions of RakutenAI-7B.

[](#usage)Usage
---------------

    from transformers import AutoModelForCausalLM, AutoTokenizer
    
    model_path = "Rakuten/RakutenAI-7B-chat"
    tokenizer = AutoTokenizer.from_pretrained(model_path)
    model = AutoModelForCausalLM.from_pretrained(model_path, torch_dtype="auto", device_map="auto")
    model.eval()
    
    requests = [
        "",
        "How to make an authentic Spanish Omelette?",
    ]
    
    system_message = "A chat between a curious user and an artificial intelligence assistant. The assistant gives helpful, detailed, and polite answers to the user's questions. USER: {user_input} ASSISTANT:"
    
    for req in requests:
        input_req = system_message.format(user_input=req)
        input_ids = tokenizer.encode(input_req, return_tensors="pt").to(device=model.device)
        tokens = model.generate(
            input_ids,
            max_new_tokens=1024,
            do_sample=True,
            pad_token_id=tokenizer.eos_token_id,
        )
        out = tokenizer.decode(tokens[0][len(input_ids[0]):], skip_special_tokens=True)
        print("USER:\n" + req)
        print("ASSISTANT:\n" + out)
        print()
        print()
    

[](#model-details)Model Details
-------------------------------

*   **Developed by**: [Rakuten Group, Inc.](https://ai.rakuten.com/)
*   **Language(s)**: Japanese, English
*   **License**: This model is licensed under [Apache License, Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
*   **Instruction-Tuning Dataset**: We fine-tune our foundation model to create RakutenAI-7B-instruct and RakutenAI-7B-chat using a mix of open source and internally hand-crafted datasets. We use `train` part of the following datasets (CC by-SA License) for instruction-tuned and chat-tuned models:
    *   [JSNLI](https://nlp.ist.i.kyoto-u.ac.jp/?%E6%97%A5%E6%9C%AC%E8%AA%9ESNLI%28JSNLI%29%E3%83%87%E3%83%BC%E3%82%BF%E3%82%BB%E3%83%83%E3%83%88)
    *   [RTE](https://nlp.ist.i.kyoto-u.ac.jp/?Textual+Entailment+%E8%A9%95%E4%BE%A1%E3%83%87%E3%83%BC%E3%82%BF)
    *   [KUCI](https://nlp.ist.i.kyoto-u.ac.jp/?KUCI)
    *   [BELEBELE](https://huggingface.co/datasets/facebook/belebele)
    *   [JCS](https://aclanthology.org/2022.lrec-1.317/)
    *   [JNLI](https://aclanthology.org/2022.lrec-1.317/)
    *   [Dolly-15K](https://huggingface.co/datasets/databricks/databricks-dolly-15k)
    *   [OpenAssistant1](https://huggingface.co/datasets/OpenAssistant/oasst1)

### [](#limitations-and-bias)Limitations and Bias

The suite of RakutenAI-7B models is capable of generating human-like text on a wide range of topics. However, like all LLMs, they have limitations and can produce biased, inaccurate, or unsafe outputs. Please exercise caution and judgement while interacting with them.

[](#citation)Citation
---------------------

For citing our work on the suite of RakutenAI-7B models, please use:

    @misc{rakutengroup2024rakutenai7b,
          title={RakutenAI-7B: Extending Large Language Models for Japanese}, 
          author={{Rakuten Group, Inc.} and Aaron Levine and Connie Huang and Chenguang Wang and Eduardo Batista and Ewa Szymanska and Hongyi Ding and Hou Wei Chou and Jean-Franois Pessiot and Johanes Effendi and Justin Chiu and Kai Torben Ohlhus and Karan Chopra and Keiji Shinzato and Koji Murakami and Lee Xiong and Lei Chen and Maki Kubota and Maksim Tkachenko and Miroku Lee and Naoki Takahashi and Prathyusha Jwalapuram and Ryutaro Tatsushima and Saurabh Jain and Sunil Kumar Yadav and Ting Cai and Wei-Te Chen and Yandi Xia and Yuki Nakayama and Yutaka Higashiyama},
          year={2024},
          eprint={2403.15484},
          archivePrefix={arXiv},
          primaryClass={cs.CL}
    }

## Model overview

`RakutenAI-7B-chat` is a Japanese language model developed by Rakuten. It builds upon the Mistral model architecture and the Mistral-7B-v0.1 pre-trained checkpoint. Rakuten has extended the vocabulary from 32k to 48k to improve the character-per-token rate for Japanese. According to an independent evaluation by Kamata et al., the instruction-tuned and chat versions of `RakutenAI-7B` achieve the highest performance among similar models like OpenCalm, Elyza, Youri, Nekomata and Swallow on Japanese language benchmarks.

## Model inputs and outputs

### Inputs
- Text prompts provided to the model in the form of a conversational exchange between a user and an AI assistant.

### Outputs
- Responses generated by the model to continue the conversation in a helpful and polite manner.

## Capabilities

`RakutenAI-7B-chat` is capable of engaging in open-ended conversations and providing detailed, informative responses on a wide range of topics. Its strong performance on Japanese language benchmarks suggests it can understand and generate high-quality Japanese text.

## What can I use it for?

`RakutenAI-7B-chat` could be used to power conversational AI assistants for Japanese-speaking users, providing helpful information and recommendations on various subjects. Developers could integrate it into chatbots, virtual agents, or other applications that require natural language interaction in Japanese.

## Things to try

With `RakutenAI-7B-chat`, you can experiment with different types of conversational prompts to see how the model responds. Try asking it for step-by-step instructions, opinions on current events, or open-ended questions about its own capabilities. The model's strong performance on Japanese benchmarks suggests it could be a valuable tool for a variety of Japanese language applications.