[![](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1ZcLIJuemiojigrfjbsDMBWrX7JqXZX6I?usp=sharing)

ChatYuan: 

PromptCLUE-large

[PromptCLUE-large:](https://www.cluebenchmarks.com/clueai.html)1000token1.5tokenPrompt

[Demo()](https://www.clueai.cn/chat)  | [API(large)](https://www.clueai.cn)  |  [Github](https://github.com/clue-ai/ChatYuan) | [Colab](https://colab.research.google.com/drive/1ZcLIJuemiojigrfjbsDMBWrX7JqXZX6I?usp=sharing#scrollTo=QokO0pdGmAYH) [](https://mp.weixin.qq.com/s/-axa6XcjGl_Koeq_OrDq8w)



![](https://huggingface.co/ClueAI/ChatYuan-large-v1/resolve/main/chatyuan_wechat.jpg)



    # 
    from transformers import T5Tokenizer, T5ForConditionalGeneration
    tokenizer = T5Tokenizer.from_pretrained("ClueAI/ChatYuan-large-v1")
    model = T5ForConditionalGeneration.from_pretrained("ClueAI/ChatYuan-large-v1")
    



    # 
    import torch
    from transformers import AutoTokenizer
    # colabgpu
    device = torch.device('cuda')
    model.to(device)
    def preprocess(text):
      text = text.replace("\n", "\\n").replace("\t", "\\t")
      return text
    
    def postprocess(text):
      return text.replace("\\n", "\n").replace("\\t", "\t")
    
    def answer(text, sample=True, top_p=1, temperature=0.7):
      '''sampleTrue;
      top_p0-1'''
      text = preprocess(text)
      encoding = tokenizer(text=[text], truncation=True, padding=True, max_length=768, return_tensors="pt").to(device) 
      if not sample:
        out = model.generate(**encoding, return_dict_in_generate=True, output_scores=False, max_new_tokens=512, num_beams=1, length_penalty=0.6)
      else:
        out = model.generate(**encoding, return_dict_in_generate=True, output_scores=False, max_new_tokens=512, do_sample=True, top_p=top_p, temperature=temperature, no_repeat_ngram_size=3)
      out_text = tokenizer.batch_decode(out["sequences"], skip_special_tokens=True)
      return postprocess(out_text[0])
    print("end...")
    

[](#)
===========================

    input_text0 = "3"
    input_text1 = ""
    input_text2 = ""
    input_text3 = ""
    input_text4 = ""
    input_text5 = ""
    input_text6 = "800"
    input_text7 = ""
    input_text8 = "GPU"
    input_list = [input_text0, input_text1, input_text2, input_text3, input_text4, input_text5, input_text6, input_text7, input_text8]
    for i, input_text in enumerate(input_list):
      input_text = "" + input_text + "\n"
      print(f"{i}".center(50, "="))
      output_text = answer(input_text)
      print(f"{input_text}{output_text}")
    

\=======================0========================  
3  


!

3

  
!



!

\=======================1========================  
  
!AI  
\=======================2========================  
  
E-mail apologies to your company for not reaching your delivery due to delay in logistics, we can compensate you for all losses.  
\=======================3========================  
  




  
  
\=======================4========================  
  
   
  


  
  
  
  
   
  
  
\=======================5========================  
  


1.  11
    
2.  11
    
3.  221
    

\=======================6========================  
800  


  
  
  
\=======================7========================  


()7100  
\=======================8========================  
GPU  


1 GeForce GTX 880 GDDR5  
2 Geforce RTX 1060 PhysX 3 Ge force MX150 SLI  
4 GetoDirectX 11DX11  
  
1.GDDR4X 256MB  
2.GDDR6X 8GB  
3.GDDR3 120GB  
4.GDDR7 2GB   
1.4 GB/s 2. 5.5 ms 3. 5 ms

[](#)
=============

    input_text = ["","",""]
    answer_text = ["!AI", "", ""]
    context = "\n".join([f"{input_text[i]}\n{answer_text[i]}" for i in range(len(input_text))])
    print(context)
    
    input_text = ""
    print(f"".center(50, "="))
    input_text = context + "\n" + input_text + "\n"
    output_text = answer(input_text)
    print(f"{input_text}{output_text}")
    

\================================================  
  
!AI  
  
  
  
  
  


### [](#)

[discord](https://discord.gg/hUVyMRByaE) [](https://github.com/clue-ai/ChatYuan#%E6%8A%80%E6%9C%AF%E4%BA%A4%E6%B5%81%E5%92%8C%E9%97%AE%E9%A2%98%E5%8F%8D%E9%A6%88%E6%89%AB%E7%A0%81%E5%9C%A8%E7%BA%BF%E4%BD%93%E9%AA%8C%E5%B0%8F%E7%A8%8B%E5%BA%8F%E6%88%96%E5%85%A5%E7%BE%A4)

[![](//www.clustrmaps.com/map_v2.png?d=sFWwaZBlUeql7focpvpWJDpp9DHpvZfdw1kSavIAWqM&cl=ffffff)](https://clustrmaps.com/site/1bsr7 "Visit tracker")

## Model overview

The `ChatYuan-large-v1` model is a large language model developed by ClueAI, a leading AI research company. It is a T5-based model that has been trained on a vast corpus of text, including web pages, books, and other online sources. The model is capable of engaging in open-ended conversations, answering questions, and generating human-like text on a wide range of topics.

Compared to similar models like [Qwen-7B-Chat](https://aimodels.fyi/models/huggingFace/qwen-7b-chat-qwen) and [Baichuan2-7B-Chat](https://aimodels.fyi/models/huggingFace/baichuan2-7b-chat-baichuan-inc), the `ChatYuan-large-v1` model boasts impressive performance on a variety of benchmarks, particularly in the areas of general language understanding, mathematics, and code generation.

## Model inputs and outputs

### Inputs
- **Text**: The model can accept text inputs of up to 768 tokens, which can include a wide range of content such as questions, instructions, or open-ended prompts.

### Outputs
- **Text**: The model generates coherent and contextually relevant text in response to the input, with the ability to continue a conversation or provide detailed answers to questions.

## Capabilities

The `ChatYuan-large-v1` model has demonstrated strong capabilities in various tasks, including open-ended conversation, question answering, and content generation. It can engage in natural-sounding dialog, provide informative and well-reasoned responses to a variety of questions, and generate high-quality text on a wide range of topics.

The model has also shown impressive performance on tasks that require logical reasoning, such as solving mathematical word problems and generating working code snippets. Its ability to understand and reason about complex concepts makes it a valuable tool for a variety of applications, from educational support to task automation.

## What can I use it for?

The `ChatYuan-large-v1` model has a wide range of potential applications, both for individual users and businesses. Some ideas for using the model include:

- **Conversational AI**: Integrating the model into chatbots or virtual assistants to provide engaging and informative interactions with users.
- **Content Generation**: Leveraging the model's text generation capabilities to create high-quality articles, stories, or marketing materials.
- **Task Automation**: Using the model's reasoning and problem-solving abilities to automate various tasks, such as data analysis, code generation, or report writing.
- **Educational Support**: Employing the model to assist students with learning, tutoring, or homework help across a variety of subjects.

[ClueAI](https://aimodels.fyi/creators/huggingFace/ClueAI), the maintainer of the `ChatYuan-large-v1` model, is a leading AI research company that is constantly working to push the boundaries of what's possible with large language models. By making this model openly available, they are empowering developers and researchers to explore new and innovative applications of this powerful technology.

## Things to try

One interesting aspect of the `ChatYuan-large-v1` model is its ability to engage in multi-turn conversations, maintaining context and coherence as the dialog progresses. Try using the model to have a back-and-forth exchange on a topic of your choice, and see how it responds to follow-up questions or requests for clarification.

Another intriguing capability of the model is its strong performance on tasks that require logical reasoning, such as solving mathematical word problems or generating working code. Experiment with prompting the model to tackle these types of challenges, and observe how it approaches and solves them.

Finally, the model's versatility in content generation makes it a valuable tool for a wide range of applications. Explore using the model to create engaging stories, informative articles, or even marketing materials, and see how its language generation abilities can be leveraged to meet your specific needs.