[](#transformers4302)transformers4.30.2
===================================================================================

*   [https://huggingface.co/spaces/fb700/chatglm-fitness-RLHF](https://huggingface.co/spaces/fb700/chatglm-fitness-RLHF) :test/qwer4321
*   [https://huggingface.co/fb700/Bofan-chatglm-Best-lora/blob/main/modelapplytest.md](https://huggingface.co/fb700/Bofan-chatglm-Best-lora/blob/main/modelapplytest.md)

[](#)
=============

*   GPT3.5

[](#)
=============

*   chatglm2-6b32kcontext4k8K16K......

[](#chatglm-6b-rlhf--lora-model)ChatGLM-6B RLHF & LoRA Model
============================================================

ChatGLM-6B ChatGLM-6B LLM

[](#)
-----------------------

*   40
*   30RM
*   SFT30fitnessRMChatGLM-6B
*   chatglm-6bchatglm2-6blangchain-chatglmchatglm-6bchatglm2-6b-7b
*   fp1620%.fp16int4int8
*   lorachatglm-6bchatglm2-6b
*   tokens
*   
*    Apache-2.0 ChatGLM2-6B  Model License
*   chatglm-6b[https://huggingface.co/THUDM/chatglm-6b](https://huggingface.co/THUDM/chatglm-6b)
*   AI\[\]2023ChatGLM-6b
*    [https://pan.baidu.com/s/1l9q\_7h8nGdelIwYlCbllMg?pwd=klhu](https://pan.baidu.com/s/1l9q_7h8nGdelIwYlCbllMg?pwd=klhu)   
*    [https://pan.quark.cn/s/d947c6dbf592](https://pan.quark.cn/s/d947c6dbf592)
*   [https://huggingface.co/fb700/Bofan-chatglm-Best-lora/blob/main/modelapplytest.md](https://huggingface.co/fb700/Bofan-chatglm-Best-lora/blob/main/modelapplytest.md)
*    [![](/fb700/chatglm-fitness-RLHF/resolve/main/glm_eval.jpg)](/fb700/chatglm-fitness-RLHF/blob/main/glm_eval.jpg)
*    [![](/fb700/chatglm-fitness-RLHF/resolve/main/lora_eva.jpg)](/fb700/chatglm-fitness-RLHF/blob/main/lora_eva.jpg)

[](#usage1-16glorachatglmlora)Usage1 16GloraChatGLMLoRA
---------------------------------------------------------------------------------------------------------------------

16GloraChatGLMLoRA (HuggingFace Transformers) First, you pass your input through the transformer model, then you get the generated sentence. Install package:

    pip install transformers 
    

    
    import sys
    from peft import PeftModel
    from transformers import AutoModel, AutoTokenizer
    sys.path.append('..')
    model = AutoModel.from_pretrained("THUDM/chatglm-6b", device_map='auto')
    model = PeftModel.from_pretrained(model, "model/chatglm_fitness_lora")#"model/chatglm_fitness_lora"lora
    model = model.half().cuda()  # fp16
    tokenizer = AutoTokenizer.from_pretrained("THUDM/chatglm-6b", trust_remote_code=True)
    sents = ['\n']
    for s in sents:
        response = model.chat(tokenizer, s, max_length=128, eos_token_id=tokenizer.eos_token_id)
        print(response)
    

output:

    
     
    
    N95
    
    1
    
    
    
    
    
    
    
    (Systemic Lupus Erythematosus,SLE)SLE
    1. SLE
    2. SLE
    3. SLE
    4. SLE
    
    
    1. SLE
    2. SLE
    3. SLE
    
    SLE
    
    
    
    1. 
    2. 
    3. 
    
    
    



    chatglm_fitness_lora
         adapter_config.json
         adapter_model.bin
    

* * *

[](#usage2-16gfp16int8int4)Usage2 16Gfp16int8int4
---------------------------------------------------------------------------------------

First, you pass your input through the transformer model, then you get the generated sentence.

    pip install transformers 
    

    
    import sys
    from peft import PeftModel
    from transformers import AutoModel, AutoTokenizer
    sys.path.append('..')
    model = AutoModel.from_pretrained("fb700/chatglm-fitness-RLHF",  device_map='auto')#fb700/chatglm-fitness-RLHFhg
    #model = PeftModel.from_pretrained(model, "model/chatglm_fitness_lora") # lora
    model = model.half().quantize(4).cuda()  # int4
    #model = model.half().quantize(8).cuda()  # int8
    #model = model.half().cuda()  # fp16
    tokenizer = AutoTokenizer.from_pretrained("fb700/chatglm-fitness-RLHF", trust_remote_code=True)
    sents = ['\n']
    for s in sents:
        response = model.chat(tokenizer, s, max_length=128, eos_token_id=tokenizer.eos_token_id)
        print(response)
    

output:

    chatglm-6bchatglm2-6b-7b
    
    RNNLSTMattention mechanism
    
    

## Model overview

The `chatglm-fitness-RLHF` is a fine-tuned version of the [ChatGLM-6B](https://huggingface.co/THUDM/chatglm-6b) language model developed by the maintainer [fb700](https://aimodels.fyi/creators/huggingFace/fb700). This model has been trained using Reinforcement Learning from Human Feedback (RLHF) to improve its conversational abilities and task-completion skills. It retains the smooth conversational flow and low deployment threshold of the original ChatGLM-6B, while introducing additional capabilities.

Similar models in the ChatGLM family include the [chatglm2-6b-int4](https://aimodels.fyi/models/huggingFace/chatglm2-6b-int4-thudm), [chatglm3-6b-32k](https://aimodels.fyi/models/huggingFace/chatglm3-6b-32k-thudm), [chatglm2-6b-32k](https://aimodels.fyi/models/huggingFace/chatglm2-6b-32k-thudm), and [chatglm3-6b-128k](https://aimodels.fyi/models/huggingFace/chatglm3-6b-128k-thudm). These models build upon the core ChatGLM architecture with various enhancements, such as improved performance, longer context handling, and more efficient inference.

## Model inputs and outputs

The `chatglm-fitness-RLHF` model is a text-to-text transformer that can generate human-like responses based on the provided input. It takes natural language text as input and produces a corresponding output text.

### Inputs
- Natural language text prompts or questions

### Outputs
- Coherent, contextual responses generated based on the input

## Capabilities

The `chatglm-fitness-RLHF` model has been fine-tuned to excel at open-ended conversation and task completion. It can engage in multi-turn dialogues, answer follow-up questions, and provide helpful information on a wide range of topics. The RLHF training has enabled the model to better understand human preferences and provide more relevant and engaging responses.

## What can I use it for?

The `chatglm-fitness-RLHF` model can be used for a variety of applications, such as building conversational AI assistants, generating helpful content, answering questions, and completing tasks. Its strong language understanding and generation capabilities make it well-suited for use cases like customer support, personal assistants, and interactive educational tools.

## Things to try

One interesting aspect of the `chatglm-fitness-RLHF` model is its ability to engage in open-ended dialogue and adapt to the user's conversational style. You could try initiating a multi-turn conversation on a topic of your choice and observe how the model responds and builds upon the discussion. Additionally, you could provide the model with complex prompts or instructions and see how it handles task completion and problem-solving.