[](#a-state-of-the-art-large-scale-pretrained-response-generation-model-dialogpt)A State-of-the-Art Large-scale Pretrained Response generation model (DialoGPT)
---------------------------------------------------------------------------------------------------------------------------------------------------------------

DialoGPT is a SOTA large-scale pretrained dialogue response generation model for multiturn conversations. The [human evaluation results](https://github.com/dreasysnail/Dialogpt_dev#human-evaluation) indicate that the response generated from DialoGPT is comparable to human response quality under a single-turn conversation Turing test. The model is trained on 147M multi-turn dialogue from Reddit discussion thread.

*   Multi-turn generation examples from an interactive environment:

Role

Response

User

Does money buy happiness?

Bot

Depends how much money you spend on it .

User

What is the best way to buy happiness ?

Bot

You just have to be a millionaire by your early 20s, then you can be happy .

User

This is so difficult !

Bot

You have no idea how hard it is to be a millionaire and happy . There is a reason the rich have a lot of money

Please find the information about preprocessing, training and full details of the DialoGPT in the [original DialoGPT repository](https://github.com/microsoft/DialoGPT)

ArXiv paper: [https://arxiv.org/abs/1911.00536](https://arxiv.org/abs/1911.00536)

### [](#how-to-use)How to use

Now we are ready to try out how the model works as a chatting partner!

    from transformers import AutoModelForCausalLM, AutoTokenizer
    import torch
    
    
    tokenizer = AutoTokenizer.from_pretrained("microsoft/DialoGPT-medium")
    model = AutoModelForCausalLM.from_pretrained("microsoft/DialoGPT-medium")
    
    # Let's chat for 5 lines
    for step in range(5):
        # encode the new user input, add the eos_token and return a tensor in Pytorch
        new_user_input_ids = tokenizer.encode(input(">> User:") + tokenizer.eos_token, return_tensors='pt')
    
        # append the new user input tokens to the chat history
        bot_input_ids = torch.cat([chat_history_ids, new_user_input_ids], dim=-1) if step > 0 else new_user_input_ids
    
        # generated a response while limiting the total chat history to 1000 tokens, 
        chat_history_ids = model.generate(bot_input_ids, max_length=1000, pad_token_id=tokenizer.eos_token_id)
    
        # pretty print last ouput tokens from bot
        print("DialoGPT: {}".format(tokenizer.decode(chat_history_ids[:, bot_input_ids.shape[-1]:][0], skip_special_tokens=True)))

## Model overview

`DialoGPT-medium` is a state-of-the-art large-scale pretrained dialogue response generation model developed by Microsoft. It is trained on 147M multi-turn dialogues from Reddit discussion threads, allowing the model to generate human-like responses in open-ended conversations. According to the [human evaluation results](https://github.com/dreasysnail/Dialogpt_dev#human-evaluation), the quality of the responses generated by `DialoGPT-medium` is comparable to human responses in a single-turn conversation Turing test.

The `DialoGPT-medium` model is part of the DialoGPT model family, which also includes the larger `DialoGPT-large` and the smaller `DialoGPT-small` versions. These models share the same architecture and training data, but differ in their size and performance characteristics.

## Model inputs and outputs

### Inputs
- Text representing the conversation history between the user and the model

### Outputs
- Text representing the model's response to continue the conversation

## Capabilities

`DialoGPT-medium` is capable of generating coherent and contextually-appropriate multi-turn responses in open-ended conversations. The model can engage in a wide range of conversational topics, from discussing the merits of wealth and happiness to providing empathetic responses. The model's ability to generate human-like responses makes it a useful tool for building conversational AI assistants.

## What can I use it for?

`DialoGPT-medium` can be used to build conversational AI assistants for a variety of applications, such as customer service, social chatbots, or virtual companions. The model's pretrained nature allows for efficient fine-tuning on specific tasks or domains, making it a versatile tool for building conversational AI systems.

## Things to try

One interesting aspect of `DialoGPT-medium` is its ability to engage in multi-turn conversations and maintain context over the course of a dialogue. Developers can experiment with using the model to build conversational agents that can remember and reference previous parts of a conversation, allowing for more natural and engaging interactions.

Another area to explore is the model's performance on specific conversational tasks or domains, such as task-oriented dialogues or empathetic responses. Developers can fine-tune the model on relevant data to assess its capabilities in these areas.