Distilgpt2

huggingface

distilgpt2

DistilGPT2 is a smaller and easier-to-run version of the GPT-2 model. It is trained using knowledge distillation and has similar functionality to GPT-2. It was trained on the OpenWebTextCorpus dataset and tokenized using a byte-level version of Byte Pair Encoding. DistilGPT2 performs slightly worse than GPT-2 on the WikiText-103 benchmark. The carbon emissions of training the model were estimated using a machine learning impact calculator.

Use cases

DistilGPT2 has a range of potential use cases for researchers and developers. Its smaller size and ease of use make it suitable for those looking to better understand and study large-scale generative language models. Additionally, DistilGPT2 can be utilized to build applications such as chatbots, virtual assistants, or content generators. Its ability to generate text makes it an invaluable tool for tasks such as language translation, summarization, and text completion. One practical use case for this model is the Write With Transformers web app, developed by the Hugging Face team, which allows users to interact with and generate text using DistilGPT2 directly from their browser. With further exploration and experimentation, this model has the potential to fuel the development of innovative and creative AI applications.

text-generation

Pricing

Cost per run
$-
USD
Avg run time
-
Seconds
Hardware
-
Prediction

Creator Models

ModelCostRuns
Distilbert Base Cased$?558,881
Camembert Base$?1,579,850
Roberta Base$?11,383,487
Bert Base Cased$?5,895,746
Gpt2$?17,806,311

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the Distilgpt2 model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Overview

Summary of this model and related resources.

PropertyValue
Creatorhuggingface
Model NameDistilgpt2
Description

DistilGPT2 (short for Distilled-GPT2) is an English-language model pre-trai...

Read more »
Tagstext-generation
Model LinkView on HuggingFace
API SpecView on HuggingFace
Github LinkNo Github link provided
Paper LinkNo paper link provided

Popularity

How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

PropertyValue
Runs1,356,355
Model Rank
Creator Rank

Cost

How much does it cost to run this model? How long, on average, does it take to complete a run?

PropertyValue
Cost per Run$-
Prediction Hardware-
Average Completion Time-