Kssteven
Rank:Average Model Cost: $0.0000
Number of Runs: 11,453
Models by this creator
ibert-roberta-base
ibert-roberta-base
ibert-roberta-base is an integer-only quantized version of the RoBERTa model. It stores all parameters in INT8 representation and performs inference using integer-only arithmetic. This results in up to 4x faster inference compared to the floating-point counterpart. The finetuning procedure for I-BERT consists of three stages: full-precision finetuning, model quantization, and integer-only finetuning. After full-precision finetuning, the model can be quantized by setting the quantize attribute in the config.json file. The quantized model can then be finetuned using integer-only operations. If you use I-BERT, please cite the paper.
$-/run
11.3K
Huggingface
ibert-roberta-large-mnli
$-/run
150
Huggingface
ibert-roberta-large
$-/run
15
Huggingface