Ptaszynski

Rank:

Average Model Cost: $0.0000

Number of Runs: 5,956

Models by this creator

bert-base-polish-cyberbullying

bert-base-polish-cyberbullying

ptaszynski

Polbert-CB - Polish BERT trained for Automatic Cyberbullying Detection This is a Polish version of BERT language model, specifically, Polbert, trained on a re-annotated and improved Dataset for Automatic Cyberbullying Detection in Polish Laguage. Fine-tuning dataset The dataset used for fine-tuning this model was based on the original Dataset for Automatic Cyberbullying Detection in Polish Laguage, which was recently additionally cleaned and re-annotated by experts from Samurai Labs. The improved dataset and will be released separately later. Acknowledgements We would like to express our gratitude to the annotators of this dataset, including original annotators, and more recent expert annotators, for their invaluable time they spent on preparing the dataset. Author Michal Ptaszynski - contact me on: Twitter: @mich_ptaszynski GitHub: ptaszynski LinkedIn: michalptaszynski HuggingFace: ptaszynski Licences The finetuned model with all attached files is licensed under CC BY-SA 4.0, or Creative Commons Attribution-ShareAlike 4.0 International License. Citations Please, cite this model using the following citation. Model: Original dataset: Improved dataset: References https://github.com/google-research/bert https://github.com/ptaszynski/cyberbullying-Polish https://huggingface.co/datasets/poleval2019_cyberbullying

Read more

$-/run

3.0K

Huggingface

yacis-electra-small-japanese-cyberbullying

yacis-electra-small-japanese-cyberbullying

yacis-electra-small-cyberbullying This is an ELECTRA Small model for the Japanese language finetuned for automatic cyberbullying detection. The original foundation model was originally pretrained on 5.6 billion words YACIS blog corpus, and later finetuned on a balanced dataset created by unifying two datasets, namely "Harmful BBS Japanese comments dataset" and "Twitter Japanese cyberbullying dataset". Model architecture The original model was pretrained using ELECTRA Small model settings and can be found here: https://huggingface.co/ptaszynski/yacis-electra-small-japanese Licenses The finetuned model with all attached files is licensed under CC BY-SA 4.0, or Creative Commons Attribution-ShareAlike 4.0 International License. Citations Please, cite this model using the following citation. The two datasets used for finetuning should be cited using the following references. Harmful BBS Japanese comments dataset: Twitter Japanese cyberbullying dataset: The pretraining was done using YACIS corpus, which should be cited using at least one of the following references.

Read more

$-/run

2.9K

Huggingface

Similar creators