Average Model Cost: $0.0000
Number of Runs: 5,956
Models by this creator
Polbert-CB - Polish BERT trained for Automatic Cyberbullying Detection This is a Polish version of BERT language model, specifically, Polbert, trained on a re-annotated and improved Dataset for Automatic Cyberbullying Detection in Polish Laguage. Fine-tuning dataset The dataset used for fine-tuning this model was based on the original Dataset for Automatic Cyberbullying Detection in Polish Laguage, which was recently additionally cleaned and re-annotated by experts from Samurai Labs. The improved dataset and will be released separately later. Acknowledgements We would like to express our gratitude to the annotators of this dataset, including original annotators, and more recent expert annotators, for their invaluable time they spent on preparing the dataset. Author Michal Ptaszynski - contact me on: Twitter: @mich_ptaszynski GitHub: ptaszynski LinkedIn: michalptaszynski HuggingFace: ptaszynski Licences The finetuned model with all attached files is licensed under CC BY-SA 4.0, or Creative Commons Attribution-ShareAlike 4.0 International License. Citations Please, cite this model using the following citation. Model: Original dataset: Improved dataset: References https://github.com/google-research/bert https://github.com/ptaszynski/cyberbullying-Polish https://huggingface.co/datasets/poleval2019_cyberbullying
yacis-electra-small-cyberbullying This is an ELECTRA Small model for the Japanese language finetuned for automatic cyberbullying detection. The original foundation model was originally pretrained on 5.6 billion words YACIS blog corpus, and later finetuned on a balanced dataset created by unifying two datasets, namely "Harmful BBS Japanese comments dataset" and "Twitter Japanese cyberbullying dataset". Model architecture The original model was pretrained using ELECTRA Small model settings and can be found here: https://huggingface.co/ptaszynski/yacis-electra-small-japanese Licenses The finetuned model with all attached files is licensed under CC BY-SA 4.0, or Creative Commons Attribution-ShareAlike 4.0 International License. Citations Please, cite this model using the following citation. The two datasets used for finetuning should be cited using the following references. Harmful BBS Japanese comments dataset: Twitter Japanese cyberbullying dataset: The pretraining was done using YACIS corpus, which should be cited using at least one of the following references.