Maintainer: nbroad

Total Score


Last updated 5/28/2024

Model LinkView on HuggingFace
API SpecView on HuggingFace
Github LinkNo Github link provided
Paper LinkNo paper link provided

Get summaries of the top AI models delivered straight to your inbox:

Model Overview

ESG-BERT is a domain-specific BERT language model developed by Mukut Mukherjee, Charan Pothireddi, and for text mining in sustainable investing. It is a fine-tuned version of the parent BERT language model.

The model was shared by the maintainer nbroad on the HuggingFace platform. ESG-BERT is a cased language model trained on text data relevant to environmental, social, and governance (ESG) investing.

Model Inputs and Outputs


  • Text: The model takes in text data as input, such as news articles, research reports, or other documents related to sustainable investing.


  • Text classification: The primary output of ESG-BERT is text classification, where the model can categorize input text into relevant ESG-related topics or themes.


ESG-BERT is designed to excel at text mining tasks in the domain of sustainable investing. It can be used to automatically analyze large volumes of text data to identify relevant ESG-related information, trends, and insights. For example, the model could be used to classify news articles by their ESG focus, extract key ESG-related concepts and entities from investment research reports, or gauge sentiment around ESG issues in social media posts.

What Can I Use It For?

The applications of ESG-BERT extend beyond just text classification. The model can be fine-tuned to perform various other downstream NLP tasks in the domain of sustainable investing, such as:

  • ESG-focused topic modeling: Identify the main themes and topics discussed in large ESG-related text corpora.
  • ESG risk assessment: Automatically assess the ESG-related risks and exposures discussed in corporate disclosures or news reports.
  • Sustainable investment research: Extract relevant ESG data and insights from research reports to inform investment decisions.
  • ESG compliance monitoring: Automate the monitoring of ESG regulatory and disclosure requirements across industries.

Things to Try

One interesting aspect of ESG-BERT is its domain-specific training, which allows it to better understand and contextualize the language used in sustainable investing. Compared to a more general language model, ESG-BERT may be better equipped to handle ESG-related jargon, identify subtle nuances in ESG discussions, and pick up on relevant ESG themes and concepts.

Researchers and practitioners in the sustainable investing space could experiment with fine-tuning ESG-BERT on their own ESG-related text data to further enhance its performance on domain-specific tasks. The model's strong base in BERT combined with its specialized training could make it a powerful tool for extracting valuable insights from the growing body of ESG-focused information.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Models




Total Score


The bert-base-cased model is a base-sized BERT model that has been pre-trained on a large corpus of English text using a masked language modeling (MLM) objective. It was introduced in this paper and first released in this repository. This model is case-sensitive, meaning it can distinguish between words like "english" and "English". The BERT model learns a bidirectional representation of text by randomly masking 15% of the words in the input and then training the model to predict those masked words. This is different from traditional language models that process text sequentially. By learning to predict masked words in their full context, BERT can capture deeper semantic relationships in the text. Compared to similar models like bert-base-uncased, the bert-base-cased model preserves capitalization information, which can be useful for tasks like named entity recognition. The distilbert-base-uncased model is a compressed, faster version of BERT that was trained to mimic the behavior of the original BERT base model. The xlm-roberta-base model is a multilingual version of RoBERTa, capable of understanding 100 different languages. Model inputs and outputs Inputs Text**: The model takes raw text as input, which is tokenized and converted to token IDs that the model can process. Outputs Masked word predictions**: When used for masked language modeling, the model outputs probability distributions over the vocabulary for each masked token in the input. Sequence classifications**: When fine-tuned on downstream tasks, the model can output classifications for the entire input sequence, such as sentiment analysis or text categorization. Token classifications**: The model can also be fine-tuned to output classifications for individual tokens in the sequence, such as named entity recognition. Capabilities The bert-base-cased model is particularly well-suited for tasks that require understanding the full context of a piece of text, such as sentiment analysis, text classification, and question answering. Its bidirectional nature allows it to capture nuanced relationships between words that sequential models may miss. For example, the model can be used to classify whether a restaurant review is positive or negative, even if the review contains negation (e.g. "The food was not good"). By considering the entire context of the sentence, the model can understand that the reviewer is expressing a negative sentiment. What can I use it for? The bert-base-cased model is a versatile base model that can be fine-tuned for a wide variety of natural language processing tasks. Some potential use cases include: Text classification**: Classify documents, emails, or social media posts into categories like sentiment, topic, or intent. Named entity recognition**: Identify and extract entities like people, organizations, and locations from text. Question answering: Build a system that can answer questions by understanding the context of a given passage. Summarization**: Generate concise summaries of long-form text. Companies could leverage the model's capabilities to build intelligent chatbots, content moderation systems, or automated customer service applications. Things to try One interesting aspect of the bert-base-cased model is its ability to capture nuanced relationships between words, even across long-range dependencies. For example, try using the model to classify the sentiment of reviews that contain negation or sarcasm. You may find that it performs better than simpler models that only consider the individual words in isolation. Another interesting experiment would be to compare the performance of the bert-base-cased model to the bert-base-uncased model on tasks where capitalization is important, such as named entity recognition. The cased model may be better able to distinguish between proper nouns and common nouns, leading to improved performance.

Read more

Updated Invalid Date




Total Score


The bert-base-uncased model is a pre-trained BERT model from Google that was trained on a large corpus of English data using a masked language modeling (MLM) objective. It is the base version of the BERT model, which comes in both base and large variations. The uncased model does not differentiate between upper and lower case English text. The bert-base-uncased model demonstrates strong performance on a variety of NLP tasks, such as text classification, question answering, and named entity recognition. It can be fine-tuned on specific datasets for improved performance on downstream tasks. Similar models like distilbert-base-cased-distilled-squad have been trained by distilling knowledge from BERT to create a smaller, faster model. Model inputs and outputs Inputs Text Sequences**: The bert-base-uncased model takes in text sequences as input, typically in the form of tokenized and padded sequences of token IDs. Outputs Token-Level Logits**: The model outputs token-level logits, which can be used for tasks like masked language modeling or sequence classification. Sequence-Level Representations**: The model also produces sequence-level representations that can be used as features for downstream tasks. Capabilities The bert-base-uncased model is a powerful language understanding model that can be used for a wide variety of NLP tasks. It has demonstrated strong performance on benchmarks like GLUE, and can be effectively fine-tuned for specific applications. For example, the model can be used for text classification, named entity recognition, question answering, and more. What can I use it for? The bert-base-uncased model can be used as a starting point for building NLP applications in a variety of domains. For example, you could fine-tune the model on a dataset of product reviews to build a sentiment analysis system. Or you could use the model to power a question answering system for an FAQ website. The model's versatility makes it a valuable tool for many NLP use cases. Things to try One interesting thing to try with the bert-base-uncased model is to explore how its performance varies across different types of text. For example, you could fine-tune the model on specialized domains like legal or medical text and see how it compares to its general performance on benchmarks. Additionally, you could experiment with different fine-tuning strategies, such as using different learning rates or regularization techniques, to further optimize the model's performance for your specific use case.

Read more

Updated Invalid Date




Total Score


CryptoBERT is a pre-trained natural language processing (NLP) model designed to analyze the language and sentiments of cryptocurrency-related social media posts and messages. It was built by further training the vinai/bertweet-base language model on a corpus of over 3.2M unique cryptocurrency-related social media posts. This model can be useful for monitoring market sentiment and identifying potential trends or investment opportunities in the cryptocurrency space. Similar models include twitter-XLM-roBERTa-base-sentiment for general sentiment analysis on Twitter data, and BTLM-3B-8k-base for large-scale language modeling. However, CryptoBERT is specifically tailored for the cryptocurrency domain, making it potentially more accurate for tasks like cryptocurrency sentiment analysis. Model inputs and outputs Inputs Text**: The model takes in text, such as social media posts or messages, related to cryptocurrencies. Outputs Sentiment classification**: The model outputs a sentiment classification of the input text, with labels "Bearish", "Neutral", or "Bullish". Classification scores**: Along with the sentiment label, the model also outputs the probability scores for each sentiment class. Capabilities CryptoBERT can be used to analyze the sentiment of cryptocurrency-related text, which can be useful for monitoring market trends, identifying potential investment opportunities, or understanding public perception of specific cryptocurrencies. The model was trained on a large corpus of cryptocurrency-related social media posts, giving it a strong understanding of the language and sentiment in this domain. What can I use it for? You can use CryptoBERT for a variety of applications related to cryptocurrency market analysis and sentiment tracking. For example, you could use it to: Monitor social media sentiment around specific cryptocurrencies or the broader cryptocurrency market. Identify potential investment opportunities by detecting shifts in market sentiment. Analyze the sentiment of news articles, blog posts, or other cryptocurrency-related content. Incorporate sentiment data into trading strategies or investment decision-making processes. The model's maintainer has also provided a classification example, which you can use as a starting point for integrating the model into your own applications. Things to try One interesting thing to try with CryptoBERT is to compare its sentiment predictions with actual cryptocurrency market movements. You could track the model's sentiment output over time and see how well it correlates with changes in cryptocurrency prices or trading volume. This could help you understand the model's strengths and limitations in predicting market sentiment and identify potential areas for improvement. Another idea is to experiment with fine-tuning the model on additional cryptocurrency-related data, such as company announcements, developer forums, or industry reports. This could further enhance the model's understanding of the language and nuances of the cryptocurrency space, potentially improving its sentiment analysis capabilities.

Read more

Updated Invalid Date



Total Score


bert-base-multilingual-uncased is a BERT model pretrained on the top 102 languages with the largest Wikipedia using a masked language modeling (MLM) objective. It was introduced in this paper and first released in this repository. This model is uncased, meaning it does not differentiate between English and english. Similar models include the BERT large uncased model, the BERT base uncased model, and the BERT base cased model. These models vary in size and language coverage, but all use the same self-supervised pretraining approach. Model inputs and outputs Inputs Text**: The model takes in text as input, which can be a single sentence or a pair of sentences. Outputs Masked token predictions**: The model can be used to predict the masked tokens in an input sequence. Next sentence prediction**: The model can also predict whether two input sentences were originally consecutive or not. Capabilities The bert-base-multilingual-uncased model is able to understand and represent text from 102 different languages. This makes it a powerful tool for multilingual text processing tasks such as text classification, named entity recognition, and question answering. By leveraging the knowledge learned from a diverse set of languages during pretraining, the model can effectively transfer to downstream tasks in different languages. What can I use it for? You can fine-tune bert-base-multilingual-uncased on a wide variety of multilingual NLP tasks, such as: Text classification**: Categorize text into different classes, e.g. sentiment analysis, topic classification. Named entity recognition**: Identify and extract named entities (people, organizations, locations, etc.) from text. Question answering**: Given a question and a passage of text, extract the answer from the passage. Sequence labeling**: Assign a label to each token in a sequence, e.g. part-of-speech tagging, relation extraction. See the model hub to explore fine-tuned versions of the model on specific tasks. Things to try Since bert-base-multilingual-uncased is a powerful multilingual model, you can experiment with applying it to a diverse range of multilingual NLP tasks. Try fine-tuning it on your own multilingual datasets or leveraging its capabilities in a multilingual application. Additionally, you can explore how the model's performance varies across different languages and identify any biases or limitations it may have.

Read more

Updated Invalid Date