Castorini

Rank:

Average Model Cost: $0.0000

Number of Runs: 59,318

Models by this creator

monot5-base-msmarco-10k

monot5-base-msmarco-10k

castorini

The monot5-base-msmarco-10k model is a T5-base reranker that has been fine-tuned on the MS MARCO passage dataset for 10,000 steps (or 1 epoch). It is designed to improve the performance of zero-shot tasks and performs better on datasets different from MS MARCO. This model can be used for document ranking and has demonstrated strong performance in various applications. For more information on how to use this model, the provided links offer examples and guidelines.

Read more

$-/run

26.5K

Huggingface

ance-msmarco-passage

ance-msmarco-passage

Model Card for ance-msmarco-passage Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations. Model Details Model Description Pyserini is primarily designed to provide effective, reproducible, and easy-to-use first-stage retrieval in a multi-stage ranking architecture Developed by: Castorini Shared by [Optional]: Hugging Face Model type: Information retrieval Language(s) (NLP): en License: More information needed Related Models: More information needed Parent Model: RoBERTa Resources for more information: GitHub Repo Associated Paper Uses Direct Use More information needed Downstream Use [Optional] More information needed Out-of-Scope Use More information needed Bias, Risks, and Limitations Significant research has explored bias and fairness issues with language models (see, e.g., Sheng et al. (2021) and Bender et al. (2021)). Predictions generated by the model may include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups. Recommendations Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations. Training Details Training Data More information needed Training Procedure Preprocessing More information needed Speeds, Sizes, Times More information needed Evaluation Testing Data, Factors & Metrics Testing Data The model creators note in the associated Paper that: Factors More information needed Metrics More information needed Results More information needed Model Examination More information needed Environmental Impact Carbon emissions can be estimated using the Machine Learning Impact calculator presented in Lacoste et al. (2019). Hardware Type: More information needed Hours used: More information needed Cloud Provider: More information needed Compute Region: More information needed Carbon Emitted: More information needed Technical Specifications [optional] Model Architecture and Objective More information needed Compute Infrastructure More information needed Hardware More information needed Software For bag-of-words sparse retrieval, we have built in Anserini (written in Java) custom parsers and ingestion pipelines for common document formats used in IR research, Citation BibTeX: Glossary [optional] More information needed More Information [optional] More information needed Model Card Authors [optional] Castorini in collaboration with Ezi Ozoani and the Hugging Face team. Model Card Contact More information needed How to Get Started with the Model Use the code below to get started with the model.

Read more

$-/run

2.0K

Huggingface

Similar creators