LLaVA 13b Delta V0 Science_qa

liuhaotian

LLaVA-13b-delta-v0-science_qa

NOTE: This "delta model" cannot be used directly.Users have to apply it on top of the original LLaMA weights to get actual LLaVA weights.See https://github.com/haotian-liu/LLaVA#llava-weights for instructions. LLaVA Model Card Model details Model type: LLaVA is an open-source chatbot trained by fine-tuning LLaMA/Vicuna on GPT-generated multimodal instruction-following data. It is an auto-regressive language model, based on the transformer architecture. This model is finetuned on ScienceQA dataset. Model date: LLaVA was trained in April 2023. Paper or resources for more information: https://llava-vl.github.io/ License: Apache License 2.0 Where to send questions or comments about the model: https://github.com/haotian-liu/LLaVA/issues Intended use Primary intended uses: The primary use of LLaVA is research on large multimodal models and chatbots. Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence. Training dataset 595K filtered image-text pairs from CC3M. ScienceQA dataset. Evaluation dataset A preliminary evaluation of the model quality is conducted by creating a set of 90 visual reasoning questions from 30 unique images randomly sampled from COCO val 2014 and each is associated with three types of questions: conversational, detailed description, and complex reasoning. We utilize GPT-4 to judge the model outputs. We also evaluate our model on the ScienceQA dataset. Our synergy with GPT-4 sets a new state-of-the-art on the dataset. See https://llava-vl.github.io/ for more details.
text-generation

Pricing

Cost per run
$-
USD
Avg run time
-
Seconds
Hardware
-
Prediction

Creator Models

ModelCostRuns
LLaVA 13b Delta V0$?505
LLaVA Pretrained Projectors$?0
Llava Vicuna 7b V1.1 Lcs_558k Instruct_80k_1e Lora Preview_alpha$?32
LLaVA 7b Delta V0$?667
LLaVA Lightning MPT 7B Preview$?1,820

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the LLaVA 13b Delta V0 Science_qa model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Currently, there are no demos available for this model.

Overview

Summary of this model and related resources.

PropertyValue
Creatorliuhaotian
Model NameLLaVA 13b Delta V0 Science_qa
Description

NOTE: This "delta model" cannot be used directly.Users have to apply it on ...

Read more »
Tagstext-generation
Model LinkView on HuggingFace
API SpecView on HuggingFace
Github LinkNo Github link provided
Paper LinkNo paper link provided

Popularity

How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

PropertyValue
Runs57
Model Rank
Creator Rank

Cost

How much does it cost to run this model? How long, on average, does it take to complete a run?

PropertyValue
Cost per Run$-
Prediction Hardware-
Average Completion Time-