[](#triplaneguassian-model-card)TriplaneGuassian Model Card
===========================================================

[**Project Page**](https://zouzx.github.io/TriplaneGaussian/) **|** [**Paper (ArXiv)**](https://arxiv.org/abs/2312.09147) **|** [**Code**](https://github.com/VAST-AI-Research/TriplaneGaussian) **|** [**Gradio demo**](https://huggingface.co/spaces/VAST-AI/TriplaneGaussian)

[](#introduction)Introduction
-----------------------------

TGS enables fast reconstruction from single-view image in a few seconds based on a hybrid Triplane-Gaussian 3D representation.

[](#examples)Examples
---------------------

### [](#results-on-images-generated-by-midjourney)Results on Images Generated by [Midjourney](https://www.midjourney.com/)

### [](#results-on-captured-real-world-images)Results on Captured Real-world Images

[](#model-details)Model Details
-------------------------------

The model `model_lvis_rel.ckpt` is trained on Objaverse-LVIS dataset, which only includes ~45K synthetic objects.

[](#usage)Usage
---------------

You can directly download the model in this repository or employ the model in python script by:

    from huggingface_hub import hf_hub_download
    MODEL_CKPT_PATH = hf_hub_download(repo_id="VAST-AI/TriplaneGaussian", filename="model_lvis_rel.ckpt", repo_type="model")
    

More details can be found in our [Github repository](https://github.com/VAST-AI-Research/TriplaneGaussian).

[](#citation)Citation
---------------------

If you find this work helpful, please consider citing our paper:

    @article{zou2023triplane,
      title={Triplane Meets Gaussian Splatting: Fast and Generalizable Single-View 3D Reconstruction with Transformers},
      author={Zou, Zi-Xin and Yu, Zhipeng and Guo, Yuan-Chen and Li, Yangguang and Liang, Ding and Cao, Yan-Pei and Zhang, Song-Hai},
      journal={arXiv preprint arXiv:2312.09147},
      year={2023}
    }

## Model overview

The `TriplaneGaussian` model, developed by VAST-AI, enables fast 3D reconstruction from single-view images in a few seconds. It uses a hybrid Triplane-Gaussian 3D representation to achieve this. Similar models like [TripoSR](https://aimodels.fyi/models/huggingFace/triposr-stabilityai) and [LGM](https://aimodels.fyi/models/huggingFace/lgm-ashawkey) also leverage Gaussian Splatting for efficient 3D generation, while [InstantMesh](https://aimodels.fyi/models/huggingFace/instantmesh-tencentarc) and [Stable-Dreamfusion](https://aimodels.fyi/models/huggingFace/stable-dreamfusion-webaverse) focus on 3D mesh generation from images or text.

## Model inputs and outputs

The `TriplaneGaussian` model takes a single-view 2D image as input and generates a 3D reconstruction based on a hybrid Triplane-Gaussian representation. This allows for fast reconstruction in just a few seconds, making it suitable for applications that require real-time 3D content creation.

### Inputs
- Single-view 2D image

### Outputs
- 3D reconstruction based on a hybrid Triplane-Gaussian representation

## Capabilities

The `TriplaneGaussian` model has been demonstrated to work well on images generated by Midjourney, as well as captured real-world images. It can generate 3D reconstructions from these inputs in a matter of seconds, making it a powerful tool for rapid 3D content creation.

## What can I use it for?

The `TriplaneGaussian` model could be useful for a variety of applications that require fast 3D reconstruction from 2D inputs, such as 3D asset creation, virtual reality, and augmented reality. Its ability to work with both synthetic and real-world images makes it a versatile tool for both content creators and developers.

## Things to try

Experimenting with the `TriplaneGaussian` model on a variety of 2D inputs, including both synthetic and real-world images, could yield interesting results and insights. Comparing its performance to similar models like [TripoSR](https://aimodels.fyi/models/huggingFace/triposr-stabilityai), [LGM](https://aimodels.fyi/models/huggingFace/lgm-ashawkey), [InstantMesh](https://aimodels.fyi/models/huggingFace/instantmesh-tencentarc), and [Stable-Dreamfusion](https://aimodels.fyi/models/huggingFace/stable-dreamfusion-webaverse) could also provide valuable insights into the strengths and limitations of each approach.