[](#tapex-large-sized-model)TAPEX (large-sized model)
=====================================================

TAPEX was proposed in [TAPEX: Table Pre-training via Learning a Neural SQL Executor](https://arxiv.org/abs/2107.07653) by Qian Liu, Bei Chen, Jiaqi Guo, Morteza Ziyadi, Zeqi Lin, Weizhu Chen, Jian-Guang Lou. The original repo can be found [here](https://github.com/microsoft/Table-Pretraining).

[](#model-description)Model description
---------------------------------------

TAPEX (**Ta**ble **P**re-training via **Ex**ecution) is a conceptually simple and empirically powerful pre-training approach to empower existing models with _table reasoning_ skills. TAPEX realizes table pre-training by learning a neural SQL executor over a synthetic corpus, which is obtained by automatically synthesizing executable SQL queries.

TAPEX is based on the BART architecture, the transformer encoder-decoder (seq2seq) model with a bidirectional (BERT-like) encoder and an autoregressive (GPT-like) decoder.

This model is the `tapex-base` model fine-tuned on the [WikiTableQuestions](https://huggingface.co/datasets/wikitablequestions) dataset.

[](#intended-uses)Intended Uses
-------------------------------

You can use the model for table question answering on _complex_ questions. Some **solveable** questions are shown below (corresponding tables now shown):

Question

Answer

according to the table, what is the last title that spicy horse produced?

Akaneiro: Demon Hunters

what is the difference in runners-up from coleraine academical institution and royal school dungannon?

20

what were the first and last movies greenstreet acted in?

The Maltese Falcon, Malaya

in which olympic games did arasay thondike not finish in the top 20?

2012

which broadcaster hosted 3 titles but they had only 1 episode?

Channel 4

### [](#how-to-use)How to Use

Here is how to use this model in transformers:

    from transformers import TapexTokenizer, BartForConditionalGeneration
    import pandas as pd
    
    tokenizer = TapexTokenizer.from_pretrained("microsoft/tapex-large-finetuned-wtq")
    model = BartForConditionalGeneration.from_pretrained("microsoft/tapex-large-finetuned-wtq")
    
    data = {
        "year": [1896, 1900, 1904, 2004, 2008, 2012],
        "city": ["athens", "paris", "st. louis", "athens", "beijing", "london"]
    }
    table = pd.DataFrame.from_dict(data)
    
    # tapex accepts uncased input since it is pre-trained on the uncased corpus
    query = "In which year did beijing host the Olympic Games?"
    encoding = tokenizer(table=table, query=query, return_tensors="pt")
    
    outputs = model.generate(**encoding)
    
    print(tokenizer.batch_decode(outputs, skip_special_tokens=True))
    # [' 2008.0']
    

### [](#how-to-eval)How to Eval

Please find the eval script [here](https://github.com/huggingface/transformers/tree/main/examples/research_projects/tapex).

### [](#bibtex-entry-and-citation-info)BibTeX entry and citation info

    @inproceedings{
        liu2022tapex,
        title={{TAPEX}: Table Pre-training via Learning a Neural {SQL} Executor},
        author={Qian Liu and Bei Chen and Jiaqi Guo and Morteza Ziyadi and Zeqi Lin and Weizhu Chen and Jian-Guang Lou},
        booktitle={International Conference on Learning Representations},
        year={2022},
        url={https://openreview.net/forum?id=O50443AsCP}
    }

## Model overview

The `tapex-large-finetuned-wtq` model is a large-sized TAPEX model fine-tuned on the [WikiTableQuestions](https://huggingface.co/datasets/wikitablequestions) dataset. TAPEX is a pre-training approach proposed by researchers from Microsoft that aims to empower models with table reasoning skills. The model is based on the BART architecture, a transformer encoder-decoder model with a bidirectional encoder and autoregressive decoder.

Similar models include the [TAPAS large model fine-tuned on WikiTable Questions (WTQ)](https://aimodels.fyi/models/huggingFace/tapas-large-finetuned-wtq-google) and the [TAPAS base model fine-tuned on WikiTable Questions (WTQ)](https://aimodels.fyi/models/huggingFace/tapas-base-finetuned-wtq-google), which also leverage the TAPAS pre-training approach for table question answering tasks.

## Model inputs and outputs

### Inputs
- **Table**: The model takes a table as input, represented in a flattened format.
- **Question**: The model also takes a natural language question about the table as input.

### Outputs
- **Answer**: The model generates the answer to the given question based on the provided table.

## Capabilities

The `tapex-large-finetuned-wtq` model is capable of answering complex questions about tables. It can handle a variety of question types, such as those that require numerical reasoning, aggregation, or multi-step logic. The model has demonstrated strong performance on the WikiTableQuestions benchmark, outperforming many previous table-based QA models.

## What can I use it for?

You can use the `tapex-large-finetuned-wtq` model for table question answering tasks, where you have a table and need to answer natural language questions about the content of the table. This could be useful in a variety of applications, such as:

- Providing intelligent search and question-answering capabilities for enterprise data tables
- Enhancing business intelligence and data analytics tools with natural language interfaces
- Automating the extraction of insights from tabular data in research or scientific domains

## Things to try

One interesting aspect of the TAPEX model is its ability to learn table reasoning skills through pre-training on a synthetic corpus of executable SQL queries. You could experiment with fine-tuning the model on your own domain-specific tabular data, leveraging this pre-trained table reasoning capability to improve performance on your specific use case.

Additionally, you could explore combining the `tapex-large-finetuned-wtq` model with other language models or task-specific architectures to create more powerful table-based question-answering systems. The modular nature of transformer-based models makes it easy to experiment with different model configurations and integration approaches.