[](#chronos-t5-large)Chronos-T5 (Large)
=======================================

Chronos is a family of **pretrained time series forecasting models** based on language model architectures. A time series is transformed into a sequence of tokens via scaling and quantization, and a language model is trained on these tokens using the cross-entropy loss. Once trained, probabilistic forecasts are obtained by sampling multiple future trajectories given the historical context. Chronos models have been trained on a large corpus of publicly available time series data, as well as synthetic data generated using Gaussian processes.

For details on Chronos models, training data and procedures, and experimental results, please refer to the paper [Chronos: Learning the Language of Time Series](https://arxiv.org/abs/2403.07815).

![](/amazon/chronos-t5-large/resolve/main/figures/main-figure.png)  
Fig. 1: High-level depiction of Chronos. (**Left**) The input time series is scaled and quantized to obtain a sequence of tokens. (**Center**) The tokens are fed into a language model which may either be an encoder-decoder or a decoder-only model. The model is trained using the cross-entropy loss. (**Right**) During inference, we autoregressively sample tokens from the model and map them back to numerical values. Multiple trajectories are sampled to obtain a predictive distribution.

* * *

[](#architecture)Architecture
-----------------------------

The models in this repository are based on the [T5 architecture](https://arxiv.org/abs/1910.10683). The only difference is in the vocabulary size: Chronos-T5 models use 4096 different tokens, compared to 32128 of the original T5 models, resulting in fewer parameters.

Model

Parameters

Based on

[**chronos-t5-tiny**](https://huggingface.co/amazon/chronos-t5-tiny)

8M

[t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny)

[**chronos-t5-mini**](https://huggingface.co/amazon/chronos-t5-mini)

20M

[t5-efficient-mini](https://huggingface.co/google/t5-efficient-mini)

[**chronos-t5-small**](https://huggingface.co/amazon/chronos-t5-small)

46M

[t5-efficient-small](https://huggingface.co/google/t5-efficient-small)

[**chronos-t5-base**](https://huggingface.co/amazon/chronos-t5-base)

200M

[t5-efficient-base](https://huggingface.co/google/t5-efficient-base)

[**chronos-t5-large**](https://huggingface.co/amazon/chronos-t5-large)

710M

[t5-efficient-large](https://huggingface.co/google/t5-efficient-large)

[](#usage)Usage
---------------

To perform inference with Chronos models, install the package in the GitHub [companion repo](https://github.com/amazon-science/chronos-forecasting) by running:

    pip install git+https://github.com/amazon-science/chronos-forecasting.git
    

A minimal example showing how to perform inference using Chronos models:

    import matplotlib.pyplot as plt
    import numpy as np
    import pandas as pd
    import torch
    from chronos import ChronosPipeline
    
    pipeline = ChronosPipeline.from_pretrained(
      "amazon/chronos-t5-large",
      device_map="cuda",
      torch_dtype=torch.bfloat16,
    )
    
    df = pd.read_csv("https://raw.githubusercontent.com/AileenNielsen/TimeSeriesAnalysisWithPython/master/data/AirPassengers.csv")
    
    # context must be either a 1D tensor, a list of 1D tensors,
    # or a left-padded 2D tensor with batch as the first dimension
    context = torch.tensor(df["#Passengers"])
    prediction_length = 12
    forecast = pipeline.predict(context, prediction_length)  # shape [num_series, num_samples, prediction_length]
    
    # visualize the forecast
    forecast_index = range(len(df), len(df) + prediction_length)
    low, median, high = np.quantile(forecast[0].numpy(), [0.1, 0.5, 0.9], axis=0)
    
    plt.figure(figsize=(8, 4))
    plt.plot(df["#Passengers"], color="royalblue", label="historical data")
    plt.plot(forecast_index, median, color="tomato", label="median forecast")
    plt.fill_between(forecast_index, low, high, color="tomato", alpha=0.3, label="80% prediction interval")
    plt.legend()
    plt.grid()
    plt.show()
    

[](#citation)Citation
---------------------

If you find Chronos models useful for your research, please consider citing the associated [paper](https://arxiv.org/abs/2403.07815):

    @article{ansari2024chronos,
      author  = {Ansari, Abdul Fatir and Stella, Lorenzo and Turkmen, Caner and Zhang, Xiyuan, and Mercado, Pedro and Shen, Huibin and Shchur, Oleksandr and Rangapuram, Syama Syndar and Pineda Arango, Sebastian and Kapoor, Shubham and Zschiegner, Jasper and Maddix, Danielle C. and Mahoney, Michael W. and Torkkola, Kari and Gordon Wilson, Andrew and Bohlke-Schneider, Michael and Wang, Yuyang},
      title   = {Chronos: Learning the Language of Time Series},
      journal = {arXiv preprint arXiv:2403.07815},
      year    = {2024}
    }
    

[](#security)Security
---------------------

See [CONTRIBUTING](/amazon/chronos-t5-large/blob/main/CONTRIBUTING.md#security-issue-notifications) for more information.

[](#license)License
-------------------

This project is licensed under the Apache-2.0 License.

## Model overview

The `chronos-t5-large` model is a time series forecasting model from Amazon that is based on the [T5 architecture](https://arxiv.org/abs/1910.10683). Like other Chronos models, it transforms time series data into sequences of tokens using scaling and quantization, and then trains a language model on these tokens to learn patterns and generate future forecasts. The `chronos-t5-large` model has 710M parameters, making it the largest in the Chronos family, which also includes smaller variants like `chronos-t5-tiny`, `chronos-t5-mini`, and `chronos-t5-base`.

Chronos models are similar to other text-to-text transformer models like [CodeT5-large](https://aimodels.fyi/models/huggingFace/codet5-large-salesforce) and the original [T5-large](https://aimodels.fyi/models/huggingFace/t5-large-google-t5) in their use of a unified text-to-text format and encoder-decoder architecture. However, Chronos is specifically designed and trained for time series forecasting tasks, while CodeT5 and T5 are more general-purpose language models.

## Model inputs and outputs

### Inputs
- **Time series data**: The Chronos-T5 models accept sequences of numerical time series values as input, which are then transformed into token sequences for modeling.

### Outputs
- **Probabilistic forecasts**: The models generate future trajectories of the time series by autoregressively sampling tokens from the trained language model. This results in a predictive distribution over future values rather than a single point forecast.

## Capabilities

The `chronos-t5-large` model and other Chronos variants have demonstrated strong performance on a variety of time series forecasting tasks, including datasets covering domains like finance, energy, and weather. By leveraging the large-scale T5 architecture, the models are able to capture complex patterns in the training data and generalize well to new time series. Additionally, the probabilistic nature of the outputs allows the models to capture uncertainty, which can be valuable in real-world forecasting applications.

## What can I use it for?

The `chronos-t5-large` model and other Chronos variants can be used for a wide range of time series forecasting use cases, such as:

- **Financial forecasting**: Predicting stock prices, exchange rates, or other financial time series
- **Energy demand forecasting**: Forecasting electricity or fuel consumption for grid operators or energy companies
- **Demand planning**: Forecasting product demand to optimize inventory and supply chain management
- **Weather and climate forecasting**: Predicting weather patterns, temperature, precipitation, and other climate-related variables

To use the Chronos models, you can follow the example provided in the [companion repository](https://github.com/amazon-science/chronos-forecasting), which demonstrates how to load the model, preprocess your data, and generate forecasts.

## Things to try

One key capability of the Chronos models is their ability to handle a wide range of time series data, from financial metrics to weather measurements. Try experimenting with different types of time series data to see how the model performs. You can also explore the impact of different preprocessing steps, such as scaling, quantization, and time series transformation, on the model's forecasting accuracy.

Another interesting aspect of the Chronos models is their probabilistic nature, which allows them to capture uncertainty in their forecasts. Try analyzing the predicted probability distributions and how they change based on the input data or model configuration. This information can be valuable for decision-making in real-world applications.