[![image/png](https://cdn-uploads.huggingface.co/production/uploads/62cd5057674cdb524450093d/1hzxoPwqkBJXshKVVe6_9.png)](https://cdn-uploads.huggingface.co/production/uploads/62cd5057674cdb524450093d/1hzxoPwqkBJXshKVVe6_9.png)

[](#granite-34b-code-instruct)Granite-34B-Code-Instruct
=======================================================

[](#model-summary)Model Summary
-------------------------------

**Granite-34B-Code-Instruct** is a 34B parameter model fine tuned from _Granite-34B-Code-Base_ on a combination of **permissively licensed** instruction data to enhance instruction following capabilities including logical reasoning and problem-solving skills.

*   **Developers:** IBM Research
*   **GitHub Repository:** [ibm-granite/granite-code-models](https://github.com/ibm-granite/granite-code-models)
*   **Paper:** [Granite Code Models: A Family of Open Foundation Models for Code Intelligence](https://arxiv.org/abs/2405.04324)
*   **Release Date**: May 6th, 2024
*   **License:** [Apache 2.0](https://www.apache.org/licenses/LICENSE-2.0).

[](#usage)Usage
---------------

### [](#intended-use)Intended use

The model is designed to respond to coding related instructions and can be used to build coding assitants.

### [](#generation)Generation

This is a simple example of how to use **Granite-34B-Code-Instruct** model.

    import torch
    from transformers import AutoModelForCausalLM, AutoTokenizer
    device = "cuda" # or "cpu"
    model_path = "ibm-granite/granite-34b-code-instruct"
    tokenizer = AutoTokenizer.from_pretrained(model_path)
    # drop device_map if running on CPU
    model = AutoModelForCausalLM.from_pretrained(model_path, device_map=device)
    model.eval()
    # change input text as desired
    chat = [
        { "role": "user", "content": "Write a code to find the maximum value in a list of numbers." },
    ]
    chat = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
    # tokenize the text
    input_tokens = tokenizer(chat, return_tensors="pt")
    # transfer tokenized inputs to the device
    for i in input_tokens:
        input_tokens[i] = input_tokens[i].to(device)
    # generate output tokens
    output = model.generate(**input_tokens, max_new_tokens=100)
    # decode output tokens into text
    output = tokenizer.batch_decode(output)
    # loop over the batch to print, in this example the batch size is 1
    for i in output:
        print(i)
    

[](#training-data)Training Data
-------------------------------

Granite Code Instruct models are trained on the following types of data.

*   Code Commits Datasets: we sourced code commits data from the [CommitPackFT](https://huggingface.co/datasets/bigcode/commitpackft) dataset, a filtered version of the full CommitPack dataset. From CommitPackFT dataset, we only consider data for 92 programming languages. Our inclusion criteria boils down to selecting programming languages common across CommitPackFT and the 116 languages that we considered to pretrain the code-base model (_Granite-34B-Code-Base_).
*   Math Datasets: We consider two high-quality math datasets, [MathInstruct](https://huggingface.co/datasets/TIGER-Lab/MathInstruct) and [MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA). Due to license issues, we filtered out GSM8K-RFT and Camel-Math from MathInstruct dataset.
*   Code Instruction Datasets: We use [Glaive-Code-Assistant-v3](https://huggingface.co/datasets/glaiveai/glaive-code-assistant-v3), [Glaive-Function-Calling-v2](https://huggingface.co/datasets/glaiveai/glaive-function-calling-v2), [NL2SQL11](https://huggingface.co/datasets/bugdaryan/sql-create-context-instruction) and a small collection of synthetic API calling datasets.
*   Language Instruction Datasets: We include high-quality datasets such as [HelpSteer](https://huggingface.co/datasets/nvidia/HelpSteer) and an open license-filtered version of [Platypus](https://huggingface.co/datasets/garage-bAInd/Open-Platypus). We also include a collection of hardcoded prompts to ensure our model generates correct outputs given inquiries about its name or developers.

[](#infrastructure)Infrastructure
---------------------------------

We train the Granite Code models using two of IBM's super computing clusters, namely Vela and Blue Vela, both outfitted with NVIDIA A100 and H100 GPUs respectively. These clusters provide a scalable and efficient infrastructure for training our models over thousands of GPUs.

[](#ethical-considerations-and-limitations)Ethical Considerations and Limitations
---------------------------------------------------------------------------------

Granite code instruct models are primarily finetuned using instruction-response pairs across a specific set of programming languages. Thus, their performance may be limited with out-of-domain programming languages. In this situation, it is beneficial providing few-shot examples to steer the model's output. Moreover, developers should perform safety testing and target-specific tuning before deploying these models on critical applications. The model also inherits ethical considerations and limitations from its base model. For more information, please refer to _[Granite-34B-Code-Base](https://huggingface.co/ibm-granite/granite-34b-code-base)_ model card.

## Model Overview

`granite-34b-code-instruct` is a 34B parameter model fine-tuned from the `granite-34b-code-base` model on a combination of permissively licensed instruction data to enhance its instruction following capabilities, including logical reasoning and problem-solving skills. It was developed by [IBM Research](https://aimodels.fyi/creators/huggingFace/ibm-granite).

Similar models include the [granite-8b-code-instruct](https://aimodels.fyi/models/huggingFace/granite-8b-code-instruct-ibm-granite) and [CodeLlama-34B-Instruct-GPTQ](https://aimodels.fyi/models/huggingFace/codellama-34b-instruct-gptq-thebloke) models. The granite-8b-code-instruct model is an 8B parameter version of the code instruction model, while the CodeLlama-34B-Instruct-GPTQ model is a 34B parameter model developed by the community and quantized for faster inference.

## Model Inputs and Outputs

### Inputs
- The model takes in text prompts, which can include instructions or coding tasks.

### Outputs
- The model generates text responses, which can include code snippets, explanations, or solutions to the given prompts.

## Capabilities

The `granite-34b-code-instruct` model is designed to excel at responding to coding-related instructions and can be used to build coding assistants. It has strong logical reasoning and problem-solving skills, allowing it to generate relevant and helpful code in response to prompts.

## What can I use it for?

The `granite-34b-code-instruct` model could be used to develop a variety of coding assistant applications, such as:

- Code generation and completion tools
- Automated programming helpers
- Natural language-to-code translation interfaces
- Educational coding tutors

By leveraging the model's instruction following and problem-solving capabilities, developers can create tools that make it easier for users to write and understand code.

## Things to Try

One interesting thing to try with the `granite-34b-code-instruct` model is to provide it with open-ended prompts about coding problems or tasks, and see how it responds. The model's ability to understand and reason about code-related instructions could lead to creative and unexpected solutions.

Another idea is to fine-tune the model further on domain-specific data or tasks, such as a particular programming language or software framework, to see if it can develop even more specialized capabilities.