[](#stable-code-instruct-3b)**Stable Code Instruct 3B**
=======================================================

[Try it out here: https://huggingface.co/spaces/stabilityai/stable-code-instruct-3b](https://huggingface.co/spaces/stabilityai/stable-code-instruct-3b)

[![image/png](https://cdn-uploads.huggingface.co/production/uploads/63466107f7bd6326925fc770/O7ZkLgqoJprQEWAttX7Hj.png)](https://cdn-uploads.huggingface.co/production/uploads/63466107f7bd6326925fc770/O7ZkLgqoJprQEWAttX7Hj.png)

[](#model-description)Model Description
---------------------------------------

`stable-code-instruct-3b` is a 2.7B billion parameter decoder-only language model tuned from [`stable-code-3b`](https://huggingface.co/stabilityai/stable-code-3b/). This model was trained on a mix of publicly available datasets, synthetic datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).

This instruct tune demonstrates state-of-the-art performance (compared to models of similar size) on the MultiPL-E metrics across multiple programming languages tested using [BigCode's Evaluation Harness](https://github.com/bigcode-project/bigcode-evaluation-harness/tree/main), and on the code portions of [MT Bench](https://klu.ai/glossary/mt-bench-eval). The model is finetuned to make it useable in tasks like,

*   General purpose Code/Software Engineering like conversations.
*   SQL related generation and conversation.

Please note: For commercial use, please refer to [https://stability.ai/membership](https://stability.ai/membership).

[](#usage)Usage
---------------

Here's how you can run the model use the model:

    
    import torch
    from transformers import AutoModelForCausalLM, AutoTokenizer
    tokenizer = AutoTokenizer.from_pretrained("stabilityai/stable-code-instruct-3b", trust_remote_code=True)
    model = AutoModelForCausalLM.from_pretrained("stabilityai/stable-code-instruct-3b", torch_dtype=torch.bfloat16, trust_remote_code=True)
    model.eval()
    model = model.cuda()
    
    messages = [
        {
            "role": "system",
            "content": "You are a helpful and polite assistant",
        },
        {
            "role": "user",
            "content": "Write a simple website in HTML. When a user clicks the button, it shows a random joke from a list of 4 jokes."
        },
    ]
    
    prompt = tokenizer.apply_chat_template(messages, add_generation_prompt=True, tokenize=False)
    
    inputs = tokenizer([prompt], return_tensors="pt").to(model.device)
    
    tokens = model.generate(
        **inputs,
        max_new_tokens=1024,
        temperature=0.5,
        top_p=0.95,
        top_k=100,
        do_sample=True,
        use_cache=True
    )
    
    output = tokenizer.batch_decode(tokens[:, inputs.input_ids.shape[-1]:], skip_special_tokens=False)[0]
    

[](#model-details)Model Details
-------------------------------

*   **Developed by**: [Stability AI](https://stability.ai/)
*   **Model type**: `Stable Code Instruct 3B` model is an auto-regressive language model based on the transformer decoder architecture.
*   **Language(s)**: English
*   **Paper**: [Stable Code Technical Report](https://drive.google.com/file/d/16-DGsR5-qwoPztZ6HcM7KSRUxIXrjlSm/view)
*   **Library**: [Alignment Handbook](https://github.com/huggingface/alignment-handbook.git)
*   **Finetuned from model**: [https://huggingface.co/stabilityai/stable-code-3b](https://huggingface.co/stabilityai/stable-code-3b)
*   **License**: [StabilityAI Non-Commercial Research Community License](https://huggingface.co/stabilityai/stable-code-instruct-3b/blob/main/LICENSE).
*   **Commercial License**: to use this model commercially, please refer to [https://stability.ai/membership](https://stability.ai/membership)
*   **Contact**: For questions and comments about the model, please email `lm@stability.ai`

[](#performance)Performance
---------------------------

### [](#multi-pl-benchmark)Multi-PL Benchmark:

Model

Size

Avg

Python

C++

JavaScript

Java

PHP

Rust

Codellama Instruct

7B

0.30

0.33

0.31

0.31

0.29

0.31

0.25

Deepseek Instruct

1.3B

0.44

0.52

**0.52**

0.41

**0.46**

0.45

0.28

Stable Code Instruct (SFT)

3B

0.44

0.55

0.45

0.42

0.42

0.44

0.32

Stable Code Instruct (DPO)

3B

**0.47**

**0.59**

0.49

**0.49**

0.44

**0.45**

**0.37**

### [](#mt-bench-coding)MT-Bench Coding:

Model

Size

Score

DeepSeek Coder

1.3B

4.6

Stable Code Instruct (DPO)

3B

**5.8**(ours)

Stable Code Instruct (SFT)

3B

5.5

DeepSeek Coder

6.7B

**6.9**

CodeLlama Instruct

7B

3.55

StarChat2

15B

5.7

### [](#sql-performance)SQL Performance

Model

Size

Date

Group By

Order By

Ratio

Join

Where

Stable Code Instruct (DPO)

3B

24.0%

54.2%

68.5%

40.0%

54.2%

42.8%

DeepSeek-Coder Instruct

1.3B

24.0%

37.1%

51.4%

34.3%

45.7%

45.7%

SQLCoder

7B

64.0%

82.9%

74.3%

54.3%

74.3%

74.3%

[](#how-to-cite)How to Cite
---------------------------

    @misc{stable-code-instruct-3b,
          url={[https://huggingface.co/stabilityai/stable-code-3b](https://huggingface.co/stabilityai/stable-code-instruct-3b)},
          title={Stable Code 3B},
          author={Phung, Duy, and Pinnaparaju, Nikhil and Adithyan, Reshinth and Zhuravinskyi, Maksym and Tow, Jonathan and Cooper, Nathan}
    }

## Model Overview

`stable-code-instruct-3b` is a 2.7 billion parameter decoder-only language model tuned from the [`stable-code-3b`](https://aimodels.fyi/models/huggingFace/stable-code-3b-stabilityai) model. This model was trained on a mix of publicly available datasets and synthetic datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290). The model demonstrates state-of-the-art performance on the MultiPL-E metrics across multiple programming languages tested using [BigCode's Evaluation Harness](https://github.com/bigcode-project/bigcode-evaluation-harness/tree/main), and on the code portions of [MT Bench](https://klu.ai/glossary/mt-bench-eval). 

This instruct-tuned model is optimized for general purpose code and software engineering tasks, as well as SQL-related generation and conversation. It outperforms similar-sized models on a range of programming-focused benchmarks.

## Model Inputs and Outputs

### Inputs
- Text prompts for code generation, including instructions, software requirements, or other context

### Outputs
- Generated code snippets or complete programs in a variety of programming languages
- Responses to prompts related to software engineering tasks, such as answering questions or providing explanations

## Capabilities

`stable-code-instruct-3b` is capable of generating high-quality code in multiple programming languages, including Python, C++, JavaScript, Java, and PHP. It can assist with a wide range of software engineering tasks, such as writing functions, implementing algorithms, and solving coding challenges. The model also demonstrates strong conversational abilities, allowing users to engage in back-and-forth dialogues about code-related topics.

## What Can I Use It For?

You can use `stable-code-instruct-3b` to aid in your software development workflows. Some potential use cases include:

- Generating starter code for new projects or features
- Assisting with debugging and troubleshooting by explaining code or suggesting fixes
- Automating repetitive coding tasks, such as boilerplate generation
- Enhancing productivity by allowing you to explore and validate ideas through interactive prompts

When using the model commercially, please refer to [https://stability.ai/membership](https://stability.ai/membership) for licensing information.

## Things to Try

One interesting capability of `stable-code-instruct-3b` is its ability to handle "Fill in the Middle" (FIM) prompts, where the model is tasked with generating the middle portion of a code snippet while the beginning and end are provided. This can be a useful feature when exploring different approaches to a problem or when trying to understand how a specific algorithm or data structure might be implemented.

Another interesting aspect of the model is its strong performance on SQL-related tasks. You can try prompting the model with database schema information or SQL queries and see how it responds, potentially generating new queries or suggesting optimizations.