Get a weekly rundown of the latest AI models and research... subscribe!




The grounded_sam model is an image-to-image AI that uses mask prompting based on Grounding DINO & Segment Anything. Users input an image URL and specific prompts for masking (for example "clothes, shoes"). They can also add negative prompts (like "pants"). The model then generates appropriate masks for the specified areas in the image. Users can modify the masking with an adjustment factor. The output of the model is a series of URLs that provide the original image with the mask applied, the mask with negative prompts applied, the isolated mask, and the inverted mask images.

Use cases

The Grounded SAM AI model, tagged for image-to-image functions, offers a range of practical applications. It is designed primarily for image masking based on specific prompts, providing an image enhancement system that can highlight or suppress certain elements in an image. For example, an online retailer can use this model to distinguish various articles of clothing in product images, such as accentuating shoes while minimising the presence of pants. Similarly, fashion stylists can leverage the model to focus on individual elements in an ensemble, aiding in creating lookbooks or style guides. Another potential use case could be in the field of surveillance and security, where the model can mask out non-critical elements, providing clearer focus on objects of interest. Extensions of this model can be applied to the development of photo-editing software, augmenting existing tools with the ability to easily highlight or suppress elements in an image. Similarly, the model can be integrated into web crawlers or image search algorithms to effectively filter and sort image data based on the presence or absence of specific elements. Beyond these, industries like real estate and interior design might find use in applying or removing focus points around furniture or specific room elements in property images. Thus, with its image-to-image capacities, the Grounded SAM AI model presents a myriad of practical use cases.



Cost per run
Avg run time

Creator Models

No other models by this creator

Similar Models

Try it!

You can use this area to play around with demo applications that incorporate the Grounded_sam model. These demos are maintained and hosted externally by third-party creators. If you see an error, message me on Twitter.

Currently, there are no demos available for this model.


Summary of this model and related resources.

Model NameGrounded_sam
Mask prompting based on Grounding DINO & Segment Anything
Model LinkView on Replicate
API SpecView on Replicate
Github LinkView on Github
Paper LinkView on Arxiv


How popular is this model, by number of runs? How popular is the creator, by the sum of all their runs?

Model Rank
Creator Rank


How much does it cost to run this model? How long, on average, does it take to complete a run?

Cost per Run$-
Prediction Hardware-
Average Completion Time-