Segment Anything, from Meta AI, is an AI model tailored for computer vision research. It allows users to segment objects within any image using a single click.
The model features a promptable segmentation system with zero-shot generalization, enabling it to work with unfamiliar objects and images without needing more training. The system accepts different types of input prompts that specify what to segment in a picture. This includes interactive points and boxes. It's also capable of producing multiple valid masks for prompts that are ambiguous.
The resulting masks can be used as inputs for other AI systems, for tracking objects in videos, for image editing purposes, and for lifting into 3D models or other creative endeavors.
The model's design prioritizes efficiency to support data engine functionality. It includes a one-time image encoder and a lightweight mask decoder that can operate in a web browser, processing prompts in just milliseconds.
While the image encoder requires a GPU to ensure efficient inference, the prompt encoder and mask decoder are compatible with PyTorch. They can also be converted to ONNX for efficient operation on CPUs or GPUs across various platforms that offer ONNX runtime support.
The model's training involved the SA-1B dataset, which consists of more than 11 million licensed images that preserve privacy, leading to the collection of over 1.1 billion segmentation masks.
Advanced capabilities in image segmentation
Segmentation of objects with a single click
Zero-shot generalization capabilities
GPU is required for the image encoder
Focuses solely on image segmentation
Does not generate mask labels

Released 3 years ago
Free + from $4.99/month

Released 2 years ago
Free

Released 5 years ago
Free + from $5.99/unit

Released 3 years ago
Free + from free tier available

Released 6 months ago
Contact for pricing

Released 3 years ago
Free + from $4/month

Released 2 years ago
Free

Released 2 years ago
From $9

Released 6 years ago
From $3.99/month

Released 7 months ago
Free + from 4.99