Segment Anything: Adaptation Challenge

Prerequisites

Get access to a machine with a decent NVIDIA GPU
Install Refiners: you will need it to solve this challenge

Background

Segment Anything (SAM) has been designed to solve downstream tasks via prompt engineering which could imply relying on other components like an object detector:

For example, if one has a bounding box detector for cats, cat instance segmentation can be solved by providing the detector's box output as a prompt to our model.

See section 2 from the official paper for more details.

As an alternative to this zero-shot approach, it is possible to improve SAM performance on certain tasks by using adapters: this is the focus of this challenge.

Task: Product Segmentation from Packshot Images

Packshots represent a single product on a white or neutral background:

Challenge

The goal is to train a SAM adapter with Refiners to accurately segment the product by just using the entire image area as a box prompt (which is natural since packshot images are object-centric).

By default, SAM (ViT-H) does not perform well on such kind of inputs, so your adapter should address some or all of these non-exhaustive issues (see packshots/ for some inputs/outputs):

Inverted Foreground-Background

By using the entire image as box prompt (not tight around the product), SAM generally returns a negated mask:

Left = input, right = SAM output mask (the product should be depicted with white pixels)

Shadows and Reflections

Drop shadows usually fool SAM:

Left = input, right = SAM output mask (the mask incorporates the shadow)

Likewise with reflection:

Inaccurate Boundaries

SAM could output inaccurate boundaries and/or mask errors like holes:

Mask overlaid on top of the input image to showcase the inaccuracies

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
issues		issues
packshots		packshots
LICENSE		LICENSE
README.md		README.md
example.jpg		example.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Segment Anything: Adaptation Challenge

Prerequisites

Background