Effective Methods for Background Removal on Images

Question

I'm interested in learning about how background removal works on images taken of clothing items. Do we need a specific color difference between the background and the clothing item in order to be able to determine object edges? How are edges determined?

Can somebody link me to algorithms/models/methods used effectively?

Yes I had read this post few days back and they said it's very helpful if there's a difference in background and the main object https://towardsdatascience.com/background-removal-with-deep-learning-c4f2104b3157 — Aditya, Sep 06 '18 at 00:52

score 1 · Answer 1 · answered Sep 06 '18 at 00:52

This problem used to be solved by analytic methods as you can find here, but these are extremely outdated and boring. However, for simpler cases they work very well and are much faster to implement.

Using deep learning is much more powerful for this particular case and you dont actually need that much data, although you will need to take some time to prepare your data.

The concept

We will train the data on a set of smaller image patches extracted from the original images where the target labels will be foreground and background.

Preparing the data

First, you will need to take your images and draw a mask over them in order to identify your labels. You can do this using paint I suppose. You will take your original images and color the foreground in white and the background in black. This will be labels 1 and 0 respectively.

In Python you will load the original images and their respective label images. You will then split the image and labels into patches of size $k \times k$. You can pick whatever patch size you think is best suited for your kind of data. This is a hyper-parameter you will need to tune using cross-validation. Each patch will have its associated label which is the label of the center of the patch.

Build the model

Then you will build a standard convolutional neural network model where the inputs are going to be the images patches and the output will be the label.

Segmenting new images

To segment new images, split that image into patches and predict the label. All the patches which result in $label=1$ is the foreground.

Alternative method

Alternatively, you can predict the values for the entire patch at once, that means the outputs of your model network will be the same size as your input. You will thus have a label for each pixel in the patch.