The goal of interactive segmention is to obtain hight-quality masks based on user inputs in multiple interaction rounds. Benchmarking interactive segmention methods requires user inputs, however, gathering real-user data is impractical.
In practice segmentation quality is assessed with a baseline clicking strategy, when in each round clicks are put in the center of the largest erroneous area. This strategy does not accuretely model real user behavior, and segmentation methods may perfom worse in real–world scenarios compared to a benchmark based on baseline clicking.
To enable more accurate evaluation of interactive segmentation methods we propose a highly realistic simulator of user clicks.
Our model predicts a clickability map. Clickability map is a singlechannel image s.t. the value of each pixel corresponds to the probability that the user will click on it.
Our model is the best compared with uniform distribution (UD), distance transform (DT), and saliency map (SM).
Using clickability map, we obtain clicking groups $\{G_i\}^{10}_{i=1}$. Each clicking group corresponds to clicks in some probabilty interval that has 10% of total probability mass.
Using clicking groups, we calculate the following metrics:@inproceedings{antonov2024rclicks,
title={RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation},
author={Antonov, Anton and Moskalenko, Andrey and Shepelev, Denis and Krapukhin, Alexander and Soshin, Konstantin and Konushin, Anton and Shakhuro, Vlad},
booktitle={NeurIPS 2024},
year={2024}
}