paraafPeter A. van der Helm Demo link

About Teaching Research Publications Presentations



T-junctions: Cues for segmentation rather than for occlusion



In everyday situations, we experience a world of whole objects, even though many of these objects are partly occluded by each other. The research field of visual pattern completion focuses on the question of how a partly occluded object is completed amodally (i.e., beyond the visual input) into a whole object. In this research field, a distinction is made between local and global approaches.

Local approaches focus on the junctions where the visible edges of different objects meet. The figure below highlights two so-called T-junctions between, as far as visible, a red rectangle and a blue cross. According to local approaches, T-junctions are cues for occlusion. In the figure below, this implies that the rectangle is predicted to be the occluder, and that the visible edges of the cross are smoothly continued behind the occluder until they meet, which here yields a non-cross as whole object.

Global approaches focus on the shapes of all candidate whole objects, starting from the visible contours (as highlighted in the figure below). For each candidate occluder and each candidate occluded, global approaches quantify the goodness-of-shape and then select the best-of-shape combination. In the figure below, the (already visible) cross has a higher goodness-of-shape than the (partly visible) non-cross, which implies that no occlusion is predicted here.


Mosaic or occlusion


Does this mean that, according to global approaches, T-junctions are not relevant to amodal completion? The amodal completion model within the global approach of SIT model leads to a balanced answer along the following line of reasoning.

To explain amodal completion, this model considers all candidate occluder+occluded combinations for a given stimulus. For each combination, it first quantifies the goodness-of-shape of the occluder and the occluded separately, namely, by the structural complexity of the simplest SIT code for each shape. The sum of these complexities yields the total shape complexity Ishape. Then, it quantifies the structural complexity of the relative position of the occluder and occluded, yielding the positional complexity Iposition. Finally, the combination with the smallest sum Ishape + Iposition is predicted to be the preferred stimulus interpretaton.

SIT's amodal completion model has shown to have considerable predictive power. Without going into quantification details, the next figure shows its predictions for some stimuli. These stimuli consist of edges that, at their point of contact, form one of four different junction types (as highlighted by the circles). For each stimulus, the "two objects" hypothesis and the "one object" hypothesis are considered, and the boxed complexities indicate the predicted interpretations.


Junctions


Thus, for instance, the third stimulus is predicted to be seen as one L-shape rather than as two objects with an L-junction at their point of contact. The second stimulus, however, is predicted to be seen as two objects with a T-junction at their point of contact rather than as one T-shape. This implies, inversely, that a T-junction is a cue for segmentation, that is, a cue for the presence of two objects.

Hence, according to SIT's global approach, T-junctions are relevant to amodal completion, albeit it not as direct cues for occlusion but merely as cues for segmentation. By consequence, whether or not the resulting segments are seen as belonging to partly-overlapping objects, must co-depend on the shapes of the hypothesized whole objects.


For further demos on these issues, see Object versus viewer and Occam versus Bayes

For a full account of SIT's amodal completion model, see Perception 1994
For a multidisciplinary and historical embedding of this occlusion model, see Psychological Bulletin 2000