Full interpretation of minimal images
- PMID: 29107889
- DOI: 10.1016/j.cognition.2017.10.006
Full interpretation of minimal images
Abstract
The goal in this work is to model the process of 'full interpretation' of object images, which is the ability to identify and localize all semantic features and parts that are recognized by human observers. The task is approached by dividing the interpretation of the complete object to the interpretation of multiple reduced but interpretable local regions. In such reduced regions, interpretation is simpler, since the number of semantic components is small, and the variability of possible configurations is low. We model the interpretation process by identifying primitive components and relations that play a useful role in local interpretation by humans. To identify useful components and relations used in the interpretation process, we consider the interpretation of 'minimal configurations': these are reduced local regions, which are minimal in the sense that further reduction renders them unrecognizable and uninterpretable. We show that such minimal interpretable images have useful properties, which we use to identify informative features and relations used for full interpretation. We describe our interpretation model, and show results of detailed interpretations of minimal configurations, produced automatically by the model. Finally, we discuss possible extensions and implications of full interpretation to difficult visual tasks, such as recognizing social interactions, which are beyond the scope of current models of visual recognition.
Keywords: Image interpretation; Minimal images; Parts and relations; Top-down processing.
Copyright © 2017. Published by Elsevier B.V.
Similar articles
-
Image interpretation above and below the object level.Interface Focus. 2018 Aug 6;8(4):20180020. doi: 10.1098/rsfs.2018.0020. Epub 2018 Jun 15. Interface Focus. 2018. PMID: 29951197 Free PMC article.
-
GAFFE: a gaze-attentive fixation finding engine.IEEE Trans Image Process. 2008 Apr;17(4):564-73. doi: 10.1109/TIP.2008.917218. IEEE Trans Image Process. 2008. PMID: 18390364
-
NMDA Receptor Antagonist Ketamine Distorts Object Recognition by Reducing Feedback to Early Visual Cortex.Cereb Cortex. 2016 May;26(5):1986-96. doi: 10.1093/cercor/bhv018. Epub 2015 Feb 6. Cereb Cortex. 2016. PMID: 25662715
-
What is special about expertise? Visual expertise reveals the interactive nature of real-world object recognition.Neuropsychologia. 2016 Mar;83:88-99. doi: 10.1016/j.neuropsychologia.2015.06.004. Epub 2015 Jun 18. Neuropsychologia. 2016. PMID: 26095002 Review.
-
Prediction, context, and competition in visual recognition.Ann N Y Acad Sci. 2015 Mar;1339:190-8. doi: 10.1111/nyas.12680. Epub 2015 Feb 27. Ann N Y Acad Sci. 2015. PMID: 25728836 Review.
Cited by
-
Minimal videos: Trade-off between spatial and temporal information in human and machine vision.Cognition. 2020 Aug;201:104263. doi: 10.1016/j.cognition.2020.104263. Epub 2020 Apr 20. Cognition. 2020. PMID: 32325309 Free PMC article.
-
Oculo-retinal dynamics can explain the perception of minimal recognizable configurations.Proc Natl Acad Sci U S A. 2021 Aug 24;118(34):e2022792118. doi: 10.1073/pnas.2022792118. Proc Natl Acad Sci U S A. 2021. PMID: 34417308 Free PMC article.
-
Image interpretation above and below the object level.Interface Focus. 2018 Aug 6;8(4):20180020. doi: 10.1098/rsfs.2018.0020. Epub 2018 Jun 15. Interface Focus. 2018. PMID: 29951197 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources