메뉴 건너뛰기




Volumn 358, Issue 6368, 2017, Pages

A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; ARTIFICIAL NEURAL NETWORK; DATA; HEURISTICS; LEARNING; MACHINE LEARNING; MODEL; PROBABILITY; SEGMENTATION;

EID: 85032354846     PISSN: 00368075     EISSN: 10959203     Source Type: Journal    
DOI: 10.1126/science.aag2612     Document Type: Article
Times cited : (276)

References (62)
  • 1
    • 84949683101 scopus 로고    scopus 로고
    • Human-level concept learning through probabilistic program induction
    • pmid: 26659050
    • B. M. Lake, R. Salakhutdinov, J. B. Tenenbaum, Human-level concept learning through probabilistic program induction. Science 350, 1332-1338 (2015). doi: 10.1126/science.aab3050; pmid: 26659050
    • (2015) Science , vol.350 , pp. 1332-1338
    • Lake, B.M.1    Salakhutdinov, R.2    Tenenbaum, J.B.3
  • 4
    • 0041431049 scopus 로고    scopus 로고
    • Recognizing objects in adversarial clutter: Breaking a visual CAPTCHA
    • IEEE Computer Society
    • G. Mori, J. Malik, “Recognizing objects in adversarial clutter: Breaking a visual CAPTCHA,” in 2003 IEEE Conference on Computer Vision and Pattern Recognition (IEEE Computer Society, 2003), pp. I-134–I-141.
    • (2003) 2003 IEEE Conference on Computer Vision and Pattern Recognition , pp. I134-I141
    • Mori, G.1    Malik, J.2
  • 9
    • 0042565834 scopus 로고    scopus 로고
    • Hierarchical Bayesian inference in the visual cortex
    • pmid: 12868647
    • T. S. Lee, D. Mumford, Hierarchical Bayesian inference in the visual cortex. JOSA A 20, 1434–1448 (2003). doi: 10.1364/JOSAA.20.001434; pmid: 12868647
    • (2003) JOSA A , vol.20 , pp. 1434-1448
    • Lee, T.S.1    Mumford, D.2
  • 10
    • 77955281093 scopus 로고    scopus 로고
    • Probabilistic models of cognition: Exploring representations and inductive biases
    • pmid: 20576465
    • T. L. Griffiths, N. Chater, C. Kemp, A. Perfors, J. B. Tenenbaum, Probabilistic models of cognition: Exploring representations and inductive biases. Trends Cogn. Sci. 14, 357–364 (2010). doi: 10.1016/j.tics.2010.05.004; pmid: 20576465
    • (2010) Trends Cogn. Sci. , vol.14 , pp. 357-364
    • Griffiths, T.L.1    Chater, N.2    Kemp, C.3    Perfors, A.4    Tenenbaum, J.B.5
  • 11
    • 84937720055 scopus 로고    scopus 로고
    • The visual system’s internal model of the world
    • IEEE
    • T. S. Lee, “The visual system’s internal model of the world,” in Proceedings of the IEEE, vol. 103 (IEEE, 2015), pp. 1359–1378
    • (2015) Proceedings of The IEEE , vol.103 , pp. 1359-1378
    • Lee, T.S.1
  • 12
    • 0038396223 scopus 로고    scopus 로고
    • Bayesian models of object perception
    • pmid: 12744967
    • D. Kersten, A. Yuille, Bayesian models of object perception. Curr. Opin. Neurobiol. 13, 150–158 (2003). doi: 10.1016/S0959-4388(03)00042-4; pmid: 12744967
    • (2003) Curr. Opin. Neurobiol. , vol.13 , pp. 150-158
    • Kersten, D.1    Yuille, A.2
  • 13
    • 84876664193 scopus 로고    scopus 로고
    • Top-down influences on visual processing
    • pmid: 23595013
    • C. D. Gilbert, W. Li, Top-down influences on visual processing. Nat. Rev. Neurosci. 14, 350–363 (2013). doi: 10.1038/nrn3476; pmid: 23595013
    • (2013) Nat. Rev. Neurosci. , vol.14 , pp. 350-363
    • Gilbert, C.D.1    Li, W.2
  • 14
    • 0034333184 scopus 로고    scopus 로고
    • The distinct modes of vision offered by feedforward and recurrent processing
    • pmid: 11074267
    • V. A. F. Lamme, P. R. Roelfsema, The distinct modes of vision offered by feedforward and recurrent processing. Trends Neurosci. 23, 571–579 (2000). doi: 10.1016/S0166-2236(00)01657-X; pmid: 11074267
    • (2000) Trends Neurosci , vol.23 , pp. 571-579
    • Lamme, V.A.F.1    Roelfsema, P.R.2
  • 15
    • 0032563929 scopus 로고    scopus 로고
    • Object-based attention in the primary visual cortex of the macaque monkey
    • pmid: 9759726
    • P. R. Roelfsema, V. A. F. Lamme, H. Spekreijse, Object-based attention in the primary visual cortex of the macaque monkey. Nature 395, 376–381 (1998). doi: 10.1038/26475; pmid: 9759726
    • (1998) Nature , vol.395 , pp. 376-381
    • Roelfsema, P.R.1    Lamme, V.A.F.2    Spekreijse, H.3
  • 16
    • 84926677566 scopus 로고    scopus 로고
    • Neural mechanisms of object-based attention
    • pmid: 24217991
    • E. H. Cohen, F. Tong, Neural mechanisms of object-based attention. Cereb. Cortex 25, 1080–1092 (2015). doi: 10.1093/cercor/bht303; pmid: 24217991
    • (2015) Cereb. Cortex , vol.25 , pp. 1080-1092
    • Cohen, E.H.1    Tong, F.2
  • 17
    • 0027420016 scopus 로고
    • Contour integration by the human visual system: Evidence for a local “association field”
    • pmid: 8447091
    • D. J. Field, A. Hayes, R. F. Hess, Contour integration by the human visual system: Evidence for a local “association field”. Vision Res. 33, 173–193 (1993). doi: 10.1016/0042-6989(93)90156-Q; pmid: 8447091
    • (1993) Vision Res. , vol.33 , pp. 173-193
    • Field, D.J.1    Hayes, A.2    Hess, R.F.3
  • 18
    • 0024343494 scopus 로고
    • Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex
    • pmid: 2746337
    • C. D. Gilbert, T. N. Wiesel, Columnar specificity of intrinsic horizontal and corticocortical connections in cat visual cortex. J. Neurosci. 9, 2432–2442 (1989). pmid: 2746337
    • (1989) J. Neurosci. , vol.9 , pp. 2432-2442
    • Gilbert, C.D.1    Wiesel, T.N.2
  • 19
    • 34447526771 scopus 로고    scopus 로고
    • A neural model of figure-ground organization
    • pmid: 17442769
    • E. Craft, H. Schütze, E. Niebur, R. von der Heydt, A neural model of figure-ground organization. J. Neurophysiol. 97, 4310–4326 (2007). doi: 10.1152/jn.00203.2007; pmid: 17442769
    • (2007) J. Neurophysiol. , vol.97 , pp. 4310-4326
    • Craft, E.1    Schütze, H.2    Niebur, E.3    Von Der Heydt, R.4
  • 20
    • 0032784928 scopus 로고    scopus 로고
    • Separate processing dynamics for texture elements, boundaries and surfaces in primary visual cortex of the macaque monkey
    • pmid: 10426419
    • V. A. F. Lamme, V. Rodriguez-Rodriguez, H. Spekreijse, Separate processing dynamics for texture elements, boundaries and surfaces in primary visual cortex of the macaque monkey. Cereb. Cortex 9, 406–413 (1999). doi: 10.1093/cercor/9.4.406; pmid: 10426419
    • (1999) Cereb. Cortex , vol.9 , pp. 406-413
    • Lamme, V.A.F.1    Rodriguez-Rodriguez, V.2    Spekreijse, H.3
  • 21
    • 0023950326 scopus 로고
    • Concurrent processing streams in monkey visual cortex
    • pmid: 2471327
    • E. A. DeYoe, D. C. Van Essen, Concurrent processing streams in monkey visual cortex. Trends Neurosci. 11, 219–226 (1988). doi: 10.1016/0166-2236(88)90130-0; pmid: 2471327
    • (1988) Trends Neurosci. , vol.11 , pp. 219-226
    • DeYoe, E.A.1    Van Essen, D.C.2
  • 22
    • 52949145321 scopus 로고    scopus 로고
    • V1 response timing and surface filling-in
    • pmid: 18509081
    • X. Huang, M. A. Paradiso, V1 response timing and surface filling-in. J. Neurophysiol. 100, 539–547 (2008). doi: 10.1152/jn.00997.2007; pmid: 18509081
    • (2008) J. Neurophysiol. , vol.100 , pp. 539-547
    • Huang, X.1    Paradiso, M.A.2
  • 23
    • 0034280490 scopus 로고    scopus 로고
    • Coding of border ownership in monkey visual cortex
    • pmid: 10964965
    • H. Zhou, H. S. Friedman, R. von der Heydt, Coding of border ownership in monkey visual cortex. J. Neurosci. 20, 6594–6611 (2000). pmid: 10964965
    • (2000) J. Neurosci. , vol.20 , pp. 6594-6611
    • Zhou, H.1    Friedman, H.S.2    Von Der Heydt, R.3
  • 25
    • 80051550318 scopus 로고    scopus 로고
    • Recursive compositional models for vision: Description and review of recent work
    • L. L. Zhu, Y. Chen, A. Yuille, Recursive compositional models for vision: Description and review of recent work. J. Math. Imaging Vis. 41, 122–146 (2011). doi: 10.1007/s10851-011-0282-2
    • (2011) J. Math. Imaging Vis. , vol.41 , pp. 122-146
    • Zhu, L.L.1    Chen, Y.2    Yuille, A.3
  • 30
    • 17444392134 scopus 로고    scopus 로고
    • Image parsing: Unifying segmentation, detection, and recognition
    • Z. Tu, X. Chen, A. L. Yuille, S.-C. Zhu, Image parsing: Unifying segmentation, detection, and recognition. Int. J. Comput. Vis. 63, 113–140 (2005). doi: 10.1007/s11263-005-6642-x
    • (2005) Int. J. Comput. Vis. , vol.63 , pp. 113-140
    • Tu, Z.1    Chen, X.2    Yuille, A.L.3    Zhu, S.-C.4
  • 33
    • 85037571827 scopus 로고    scopus 로고
    • Supplementary materials
    • Supplementary materials.
  • 34
    • 84880878477 scopus 로고    scopus 로고
    • Learning AND-OR templates for object recognition and detection
    • Z. Si, S.-C. Zhu, Learning AND-OR templates for object recognition and detection. IEEE Trans. Pattern Anal. Mach. Intell. 35, 2189–2205 (2013).
    • (2013) IEEE Trans. Pattern Anal. Mach. Intell. , vol.35 , pp. 2189-2205
    • Si, Z.1    Zhu, S.-C.2
  • 35
    • 34248586201 scopus 로고    scopus 로고
    • Invariance and selectivity in the ventral visual pathway
    • pmid: 17336506
    • S. Geman, Invariance and selectivity in the ventral visual pathway. J. Physiol. Paris 100, 212–224 (2006). doi: 10.1016/j.jphysparis.2007.01.001; pmid: 17336506
    • (2006) J. Physiol. Paris , vol.100 , pp. 212-224
    • Geman, S.1
  • 37
    • 34047107296 scopus 로고    scopus 로고
    • Primal sketch: Integrating structure and texture
    • C. Guo, S.-C. Zhu, Y. N. Wu, Primal sketch: Integrating structure and texture. Comput. Vis. Image Underst. 106, 5–19 (2007). doi: 10.1016/j.cviu.2005.09.004
    • (2007) Comput. Vis. Image Underst. , vol.106 , pp. 5-19
    • Guo, C.1    Zhu, S.-C.2    Wu, Y.N.3
  • 39
    • 77954860497 scopus 로고    scopus 로고
    • A numerical study of the bottom-up and top-down inference processes in and-or graphs
    • T. Wu, S.-C. Zhu, A numerical study of the bottom-up and top-down inference processes in and-or graphs. Int. J. Comput. Vis. 93, 226–252 (2011). doi: 10.1007/s11263-010-0346-6
    • (2011) Int. J. Comput. Vis. , vol.93 , pp. 226-252
    • Wu, T.1    Zhu, S.-C.2
  • 40
    • 84986299897 scopus 로고    scopus 로고
    • Representing higher-order dependencies in networks
    • pmid: 27386539
    • J. Xu, T. L. Wickramarathne, N. V. Chawla, Representing higher-order dependencies in networks. Sci. Adv. 2, e1600028 (2016). doi: 10.1126/sciadv.1600028; pmid: 27386539
    • (2016) Sci. Adv. , vol.2 , pp. e1600028
    • Xu, J.1    Wickramarathne, T.L.2    Chawla, N.V.3
  • 43
    • 84874508893 scopus 로고    scopus 로고
    • Computational models of visual attention
    • J. Tsotsos, A. Rothenstein, Computational models of visual attention. Scholarpedia 6, 6201 (2011). doi: 10.4249/scholarpedia.6201
    • (2011) Scholarpedia , vol.6 , pp. 6201
    • Tsotsos, J.1    Rothenstein, A.2
  • 44
    • 0027379242 scopus 로고
    • A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information
    • pmid: 8229193
    • B. A. Olshausen, C. H. Anderson, D. C. Van Essen, A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information. J. Neurosci. 13, 4700–4719 (1993). pmid: 8229193
    • (1993) J. Neurosci. , vol.13 , pp. 4700-4719
    • Olshausen, B.A.1    Anderson, C.H.2    Van Essen, D.C.3
  • 45
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998). doi: 10.1109/5.726791
    • (1998) Proc. IEEE , vol.86 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 51
    • 1842733191 scopus 로고    scopus 로고
    • Greedy learning of multiple objects in images using robust statistics and factorial learning
    • pmid: 15070509
    • C. K. Williams, M. K. Titsias, Greedy learning of multiple objects in images using robust statistics and factorial learning. Neural Comput. 16, 1039–1062 (2004). doi: 10.1162/089976604773135096; pmid: 15070509
    • (2004) Neural Comput , vol.16 , pp. 1039-1062
    • Williams, C.K.1    Titsias, M.K.2
  • 56
    • 84959238467 scopus 로고    scopus 로고
    • Semantic part segmentation using compositional model combining shape and appearance
    • IEEE
    • J. Wang, A. Yuille, “Semantic part segmentation using compositional model combining shape and appearance,” in 2015 IEEE Conference on Computer Vision and Pattern Recognition (IEEE, 2015), pp. 1788–1797.
    • (2015) 2015 IEEE Conference on Computer Vision and Pattern Recognition , pp. 1788-1797
    • Wang, J.1    Yuille, A.2
  • 57
    • 84938559532 scopus 로고    scopus 로고
    • Adding discriminative power to a generative hierarchical compositional model using histograms of compositions
    • D. Tabernik, A. Leonardis, M. Boben, D. Skočaj, M. Kristan, Adding discriminative power to a generative hierarchical compositional model using histograms of compositions. Comput. Vis. Image Underst. 138, 102–113 (2015). doi: 10.1016/j.cviu.2015.04.006
    • (2015) Comput. Vis. Image Underst. , vol.138 , pp. 102-113
    • Tabernik, D.1    Leonardis, A.2    Boben, M.3    Skočaj, D.4    Kristan, M.5
  • 60
    • 0021700041 scopus 로고
    • Visual routines
    • pmid: 6543165
    • S. Ullman, Visual routines. Cognition 18, 97–159 (1984). doi: 10.1016/0010-0277(84)90023-4; pmid: 6543165
    • (1984) Cognition , vol.18 , pp. 97-159
    • Ullman, S.1
  • 61
    • 77953750826 scopus 로고    scopus 로고
    • Cortical circuitry implementing graphical models
    • pmid: 19686065
    • S. Litvak, S. Ullman, Cortical circuitry implementing graphical models. Neural Comput. 21, 3010–3056 (2009). doi: 10.1162/neco.2009.05-08-783; pmid: 19686065
    • (2009) Neural Comput , vol.21 , pp. 3010-3056
    • Litvak, S.1    Ullman, S.2
  • 62
    • 73349116544 scopus 로고    scopus 로고
    • Towards a mathematical theory of cortical micro-circuits
    • pmid: 19816557
    • D. George, J. Hawkins, Towards a mathematical theory of cortical micro-circuits. PLOS Comput. Biol. 5, e1000532 (2009). doi: 10.1371/journal.pcbi.1000532; pmid: 19816557
    • (2009) PLOS Comput. Biol. , vol.5
    • George, D.1    Hawkins, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.