메뉴 건너뛰기




Volumn 40, Issue 4, 2018, Pages 834-848

DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs

Author keywords

atrous convolution; conditional random fields; Convolutional neural networks; semantic segmentation

Indexed keywords

CONVOLUTION; DEEP LEARNING; DEEP NEURAL NETWORKS; NEURAL NETWORKS; RANDOM PROCESSES; SEMANTICS;

EID: 85042712042     PISSN: 01628828     EISSN: None     Source Type: Journal    
DOI: 10.1109/TPAMI.2017.2699184     Document Type: Article
Times cited : (18309)

References (102)
  • 1
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Nov.
    • Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based learning applied to document recognition, " in Proc. IEEE, vol. 86, no. 11, pp. 2278-2324, Nov. 1998.
    • (1998) Proc. IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • LeCun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 6
    • 84959218210 scopus 로고    scopus 로고
    • Modeling local and global deformations in deep learning: Epitomic convolution, multiple instance learning, and sliding window detection
    • G. Papandreou, I. Kokkinos, and P.-A. Savalle, "Modeling local and global deformations in deep learning: Epitomic convolution, multiple instance learning, and sliding window detection, " in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2015, pp. 390-399.
    • (2015) Proc. IEEE Conf. Comput. Vis. Pattern Recog. , pp. 390-399
    • Papandreou, G.1    Kokkinos, I.2    Savalle, P.-A.3
  • 13
    • 84906489074 scopus 로고    scopus 로고
    • Visualizing and understanding convolutional networks
    • M. D. Zeiler and R. Fergus, "Visualizing and understanding convolutional networks, " in Proc. Eur. Conf. Comput. Vis., 2014, pp. 818-833.
    • (2014) Proc. Eur. Conf. Comput. Vis. , pp. 818-833
    • Zeiler, M.D.1    Fergus, R.2
  • 18
    • 85083952789 scopus 로고    scopus 로고
    • Pushing the boundaries of boundary detection using deep learning
    • I. Kokkinos, "Pushing the boundaries of boundary detection using deep learning, " in Proc. Int. Conf. Learn. Representations, 2016.
    • (2016) Proc. Int. Conf. Learn. Representations
    • Kokkinos, I.1
  • 20
    • 84906508687 scopus 로고    scopus 로고
    • Spatial pyramid pooling in deep convolutional networks for visual recognition
    • K. He, X. Zhang, S. Ren, and J. Sun, "Spatial pyramid pooling in deep convolutional networks for visual recognition, " in Proc. Eur. Conf. Comput. Vis., 2014, pp. 346-361.
    • (2014) Proc. Eur. Conf. Comput. Vis. , pp. 346-361
    • He, K.1    Zhang, X.2    Ren, S.3    Sun, J.4
  • 22
    • 85162351107 scopus 로고    scopus 로고
    • Efficient inference in fully con-nected CRFs with Gaussian edge potentials
    • P. Krahenbuhl and V. Koltun, "Efficient inference in fully con-nected CRFs with Gaussian edge potentials, " in Proc. Advances Neural Inf. Process. Syst., 2011, pp. 109-117
    • (2011) Proc. Advances Neural Inf. Process. Syst. , pp. 109-117
    • Krahenbuhl, P.1    Koltun, V.2
  • 23
    • 12844262766 scopus 로고    scopus 로고
    • GrabCut: Interactive foreground extraction using iterated graph cuts
    • C. Rother, V. Kolmogorov, and A. Blake, "GrabCut: Interactive foreground extraction using iterated graph cuts, " in Proc. ACM SIGGRAPH, 2004, pp. 309-314.
    • (2004) Proc. ACM SIGGRAPH , pp. 309-314
    • Rother, C.1    Kolmogorov, V.2    Blake, A.3
  • 24
    • 58149151266 scopus 로고    scopus 로고
    • Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context
    • J. Shotton, J. Winn, C. Rother, and A. Criminisi, "Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context, " Int. J. Comput. Vis., vol. 81, pp. 2-23, 2009.
    • (2009) Int. J. Comput. Vis. , vol.81 , pp. 2-23
    • Shotton, J.1    Winn, J.2    Rother, C.3    Criminisi, A.4
  • 25
  • 29
    • 84855348351 scopus 로고    scopus 로고
    • Fast approxi-mate energy minimization with label costs
    • A. Delong, A. Osokin, H. N. Isack, and Y. Boykov, "Fast approxi-mate energy minimization with label costs, " Int. J. Comput. Vis., vol. 96, pp. 1-27, 2012.
    • (2012) Int. J. Comput. Vis. , vol.96 , pp. 1-27
    • Delong, A.1    Osokin, A.2    Isack, H.N.3    Boykov, Y.4
  • 31
    • 61349174704 scopus 로고    scopus 로고
    • Robust higher order potentials for enforcing label consistency
    • P. Kohli, P. H. Torr, and L. Ladick, "Robust higher order potentials for enforcing label consistency, " Int. J. Comput. Vis., vol. 82, no. 3, pp. 302-324, 2009.
    • (2009) Int. J. Comput. Vis. , vol.82 , Issue.3 , pp. 302-324
    • Kohli, P.1    Torr, P.H.2    Ladick, L.3
  • 32
    • 84898816122 scopus 로고    scopus 로고
    • Learning a dictionary of shape epitomes with applications to image labeling
    • L.-C. Chen, G Papandreou, and A. Yuille, "Learning a dictionary of shape epitomes with applications to image labeling, " in Proc. IEEE Int. Conf. Comput. Vis., 2013, pp. 337-344.
    • (2013) Proc. IEEE Int. Conf. Comput. Vis. , pp. 337-344
    • Chen, L.-C.1    Papandreou, G.2    Yuille, A.3
  • 35
    • 84911444024 scopus 로고    scopus 로고
    • The role of context for object detection and semantic segmentation in the wild
    • R. Mottaghi, et al., "The role of context for object detection and semantic segmentation in the wild, " in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2014, pp. 891-898.
    • (2014) Proc. IEEE Conf. Comput. Vis. Pattern Recog. , pp. 891-898
    • Mottaghi, R.1
  • 37
    • 84986255616 scopus 로고    scopus 로고
    • The cityscapes dataset for semantic urban scene understanding
    • M. Cordts, et al., "The cityscapes dataset for semantic urban scene understanding, " in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2016.
    • (2016) Proc. IEEE Conf. Comput. Vis. Pattern Recog.
    • Cordts, M.1
  • 42
    • 77956051102 scopus 로고    scopus 로고
    • Auto-context and its application to high-level vision tasks and 3D brain image segmentation
    • Oct.
    • Z. Tu and X. Bai, "Auto-context and its application to high-level vision tasks and 3D brain image segmentation, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 32, no. 10, pp. 1744-1757, Oct. 2010.
    • (2010) IEEE Trans. Pattern Anal. Mach. Intell. , vol.32 , Issue.10 , pp. 1744-1757
    • Tu, Z.1    Bai, X.2
  • 46
    • 84861335581 scopus 로고    scopus 로고
    • CPMC: Automatic object seg-mentation using constrained parametric min-cuts
    • Jul.
    • J. Carreira and C. Sminchisescu, "CPMC: Automatic object seg-mentation using constrained parametric min-cuts, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 7, pp. 1312-1328, Jul. 2012.
    • (2012) IEEE Trans. Pattern Anal. Mach. Intell. , vol.34 , Issue.7 , pp. 1312-1328
    • Carreira, J.1    Sminchisescu, C.2
  • 54
    • 0026151642 scopus 로고
    • Parallel and deterministic algorithms from MRFs: Surface reconstruction
    • May
    • D. Geiger and F. Girosi, "Parallel and deterministic algorithms from MRFs: Surface reconstruction, " IEEE Trans. Pattern Anal. Mach. Intell., vol. 13, no. 5, pp. 401-412, May 1991.
    • (1991) IEEE Trans. Pattern Anal. Mach. Intell. , vol.13 , Issue.5 , pp. 401-412
    • Geiger, D.1    Girosi, F.2
  • 55
    • 0026201666 scopus 로고
    • A common framework for image segmentation
    • D. Geiger and A. Yuille, "A common framework for image segmentation, " Int. J. Comput. Vis., vol. 6, no. 3, pp. 227-243, 1991.
    • (1991) Int. J. Comput. Vis. , vol.6 , Issue.3 , pp. 227-243
    • Geiger, D.1    Yuille, A.2
  • 56
    • 44649169686 scopus 로고    scopus 로고
    • Computational analysis and learning for a biologically motivated model of boundary detection
    • I. Kokkinos, R. Deriche, O. Faugeras, and P. Maragos, "Computational analysis and learning for a biologically motivated model of boundary detection, " Neurocomputing, vol. 71, no. 10, pp. 1798-1812, 2008.
    • (2008) Neurocomputing , vol.71 , Issue.10 , pp. 1798-1812
    • Kokkinos, I.1    Deriche, R.2    Faugeras, O.3    Maragos, P.4
  • 59
    • 84973861983 scopus 로고    scopus 로고
    • Conditional random fields as recurrent neural networks
    • S. Zheng, et al., "Conditional random fields as recurrent neural networks, " in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 1529-1537.
    • (2015) Proc. IEEE Int. Conf. Comput. Vis. , pp. 1529-1537
    • Zheng, S.1
  • 60
    • 84973890848 scopus 로고    scopus 로고
    • Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation
    • J. Dai, K. He, and J. Sun, "Boxsup: Exploiting bounding boxes to supervise convolutional networks for semantic segmentation, " in Proc. Int. Conf. Comput. Vis., 2015.
    • (2015) Proc. Int. Conf. Comput. Vis.
    • Dai, J.1    He, K.2    Sun, J.3
  • 61
    • 84973879016 scopus 로고    scopus 로고
    • Learning deconvolution network for semantic segmentation
    • H. Noh, S. Hong, and B. Han, "Learning deconvolution network for semantic segmentation, " in Proc. IEEE Int. Conf. Comput. Vis., 2015, pp. 1520-1528.
    • (2015) Proc. IEEE Int. Conf. Comput. Vis. , pp. 1520-1528
    • Noh, H.1    Hong, S.2    Han, B.3
  • 63
    • 84986275144 scopus 로고    scopus 로고
    • Semantic image segmentation with task-specific edge detection using cnns and a discriminatively trained domain transform
    • L.-C. Chen, J. T. Barron, G. Papandreou, K. Murphy, and A. L. Yuille, "Semantic image segmentation with task-specific edge detection using cnns and a discriminatively trained domain transform, " in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2016, pp. 4545-4554.
    • (2016) Proc. IEEE Conf. Comput. Vis. Pattern Recog. , pp. 4545-4554
    • Chen, L.-C.1    Barron, J.T.2    Papandreou, G.3    Murphy, K.4    Yuille, A.L.5
  • 67
    • 80051902233 scopus 로고    scopus 로고
    • Domain transform for edge-aware image and video processing
    • E. S. L. Gastal and M. M. Oliveira, "Domain transform for edge-aware image and video processing, " in Proc. ACM SIGGRAPH, 2011, Art. no. 69.
    • (2011) Proc. ACM SIGGRAPH , pp. 69
    • Gastal, E.S.L.1    Oliveira, M.M.2
  • 68
    • 84973888826 scopus 로고    scopus 로고
    • High-for-low and low-for-high: Efficient boundary detection from deep object features and its applications to high-level vision
    • G Bertasius, J. Shi, and L. Torresani, "High-for-low and low-for-high: Efficient boundary detection from deep object features and its applications to high-level vision, " in Proc. IEEE Int. Conf. Comput. Vis., 2015.
    • (2015) Proc. IEEE Int. Conf. Comput. Vis.
    • Bertasius, G.1    Shi, J.2    Torresani, L.3
  • 71
    • 84965099276 scopus 로고    scopus 로고
    • Decoupled deep neural network for semi-supervised semantic segmentation
    • S. Hong, H. Noh, and B. Han, "Decoupled deep neural network for semi-supervised semantic segmentation, " in Proc. 28th Int. Conf. Neural Inf. Process. Syst., 2015, pp. 1495-1503.
    • (2015) Proc. 28th Int. Conf. Neural Inf. Process. Syst. , pp. 1495-1503
    • Hong, S.1    Noh, H.2    Han, B.3
  • 72
  • 74
    • 27644560241 scopus 로고    scopus 로고
    • The redundant discrete wavelet transform and additive noise
    • Sep.
    • J. E. Fowler, "The redundant discrete wavelet transform and additive noise, " IEEE Signal Process. Lett., vol. 12, no. 9, pp. 629-632, Sep. 2005.
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.9 , pp. 629-632
    • Fowler, J.E.1
  • 75
    • 0025244687 scopus 로고
    • Multirate digital filters, filter banks, polyphase networks, and applications: A tutorial
    • Jan.
    • P. P. Vaidyanathan, "Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial, " Proc. IEEE, vol. 78, no. 1, pp. 56-93, Jan. 1990.
    • (1990) Proc. IEEE , vol.78 , Issue.1 , pp. 56-93
    • Vaidyanathan, P.P.1
  • 82
    • 0026938667 scopus 로고
    • The discrete wavelet transform: Wedding the a trous and Mallat algorithms
    • Oct.
    • M. J. Shensa, "The discrete wavelet transform: Wedding the a trous and Mallat algorithms, " IEEE Trans. Signal Process., vol. 40, no. 10, pp. 2464-2482, Oct. 1992.
    • (1992) IEEE Trans. Signal Process. , vol.40 , Issue.10 , pp. 2464-2482
    • Shensa, M.J.1
  • 84
    • 77952844108 scopus 로고    scopus 로고
    • Fast high-dimensional filtering using the permutohedral lattice
    • A. Adams, J. Baek, and M. A. Davis, "Fast high-dimensional filtering using the permutohedral lattice, " in Eurographics, vol. 29, pp. 753-762, 2010.
    • (2010) Eurographics , vol.29 , pp. 753-762
    • Adams, A.1    Baek, J.2    Davis, M.A.3
  • 87
    • 84906493406 scopus 로고    scopus 로고
    • Microsoft COCO: Common objects in context
    • T.-Y. Lin, et al., "Microsoft COCO: Common objects in context, " in Proc. Eur. Conf. Comput. Vis., 2014, pp. 740-755.
    • (2014) Proc. Eur. Conf. Comput. Vis. , pp. 740-755
    • Lin, T.-Y.1
  • 98
    • 84959238467 scopus 로고    scopus 로고
    • Semantic part segmentation using compositional model combining shape and appearance
    • J. Wang and A. Yuille, "Semantic part segmentation using compositional model combining shape and appearance, " in Proc. IEEE Conf. Comput. Vis. Pattern Recog., 2015.
    • (2015) Proc. IEEE Conf. Comput. Vis. Pattern Recog.
    • Wang, J.1    Yuille, A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.