메뉴 건너뛰기




Volumn 40, Issue 1, 2014, Pages 1-9

Parallel L-BFGS-B algorithm on GPU

Author keywords

CVT; GPU; L BFGS B; Nonlinear optimization

Indexed keywords

COMPUTER GRAPHICS; COMPUTER GRAPHICS EQUIPMENT; CONJUGATE GRADIENT METHOD; NONLINEAR PROGRAMMING; PROGRAM PROCESSORS; VARIABLE SPEED TRANSMISSIONS;

EID: 84893541927     PISSN: 00978493     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.cag.2014.01.002     Document Type: Article
Times cited : (55)

References (60)
  • 2
    • 33646887390 scopus 로고
    • On the limited memory BFGS method for large scale optimization
    • D.C. Liu, and J. Nocedal On the limited memory BFGS method for large scale optimization Math Program 45 1 1989 503 528 (Pubitemid 20660315)
    • (1989) Mathematical Programming, Series B , vol.45 , Issue.3 , pp. 503-528
    • Liu Dong, C.1    Nocedal Jorge2
  • 3
    • 0000732463 scopus 로고
    • A limited memory algorithm for bound constrained optimization
    • R.H. Byrd, P. Lu, J. Nocedal, and C. Zhu A limited memory algorithm for bound constrained optimization SIAM J Sci Comput 16 5 1995 1190 1208
    • (1995) SIAM J Sci Comput , vol.16 , Issue.5 , pp. 1190-1208
    • Byrd, R.H.1    Lu, P.2    Nocedal, J.3    Zhu, C.4
  • 4
    • 0031345518 scopus 로고    scopus 로고
    • Algorithm 778: L-BFGS-B: Fortran Subroutines for Large-Scale Bound-Constrained Optimization
    • C. Zhu, R.H. Byrd, P. Lu, and J. Nocedal Algorithm 778 L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization ACM Trans Math Softw 23 4 1997 550 560 (Pubitemid 127447768)
    • (1997) ACM Transactions on Mathematical Software , vol.23 , Issue.4 , pp. 550-560
    • Zhu, C.1    Byrd, R.H.2    Lu, P.3    Nocedal, J.4
  • 6
    • 0000169232 scopus 로고
    • An algorithm for least-squares estimation of nonlinear parameters
    • D.W. Marquardt An algorithm for least-squares estimation of nonlinear parameters SIAM J Soc Ind Appl Math 11 2 1963 431 441
    • (1963) SIAM J Soc Ind Appl Math , vol.11 , Issue.2 , pp. 431-441
    • Marquardt, D.W.1
  • 7
    • 70049086698 scopus 로고    scopus 로고
    • On centroidal Voronoi tessellation - Energy smoothness and fast computation
    • Y. Liu, W. Wang, B. Lévy, F. Sun, D. Yan, and L. Lu et al. On centroidal Voronoi tessellation - energy smoothness and fast computation ACM Trans Graph 28 4 2009 101
    • (2009) ACM Trans Graph , vol.28 , Issue.4 , pp. 101
    • Liu, Y.1    Wang, W.2    Lévy, B.3    Sun, F.4    Yan, D.5    Lu, L.6
  • 12
    • 17044416773 scopus 로고    scopus 로고
    • Centroidal Voronoi tessellation based algorithms for vector fields visualization and segmentation
    • IEEE Visualization 2004 - Proceedings, VIS 2004
    • Du Q, Wang X. Centroidal Voronoi tessellation based algorithms for vector fields visualization and segmentation. In: Proceedings of Vis 04. IEEE; 2004. p. 43-50. (Pubitemid 40497906)
    • (2004) IEEE Visualization 2004 - Proceedings, VIS 2004 , pp. 43-50
    • Du, Q.1    Xiaoqiang, W.2
  • 13
    • 0033317391 scopus 로고    scopus 로고
    • Centroidal Voronoi tessellations: Applications and algorithms
    • Q. Du, V. Faber, and M. Gunzburger Centroidal Voronoi tessellations applications and algorithms SIAM Rev 41 4 1999 637 676 (Pubitemid 30536738)
    • (1999) SIAM Review , vol.41 , Issue.4 , pp. 637-676
    • Du, Q.1    Faber, V.2    Gunzburger, M.3
  • 14
    • 33645231106 scopus 로고    scopus 로고
    • Centroidal Voronoi tessellation algorithms for image compression, segmentation, and multichannel restoration
    • Q. Du, M. Gunzburger, L. Ju, and X. Wang Centroidal Voronoi tessellation algorithms for image compression, segmentation, and multichannel restoration J Math Imaging Vis 24 2 2006 177 194
    • (2006) J Math Imaging Vis , vol.24 , Issue.2 , pp. 177-194
    • Du, Q.1    Gunzburger, M.2    Ju, L.3    Wang, X.4
  • 15
    • 68149098829 scopus 로고    scopus 로고
    • An edge-weighted centroidal Voronoi tessellation model for image segmentation
    • J. Wang, L. Ju, and X. Wang An edge-weighted centroidal Voronoi tessellation model for image segmentation IEEE Trans Image Process 18 8 2009 1844 1858
    • (2009) IEEE Trans Image Process , vol.18 , Issue.8 , pp. 1844-1858
    • Wang, J.1    Ju, L.2    Wang, X.3
  • 17
    • 19644365876 scopus 로고    scopus 로고
    • Anisotropic centroidal voronoi tessellations and their applications
    • DOI 10.1137/S1064827503428527
    • Q. Du, and D. Wang Anisotropic centroidal Voronoi tessellations and their applications SIAM J Sci Comput 26 3 2005 737 761 (Pubitemid 40738105)
    • (2005) SIAM Journal of Scientific Computing , vol.26 , Issue.3 , pp. 737-761
    • Qiang, D.U.1    Wang, D.2
  • 18
    • 77956357359 scopus 로고    scopus 로고
    • Lp centroidal Voronoi tessellation and its applications
    • Lp centroidal Voronoi tessellation and its applications ACM Trans Graph 29 4 2010 119
    • (2010) ACM Trans Graph , vol.29 , Issue.4 , pp. 119
    • Lévy, B.1    Liu, Y.2
  • 19
    • 0141839118 scopus 로고    scopus 로고
    • Beyond stippling methods for distributing objects on the plane
    • S. Hiller, H. Hellwig, and O. Deussen Beyond stippling methods for distributing objects on the plane Comput Graph Forum 22 3 2003 515 522
    • (2003) Comput Graph Forum , vol.22 , Issue.3 , pp. 515-522
    • Hiller, S.1    Hellwig, H.2    Deussen, O.3
  • 21
    • 37249078696 scopus 로고    scopus 로고
    • Digital mosaic frameworks - An overview
    • DOI 10.1111/j.1467-8659.2007.01021.x
    • Battiato S, Di Blasi G, Farinella GM, Gallo G. Digital mosaic frameworks - an overview. In: Comput graph forum, vol. 26. Wiley Online Library; 2007, p. 794-812. (Pubitemid 350274204)
    • (2007) Computer Graphics Forum , vol.26 , Issue.4 , pp. 794-812
    • Battiato, S.1    Di Blasi, G.2    Farinella, G.M.3    Gallo, G.4
  • 24
    • 0242533311 scopus 로고    scopus 로고
    • Sparse matrix solvers on the GPU conjugate gradients and multigrid
    • J. Bolz, I. Farmer, E. Grinspun, and P. Schröoder Sparse matrix solvers on the GPU conjugate gradients and multigrid ACM Trans Graph 22 3 2003 917 924
    • (2003) ACM Trans Graph , vol.22 , Issue.3 , pp. 917-924
    • Bolz, J.1    Farmer, I.2    Grinspun, E.3    Schröoder, P.4
  • 25
    • 79551687914 scopus 로고    scopus 로고
    • GPU accelerated non-rigid registration for the evaluation of cardiac function
    • Li B, Young AA, Cowan BR. GPU accelerated non-rigid registration for the evaluation of cardiac function. In: Proceedings of MICCAI 08, 2008. p. 880-7.
    • (2008) Proceedings of MICCAI 08 , pp. 880-887
    • Li, B.1    Young, A.A.2    Cowan, B.R.3
  • 26
    • 84855220977 scopus 로고    scopus 로고
    • Remark on "algorithm 778 L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization"
    • J.L. Morales, and J. Nocedal Remark on "Algorithm 778 L-BFGS-B: Fortran subroutines for large-scale bound constrained optimization" ACM Trans Math Softw 38 1 2011 1 4
    • (2011) ACM Trans Math Softw , vol.38 , Issue.1 , pp. 1-4
    • Morales, J.L.1    Nocedal, J.2
  • 29
    • 0000302437 scopus 로고
    • On the local and superlinear convergence of quasi-Newton methods
    • C. Broyden, J. Dennis Jr., and J. Moré On the local and superlinear convergence of quasi-Newton methods IMA J Appl Math 12 3 1973 223 245
    • (1973) IMA J Appl Math , vol.12 , Issue.3 , pp. 223-245
    • Broyden, C.1    Dennis, Jr.J.2    Moré, J.3
  • 31
    • 85086684614 scopus 로고    scopus 로고
    • An improved implementation of the LBFGS algorithm for automatic history matching
    • Gao G, Reynolds A. An improved implementation of the LBFGS algorithm for automatic history matching. In: Proceedings of ATCE 04, 2004. p. 1-18.
    • (2004) Proceedings of ATCE 04 , pp. 1-18
    • Gao, G.1    Reynolds, A.2
  • 32
    • 72449211086 scopus 로고    scopus 로고
    • A stochastic quasi-Newton method for online convex optimization
    • Schraudolph N, Yu J, Günter S. A stochastic quasi-Newton method for online convex optimization. In: Proceedings of AISTATS 07, 2007. p. 433-40.
    • (2007) Proceedings of AISTATS 07 , pp. 433-440
    • Schraudolph, N.1    Yu, J.2    Günter, S.3
  • 33
    • 84893583031 scopus 로고    scopus 로고
    • Liu Y. HLBFGS. 2010 {http://research.microsoft.com/en-us/UM/people/ yangliu/software/HLBFGS/}.
    • (2010) HLBFGS
    • Liu, Y.1
  • 34
    • 31244437221 scopus 로고    scopus 로고
    • A numerical study of limited memory BFGS methods
    • J. Morales A numerical study of limited memory BFGS methods Appl Math Lett 15 4 2002 481 487
    • (2002) Appl Math Lett , vol.15 , Issue.4 , pp. 481-487
    • Morales, J.1
  • 35
    • 2942640959 scopus 로고    scopus 로고
    • Nonlinear optimization framework for image-based modeling on programmable graphics hardware
    • Hillesland KE, Molinov S, Grzeszczuk R. Nonlinear optimization framework for image-based modeling on programmable graphics hardware. In: ACM SIGGRAPH 05 courses, 2005.
    • (2005) ACM SIGGRAPH 05 Courses
    • Hillesland, K.E.1    Molinov, S.2    Grzeszczuk, R.3
  • 36
  • 37
    • 0242533310 scopus 로고    scopus 로고
    • Linear algebra operators for GPU implementation of numerical algorithms
    • J. Krüger, and R. Westermann Linear algebra operators for GPU implementation of numerical algorithms ACM Trans Graph 22 3 2003 908 916
    • (2003) ACM Trans Graph , vol.22 , Issue.3 , pp. 908-916
    • Krüger, J.1    Westermann, R.2
  • 38
    • 57849103463 scopus 로고    scopus 로고
    • Multigrid on GPU: Tackling power grid analysis on parallel SIMT platforms
    • Feng Z, Li P. Multigrid on GPU: tackling power grid analysis on parallel SIMT platforms. In: Proceedings of ICCAD 08, 2008. p. 647-54.
    • (2008) Proceedings of ICCAD 08 , pp. 647-654
    • Feng, Z.1    Li, P.2
  • 39
    • 77952611196 scopus 로고    scopus 로고
    • Concurrent number cruncher a GPU implementation of a general sparse linear solver
    • L. Buatois, G. Caumon, and B. Levy Concurrent number cruncher a GPU implementation of a general sparse linear solver Int J Parallel Emergent Distrib Syst 24 3 2009 205 223
    • (2009) Int J Parallel Emergent Distrib Syst , vol.24 , Issue.3 , pp. 205-223
    • Buatois, L.1    Caumon, G.2    Levy, B.3
  • 40
    • 77952662514 scopus 로고    scopus 로고
    • A parallel preconditioned conjugate gradient solver for the Poisson problem on a multi-GPU platform
    • Ament M, Knittel G, Weiskopf D, Strasser W. A parallel preconditioned conjugate gradient solver for the Poisson problem on a multi-GPU platform. In: Proceedings of PDP 10, 2010. p. 583-92.
    • (2010) Proceedings of PDP 10 , pp. 583-592
    • Ament, M.1    Knittel, G.2    Weiskopf, D.3    Strasser, W.4
  • 41
    • 79955529942 scopus 로고    scopus 로고
    • Enhancing the performance of conjugate gradient solvers on graphic processing units
    • M. Dehnavi, M. Fernandez, and D. Giannacopoulos Enhancing the performance of conjugate gradient solvers on graphic processing units IEEE Trans Magn 47 5 2011 1162 1165
    • (2011) IEEE Trans Magn , vol.47 , Issue.5 , pp. 1162-1165
    • Dehnavi, M.1    Fernandez, M.2    Giannacopoulos, D.3
  • 42
    • 77952956190 scopus 로고    scopus 로고
    • High performance conjugate gradient solver on multi-GPU clusters using hypergraph partitioning
    • A. Cevahir, A. Nukada, and S. Matsuoka High performance conjugate gradient solver on multi-GPU clusters using hypergraph partitioning Comput Sci Res Dev 25 1 2010 83 91
    • (2010) Comput Sci Res Dev , vol.25 , Issue.1 , pp. 83-91
    • Cevahir, A.1    Nukada, A.2    Matsuoka, S.3
  • 43
    • 84865545671 scopus 로고    scopus 로고
    • Analysis and performance estimation of the conjugate gradient method on multiple GPUs
    • M. Verschoor, and A. Jalba Analysis and performance estimation of the conjugate gradient method on multiple GPUs Parallel Comput 38 10-11 2012 52 575
    • (2012) Parallel Comput , vol.38 , Issue.1011 , pp. 52-575
    • Verschoor, M.1    Jalba, A.2
  • 44
    • 84870693491 scopus 로고    scopus 로고
    • GPU accelerated nonlinear optimization in radio interferometric calibration
    • Yatawatta S, Kazemi S, Zaroubi S. GPU accelerated nonlinear optimization in radio interferometric calibration. In: Proceedings of IPC 12, 2012. p. 1-6.
    • (2012) Proceedings of IPC 12 , pp. 1-6
    • Yatawatta, S.1    Kazemi, S.2    Zaroubi, S.3
  • 48
    • 82955212653 scopus 로고    scopus 로고
    • NVIDIA Corporation
    • CUDA C programming guide. NVIDIA Corporation; 2007 {http://docs.nvidia. com/cuda/cuda-c-programming-guide/index.html}.
    • (2007) CUDA C programming guide
  • 51
    • 84893590582 scopus 로고    scopus 로고
    • CUBLAS Library. NVIDIA Corporation
    • CUBLAS Library. NVIDIA Corporation; 2008 {http://docs.nvidia.com/cuda/ cublas/index.html}.
    • (2008)
  • 52
    • 70350771131 scopus 로고    scopus 로고
    • Benchmarking GPUs to tune dense linear algebra
    • Volkov V, Demmel JW. Benchmarking GPUs to tune dense linear algebra. In: Proceedings of SC 08, 2008. p. 31:1-11.
    • (2008) Proceedings of SC 08 , vol.31 , pp. 1-11
    • Volkov, V.1    Demmel, J.W.2
  • 54
    • 84893615087 scopus 로고    scopus 로고
    • NVIDIA Corporation
    • Thrust. NVIDIA Corporation; 2009 {http://docs.nvidia.com/cuda/thrust/ index.html}.
    • (2009) Thrust
  • 56
    • 33748688196 scopus 로고    scopus 로고
    • Jump flooding in GPU with applications to Voronoi diagram and distance transform
    • Rong G, Tan TS. Jump flooding in GPU with applications to Voronoi diagram and distance transform. In: Proceedings of I3D 06, 2006. p. 109-116.
    • (2006) Proceedings of I3D 06 , pp. 109-116
    • Rong, G.1    Tan, T.S.2
  • 57
    • 47849106060 scopus 로고    scopus 로고
    • Variants of jump flooding algorithm for computing discrete Voronoi diagrams
    • Rong G, Tan TS. Variants of jump flooding algorithm for computing discrete Voronoi diagrams. In: Proceedings of ISVD 07, 2007. p. 176-81.
    • (2007) Proceedings of ISVD 07 , pp. 176-181
    • Rong, G.1    Tan, T.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.