메뉴 건너뛰기




Volumn 10, Issue 6, 2017, Pages 969-983

Reliable computing service in massive-scale systems through rapid low-cost failover

Author keywords

Cloud computing; Failover; Reliability; Resource management; Services

Indexed keywords

CLOUD COMPUTING; COST EFFECTIVENESS; LARGE SCALE SYSTEMS; NATURAL RESOURCES MANAGEMENT; RELIABILITY; RESOURCE ALLOCATION;

EID: 85027256100     PISSN: 19391374     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSC.2016.2544313     Document Type: Article
Times cited : (18)

References (48)
  • 1
    • 79956268427 scopus 로고    scopus 로고
    • Intercloud: Utilityoriented federation of cloud computing environments for scaling of application services
    • R. Buyya, R. Ranjan, and R. N. Calheiros, "Intercloud: Utilityoriented federation of cloud computing environments for scaling of application services," in Proc. 10th Int. Conf. Algorithms Archit. Parallel Process., 2010, pp. 13-31.
    • (2010) Proc. 10th Int. Conf. Algorithms Archit. Parallel Process. , pp. 13-31
    • Buyya, R.1    Ranjan, R.2    Calheiros, R.N.3
  • 5
    • 84905826576 scopus 로고    scopus 로고
    • Fuxi: A faulttolerant resource management and job scheduling system at internet scale
    • Z. Zhang, C. Li, Y. Tao, R. Yang, H. Tang, and J. Xu, "Fuxi: A faulttolerant resource management and job scheduling system at internet scale," in Proc. Int. Conf. Very Large Databases, 2014, pp. 1393-1404.
    • (2014) Proc. Int. Conf. Very Large Databases , pp. 1393-1404
    • Zhang, Z.1    Li, C.2    Tao, Y.3    Yang, R.4    Tang, H.5    Xu, J.6
  • 7
    • 79951839892 scopus 로고    scopus 로고
    • Fault tolerance and scaling in e-science cloud applications: Observations from the continuing development of modisazure
    • J. Li, M. Humphrey, Y.-W. Cheah, Y. Ryu, D. Agarwal, K. Jackson, and C. van Ingen, "Fault tolerance and scaling in e-science cloud applications: Observations from the continuing development of modisazure," in Proc. IEEE 6th Int. Conf. e-Sci., 2010, pp. 246-253.
    • (2010) Proc. IEEE 6th Int. Conf. E-Sci. , pp. 246-253
    • Li, J.1    Humphrey, M.2    Cheah, Y.-W.3    Ryu, Y.4    Agarwal, D.5    Jackson, K.6    Van Ingen, C.7
  • 8
    • 84978952293 scopus 로고    scopus 로고
    • Computing at massive scale: Scalability and dependability challenges
    • Oxford, U.K.
    • R. Yang and J. Xu, "Computing at massive scale: Scalability and dependability challenges," in presented at the IEEE 10th Int. Symp. Service Oriented System Engineering, Oxford, U.K., 2016.
    • (2016) IEEE 10th Int. Symp. Service Oriented System Engineering
    • Yang, R.1    Xu, J.2
  • 10
    • 0001314414 scopus 로고
    • The evolution of the recovery block concept
    • New York, NY, USA: Wiley
    • B. Randell and J. Xu, "The evolution of the recovery block concept," in Softw. Fault Tolerance, New York, NY, USA: Wiley, 1995.
    • (1995) Softw. Fault Tolerance
    • Randell, B.1    Xu, J.2
  • 14
    • 85044264755 scopus 로고    scopus 로고
    • (2013). [Online].Available: Https://issues.apache.org/jira/browse/YARN-556
    • (2013)
  • 15
    • 85044300760 scopus 로고    scopus 로고
    • (2013). [Online].Available: Https://issues.apache.org/jira/browse/YARN-1336
    • (2013)
  • 19
    • 78649459815 scopus 로고    scopus 로고
    • Cdrm: A cost-effective dynamic replication management scheme for cloud storage cluster
    • Q. Wei, B. Veeravalli, B. Gong, L. Zeng, and D. Feng, "Cdrm: A cost-effective dynamic replication management scheme for cloud storage cluster," in Proc. IEEE Int. Conf. Cluster Comput., 2010, pp. 188-196.
    • (2010) Proc. IEEE Int. Conf. Cluster Comput. , pp. 188-196
    • Wei, Q.1    Veeravalli, B.2    Gong, B.3    Zeng, L.4    Feng, D.5
  • 20
    • 80053400298 scopus 로고    scopus 로고
    • Adaptive fault tolerance in real time cloud computing
    • S. Malik and F. Huet, "Adaptive fault tolerance in real time cloud computing," in Proc. IEEE World Congr. Servi., 2011, pp. 280-287.
    • (2011) Proc. IEEE World Congr. Servi. , pp. 280-287
    • Malik, S.1    Huet, F.2
  • 23
    • 76849100508 scopus 로고    scopus 로고
    • Failure-aware resource management for high-availability computing clusters with distributed virtual machines
    • S. Fu, "Failure-aware resource management for high-availability computing clusters with distributed virtual machines," J. Parallel Distrib. Comput., vol. 70, no. 4, pp. 384-393, 2010.
    • (2010) J. Parallel Distrib. Comput. , vol.70 , Issue.4 , pp. 384-393
    • Fu, S.1
  • 24
    • 85044273960 scopus 로고    scopus 로고
    • (2013). Amazon web services suffers outage [Online]. Available: Http://www.zdnet.com/article/amazon-web-services-suffersoutage-takes-d own-vine-instagram-others-with-it/
    • (2013) Amazon Web Services Suffers Outage
  • 25
    • 0003217728 scopus 로고
    • The methodology of n-version programming
    • Hoboken, NJ, USA: Wiley
    • A. Avizienis, "The methodology of n-version programming," in Software Fault Tolerance, Hoboken, NJ, USA: Wiley, 1995.
    • (1995) Software Fault Tolerance
    • Avizienis, A.1
  • 28
    • 0029212717 scopus 로고
    • Reliability analysis of a complex standby redundant systems
    • R. Subramanian and V. Anantharaman, "Reliability analysis of a complex standby redundant systems," Rel. Eng. Syst. Safety, vol. 48, no. 1, pp. 57-70, 1995.
    • (1995) Rel. Eng. Syst. Safety , vol.48 , Issue.1 , pp. 57-70
    • Subramanian, R.1    Anantharaman, V.2
  • 30
  • 31
    • 84930247783 scopus 로고    scopus 로고
    • An analysis of failure-related energy waste in a large-scale cloud environment
    • Jun.
    • P. Garraghan, I. S. Moreno, P. Townend, and J. Xu, "An analysis of failure-related energy waste in a large-scale cloud environment," IEEE Trans. Emerging Topics Comput., vol. 2, no. 2, pp. 166-180, Jun. 2014.
    • (2014) IEEE Trans. Emerging Topics Comput. , vol.2 , Issue.2 , pp. 166-180
    • Garraghan, P.1    Moreno, I.S.2    Townend, P.3    Xu, J.4
  • 32
    • 0023090161 scopus 로고
    • Checkpointing and rollback-recovery for distributed systems
    • Jan.
    • R. Koo and S. Toueg, "Checkpointing and rollback-recovery for distributed systems," IEEE Trans. Softw. Eng., vol. SE-13, no. 1, pp. 23-31, Jan. 1987.
    • (1987) IEEE Trans. Softw. Eng. , vol.SE13 , Issue.1 , pp. 23-31
    • Koo, R.1    Toueg, S.2
  • 34
    • 84946125131 scopus 로고    scopus 로고
    • Service-oriented computing: Concepts, characteristics and directions
    • M. P. Papazoglou, "Service-oriented computing: Concepts, characteristics and directions," in Proc. 4th Int. Conf. Web Inform. Syst. Eng., 2003, 3-12.
    • (2003) Proc. 4th Int. Conf. Web Inform. Syst. Eng. , pp. 3-12
    • Papazoglou, M.P.1
  • 35
    • 0036601844 scopus 로고    scopus 로고
    • Grid services for distributed system integration
    • Jun.
    • I. Foster, C. Kesselman, J. M. Nick, and S. Tuecke, "Grid services for distributed system integration," in IEEE Comput., vol. 35, no. 6, pp. 37-46, Jun. 2002.
    • (2002) IEEE Comput. , vol.35 , Issue.6 , pp. 37-46
    • Foster, I.1    Kesselman, C.2    Nick, J.M.3    Tuecke, S.4
  • 40
    • 84988273398 scopus 로고    scopus 로고
    • Consnap: Taking continuous snapshots for running state protection of virtual machines
    • J. Li, J. Zheng, L. Cui, and R. Yang, "Consnap: Taking continuous snapshots for running state protection of virtual machines," in Proc. IEEE 20th Int. Conf. Parallel Distrib. Syst., 2014, pp. 677-684.
    • (2014) Proc. IEEE 20th Int. Conf. Parallel Distrib. Syst. , pp. 677-684
    • Li, J.1    Zheng, J.2    Cui, L.3    Yang, R.4
  • 43
    • 85031898917 scopus 로고    scopus 로고
    • Towards characterizing cloud backend workloads: Insights from google compute clusters
    • A. K. Mishra, J. L. Hellerstein, W. Cirne, and C. R. Das, "Towards characterizing cloud backend workloads: Insights from google compute clusters," ACM SIGMETRICS Perform. Eval. Rev., vol. 37, no. 4, pp. 34-41, 2010.
    • (2010) ACM SIGMETRICS Perform. Eval. Rev. , vol.37 , Issue.4 , pp. 34-41
    • Mishra, A.K.1    Hellerstein, J.L.2    Cirne, W.3    Das, C.R.4
  • 44
    • 84965042403 scopus 로고    scopus 로고
    • Analysis, modeling and simulation of workload patterns in a large-scale utility cloud
    • Apr.-Jun.
    • I. Solis Moreno, P. Garraghan, P. Townend, and J. Xu, "Analysis, modeling and simulation of workload patterns in a large-scale utility cloud," IEEE Trans. Cloud Comput., vol. 2, no. 2, pp. 208-221, Apr.-Jun. 2014.
    • (2014) IEEE Trans. Cloud Comput. , vol.2 , Issue.2 , pp. 208-221
    • Solis Moreno, I.1    Garraghan, P.2    Townend, P.3    Xu, J.4
  • 45
    • 84881145178 scopus 로고    scopus 로고
    • An analysis of the server characteristics and resource utilization in Google cloud
    • P. Garraghan, P. Townend, and J. Xu, "An analysis of the server characteristics and resource utilization in Google cloud," in Proc. IEEE Int. Conf. Cloud Eng., 2013, pp. 124-131.
    • (2013) Proc. IEEE Int. Conf. Cloud Eng. , pp. 124-131
    • Garraghan, P.1    Townend, P.2    Xu, J.3
  • 47
    • 84873622276 scopus 로고    scopus 로고
    • The tail at scale
    • J. Dean and L. A. Barroso, "The tail at scale," in ACM Commun., vol. 56, no. 2, pp. 74-80, 2013.
    • (2013) ACM Commun. , vol.56 , Issue.2 , pp. 74-80
    • Dean, J.1    Barroso, L.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.