메뉴 건너뛰기




Volumn 43, Issue 2, 1996, Pages 225-267

Unreliable failure detectors for reliable distributed systems

Author keywords

Agreement problem; Algorithms; Asynchronous systems; Atomic broadcast; Byzantine Generals' problem; Commit problem; Consensus problem; Crash failures; Failure detection; Fault tolerance; Message passing; Partial synchrony; Processor failures; Reliability; Theory

Indexed keywords

ALGORITHMS; AUTOMATA THEORY; COMPUTER OPERATING SYSTEMS; CONCURRENT ENGINEERING; DATA PROCESSING; DATABASE SYSTEMS; FAULT TOLERANT COMPUTER SYSTEMS; RELIABILITY;

EID: 0030102105     PISSN: 00045411     EISSN: None     Source Type: Journal    
DOI: 10.1145/226643.226647     Document Type: Article
Times cited : (1835)

References (47)
  • 3
    • 6144272372 scopus 로고
    • Bounds on the time to reach agreement in the presence of timing uncertainty
    • (New Orleans, La., May 6-8). ACM, New York
    • ATTIYA, H., DWORK, C., LYNCH, N., AND STOCKMEYER, L. 1991. Bounds on the time to reach agreement in the presence of timing uncertainty. In Proceedings of the 23rd ACM Symposium on Theory of Computing (New Orleans, La., May 6-8). ACM, New York, pp. 359-369.
    • (1991) Proceedings of the 23rd ACM Symposium on Theory of Computing , pp. 359-369
    • Attiya, H.1    Dwork, C.2    Lynch, N.3    Stockmeyer, L.4
  • 4
    • 0020976844 scopus 로고
    • Another advantage of free choice: Completely asynchronous agreement protocols
    • (Montreal, Que., Canada, Aug. 17-19). ACM, New York
    • BEN-OR, M. 1983. Another advantage of free choice: Completely asynchronous agreement protocols. In Proceedings of the 2nd ACM Symposium on Principles of Distributed Computing (Montreal, Que., Canada, Aug. 17-19). ACM, New York, pp. 27-30.
    • (1983) Proceedings of the 2nd ACM Symposium on Principles of Distributed Computing , pp. 27-30
    • Ben-Or, M.1
  • 6
    • 85032186474 scopus 로고
    • A combinatorial characterization of the distributed tasks which are solvable in the presence of one faulty processor
    • (Toronto, Ont., Canada, Aug. 15-17). ACM, New York
    • BIRAN, O., MORAN, S., AND ZAKS, S. 1988. A combinatorial characterization of the distributed tasks which are solvable in the presence of one faulty processor. In Proceedings of the 7th ACM Symposium on Principles of Distributed Computing (Toronto, Ont., Canada, Aug. 15-17). ACM, New York, pp. 263-275.
    • (1988) Proceedings of the 7th ACM Symposium on Principles of Distributed Computing , pp. 263-275
    • Biran, O.1    Moran, S.2    Zaks, S.3
  • 8
    • 0023287946 scopus 로고
    • Reliable communication in the presence of failures
    • Feb.
    • BIRMAN, K. P., AND JOSEPH, T. A. 1987. Reliable communication in the presence of failures. ACM Trans. Comput. Syst. 5, 1 (Feb.), 47-76.
    • (1987) ACM Trans. Comput. Syst. , vol.5 , Issue.1 , pp. 47-76
    • Birman, K.P.1    Joseph, T.A.2
  • 9
    • 0022144724 scopus 로고
    • Asynchronous consensus and broadcast protocols
    • Oct.
    • BRACHA, G., AND TOUEG, S. 1985. Asynchronous consensus and broadcast protocols. J. ACM 32, 4 (Oct.), 824-840.
    • (1985) J. ACM , vol.32 , Issue.4 , pp. 824-840
    • Bracha, G.1    Toueg, S.2
  • 10
    • 85032206498 scopus 로고
    • Fault-tolerant decision making in totally asynchronous distributed systems
    • (Vancouver, B.C., Canada, Aug. 10-12). ACM, New York
    • BRIDGLAND, M. F., AND WATRO, R. J. 1987. Fault-tolerant decision making in totally asynchronous distributed systems. In Proceedings of the 6th ACM Symposium on Principles of Distributed Computing (Vancouver, B.C., Canada, Aug. 10-12). ACM, New York, pp. 52-63.
    • (1987) Proceedings of the 6th ACM Symposium on Principles of Distributed Computing , pp. 52-63
    • Bridgland, M.F.1    Watro, R.J.2
  • 12
    • 0026966839 scopus 로고
    • The weakest failure detector for solving consensus
    • Technical Report 92-1293 (July), Department of Computer Science, Cornell University. (Vancouver, B.C., Canada, Aug. 10-12). ACM, New York
    • CHANDRA, T. D., HADZILACOS, V., AND TOUEG, S. 1992. The weakest failure detector for solving consensus. Technical Report 92-1293 (July), Department of Computer Science, Cornell University. Available from ftp://ftp.cs.cornell.edll/pub/chandra/failure.detectors.weakest.dvi.Z. A preliminary version appeared in the Proceedings of the 11th ACM Symposium on Principles of Distributed Computing (Vancouver, B.C., Canada, Aug. 10-12). ACM, New York, pp. 147-158.
    • (1992) Proceedings of the 11th ACM Symposium on Principles of Distributed Computing , pp. 147-158
    • Chandra, T.D.1    Hadzilacos, V.2    Toueg, S.3
  • 14
    • 6144246271 scopus 로고    scopus 로고
    • CHANDRA, T. D., AND LARREA, M. 1994. E-mail correspondence. Showed that ◇°W cannot be used to solve non-blocking atomic commit
    • CHANDRA, T. D., AND LARREA, M. 1994. E-mail correspondence. Showed that ◇°W cannot be used to solve non-blocking atomic commit.
  • 16
    • 84976768445 scopus 로고
    • Reliable broadcast protocols
    • Aug.
    • CHANG, J., AND MAXEMCHUK, N. 1984. Reliable broadcast protocols. ACM Trans. Comput. Syst. 2, 3 (Aug.), 251-273.
    • (1984) ACM Trans. Comput. Syst. , vol.2 , Issue.3 , pp. 251-273
    • Chang, J.1    Maxemchuk, N.2
  • 17
    • 0012242251 scopus 로고
    • Randomization in byzantine agreement
    • CHOR, B., AND DWORK, C. 1989. Randomization in byzantine agreement. Adv. Comput. Res. 5, 443-497.
    • (1989) Adv. Comput. Res. , vol.5 , pp. 443-497
    • Chor, B.1    Dwork, C.2
  • 18
    • 6144289783 scopus 로고
    • Issues in the design of highly available computing services
    • July. Also IBM Res. Rep. RJ5856. Thomas J. Watson Research Center, Hawthorne, N.Y.
    • CRISTIAN, F. 1987. Issues in the design of highly available computing services. In Annual Symposium of the Canadian Information Processing Society (July), pp. 9-16. Also IBM Res. Rep. RJ5856. Thomas J. Watson Research Center, Hawthorne, N.Y.
    • (1987) Annual Symposium of the Canadian Information Processing Society , pp. 9-16
    • Cristian, F.1
  • 19
    • 0022246697 scopus 로고
    • Atomic broadcast: From simple message diffusion to Byzantine agreement
    • June 1985. A revised version appears as IBM Research Laboratory Technical Report RJ5244 (April 1989). Thomas J. Watson Research Center, Hawthorne, N.Y.
    • CRISTIAN, F., AGHILI, H., STRONG, R., AND DOLEV, D. 1985/1989. Atomic broadcast: From simple message diffusion to Byzantine agreement. In Proceedings of the 15th International Symposium on Fault-Tolerant Computing (June 1985), pp. 200-206. A revised version appears as IBM Research Laboratory Technical Report RJ5244 (April 1989). Thomas J. Watson Research Center, Hawthorne, N.Y.
    • (1985) Proceedings of the 15th International Symposium on Fault-Tolerant Computing , pp. 200-206
    • Cristian, F.1    Aghili, H.2    Strong, R.3    Dolev, D.4
  • 20
  • 21
    • 0023168356 scopus 로고
    • On the minimal synchronism needed for distributed consensus
    • Jan.
    • DOLEV, D., DWORK, C., AND STOCKMEYER, L. 1987. On the minimal synchronism needed for distributed consensus. J. ACM 34, 1 (Jan.), 77-97.
    • (1987) J. ACM , vol.34 , Issue.1 , pp. 77-97
    • Dolev, D.1    Dwork, C.2    Stockmeyer, L.3
  • 22
    • 0022754708 scopus 로고
    • Reaching approximate agreement in the presence of faults
    • July
    • DOLEV, D., LYNCH, N. A., PINTER, S. S., STARK, E. W., AND WEIHL, W. E. 1986. Reaching approximate agreement in the presence of faults. J. ACM 33, 3 (July), 499-516.
    • (1986) J. ACM , vol.33 , Issue.3 , pp. 499-516
    • Dolev, D.1    Lynch, N.A.2    Pinter, S.S.3    Stark, E.W.4    Weihl, W.E.5
  • 23
    • 0023994903 scopus 로고
    • Consensus in the presence of partial synchrony
    • Apr.
    • DWORK, C., LYNCH, N. A., AND STOCKMEYER, L. 1988. Consensus in the presence of partial synchrony. J. ACM 35, 2 (Apr.), 288-323.
    • (1988) J. ACM , vol.35 , Issue.2 , pp. 288-323
    • Dwork, C.1    Lynch, N.A.2    Stockmeyer, L.3
  • 25
    • 0022045868 scopus 로고
    • Impossibility of distributed consensus with one faulty process
    • Apr.
    • FISCHER, M. J., LYNCH, N. A., AND PATERSON, M. S. 1985. Impossibility of distributed consensus with one faulty process. J. ACM 32, 2 (Apr.), 374-382.
    • (1985) J. ACM , vol.32 , Issue.2 , pp. 374-382
    • Fischer, M.J.1    Lynch, N.A.2    Paterson, M.S.3
  • 27
    • 84947902519 scopus 로고
    • Revisiting the relationship between non blocking atomic commitment and consensus
    • (Sept.). Springer-Verlag, New York
    • GUERRAOUI, R. 1995. Revisiting the relationship between non blocking atomic commitment and consensus. In Proceedings of the 9th International Workshop on Distributed Algorithms (Sept.). Springer-Verlag, New York, pp. 87-100.
    • (1995) Proceedings of the 9th International Workshop on Distributed Algorithms , pp. 87-100
    • Guerraoui, R.1
  • 28
    • 0003070483 scopus 로고
    • Fault-tolerant broadcasts and related problems
    • Chap. 5, S. J. MULLENDER, Ed., Addison-Wesley, Reading, Mass.
    • HADZILACOS, V., AND TOUEG, S. 1993. Fault-tolerant broadcasts and related problems. In Distributed Systems, Chap. 5, S. J. MULLENDER, Ed., Addison-Wesley, Reading, Mass., pp. 97-145.
    • (1993) Distributed Systems , pp. 97-145
    • Hadzilacos, V.1    Toueg, S.2
  • 29
    • 0003806377 scopus 로고
    • Tech. Rep. 94-1425 (May), Computer Science Department, Cornell University, Ithaca, N.Y.
    • HADZILACOS, V., AND TOUEG, S. 1994. A modular approach to fault-tolerant broadcasts and related problems. Tech. Rep. 94-1425 (May), Computer Science Department, Cornell University, Ithaca, N.Y. Available by anonymous ftp from ftp://ftp.db.toronto.edu/pub/vassos/fault.tolerant. broadcasts.dvi.Z. (An earlier version is also available in Hadzilacos and Toueg [1993]).
    • (1994) A Modular Approach to Fault-tolerant Broadcasts and Related Problems
    • Hadzilacos, V.1    Toueg, S.2
  • 30
    • 0025460546 scopus 로고
    • Knowledge and common knowledge in a distributed environment
    • July
    • HALPERN, J. Y., AND MOSES, Y. 1990. Knowledge and common knowledge in a distributed environment. J. ACM 37, 3 (July), 549-587.
    • (1990) J. ACM , vol.37 , Issue.3 , pp. 549-587
    • Halpern, J.Y.1    Moses, Y.2
  • 31
    • 0017972109 scopus 로고
    • The implementation of reliable distributed multiprocess systems
    • LAMPORT, L. 1978. The implementation of reliable distributed multiprocess systems. Comput. Netw. 2, 95-114.
    • (1978) Comput. Netw. , vol.2 , pp. 95-114
    • Lamport, L.1
  • 33
    • 57649094701 scopus 로고
    • Using failure detectors to solve consensus in asynchronous shared-memory systems
    • (Sept.), Springer-Verlag, New York
    • LO, W. K., AND HADZILACOS, V. 1994. Using failure detectors to solve consensus in asynchronous shared-memory systems. In Proceedings of the 8th International Workshop on Distributed Algorithms (Sept.), Springer-Verlag, New York, pp. 280-295. Available from ftp://ftp.db.toronto.edu/pub/vassos/failure.detectors.shared.memory.ps.Z.
    • (1994) Proceedings of the 8th International Workshop on Distributed Algorithms , pp. 280-295
    • Lo, W.K.1    Hadzilacos, V.2
  • 34
    • 0001662644 scopus 로고
    • Memory requirements for agreement among unreliable asynchronous processes
    • LOUI, M., AND ABU-AMARA. 1987. Memory requirements for agreement among unreliable asynchronous processes. Adv. Comput. Res. 4, 163-183.
    • (1987) Adv. Comput. Res. , vol.4 , pp. 163-183
    • Loui, M.1    Abu-Amara2
  • 35
    • 0022562250 scopus 로고
    • Cheating husbands and other stories: A case study of knowledge, action, and communication
    • MOSES, Y., DOLEV, D., AND HALPERN, J. Y. 1986. Cheating husbands and other stories: a case study of knowledge, action, and communication. Distrib. Comput. 1, 3, 167-176.
    • (1986) Distrib. Comput. , vol.1 , Issue.3 , pp. 167-176
    • Moses, Y.1    Dolev, D.2    Halpern, J.Y.3
  • 38
    • 38249018396 scopus 로고
    • Automatically increasing the fault-tolerance of distributed algorithms
    • Sept.
    • NEIGER, G., AND TOUEG, S. 1990. Automatically increasing the fault-tolerance of distributed algorithms. J. Algorithms 11, 3 (Sept.), 374-419.
    • (1990) J. Algorithms , vol.11 , Issue.3 , pp. 374-419
    • Neiger, G.1    Toueg, S.2
  • 39
    • 84976810569 scopus 로고
    • Reaching agreement in the presence of faults
    • Apr.
    • PEASE, M., SHOSTAK, R., AND LAMPORT, L. 1980. Reaching agreement in the presence of faults. J. ACM 27, 2 (Apr.), 228-234.
    • (1980) J. ACM , vol.27 , Issue.2 , pp. 228-234
    • Pease, M.1    Shostak, R.2    Lamport, L.3
  • 40
    • 0024716226 scopus 로고
    • Preserving and using context information in interprocess communication
    • Aug.
    • PETERSON, L. L., BUCHOLZ, N. C., AND SCHLICHTING, R. D. 1989. Preserving and using context information in interprocess communication. ACM Trans. Comput. Syst. 7, 3 (Aug.), 217-246.
    • (1989) ACM Trans. Comput. Syst. , vol.7 , Issue.3 , pp. 217-246
    • Peterson, L.L.1    Bucholz, N.C.2    Schlichting, R.D.3
  • 41
    • 0024606853 scopus 로고
    • Reliable scheduling in a tmr database system
    • Feb.
    • PITTELLI, F., AND GARCIA-MOLINA, H. 1989. Reliable scheduling in a tmr database system. ACM Trans. Comput. Syst. 7, 1 (Feb.), 25-60.
    • (1989) ACM Trans. Comput. Syst. , vol.7 , Issue.1 , pp. 25-60
    • Pittelli, F.1    Garcia-Molina, H.2
  • 43
    • 6144283613 scopus 로고
    • Tech. Rep. RJ 3673 (Nov.), IBM Research Laboratory, Thomas J. Watson Research Center, Hawthorne, N.Y.
    • REISCHUK, R. 1982. A new solution for the Byzantine general's problem. Tech. Rep. RJ 3673 (Nov.), IBM Research Laboratory, Thomas J. Watson Research Center, Hawthorne, N.Y.
    • (1982) A New Solution for the Byzantine General's Problem
    • Reischuk, R.1
  • 44
    • 84976863070 scopus 로고
    • Using process groups to implement failure detection in asynchronous environments
    • (Montreal, Que., Canada, Aug. 19-21). ACM, New York
    • RICCIARDI, A., AND BIRMAN, K. P. 1991. Using process groups to implement failure detection in asynchronous environments. In Proceedings of the 10th ACM Symposium on Principles of Distributed Computing (Montreal, Que., Canada, Aug. 19-21). ACM, New York, pp. 341-354.
    • (1991) Proceedings of the 10th ACM Symposium on Principles of Distributed Computing , pp. 341-354
    • Ricciardi, A.1    Birman, K.P.2
  • 45
    • 0003838267 scopus 로고
    • Tech. Rep. TR95-411 (Feb.). Univ. California at San Diego. San Diego, Calif.
    • SABEL, L., AND MARZULLO, K. 1995. Election vs. consensus in asynchronous systems. Tech. Rep. TR95-411 (Feb.). Univ. California at San Diego. San Diego, Calif. Available at ftp://ftp.cs. cornell.edu/pub/sabel/tr94-1413.ps.
    • (1995) Election vs. Consensus in Asynchronous Systems
    • Sabel, L.1    Marzullo, K.2
  • 46
    • 0025564050 scopus 로고
    • Implementing fault-tolerant services using the state machine approach: A tutorial
    • Dec.
    • SCHNEIDER, F. B. 1990. Implementing fault-tolerant services using the state machine approach: A tutorial. ACM Comput. Surv. 22, 4 (Dec.), 299-319.
    • (1990) ACM Comput. Surv. , vol.22 , Issue.4 , pp. 299-319
    • Schneider, F.B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.