메뉴 건너뛰기




Volumn 4, Issue 4, 2007, Pages 266-279

Automated rule-based diagnosis through a distributed monitor system

Author keywords

Distributed system diagnosis; Fault injection based evaluation; Hierarchical Monitor system; Runtime monitoring

Indexed keywords

COMPUTER AIDED DIAGNOSIS; COMPUTER ARCHITECTURE; ERROR ANALYSIS; GRAPH THEORY; NETWORK PROTOCOLS;

EID: 36248945561     PISSN: 15455971     EISSN: None     Source Type: Journal    
DOI: 10.1109/TDSC.2007.70211     Document Type: Article
Times cited : (31)

References (46)
  • 2
    • 36248963536 scopus 로고    scopus 로고
    • Costs of Computer Downtime to American Businesses, FIND/SVP, 1993.
    • Costs of Computer Downtime to American Businesses, FIND/SVP, 1993.
  • 3
    • 2642518761 scopus 로고    scopus 로고
    • Failure Handling in a Reliable Multicast Protocol for Improving Buffer Utilization and Accommodating Heterogeneous Receivers
    • G. Khanna, J. Rogers, and S. Bagchi, "Failure Handling in a Reliable Multicast Protocol for Improving Buffer Utilization and Accommodating Heterogeneous Receivers," Proc. 10th IEEE Pacific Rim Dependable Computing Conf. (PRDC '04), pp. 15-24, 2004.
    • (2004) Proc. 10th IEEE Pacific Rim Dependable Computing Conf. (PRDC '04) , pp. 15-24
    • Khanna, G.1    Rogers, J.2    Bagchi, S.3
  • 5
    • 0028755449 scopus 로고
    • Observer - A Concept for Formal On-Line Validation of Distributed Systems
    • Dec
    • M. Diaz, G. Juanole, and J.-P. Courtiat, "Observer - A Concept for Formal On-Line Validation of Distributed Systems," IEEE Trans. Software Eng., vol. 20, no. 12, pp. 900-913, Dec. 1994.
    • (1994) IEEE Trans. Software Eng , vol.20 , Issue.12 , pp. 900-913
    • Diaz, M.1    Juanole, G.2    Courtiat, J.-P.3
  • 10
    • 0030102105 scopus 로고    scopus 로고
    • Unreliable Failure Detectors for Reliable Distributed Systems
    • T. Chandra and S. Toueg, "Unreliable Failure Detectors for Reliable Distributed Systems," J. ACM, vol. 43, no. 2, pp. 225-267, 1996.
    • (1996) J. ACM , vol.43 , Issue.2 , pp. 225-267
    • Chandra, T.1    Toueg, S.2
  • 11
    • 0022144724 scopus 로고
    • Asynchronous Consensus and Broadcast Protocols
    • G. Bracha and S. Toueg, "Asynchronous Consensus and Broadcast Protocols," J. ACM, vol. 32, no. 4, pp. 824-840, 1985.
    • (1985) J. ACM , vol.32 , Issue.4 , pp. 824-840
    • Bracha, G.1    Toueg, S.2
  • 16
    • 0029516767 scopus 로고
    • Schemes for Fault Identification in Communication Networks
    • I. Katzela and M. Schwartz, "Schemes for Fault Identification in Communication Networks," IEEE/ACM Trans. Networking, vol. 3, no. 6, pp. 753-764, 1995.
    • (1995) IEEE/ACM Trans. Networking , vol.3 , Issue.6 , pp. 753-764
    • Katzela, I.1    Schwartz, M.2
  • 17
    • 84938017623 scopus 로고
    • On the Connection Assignment Problem of Diagnosable Systems
    • F.P. Preparata, G. Metze, and R.T. Chien, "On the Connection Assignment Problem of Diagnosable Systems," IEEE Trans. Electronic Computers, vol. 16, no. 6, pp. 848-854, 1967.
    • (1967) IEEE Trans. Electronic Computers , vol.16 , Issue.6 , pp. 848-854
    • Preparata, F.P.1    Metze, G.2    Chien, R.T.3
  • 18
    • 0016926333 scopus 로고
    • On Models for Diagnosable Systems and Probabilistic Fault Diagnosis
    • S. Maheshwari and S. Hakimi, "On Models for Diagnosable Systems and Probabilistic Fault Diagnosis," IEEE Trans. Computers, vol. 25, pp. 228-236, 1976.
    • (1976) IEEE Trans. Computers , vol.25 , pp. 228-236
    • Maheshwari, S.1    Hakimi, S.2
  • 20
    • 0027610755 scopus 로고
    • The Consensus Problem in Fault-Tolerant Computing
    • June
    • M. Barborak, A. Dahbura, and M. Malek, "The Consensus Problem in Fault-Tolerant Computing," ACM Computing Surveys, vol. 25, no. 2, pp. 171-220, June 1993.
    • (1993) ACM Computing Surveys , vol.25 , Issue.2 , pp. 171-220
    • Barborak, M.1    Dahbura, A.2    Malek, M.3
  • 22
    • 0023345716 scopus 로고
    • Measurement-Based Analysis of Error Latency
    • May
    • R. Chillarege and R.K. Iyer, "Measurement-Based Analysis of Error Latency," IEEE Trans. Computers, vol. 36, no. 5, May 1987.
    • (1987) IEEE Trans. Computers , vol.36 , Issue.5
    • Chillarege, R.1    Iyer, R.K.2
  • 23
    • 0028452737 scopus 로고
    • On Probabilistic Diagnosis of Multi-processor Systems Using Multiple Syndromes
    • June
    • S. Lee and K.G. Shin, "On Probabilistic Diagnosis of Multi-processor Systems Using Multiple Syndromes," IEEE Trans. Parallel and Distributed Systems, vol. 5, no. 6, pp. 630-638, June 1994.
    • (1994) IEEE Trans. Parallel and Distributed Systems , vol.5 , Issue.6 , pp. 630-638
    • Lee, S.1    Shin, K.G.2
  • 24
    • 0022719806 scopus 로고
    • Dependable Computing: From Concepts to Design Diversity
    • A. Avizienis and J.-C. Laprie, "Dependable Computing: From Concepts to Design Diversity," Proc. IEEE, vol. 74, no. 5, pp. 629-638, 1986.
    • (1986) Proc. IEEE , vol.74 , Issue.5 , pp. 629-638
    • Avizienis, A.1    Laprie, J.-C.2
  • 27
    • 84952324793 scopus 로고    scopus 로고
    • An Active Approach to Characterizing Dynamic Dependencies for Problem Determination in a Distributed Environment
    • A. Brown, G. Kar, and A. Keller, "An Active Approach to Characterizing Dynamic Dependencies for Problem Determination in a Distributed Environment," Proc. Int'l Symp. Integrated Network Management (IM '01), 2001.
    • (2001) Proc. Int'l Symp. Integrated Network Management (IM '01)
    • Brown, A.1    Kar, G.2    Keller, A.3
  • 30
    • 0031672517 scopus 로고    scopus 로고
    • A Hierarchical Adaptive Distributed System-Level Diagnosis Algorithm
    • Jan
    • E.P. Duarte and T. Nanya, "A Hierarchical Adaptive Distributed System-Level Diagnosis Algorithm," IEEE Trans. Computers, vol. 47, no. 1, pp. 34-45, Jan. 1998.
    • (1998) IEEE Trans. Computers , vol.47 , Issue.1 , pp. 34-45
    • Duarte, E.P.1    Nanya, T.2
  • 31
    • 35248841292 scopus 로고    scopus 로고
    • A General-Purpose Algorithm for Quantitative Diagnosis of Performance Problems
    • J.L. Hellerstein, "A General-Purpose Algorithm for Quantitative Diagnosis of Performance Problems," J. Network and Systems Management, 2003.
    • (2003) J. Network and Systems Management
    • Hellerstein, J.L.1
  • 37
    • 0017996760 scopus 로고
    • Time, Clocks, and the Ordering of Events in a Distributed System
    • July
    • L. Lamport, "Time, Clocks, and the Ordering of Events in a Distributed System," Comm. ACM, vol. 21, no. 7, pp. 558-565, July 1978.
    • (1978) Comm. ACM , vol.21 , Issue.7 , pp. 558-565
    • Lamport, L.1
  • 39
    • 0027633047 scopus 로고
    • Optimal and Efficient Probabilistic Distributed Diagnosis Schemes
    • July
    • S. Lee and K. Shin, "Optimal and Efficient Probabilistic Distributed Diagnosis Schemes," IEEE Trans. Computers, vol. 42, no. 7, pp. 882-886, July 1993.
    • (1993) IEEE Trans. Computers , vol.42 , Issue.7 , pp. 882-886
    • Lee, S.1    Shin, K.2
  • 41
    • 33746780102 scopus 로고    scopus 로고
    • Astrolabe: A Robust and Scalable Technology for Distributed System Monitoring, Management, and Data Mining
    • R.V. Renesse, K.P. Birman, and W. Vogels, "Astrolabe: A Robust and Scalable Technology for Distributed System Monitoring, Management, and Data Mining," ACM Trans. Computer Systems, vol. 21, no. 2, pp. 164-206, 2003.
    • (2003) ACM Trans. Computer Systems , vol.21 , Issue.2 , pp. 164-206
    • Renesse, R.V.1    Birman, K.P.2    Vogels, W.3
  • 43
    • 0010644927 scopus 로고    scopus 로고
    • Automatic Generation of Software Tests from Formal Specifications,
    • PhD dissertation, The Queen's Univ. of Belfast
    • C. Meudec, "Automatic Generation of Software Tests from Formal Specifications," PhD dissertation, The Queen's Univ. of Belfast, 1997.
    • (1997)
    • Meudec, C.1
  • 44
    • 0042078549 scopus 로고    scopus 로고
    • A Survey of Rollback-Recovery Protocols in Message-Passing Systems
    • Sept
    • E.N. Etnozahy, L. Alvisi, Y.M. Wang, and D.B. Johnson, "A Survey of Rollback-Recovery Protocols in Message-Passing Systems," ACM Computing Surveys, vol. 34, no. 3, pp. 375-408, Sept. 2002.
    • (2002) ACM Computing Surveys , vol.34 , Issue.3 , pp. 375-408
    • Etnozahy, E.N.1    Alvisi, L.2    Wang, Y.M.3    Johnson, D.B.4
  • 45
    • 12244279838 scopus 로고
    • Detecting Causal Relationships in Distributed Computations: In Search of the Holy Grail
    • R. Schwarz and F. Mattern, "Detecting Causal Relationships in Distributed Computations: In Search of the Holy Grail," Distributed Computing, vol. 7, no. 3, pp. 149-174, 1994.
    • (1994) Distributed Computing , vol.7 , Issue.3 , pp. 149-174
    • Schwarz, R.1    Mattern, F.2
  • 46
    • 0001811152 scopus 로고
    • Detecting Global States of Distributed System: Fundamental Concepts and Mechanisms
    • Addison-Wesley, pp
    • O. Babaoglu and K. Marzullo, "Detecting Global States of Distributed System: Fundamental Concepts and Mechanisms," Distributed Systems, Addison-Wesley, pp. 55-96, 1993.
    • (1993) Distributed Systems , pp. 55-96
    • Babaoglu, O.1    Marzullo, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.