Book chapters details

  • Debugging of Parallel and Distributed Programs
  • Jan 2001
  • This chapter surveys the main issues involved in correctness debugging of parallel and distributed programs. Distributed debugging is an instance of the more general problem of observation of a distributed computation. This chapter briefly summarizes the theoretical foundations of the distributed debugging activity. Then a survey is presented of the main methodologies used for parallel and distributed debugging, including state and event based debugging, deterministic re-execution, systematic state exploration, and correctness predicate evaluation. Such approaches are complementary to one another, and the chapter discusses how they can be supported using distinct techniques for observation and control.
  • Parallel Program Development for Cluster Computing: Methodology, Tools and Integrated Environments
  • Nova Science
  • José Cardoso e Cunha, João Lourenço, Vítor Duarte
  • José C.Cunha, P.Kacsuk, and S.Winter
  • Advances in Computation: Theory and Practice
  • 5
  • 1-56072-865-5
  • 97 to 129
  • 1 Jan 2001