18-849 Reading List Fall 2008

Course Home Page


Course Introduction & Example Paper Discussion

Required:

Note: Read Wallace & Kuhn before reading Sullivan & Chillarege.

Supplemental:


Case Studies

Required:

Supplemental:


HW Fault Tolerance & Dependability Modeling

Required:

Supplemental:

Pending:


Fault Tolerant Middleware

Required:

Supplemental:


Distributed Time

Required:

Supplemental:


SW Fault tolerance: rollback/rollforward

Required:

Supplemental


SW Fault Tolerance: other techniques

Required:

Other High-Level Discussions

Supplemental:

See also: Exception handling; Fault Injection


Exception Handling

Required:

Supplemental:

Other sources:


Byzantine Agreement

Required:

Part 1:

Part 2:

Highly Recommended:

Supplemental:


Group Membership

Required:

Recommended:

Supplemental:


Synchronous/Asynchronous Systems

Required:

Supplemental:


Exception Handling Studies

Required:

Supplemental:


Dependable Upgrade

Required:

Supplemental:


Multi-Version Software

Required:

Recommended:

Supplemental:

Other sources:


Fault Injection

Required:

Supplemental:


Boeing 777: Case Study

Required:

Supplemental:


Graceful Degradation & Self-Healing Systems

Required:

Supplemental:

ALSO: see papers on "self-healing systems" although as of the last update this area was not quite mature enough to have generated a paper for inclusion in this list. A more recent term being used is "autonomic systems" although those are mostly in the IT space and not in the embedded system area.


Robustness Testing

Required:

Supplemental:

Other sources:


Distributed Real Time Computing & Scheduling

Required:

Other High-Level Summaries:

Supplemental:

Other Sources:


Verification & Validation

Required:

Supplemental:

Supplemental Formal Methods papers:

Other sources:


Case Study: Three Mile Island

Required:

Recommended:


User Interface & Operator Errors

Required:

Supplemental:

Other Reading:


Software Safety

Required:

Other High-Level Summaries:

Supplemental:


Embedded Security

Required:

Supplemental:


Safety cases & risk management

Required:

You can see a rail "safety case" at: http://www.tfl.gov.uk/tube/company/reports/safety-case.asp

Supplemental:

Risk Management Tools


Societal Issues

Required:

Supplemental:

Other References:

http://www.badsoftware.com/ has several papers that talk about UCITA, which is an attempt to regulate software that will have an effect on embedded system software.


TTP & Automotive Protocols: Case Study

Required:

Supplemental:


Very Dependable Systems

Required:

Perrow paper: http://www3.interscience.wiley.com/journal/119973206/abstract

Supplemental:


Miscellaneous

Error Coding