Course details
Fault Tolerant Systems
SOD Acad. year 2024/2025 Summer semester
Principles of fault tolerance, data and circuit structures, coding techniques. Codes for control and correction of information, information redundancy. Linear block codes: Hamming codes, oarity codes. Matrix description of codes. Principles of finite fields construction. Cyclic codes: principles and properties, CRC, BCH and Reed-Solomon codes. Architecture of codes for Flash memories and CDROMs. Introduction to quantum computing, quantum-inspired wrror correction codes. Fault tolerance at VLSI level. Radiation safety and fault tolerance. Fault tolerant communication networks, distributed tolerant systems.
State doctoral exam - Final interview topics:
- Principe's, approaches and parameters of safe and fault tolerant systems.
- Parity codes, multidimensional parity codes, low-density parity codes, arithmentic codes, Raptor codes.
- Hamming codes, byte error correction codes, matrix notation of of coding and decoding.
- Cyclic codes, basic and fast CRC calculation.
- Galois finite field GF(n) construction, minimum polynomials.
- Construction and applications of BCH and RS codes.
- Time redundancy, radiation tolerant circuits and systems.
- Fault tolerance in VLSI structures - memories and multiprocessors, reconfiguration, fault and error containment.
- Fault tolerance in communication systems.
- Software implemented fault tolerance, Byzantine agreement.
Guarantor
Language of instruction
Completion
Time span
- 39 hrs lectures
Assessment points
- 100 pts final exam
Department
Learning objectives
To inform the students about different types of redundancy and its application for the design of computer systems being able to function correctly even under presence of faults and data errors. To give the students literary sources and principles of advanced topics in the area of fault and error tolerance for the choice of up-to-date research topics.
Skills and approaches to building fault tolerance using hardware and codes. To research new techniques and their applications.
To get know a novel approaches to ensure availability and safety of technical means.
Prerequisite knowledge and skills
Computer design and software tools.
Study literature
- Nicolaidis M.: Soft Errors in Modern Electronic Systems, Spribger, 2011
- Shokrollahi A., Luby M.: Raptor Codes, NOW Publishers, 2011
- Szefer J.: Principles of Secure Processor Architecture Design, Morgan & Claypool, 2019
Syllabus of lectures
- FT design methodology, structures and techniques.
- Error control codes. Parity codes, multidimensional parity codes, arithmetic codes.
- Residue codes, Hamming codes, sparse parity codes. Raptor codes.
- Cyclic codes, Fire codes.
- Galois fields GF(n) and their construction, BCH and Reed-Solomon codes, byte error detection.
- Time redundancy, alternating logic.
- Reliability modeling, combinatorial models, MIL-HDBK-217. Markov reliability models.
- Safe systems.
- FT architectures.
- VLSI fault tolerance. Radiation fault tolerance.
- FT in computer units, in memorie, in computer and communication systems.
- Fault tolerant and secure control systems.
- Distributed FT systems.
- Software implemented fault tolerance.
Progress assessment
Project topic selection and systematic consultatiobs.
Additional sessions after cunsultations wuth the lecturer.
Final exam, project submission and presentation.
Course inclusion in study plans
- Programme DIT, any year of study, Compulsory-Elective group O
- Programme DIT, any year of study, Compulsory-Elective group O
- Programme DIT-EN (in English), any year of study, Compulsory-Elective group O
- Programme DIT-EN (in English), any year of study, Compulsory-Elective group O
- Programme VTI-DR-4, field DVI4, any year of study, Elective
- Programme VTI-DR-4, field DVI4, any year of study, Elective
- Programme VTI-DR-4 (in English), field DVI4, any year of study, Elective