Manuel Gericota's personal page

Design of Self-Fault-Tolerant Systems based on Self-Reconfigurable FPGAs (Field Programmable Gate Arrays)

In the last few years, Application Specific Integrated Circuits (ASICs), with their lengthy design and development time, escalating development costs and low flexibility, have seen their use restricted to high-volume manufacturing of chips requiring cutting edge-of-the-technology densities and speed processing.

Meanwhile, the exponential growth in density and performance of configurable logic devices, such as SRAM-based Field Programmable Gate Arrays (FPGAs), and the addition of new features, greatly expanded the areas where they can advantageously replace ASICs. FPGAs have lower development costs, faster time-to-market, and an unparalleled flexibility. Recently, two new features were added: dynamic reconfiguration, which allowed the implementation in real-time of dynamic resource allocation strategies, enabling multiple independent functions from different applications to share the same logic resources in the space and temporal domain; and self-reconfiguration, which added the possibility of self-adaptation of the FPGA, dynamically and almost instantaneously, to new functional requirements.

These new FPGAs, while enabling their use in a wide range of applications, such as reconfigurable hardware platforms, also create new challenges to test. The nanometre technologies used in their manufacturing increases their vulnerability to soft errors, due to environmental radiation, and make them more prone to defects emerging from small manufacturing imperfections not detected during production tests, giving rise to transient or permanent changes in the configured functionality. Therefore, their expanding use, even in critical systems, requires the design of fault tolerant circuits able to assure a high reliability and availability. This goal involves the online concurrent detection of permanent and transient faults and their masking, to avoid their propagation, while triggering a test procedure to determine their origin, either functional or structural, and to assure the repair of their cause(s), avoiding cumulative effects that may lead to a general system's failure.

Therefore, it is imperative to study the specific fault inducement mechanisms of these devices, to correlate a set of faults and, fully exploring new FPGAs' features and performance, to develop innovative test methodologies tailored to their unique architecture and to the new sort of applications they enable to implement. These methods have to be able to guarantee both fault tolerance and repair when complex functions are configured, and also to avoid previously detected structural errors when reusing the same hardware resources to implement new incoming functions required by each new application.

The incorporation of self-reconfiguration capabilities in recent FPGAs, allied to the use of a controller core, enables the development of self-contained fault tolerant reconfigurable systems, with automatic rerouting and floorplanning being performed by this controller, and thus enabling the implementation of fault detection, test and repair procedures in a transparent and autonomous way.

Related project publications:

Reliability and Availability in Reconfigurable Computing: a Basis for a Common Solution

Manuel G. Gericota, Gustavo R. Alves, Miguel L. Silva, José M. Ferreira, "Reliability and Availability in Reconfigurable Computing: a Basis for a Common Solution", in: IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Vol. 16, No. 11, November 2008, pp. 1545-1558. ISSN 1063-8210

Manuel G. Gericota, Luis F. Lemos, Gustavo R. Alves, José M. Ferreira, "On-Line Self-Healing of Circuits Implemented on Reconfigurable FPGAs", Proceedings of the 13th IEEE On-Line Testing Symposium (IOLTS'2007), Hersonissos-Heraklion, Crete, Greece, July 2007, pp. 217-222. ISBN 0-7695-2918-6

Manuel G. Gericota, Luis F. Lemos, Gustavo R. Alves, José M. Ferreira, "A Framework for Self-Healing Radiation-Tolerant Implementations on Reconfigurable FPGAs", Proceedings of the 2007 IEEE Conference on Design and Diagnostics of Electronic Circuits and Systems (DDECS'2007), Krakow, Poland, April 2007, pp. 301-306. ISBN 1-4244-1161-0

Manuel G. Gericota, Luis F. Lemos, Gustavo R. Alves, José M. Ferreira, "A Framework for Implementing Radiation-Tolerant Circuits on Reconfigurable FPGAs", Proceedings of the XXI Conference on Design of Circuits and Integrated Systems (DCIS'2006), Barcelona, Spain, November 2006. ISBN 978-84-690-4144-4

Manuel G. Gericota, Luis F. Lemos, Gustavo R. Alves, Mário M. Barbosa, José M. Ferreira, "A Framework for Fault Tolerant Real-Time Systems Based on Reconfigurable FPGAs", Proceedings of the 11th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA'2006), Praga, The Czech Republic, September 2006, pp. 131-138. ISBN: 1-4244-0681-1

Manuel G. Gericota, Gustavo R. Alves, Luís F. Lemos, José M. Ferreira, "A New Approach to Assess Defragmentation Strategies in Dynamically Reconfigurable FPGAs", in: "Reconfigurable Computing: Architectures and Applications". Editors: Koen Bertels, João M. P. Cardoso, Stamatis Vassiliadis. Revised selected papers of the International Workshop on Applied Reconfigurable Computing (ARC'2006), Delft, The Netherlands, March 2006. 469 p. Lecture Notes in Computer Science 3985. Springer, March 2006. pp. 117-129. ISBN: 3-540-36708-X

Manuel G. Gericota, Gustavo R. Alves, Luís F. Lemos, José M. Ferreira, "Assessing Defragmentation Strategies for FPGAs", Actas das II Jornadas sobre Sistemas Reconfiguráveis (REC'2006), Porto, Portugal, February 2006, pp. 99-102. ISBN 972-752-084-7

Manuel G. Gericota, Gustavo R. Alves, José M. Ferreira, "Robust Configurable System Design with Built-In Self-Healing", Proceedings of the XX Conference on Design of Circuits and Integrated Systems (DCIS'2005), Lisboa, Portugal, November 2005, ISBN 972-99387-2-5

Manuel G. Gericota, Gustavo R. Alves, José M. Ferreira, "A Self-Healing Real-Time System Based on Run-Time Self-Reconfiguration", Proceedings of the 10th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA'2005), Catania, Italy, September 2005, Vol. 1, pp. 1039-1042, ISBN 0-7803-9402-X

Manuel G. Gericota, José M. Ferreira, "Restoring Reliability in Fault Tolerant Reconfigurable Systems", Actas das Jornadas sobre Sistemas Reconfiguráveis 2005 (REC'2005), Faro, Portugal, February 2005, pp. 79-82, ISBN 972-9341-41-9

André R. Guerra, José M. Ferreira, Manuel G. Gericota, "Techniques to improve the reliability of fault-tolerant systems based on self-reconfigurable FPGAs", Actas das I Jornadas sobre Sistemas Reconfiguráveis (REC'2005), Faro, Portugal, February 2005, pp. 125-126, ISBN 972-9341-41-9