Adaptive reliability for fault tolerant multicore systems

Ihsen Alouani, Thomas Wild, Andreas Herkersdorf, Smail Niar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Citations (Scopus)

Abstract

In an era of continuously shrinking technology and escalating power density, Multiprocessor System on Chips (MPSoCs) suffer from a growing prominence of device defects and increase of dependability-related issues. This paper tackles the dependability challenge by suggesting an adaptive reliability enhancement strategy for multicore systems. We dynamically adapt the reliability enhancement to the actual tasks requirements as well as cores runtime operating conditions. As reliability improvement may adversely affect the parameters of embedded systems, we suggest a runtime recovery method. In fact, we implement a 3-mode mapping technique to limit redundancy overheads through judicious task migrating and dropping. Our experiments show promising results in terms of error mitigation with controllable power and thermal overheads.

Original languageEnglish
Title of host publicationProceedings of the 20th Euromicro Conference on Digital System Design, DSD 2017
EditorsHana Kubátová, Martin Novotný, Amund Skavhaug
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages538-542
Number of pages5
ISBN (Electronic)9781538621462
ISBN (Print)9781538621479
DOIs
Publication statusPublished - 28 Sept 2017
Externally publishedYes
Event20th Euromicro Conference on Digital System Design - Vienna, Austria
Duration: 30 Aug 201701 Sept 2017
https://doi.org/10.1109/DSD42682.2017

Publication series

NameEuromicro Symposium on Digital System Design: Proceedings
PublisherIEEE

Conference

Conference20th Euromicro Conference on Digital System Design
Abbreviated titleDSD
Country/TerritoryAustria
CityVienna
Period30/08/201701/09/2017
Internet address

Keywords

  • dependability
  • mapping
  • Multicore
  • reliability

ASJC Scopus subject areas

  • Computer Science Applications
  • Control and Systems Engineering
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Adaptive reliability for fault tolerant multicore systems'. Together they form a unique fingerprint.

Cite this