Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

6 Citations (Scopus)
38 Downloads (Pure)

Abstract

Real-time embedded systems like smartphones tend to comprise an ever increasing number of processing cores. For scalability and the need for guaranteed performance, the use of a connection-oriented network-on-chip (NoC) is advocated. Furthermore, a distributed shared memory architecture is preferred as it simplifies software development for a multi-core system. In this paper, experimental evidence is provided, showing that replacing a connection-oriented NoC by a connectionless one in a distributed shared memory system reduces the hardware costs and improves the performance. We observed that our FPGA could only support an 8-core system with a connection-oriented NoC. We exchanged the NoC with our tree-shaped, connectionless network and a ring, allowing a 32- core system in the same FPGA, mainly because of a reduced number of physical connections. Although the analytical worst-case performance slightly decreased, measurements show that the latency of latency-critical memory reads was reduced by 52% on average.
Original languageUndefined
Title of host publicationProceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012
Place of PublicationUSA
PublisherIEEE Computer Society
Pages727-730
Number of pages4
ISBN (Print)978-0-7695-4798-5
DOIs
Publication statusPublished - 5 Sep 2012
Event15th EUROMICRO Conference on Digital System Design, DSD 2012 - Çeşme, Izmir, Turkey
Duration: 5 Sep 20128 Sep 2012
Conference number: 15
http://www.univ-valenciennes.fr/congres/dsd2012/

Publication series

Name
PublisherIEEE Computer Society

Conference

Conference15th EUROMICRO Conference on Digital System Design, DSD 2012
Abbreviated titleDSD
CountryTurkey
CityÇeşme, Izmir
Period5/09/128/09/12
Internet address

Keywords

  • IR-81381
  • EWI-22211
  • METIS-287994

Cite this

Rutgers, J. H., Bekooij, M. J. G., & Smit, G. J. M. (2012). Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System. In Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012 (pp. 727-730). USA: IEEE Computer Society. https://doi.org/10.1109/DSD.2012.54
Rutgers, J.H. ; Bekooij, Marco Jan Gerrit ; Smit, Gerardus Johannes Maria. / Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System. Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012. USA : IEEE Computer Society, 2012. pp. 727-730
@inproceedings{3c06771c43ae4937ad1d4ecf1e0f5da7,
title = "Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System",
abstract = "Real-time embedded systems like smartphones tend to comprise an ever increasing number of processing cores. For scalability and the need for guaranteed performance, the use of a connection-oriented network-on-chip (NoC) is advocated. Furthermore, a distributed shared memory architecture is preferred as it simplifies software development for a multi-core system. In this paper, experimental evidence is provided, showing that replacing a connection-oriented NoC by a connectionless one in a distributed shared memory system reduces the hardware costs and improves the performance. We observed that our FPGA could only support an 8-core system with a connection-oriented NoC. We exchanged the NoC with our tree-shaped, connectionless network and a ring, allowing a 32- core system in the same FPGA, mainly because of a reduced number of physical connections. Although the analytical worst-case performance slightly decreased, measurements show that the latency of latency-critical memory reads was reduced by 52{\%} on average.",
keywords = "IR-81381, EWI-22211, METIS-287994",
author = "J.H. Rutgers and Bekooij, {Marco Jan Gerrit} and Smit, {Gerardus Johannes Maria}",
note = "10.1109/DSD.2012.54",
year = "2012",
month = "9",
day = "5",
doi = "10.1109/DSD.2012.54",
language = "Undefined",
isbn = "978-0-7695-4798-5",
publisher = "IEEE Computer Society",
pages = "727--730",
booktitle = "Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012",
address = "United States",

}

Rutgers, JH, Bekooij, MJG & Smit, GJM 2012, Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System. in Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012. IEEE Computer Society, USA, pp. 727-730, 15th EUROMICRO Conference on Digital System Design, DSD 2012, Çeşme, Izmir, Turkey, 5/09/12. https://doi.org/10.1109/DSD.2012.54

Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System. / Rutgers, J.H.; Bekooij, Marco Jan Gerrit; Smit, Gerardus Johannes Maria.

Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012. USA : IEEE Computer Society, 2012. p. 727-730.

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

TY - GEN

T1 - Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System

AU - Rutgers, J.H.

AU - Bekooij, Marco Jan Gerrit

AU - Smit, Gerardus Johannes Maria

N1 - 10.1109/DSD.2012.54

PY - 2012/9/5

Y1 - 2012/9/5

N2 - Real-time embedded systems like smartphones tend to comprise an ever increasing number of processing cores. For scalability and the need for guaranteed performance, the use of a connection-oriented network-on-chip (NoC) is advocated. Furthermore, a distributed shared memory architecture is preferred as it simplifies software development for a multi-core system. In this paper, experimental evidence is provided, showing that replacing a connection-oriented NoC by a connectionless one in a distributed shared memory system reduces the hardware costs and improves the performance. We observed that our FPGA could only support an 8-core system with a connection-oriented NoC. We exchanged the NoC with our tree-shaped, connectionless network and a ring, allowing a 32- core system in the same FPGA, mainly because of a reduced number of physical connections. Although the analytical worst-case performance slightly decreased, measurements show that the latency of latency-critical memory reads was reduced by 52% on average.

AB - Real-time embedded systems like smartphones tend to comprise an ever increasing number of processing cores. For scalability and the need for guaranteed performance, the use of a connection-oriented network-on-chip (NoC) is advocated. Furthermore, a distributed shared memory architecture is preferred as it simplifies software development for a multi-core system. In this paper, experimental evidence is provided, showing that replacing a connection-oriented NoC by a connectionless one in a distributed shared memory system reduces the hardware costs and improves the performance. We observed that our FPGA could only support an 8-core system with a connection-oriented NoC. We exchanged the NoC with our tree-shaped, connectionless network and a ring, allowing a 32- core system in the same FPGA, mainly because of a reduced number of physical connections. Although the analytical worst-case performance slightly decreased, measurements show that the latency of latency-critical memory reads was reduced by 52% on average.

KW - IR-81381

KW - EWI-22211

KW - METIS-287994

U2 - 10.1109/DSD.2012.54

DO - 10.1109/DSD.2012.54

M3 - Conference contribution

SN - 978-0-7695-4798-5

SP - 727

EP - 730

BT - Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012

PB - IEEE Computer Society

CY - USA

ER -

Rutgers JH, Bekooij MJG, Smit GJM. Evaluation of a Connectionless NoC for a Real-Time Distributed Shared Memory Many-Core System. In Proceedings of the 15th Euromicro Conference on Digital System Design, DSD 2012. USA: IEEE Computer Society. 2012. p. 727-730 https://doi.org/10.1109/DSD.2012.54