TY - GEN
T1 - Optimizing Big Data Network Transfers in FPGA SoC Clusters
T2 - 6th Latin American High Performance Computing Conference, CARLA 2019
AU - León-Vega, Luis G.
AU - Alfaro-Badilla, Kaleb
AU - Chacón-Rodríguez, Alfonso
AU - Salazar-García, Carlos
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - Spiking Neural Network (SSN) simulators based on clusters of FPGA-based System-on-Chip (SoC) involve the transmission of large amounts of data (from hundreds of MB to tens of GB per second) from and to a data host, usually a PC or a server. TECBrain is an SNN simulator which currently uses Ethernet for transmitting results from its simulations, which can potentially take hours if the effective connection speed is around 100 Mbps. This paper proposes data transfer techniques that optimize data transmissions by grouping data into packages making the most of the payload size and the use of thread-level parallelism, trying to minimize the impact of multiple clients transmitting at the same time. The proposed method achieves its highest throughput when inserting simulation results directly into a No-SQL database. Using the proposed optimization techniques over an Ethernet connection, the minimum overhead reached is 2.93% (out of the theoretical 2.47%) for five nodes sending data simultaneously from C++, with speeds up to 95 Mbps on a network at 100 Mbps. Besides, the maximum database insertion speed reached is 32.5 MB/s, using large packages and parallelism, which is 26% of the bandwidth of the connection link at 1 Gbps.
AB - Spiking Neural Network (SSN) simulators based on clusters of FPGA-based System-on-Chip (SoC) involve the transmission of large amounts of data (from hundreds of MB to tens of GB per second) from and to a data host, usually a PC or a server. TECBrain is an SNN simulator which currently uses Ethernet for transmitting results from its simulations, which can potentially take hours if the effective connection speed is around 100 Mbps. This paper proposes data transfer techniques that optimize data transmissions by grouping data into packages making the most of the payload size and the use of thread-level parallelism, trying to minimize the impact of multiple clients transmitting at the same time. The proposed method achieves its highest throughput when inserting simulation results directly into a No-SQL database. Using the proposed optimization techniques over an Ethernet connection, the minimum overhead reached is 2.93% (out of the theoretical 2.47%) for five nodes sending data simultaneously from C++, with speeds up to 95 Mbps on a network at 100 Mbps. Besides, the maximum database insertion speed reached is 32.5 MB/s, using large packages and parallelism, which is 26% of the bandwidth of the connection link at 1 Gbps.
KW - Embedded software
KW - High perfomance computing
KW - High-speed networks
KW - No-SQL
UR - http://www.scopus.com/inward/record.url?scp=85081171173&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-41005-6_4
DO - 10.1007/978-3-030-41005-6_4
M3 - Contribución a la conferencia
AN - SCOPUS:85081171173
SN - 9783030410049
T3 - Communications in Computer and Information Science
SP - 49
EP - 62
BT - High Performance Computing - 6th Latin American Conference, CARLA 2019, Revised Selected Papers
A2 - Crespo-Mariño, Juan Luis
A2 - Meneses-Rojas, Esteban
PB - Springer
Y2 - 25 September 2019 through 27 September 2019
ER -