Sep 23 – 27, 2024
ESRF Auditorium
Europe/Paris timezone

The High-throughput Data I/O framework for HEPS

Sep 25, 2024, 4:55 PM
15m
Hybrid event (ESRF Auditorium)

Hybrid event

ESRF Auditorium

EPN Campus ESRF - ILL 71 Av. des Martyrs, 38000 Grenoble
Talk Data Reduction Data Reduction

Speaker

Shiyuan Fu (IHEP)

Description

The High Energy Photon Source (HEPS) is estimated to produce a substantial volume of raw data, presenting significant computational challenges in scientific research. To address this problem, we have developed a high-throughput data I/O framework specifically tailored for HEPS, aimed at mitigating the I/O bottlenecks.Firstly, within this framework, we have devised a unified I/O interface for computational tasks, which serves to shield the difference in underlying data sources and formats. Subsequently, an asynchronous prefetch method has been integrated into the framework to expedite data read and write speeds. This includes the dynamic adjustment of prefetch data volume based on computational tasks and memory space, thereby optimizing the utilization of computational node memory space. Lastly, in order to overcome the issue of slow data access resulting from the process of writing data to disk and subsequent reading, the framework has been extended to encompass a streaming data module. This module dynamically parses data streams from the DAQ and stores them in a distributed cache pool, thereby further accelerating the data retrieval process through the utilization of data streams.

Abstract publication I agree that the abstract will be published on the web site

Primary authors

Shiyuan Fu (IHEP) Yu Hu (IHEP, CAS) Rui Liu (IHEP) Hao-Kai Sun (Computing Center, Institute of High Energy Physics, Chinese Academy of Sciences) Jian Liu (中国科学院高能物理研究所) Lei Wang (IHEP) Shuang Wang (IHEP)

Presentation materials