PEDM: A Parallelized Event Data Management Framework for Efficient High-Energy Physics Data Processing

Sunday 09 March 2025


The quest for faster data processing has been a long-standing challenge in the field of physics, particularly in high-energy particle colliders. The sheer volume of data generated by these experiments requires sophisticated software frameworks to efficiently process and analyze the information. In recent years, researchers have made significant strides in developing parallelized event data management systems to tackle this issue.


One such system is PEDM (Parallelized Event Data Management), a framework designed to support large-scale data processing for non-collider physics experiments. Developed by a team of scientists from Shandong University, PEDM integrates the MT-SNIPER framework with PODIO, a toolkit for generating event data models. By leveraging these technologies, PEDM enables parallelized data processing, allowing multiple threads to work simultaneously on different events.


The core component of PEDM is the GlobalStore, which manages the caching and retrieval of event data objects. This system is designed to ensure thread safety and prevent data races, a common issue in parallel computing. The GlobalInputAlg and GlobalOutputAlg services are responsible for reading and writing event data to files, respectively. These services are triggered by worker threads, which process events independently.


PEDM’s architecture is flexible and scalable, making it suitable for various applications. In its implementation, the team used Intel Threading Building Blocks (TBB) to create parallel tasks and manage thread synchronization. This approach allows PEDM to effectively utilize multi-core CPU resources and reduce processing times.


To evaluate PEDM’s performance, the researchers applied it to the offline software of the Super Tau Charm Facility (STCF), a next-generation electron-positron collider. The results showed that PEDM significantly accelerated data processing, with a speedup ratio increasing linearly as the number of worker threads increased.


The development of PEDM has significant implications for future high-energy physics experiments. As the volume and complexity of data continue to grow, efficient data management systems like PEDM will become increasingly important. By leveraging parallel computing techniques and sophisticated software frameworks, researchers can unlock new insights into the fundamental nature of matter and the universe.


In addition to its application in particle colliders, PEDM’s design and architecture can be adapted for other fields that require large-scale data processing, such as artificial intelligence, machine learning, and genomics. As the demand for fast and efficient data processing continues to rise, PEDM serves as a valuable example of how innovative software solutions can address complex scientific challenges.


Cite this article: “PEDM: A Parallelized Event Data Management Framework for Efficient High-Energy Physics Data Processing”, The Science Archive, 2025.


High-Energy Physics, Data Processing, Parallel Computing, Event Data Management, Pedm, Particle Colliders, Software Frameworks, Thread Safety, Data Races, Multi-Core Cpu Resources.


Reference: Qianqian Shi, Teng Li, Xingtao Huang, “Parallelized Event Data Management System Based on MT-SNiPER Framework and PODIO” (2025).


Leave a Reply