State-of-the-Art SoccerNet Game State Reconstruction: A Step Towards Automated Football Analysis

Tuesday 22 April 2025


The pursuit of accurate game state reconstruction in sports video analysis has been a longstanding challenge for researchers and developers alike. Recently, a team of scientists from Constructor Tech, Sofia, made significant strides in this area by presenting an innovative pipeline that leverages a fine-tuned YOLOv5m object detector, a custom SegFormer-based camera parameter estimator, and a DeepSORT-enhanced tracking framework.


At its core, the proposed method relies on a multi-stage approach to tackle the complexities of reconstructing game states from broadcast videos. The first stage involves using a custom SegFormer model to generate an initial estimate of the camera parameters, which includes position, orientation, and field of view information. This initial estimation is then refined through a process involving keypoint detection and alignment.


The second stage focuses on detecting keypoints on the football pitch, such as intersections of pitch lines with grass lines. These keypoints serve as anchors for the camera parameter refinement process, ensuring accurate mapping between 2D image coordinates and 3D world coordinates.


In the final stage, the tracking framework employs a DeepSORT-based approach to associate individual players across fragmented trajectories. This association is achieved by recognizing jersey numbers, orientations, and team affiliations, ultimately enabling precise player positioning on the pitch.


The researchers’ synthetic dataset, comprising over 1,000 images with corresponding camera parameters, plays a crucial role in training their model. By leveraging this diverse set of configurations, the algorithm learns to generalize across various camera placements and field conditions.


One notable aspect of this work is its emphasis on robustness and adaptability. The proposed pipeline demonstrates remarkable resilience in the face of challenging scenarios, such as occlusions, varying lighting conditions, and dynamic scene content. This capability stems from the combination of sophisticated object detection, segmentation, and tracking techniques.


The team’s achievement has far-reaching implications for sports analytics, enabling more accurate player performance evaluation, tactical decision-making, and training strategy optimization. As the authors note, their method can be extended to other sports and applications where game state reconstruction is crucial.


While this work represents a significant milestone in the field of sports video analysis, it also underscores the importance of continued innovation and refinement. As researchers continue to push the boundaries of what is possible, we can expect even more sophisticated solutions to emerge, further empowering coaches, analysts, and fans alike.


Cite this article: “State-of-the-Art SoccerNet Game State Reconstruction: A Step Towards Automated Football Analysis”, The Science Archive, 2025.


Sports Video Analysis, Game State Reconstruction, Object Detection, Segmentation, Tracking, Camera Parameters, Football Pitch, Keypoints, Deepsort, Yolov5M


Reference: Vladimir Golovkin, Nikolay Nemtsev, Vasyl Shandyba, Oleg Udin, Nikita Kasatkin, Pavel Kononov, Anton Afanasiev, Sergey Ulasen, Andrei Boiarov, “From Broadcast to Minimap: Achieving State-of-the-Art SoccerNet Game State Reconstruction” (2025).


Leave a Reply