AI News HubLIVE
In-site rewrite3 min read

Enhanced YOLO framework boosts 360-degree small-object detection to 90% accuracy

Researchers from Shibaura Institute of Technology developed an enhanced YOLO framework using transfer learning and a custom dataset, significantly improving the detection of small moving objects in omnidirectional videos. The model achieves 90% accuracy and maintains performance up to 50 meters, outperforming standard YOLO versions.

May 22, 2026

Enhanced YOLO framework boosts 360-degree small-object detection to 90% accuracy

by Shibaura Institute of Technology

edited by Sadie Harley, reviewed by Robert Egan

Sadie Harley

Scientific Editor

Robert Egan

Associate Editor

Editors' notes

This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:

fact-checked

trusted source

proofread

The GIST

Add as preferred source

Researchers from Shibaura Institute of Technology, Japan, developed a newly trained You Only Look Once (YOLO) model using transfer learning that can detect small moving objects with improved accuracy compared to standard models. The proposed method maintains high accuracy even at 50 meters, whereas standard YOLO models showed a steep drop in accuracy beyond 40 meters. The graph shows the accuracy comparison with respect to distance. Credit: Professor Chinthaka Premachandra, Shibaura Institute of Technology, Japan

Omnidirectional cameras are widely popular as they capture a full 360-degree view. They are often utilized for surveillance, traffic analysis, and autonomous systems. But the same wide-angle vision also leads to a technical problem. Objects far from the camera often appear distorted and tiny, making it difficult for computer vision systems to accurately recognize them.

The challenge is especially serious for moving objects such as pedestrians, bicycles, motorcycles, and cars in outdoor scenes like road intersections.

You Only Look Once (YOLO) is a popular, high-speed, and accurate real-time object detection algorithm. Although YOLO is known for speed and strong general performance, it struggles with small object detection/classification in omnidirectional videos, as it divides an image into grid cells.

When several tiny objects fall within the same grid, some of their visual information can be lost. In omnidirectional footage, this weakness becomes even more pronounced because distant objects already suffer from low resolution.

To address this issue, a team of researchers led by Professor Chinthaka Premachandra from Shibaura Institute of Technology, Japan, designed an enhanced framework that combined a custom-built training dataset with transfer learning.

"In many countries, including Japan, road intersections are extremely accident-prone areas due to the complex interactions of vehicles, pedestrians, and cyclists moving from multiple directions. Some of these road users may suddenly appear from blind spots at intersections, further increasing the likelihood of accidents. Our research was initiated to solve this particular issue," said Dr. Premachandra, while talking about the motivation behind this study.

The paper was published in the journal IEEE Open Journal of Intelligent Transportation Systems.

For developing the model, a dataset of about 4,000 annotated images was created, covering four categories—humans, cars, bicycles, and motorcycles. Importantly, the annotations were not generic. Omnidirectional cameras exhibit a rapid decrease in resolution as the distance between the camera and the object increases, and the objects are often misidentified.

To mitigate this issue, the team defined characteristic features for each moving object class to help the model learn what to look for under difficult conditions. For example, a human needed at least one arm or leg to be visible, a car needed two or more tires to be visible, and bicycles and motorcycles needed both front and rear wheels to be visible.

The research team also strengthened the dataset by cropping images and including objects viewed from multiple angles so that small and less frequent targets would be better represented. The dataset was used for training via transfer learning, a method that adapts the knowledge of an existing model to a new domain. Finally, the trained model was compared against conventional models for accuracy.

In direct comparison, the proposed model reached an overall accuracy of 90%, while YOLOv5 achieved 46% and YOLOv8 achieved 53% for objects larger than 8 × 8 pixels. For small moving objects specifically, ranging from 8 × 8 to 32 × 32 pixels, the proposed model achieved an accuracy of 0.81, which is significantly higher compared to 0.39 for YOLOv5 and 0.42 for YOLOv8.

The study also found that while standard YOLO models showed a steep drop in accuracy beyond 40 meters, the new model maintained useful performance up to 50 meters.

This research addresses a critical limitation in current perception systems by improving the detection of small and distant objects across a full 360° field of view.

"This approach can be effectively applied to intelligent transportation systems, autonomous driving, and robotic navigation, where reliable omnidirectional perception is essential. Specifically, it is well-suited for intersection monitoring and safety assistance, where vehicles, pedestrians, and cyclists may approach from multiple directions simultaneously," explains Dr. Premachandra, while talking about the application of this research.

Let us hope the research progresses quickly to develop models with enhanced detection accuracy for objects smaller than 8 × 8 pixels, which can further reduce the risks of accidents and improve road safety.

More information

Yui Nakaoka et al, E-YOLO to OMOD: An Enhanced YOLO Framework for Small Moving Object Detection in Omnidirectional Videos, IEEE Open Journal of Intelligent Transportation Systems (2026). DOI: 10.1109/ojits.2026.3670217

Key concepts

Computational 3D visionMachine learning methodologies

Provided by Shibaura Institute of Technology

Citation: Enhanced YOLO framework boosts 360-degree small-object detection to 90% accuracy (2026, May 22) retrieved 22 May 2026 from https://techxplore.com/news/2026-05-yolo-framework-boosts-degree-small.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Detecting distant vehicles for safer roads: A new camera-based technique sees farther

3 shares

Facebook

Twitter

Email

Feedback to editors