Review: YOLOv8 (Object Detection)

Outperforms YOLOv5, YOLOv6 and YOLOv7

Sik-Ho Tsang
3 min readApr 11, 2024
YOLOv8 by ultralytics (Image from ultralytics)

YOLOv8, 2023, by ultralytics (Sik-Ho Tsang @ Medium)

Object Detection
20142021 [Scaled-YOLOv4] [PVT, PVTv1] [Deformable DETR] [HRNetV2, HRNetV2p] [MDETR] [TPH-YOLOv5] 2022 [Pix2Seq] [MViTv2] [SF-YOLOv5] [GLIP] [TPH-YOLOv5++] [YOLOv6] 2023 [YOLOv7]
==== My Other Paper Readings Are Also Over Here ====

  • While there is no paper or document descirbing YOLOv8 model architecture and training strategies so far, I started to find some other papers utilizing YOLOv8 for transfer learning, see if they have mentioned a little more about what kinds of new stuffs or technologies have been involved in YOLOv8 compared with the previous YOLOs.
  • After all of this, I found a paper named: Real-Time Flying Object Detection with YOLOv8. It mentioned about some differences between YOLOv8 and YOLOv5.
  • There is also a GitHub by mmyolo having a model diagram of YOLOv8 architecture.

Outline

  1. YOLOv8
  2. Results

1. YOLOv8

YOLOv8-P5 (Image from mmyolo GitHub)
  • (Since it is difficult to go into details without any papers/docs, if there is something wrong, please feel free to tell me.)
  • Authors in Real-Time Flying Object Detection with YOLOv8 mention that: YOLOv8 was trained on a blend of the COCO dataset and several other datasets, while YOLOv5 was trained primarily on the COCO dataset.
  • For the neck, similar to YOLOv5, YOLOv8 also uses the methods of FPN and PAN.
  • For the head, simlar to YOLOv6 and YOLOX, decoupled head is used.
  • Similar to YOLOv6, YOLOv8 is also a anchor-free object detector that directly predicts the center of an object instead of the offset from a known anchor box which reduces the number of box predictions, and that speeds up the post processing process.
  • YOLOv8 uses Soft-NMS which is a variant of the NMS technique used in YOLOv5. Soft-NMS applies a soft threshold to the overlapping bounding boxes instead of discarding them outright.
  • The loss function in YOLOv8 is:
  • This loss function includes the CIoU (complete IoU) loss proposed by Zheng et al. [22] as the box loss, the standard binary cross entropy for multi-label classification as the classification loss (allowing each cell to predict more than 1 class), and the distribution focal loss proposed by Li et al. [10] as the 3rd term.

2. Results

Performance of YOLOv8 Variants
mAP Against Model Parameters or Time
  • By plotting results from different YOLOv8 sizes, it is observed that YOLOv8 outperforms YOLOv5, YOLOv6 and YOLOv7.

--

--

Sik-Ho Tsang

PhD, Researcher. I share what I learn. :) Linktree: https://linktr.ee/shtsang for Twitter, LinkedIn, etc.