YOLO to ONNX Conversion

From LogicalDOC Community Wiki
Revision as of 07:39, 26 June 2026 by Giuseppe (talk | contribs)
Jump to navigationJump to search

YOLO to ONNX Conversion

Converting a YOLO Model to ONNX


This guide describes an example workflow for training a custom YOLO model and preparing it for use with LogicalDOC.

Please be aware that this procedure is not coverded by the standard support contract. LogicalDOC cannot provide assistance with issues related to dataset preparation, training failures, model quality, GPU configuration, or third-party tools such as Label Studio, Ultralytics YOLO, or ONNX Runtime.

If you require professional assistance, please contact sales@logicaldoc.com to request a quotation for consulting services.



Licensing notice : Ultralytics YOLO is distributed under its own licensing terms. Commercial use requires a commercial license from Ultralytics. Before using YOLO in a commercial environment, review the licensing information available on the official Ultralytics website: https://www.ultralytics.com/


LogicalDOC performs object detection using the ONNX Runtime Java API. Therefore, a trained YOLO model must be converted from the native PyTorch (`.pt`) format to the Open Neural Network Exchange (ONNX) format before it can be imported into LogicalDOC.

ONNX (Open Neural Network Exchange) is an open standard for representing machine learning models. It enables models trained in one framework, such as PyTorch, to be executed efficiently in different programming languages and runtime environments.

The conversion process does not retrain or modify the model. Instead, it exports the trained network, including its learned weights and computational graph, into a portable format that can be executed by ONNX Runtime.

Why ONNX?

Ultralytics trains YOLO models using PyTorch. However, LogicalDOC is implemented in Java and performs inference using the ONNX Runtime library.

Using ONNX provides several advantages:

  • No dependency on the Python runtime during inference.
  • Native Java support through the ONNX Runtime API.
  • Faster startup and lower memory consumption than embedding a Python interpreter.
  • Cross-platform model portability.
  • Support for hardware acceleration when available.

Export the Model

The following script converts the trained best.pt model into the ONNX format:

from ultralytics import YOLO

model = YOLO("runs/detect/target/weights/best.pt")

model.export(
format="onnx",
simplify=True,
nms=True
)

Where:

  • format="onnx" exports the model in the ONNX format.
  • simplify=True simplifies the exported computation graph to improve compatibility and potentially reduce inference overhead.
  • nms=True embeds the Non-Maximum Suppression (NMS) step into the exported model, allowing the ONNX Runtime to return the final detections directly.

After the conversion completes, the generated model (best.onnx ) can be imported into LogicalDOC for inference.