sebastientaylor commited on
Commit
95e71e7
Β·
verified Β·
1 Parent(s): 8d2554f

Update model card for yolo26-det

Browse files
Files changed (1) hide show
  1. README.md +96 -157
README.md CHANGED
@@ -9,14 +9,11 @@ tags:
9
  - onnx
10
  - int8
11
  - yolo
12
- - gstreamer
13
  - edgefirst
14
  - nxp
15
  - hailo
16
  - jetson
17
- - real-time
18
  - embedded
19
- - multiplatform
20
  model-index:
21
  - name: yolo26-det
22
  results:
@@ -28,244 +25,186 @@ model-index:
28
  metrics:
29
  - name: "mAP@0.5 (Nano ONNX FP32)"
30
  type: map_50
31
- value: 54.9
32
  - name: "mAP@0.5-0.95 (Nano ONNX FP32)"
33
  type: map
34
- value: 39.7
35
- - name: "mAP@0.5 (Nano TFLite INT8)"
36
  type: map_50
37
- value: 51.5
38
- - name: "mAP@0.5-0.95 (Nano TFLite INT8)"
39
  type: map
40
- value: 34.9
 
 
 
 
 
 
41
  ---
42
 
43
- # YOLO26 Detection β€” EdgeFirst Edge AI
44
 
45
- **NXP i.MX 8M Plus** | **NXP i.MX 93** | **NXP i.MX 95** | **NXP Ara240** | **RPi5 + Hailo-8/8L** | **NVIDIA Jetson**
46
- YOLO26 Detection models optimized for edge AI deployment across multiple hardware platforms. All sizes from Nano to XLarge, in ONNX FP32 and TFLite INT8 formats, with platform-specific compiled models for NPU acceleration.
 
47
 
48
- Trained on [COCO 2017](https://test.edgefirst.studio/public/projects/2839/home) (80 classes). Part of the [EdgeFirst Model Zoo](https://huggingface.co/spaces/EdgeFirst/Models).
49
  > [!TIP]
50
- > **Training session**: [View on EdgeFirst Studio](https://test.edgefirst.studio/public/projects/2839/experiment/training/list?exp_id=4657) β€” dataset, training config, metrics, and exported artifacts.
51
 
52
  > [!NOTE]
53
- > end2end=False required for INT8. Fastest architecture.
54
 
55
  ---
56
 
57
- ## Size Comparison
58
 
59
- All models validated on COCO val2017 (5000 images, 80 classes).
60
 
61
- | Size | Params | GFLOPs | ONNX FP32 mAP@0.5 | ONNX FP32 mAP@0.5-0.95 | TFLite INT8 mAP@0.5 | TFLite INT8 mAP@0.5-0.95 |
62
- |------|--------|--------|--------------------|-------------------------|----------------------|--------------------------|
63
- | Nano | 2.7M | 7.6 | 54.9% | 39.7% | 51.5% | 34.9% |
64
- | Small | 10.3M | 27.0 | β€” | β€” | β€” | β€” |
65
- | Medium | 24.5M | 74.4 | β€” | β€” | β€” | β€” |
66
  | Large | 42.5M | 155.0 | β€” | β€” | β€” | β€” |
67
  | XLarge | 67.5M | 244.0 | β€” | β€” | β€” | β€” |
68
 
69
  ---
70
 
71
- ## On-Target Performance
72
-
73
- Full pipeline timing: pre-processing + inference + post-processing.
74
-
75
- | Size | Platform | Pre-proc (ms) | Inference (ms) | Post-proc (ms) | Total (ms) | FPS |
76
- |------|----------|---------------|----------------|-----------------|------------|-----|
77
- | β€” | β€” | β€” | β€” | β€” | β€” | β€” |
78
-
79
- *Measured with [EdgeFirst Perception](https://github.com/EdgeFirstAI) stack. Timing includes full GStreamer pipeline overhead.*
80
-
81
- ---
82
-
83
- ## Downloads
84
-
85
- <details open>
86
- <summary><strong>ONNX FP32</strong> β€” Any platform with ONNX Runtime.</summary>
87
-
88
- | Size | File | Status |
89
- |------|------|--------|
90
- | Nano | `yolo26n-det-coco.onnx` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/onnx/yolo26n-det-coco.onnx) |
91
- | Small | `yolo26s-det-coco.onnx` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/onnx/yolo26s-det-coco.onnx) |
92
- | Medium | `yolo26m-det-coco.onnx` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/onnx/yolo26m-det-coco.onnx) |
93
- | Large | `yolo26l-det-coco.onnx` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/onnx/yolo26l-det-coco.onnx) |
94
- | XLarge | `yolo26x-det-coco.onnx` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/onnx/yolo26x-det-coco.onnx) |
95
-
96
- </details>
97
-
98
- <details>
99
- <summary><strong>TFLite INT8</strong> β€” CPU or NPU via runtime delegate (i.MX 8M Plus VX Delegate).</summary>
100
-
101
- | Size | File | Status |
102
- |------|------|--------|
103
- | Nano | `yolo26n-det-coco.tflite` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/tflite/yolo26n-det-coco.tflite) |
104
- | Small | `yolo26s-det-coco.tflite` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/tflite/yolo26s-det-coco.tflite) |
105
- | Medium | `yolo26m-det-coco.tflite` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/tflite/yolo26m-det-coco.tflite) |
106
- | Large | `yolo26l-det-coco.tflite` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/tflite/yolo26l-det-coco.tflite) |
107
- | XLarge | `yolo26x-det-coco.tflite` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/tflite/yolo26x-det-coco.tflite) |
108
 
109
- </details>
110
 
111
- <details>
112
- <summary><strong>NXP i.MX 95 (eIQ Neutron)</strong> β€” eIQ Neutron NPU optimized.</summary>
113
-
114
- | Size | File | Status |
115
- |------|------|--------|
116
- | Nano | `yolo26n-det-coco.imx95.tflite` | [Download](https://huggingface.co/EdgeFirst/yolo26-det/resolve/main/imx95/yolo26n-det-coco.imx95.tflite) |
117
- | Small | `yolo26s-det-coco.imx95.tflite` | Coming Soon |
118
- | Medium | `yolo26m-det-coco.imx95.tflite` | Coming Soon |
119
- | Large | `yolo26l-det-coco.imx95.tflite` | Coming Soon |
120
- | XLarge | `yolo26x-det-coco.imx95.tflite` | Coming Soon |
121
-
122
- </details>
123
 
 
 
 
 
 
 
 
124
 
125
 
126
  ---
127
 
128
- ## Deploy with EdgeFirst Perception
129
 
130
- Copy-paste [GStreamer](https://github.com/EdgeFirstAI/gstreamer) pipeline examples for each platform.
131
 
132
- ### NXP i.MX 8M Plus β€” Camera to Detection with Vivante NPU
 
 
133
 
134
- ```bash
135
- gst-launch-1.0 \
136
- v4l2src device=/dev/video0 ! video/x-raw,width=640,height=480 ! \
137
- edgefirstcameraadaptor ! \
138
- tensor_filter framework=tensorflow-lite \
139
- model=yolo26n-det-coco.tflite \
140
- custom=Delegate:External,ExtDelegateLib:libvx_delegate.so ! \
141
- edgefirstdetdecoder ! edgefirstoverlay ! waylandsink
142
- ```
143
 
144
- ### RPi5 + Hailo-8L
145
 
146
- ```bash
147
- gst-launch-1.0 \
148
- v4l2src device=/dev/video0 ! video/x-raw,width=640,height=480 ! \
149
- hailonet hef-path=yolo26n-det-coco.hailo8l.hef ! \
150
- hailofilter function-name=yolo26_nms ! \
151
- hailooverlay ! videoconvert ! autovideosink
152
- ```
153
-
154
- ### NVIDIA Jetson (TensorRT)
155
 
156
- ```bash
157
- gst-launch-1.0 \
158
- v4l2src device=/dev/video0 ! video/x-raw,width=640,height=480 ! \
159
- edgefirstcameraadaptor ! \
160
- nvinfer config-file-path=yolo26n-det-coco-config.txt ! \
161
- edgefirstdetdecoder ! edgefirstoverlay ! nveglglessink
162
- ```
163
 
 
164
 
165
- *Full pipeline documentation: [EdgeFirst GStreamer Plugins](https://github.com/EdgeFirstAI/gstreamer)*
166
 
167
  ---
168
 
169
- ## Foundation (HAL) Python Integration
170
 
171
  ```python
172
  from edgefirst.hal import Model, TensorImage
173
 
174
- # Load model β€” metadata (labels, decoder config) is embedded in the file
175
- model = Model("yolo26n-det-coco.tflite")
176
 
177
  # Run inference on an image
178
  image = TensorImage.from_file("image.jpg")
179
  results = model.predict(image)
180
 
181
- # Access detections
182
  for det in results.detections:
183
  print(f"{det.label}: {det.confidence:.2f} at {det.bbox}")
184
  ```
185
 
186
- *[EdgeFirst HAL](https://github.com/EdgeFirstAI/hal) β€” Hardware abstraction layer with accelerated inference delegates.*
187
 
188
  ---
189
 
190
- ## CameraAdaptor
191
-
192
- EdgeFirst [CameraAdaptor](https://github.com/EdgeFirstAI/cameraadaptor) enables training and inference directly on native sensor formats (GREY, YUYV, etc.) β€” skipping the ISP color conversion pipeline entirely. This reduces latency and power consumption on edge devices.
193
 
194
- CameraAdaptor variants are included alongside baseline RGB models:
195
 
196
- | Variant | Input Format | Use Case |
197
- |---------|-------------|----------|
198
- | `yolo26n-det-coco.onnx` | RGB (3ch) | Standard camera input |
199
- | `yolo26n-det-coco-grey.onnx` | GREY (1ch) | Monochrome / IR sensors |
200
- | `yolo26n-det-coco-yuyv.onnx` | YUYV (2ch) | Raw sensor bypass |
201
 
202
- *Train CameraAdaptor models with [EdgeFirst Studio](https://edgefirst.studio) β€” the CameraAdaptor layer is automatically inserted during training.*
203
 
204
  ---
205
 
206
- ## Train Your Own with EdgeFirst Studio
207
 
208
- Train on your own dataset with [**EdgeFirst Studio**](https://edgefirst.studio):
209
 
210
- - **Free tier** includes YOLO training with automatic INT8 quantization and edge deployment
211
- - Upload datasets via [EdgeFirst Recorder](https://github.com/EdgeFirstAI/recorder) or COCO/YOLO format
212
- - AI-assisted annotation with auto-labeling
213
- - CameraAdaptor integration for native sensor format training
214
- - Deploy trained models to edge devices via [EdgeFirst Client](https://github.com/EdgeFirstAI/client)
 
 
 
215
 
216
  ---
217
 
218
- ## See Also
219
 
220
- Other models in the [EdgeFirst Model Zoo](https://huggingface.co/spaces/EdgeFirst/Models):
221
 
222
- | Model | Task | Best Nano Metric | Link |
223
- |-------|------|-------------------|------|
224
- | YOLOv5 Detection | Detection | 49.6% mAP@0.5 (ONNX) | [EdgeFirst/yolov5-det](https://huggingface.co/EdgeFirst/yolov5-det) |
225
- | YOLOv8 Detection | Detection | 50.2% mAP@0.5 (ONNX) | [EdgeFirst/yolov8-det](https://huggingface.co/EdgeFirst/yolov8-det) |
226
- | YOLOv8 Segmentation | Segmentation | 34.1% Mask mAP@0.5-0.95 (ONNX) | [EdgeFirst/yolov8-seg](https://huggingface.co/EdgeFirst/yolov8-seg) |
227
- | YOLO11 Detection | Detection | 53.4% mAP@0.5 (ONNX) | [EdgeFirst/yolo11-det](https://huggingface.co/EdgeFirst/yolo11-det) |
228
- | YOLO11 Segmentation | Segmentation | 35.5% Mask mAP@0.5-0.95 (ONNX) | [EdgeFirst/yolo11-seg](https://huggingface.co/EdgeFirst/yolo11-seg) |
229
- | YOLO26 Segmentation | Segmentation | 37.0% Mask mAP@0.5-0.95 (ONNX) | [EdgeFirst/yolo26-seg](https://huggingface.co/EdgeFirst/yolo26-seg) |
230
 
231
  ---
232
 
233
- ## Technical Details
 
 
234
 
235
- ### Quantization Pipeline
236
 
237
- All TFLite INT8 models are produced by EdgeFirst's custom quantization pipeline ([details](https://github.com/EdgeFirstAI/studio-ultralytics)):
 
 
 
 
238
 
239
- 1. **ONNX Export** β€” Standard Ultralytics export with `simplify=True`
240
- 2. **TF-Wrapped ONNX** β€” Box coordinates normalized to [0,1] inside DFL decode via `tf_wrapper` (~1.2% better mAP than post-hoc normalization)
241
- 3. **Split Decoder** β€” Boxes, scores, and mask coefficients split into separate output tensors for independent INT8 quantization scales
242
- 4. **Smart Calibration** β€” 500 images selected via greedy coverage maximization from COCO val2017
243
- 5. **Full INT8** β€” `uint8` input (raw pixels), `int8` output (per-tensor scales), MLIR quantizer
244
 
245
- ### Split Decoder Output Format
246
 
247
- **Detection** (e.g., yolo26n):
248
- - Boxes: `(1, 4, 8400)` β€” normalized [0,1] coordinates
249
- - Scores: `(1, 80, 8400)` β€” class probabilities
250
 
251
- Each tensor has independent quantization scale and zero-point. EdgeFirst HAL handles dequantization and reassembly automatically.
252
 
253
- ### Metadata
254
 
255
- - **TFLite**: `edgefirst.json`, `labels.txt`, and `edgefirst.yaml` embedded via ZIP (no `tflite-support` dependency)
256
- - **ONNX**: `edgefirst.json` embedded via `model.metadata_props`
257
 
258
- No standalone metadata files β€” models are self-contained.
259
 
260
  ---
261
 
262
  ## Limitations
263
 
264
- - **COCO bias** β€” Models trained on COCO (80 classes) inherit its biases: Western-centric scenes, specific object distributions, limited weather/lighting diversity
265
- - **INT8 accuracy loss** β€” Full-integer quantization typically degrades mAP by 6-12% relative to FP32; actual loss depends on model architecture and dataset
266
- - **Thermal variation** β€” On-target performance varies with device temperature; sustained inference may throttle on passively-cooled devices
267
- - **Input resolution** β€” All models expect 640Γ—640 input; other resolutions require letterboxing or may reduce accuracy
268
- - **CameraAdaptor variants** β€” GREY/YUYV models trade color information for latency; accuracy may differ from RGB baseline depending on the task
269
 
270
  ---
271
 
@@ -273,7 +212,7 @@ No standalone metadata files β€” models are self-contained.
273
 
274
  ```bibtex
275
  @software{edgefirst_yolo26_det,
276
- title = { {YOLO26 Detection β€” EdgeFirst Edge AI} },
277
  author = {Au-Zone Technologies},
278
  url = {https://huggingface.co/EdgeFirst/yolo26-det},
279
  year = {2026},
 
9
  - onnx
10
  - int8
11
  - yolo
 
12
  - edgefirst
13
  - nxp
14
  - hailo
15
  - jetson
 
16
  - embedded
 
17
  model-index:
18
  - name: yolo26-det
19
  results:
 
25
  metrics:
26
  - name: "mAP@0.5 (Nano ONNX FP32)"
27
  type: map_50
28
+ value: 55.06
29
  - name: "mAP@0.5-0.95 (Nano ONNX FP32)"
30
  type: map
31
+ value: 39.71
32
+ - name: "mAP@0.5 (Small ONNX FP32)"
33
  type: map_50
34
+ value: 63.6
35
+ - name: "mAP@0.5-0.95 (Small ONNX FP32)"
36
  type: map
37
+ value: 47.16
38
+ - name: "mAP@0.5 (Medium ONNX FP32)"
39
+ type: map_50
40
+ value: 68.89
41
+ - name: "mAP@0.5-0.95 (Medium ONNX FP32)"
42
+ type: map
43
+ value: 51.88
44
  ---
45
 
46
+ # YOLO26 Detection β€” EdgeFirst Model Zoo
47
 
48
+ YOLO26 Detection models trained on [COCO 2017](https://test.edgefirst.studio/public/projects/2839/home) (80 classes) and validated on real edge hardware through the EdgeFirst Profiler + Validator pipeline. Each row in the tables below cites the EdgeFirst Studio validation session (`v-XXXX`) that produced the measurement.
49
+
50
+ Part of the [EdgeFirst Model Zoo](https://huggingface.co/spaces/EdgeFirst/Models).
51
 
 
52
  > [!TIP]
53
+ > **Training experiment**: [View on EdgeFirst Studio](https://test.edgefirst.studio/public/projects/2839/experiment/training/list?exp_id=4657) β€” dataset, training configuration, metrics, and exported artifacts.
54
 
55
  > [!NOTE]
56
+ > End-to-end attention head. `end2end=False` required for INT8 export.
57
 
58
  ---
59
 
60
+ ## Reference accuracy β€” ONNX FP32
61
 
62
+ Accuracy ceiling for each size, measured against COCO `val2017` (5,000 images) with `pycocotools`. Quantized and compiled artifacts (TFLite INT8, HEF, etc.) are graded against this reference per the EdgeFirst publication rule.
63
 
64
+ | Size | Params | GFLOPs | mAP@0.5 | mAP@0.5-0.95 | mAP@0.75 | Source |
65
+ |------|--------|--------|---------|--------------|----------|--------|
66
+ | Nano | 2.7M | 7.6 | 55.06% | 39.71% | 42.87% | [v-1d3b](https://test.edgefirst.studio/public/validation/v-1d3b/details?mode=metrics) |
67
+ | Small | 10.3M | 27.0 | 63.60% | 47.16% | 51.14% | [v-1d3c](https://test.edgefirst.studio/public/validation/v-1d3c/details?mode=metrics) |
68
+ | Medium | 24.5M | 74.4 | 68.89% | 51.88% | 56.41% | [v-1d3e](https://test.edgefirst.studio/public/validation/v-1d3e/details?mode=metrics) |
69
  | Large | 42.5M | 155.0 | β€” | β€” | β€” | β€” |
70
  | XLarge | 67.5M | 244.0 | β€” | β€” | β€” | β€” |
71
 
72
  ---
73
 
74
+ ## On-target validation results
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
75
 
76
+ Each row is one EdgeFirst Studio validation session. Click the **Source** link to inspect the full session β€” model artifact, dataset version, parameters, per-stage Perfetto trace, and the host hardware description (hostname, kernel version, SoC, NPU, profiler version).
77
 
78
+ Cells rendered as `β€”` are sessions that did not meet the EdgeFirst publication threshold; the underlying session is still linked in the Source column for inspection.
 
 
 
 
 
 
 
 
 
 
 
79
 
80
+ | Size | Platform | mAP@0.5 | Ξ” vs FP32 (pp) | mAP@0.5-0.95 | Inference (ms) | FPS (median) | Source |
81
+ |------|----------|---------|----------------|--------------|----------------|--------------|--------|
82
+ | Nano | NXP i.MX 8M Plus + VeriSilicon NPU | 50.07% | -4.99 | 32.27% | 105.65 | 8.5 | [v-1d54](https://test.edgefirst.studio/public/validation/v-1d54/details?mode=metrics) |
83
+ | Nano | NXP i.MX 95 + eIQ Neutron NPU | β€” | β€” | β€” | β€” | β€” | [v-1d57](https://test.edgefirst.studio/public/validation/v-1d57/details?mode=metrics) |
84
+ | Nano | Raspberry Pi 5 + Hailo-8L NPU | 51.60% | -3.46 | 35.41% | 23.73 | 41.3 | [v-1d47](https://test.edgefirst.studio/public/validation/v-1d47/details?mode=metrics) |
85
+ | Small | Raspberry Pi 5 + Hailo-8L NPU | 60.03% | -3.57 | 42.81% | 48.70 | 20.3 | [v-1d48](https://test.edgefirst.studio/public/validation/v-1d48/details?mode=metrics) |
86
+ | Medium | Raspberry Pi 5 + Hailo-8L NPU | 65.64% | -3.25 | 47.16% | 90.55 | 11.0 | [v-1d4d](https://test.edgefirst.studio/public/validation/v-1d4d/details?mode=metrics) |
87
 
88
 
89
  ---
90
 
91
+ ## Validation pipeline
92
 
93
+ These results are produced by the EdgeFirst on-target validation pipeline:
94
 
95
+ 1. **EdgeFirst Profiler** runs on the target hardware, executes the full inference pipeline (preprocess β†’ inference β†’ postprocess), and emits per-image predictions in EdgeFirst Arrow/Parquet plus a Perfetto trace.
96
+ 2. **EdgeFirst Validator** consumes the predictions and trace, computes `pycocotools` accuracy metrics and per-stage timing summaries, and publishes the results to the Studio validation session.
97
+ 3. **EdgeFirst HAL** ([open source](https://github.com/EdgeFirstAI/hal)) provides the hardware-accelerated preprocessing and post-decoding primitives used at both validation and deployment time, so the timings measured here reflect the same accelerated paths a production runtime would take.
98
 
99
+ Inference latency is reported as the on-accelerator inference time. FPS is the measured end-to-end pipelined throughput from the Perfetto trace, which generally exceeds `1000 / (preprocess + inference + postprocess)` because the runtime overlaps stages across frames.
 
 
 
 
 
 
 
 
100
 
101
+ See [EdgeFirst Studio](https://edgefirst.studio) for the full validation pipeline.
102
 
103
+ ---
 
 
 
 
 
 
 
 
104
 
105
+ ## Downloads
 
 
 
 
 
 
106
 
107
+ Artifacts are organized by deployment target. Each model file embeds the EdgeFirst `edgefirst.json` metadata (training session, dataset version, calibration artifact, converter chain) so a single file is sufficient for deployment β€” no sidecar configuration required.
108
 
109
+ *Per-artifact download links are populated from the Studio artifact registry. To see the live download table, regenerate this card with `--studio` against an authenticated Studio session.*
110
 
111
  ---
112
 
113
+ ## Inference example (Python)
114
 
115
  ```python
116
  from edgefirst.hal import Model, TensorImage
117
 
118
+ # Load the model β€” embedded edgefirst.json carries labels and decoder config
119
+ model = Model("yolo26n-det-int8.tflite")
120
 
121
  # Run inference on an image
122
  image = TensorImage.from_file("image.jpg")
123
  results = model.predict(image)
124
 
125
+ # Iterate detections
126
  for det in results.detections:
127
  print(f"{det.label}: {det.confidence:.2f} at {det.bbox}")
128
  ```
129
 
130
+ [EdgeFirst HAL](https://github.com/EdgeFirstAI/hal) β€” Hardware abstraction layer with accelerated inference delegates.
131
 
132
  ---
133
 
134
+ ## Traceability
 
 
135
 
136
+ Every measurement in the tables above is reachable through the EdgeFirst Studio validation framework. The `v-XXXX` Source link on each row resolves to a public Studio URL of the form:
137
 
138
+ ```
139
+ https://test.edgefirst.studio/public/validation/v-XXXX/details?mode=metrics
140
+ ```
 
 
141
 
142
+ From there, the full provenance chain is one click deeper: training session ID, dataset version, calibration artifact, converter chain (e.g. TFLite quantizer + Neutron compile), validation parameters, and the host hardware description (hostname, kernel version, SoC, NPU, profiler version). The same model file you download from this repository embeds the same chain in its `edgefirst.json` metadata.
143
 
144
  ---
145
 
146
+ ## See also
147
 
148
+ Other model families in the [EdgeFirst Model Zoo](https://huggingface.co/spaces/EdgeFirst/Models):
149
 
150
+ | Model | Task | Link |
151
+ |-------|------|------|
152
+ | YOLOv5 Detection | Detection | [EdgeFirst/yolov5-det](https://huggingface.co/EdgeFirst/yolov5-det) |
153
+ | YOLOv8 Detection | Detection | [EdgeFirst/yolov8-det](https://huggingface.co/EdgeFirst/yolov8-det) |
154
+ | YOLOv8 Segmentation | Segmentation | [EdgeFirst/yolov8-seg](https://huggingface.co/EdgeFirst/yolov8-seg) |
155
+ | YOLO11 Detection | Detection | [EdgeFirst/yolo11-det](https://huggingface.co/EdgeFirst/yolo11-det) |
156
+ | YOLO11 Segmentation | Segmentation | [EdgeFirst/yolo11-seg](https://huggingface.co/EdgeFirst/yolo11-seg) |
157
+ | YOLO26 Segmentation | Segmentation | [EdgeFirst/yolo26-seg](https://huggingface.co/EdgeFirst/yolo26-seg) |
158
 
159
  ---
160
 
161
+ ## Train your own with EdgeFirst Studio
162
 
163
+ Train on your own dataset with [**EdgeFirst Studio**](https://edgefirst.studio):
164
 
165
+ - Free tier includes YOLO training with automatic INT8 quantization and edge deployment.
166
+ - Upload datasets via [EdgeFirst Recorder](https://github.com/EdgeFirstAI/recorder) or COCO/YOLO format.
167
+ - AI-assisted annotation with auto-labeling.
168
+ - CameraAdaptor integration for native sensor format training.
169
+ - Deploy trained models to edge devices via [EdgeFirst Client](https://github.com/EdgeFirstAI/client).
 
 
 
170
 
171
  ---
172
 
173
+ ## Technical notes
174
+
175
+ ### Quantization pipeline
176
 
177
+ All TFLite INT8 models are produced by EdgeFirst's quantization pipeline ([details](https://github.com/EdgeFirstAI/studio-ultralytics)):
178
 
179
+ 1. **ONNX export** β€” standard Ultralytics export with `simplify=True`
180
+ 2. **TF-wrapped ONNX** β€” box coordinates normalized to `[0, 1]` inside DFL decode
181
+ 3. **Split decoder** β€” boxes, scores, and mask coefficients split into separate output tensors so each receives an independent INT8 quantization scale
182
+ 4. **Smart calibration** β€” calibration samples selected via greedy coverage maximization; the artifact is content-addressed by parameter hash and cached in Studio for deterministic reuse
183
+ 5. **Full integer INT8** β€” `uint8` input, `int8` output, MLIR quantizer
184
 
185
+ ### Split decoder output format
 
 
 
 
186
 
187
+ **Detection** (e.g. yolo26n):
188
 
189
+ - `boxes` β€” `(1, 4, 8400)` normalized `[0, 1]` coordinates
190
+ - `scores` β€” `(1, 80, 8400)` per-class probabilities
 
191
 
192
+ Each tensor has its own quantization scale and zero point. The EdgeFirst HAL handles dequantization and reassembly automatically; no application code change is required across NPU targets.
193
 
194
+ ### Embedded metadata
195
 
196
+ - **TFLite**: `edgefirst.json` and `labels.txt` embedded in the ZIP-format model file
197
+ - **ONNX**: `edgefirst.json` embedded in `model.metadata_props`
198
 
199
+ No sidecar files required; the model artifact is self-contained.
200
 
201
  ---
202
 
203
  ## Limitations
204
 
205
+ - **COCO bias** β€” models trained on COCO (80 classes) inherit the dataset's biases (Western-centric scenes, particular object distributions, limited weather/lighting diversity).
206
+ - **INT8 quantization loss** β€” full-integer quantization introduces accuracy loss relative to FP32; the magnitude per platform is shown in the *Ξ” vs FP32* column above.
207
+ - **Input resolution** β€” all models expect 640Γ—640 input; other resolutions require letterboxing.
 
 
208
 
209
  ---
210
 
 
212
 
213
  ```bibtex
214
  @software{edgefirst_yolo26_det,
215
+ title = { {YOLO26 Detection β€” EdgeFirst Model Zoo} },
216
  author = {Au-Zone Technologies},
217
  url = {https://huggingface.co/EdgeFirst/yolo26-det},
218
  year = {2026},