YOLOv8n Training Report — Phase 3 Ablation

Synthetic LEGO object detection · Scene diversity vs. per-scene augmentation at fixed 100-image budget.

Source: outputs/phase3/results.json · Split review

Ablation Results — mAP@50

Subset	Fold 0	Fold 1	Fold 2	Fold 3	Fold 4	Mean
2-view (84 img)	0.279	—	—	—	—	0.279
5-view (100 img)	0.343	—	—	—	—	0.343
10-view (100 img)	0.312	—	—	—	—	0.312

Best mean highlighted in green. Fold rows use per-val-fold mAP50 from CV runs.

Bar chart not available.

Final runs when available; falls back to best CV fold. Sorted by mean AP.

Run	GPU	Compute (s)	Wall (s)	Epochs	mAP50	mAP50:95	Cost (USD)	Status
fold0_10view	T4	194	198	100	0.312	0.271	$0.0318	✓ ok
fold0_2view	T4	168	176	100	0.279	0.244	$0.0275	✓ ok
fold0_5view	T4	203	211	100	0.343	0.307	$0.0333	✓ ok
final_10view	T4	195	200	100	0.256	0.228	$0.0320	✓ ok
final_2view	T4	178	182	100	0.314	0.280	$0.0291	✓ ok
final_5view	T4	202	206	100	0.331	0.300	$0.0330	✓ ok
Total							$0.1867

T4 rate: $0.59/hr = $0.000164/s. Cost = modal_compute_s × rate.

Total scenes

Train pool

Val hold-out

Val images

Classes (nc)

CV folds

Subset	Views/scene	Scenes	Images
2view	2	42	84
5view	5	20	100
10view	10	10	100

Val hold-out: scenes 042–049 (fixed, never in training).