“An In-Depth Comparison of Plain CNN, Fine-Tune VGG16, and Vision Transformer Models in Object Detection”. 2026. BAYERO JOURNAL OF ENGINEERING AND TECHNOLOGY 21 (1): 46-54. https://bjet.ng/index.php/jet/article/view/142.