TY - GEN
T1 - Object Detection in Pineapple Fields Drone Imagery Using Few Shot Learning and the Segment Anything Model
AU - Fallas-Moya, Fabian
AU - Calderon-Ramirez, Saul
AU - Sadovnik, Amir
AU - Qi, Hairong
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Deep Learning Object Detection relies on extensive, manual annotation of datasets, a time-consuming and costly process prone to human inconsistencies. Auto-labeling using Visual Foundation Models offers a promising alternative but often falls short in object detection tasks. This research introduces a novel framework that uses the Segment Anything Model (SAM) with minimal annotated images to create an effective object detector. Despite the capabilities of Visual Foundation Models in downstream tasks, our research reveals their poor performance in object detection when operating within a different domain. Additionally, we demonstrate that with only a few labeled images, we can create a much better and simpler object detection system. We also prove that our model outperforms the best existing object detectors when it comes to analyzing drone images taken in pineapple fields.
AB - Deep Learning Object Detection relies on extensive, manual annotation of datasets, a time-consuming and costly process prone to human inconsistencies. Auto-labeling using Visual Foundation Models offers a promising alternative but often falls short in object detection tasks. This research introduces a novel framework that uses the Segment Anything Model (SAM) with minimal annotated images to create an effective object detector. Despite the capabilities of Visual Foundation Models in downstream tasks, our research reveals their poor performance in object detection when operating within a different domain. Additionally, we demonstrate that with only a few labeled images, we can create a much better and simpler object detection system. We also prove that our model outperforms the best existing object detectors when it comes to analyzing drone images taken in pineapple fields.
KW - few-shot
KW - object detection
KW - segment anything
UR - http://www.scopus.com/inward/record.url?scp=85190143017&partnerID=8YFLogxK
U2 - 10.1109/ICMLA58977.2023.00247
DO - 10.1109/ICMLA58977.2023.00247
M3 - Contribución a la conferencia
AN - SCOPUS:85190143017
T3 - Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023
SP - 1635
EP - 1642
BT - Proceedings - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023
A2 - Arif Wani, M.
A2 - Boicu, Mihai
A2 - Sayed-Mouchaweh, Moamar
A2 - Abreu, Pedro Henriques
A2 - Gama, Joao
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023
Y2 - 15 December 2023 through 17 December 2023
ER -