无人机小目标检测-VisDrone

模型描述

无人机俯拍图像由于其空中拍摄的独特视角，能够对道路、交通等场景进行完整的呈现,对无人机的图像数据进行检测分析具有重要的应用价值。在此，我们提供基于DAMO-YOLO的无人机小目标检测场景模型。

模型评测

Model	size	mAP^val 0.5:0.95	Latency(ms) T4-TRT-FP16	FLOPs (G)	Parameters(M)
DAMO-YOLO-S	640	24.6	3.83	37.8	16.3
DAMO-YOLO-S	1280	31.4	–	151.2	16.3
DAMO-YOLO-S	1920	34.9	–	340.2	16.3

表中汇报的mAP是COCO2017 val集上的结果。
表中汇报的latency不包括后处理（nms）时间，其测试条件为：T4 GPU，TensorRT=7.2.1.6， CUDA=10.2, CUDNN=8.0.0.1。

使用范围

本模型适用于无人机俯拍场景的物体检测，能对图片中的行人、车辆进行定位。

使用方法

在ModelScope框架上，提供输入图片，即可以通过简单的Pipeline调用使用当前模型。具体代码示例如下：

from modelscope.pipelines import pipeline
from modelscope.utils.constant import Tasks
# the default model is based on 640 resolution
object_detect = pipeline(Tasks.image_object_detection,model='XianzheXu/cv_tinynas_object-detection_damoyolo_visdrone')
img_path ='./assets/visdrone_1.jpg'
result = object_detect(img_path)

训练数据介绍

模型训练和评测使用VisDrone-2019数据集，该数据集包含6471张训练图片，548张验证图片，共10个类别，包括人、行人、箱式货车、轿车、巴士、摩托车、自行车、有蓬三轮车、三轮车、卡车。

模型可视化效果

引用

 @article{damoyolo,
  title={DAMO-YOLO: A Report on Real-Time Object Detection Design},
  author={Xianzhe Xu, Yiqi Jiang, Weihua Chen, Yilun Huang, Yuan Zhang and Xiuyu Sun},
  journal={arXiv preprint arXiv:2211.15444v2},
  year={2022}
}