ViDT人脸检测模型

基于ViDT系列算法训练的人脸检测模型，支持对人脸的检测。输入一张图片，返回人脸所在的位置。
本项目实现的模型为ViDT+，采用的Backbone为Swin-Tiny。

模型描述

ViDT+的模型结构如下所示：

期望模型使用方式以及适用范围

如何使用

基于 ModelScope 框架，通过调用预定义的 Pipeline 可实现快速调用。

代码范例

from modelscope.pipelines import pipeline

vidt_pipeline = pipeline('image-object-detection',
                         'damo/ViDT-face-detection')
result = vidt_pipeline('http://maao-pub.oss-cn-zhangjiakou.aliyuncs.com/projects/vidt-vidt-plus/resources/modelscope/images/vidt_test1.jpg')
print(f'Output: {result}.')

训练数据介绍

OpenImage Face，https://storage.googleapis.com/openimages/web/index.html。
Wider Face，http://shuoyang1213.me/WIDERFACE/。

相关论文以及引用信息

如果该模型对您有所帮助，请引用下面的相关的论文：

@inproceedings{song2022vidt,
  title={ViDT: An Efficient and Effective Fully Transformer-based Object Detector},
  author={Song, Hwanjun and Sun, Deqing and Chun, Sanghyuk and Jampani, Varun and Han, Dongyoon and Heo, Byeongho and Kim, Wonjae and Yang, Ming-Hsuan},
  booktitle={International Conference on Learning Representation},
  year={2022}
}
@article{song2022vidtplus,
  title={An Extendable, Efficient and Effective Transformer-based Object Detector},
  author={Song, Hwanjun and Sun, Deqing and Chun, Sanghyuk and Jampani, Varun and Han, Dongyoon and Heo, Byeongho and Kim, Wonjae and Yang, Ming-Hsuan},
  journal={arXiv preprint arXiv:2204.07962},
  year={2022}
}