咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Weakly Supervised Video Indivi... 收藏
arXiv

Weakly Supervised Video Individual Counting

作     者:Liu, Xinyan Li, Guorong Qi, Yuankai Yan, Ziheng Han, Zhenjun van den Hengel, Anton Yang, Ming-Hsuan Huang, Qingming 

作者机构:University of Chinese Academy of Science Beijing China Australian Institute for Machine Learning The University of Adelaide Australia Key Lab of Intell. Info. Process. Inst. of Comput. Tech. CAS Beijing China Peng Cheng Laboratory Shenzhen China University of California Merced United States 

出 版 物:《arXiv》 (arXiv)

年 卷 期:2023年

核心收录:

主  题:Supervised learning 

摘      要:Video Individual Counting (VIC) aims to predict the number of unique individuals in a single video. Existing methods learn representations based on trajectory labels for individuals, which are annotation-expensive. To provide a more realistic reflection of the underlying practical challenge, we introduce a weakly supervised VIC task, wherein trajectory labels are not provided. Instead, two types of labels are provided to indicate traffic entering the field of view (inflow) and leaving the field view (outflow). We also propose the first solution as a baseline that formulates the task as a weakly supervised contrastive learning problem under group-level matching. In doing so, we devise an end-to-end trainable soft contrastive loss to drive the network to distinguish inflow, outflow, and the remaining. To facilitate future study in this direction, we generate annotations from the existing VIC datasets SenseCrowd and CroHD and also build a new dataset, UAVVIC. Extensive results show that our baseline weakly supervised method outperforms supervised methods, and thus, little information is lost in the transition to the more practically relevant weakly supervised task. The code and trained model will be public at CGNet. © 2023, CC BY.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分