Dominated point cloud-based 3D object detectors in autonomous driving scenarios rely heavily on the huge amount of accurately labeled samples, however, 3D annotation in the point cloud is extremely tedious, expensive ...
详细信息
Unmanned Aerial Vehicle (UAV) offers lots of applications in both commerce and recreation. Therefore, perception of the status of UAVs is crucially important. In this paper, we consider the task of tracking UAVs, prov...
详细信息
Recently, Transformers have shown promising performance in various vision tasks. To reduce the quadratic computation complexity caused by the global self-attention, various methods constrain the range of attention wit...
详细信息
Gluten, surimi and mixted protein were analyzed for changes in protein structure after treatment with konjac glucomannan (KGM), ultrasound (U) and konjac glucomannan-ultrasound (UKGM). In this study, molecular weight,...
详细信息
Transformers have shown impressive performance in various natural language processing and computer vision tasks, due to the capability of modeling long-range dependencies. Recent progress has demonstrated that combini...
详细信息
Computer assisted pronunciation training system (CAPT) can detect the wrong pronunciation produced by nonnative speakers and provide positive feedback. CAPT is helpful to improve the pronunciation level for L2 learner...
详细信息
Semi-supervised learning (SSL) has been proven beneficial for mitigating the issue of limited labeled data especially on the task of volumetric medical image segmentation. Unlike previous SSL methods which focus on ex...
详细信息
Accurate detection of obstacles in 3D is an essential task for autonomous driving and intelligent transportation. In this work, we propose a general multimodal fusion framework FusionPainting to fuse the 2D RGB image ...
详细信息
3D object detection from a single image is an important task in Autonomous Driving (AD), where various approaches have been proposed. However, the task is intrinsically ambiguous and challenging as single image depth ...
详细信息
A picture is worth a thousand words, thus, it is crucial for conversational agents to understand, perceive, and effectively respond with pictures. However, we find that directly employing conventional image generation...
详细信息
暂无评论