检索结果-内蒙古大学图书馆

arXiv 2020年

作者： Cheng, Mingmei Hui, Le Xie, Jin Yang, Jian Kong, Hui PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing210094 China

In this paper, we propose a cascaded non-local neural network for point cloud segmentation. The proposed network aims to build the long-range dependencies of point clouds for the accurate segmentation. Specifically, we develop a novel cascaded non-local module, which consists of the neighborhood-level, superpoint-level and global-level non-local blocks. First, in the neighborhood-level block, we extract the local features of the centroid points of point clouds by assigning different weights to the neighboring points. The extracted local features of the centroid points are then used to encode the superpoint-level block with the non-local operation. Finally, the global-level block aggregates the non-local features of the superpoints for semantic segmentation in an encoder-decoder framework. Benefiting from the cascaded structure, geometric structure information of different neighborhoods with the same label can be propagated. In addition, the cascaded structure can largely reduce the computational cost of the original non-local operation on point clouds. Experiments on different indoor and outdoor datasets show that our method achieves state-of-the-art performance and effectively reduces the time consumption and memory occupation. Copyright © 2020, The Authors. All rights reserved.

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

Probabilistic Margins for Instance Reweighting in Adversarial Training

arXiv

引用

arXiv 2021年

作者： Wang, Qizhou Liu, Feng Han, Bo Liu, Tongliang Gong, Chen Niu, Gang Zhou, Mingyuan Sugiyama, Masashi Department of Computer Science Hong Kong Baptist University Hong Kong DeSI Lab Australian Artificial Intelligence Institute University of Technology Sydney Australia TML Lab School of Computer Science Faculty of Engineering The University of Sydney Australia PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of MoE Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology China McCombs School of Business The University of Texas at Austin United States Graduate School of Frontier Sciences The University of Tokyo Japan

Reweighting adversarial data during training has been recently shown to improve adversarial robustness, where data closer to the current decision boundaries are regarded as more critical and given larger weights. However, existing methods measuring the closeness are not very reliable: they are discrete and can take only a few values, and they are path-dependent, i.e., they may change given the same start and end points with different attack paths. In this paper, we propose three types of probabilistic margin (PM), which are continuous and path-independent, for measuring the aforementioned closeness and reweighting adversarial data. Specifically, a PM is defined as the difference between two estimated class-posterior probabilities, e.g., such a probability of the true label minus the probability of the most confusing label given some natural data. Though different PMs capture different geometric properties, all three PMs share a negative correlation with the vulnerability of data: data with larger/smaller PMs are safer/riskier and should have smaller/larger weights. Experiments demonstrated that PMs are reliable and PM-based reweighting methods outperformed state-of-the-art counterparts. Copyright © 2021, The Authors. All rights reserved.

关键词： Probability

来源：评论

学校读者我要写书评

暂无评论

Assignment problem based deep embedding 1

引用

2nd Chinese Conference on Pattern Recognition and computer vision, PRCV 2019

作者： Zheng, Ruishen Xie, Jin Qian, Jianjun Yang, Jian PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing210094 China

ISBN: (数字)9783030317232

ISBN: (纸本)9783030317225

How to measure the similarity of samples is a fundamental problem in many computer vision tasks such as retrieval and clustering. Due to the rapid development of deep neural networks, deep metric learning has been widely studied. Some studies focus on the hard sample mining strategy for triplet loss. We observe that hard mining strategies are also vital for contrastive loss. But the hardest mining strategy for contrastive loss is sensitive to outliers. In this paper, based on combinatorial information of sample pairs, we propose a novel linear assignment problem based hard sample mining strategy for contrastive loss to learn feature embeddings. Specifically, our method can assign 0/1 weight to sample pairs for the hard sample selection by maximizing a linear assignment loss and ensure that each sample is only included by one pair for the optimization. Our method can obtain the state-of-the-art performance on the CUB-200-2011, Cars196, and In-shop datasets with the GoogLeNet network. © Springer Nature Switzerland AG 2019.

关键词： Combinatorial optimization

来源：评论

学校读者我要写书评

暂无评论

Multi-scale dynamic feature encoding network for image demoiréing 17

Multi-scale dynamic feature encoding network for image demoi...

引用

17th IEEE/CVF International Conference on computer vision Workshop, ICCVW 2019

作者： Cheng, Xi Fu, Zhenyong Yang, Jian PCA Lab Key Lab of Intelligent Percept. and Syst. for High-Dimensional Information of Ministry of Education Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology Nanjing China

ISBN: (纸本)9781728150239

The prevalence of digital sensors, such as digital cameras and mobile phones, simplifies the acquisition of photos. Digital sensors, however, suffer from producing Moire when photographing objects having complex textures, which deteriorates the quality of photos. Moire spreads across various frequency bands of images and is a dynamic texture with varying colors and shapes, which pose two main challenges in demoireing - an important task in image restoration. In this paper, towards addressing the first challenge, we design a multi-scale network to process images at different spatial resolutions, obtaining features in different frequency bands, and thus our method can jointly remove moire in different frequency bands. Towards solving the second challenge, we propose a dynamic feature encoding module (DFE), embedded in each scale, for dynamic texture. Moire pattern can be eliminated more effectively via *** proposed method, termed Multi-scale convolutional network with Dynamic feature encoding for image DeMoireing (MDDM), can outperform the state of the arts in fidelity as well as perceptual on benchmarks. © 2019 IEEE.

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Fine-grained image analysis with deep learning: A survey

arXiv

引用

arXiv 2021年

作者： Wei, Xiu-Shen Song, Yi-Zhe Aodha, Oisin Mac Wu, Jianxin Peng, Yuxin Tang, Jinhui Yang, Jian Belongie, Serge PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education Jiangsu Key Lab of Image Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology China University of Surrey United Kingdom The University of Edinburgh United Kingdom The State Key Laboratory for Novel Software Technology Nanjing University China Peking University China Nanjing University of Science and Technology China The University of Copenhagen The Pioneer Centre for AI Denmark

Fine-grained image analysis (FGIA) is a longstanding and fundamental problem in computer vision and pattern recognition, and underpins a diverse set of real-world applications. The task of FGIA targets analyzing visual objects from subordinate categories, e.g., species of birds or models of cars. The small inter-class and large intra-class variation inherent to fine-grained image analysis makes it a challenging problem. Capitalizing on advances in deep learning, in recent years we have witnessed remarkable progress in deep learning powered FGIA. In this paper we present a systematic survey of these advances, where we attempt to re-define and broaden the field of FGIA by consolidating two fundamental fine-grained research areas – fine-grained image recognition and fine-grained image retrieval. In addition, we also review other key issues of FGIA, such as publicly available benchmark datasets and related domain-specific applications. We conclude by highlighting several research directions and open problems which need further exploration from the community. © 2021, CC BY.

关键词： image recognition

来源：评论

学校读者我要写书评

暂无评论

Probabilistic margins for instance reweighting in adversarial training 21

Probabilistic margins for instance reweighting in adversaria...

引用

Proceedings of the 35th International Conference on Neural Information Processing Systems

作者： Qizhou Wang Feng Liu Bo Han Tongliang Liu Chen Gong Gang Niu Mingyuan Zhou Masashi Sugiyama Department of Computer Science Hong Kong Baptist University DeSI Lab Australian Artificial Intelligence Institute University of Technology Sydney TML Lab School of Computer Science Faculty of Engineering The University of Sydney PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of MoE and Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology RIKEN Center for Advanced Intelligence Project (AIP) McCombs School of Business The University of Texas at Austin RIKEN Center for Advanced Intelligence Project (AIP) and Graduate School of Frontier Sciences The University of Tokyo

ISBN: (纸本)9781713845393

关键词：

来源：评论

学校读者我要写书评

暂无评论

Adversarial Learning of Structure-Aware Fully Convolutional Networks for Landmark Localization

引用

IEEE transactions on pattern analysis and machine intelligence 2020年第7期42卷 1654-1669页

作者： Yu Chen Chunhua Shen Hao Chen Xiu-Shen Wei Lingqiao Liu Jian Yang Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education Jiangsu Key Lab of Image and Video Understanding for Social Security Nanjing University of Science and Technology Nanjing Jiangsu China School of Computer Science The University of Adelaide Adelaide SA Australia Megvii Research Nanjing Megvii Technology Nanjing Jiangsu China

Landmark/pose estimation in single monocular images has received much effort in computer vision due to its important applications. It remains a challenging task when input images come with severe occlusions caused by, e.g., adverse camera views. Under such circumstances, biologically implausible pose predictions may be produced. In contrast, human vision is able to predict poses by exploiting geometric constraints of landmark point inter-connectivity. To address the problem, by incorporating priors about the structure of pose components, we propose a novel structure-aware fully convolutional network to implicitly take such priors into account during training of the deep network. Explicit learning of such constraints is typically challenging. Instead, inspired by how human identifies implausible poses, we design discriminators to distinguish the real poses from the fake ones (such as biologically implausible ones). If the pose generator G generates results that the discriminator fails to distinguish from real ones, the network successfully learns the priors. Training of the network follows the strategy of conditional Generative Adversarial Networks (GANs). The effectiveness of the proposed network is evaluated on three pose-related tasks: 2D human pose estimation, 2D facial landmark estimation and 3D human pose estimation. The proposed approach significantly outperforms several state-of-the-art methods and almost always generates plausible pose predictions, demonstrating the usefulness of implicit learning of structures using GANs.

关键词： computer vision Face Recognition Learning Artificial Intelligence Neural Nets Pose Estimation Landmark Localization computer vision Geometric Constraints Landmark Point Inter Connectivity Conditional Generative Adversarial Networks 2 D Facial Landmark Estimation Pose Predictions Structure Aware Fully Convolutional Network 3 D Human Pose Estimation 2 D Human Pose Estimation Pose Estimation Two Dimensional Displays Three Dimensional Displays Heating Systems Task Analysis Training Pose Estimation Landmark Localization Structure Aware Network Adversarial Training Multi Task Learning Deep Convolutional Networks

来源：评论

学校读者我要写书评

暂无评论

Norm-Aware Embedding for Efficient Person Search

Norm-Aware Embedding for Efficient Person Search

引用

Conference on computer vision and Pattern Recognition (CVPR)

作者： Di Chen Shanshan Zhang Jian Yang Bernt Schiele PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education School of Computer Science and Engineering Nanjing University of Science and Technology Max Planck Institute for Informatics Saarland Informatics Campus Jiangsu Key Lab of Image and Video Understanding for Social Security

ISBN: (数字)9781728171685

ISBN: (纸本)9781728171692

Person Search is a practically relevant task that aims to jointly solve Person Detection and Person Re-identification (re-ID). Specifically, it requires to find and locate all instances with the same identity as the query person in a set of panoramic gallery images. One major challenge comes from the contradictory goals of the two sub-tasks, i.e., person detection focuses on finding the commonness of all persons while person re-ID handles the differences among multiple identities. Therefore, it is crucial to reconcile the relationship between the two sub-tasks in a joint person search model. To this end, We present a novel approach called Norm-Aware Embedding to disentangle the person embedding into norm and angle for detection and re-ID respectively, allowing for both effective and efficient multi-task training. We further extend the proposal-level person embedding to pixel-level, whose discrimination ability is less affected by mis-alignment. We outperform other one-step methods by a large margin and achieve comparable performance to two-step methods on both CUHK-SYSU and PRW. Also, Our method is easy to train and resource-friendly, running at 12 fps on a single GPU.

关键词： Feature extraction Task analysis Training Detectors Standards Proposals Search problems

来源：评论

学校读者我要写书评

暂无评论

Tips and Tricks for Webly-Supervised Fine-Grained Recognition: Learning from the WebFG 2020 Challenge

arXiv

引用

arXiv 2020年

作者： Wei, Xiu-Shen Xu, Yu-Yan Yao, Yazhou Wei, Jia Xi, Si Xu, Wenyuan Zhang, Weidong Lv, Xiaoxin Fu, Dengpan Li, Qing Chen, Baoying Guo, Haojie Xue, Taolue Jing, Haipeng Wang, Zhiheng Zhang, Tianming Zhang, Mingwen PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education School of Computer Science and Engineering Nanjing University of Science and Technology China Jiangsu Key Lab of Image Video Understanding for Social Security China Netease Games AI Lab Netease Yidun AI Lab

WebFG 2020 is an international challenge hosted by Nanjing University of Science and Technology, University of Edinburgh, Nanjing University, The University of Adelaide, Waseda University, etc. This challenge mainly pays attention to the webly-supervised fine-grained recognition problem. In the literature, existing deep learning methods highly rely on large-scale and high-quality labeled training data, which poses a limitation to their practicability and scalability in real world applications. In particular, for fine-grained recognition, a visual task that requires professional knowledge for labeling, the cost of acquiring labeled training data is quite high. It causes extreme difficulties to obtain a large amount of high-quality training data. Therefore, utilizing free web data to train fine-grained recognition models has attracted increasing attentions from researchers in the fine-grained community. This challenge expects participants to develop webly-supervised fine-grained recognition methods, which leverages web images in training fine-grained recognition models to ease the extreme dependence of deep learning methods on large-scale manually labeled datasets and to enhance their practicability and scalability. In this technical report, we have pulled together the top WebFG 2020 solutions of total 54 competing teams, and discuss what methods worked best across the set of winning teams, and what surprisingly did not help. © 2020, CC BY.

关键词： Scalability

来源：评论

学校读者我要写书评

暂无评论

image Restoration for Terahertz image Based on Complex-Valued Deconvolution 8

Image Restoration for Terahertz Image Based on Complex-Value...

引用

8th Asia-Pacific Conference on Antennas and Propagation, APCAP 2019

作者： Ning, Wei Qi, Feng Wang, Jinkuan Northeastern University School of Computer Science and Engineering Shenyang110169 China Chinese Academy of Sciences Shenyang Institute of Automation Shenyang110016 China Institutes for Robotics and Intelligent Manufacturing Chinese Academy of Sciences Shenyang110016 China Key Laboratory of Opto-Electronic Information Processing Chinese Academy of Sciences Shenyang110016 China Key Lab of Image Understanding and Computer Vision Liaoning Province Shenyang110016 China

ISBN: (纸本)9781665400541

According to the unique characteristics of terahertz (THz) waves, THz imaging has become a hot topic in widely application areas. However, the imaging resolution is constrained by its long wavelength. Generally, the deconvolution is always used to reconstruct the object function. In this paper, a method that combining the classic deconvolution algorithm with the complex-value data processing is investigated. It is posed that the proposed complex-number deconvolution achieves better performance compared to the real-valued deconvolution. © 2019 IEEE.

关键词： Deconvolution

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：