检索结果-内蒙古大学图书馆

arXiv 2018年

作者： He, Anfeng Luo, Chong Tian, Xinmei Zeng, Wenjun CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei Anhui China Microsoft Research Beijing China

Recently, Siamese network based trackers have received tremendous interest for their fast tracking speed and high performance. Despite the great success, this tracking framework still suffers from several limitations. First, it cannot properly handle large object rotation. Second, tracking gets easily distracted when the background contains salient objects. In this paper, we propose two simple yet effective mechanisms, namely angle estimation and spatial masking, to address these issues. The objective is to extract more representative features so that a better match can be obtained between the same object from different frames. The resulting tracker, named Siam-BM, not only significantly improves the tracking performance, but more importantly maintains the realtime capability. Evaluations on the VOT2017 dataset show that Siam-BM achieves an EAO of 0.335, which makes it the best-performing realtime tracker to date. Copyright © 2018, The Authors. All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

COOPERATIVE HYBRID DIGITAL-ANALOG VIDEO TRANSMISSION IN D2D NETWORKS

COOPERATIVE HYBRID DIGITAL-ANALOG VIDEO TRANSMISSION IN D2D ...

引用

IEEE International Conference on Image processing

作者： Jian Shen Fei Liang Chong Luo Houqiang Li Wenjun Zeng CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei China Microsoft Research Asia Beijing China

In this paper, we propose a cooperative video transmission scheme in D2D networks. This research is motivated by the growing interests in hybrid digital-analog video transmissions and device-to-device (D2D) communications. The framework of D2D communications can be generally modeled as a three-node network. In this network, coset coding is used to allow the destination to exploit the correlations between the video signals received in two phases. We have done some work of further optimization to improve the video quality at destination in this network. First, we derive a closed form of the reconstruction error at the destination. This provides a theoretical foundation for finding the optimal quantization step size in coset coding. Then, based on the accurate analysis on the coset coding we design a new power allocation algorithm. Experimental results verify that our scheme outperforms the recently proposed WCVC and DCVC.

关键词： Encoding Resource management Relays Distortion Quantization (signal) Device-to-device communication Decoding

来源：评论

学校读者我要写书评

暂无评论

A twofold siamese network for real-time object tracking

arXiv

引用

arXiv 2018年

Observing that Semantic features learned in an image classification task and Appearance features learned in a similarity matching task complement each other, we build a twofold Siamese network, named SA-Siam, for real-time object tracking. SA-Siam is composed of a semantic branch and an appearance branch. Each branch is a similarity-learning Siamese network. An important design choice in SA-Siam is to separately train the two branches to keep the heterogeneity of the two types of features. In addition, we propose a channel attention mechanism for the semantic branch. Channel-wise weights are computed according to the channel activations around the target position. While the inherited architecture from SiamFC [3] allows our tracker to operate beyond real-time, the twofold design and the attention mechanism significantly improve the tracking performance. The proposed SA-Siam outperforms all other real-time trackers by a large margin on OTB-2013/50/100 benchmarks. Copyright © 2018, The Authors. All rights reserved.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Convolutional Neural Network-Based Invertible Half-Pixel Interpolation Filter for Video Coding

Convolutional Neural Network-Based Invertible Half-Pixel Int...

引用

IEEE International Conference on Image processing

作者： Ning Yan Dong Liu Houqiang Li Tong Xu Feng Wu Bin Li CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei China Microsoft Research Asia Beijing China

Fractional-pixel interpolation has been widely used in the modern video coding standards to improve the accuracy of motion compensated prediction. Traditional interpolation filters are designed based on the signal processing theory. However, video signal is non-stationary, making the traditional methods less effective. In this paper, we reveal that the interpolation filter can not only generate the fractional pixels from the integer pixels, but also reconstruct the integer pixels from the fractional ones. This property is called invertibility. Inspired by the invertibility of fractional-pixel interpolation, we propose an end-to-end scheme based on convolutional neural network (CNN) to derive the invertible interpolation filter, termed CNNInvIF. CNNlnvIF does not need the “ground-truth” of fractional pixels for training. Experimental results show that the proposed CNNInvIF can achieve up to 4.6% and on average 2.2% BD-rate reduction than HEVC under the low-delay P configuration.

关键词： Interpolation Video coding Training Image reconstruction geometry Convolutional neural networks Image coding

来源：评论

学校读者我要写书评

暂无评论

SALIENT SEED EXTRACTION BASED TARGET DETECTION IN SAR IMAGES

SALIENT SEED EXTRACTION BASED TARGET DETECTION IN SAR IMAGES

引用

IEEE International geoscience and Remote Sensing Symposium

作者： Zongxu Pan Bin Lei Key Laboratory of Technology in Geo-spatial Information Processing and Application System Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China

A salient seed extraction based target detection method is proposed in this paper, aiming to distinguish target points from background points in SAR images. Different from recent superpixel based method which generates superpixels firstly, and for each superpixel decides whether it belongs to part of a target. The proposed method employs a salient point to region scheme. At first, salient seeds are extracted by mean-shift and region feature based approach. Then, pixels are assigned to the most similar seed and those assigned to the salient seeds are extracted to form the foreground region. Finally, constant false alarm rate (CFAR) operation is employed to detect the target points from the foreground region. The effectiveness of the proposed method is validated by comparing with five state-of-the-art methods on TerraSAR-X images.

关键词： Target detection Salient seed extraction Synthetic aperture radar (SAR)

来源：评论

学校读者我要写书评

暂无评论

CONVOLUTIONAL NEURAL NETWORK-BASED ARITHMETIC CODING OF DC COEFFICIENTS FOR HEVC INTRA CODING

CONVOLUTIONAL NEURAL NETWORK-BASED ARITHMETIC CODING OF DC C...

引用

IEEE International Conference on Image processing

作者： Changyue Ma Dong Liu Xiulian Peng Feng Wu CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China Hefei China Microsoft Research Asia Beijing China

In the state-of-the-art video coding standard-High Efficiency Video Coding (HEVC), context-adaptive binary arithmetic coding (CABAC) is adopted as the entropy coding tool. In CABAC, the binarization processes are manually designed, and the context models are empirically crafted, both of which incur that the probability distribution of the syntax elements may not be estimated accurately, and restrict the coding efficiency. In this paper, we adopt a convolutional neural network-based arithmetic coding (CNNAC) strategy, and conduct studies on the coding of the DC coefficients for HEVC intra coding. Instead of manually designing binarization process and context model, we propose to directly estimate the probability distribution of the value of the DC coefficient using densely connected convolutional networks. The estimated probability together with the real DC coefficient are then input into a multi-level arithmetic codec to fulfill entropy coding. Simulation results show that our proposed CNNAC leads to on average 22.47% bits saving compared with CABAC for the bits of DC coefficients, which corresponds to 1.6% BD-rate reduction than the HEVC anchor.

关键词： Context modeling Probability distribution Entropy coding Codecs Training data Tools

来源：评论

学校读者我要写书评

暂无评论

Identity Regularized Sparse Representation for Automatic Target Recognition in Sar Images

Identity Regularized Sparse Representation for Automatic Tar...

引用

IEEE International Symposium on geoscience and Remote Sensing (IGARSS)

作者： Zongxu Pan Lei Liu Bin Lei Key Laboratory of Technology in Geo-spatial Information Processing and Application System Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China

An identity regularized sparse representation (IRSR) based SAR target recognition method is proposed in this paper. The method aims to find a transformation that can map the data to a transformed space, in which targets from the same class are close with each other, no matter the distance of them in the original space. This identity constraint can be formulated as a ℓ 1 -norm minimization problem. By decoupling the problem into the sparse coding problem and the dictionary learning problem, the solution can be obtained iteratively. The solution is simply the weighted average of the sparse coding of all training data. Experimental results demonstrate that the proposed method is superior to several related methods.

关键词： Synthetic aperture radar Target recognition Linear programming Airplanes Remote sensing Support vector machines Minimization

来源：评论

学校读者我要写书评

暂无评论

Layer-wise coordination between encoder and decoder for neural machine translation 18

Layer-wise coordination between encoder and decoder for neur...

引用

Proceedings of the 32nd International Conference on Neural information processing systems

作者： Tianyu He Xu Tan Yingce Xia Di He Tao Qin Zhibo Chen Tie-Yan Liu CAS Key Laboratory of Technology in Geo-spatial Information Processing and Application System University of Science and Technology of China Microsoft Research Key Laboratory of Machine Perception MOE School of EECS Peking University

Neural Machine Translation (NMT) has achieved remarkable progress with the quick evolvement of model structures. In this paper, we propose the concept of layer-wise coordination for NMT, which explicitly coordinates the learning of hidden representations of the encoder and decoder together layer by layer, gradually from low level to high level. Specifically, we design a layer-wise attention and mixed attention mechanism, and further share the parameters of each layer between the encoder and decoder to regularize and coordinate the learning. Experiments show that combined with the state-of-the-art Transformer model, layer-wise coordination achieves improvements on three IWSLT and two WMT translation tasks. More specifically, our method achieves 34.43 and 29.01 BLEU score on WMT16 English-Romanian and WMT14 English-German tasks, outperforming the Transformer baseline.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A DESIGN OF THE COMPLETE POLARIZATION CONVERTER USING DIELECTRIC PERIODIC STRUCTURES

A DESIGN OF THE COMPLETE POLARIZATION CONVERTER USING DIELEC...

引用

IEEE International geoscience and Remote Sensing Symposium

作者： Yueting Zhang Chibiao Ding Bin Lei Weihai Fang Key Laboratory of Geo-spatial Information Processing and Application System Technology Chinese Academy of Sciences University of Chinese Academy of Sciences Beijing General Institute of Electronic Engineering

Polarization converter is used in the applications of the polar SAR observations. There exists coupling between TE and TM modes when plane wave is oblique incident on the surface of dielectric periodic structure, the single TE or TM polarized wave incident will cause TE and TM mixed transmission wave. In some proper incident conditions, complete polarization conversion can be realized between TE and TM mode. In this work, a design of complete polarization converter by using dielectric periodic structure is designed and it is carefully investigated by a method which combines the multimode network theory with the rigorous mode matching method. We revealed TE/TM complete polarization conversion characteristics of dielectric periodic structure, and also analyzed the effects of structure parameters. These investigations provide important guideline for accurate designing new millimeter wave polarization converters.

关键词： Dielectrics Polarization Periodic structures Optical waveguides Microwave filters Reflection

来源：评论

学校读者我要写书评

暂无评论

NTIRE 2020 Challenge on Image and Video Deblurring

arXiv

引用

arXiv 2020年

作者： Seungjun, Nah Sanghyun, Son Radu, Timofte Kyoung Mu, Lee Tseng, Yu Xu, Yu-Syuan Chiang, Cheng-Ming Tsai, Yi-Min Brehm, Stephan Scherer, Sebastian Xu, Dejia Chu, Yihao Sun, Qingyan Jiang, Jiaqin Duan, Lunhao Yao, Jian Purohit, Kuldeep Suin, Maitreya Rajagopalan, A.N. Ito, Yuichi Hrishikesh, P.S. Puthussery, Densen Akhil, K.A. Jiji, C.V. Kim, Guisik Deepa, P.L. Xiong, Zhiwei Huang, Jie Liu, Dong Kim, Sangmin Nam, Hyungjoon Kim, Jisu Jeong, Jechang Huang, Shihua Fan, Yuchen Yu, Jiahui Yu, Haichao Huang, Thomas S. Zhou, Ya Li, Xin Liu, Sen Chen, Zhibo Dutta, Saikat Das, Sourya Dipta Garg, Shivam Sprague, Daniel Patel, Bhrij Huck, Thomas Department of ECE ASRI SNU Korea Republic of Computer Vision Lab ETH Zurich Switzerland MediaTek Inc University of Augsburg Chair for Multimedia Computing and Computer Vision Lab Germany Peking University China Beijing University of Posts and Telecommunications China Beijing Jiaotong University China Wuhan University China Indian Institute of Technology Madras India Vermilion College of Engineering Trivandrum India CVML Chung-Ang University Korea Republic of APJ Abdul Kalam Technological University India University of Science and Technology of China China Image Communication Signal Processing Laboratory Hanyang University Korea Republic of Southern University of Science and Technology China University of Illinois at Urbana-Champaign United States CAS Key Laboratory of Technology in Geo-Spatial Information Processing and Application System University of Science and Technology of China China IIT Madra Jadavpur University India University of Texas Austin United States Duke University Computer Science Department United States

Motion blur is one of the most common degradation artifacts in dynamic scene photography. This paper reviews the NTIRE 2020 Challenge on Image and Video Deblurring. In this challenge, we present the evaluation results from 3 competition tracks as well as the proposed solutions. Track 1 aims to develop single-image deblurring methods focusing on restoration quality. On Track 2, the image deblurring methods are executed on a mobile platform to find the balance of the running speed and the restoration accuracy. Track 3 targets developing video deblurring methods that exploit the temporal relation between input frames. In each competition, there were 163, 135, and 102 registered participants and in the final testing phase, 9, 4, and 7 teams competed. The winning methods demonstrate the state-of-the-art performance on image and video deblurring tasks. Copyright © 2020, The Authors. All rights reserved.

关键词： Image enhancement

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：