检索结果-内蒙古大学图书馆

Frequency-Aware Physics-Inspired Degradation Model for Real-World image Super-Resolution

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Dong, Zhenxing Cao, Hong Shen, Wang Gan, Yu Ling, Yuye Zhai, Guangtao Su, Yikai John Hopcroft Center for Computer Science Shanghai Jiao Tong University China Institute of Image Communication and Network Engineering Shanghai Jiao Tong University China Department of Electrical and Computer Engineering University of Alabama TuscaloosaAL35401 United States State Key Lab of Advanced Optical Communication Systems and Networks Shanghai Jiao Tong University China

Current learning-based single image super-resolution (SISR) algorithms underperform on real data due to the deviation in the assumed degradation process from that in the real-world scenario. Conventional degradation processes consider applying blur, noise, and downsampling (typically bicubic downsampling) on high-resolution (HR) images to synthesize low-resolution (LR) counterparts. However, few works on degradation modelling have taken the physical aspects of the optical imaging system into consideration. In this paper, we analyze the imaging system optically and exploit the characteristics of the real-world LR-HR pairs in the spatial frequency domain. We formulate a real-world physics-inspired degradation model by considering both optics and sensor degradation;The physical degradation of an imaging system is modelled as a low-pass filter, whose cut-off frequency is dictated by the object distance, the focal length of the lens, and the pixel size of the image sensor. In particular, we propose to use a convolutional neural network (CNN) to learn the cutoff frequency of real-world degradation process. The learned network is then applied to synthesize LR images from unpaired HR images. The synthetic HR-LR image pairs are later used to train an SISR network. We evaluate the effectiveness and generalization capability of the proposed degradation model on real-world images captured by different imaging systems. Experimental results showcase that the SISR network trained by using our synthetic data performs favorably against the network using the traditional degradation model. Moreover, our results are comparable to that obtained by the same network trained by using real-world LR-HR pairs, which are challenging to obtain in real scenes. © 2021, CC BY.

关键词： Cutoff frequency

A multi-user oriented live free-viewpoint video streaming system based on view interpolation

学校读者我要写书评

暂无评论

arXiv 2021年

作者： Hu, Jingchuan Guo, Shuai Dong, Yu Zhou, Kai Xu, Jun Song, Li Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai200240 China Cooperative Medianet Innovation Center Shanghai Jiao Tong University Shanghai200240 China MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai200240 China

As an important application form of immersive multimedia services, free-viewpoint video(FVV) enables users with great immersive experience by strong interaction. However, the computational complexity of virtual view synthesis algorithms poses a significant challenge to the real-time performance of an FVV system. Furthermore, the individuality of user interaction makes it difficult to serve multiple users simultaneously for a system with conventional architecture. In this paper, we novelly introduce a CNN-based view interpolation algorithm to synthesis dense virtual views in real time. Based on this, we also build an end-to-end live free-viewpoint system with a multi-user oriented streaming strategy. Our system can utilize a single edge server to serve multiple users at the same time without having to bring a large view synthesis load on the client side. We analysis the whole system and show that our approaches give the user a pleasant immersive experience, in terms of both visual quality and latency. Copyright © 2021, The Authors. All rights reserved.

关键词： Multimedia services

ROAD CENTRAL CONTOUR EXTRACTION FROM HIGH RESOLUTION SATELLITE image USING TENSOR VOTING FRAMEWORK

学校读者我要写书评

暂无评论

ROAD CENTRAL CONTOUR EXTRACTION FROM HIGH RESOLUTION SATELLI...

2006 International Conference on Machine Learning and Cybernetics(IEEE第五届机器学习与控制论坛)

作者： SHENG ZHENG JIAN LIU WEN-ZHONG SHI GUANG-XI ZHU Electronic & Information Engineering Dept. Huazhong University of Science and Technology Wuhan 430 State Education Commission Key Laboratory for Image Processing and Intelligent Control Inst.for Pat Advanced Research Center for Spatial Information Technology Department of Land Surveying and Geo-In Chinese Key lab of optic & electronics Huazhong University of Science and Technology Wuhan 430074

In this paper, a unique road contour extraction approach from high resolution satellite image is proposed, in which the road contour was extracted in two steps. Firstly, support vector machines (SVM) was employed merely to classify the image into two groups of categories: a road group and a non-road group. The identified road group images are the discrete and irregularly distributed sampled points, and they are an uncompleted data set for the road. Secondly, the road contour was extracted from the road group images using the tensor voting framework, since the tensor voting technique is superior to the traditional methods in extracting the geometrical structure from the uncompleted data set. The experimental results on the high resolution satellite image demonstrate that the proposed approach worked well with images comprised by both rural and urban area features.

关键词： Tensor voting framework Road central line extraction High-resolution satellite image

Online Attentive Kernel-Based Temporal Difference Learning

学校读者我要写书评

暂无评论

arXiv 2022年

作者： Yang, Guang Chen, Xingguo Yang, Shangdong Wang, Huihui Dong, Shaokang Gao, Yang The the Jiangsu Key Laboratory of Big Data Security & Intelligent Processing Nanjing University of Posts and Telecommunications National Engineering Laboratory for Agri-Product Quality Traceability Beijing Technology and Business University China The State Key Laboratory for Novel Software Technology Nanjing University China The PCA Lab Key Lab of Intelligent Perception and Systems for High-Dimensional Information Ministry of Education Jiangsu Key Lab of Image and Video Understanding for Social Security School of Computer Science and Engineering Nanjing University of Science and Technology China

With rising uncertainty in the real world, online Reinforcement Learning (RL) has been receiving increasing attention due to its fast learning capability and improving data efficiency. However, online RL often suffers from complex Value Function Approximation (VFA) and catastrophic interference, creating difficulty for the deep neural network to be applied to an online RL algorithm in a fully online setting. Therefore, a simpler and more adaptive approach is introduced to evaluate value function with the kernel-based model. Sparse representations are superior at handling interference, indicating that competitive sparse representations should be learnable, non-prior, non-truncated and explicit when compared with current sparse representation methods. Moreover, in learning sparse representations, attention mechanisms are utilized to represent the degree of sparsification, and a smooth attentive function is introduced into the kernel-based VFA. In this paper, we propose an Online Attentive Kernel-Based Temporal Difference (OAKTD) algorithm using two-timescale optimization and provide convergence analysis of our proposed algorithm. Experimental evaluations showed that OAKTD outperformed several Online Kernel-based Temporal Difference (OKTD) learning algorithms in addition to the Temporal Difference (TD) learning algorithm with Tile Coding on public Mountain Car, Acrobot, CartPole and Puddle World tasks. Copyright © 2022, The Authors. All rights reserved.

关键词： Learning algorithms

Observer-Based Robust Containment Control of Multi-agent Systems With Input Saturation

学校读者我要写书评

暂无评论

Observer-Based Robust Containment Control of Multi-agent Sys...

Chinese Control Conference (CCC)

作者： Juan Qian Xiaoling Wang Guo-Ping Jiang Housheng Su College of Automation and College of Artificial Intelligence Nanjing University of Posts and Telecommunications and Jiangsu Engineering Lab for IOT Intelligent Robots(IOTRobot) Nanjing PR China School of Artificial Intelligence and Automation Image Processing and Intelligent Control Key Laboratory of Education Ministry ofChina Huazhong University of Science and Technology Wuhan PR China

ISBN: (数字)9789881563903

ISBN: (纸本)9781728165233

In this paper, the robust containment control problem of the leader-following multi-agent systems with input saturation and input additive disturbance is addressed, where the followers can be informed by multiple leaders. With the help of the lowand-high gain feedback technique and the high-gain observer approach, a distributed control algorithm for each agent is firstly designed by using the observed output information, then sufficient conditions are provided to guarantee the semi-global robust containment of the system. Finally, some numerical simulations are given to verify the correctness of the theoretical results.

关键词： Multi-agent systems Robustness Observers Electronic mail State feedback Additives

Understanding the Robustness of 3D Object Detection with Bird’s-Eye-View Representations in Autonomous Driving

学校读者我要写书评

暂无评论

arXiv 2023年

作者： Zhu, Zijian Zhang, Yichi Chen, Hai Dong, Yinpeng Zhao, Shu Ding, Wenbo Zhong, Jiachen Zheng, Shibao Institute of Image Communication and Network Engineering Shanghai Jiao Tong University China Dept. of Comp. Sci. and Tech. Institute for AI THBI Lab BNRist Center Tsinghua University China Key Laboratory of Intelligent Computing and Signal Processing Ministry of Education School of Computer Science and Technology Anhui University Information Materials and Intelligent Sensing Laboratory of Anhui Province China SAIC Motor AI Lab China Zhongguancun Laboratory China

3D object detection is an essential perception task in autonomous driving to understand the environments. The Bird’s-Eye-View (BEV) representations have significantly improved the performance of 3D detectors with camera inputs on popular benchmarks. However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems. In this paper, we evaluate the natural and adversarial robustness of various representative models under extensive settings, to fully understand their behaviors influenced by explicit BEV features compared with those without BEV. In addition to the classic settings, we propose a 3D consistent patch attack by applying adversarial patches in the 3D space to guarantee the spatiotemporal consistency, which is more realistic for the scenario of autonomous driving. With substantial experiments, we draw several findings: 1) BEV models tend to be more stable than previous methods under different natural conditions and common corruptions due to the expressive spatial representations;2) BEV models are more vulnerable to adversarial noises, mainly caused by the redundant BEV features;3) Camera-LiDAR fusion models have superior performance under different settings with multi-modal inputs, but BEV fusion model is still vulnerable to adversarial noises of both point cloud and image. These findings alert the safety issue in the applications of BEV detectors and could facilitate the development of more robust models. Code available at https: //***/zzj403/BEV_Robust. Copyright © 2023, The Authors. All rights reserved.

关键词： Birds

Ensemble of deep convolutional neural networks for automatic pavement crack detection and measurement

学校读者我要写书评

暂无评论

arXiv 2020年

作者： Fan, Zhun Li, Chong Chen, Ying Di Mascio, Paola Chen, Xiaopeng Zhu, Guijie Loprencipe, Giuseppe Key Lab of Digital Signal and Image Processing of Guangdong Province Shan’tou515063 China College of Engineering Shantou University Shan’tou515063 China Department of Civil Construction and Environmental Engineering Sapienza University of Rome Rome00184 Italy Department of Industrial Engineering Pusan National University Busan609735 Korea Republic of

Automated pavement crack detection and measurement are important road issues. Agencies have to guarantee the improvement of road safety. Conventional crack detection and measurement algorithms can be extremely time-consuming and low efficiency. Therefore, recently, innovative algorithms have received increased attention from researchers. In this paper, we propose an ensemble of convolutional neural networks (without a pooling layer) based on probability fusion for automated pavement crack detection and measurement. Specifically, an ensemble of convolutional neural networks was employed to identify the structure of small cracks with raw images. Secondly, outputs of the individual convolutional neural network model for the ensemble were averaged to produce the final crack probability value of each pixel, which can obtain a predicted probability map. Finally, the predicted morphological features of the cracks were measured by using the skeleton extraction algorithm. To validate the proposed method, some experiments were performed on two public crack databases (CFD and AigleRN) and the results of the different state-of-the-art methods were compared. To evaluate the efficiency of crack detection methods, three parameters were considered: precision (Pr), recall (Re) and F1 score (F1). For the two public databases of pavement images, the proposed method obtained the highest values of the three evaluation parameters: for the CFD database, Pr = 0.9552, Re = 0.9521 and F1 = 0.9533 (which reach values up to 0.5175 higher than the values obtained on the same database with the other methods), for the AigleRN database, Pr = 0.9302, Re = 0.9166 and F1 = 0.9238 (which reach values up to 0.7313 higher than the values obtained on the same database with the other methods). The experimental results show that the proposed method outperforms the other methods. For crack measurement, the crack length and width can be measure based on different crack types (complex, common, thin, and inter

关键词： Convolution

VG-Swarm: A Vision-based Gene Regulation Network for UAVs Swarm Behavior Emergence

学校读者我要写书评

暂无评论

arXiv 2022年

作者： Cai, Yuwei Li, Huanlin Hong, Juncao Xu, Peng Cheng, Hui Zhu, Xiaomin Hu, Bingliang Hao, Zhifeng Fan, Zhun Key Lab of Digital Signal and Image Processing of Guangdong Province College of Engineering University of Shantou Guangdong China School of Computer Science and Engineering University of Sun Yat-Sen Guangdong China College of Systems Engineering National University of Defense Technology Hunan Changsha China Xi’an Institute of Optics and Precision Mechanics Shanxi China

Unmanned Aerial Vehicles (UAVs) dynamic encirclement is an emerging field with great potential. Researchers often get inspiration from biological systems, either from macro-world like fish schools or bird flocks etc, or from micro-world like gene regulatory networks (GRN). However, most swarm control algorithms rely on centralized control, global information acquisition, and communications among neighboring agents. In this work, we propose a distributed swarm control method based purely on vision and GRN without any direct communications, in which swarm agents of e.g. UAVs can generate an entrapping pattern to encircle an escaping target of UAV based purely on their installed omnidirectional vision sensors. A finite-state-machine (FSM) describing the behavioral model of each drone is also designed so that a swarm of drones can accomplish searching and entrapping of the target collectively in an integrated way. We verify the effectiveness and efficiency of the proposed method in various simulation and real-world experiments. Copyright © 2022, The Authors. All rights reserved.

关键词： Drones

Robust multispectral palmprint identification system by jointly using contourlet decomposition & Gabor Filter response

学校读者我要写书评

暂无评论

Robust multispectral palmprint identification system by join...

International Conference on Security and Cryptography (SECRYPT)

作者： Abdallah Meraoumia Salim Chitroub Ahmed Bouridane Fac. des nouvelles technologies de l'information et de la communication Lab. de Génie Electrique Ouargla Algeria Electronics and Computer Science Faculty Signal and Image Processing Laboratory Algiers Algeria Department of Computer Science and Digital Technologies Northumbria University Newcastle Newcastle upon Tyne U.K

In current society, reliable identification and verification of individuals are becoming more and more necessary tasks for many fields, not only in police environment, but also in civilian applications, such as access control or financial transactions. Biometric systems are used nowadays in these fields, offering greater convenience and several advantages over traditional security methods based on something that you know (password) or something that you have (keys). In this paper, we propose an efficient online personal identification system based on Multi-Spectral Palmprint (MSP) images using Contourlet Transform (CT) and Gabor Filter (GF) response. In this study, the spectrum image is characterized by the contourlet coefficients sub-bands. Then, we use the Hidden Markov Model (HMM) for modeling the observation vector. In addition, the same spectrum is filtered by the Gabor filter. The real and imaginary responses of the filtering image are used to create another observation vector. Subsequently, the two sub-systems are integrated in order to construct an efficient multi-modal identification system based on matching score level fusion. Our experimental results show the effectiveness and reliability of the proposed method, which brings both high identification and accuracy rate.

关键词： Biometrics (access control) Feature extraction Hidden Markov models Databases Computational modeling Computed tomography Hamming distance