检索结果-内蒙古大学图书馆

30th IEEE International Conference on image processing (ICIP)

作者： Hussain, Sadia Lall, Brejesh IIT Delhi BSTTM Delhi India IIT Delhi Dept EE Delhi India

ISBN: (纸本)9781728198354

Convolutional neural networks have proven to be proficient when extracting low-level concepts in an image. With the wonderful performance of transformers in exploiting the long-range correlations in an image, many methods have been explored where one exploit benefits of both the architectures. Therefore, in order to strengthen our network we add an important feature to transformers wherein single image super-resolution (SISR) is exploited using band grouping leveraging a simple CNN architecture. This paper aims to train a set of simple residual modelling architectures and then integrate them into a transformer architecture to solve super-resolution problem in HSI. We take a step forward to analyse how to adapt swinIR to fully exploit the information derived from band grouping for efficient SISR.

关键词： hyperspectral restoration image super-resolution transformers convolutional neural networks spatial super-resolution

来源：评论

学校读者我要写书评

暂无评论

GRAPHIC - GRAPH-BASED REPRESENTATION FOR ANALYZING PEOPLE'S HIGH-LEVEL INTERACTIONS IN CROWDS 31

GRAPHIC - GRAPH-BASED REPRESENTATION FOR ANALYZING PEOPLE'S ...

引用

2024 International Conference on image processing

作者： Longobardi, Francesco Riccio, Daniel Univ Naples Federico II Via Claudio 21 I-80125 Naples Italy

ISBN: (纸本)9798350349405;9798350349399

The need for automated systems to aid law enforcement during densely packed events arises from the inherent danger of large crowds, evidenced by historical instances of stampedes and crushes. Existing methods vary from basic crowd statistics extraction to detailed anomaly detection in behavior classification, but often focus on single, pre-segmented scenes. Our work addresses classifying crowd behaviors in environments where multiple behaviors coexist within a single scene, defined as a multi-class crowd motion characterization challenge. We use a microscopic approach for scenes captured by drones at varying altitudes, without prior manipulation. This approach combines graph-based representations of individuals and flow images, facilitating classification of diverse crowd behaviors in unsegmented scenes. Tested on a public dataset, our method shows promising results in analyzing complex crowd dynamics.

关键词： Crowd behaviour classification Graph neural networks Drone video analysis

来源：评论

学校读者我要写书评

暂无评论

Color image encryption based on lite dense-ResNet and bit-XOR diffusion

引用

MULTIMEDIA TOOLS AND APPLICATIONS 2024年第5期83卷 12819-12848页

作者： Bao, Zhenjie Xue, Ru Hu, Jingyun Liu, Yue Xizang Minzu Univ Sch Informat Engn Xianyang Shaanxi Peoples R China

images contain a wealth of information and are frequently targeted by malicious attackers when transmitted over public networks. Fortunately, image encryption prevents confidential information from being acquired by illegal attackers. Deep learning-based image encryption is a relatively new research area, but recently proposed methods have not achieved satisfactory levels of generalization, security, and efficiency. To address these limitations, we employ a lite dense residual network (Dense-ResNet) to rearrange image pixels, thereby reducing the computation amounts. In addition, we design a weight-adjustable loss function model, which combines the encryption loss function, decryption loss function, and total variational loss function. And then we adopt bit-XOR diffusion to further encrypt the intermedia ciphertext image obtained by the encryption network. We trained and tested encryption and decryption neural networks in a dataset of no fixed category images. Experiments declare our method can complete the image encryption/ decryption tasks in various scenarios. Additionally, the proposed approach exhibits broad generalization abilities with high encryption and decryption quality aided by the decryption total variation loss function. Compared to recently proposed deep learning-based image encryption approaches, our method demonstrates faster processing times for both image encryption and decryption, with at least a 2.7% and 7.5% increase in efficiency, respectively. Furthermore, our method improves decryption performance by at least 1.0% and 0.5% in Peak signal-to-ratio (PSNR) as well as structural similarity (SSIM) indicators while maintaining a high level of security. What is more, our method enhances traceability of data loss or noise attacks since such attacks leave a noticeable trail on decrypted images produced by our method.

关键词： Color image encryption Lite dense-ResNet Bit-XOR diffusion Deep learning

来源：评论

学校读者我要写书评

暂无评论

A Hybrid CNN-Tree Based Model for Enhanced image Classification Performance 32

A Hybrid CNN-Tree Based Model for Enhanced Image Classificat...

引用

32nd IEEE signal processing and Communications Applications Conference (SIU)

作者： Aydin, Musa Kus, Zeki Akcelik, Zeliha Kaya Fatih Sultan Mehmet Vakif Univ Bilgisayar Muhendisligi Istanbul Turkiye

ISBN: (纸本)9798350388978;9798350388961

Blood cells play an essential role in various bodily functions, such as protection against infections and the body's defense. The accurate classification of blood cells, generally grouped as red, white, and platelets is important for clinical diagnosis and hematological analysis. However, identifying these cells is a specialized and time-consuming process. Therefore, there is a hot-topic for high-precision automatic blood cell classification methods. Convolutional neural networks (CNNs) are a deep learning model used for visual data analysis and are very powerful in extracting features from data. In this study, we propose a hybrid classification model that combines the feature extraction power of CNNs with the ensemble-based prediction capabilities of Random Forest and XGBoost algorithms. The proposed hybrid model is compared with different methods on the BloodMNIST dataset in terms of classification performance and inference time. The results show that the tree-based methods outperform CNN by up to 8.49 and 11.62 points and achieve up to 82.9 times better inference times than other methods.

关键词： CNN feature extraction Random forest XGBoost Blood cell classification

来源：评论

学校读者我要写书评

暂无评论

Efficient Underground Target Detection of Urban Roads in Ground-Penetrating Radar images Based on neural Networks

引用

REMOTE SENSING 2023年第5期15卷 1346-1346页

作者： Xue, Wei Chen, Kehui Li, Ting Liu, Li Zhang, Jian China Univ Geosci Sch Automat Wuhan 430074 Peoples R China Hubei Key Lab Adv Control & Intelligent Automat Co Wuhan 430074 Peoples R China Minist Educ Engn Res Ctr Intelligent Technol Geoexplorat Wuhan 430074 Peoples R China Wuhan Univ Sch Elect Informat Wuhan 430072 Peoples R China

Ground-penetrating radar (GPR) is an important nondestructive testing (NDT) tool for the underground exploration of urban roads. However, due to the large amount of GPR data, traditional manual interpretation is time-consuming and laborious. To address this problem, an efficient underground target detection method for urban roads based on neural networks is proposed in this paper. First, robust principal component analysis (RPCA) is used to suppress the clutter in the B-scan image. Then, three time-domain statistics of each A-scan signal are calculated as its features, and one backpropagation (BP) neural network is adopted to recognize A-scan signals to obtain the horizontal regions of targets. Next, the fusion and deletion (FAD) algorithm is used to further optimize the horizontal regions of targets. Finally, three time-domain statistics of each segmented A-scan signal in the horizontal regions of targets are extracted as the features, and another BP neural network is employed to recognize the segmented A-scan signals to obtain the vertical regions of targets. The proposed method is verified with both simulation and real GPR data. The experimental results show that the proposed method can effectively locate the horizontal ranges and vertical depths of underground targets for urban roads and has higher recognition accuracy and less processing time than the traditional segmentation recognition methods.

关键词： ground-penetrating radar underground target detection urban road neural network robust principal component analysis fusion and deletion algorithm

来源：评论

学校读者我要写书评

暂无评论

FREQUENCY-AWARE RE-PARAMETERIZATION FOR OVER-FITTING BASED image COMPRESSION 30

FREQUENCY-AWARE RE-PARAMETERIZATION FOR OVER-FITTING BASED I...

引用

30th IEEE International Conference on image processing (ICIP)

作者： Ye, Yun Pan, Yanjie Jiang, Qually Lu, Ming Fang, Xiaoran Xu, Beryl Intel Corp Shanghai Peoples R China

ISBN: (纸本)9781728198354

Over-fitting-based image compression requires weights compactness for compression and fast convergence for practical use, posing challenges for deep convolutional neural networks (CNNs) based methods. This paper presents a simple re-parameterization method to train CNNs with reduced weights storage and accelerated convergence. The convolution kernels are re-parameterized as a weighted sum of discrete cosine transform (DCT) kernels enabling direct optimization in the frequency domain. Combined with L1 regularization, the proposed method surpasses vanilla convolutions by achieving a significantly improved rate-distortion with low computational cost. The proposed method is verified with extensive experiments of over-fitting-based image restoration on various datasets, achieving up to -46.12% BD-rate on top of HEIF with only 200 iterations.

关键词： image Compression Over-fitting based Compression Convolutional neural Networks Rate-Distortion

来源：评论

学校读者我要写书评

暂无评论

A two-stage CNN method for MRI image segmentation of prostate with lesion?

引用

BIOMEDICAL signal processing AND CONTROL 2023年第1期82卷

作者： Wang, Zixuan Wu, Ruofan Xu, Yanran Liu, Yi Chai, Ruimei Ma, He Northeastern Univ Coll Med & Biol Informat Engn Shenyang 110819 Liaoning Peoples R China China Med Univ Hosp 1 Dept Radiol Shenyang 110002 Liaoning Peoples R China Minist Educ Key Lab Intelligent Comp Med Image Shenyang 110819 Liaoning Peoples R China

Prostate magnetic resonance imaging (MRI) is widely used in the diagnosis of prostate cancer and other prostate diseases. The automatic segmentation of images from prostate MRI plays an important role in the auxiliary diagnosis of prostate diseases. Currently, there are two commonly used methods for automatic segmentation of prostate MRI, which are 2D image segmentation and 3D image segmentation. In this paper, a two-stage CNN method for MRI image segmentation of prostate with lesion is proposed. At the first stage, we used a CNN model incorporating the Squeeze-Excitation module to discriminate whether the image contains prostate or not. At the second stage, we proposed a Residual-Attention U-Net for segmentation of images containing prostate. Eventually, the 3D prostate MRI segmentation results are obtained and fully automated segmentation is accomplished. We evaluated our proposed method and other common 2D and 3D segmentation methods on the test dataset and compared their results based on Dice Similarity Coefficient (DSC) value. Our method performed the best and achieved the DSC metric value of 0.860.

关键词： Prostate MRI images Medical image segmentation Convolutional neural networks Squeeze-excitation module Residual-attention U-net

来源：评论

学校读者我要写书评

暂无评论

Diagnosis of Parkinson's Disease Based on Hybrid Fusion Approach of Offline Handwriting images

引用

IEEE signal processing LETTERS 2024年 31卷 3179-3183页

作者： Dong, Shanyu Liu, Jin Wang, Jianxin Cent South Univ Sch Comp Sci & Engn Hunan Prov Key Lab Bioinformat Changsha 410083 Peoples R China

Handwriting images are commonly used to diagnose Parkinson's disease due to their intuitive nature and easy accessibility. However, existing methods have not explored the potential of the fusion of different handwriting image sources for diagnosis. To address this issue, this study proposes a hybrid fusion approach that makes use of the visual information derived from different handwriting images and handwriting templates, significantly enhancing the performance in diagnosing Parkinson's disease. The proposed method involves several key steps. Initially, different preprocessed handwriting images undergo pixel-level fusion using Laplacian transformation. Subsequently, the fused and original images are fed into a pre-trained CNN separately to extract visual features. Finally, feature-level fusion is performed by concatenating the feature vectors extracted from the flatten layer, and the fused feature vectors are input into SVM to obtain classification results. Our experimental results validate that the proposed method achieves excellent performance by only utilizing visual features from images, with 95.45% accuracy on the NewHandPD. Furthermore, the results obtained on our dataset verify the strong generalizability of the proposed approach.

关键词： Feature extraction Diseases Visualization Vectors image color analysis Convolutional neural networks Accuracy image edge detection Writing Transforms Handwriting images hybrid fusion approach Laplacian transformation Parkinson's disease pre-trained CNN

来源：评论

学校读者我要写书评

暂无评论

stochasticWindow Transformer for image Restoration 36

StochasticWindow Transformer for Image Restoration

引用

36th Conference on neural Information processing Systems (NeurIPS)

作者： Xiao, Jie Fu, Xueyang Wu, Feng Zha, Zheng-Jun Univ Sci & Technol China Hefei Peoples R China

ISBN: (纸本)9781713871088

Thanks to the powerful representation capabilities, transformers have made impressive progress in image restoration. However, existing transformers-based methods do not carefully consider the particularities of image restoration. In general, image restoration requires that an ideal approach should be translation-invariant to the degradation, i.e., the undesirable degradation should be removed irrespective of its position within the image. Furthermore, the local relationships also play a vital role, which should be faithfully exploited for recovering clean images. Nevertheless, most transformers either adopt local attention with the fixed local window strategy or global attention, which unfortunately breaks the translation invariance and causes huge loss of local relationships. To address these issues, we propose an elegant stochastic window strategy for transformers. Specifically, we first introduce the window partition with stochastic shift to replace the original fixed window partition for training. Then, we design a new layer expectation propagation algorithm to efficiently approximate the expectation of the induced stochastic transformer for testing. Our stochastic window transformer not only enjoys powerful representation but also maintains the desired property of translation invariance and locality. Experiments validate the stochastic window strategy consistently improves performance on various image restoration tasks (deraining, denoising and deblurring) by significant margins. The code is available at https://***/jiexiaou/Stoformer.

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

A gradient-free training approach for optical neural networks based on stochastic functions 13

A gradient-free training approach for optical neural network...

引用

Optoelectronic Devices and Integration XIII 2024

作者： Qi, Ji Wang, Shuang Liu, Zheng Zhang, Yunfan Cai, Xiaomin Liu, Tiegen Xu, Tianhua School of Precision Instruments and Opto-Electronics Engineering Tianjin University Tianjin300072 China Tianjin Key Laboratory of Space Environment Simulation Technology Tianjin300450 China School of Engineering University of Warwick CoventryCV4 7AL United Kingdom

ISBN: (纸本)9781510682009

Due to increasingly large computational resources, modern neural networks are severely constrained due to their processing speed and energy consumption. Optical neural networks (ONNs), which use photonic structures to process signals at the physical level as an alternative to the computation in the electronic domain provided by traditional neural networks, are an attractive approach to implementing ultra-high-speed, low-energy parallel computation. Nevertheless, current training processes for electronic domain neural networks are optimized from gradient-based training methods, such as backpropagation, not compatible with ONNs with gradient-free features. In this work, a stochastic function-based gradient-free training method, i.e., stochastic function direct feedback alignment (SF-DFA) is demonstrated and evaluated. SF-DFA trains a gradient-free system using stochastic matrices and functions to replace the weights and gradients of the nodes in neural networks. Thus, it is feasible to train ONNs without a prior knowledge of the photonic system and its gradients. In addition, implementing such training process on optical hardware is also known to be possible. A series of studies have been carried out for a spectral slicing neural network (SS-NN) architecture trained by SF-DFA. The SS-NN system uses bandpass filters embedded in optical fiber micro rings to enable slicing of the optical signal spectrum. Our results demonstrate that the training of ONN using SF-DFA can converge efficiently, with higher processing speed and lower energy consumption compared to back-propagation. © 2024 SPIE. All rights reserved.

关键词： Photonic devices

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：