检索结果-内蒙古大学图书馆

Proceedings of the 2020 4th International Conference on Vision, image and signal processing

作者： Yimin Yuan Chaoying Tang Shuhang Xia Zhou Chen Tong Qi Automation College Nanjing University of Aeronautics and Astronautics Nanjing China

ISBN: (纸本)9781450389532

Biometric identification is the technology that differentiates individuals by body parts or behavioral characteristics. Hand has been proved to be a successful biometric for verification and identification because of the rich features such as fingerprint, palmprint, dorsal vein, etc. This paper presents a system for identifying individuals based on their hand images. Firstly, after image preprocessing with guided filter and CLAHE method, hand images taken under visible light and near-infrared (NIR) light were normalized. Secondly, a convolutional neural network structure was designed and trained on a large dataset. Using hand images as the input of the network, different depth features were extracted, including the feature from the fusion layer. Thirdly, SVM classifiers were adopted to get the classification results. A fusion strategy was used to make use of different SVM classifiers. The proposed algorithm was tested on different datasets and the experimental results showed that high accuracy can be obtained from the fusion of features. It shows that the hand image is a strong biometric for verification and identification.

关键词： Fusion Hand images Convolutional neural Network Biometrics SVM Classifier

来源：评论

学校读者我要写书评

暂无评论

Diffuse optical tomography in the human brain: A briefly review from the neurophysiology to its applications

引用

Brain Science Advances 2021年第4期6卷 289 - 305页

作者： Estefania Hernandez-Martin José Luis Gonzalez-Mora Department of Basic Medical Science Faculty of Health Science Medicine Section Universidad de La Laguna 38071 Spain

The present work describes the use of noninvasive diffuse optical tomography (DOT) technology to measure hemodynamic changes, providing relevant information which helps to understand the basis of neurophysiology in the human brain. Advantages such as portability, direct measurements of hemoglobin state, temporal resolution, non‐restricted movements as occurs in magnetic resonance imaging (MRI) devices mean that DOT technology can be used in research and clinical fields. In this review we covered the neurophysiology, physical principles underlying optical imaging during tissue‐light interactions, and technology commonly used during the construction of a DOT device including the source‐detector requirements to improve the image quality. DOT provides 3D cerebral activation images due to complex mathematical models which describe the light propagation inside the tissue head. Moreover, we describe briefly the use of Bayesian methods for raw DOT data filtering as an alternative to linear filters widely used in signal processing, avoiding common problems such as the filter selection or a false interpretation of the results which is sometimes due to the interference of background physiological noise with neural activity.

关键词： diffuse optical imaging image reconstruction algorithms filtering DOT data biomedical applications

来源：评论

学校读者我要写书评

暂无评论

Cervical lesion segmentation via transformer-based network with attention and boundary-aware modules

引用

Biomedical signal processing and Control 2025年 109卷

作者： Gao, Huayu Li, Jing Shen, Nanyan Lu, Wei Ma, Juanjuan Yang, Ying Shanghai Key Laboratory of Intelligent Manufacturing and Robotics Shanghai University Shanghai China School of Mechatronic Engineering and Automation Shanghai University Shanghai China Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine Shanghai China

Colposcopic diagnosis and directed biopsy is the foundation of cervical cancer screening. In the procedure of colposcopy, automatic segmentation of cervical lesion in colposcopic images can provide great assistance and convenience especially in underdeveloped region. However, the existing methods based on Convolutional neural Networks only differentiate the abnormality from healthy tissue, which is hard to further subdivide the lesion. In this paper, a Transformer-based network TABNet is proposed which can precisely extract the cervical lesion and recognize the corresponding category of each lesion. Unlike the other CNN-based methods, a more powerful vision transformer is adopted as the encoder. Three effective modules in decoder are constructed to integrate the advance in attention mechanism and boundary-aware prior knowledge. Extensive experiments on a large clinical colposcopic image dataset show that TABNet outperforms the existing state-of-art methods and achieves great improvement. Compared with nnUNet, our proposed model improves the mean DSC by 7.74 % and mean IoU by 8.51 %, respectively. © 2025 Elsevier Ltd

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

FWUA : A Flexible Winograd-Based Uniform Accelerator for 1D/2D/3D CNNs

FWUA : A Flexible Winograd-Based Uniform Accelerator for 1D/...

引用

IEEE International Conference on Integrated Circuits, Technologies and Applications (ICTA)

作者： Jian Wang Huipeng Deng Huafeng Ye Shanlin Xiao Zhiyi Yu School of Electronics and Information Technology Sun Yat-sen University School of Microelectronics Science and Technology Sun Yat-sen University Guangdong Provincial Key Laboratory of Optoelectronic Information Processing Chips and Systems

ISBN: (纸本)9781665417488

Convolutional neural networks (CNNs) have proven to be promising in various applications such as audio recognition, image classification, and video understanding. Different dimensions of CNNs (e.g., 1D, 2D, and 3D CNNs) are proposed to adapt to these applications. To accelerate different dimensional convolution, a uniform accelerator is necessary. Nevertheless, the implementation poses a significant challenge due to several observations. Firstly, computational complexity, network mapping methods, and data reuse strategies vary greatly among different dimensional convolutional neural networks. Secondly, various efficient algorithms such as Winograd have been proposed to accelerate CNNs, but their implementations lack flexible support for different network types. Typically, the Winograd-base accelerator is designed for 1-stride and the non-1-stride methods haven’t been implemented on 3D CNNs. To address these challenges, we propose a flexible Winograd-based uniform accelerator (FWUA) for 1D/2D/3D CNNs. With adaptive support for different dimensions, strides, and filter sizes, FWUA is runtime-reconfigurable for different dimensions of CNNs applications, i.e., audio, image, and video. The FWUA is verified on the Xilinx ZCU102 evaluation board FPGA. Our design achieves 1.51/1.13/0.66 (GOPS/DSP) DSP-efficiency and 242/181/105 (GOPS/W) energy-efficiency in C3D, VGG-16, and HAR-CNNs, which are up to 2x comparing to state-of-the-art FPGA works.

关键词： Integrated circuit technology Three-dimensional displays image recognition Convolution Digital signal processors signal processing algorithms Energy efficiency

来源：评论

学校读者我要写书评

暂无评论

A Computational Algorithm Based on Convolutional neural Networks Aimed at Estimating the MOS Quality Parameter According to the Norm UIT-T P.862 22

A Computational Algorithm Based on Convolutional Neural Netw...

引用

22nd Symposium on image, signal processing and Artificial Vision (STSIVA)

作者： Gutierrez, Rodrigo Asca, Brallan Kemper, Guillermo Univ Peruana Ciencias Aplicadas Sch Elect Engn Fac Engn Lima Peru

ISBN: (纸本)9781728114910

This paper proposes an algorithm based on convolutional neural networks for the estimation of the quality level of voice signals transmitted through cellular communication systems. The objective is to take advantage of artificial intelligence methods to estimate the MOS parameter and obtain a similar accuracy to that obtained by methods and procedures established in the international norms and international licensed standards. The proposed algorithm uses the MOS results obtained by the method detailed in the ITU-T P.862 standard. The values were obtained for different signals acquired at different reception points. With this information we proceeded to design and train a convolutional neuronal network of 4 layers, achieving very satisfactory results. For the validation, the mean square error was used to measure the degree of similarity of the MOS values obtained by ITU-T P.862 and by the proposed algorithm. The results show a mean square error of 0.00007 for the proposed algorithm.

关键词： Convolutional neural network MOS PESQ UIT-T P.862 QoE

来源：评论

学校读者我要写书评

暂无评论

Ghost edge imaging with untrained neural networks

引用

Optics and Laser Technology 2025年 191卷

作者： Yang, Yao Zhao, Zhiyan Wang, Le Zhao, Shengmei School of Communications and Information Engineering Nanjing University of Posts and Telecommunications (NUPT) Nanjing 210003 China Portland Institute Nanjing University of Posts and Telecommunications (NUPT) Nanjing 210023 China

Edge detection based on ghost imaging technology can directly capture the edge details of a target without acquiring the entire image of the object. In this paper, we propose a method of ghost edge imaging based on untrained neural network. The method initially generates a set of shifted random binary speckle patterns, then illuminates the object to obtain eight sets of detection values. These eight sets are recombined into two sets of detection values, which respectively contain horizontal and vertical edge information and are fed into a manually designed, pre-training-free neural network for processing to yield sharper edges. We implemented the proposed method through simulations and experiments, demonstrating its ability to successfully recover the edges of target objects at lower compression ratios than traditional methods. This method outperforms some widely used edge detection methods based on ghost imaging in terms of signal-to-noise ratio. The neural network used in this method does not require pre-training and exhibits good generalization capability. © 2025 Elsevier Ltd

关键词： Edge detection Ghost imaging Single pixel imaging Untrained neural network

来源：评论

学校读者我要写书评

暂无评论

All You Need is RAW: Defending Against Adversarial Attacks with Camera image Pipelines

arXiv

引用

arXiv 2021年

作者： Zhang, Yuxuan Dong, Bo Heide, Felix Princeton University United States

Existing neural networks for computer vision tasks are vulnerable to adversarial attacks: adding imperceptible perturbations to the input images can fool these models to make a false prediction on an image that was correctly predicted without the perturbation. Various defense methods have proposed image-to-image mapping methods, either including these perturbations in the training process or removing them in a preprocessing step. In doing so, existing methods often ignore that the natural RGB images in today’s datasets are not captured but, in fact, recovered from RAW color filter array captures that are subject to various degradations in the capture. In this work, we exploit this RAW data distribution as an empirical prior for adversarial defense. Specifically, we proposed a model-agnostic adversarial defensive method, which maps the input RGB images to Bayer RAW space and back to output RGB using a learned camera image signal processing (ISP) pipeline to eliminate potential adversarial patterns. The proposed method acts as an off-the-shelf preprocessing module and, unlike model-specific adversarial training methods, does not require adversarial images to train. As a result, the method generalizes to unseen tasks without additional retraining. Experiments on large-scale datasets (e.g., imageNet, COCO) for different vision tasks (e.g., classification, semantic segmentation, object detection) validate that the method significantly outperforms existing methods across task domains. © 2021, CC BY.

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

XRL: Explainable Reinforcement Learning for AI Autonomy

XRL: Explainable Reinforcement Learning for AI Autonomy

引用

作者： Kolter, J. Z Ravikumar, P Carnegie Mellon University Pittsburgh PA AIR FORCE RESEARCH LAB ROME NY Air Force Research Laboratory - Information Directorate Rome NY

Understanding the decision of AI classifiers is fundamental to the reliable and robust application of ML methods across a wide variety of domains and end-uses. This report describes work on a specific area of interest conducted under the CMU XAI program, that of detecting and understanding the ability of adversaries to intentionally poison pre-trained classifiers with malicious triggers that allow them full control over the practical use of such systems. We show that by exploiting our developed XAI techniques, it is possible to reliably detect and avoid the use of such classifiers, or indeed to create triggers that are equally capable of breaking the systems. In addition, we present a broader survey of several different approaches to XAI methods, well beyond the scope of the classifier poisoning work, which was additionally developed throughout the course of the program.

关键词： neural networks Information systems Machine learning Information processing Artificial intelligence Information science Kernel functions Artificial intelligence software Data sets Dimensionality reduction Game theory image processing Operations research Probability signal processing Training Algorithms

来源：评论

学校读者我要写书评

暂无评论

S²-aware network for visual recognition

引用

signal processing-image COMMUNICATION 2021年 99卷 116458-116458页

作者： Zhao, Wenyi Yang, Huihua Pan, Xipeng Li, Lingqiao Beijing Univ Posts & Telecommun Sch Artificial Intelligence Beijing Peoples R China Guilin Univ Elect Technol Sch Comp Sci & Informat Secur Guilin Peoples R China

Capturing the comprehensive information of various sizes and shapes of images in the same convolution layer is typically a challenging task in computer vision. There are two main kinds of methods for capturing those features. The first uses the inception structure and its variants. The second utilizes larger convolution kernels on specific layers or stacks with more convolution blocks. However, these methods can result in computationally intensive or vanishing gradients. In this paper, to accommodate feature distributions with different sizes, shapes and reduce computational cost, we propose a width-and depth-aware module named the WD-module to match feature distributions. Moreover, the proposed WD-module consumes less computational cost and parameters compared with traditional residual convolution layers. To verify the effectiveness of our proposed method, a size-and shape-aware backbone network named S(2)A-Net was built, which was obtained by stacking the WD-modules. By visualizing heat maps and features, the proposed S(2)A-Net can adapt to objects with different sizes and shapes in visual recognition tasks and learn more comprehensive characteristics. Experimental results show that the proposed method has higher accuracy in image recognition and outperforms other state-of-the-art networks with the same numbers of layers.

关键词： Convolution neural network Size aware Shape aware Light weight

来源：评论

学校读者我要写书评

暂无评论

多维感知-空间解耦单样本人体动作识别模型

引用

信号处理 2025年第4期41卷 683-693页

作者：胡正平王雨露张琦明许凌峰陈代萍燕山大学信息科学与工程学院河北秦皇岛066004 燕山大学河北省信息传输与信号处理重点实验室河北秦皇岛066004

基于骨骼数据的人体动作识别方法因其能够消除与动作无关的视觉信息来降低训练复杂性越来越受到人们关注,然而大规模骨骼动作数据收集和注释面临挑战,基于骨骼的单样本动作识别旨在仅用单个训练样本识别人体动作,可以使机器人对新颖动... 详细信息

基于骨骼数据的人体动作识别方法因其能够消除与动作无关的视觉信息来降低训练复杂性越来越受到人们关注,然而大规模骨骼动作数据收集和注释面临挑战,基于骨骼的单样本动作识别旨在仅用单个训练样本识别人体动作,可以使机器人对新颖动作类别积极反应改善人机交互。针对基于卷积神经网络编码器进行人类活动分类数据稀缺问题,考虑将单样本动作识别问题表述为骨骼序列紧凑表示和深度度量学习范式,基于自注意力Transformer机制和空间解耦约束重新审视骨骼动力学图像建模向新颖活动类别传输,提出多维感知-空间解耦单样本人体动作识别模型。首先,将3D骨骼序列坐标映射为紧凑图像表示;其次,基于骨干网络将输入投影到低维特征空间,提取初级动作特征;接着,设计融合多层感知机与Transformer的嵌入编码器,在嵌入空间中捕捉关节时间空间依赖关系,增强模型对时空信息感知能力,得到高层次多维嵌入特征;然后,基于最近邻搜索完成样本间相似性度量;最后,结合多相似性损失、三元组边界损失、交叉熵损失和空间解耦损失的混合深度度量学习优化模型。实验在公共大规模数据集NTU RGB+D 120上进行评估,提出方法较Skeleton-DML提高3.8%,在使用40个训练类别时较Skeleton-DML提高7.5%。研究表明,提出方法能够在数据稀缺情况下充分利用骨骼序列紧凑表示信息,提高单样本动作识别匹配精度。

关键词：动作识别单样本学习度量学习 Transformer 空间解耦

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：