检索结果-内蒙古大学图书馆

arXiv 2022年

作者： Lu, Haoyu Zhou, Qiongyi Fei, Nanyi Lu, Zhiwu Ding, Mingyu Wen, Jingyuan Du, Changde Zhao, Xin Sun, Hao He, Huiguang Wen, Ji-Rong Gaoling School of Artificial Intelligence Renmin University of China Beijing100872 China Beijing Key Laboratory of Big Data Management and Analysis Methods Beijing100872 China Research Center for Brain-inspired Intelligence National Laboratory of Pattern Recognition Institute of Automation Chinese Academy of Sciences Beijing100190 China School of Artificial Intelligence University of Chinese Academy of Sciences Beijing100049 China School of Information Renmin University of China Beijing100872 China The University of Hong Kong Pokfulam Hong Kong Beijing Academy of Artificial Intelligence Beijing China

Multimodal learning, especially large-scale multimodal pre-training, has developed rapidly over the past few years and led to the greatest advances in artificial intelligence (AI). Despite its effectiveness, understanding the underlying mechanism of multimodal pre-training models still remains a grand challenge. Revealing the explainability of such models is likely to enable breakthroughs of novel learning paradigms in the AI field. To this end, given the multimodal nature of the human brain, we propose to explore the explainability of multimodal learning models with the aid of non-invasive brain imaging technologies such as functional magnetic resonance imaging (fMRI). Concretely, we first present a newly-designed multimodal foundation model pre-trained on 15 million image-text pairs, which has shown strong multimodal understanding and generalization abilities in a variety of cognitive downstream tasks. Further, from the perspective of neural encoding (based on our foundation model), we find that both visual and lingual encoders trained multimodally are more brain-like compared with unimodal ones. Particularly, we identify a number of brain regions where multimodally-trained encoders demonstrate better neural encoding performance. This is consistent with the findings in existing studies on exploring brain multi-sensory integration. Therefore, we believe that multimodal foundation models are more suitable tools for neuroscientists to study the multimodal signal processing mechanisms in the human brain. Our findings also demonstrate the potential of multimodal foundation models as ideal computational simulators to promote both AI-for-brain and brain-for-AI research. Copyright © 2022, The Authors. All rights reserved.

关键词： Brain

来源：评论

学校读者我要写书评

暂无评论

Mathematical mixed-integer programming for solving a new optimization model of selective image restoration: modelling and resolution by CHN and GA

引用

CIRCUITS SYSTEMS AND signal processing 2019年第5期38卷 2072-2096页

作者： Joudar, Nour-eddine Ettaouil, Mohamed Univ Sidi Mohamed Ben Abdellah Fez Fac Sci & Technol Dept Math Fes Morocco

In grey-level image restoration, a prior knowledge of degraded areas allows, thanks to the selective filtering, to achieve a good protection of the image features. In this paper, we propose a quadratic programming-based technique that deals with the issue of details preservation during the restoration process. Based on the classical model of image restoration, we build a modified model by introducing a set of binary variables that indicate the pixel categories. We combine each pixel with the median of its neighbours in a decision rule so that one of them generates the optimal solution. The obtained model is a nonlinear mixed-integer problem where resolution by exact methods is not feasible. In this regard, we use both of the continuous Hopfield neural network and the genetic algorithm to solve the suggested model. Performance of our method is demonstrated numerically and visually by several computational tests.

关键词： image restoration Optimization Selective image restoration Median Continuous Hopfield neural network Penalty function Quadratic programming Genetic algorithm

来源：评论

学校读者我要写书评

暂无评论

Variational probabilistic generative framework for single image super-resolution

引用

signal processing 2019年 156卷 92-105页

作者： Wang, Zhengjue Chen, Bo Zhang, Hao Liu, Hongwei Xidian Univ Natl Lab Radar Signal Proc Xian 710071 Shaanxi Peoples R China Xidian Univ Collaborat Innovat Ctr Informat Sensing & Underst Xian 710071 Shaanxi Peoples R China

In this paper, a general variational probabilistic generative framework parameterized by deep networks is proposed for single image super-resolution, which assembles the advantages of coding-based methods and regression-based methods. We use probabilistic generative networks to model the joint full likelihood of a pair of low-resolution (LR) and high-resolution (HR) patches which are generated from a shared latent representation. An inference model is applied to infer the stochastic distribution of the latent representation. By jointly optimizing the generative and inference models, a regression process to the distribution of the HR patch is implied during the learning phase, which provides an efficient forward mapping to accomplish the super-resolution task. We use our framework as a guidance and develop a new model called PGM-CP, with the help of an informative conditional prior and a consistent recognition model. We likewise show how three existing popular example-based SR methods can be "reinvented" under our framework. The effectiveness and efficiency of the proposed method is examined based on three public datasets. Experimental results demonstrate that our model is competitive with state-of-the-art approaches, especially when the image is corrupted by noise. (C) 2018 Elsevier B.V. All rights reserved.

关键词： Probabilistic generative model image super-resolution Conditional prior Recognition model

来源：评论

学校读者我要写书评

暂无评论

End-to-End Conditional GAN-based Architectures for image Colourisation 21

End-to-End Conditional GAN-based Architectures for Image Col...

引用

IEEE 21st International Workshop on Multimedia signal processing (MMSP)

作者： Blanch, Marc Gorriz Mrak, Marta Smeaton, Alan F. O'Connor, Noel E. BBC Res & Dev London England Dublin City Univ Dublin Ireland

ISBN: (纸本)9781728118178

In this work recent advances in conditional adversarial networks are investigated to develop an end-to-end architecture based on Convolutional neural Networks (CNNs) to directly map realistic colours to an input greyscale image. Observing that existing colourisation methods sometimes exhibit a lack of colourfulness, this paper proposes a method to improve colourisation results. In particular, the method uses Generative Adversarial neural Networks (GANs) and focuses on improvement of training stability to enable better generalisation in large multi-class image datasets. Additionally, the integration of instance and batch normalisation layers in both generator and discriminator is introduced to the popular U-Net architecture, boosting the network capabilities to generalise the style changes of the content. The method has been tested using the ILSVRC 2012 dataset, achieving improved automatic colourisation results compared to other methods based on GANs.

关键词： Colourisation Conditional GANs CNNs

来源：评论

学校读者我要写书评

暂无评论

基于红色通道注意力机制的水下图像增强

引用

数字海洋与水下攻防 2023年第1期6卷 48-55页

作者：王浩瀚王国栋张镡月董浩青岛大学计算机科学技术学院

水下图像增强因其在海洋勘测和水下机器人中的重要意义而备受关注。在过去的几年中，已经提出了许多水下图像增强算法。已有的深度学习方法由于忽略水下图像的预处理过程和对红色通道信息的增强或者弱化了这个过程，导致增强结果并不显... 详细信息

水下图像增强因其在海洋勘测和水下机器人中的重要意义而备受关注。在过去的几年中，已经提出了许多水下图像增强算法。已有的深度学习方法由于忽略水下图像的预处理过程和对红色通道信息的增强或者弱化了这个过程，导致增强结果并不显著，其往往只适应特定的场景，缺乏泛化能力。为此，基于卷积神经网络建立了一种全新的水下图像增强算法，为了充分利用特征图的通道信息，在相同维度的特征图之间采用不同尺寸的卷积核获取更多通道数目的特征。然后，基于红色通道构建了注意力机制，以加强对于图像中容易丢失信息的红色通道的特征提取。最后，在EUVP,UFO120数据集做了消融实验，证明了红色通道注意力机制的有效性。通过对对比实验的增强结果进行各项指标分析，证明增强结果有着更高的结构相似性和峰值信噪比，并且在无参考指标方面有着更高的颜色平衡、清晰度以及对比度，综合性能优于以往的方法。

关键词：图像处理水下图像增强卷积神经网络

来源：评论

学校读者我要写书评

暂无评论

A Deep Learning Pipeline for Identification of Motor Units in Musculoskeletal Ultrasound

引用

IEEE ACCESS 2020年 8卷 170595-170608页

作者： Ali, Hazrat Umander, Johannes Rohlen, Robin Gronlund, Christer Umea Univ Dept Radiat Sci S-90187 Umea Sweden

Skeletal muscles are functionally regulated by populations of so-called motor units (MUs). An MU comprises a bundle of muscle fibers controlled by a neuron from the spinal cord. Current methods to diagnose neuromuscular diseases and monitor rehabilitation, and study sports sciences rely on recording and analyzing the bio-electric activity of the MUs. However, these methods provide information from a limited part of a muscle. Ultrasound imaging provides information from a large part of the muscle. It has recently been shown that ultrafast ultrasound imaging can be used to record and analyze the mechanical response of individual MUs using blind source separation. In this work, we present an alternative method - a deep learning pipeline - to identify active MUs in ultrasound image sequences, including segmentation of their territories and signal estimation of their mechanical responses (twitch train). We train and evaluate the model using simulated data mimicking the complex activation pattern of tens of activated MUs with overlapping territories and partially synchronized activation patterns. Using a slow fusion approach (based on 3D CNNs), we transform the spatiotemporal image sequence data to 2D representations and apply a deep neural network architecture for segmentation. Next, we employ a second deep neural network architecture for signal estimation. The results show that the proposed pipeline can effectively identify individual MUs, estimate their territories, and estimate their twitch train signal at low contraction forces. The framework can retain spatio-temporal consistencies and information of the mechanical response of MU activity even when the ultrasound image sequences are transformed into a 2D representation for compatibility with more traditional computer vision and image processing techniques. The proposed pipeline is potentially useful to identify simultaneously active MUs in whole muscles in ultrasound image sequences of voluntary skeletal muscle cont

关键词： Machine learning image segmentation image sequences Ultrasonic imaging Muscles Pipelines Convolution Motor unit decomposition ultrafast ultrasound medical imaging deep learning mechanical response neural networks recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

JOINT DEMOSAICKING AND BLIND DEBLURRING USING DEEP CONVOLUTIONAL neural NETWORK 26

JOINT DEMOSAICKING AND BLIND DEBLURRING USING DEEP CONVOLUTI...

引用

26th IEEE International Conference on image processing (ICIP)

作者： Chi, Zhixiang Shu, Xiao Wu, Xiaolin McMaster Univ Hamilton ON Canada Shanghai Jiao Tong Univ Shanghai Peoples R China

ISBN: (纸本)9781538662496

Despite extensive research efforts, blind image deblurring remains a challenge without general robust solutions. A longoverlooked problem of existing deblurring methods is that they are all designed to work on fully sampled RGB input images for simplicity. But, in practice, most RGB color images are reconstructed from Bayer mosaic data hence riddled with various high-frequency demosaicking artifacts, such as zippering and moir ' e patterns, which can easily derail a deblurring algorithm. In this paper, we propose a novel multiscale deep convolutional neural network to solve demosaicking and deblurring jointly. By processing Bayer raw images directly, our method is free of the interference of demosaicking artifacts. Extensive experiments show that the joint approach greatly outperforms the simple cascade of state-of-art demosaicking and deblurring methods.

关键词： Blind deblurring demosaicking multiscale recursive neural network

来源：评论

学校读者我要写书评

暂无评论

From CNNs to Adaptive Filter Design for Digital image Denoising Using Reinforcement Q-Learning

From CNNs to Adaptive Filter Design for Digital Image Denois...

引用

IEEE Southeastcon

作者： Muhammad Alolaiwy Murat Tanik Leon Jololian The University of Alabama at Birmingham Birmingham USA

Multi-modal image acquisition techniques have allowed digital images to penetrate domains from micro-scale medical imaging to mega-scale satellite imaging. For postprocessing, deep learning techniques have widely been used for image denoising and artifact suppression. However, far little work has been done to summarize their effectiveness concerning adaptive filter design, e.g., salt and pepper noise, stochastic Poisson, or additive white noise. Because different images, natural or urban, structured or unstructured scenes, and objects produce different types of noise, from the modality as well as from the imaging medium, devising a single method for all noise types is impractical. This paper proposes to use reinforcement learning (Q-learning) to adaptively design filters of a convolutional neural network (CNN). In contrast to the popular state of the art methods that use filter designs based on the noise model, CNN filters lack the power to do so. We have attempted to address this limitation of CNN by introducing a new modality of reinforcement learning for adaptive filter design. The qualitative and quantitative analysis of the proposed method is done and its efficacy is demonstrated using the following evaluation metrics: Peak signal to Noise Ratio (PSNR), Contrast to Noise Ratio (CNR), and Structure Similarity Index (SSIM).

关键词： Adaptation models Satellites PSNR Digital images Noise reduction Adaptive filters Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Retinal blood vessel segmentation using pixel-based feature vector

引用

BIOMEDICAL signal processing AND CONTROL 2021年第0期70卷 103053-103053页

作者： Toptas, Buket Hanbay, Davut Bandirma Onyedi Eylul Univ Engn & Nat Sci Fac Comp Engn Dept Balikesir Turkey Inonu Univ Engn Fac Comp Engn Dept TR-44280 Malatya Turkey

A lot of important disease information can be accessed by performing retinal blood vessel analysis on fundus images. Diabetic retinopathy is one of the diseases understood by retinal blood vessel analysis. If this disease is detected at an early stage, vision loss can be prevented. In this paper, a method that performs retinal blood vessel analysis with classical methods is proposed. In this proposed system, pixel-based feature extraction is performed. Five different feature groups are used for feature extraction. These feature groups are edge detection, morphological, statistical, gradient, and Hessian matrix. An 18-D feature vector is created for each pixel. This feature vector is given to the artificial neural network for training. Using test images, the system is tested on two publicly available datasets. Sensitivity, Specificity, and Accuracy performance measures were used as success measures. The similarity index between the segmented image and the ground truth is measure using Dice and Jaccard. The accuracy of the system was measured as 96.18% for DRIVE and 94.56% for STARE, respectively. Experimental results show that the proposed algorithm achieves satisfactory results. This method can be used as an automated retinal blood vessel segmenting system.

关键词： Biomedical imaging Retinal blood vessel segmentation image segmentation Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Discriminative Color Space Learning for Face Anti-Spoofing via Convolutional neural Networks 20

Discriminative Color Space Learning for Face Anti-Spoofing v...

引用

5th International Conference on Biomedical signal and image processing, ICBIP 2020

作者： Ren, Yifeng Huang, Dong Sun, Lei Liu, Zhe Li, Qingyan Department of Electronical and Information Engineering Xijing University Shaanxi Xi'an China School of Electronics and Information Northwestern Polytechnical University Shaanxi Xi'an China Beijing Microelectronics Technology Institute Beijing China

ISBN: (纸本)9781450387767

Face spoofing detection is gaining an increasing attention in the biometric research. Various approaches have been proposed in the literatures. In these methods, the color variation of facial regions, caused by the defect of medium of fake face, is a vitally important clue. The traditional color spaces (e.g. RGB, HSV and YCbCr) are used in many spoofing detection approaches, however, it is not very discriminative to distinguish real and fake faces in these existing color spaces. So, in this paper, we propose a novel method to learn a new color space, which is suitable for face anti-spoofing and can be discriminative between real and fake faces. Different from other color learning methods, our novel method is based on convolutional neural networks and can nonlinearly project the real and fake face images into a distinguishable color s-pace. Extensive experiments are conducted on two publicly available databases, showing very interesting performance compared to other existing color spaces and state-of-the-art. © 2020 ACM.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：