检索结果-内蒙古大学图书馆

2024 International Conference on image processing

作者： Blanchard, Martin Delezay, Olivier Ducottet, Christophe Muselet, Damien Univ Jean Monnet St Etienne CNRS Inst Optique Grad Sch Lab Hubert CurienUMR 5516 St Etienne France Univ Jean Monnet Lab Sainboise INSERM UMR 1059 St Etienne France

ISBN: (纸本)9798350349405;9798350349399

Deep learning for automated cell imaging analysis has become a tool of choice to process large amounts of data. But many of these methods lack explainability, slowing down their deployment for tasks such as diagnosis. We present a prototype-based framework to analyze structural changes which addresses the specific challenges of explainability in the context of cell imaging. Our method relies on classification between two distinct cell populations in a weakly supervised context where no label for individual cells is available. Our model extracts typical features from each population, representing intra-cellular structure, and provides an explanation on its classification decision by creating visualization of the local textures corresponding to the structures of interest. We show a real application where it effectively highlights a change in the organization of the actin content of the cells.

关键词： Visual Feature Learning Biocellular Imaging Explainable AI Deep neural Networks

来源：评论

学校读者我要写书评

暂无评论

TARGET OPTIMIZATION DIRECTION GUIDED TRANSFER LEARNING FOR image CLASSIFICATION 49

TARGET OPTIMIZATION DIRECTION GUIDED TRANSFER LEARNING FOR I...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Han, Kelvin Ting Zuo Zhang, Shengxuming Freixas, Gerard Marcos Feng, Zunlei Jin, Cheng Fudan Univ Sch Comp Sci Shanghai Peoples R China Zhejiang Univ Coll Comp Sci & Technol Hangzhou Peoples R China MCT Innovat Ctr Callig & Painting Creat Technol Beijing Peoples R China

ISBN: (纸本)9798350344868;9798350344851

At present, deep learning has made impressive achievements in various fields;however, effectively training deep neural networks on small data sets remains a significant challenge. Transfer learning, as a method of efficient training across multiple tasks, has been widely used to solve this problem. However, when the domain gap or the data volume difference between the two tasks is too large, the transfer learning may not perform well, and other optimization methods will be required to improve the performance. In this paper, we propose a new transfer learning method guided by the direction of objective optimization from the perspective of gradient. This method guides the gradient direction of the source task towards the gradient direction of the target task. In several similar and conflicting tasks, this method has achieved good results in efficiency and performance. In comparison with other transfer learning methods, the results shown by this method are generally better.

关键词： Transfer learning Deep learning GradMF Gradient projection

来源：评论

学校读者我要写书评

暂无评论

stochastic Super-Resolution For Gaussian Textures 48

Stochastic Super-Resolution For Gaussian Textures

引用

48th IEEE International Conference on Acoustics, Speech and signal processing, ICASSP 2023

作者： Pierret, Émile Galerne, Bruno Université de Tours Cnrs Institut Denis Poisson Université d'Orléans France France

ISBN: (纸本)9781728163277

Super-resolution (SR) is an ill-posed inverse problem which consists in proposing high-resolution images consistent with a given low-resolution one. While most SR algorithms are deterministic, stochastic SR deals with designing a stochastic sampler generating any realistic SR solution. The goal of this paper is to show that stochastic SR is a well-posed and solvable problem when restricting to Gaussian stationary textures. Using Gaussian conditional sampling and exploiting the stationarity assumption, we propose an efficient algorithm based on fast Fourier transform. We also demonstrate the practical relevance of the approach for SR with a reference image. Although limited to stationary microtextures, our approach compares favorably in terms of speed and visual quality to some state of the art methods designed for a larger class of images. © 2023 IEEE.

关键词： Kriging

来源：评论

学校读者我要写书评

暂无评论

ADAPID: AN ADAPTIVE PID OPTIMIZER FOR TRAINING DEEP neural NETWORKS 47

ADAPID: AN ADAPTIVE PID OPTIMIZER FOR TRAINING DEEP NEURAL N...

引用

47th IEEE International Conference on Acoustics, Speech and signal processing (ICASSP)

作者： Weng, Boxi Sun, Jian Sadeghi, Alireza Wang, Gang Beijing Inst Technol Sch Automat Beijing 100081 Peoples R China Chongqing Innovat Ctr Beijing Inst Technol Chongqing 401120 Peoples R China Univ Minnesota Dept Elect & Comp Engn Minneapolis MN 55455 USA

ISBN: (纸本)9781665405409

Deep neural networks (DNNs) have well-documented merits in learning nonlinear functions in high-dimensional spaces. stochastic gradient descent (SGD)-type optimization algorithms are the 'workhorse' for training DNNs. Nonetheless, such algorithms often suffer from slow convergence, sizable fluctuations, and abundant local solutions, to name a few. In this context, the present paper draws ideas from adaptive control of dynamical systems, and develops an adaptive proportional-integral-derivative (AdaPID) solver for fast, stable, and effective training of DNNs. AdaPID relies on second-order moment estimates of gradients to adaptively adjust the PID coefficients. Numerical tests corroborate the merits of AdaPID on several tasks such as image generation using generative adversarial networks (GANs) and image classification using convolutional neural networks (CNNs) as well as long-short term memories (LSTMs).

关键词： Deep neural network PID control adaptive control stochastic optimization adaptive learning rate

来源：评论

学校读者我要写书评

暂无评论

Learning active contour models based on self-attention for breast ultrasound image segmentation

引用

BIOMEDICAL signal processing AND CONTROL 2024年 89卷

作者： Zhao, Yu Shen, Xiaoyan Chen, Jiadong Qian, Wei Sang, Liang Ma, He Northeastern Univ Coll Med & Biol Informat Engn Shenyang 110819 Liaoning Peoples R China Dongguan Univ Technol Sch Life & Hlth Technol Dongguan 523808 Guangdong Peoples R China China Med Univ Hosp 1 Dept Ultrasound Shenyang 110002 Liaoning Peoples R China Northeastern Univ Key Lab Intelligent Comp Med Image Minist Educ Shenyang 110819 Liaoning Peoples R China

Computer-aided diagnosis (CAD) systems based on ultrasound have been developed and widely promoted in breast cancer screening. Due to the characteristics of low contrast and speckle noises, breast ultrasound image segmentation, one of the crucial steps of CAD systems, has always been challenging. Recently, the emerging Transformer-based medical segmentation methods, which have a better ability to model long dependencies than convolutional neural networks (CNNs), have shown significant value for medical image segmentation. However, due to the limited data with the high-quality label, Transformer performs weakly on breast ultrasound image segmentation without pretraining. Thus, we propose the Attention-Gate Medical Transformer (AGMT) for small breast ultrasound datasets, which introduces the attention-gate (AG) module to suppress background information and the average radial derivative increment (Delta ARD) loss function to enhance shape information. We evaluate the AGMT on both a private dataset A and a public dataset B. On dataset A, the AGMT outperforms MT on the metrics of true positive ratio, jaccard index (JI) and dice similarity coefficient (DSC) by 6.4%, 2.3% and 1.9%, respectively. Meanwhile, when compared with UNet, the AGMT improves JI and DSC by 5.3% and 4.9%, respectively. The results show performance has significantly improved compared with mainstream models. In addition, we also conduct ablation experiments on the AG module and Delta ARD, which prove their effectiveness.

关键词： Breast ultrasound image segmentation Transformer Loss function Average radial derivative

来源：评论

学校读者我要写书评

暂无评论

Noise-robust registration of microscopic height data using convolutional neural networks

Noise-robust registration of microscopic height data using c...

引用

SPIE Conference on Future Sensing Technologies

作者： Siemens, Stefan Kaestner, Markus Reithmeier, Eduard Leibniz Univ Hannover Inst Measurement & Automat Control Univ 1 D-30823 Hannover Germany

ISBN: (纸本)9781510657229

In this work, a deep convolutional neural network is proposed to improve the registration of microtopographic data. For this purpose, different mechanical surfaces were optically measured using a confocal laser scanning microscope. A wide range of surfaces with different materials, processing methods, and topographic properties, such as isotropy and anisotropy or stochastic and deterministic features, are included. Training and testing datasets with known homographies are generated from these measurements by cropping a fixed and moving image patch from each topography and then randomly perturbing the latter. A pseudo-siamese network architecture based on the VGG Net is then used to predict these homographies. The network is trained with a supervised learning approach where the Euclidean distance between the predicted and the ground truth gives the loss function. The 4-point homography parameterization is used to improve the loss convergence. Furthermore, different amounts of image noise are added to enhance the prediction's robustness and prevent overfitting. The effectiveness of the proposed method is evaluated through different experiments. First, the network performance is compared to intensity-based and feature-based conventional registration algorithms regarding the resulting error, the noise-robustness, and the processing speed. In addition, images from the Microsoft Common Objects in Context (COCO) dataset are used to verify the network's generalization capability to new image types and contents. The results show that the learning-based approach offers much higher robustness regarding image noise and a much lower processing time. In contrast, conventional algorithms have a smaller registration error without image noise.

关键词： image registration Homography estimation Surface metrology Machine learning Convolutional neural Network Confocal Laser Scanning Microscopy

来源：评论

学校读者我要写书评

暂无评论

3D FACE RECONSTRUCTION BASED ON WEAKLY-SUPERVISED LEARNING MORPHABLE FACE MODEL 30

3D FACE RECONSTRUCTION BASED ON WEAKLY-SUPERVISED LEARNING M...

引用

30th IEEE International Conference on image processing (ICIP)

作者： Liang, Kai-Wen Li, Pin-Hsuan Lo, Chung-Hsun Wang, Chien-Yao Chen, Yung-Fang Wang, Jia-Ching Chang, Pao-Chi Natl Cent Univ Dept Commun Engn Taoyuan Taiwan Natl Cent Univ Dept Comp Sci & Informat Engn Taoyuan Taiwan Acad Sinica Inst Informat Sci Taipei Taiwan

ISBN: (纸本)9781728198354

In this paper, we propose a system for 3D face model reconstruction. Earlier studies on reconstruction methods included the software modeling methods or the instrument scanning modeling methods. But both of the above methods require a lot of development resources and time costs. Therefore, we develop a reconstruction system using a weakly supervised approach combining Convolutional neural Networks (CNN) and 3D Morphable Face Models (3DMM). Given a sufficient number of 2D face images to train and learn the main features of the face, our system is capable of rapidly constructing 3D face models. The proposed method enhances the efficiency of preprocessing and improves the performance of loss function through image depth feature extraction and regression coefficients. Using two datasets for model evaluation and analysis, this study efficiently reconstructs faces without ground-truth labels.

关键词： 3D Face Reconstruction 3D Morphable Face Model Deep Learning Convolutional neural Network

来源：评论

学校读者我要写书评

暂无评论

3D POSE ESTIMATION FROM MONOCULAR VIDEO WITH CAMERA-BONE ANGLE REGULARIZATION ON THE image FEATURE 49

3D POSE ESTIMATION FROM MONOCULAR VIDEO WITH CAMERA-BONE ANG...

引用

49th IEEE International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Ishii, Asuka Ikeda, Hiroo NEC Corp Ltd Tokyo Japan

ISBN: (纸本)9798350344868;9798350344851

In this paper, we propose a monocular 3D pose estimation method which explicitly takes into account the angles between the camera optical axis and bones (camera-bone angles) as well as temporal information. The proposed method combines a 2D-to-3D-based method, which predicts a 3D pose from a sequence of 2D poses, and convolutional neural network (CNN) and includes novel regularization loss to enable the CNN to extract camera-bone-angle information. The camera-bone-angle and temporal information suppress ambiguity of 2D-to-3D-based methods where the same 2D pose can be mapped to multiple 3D poses. Experiments on the Human3.6M and MPI-INF-3DHP datasets showed that the proposed method improved the performance by 5.1 mm and 2.1 mm in terms of mean per joint position error (MPJPE) respectively.

关键词： 3D pose estimation pose estimation monocular regularization

来源：评论

学校读者我要写书评

暂无评论

Powerful Lossy Compression for Noisy images

Powerful Lossy Compression for Noisy Images

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Cai, Shilv Liang, Xiaoguo Cao, Shuning Yan, Luxin Zhong, Sheng Chen, Liqun Zu, Xu Huazhong Univ Sci & Technol Wuhan Peoples R China Natl Key Lab Multispectral Informat Intelligent P Beijing Peoples R China

ISBN: (纸本)9798350390155;9798350390162

image compression and denoising represent fundamental challenges in image processing with many real-world applications. To address practical demands, current solutions can be categorized into two main strategies: 1) sequential method;and 2) joint method. However, sequential methods have the disadvantage of error accumulation as there is information loss between multiple individual models. Recently, the academic community began to make some attempts to tackle this problem through end-to-end joint methods. Most of them ignore that different regions of noisy images have different characteristics. To solve these problems, in this paper, our proposed signal-to-noise ratio (SNR) aware joint solution exploits local and non-local features for image compression and denoising simultaneously. We design an end-to-end trainable network, which includes the main encoder branch, the guidance branch, and the signal-to-noise ratio (SNR) aware branch. We conducted extensive experiments on both synthetic and real-world datasets, demonstrating that our joint solution outperforms existing state-of-the-art methods.

关键词： Joint Solution image Compression image Denoising neural Networks

来源：评论

学校读者我要写书评

暂无评论

Deep Learning Network Optimization Combining 3D Imaging and Multidimensional signal processing 5th

Deep Learning Network Optimization Combining 3D Imaging and ...

引用

5th International Conference on 3D Imaging Technologies—Multidimensional signal processing and Deep Learning, 3DIT-MSP and DL 2023

作者： Hou, Juncheng Yang, Diansheng Chen, Wei Shanghai Yuansi Standard Science and Technology Co. Ltd. Shanghai China Shaoguan University Zhenjiang District Shaoguan China Shanghai Research Institute of Criminal Science and Technology Shanghai200083 China

ISBN: (纸本)9789819751808

This research aims to optimize the deep learning network by combining three-dimensional imaging technology and multidimensional signal processing methods to improve the processing capabilities of complex three-dimensional data. We propose a model based on 3D convolutional neural network (CNN), which is structurally and functionally optimized specifically for 3D image data. In the model design, we introduced an efficient feature extraction mechanism and an improved network training strategy, including batch normalization and regularization techniques, to improve the model's generalization ability and training efficiency. Experimental results show that this model exhibits higher accuracy and robustness than traditional two-dimensional CNN and multi-layer perceptron (MLP) models when processing three-dimensional imaging data. Furthermore, we provide an in-depth analysis of the model's performance and discuss its potential issues and limitations in practical applications. Overall, this study provides an effective deep learning solution for 3D image processing and lays the foundation for future research in a wider range of application fields. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：