检索结果-内蒙古大学图书馆

2nd International Conference on Machine Vision, image processing and Imaging Technology, MVIPIT 2024

作者： Cao, Rui Luo, Renze School of Electrical and Information Southwest Petroleum University Chengdu China School of Faculty of Earth Sciences and Technology Southwest Petroleum University Chengdu China

ISBN: (纸本)9798331543037

In order to solve the problems of irregular targets and fuzzy boundaries in bone scintigraphy segmentation, an improved TransUNet model was proposed. The feature extraction part of the encoder is replaced with an asymmetric convolution residual module to enhance feature capture in different directions and avoid gradient vanishing. At the same time, the cross-fusion module is used to replace the jump connection, which strengthens the deep connection between the encoder and the decoder, suppresses redundant information and improves fine-grained feature capture. In addition, the maximization decision-making method in the two-channel output dimension is used to improve the richness of classification information, capture the uncertain region and reduce the influence of class imbalance in the case of fuzzy boundaries, and obtain a segmentation graph..Experimental results show that the improved Transunet segmentation algorithm has improved the Intersection over Union (loU), DSC coefficient (Dice Similarity Cofficient), pixel accuracy (CPA) and recall rate (Recall), reaching 0.498, 0.667, 0.662 and 0.761, respectively, which is better than the current mainstream segmentation algorithms. It has certain practical application value. ©2024 IEEE.

关键词： deep learning

来源：评论

学校读者我要写书评

暂无评论

Deere learning approach for hyperspectral image demosaicking, spectral correction and high-resolution RGB reconstruction

引用

COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION 2022年第4期10卷 409-417页

作者： Li, Peichao Ebner, Michael Noonan, Philip Horgan, Conor Bahl, Anisha Ourselin, Sebastien Shapey, Jonathan Vercauteren, Tom Kings Coll London Sch Biomed Engn & Imaging Sci London England Hypervis Surg Ltd London England Kings Coll Hosp NHS Fdn Trust Dept Neurosurg London England

Hyperspectral imaging is one of the most promising techniques for intraoperative tissue characterisation. Snapshot mosaic cameras, which can capture hyperspectral data in a single exposure, have the potential to make a real-time hyperspectral imaging system for surgical decision-making possible. However, optimal exploitation of the captured data requires solving an ill-posed demosaicking problem and applying additional spectral corrections. In this work, we propose a supervised learning-based image demosaicking algorithm for snapshot hyperspectral images. Due to the lack of publicly available medical images acquired with snapshot mosaic cameras, a synthetic image generation approach is proposed to simulate snapshot images from existing medical image datasets captured by high-resolution, but slow, hyperspectral imaging devices. image reconstruction is achieved using convolutional neural networks for hyperspectral image super-resolution, followed by spectral correction using a sensor-specific calibration matrix. The results are evaluated both quantitatively and qualitatively, showing clear improvements in image quality compared to a baseline demosaicking method using linear interpolation. Moreover, the fast processing time of 45 ms of our algorithm to obtain super-resolved RGB or oxygenation saturation maps per image for a state-of-the-art snapshot mosaic camera demonstrates the potential for its seamless integration into real-time surgical hyperspectral imaging applications.

关键词： Hyperspectral imaging demosaicking deep learning surgical imaging

来源：评论

学校读者我要写书评

暂无评论

SSLCT: A Convolutional Transformer for Synthetic Speech Localization 7

SSLCT: A Convolutional Transformer for Synthetic Speech Loca...

引用

7th IEEE International Conference on Multimedia Information processing and Retrieval (MIPR)

作者： Bhagtani, Kratika Yadav, Amit Kumar Singh Bestagini, Paolo Delp, Edward J. Purdue Univ Video & Image Proc Lab VIPER Sch Elect & Comp Engn W Lafayette IN 47907 USA Politecn Milan Dipartimento Elettron Informaz & Bioingn Milan Italy

ISBN: (纸本)9798350351439;9798350351422

deep learning methods can now generate high quality synthetic speech which is perceptually indistinguishable from real speech. As synthetic speech can be used for nefarious purposes, speech forensics methods to detect fully synthetic speech have been developed. Speech editing tools can also create partially synthetic speech in which only a part of the speech signal is synthetic. Detecting these short synthetic segments within a speech signal requires specialized methods to determine the temporal location of the synthetic speech. In this paper, we propose the Synthetic Speech Localization Convolutional Transformer (SSLCT), a neural network and transformer method for synthetic speech localization. SSLCT can temporally localize synthetic speech segments as small as 20 milliseconds. We demonstrate that SSLCT achieves less than 10% Equal Error Rate (EER), which is an improvement over several existing methods.

关键词： Synthetic speech localization speech forensics deepfake speech PartialSpoof transformer

来源：评论

学校读者我要写书评

暂无评论

PROAUG: PROTOTYPE-BASED AUGMENTATION FOR LONG-TAILED image CLASSIFICATION 49

PROAUG: PROTOTYPE-BASED AUGMENTATION FOR LONG-TAILED IMAGE C...

引用

49th IEEE International Conference on Acoustics, Speech, and Signal processing (ICASSP)

作者： Hong, Yan Zhang, Jianfu Sun, Zhongyi Yan, Ke Ant Grp Hangzhou Peoples R China Shanghai Jiao Tong Univ Shanghai Peoples R China Youtu Lab Tencent Beijing Peoples R China

ISBN: (纸本)9798350344868;9798350344851

real-world data often exhibit long-tailed distributions with heavy class imbalance, which deteriorates the generalization performance of the classifier. To mitigate this problem, we propose a novel Prototype-based Augmentation framework (ProAug) to address the data scarcity issue by augmenting the feature space for tail classes. Our ProAug consists of a prototype construction branch and a dynamic augmentation branch. The prototype-based dictionary is optimized with category-aware margin loss to learn multi-center and discriminative prototypes for each category. In the dynamic augmentation branch, we aim to produce high-quality tail-class features by dynamically composing context-similar prototypes with an attention mechanism. Moreover, to further improve the reliability of prototypes and the quality of augmented features, a meta-update strategy is adopted to calibrate two branches of ProAug to boost performance. Extensive empirical results on CIFAR-LT-10/100, imageNet-LT, and iNaturalist 2018 demonstrate the effectiveness of our method.

关键词： Long-tail classification margin loss prototype meta-update strategy deep learning

来源：评论

学校读者我要写书评

暂无评论

Mitigating Cyberbullying in Social Media: A deep Contextual learning Approach for Severity Level Classification in Textual Data 5

Mitigating Cyberbullying in Social Media: A Deep Contextual ...

引用

5th International Conference on Electronics and Sustainable Communication Systems, ICESC 2024

作者： Agrawal, Prashant Kumar, Awanit Tripathi, Arun Kr. Sangam University Computer Science & Engineering Rajasthan India Sangam University Bhilwara Computer Science & Engineering Rajasthan India Kiet Group of Institution Department of Computer Applications Delhi-NCR Ghaziabad India

ISBN: (纸本)9798350379945

This research introduces a novel approach that integrates deep Contextual learning (DCL), specifically the DCL-256-32 model with an embedding model to accurately classify offense levels within the textual data. The DCL-256-32 model employs a SoftMax function to assign probabilities to distinct severity classes, ranging from critical to negligible. The proposed model incorporates two endpoints: an embedding model for generating semantic representations of input text and a set of pre-trained DCL-256-32 models for predicting offense levels. By averaging these predictions and associating them with humanreadable labels, this study proposes a robust and scalable framework for real-time text analysis. The proposed model demonstrates high performance compared to existing methods, contributing to the advancement of Natural Language processing (NLP) and classification. This research study offers a practical solution for enhancing digital safety and combating online harassment. © 2024 IEEE.

关键词： Contrastive learning

来源：评论

学校读者我要写书评

暂无评论

Quantized spiral-phase-modulation based deep learning for real-time defocusing distance prediction

引用

OPTICS EXPRESS 2022年第15期30卷 26931-26940页

作者： Zhang, Zezheng Chan, Ryan K. Y. Wong, Kenneth K. Y. Univ Hong Kong Dept Elect & Elect Engn Pokfulam Rd Hong Kong Peoples R China Adv Biomed Instrumentat Ctr Shatin Hong Kong Sci Pk Hong Kong Peoples R China

Whole slide imaging (WSI) has become an essential tool in pathological diagnosis, owing to its convenience on remote and collaborative review. However, how to bring the sample at the optimal position in the axial direction and image without defocusing artefacts is still a challenge, as traditional methods are either not universal or time-consuming. Until recently, deep learning has been shown to be effective in the autofocusing task in predicting defocusing distance. Here, we apply quantized spiral phase modulation on the Fourier domain of the captured images before feeding them into a light-weight neural network. It can significantly reduce the average predicting error to be lower than any previous work on an open dataset. Also, the high predicting speed strongly supports it can be applied on an edge device for real-time tasks with limited computational source and memory footprint. (C) 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

关键词： High numerical aperture optics image processing Neural networks Phase modulation Spiral phase Stochastic gradient descent

来源：评论

学校读者我要写书评

暂无评论

A Robust Large-scale Dataset for Assessing Light Field image Quality

A Robust Large-scale Dataset for Assessing Light Field Image...

引用

9th International Conference on Signal and image processing (ICSIP)

作者： Ma, Jian Liu, Zhuochi Xu, Yaozong Yu, Jiajun Wang, Yang Liu, Yuhao Anhui Univ Sch Internet Hefei Peoples R China

ISBN: (纸本)9798350350920

In recent years, light field imaging has gained significant attention in the scientific community due to its ability to provide a more immersive representation of the 3D world. However, ensuring the quality of light field images is crucial for their subsequent processing and applications. deep learning methods, leveraging neural networks, have shown promising performance in image Quality Assessment (IQA). However, the unique characteristics of light field data pose a challenge for existing IQA methods. To address this challenge, we propose a Robust Large-scale Dataset for Assessing Light Field image Quality, named RLSD, specifically designed for evaluating the quality of light field images. The dataset comprises both real and synthetic scenes, covering a wide range of key low attributes and including three representative distortions: compression, noise, and blur. To obtain subjective evaluations, we adopt the single stimulus continuous quality evaluation (SSCQE) method and compute the Mean Opinion Score (MOS). We performed statistical analysis on the dataset and experimental results indicate that our proposed RLSD dataset includes various common scenes and distortion levels, making it suitable for designing and evaluating LF-IQA algorithms. The dataset is publicly available at the following link: "https://***/s/1kJmx4qsy8ywLPba-HwGCEg" (password: XY28).

关键词： Light field image quality assessment RLSD distortion Single stimulus continuous quality evaluation

来源：评论

学校读者我要写书评

暂无评论

Research on Automobile Intelligent Manufacturing Defect Detection Algorithm Based on deep learning 4

Research on Automobile Intelligent Manufacturing Defect Dete...

引用

4th IEEE International Conference on Data Science and Computer Application, ICDSCA 2024

作者： Lv, Yuanyuan Geng, Xiao Zhang, Xiaorong Dai, Jiarong Shandong Vocational and Technical University of Engineering Jinan Shandong China Jinan Yiheng Technology Co. Ltd Jinan Shandong China

ISBN: (纸本)9798350368239

This research puts forward a deep-learning-centered automotive manufacturing defect detection algorithm. It utilizes the SSD (Single Shot MultiBox Detector) algorithm to realize the efficient detection of surface flaws on automotive components. Initially, the system employs CNN to extract image features and combines multi-scale features for the purpose of strengthening the recognition capability of diverse defects. Subsequently, the model incorporates automatic tagging and data augmentation techniques to enhance the generalization ability of the model with respect to different defect types. This research has designed a range of experimental scenarios and has verified the effectiveness of the algorithm through accurate data analysis. The experimental outcomes indicate that the proposed algorithm has attained a high level in terms of accuracy, recall rate and other metrics. In comparison with traditional detection methods, it exhibits greater robustness and real-time performance. Precisely, in the actual test set, the algorithm has achieved a detection accuracy of 95.6% and a recall rate of 92.8%, which has effectively enhanced the detection efficiency and decreased the false detection rate. The research findings demonstrate that the defect detection algorithm based on deep learning has extensive application prospects in the automotive intelligent manufacturing field, and is anticipated to significantly boost the automation level of the manufacturing process and the reliability of product quality. ©2024 IEEE.

关键词： Smart manufacturing

来源：评论

学校读者我要写书评

暂无评论

Enhanced Pomegranate Grading Using InceptionResNetV2 with Transformer Integration

Enhanced Pomegranate Grading Using InceptionResNetV2 with Tr...

引用

2024 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering, WIECON-ECE 2024

作者： Kiran, V. Gopi Babu, T. Hemanth Vidula, N.A. Singh, Rimjhim Padam Amrita School of Computing Department of Computer Science and Engineering Amrita Vishwa Vidyapeetham Bengaluru India

ISBN: (纸本)9798331535476

Sorting pomegranates based on quality grades is a crucial stage in their export preparation and packing process. Traditionally reliant on manual sorting methods which are time-consuming, prone to inaccuracies and incur very high costs due to the need for a large workforce prove to be very inefficient. Moreover, manual grading of export-quality of foods poses additional challenges. This paper addresses such issues by proposing an automated system employing modern digital image processing and deep learning techniques for pomegranate quality grading and classification. Initially, pomegranate images are subjected to several augmentation techniques like extracting HSV color features, Local Binary Pattern features, horizontal and vertical transformations, etc. Then a novel classification model having InceptionResNetv2 convolutional network as its backbone and integrated with optimally placed transformer blocks is proposed. The augmented dataset is utilized to train six state-of-art deep learning models namely, Vision transformers, XceptionNet, EfficientNetV2B0, DenseNet121, InceptionResNetV2 and NasNet-Large. It is observed that the proposed InceptionResNetV2 model containing transformer blocks exhibited superior performance, achieving an accuracy of 90.56%. This underscores the ability of the proposed deep learning model in automating quality evaluation and sorting procedures for pomegranates. © 2024 IEEE.

关键词： deep learning

来源：评论

学校读者我要写书评

暂无评论

6th IEEE International Conference on image processing, Applications and Systems, IPAS 2025 - Proceedings

6th IEEE International Conference on Image Processing, Appli...

引用

6th IEEE International Conference on image processing, Applications and Systems, IPAS 2025

ISBN: (纸本)9798331506520

The proceedings contain 86 papers. The topics discussed include: robust real-time monitoring of complex human activities using multi modal video analytics;a robust approach for classifying laparoscopic video distortions using ResNet-50;enhancing x-ray image classification through neural architecture;revolutionary MRI imaging for Alzheimer’s: cutting-edge GANs and vision transformer solutions;advanced deep learning strategies for breast cancer image analysis;identifying surgical instruments in pedagogical cataract surgery videos through an optimized aggregation network;enhancing auxiliary cancer classification task for multi-task breast ultrasound diagnosis network;and bioinspired computer vision for effective extended reality applications.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：