检索结果-内蒙古大学图书馆

4th International Conference on Signal processing and Machine learning, CONF-SPML 2024

ISBN: (纸本)9781510674721

The proceedings contain 28 papers. The topics discussed include: phototropic bionics: realization of intelligent machine detection and obstacle avoidance;a study of model predictive control and reinforcement learning control system;advancements and challenges in speech emotion recognition: a comprehensive review;revolutionizing ADHD diagnosis: deep learning in 3D medical imaging;improving robustness in emotion recognition via adversarial training;real image improvement study based on pivotal tuning inversion;a review of 3D printing slicing algorithms;and analysis of two variants of U-net for pulmonary nodule segmentation: attention U-net and dense-attention U-net.

关键词：

来源：评论

学校读者我要写书评

暂无评论

real-time processing of image, Depth and Video Information 2023

Real-time Processing of Image, Depth and Video Information 2...

引用

real-time processing of image, Depth and Video Information 2023

ISBN: (纸本)9781510662629

The proceedings contain 16 papers. The topics discussed include: evolution of real-time processing of visual information over four decades: a retrospective as outlook to the future of real-time imaging;real-time embedded large-scale place recognition for autonomous ground vehicles using a spatial descriptor;real-time video super-resolution reconstruction using wavelet transforms and sparse representation;development of light-field motion tracking technology for use in laboratory studies of planet formation;towards learning-based denoising of light fields;real-time onboard visual parking space detection: a performance study;an automated AI and video measurement techniques for monitoring social distancing, mask detection, and facial temperature screening for COVID-19;computational efficient deep learning-based super resolution approach;and in-sensor neural network for real-time KWS by image processing.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Lightweight Indoor Positioning System Based on Multiple Self-learning Features and Key Frame Classification

Lightweight Indoor Positioning System Based on Multiple Self...

引用

2024 Mid-term Symposium on Spatial Information to Empower the Metaverse

作者： Wang, Chenzhe Bi, Kai Zhao, Bianli Li, Ming Chen, Yujia Tao, Shiliang Yang, Juntao National Geomatics Center of China Beijing China North China University of Science and Technology Tangshan China Baidu. Inc. Beijing China Shandong University of Science and Technology Qingdao China

Traditional indoor positioning technologies mostly require advanced installation of hardware devices, resulting in high costs and long-term maintenance. With advancements in image recognition and deep learning technologies, indoor visual positioning based on image recognition has become increasingly mature. This method offers the benefits of low cost and does not require additional hardware installation. However, it still has inherent defects, such as cumbersome data collection, complex algorithms, and universality. To minimize indoor information pre-collection cost, improve versatility, and enable rapid deployment in low-performance mobile devices, this paper proposes a lightweight indoor positioning system based on multiple self-learning features and key frame classification. The system is divided into two stages: preprocessing and real-time positioning. In the preprocessing stage, image information is collected for the entire indoor environment, and a key-frame recognizer is trained based on the image information. Simultaneously, an environmental feature information database is established. In the real-time positioning stage, the system first uses mobile devices such as smartphones to obtain real-time video streams. A key frame recognizer based on convolutional neural networks identifies key frames in each video stream frame, thereby obtaining approximate positions for rough positioning. Second, feature points are identified in each frame of the video stream and matched with feature points with location information in the built environmental feature information database to calculate precise positions for fine positioning. It has significant optimizations compared with conventional visual solutions in terms of preprocessing data collection, algorithm performance consumption, and versatility. © Author(s) 2024.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Adaptive Edge Systems for Smart IoT Applications

Adaptive Edge Systems for Smart IoT Applications

引用

作者： Liu, Miaomiao University of California Merced

学位级别：Ph.D., Doctor of Philosophy

The proliferation of the Internet of Things (IoT) and cloud services has given rise to the edge computing paradigm, where data is processed partly or entirely at the edge of the network, rather than solely in the cloud. Edge computing can address problems such as latency, limited battery life of mobile devices, bandwidth costs, security, and privacy. Typical applicable scenarios based on edge computing include video analytics, smart home, smart city, and collaborative *** the development of deep learning techniques, research on employing deep learning to develop intelligent edge systems is emerging. In this dissertation, we aim to investigate how deep learning can process data on source-constrained individual edge devices in real time and how deep learning can process data by utilizing collaborative edge devices to provide better *** build several critical systems, including video analytics, driving anomaly detection, arm posture tracking, and device orientation tracking. In the video analytics system, we combine deep learning with traditional image processing techniques to achieve real-time object detection on mobile devices without offloading. In the driving anomaly detection system, we train deep learning models for driving anomaly detection by leveraging the information from collaborative peer devices to provide better accuracy. In the arm posture tracking system, we employ multitask learning to track the orientation and location of the wrist simultaneously, which significantly improves the latency compared to the conventional methods. In the device orientation tracking system, we develop a deep reinforcement learning framework to train an agent that adjusts the parameters of a conventional orientation tracking method in response to changing *** IoT systems continue to grow in complexity and size, preserving training data has become an increasingly important challenge. In our future work, we plan to investigate the use of representation

关键词： deep learning Edge computing Internet of Things Mobile devices real-time performance System adaptation

来源：评论

学校读者我要写书评

暂无评论

The study on ultrasound image classification using a dual-branch model based on Resnet50 guided by U-net segmentation results

引用

BMC MEDICAL IMAGING 2024年第1期24卷 1-16页

作者： Yang, Xu Qu, Shuo'ou Wang, Zhilin Li, Lingxiao An, Xiaofeng Cong, Zhibin Changchun Univ Sci & Technol Sch Elect & Informat Engn Changchun 130022 Peoples R China CCUCM Human Resources Dept Affiliated Hosp 3 Changchun 130117 Peoples R China Jilin Engn Normal Univ Educ Qual Monitoring Ctr Changchun 130052 Peoples R China Univ Tradit Chinese Med Dept Electrodiag Affiliated Hosp Changchun Changchun 130021 Peoples R China

In recent years, the incidence of nodular thyroid diseases has been increasing annually. Ultrasonography has become a routine diagnostic tool for thyroid nodules due to its high real-time capabilities and low invasiveness. However, thyroid images obtained from current ultrasound tests often have low resolution and are plagued by significant noise interference. Regional differences in medical conditions and varying levels of physician experience can impact the accuracy and efficiency of diagnostic results. With the advancement of deep learning technology, deep learning models are used to identify whether a nodule in a thyroid ultrasound image is benign or malignant. This helps to close the gap between doctors' experience and equipment differences, improving the accuracy of the initial diagnosis of thyroid nodules. To cope with the problem that thyroid ultrasound images contain complex background and noise as well as poorly defined local features. in this paper, we first construct an improved ResNet50 classification model that uses a two-branch input and incorporates a global attention lightening module. This model is used to improve the accuracy of benign and malignant nodule classification in thyroid ultrasound images and to reduce the computational effort due to the two-branch *** constructed a U-net segmentation model incorporating our proposed ACR module, which uses hollow convolution with different dilation rates to capture multi-scale contextual information for feature extraction of nodules in thyroid ultrasound images and uses the results of the segmentation task as an auxiliary branch for the classification task to guide the classification model to focus on the lesion region more efficiently in the case of weak local features. The classification model is guided to focus on the lesion region more efficiently, and the classification and segmentation sub-networks are respectively improved specifically for this study, which is used to improve the accurac

关键词： deep learning Thyroid ultrasound images Resnet50 U-net: attention mechanism

来源：评论

学校读者我要写书评

暂无评论

A novel deep learning-based technique for efficient characterization of engineered cementitious composites cracks for durability assessment

引用

STRUCTURAL CONCRETE 2025年第2期26卷 2107-2123页

作者： Das, Avik Kumar Leung, Christopher K. Y. Tsinghua Univ Inst Ocean Engn SIGS Shenzhen Peoples R China HKUST Dept Civil & Environm Engn Hong Kong Peoples R China

Engineered Cementitious Composites also known as Strain-hardening cementitious composites (SHCCs) has unique cracking patterns like cracks that have tiny widths and showcase high density. All of this makes it difficult and laborious to compute crack parameters from crack patterns. Unfortunately, this is an essential part of assessing durability and micromechanical modeling. SHSnet is developed to perform end-to-end semantic segmentation of SHCC cracks. SHSnet is efficient, attention based deep encoder-decoder network with large receptive field. Loss function based on Tversky function were used for training the model. SHSnet with loss function shows promising result with mPrecision, mF1Score and mIoU of 0.87, 0.84 and 0.83 respectively for complex SHCC cracks while requiring at least an order of fewer computational parameters than those in the literature. An image processing unit is then used to estimate the width, number, and length of the cracks from the segmentation mask. Test results show that the computed crack parameters with SHSnet are exactly the same as that computed with an optical microscope but require similar to 100x less time. Results demonstrate that SHSnet works equally well in SHCCs with different surface textures, crack density, and widths;the final result was far superior to a conventional technique. This technique also shows promising results in an automatic evaluation of crack parameters relevant to durability and visualizing crack patterns even in the presence of artifacts during progressive testing. The results also demonstrate the necessity to accurately and densely calculate crack length and maximum crack width;else the durability results are expected to be significantly more conservative than the actual value.

关键词： deep learning engineered cementitious composites multiple-thin- tortuous crack characterization strain hardening cementitious composites

来源：评论

学校读者我要写书评

暂无评论

Decentralized Federated deep learning image Recognition Models 4

Decentralized Federated Deep Learning Image Recognition Mode...

引用

4th International Conference on Artificial Intelligence, Robotics and Control, AIRC 2023

作者： Kugan, Sharun Islam, Md Quyyum Ul Kashef, Rasha Toronto Metropolitan University Department of Electrical Computer Biomedical Engineering Toronto Canada

ISBN: (纸本)9798350348248

In the era of IoT, numerous frameworks and cutting-edge models have been introduced to enhance user experience and privacy and reduce the risk of data breaches. Over time, IoT device usage has grown tremendously, and a flood of data has been sent to servers for processing. Federated learning has been deployed for efficient decentralization while preserving privacy. Federated learning has been applied in various IoT-related applications such as image classification, object segmentation, object detection, and sensor analytics. Existing centralized image recognition models fall short of providing accurate image classification with acceptable processing time for real-time deployment while preserving privacy. In this paper, we designed two decentralized deep learning models using federated learning, the CNN-TFF and the VGG16-TFF. With around 250 training iterations, we achieved a high accuracy rate of up to 90% with a decrease in the loss value for the CIFAR-100 dataset using the VGG16-TFF model while maintaining data privacy using federated learning. © 2023 IEEE.

关键词： image classification

来源：评论

学校读者我要写书评

暂无评论

Research on Computer deep learning Algorithm to Optimize the Fault Prediction Model of Nuclear Power Plant Emergency Feed Pump 2

Research on Computer Deep Learning Algorithm to Optimize the...

引用

2nd IEEE International Conference on image processing and Computer Applications, ICIPCA 2024

作者： Gao, Chao Song, Xianjun Zhang, Zhiqiang Li, Chunguang Chang, Xincai Cgn Digital Technology Co. Ltd Cgnpc Beijing China

ISBN: (纸本)9798350360240

This project takes the typical failure forms of pumps in nuclear power plants such as abnormal vibration, friction and wear as the research object. The most readily available pump housing acceleration signal frequency domain information is used as input. The system is mainly composed of four modules: input module, convolution module, multi-head self-attention module and output module. The attention mechanism of characteristic frequency domain data based on the fusion of deep neural network and attention network is studied. At the same time, the pattern recognition model of pump faults in nuclear power plant is constructed. Compared with the existing research results, using frequency-domain data as input, using frequency-domain data attention network and other technologies can shorten the amount of input data, and make the recognition accuracy of the model up to 100% in the test set, which has obvious advantages over other fault recognition models based on deep neural networks. © 2024 IEEE.

关键词： Nuclear energy

来源：评论

学校读者我要写书评

暂无评论

Multi-Stream Temporally Enhanced Network for Video Salient Object Detection

引用

Computers, Materials & Continua 2024年第1期78卷 85-104页

作者： Dan Xu Jiale Ru Jinlong Shi School of Computer Science Jiangsu University of Science and TechnologyZhenjiang212100China

Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal *** poses a challenging task in computer vision,as it involves processing complex spatial data that is also influenced by temporal *** the progress made in existing VSOD models,they still struggle in scenes of great background diversity within and between ***,they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term *** propose a multi-stream temporal enhanced network(MSTENet)to address these *** investigates saliency cues collaboration in the spatial domain with a multi-stream structure to deal with the great background diversity challenge.A straightforward,yet efficient approach for temporal feature extraction is developed to avoid the accumulative noises and reduce time *** distinction between MSTENet and other VSOD methods stems from its incorporation of both foreground supervision and background supervision,facilitating enhanced extraction of collaborative saliency *** notable differentiation is the innovative integration of spatial and temporal features,wherein the temporal module is integrated into the multi-stream structure,enabling comprehensive spatial-temporal interactions within an end-to-end *** experimental results demonstrate that the proposed method achieves state-of-the-art performance on five benchmark datasets while maintaining a real-time speed of 27 fps(Titan XP).Our code and models are available at https://***/RuJiaLe/MSTENet.

关键词： Video salient object detection deep learning temporally enhanced foreground-background collaboration

来源：评论

学校读者我要写书评

暂无评论

Measurement of beam offset using wavefront distribution of vortex beam

Measurement of beam offset using wavefront distribution of v...

引用

2024 Advanced Fiber Laser Conference, AFL 2024

作者： Cao, Yousheng Li, Xiaoji Key Laboratory of Cognitive Radio and Information Processing Ministry of Education Guilin University of Electronic Technology Guilin541004 China

ISBN: (数字)9781510688889

ISBN: (纸本)9781510688872

In the field of underwater wireless optical communication, optical transmitters and optical receivers need to track and align in real time, which poses challenges to the real-time and accuracy of the measurement method for beam offset. To address this issue, a two-dimensional signal cross-correlation method based on Fourier transform is proposed to measure the beam shift using the unique spatial distribution of vortex beams, and an experimental setup is established to verify the effectiveness of the proposed method. © 2025 SPIE.

关键词： Underwater optical wireless communication

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：