检索结果-内蒙古大学图书馆

Proceedings of the 2023 12th International Conference on Computing and Pattern Recognition

作者： Xiangqian Zhao Xuebin Xu Reyihanguli Kasenmu Xinjiang University China Xinjiang Key Laboratory of Signal Detection and Processing(Xinjiang University) Xinjiang University China Xinjiang Teachers college China

ISBN: (纸本)9798400707988

In the process of printed document image retrieval, the traditional algorithm, SURF algorithm combined with violent matching, has the problems of low retrieval accuracy and low retrieval efficiency. This paper proposes a FAST + PCA-SURF combined with KNN algorithm based on FLANN for multilingual document image retrieval. Based on FAST, the feature points are detected, and the feature descriptors after dimensionality reduction are extracted by ***, the KNN algorithm based on FLANN is used for feature matching, and finally the appropriate matching results are output. The experimental results show that the proposed algorithm is improved in terms of time complexity and retrieval accuracy compared with the traditional algorithm. The average time complexity and retrieval accuracy of the traditional algorithm are 0.1783 s and 71.8%, respectively, while the proposed algorithm is 0.0464 s and 77.8%, indicating that the proposed algorithm achieves better experimental results in multilingual document image retrieval.

关键词： FAST FLANN matching PCA-SURF Printed multilingual document images keyword search.

来源：评论

学校读者我要写书评

暂无评论

GCW-YOLOv8n: Lightweight Safety Helmet Wearing detection Algorithm

引用

Computer Engineering and Applications 2025年第3期61卷 144-154页

作者： Xu, Zhuang Qian, Yurong Yan, Feng Key Laboratory of Software Engineering Xinjiang University Urumqi830091 China Key Laboratory of Signal Detection and Processing in Xinjiang Uygur Autonomous Region Xinjiang University Urumqi830046 China School of Computer Science and Technology Xinjiang University Urumqi830046 China

China is a major industrial country in the world. In various construction environments, the falling of construction materials and collisions on construction sites are the main causes of casualties. Accidents caused by head injuries often occur, and wearing safety helmets can ensure the safety of construction personnel to the greatest extent possible. In order to solve the problems of poor timeliness and low management efficiency of manual management, the existing models have strict requirements for computing power, large memory requirements, and handling of load and data transmission delay of industrial equipment, and to achieve edge computing and real-time control, a modified helmet wearing detection algorithm based on YOLOv8n is proposed. Firstly, a new GS-C2f module is proposed, which introduces GhostConv and SE (squeeze and excitation) attention mechanism, effectively reducing the computational complexity of the model and helping the network extract features effectively. Secondly, the CBAM attention mechanism is introduced in the Neck section to enhance the model focus on effective features. Finally, Wise-IoUv3 is introduced to further improve the accuracy of the model. Through experiments, compared with the original YOLOv8n model, this model achieves a 21.24% reduction in computational parameters and a 0.01 improvement in recognition accuracy, achieving satisfactory results between model accuracy and complexity. © 2025 Journal of Computer Engineering and Applications Beijing Co., Ltd.;Science Press. All rights reserved.

关键词： Office equipment

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Domain Adaptive Learning for Image Desnowing with Real-World Data

Unsupervised Domain Adaptive Learning for Image Desnowing wi...

引用

IEEE International Conference on Image processing

作者： Jingxu Ren Gang Zhou Yusen Zhu Yangxin Liu Juan Chen Zhenhong Jia Key Laboratory of Signal Detection and Processing Xinjiang University Urumqi China

Snow images usually contain snow grains, snow streaks, and mist, which greatly affect the visibility of images. Currently, supervised learning with synthetic data often faces limitations when it comes to handling real-world snow images. To address this crucial issue, this work proposes an unsupervised domain adaptation image snow removal framework. The framework improves the performance on real-world images by learning a domain classifier in adversarial training manner. Additionally, considering the diversity of snowflake shapes and sizes in real-world snow images, we design a multiple-kernel dilated convolution module. Extensive experiments on three representative datasets have validated that our model can achieve better results than existing desnowing methods. More importantly, experiments on real datasets show that the proposed method obtains state-of-the-art performance in real-world desnowing.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Improved Self-Consistency Training with Selective Feature Fusion for Sound Event detection

Improved Self-Consistency Training with Selective Feature Fu...

引用

IEEE International Conference on Information Communication and signal processing (ICICSP)

作者： Mingyu Wang Yunlong Li Ying Hu Key Laboratory of signal detection and processing Xinjiang University Urumqi China

Sound event detection (SED) is a joint task of identifying the categories and time boundaries of sound events within an audio clip. In this paper, we propose an improved self-consistency training (ISCT) strategy for semi-supervised SED based on Mean Teacher (MT) method. For teacher and student models, each adopts two branches with the same CRNN structure, the two branches help training the model by means of consistency regularization. ISCT strategy incorporates self-consistency loss on the basis of MT loss to improve the generalization performance of the model. A selective feature fusion (SFF) module is designed for applying in the shallow layers of the feature extraction part to selectively fuse the features with different scales. A parallel attention (PA) module is designed for applying in the deep layers of the feature extraction part to obtain much richer high-level features by the channel and spatial-wise attention. Ablation experiments verify the effectiveness of our proposed ISCT strategy, SFF and PA modules. In addition, compared with four methods, our proposed method achieves competitive performance on the DCASE 2020 task4 dataset.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MSFSAN: A Novel Multi-Scale Spatio-Temporal Feature Screening Attention Network for Urban Carbon Emission Prediction

MSFSAN: A Novel Multi-Scale Spatio-Temporal Feature Screenin...

引用

IEEE International Conference on Systems, Man and Cybernetics

作者： Ben Wang Xizhong Qin Jiwei Qin Xiaoyu Zhang Haodong Ma Xinjiang Key Laboratory of Signal Detection and Processing College of Computer Science and Technology Xinjiang University Ürümqi China

ISBN: (数字)9781665410205

ISBN: (纸本)9781665410212

In order to cope with the increasingly severe global energy conservation and emission reduction problems, research on urban carbon emission prediction is of great significance. The existing methods mainly use time series analysis to predict urban carbon emission, but there is a strong spatial correlation between the carbon emission data of several cities. Therefore, this paper designs a multi-scale spatial-temporal feature screening attention network to predict target cities' future carbon emission data. Firstly, this paper combines the daily carbon emission data of the near-neighbouring cities and the daily homologous emission data of the target city to analyze the urban carbon emission data from a spatio-temporal perspective. Then, this paper designs a multi-scale spatial interactive convolution module and a multi-scale temporal convolution module to extract multi-scale spatio-temporal features effectively. In addition, the feature screening module is designed to reduce the adverse effects of redundant features. Finally, the multi-scale features are used to predict the future carbon emissions of the target city through a predictor. The experimental results show that our prediction model is superior to the existing methods in six datasets.

关键词： Correlation Convolution Urban areas Time series analysis Energy conservation Carbon dioxide Predictive models Feature extraction Cybernetics

来源：评论

学校读者我要写书评

暂无评论

EEMD and Double Thresholds Integrated Voice Activity detection 3

EEMD and Double Thresholds Integrated Voice Activity Detecti...

引用

3rd International Conference on Pattern Recognition and Machine Learning, PRML 2022

作者： Meng, Shan Ablimit, Mijit Hamdulla, Askar Xinjiang University Xinjiang Key Laboratory of Signal Detection and Processing School of Information Science and Engineering Xinjiang Urumqi China Xinjiang University Xinjiang Key Laboratory of Multilingual Information Technology School of Information Science and Engineering Xinjiang Urumqi China

ISBN: (数字)9781665499507

ISBN: (纸本)9781665499507

Voice activity detection (VAD) is an important preprocessing for voice applications. Anti-noise performance is the most important evaluation index of VAD algorithm. The traditional dual-threshold-based VAD algorithm has very low detection accuracy in a low signal-to-noise ratio environment. This paper proposes a voice activity detection algorithm based on Ensemble Empirical Mode Decomposition (EEMD) combined with the dual-threshold method, which integrates the decomposition of EEMD. The denoising feature is combined with VAD based on dual thresholds, and dual thresholds are set for VAD to improve the anti-noise performance and accuracy of the algorithm. VAD is divided into three categories: based on feature parameters, based on pattern recognition, and based on deep learning. The VAD algorithms based on pattern recognition and deep learning require the support of big training data to achieve good detection results. And it is too complex and requires a large amount of computation, thus its application and real-time deployment have been greatly affected. The traditional VAD methods based on feature parameters only needs to use the short-term energy and short-term zero-crossing rate as the judgment criteria for activity detection. Thise algorithm is simple, easy to deploy, and applicable even if it is dozens of voices. At the same time, the EEMD changes the extreme point characteristics of the signal by adding different white noises of the same amplitude each time, and then performs the overall average of the corresponding IMF(intrinsic mode function) obtained by multiple EMDs to offset the added white noise, therefor effectively suppressing the mode-mixing. Thus the production of state aliasing can better improve the anti-noise performance of the VAD algorithm. © 2022 IEEE.

关键词： Empirical mode decomposition

来源：评论

学校读者我要写书评

暂无评论

MFDPonzi: Detecting Ethereum Ponzi Schemes Using Static Features from Novel Opcode Sequences

MFDPonzi: Detecting Ethereum Ponzi Schemes Using Static Feat...

引用

International Conference on Acoustics, Speech, and signal processing (ICASSP)

作者： Longwei Cao Jiwei Qin Xuzi Zhang College of Computer Science and Technology Xinjiang University Urumqi China Xinjiang Key Laboratory of Signal Detection and Processing Xinjiang University Urumqi China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Ethereum, the first blockchain platform to support smart contracts, has become a target for various cybercrimes, particularly financial frauds like Ponzi schemes. Ponzi schemes on Ethereum are known as Smart Ponzi Schemes (or Ponzi Contracts) and have caused huge financial losses. Current Ponzi contract detection models face three main challenges: simple opcode sequence processing does not effectively distinguish Ponzi from non-Ponzi contracts, single-feature-based models lack accuracy, and reliance on transaction records hinders early detection. To address these issues, this paper proposes a Multi-Feature Ponzi Scheme detection Model (MFDPonzi). MFDPonzi tracks the changes in stack, memory, and storage parameters during the execution of smart contracts, reconstructing opcode sequences and extracting diverse features, including semantic and developer features. Finally, a multi-feature fusion algorithm is used to enhance model stability. Additionally, MFDPonzi can identify Ponzi contracts at the early stage of smart contract creation without relying on transaction data. Experimental results show that MFDPonzi achieves an 85.9% recall and an 88.7% F-score on Ethereum smart contracts, outperforming baselines in both performance and robustness.

关键词： Biological system modeling Smart contracts signal processing algorithms Feature extraction Robustness Stability analysis Blockchains Logic Speech processing Thermal stability

来源：评论

学校读者我要写书评

暂无评论

Research of Scene Text detection Algorithms 5

Research of Scene Text Detection Algorithms

引用

5th International Conference on Intelligent Robotics and Control Engineering, IRCE 2022

作者： Chen, Mengmeng Ibrayim, Mayire Hamdulla, Askar School of Information Science and Engineering Xinjiang University Xinjiang Key Laboratory of Signal Detection and Processing Xinjiang Urumqi China School of Information Science and Engineering Xinjiang University Xinjiang Key Laboratory of Multilingual Information Technology Xinjiang Urumqi China

ISBN: (数字)9781665469951

ISBN: (纸本)9781665469951

With the development of artificial intelligence, obtaining textual information from natural scenes has become a hot topic. There are still huge challenges for curved text and arbitrary orientation text detection in real-world scenes. In this work, we briefly introduce the development of text detection algorithms in recent years and study a typical lightweight detection algorithm. We have improved the algorithm to different degrees to make it more suitable for detection of the curved text. Finally, we select the three most representative public text datasets for experiments. Experiments conducted on CTW1500, ICDAR2015, and MSRA-TD500 have demonstrated the superiority of this model. © 2022 IEEE.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network 3

Exploring Turkish Speech Recognition via Hybrid CTC/Attentio...

引用

3rd International Conference on Frontiers of Electronics, Information and Computation Technologies, ICFEICT 2023

作者： Yolwas, Nurmement Ren, Zeyu Wang, Huiru Slamu, Wushour Xinjiang University Xinjiang Key Laboratory of Signal Detection College of Information Science and Engineering Urumqi China Xinjiang University Xinjiang Multilingual Info. Technology Laboratory College of Information Science and Engineering Urumqi China

ISBN: (纸本)9798350302356

In recent years, there has been rapid development in deep learning-based End-To-End speech recognition technology. However, the performance of Turkish speech recognition systems has been hindered by the lack of Turkish speech data. To address this, this paper focuses on several speech recognition tuning techniques. Experimental results demonstrate that the best performance is achieved when utilizing a combination of speed perturbation and noise addition for data augmentation, along with the beam search width set to 16. Furthermore, to maximize the utilization of effective feature information and enhance feature extraction accuracy, a novel feature extractor called LSPC is proposed. By combining LSPC with the LiGRU network, a shared encoder structure is formed, and model compression is achieved. The results indicate that when using only Fbank features, the performance of LSPC surpasses that of MSPC and VGGnet, resulting in a respective improvement of 1.01% and 2.53% in the word error rate (WER). Finally, building upon the aforementioned advancements, a new multifeature fusion network is proposed as the primary encoder structure. The results demonstrate that the WER of the proposed feature fusion network, based on LSPC, further improves by 0.82% and 1.94% when compared to single-feature extraction using LSPC, specifically utilizing Fbank features and spectrogram features. As a result, our model achieves comparable performance to advanced End-To-End models. © 2023 IEEE.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

LE-CAM++: A Lighter and More Efficient CAM++ for Speaker Verification 14

LE-CAM++: A Lighter and More Efficient CAM++ for Speaker Ver...

引用

14th International Symposium on Chinese Spoken Language processing, ISCSLP 2024

作者： Liu, Shuanghong Song, Zhida Fang, Zhihua He, Liang School of Computer Science and Technology Xinjiang University Urumqi China School of Intelligence Science and Technology Xinjiang University Urumqi China Xinjiang Key Laboratory of Signal Detection and Processing Urumqi China Department of Electronic Engineering Beijing National Research Center for Information Science and Technology Tsinghua University Beijing China

ISBN: (纸本)9798331516826

Due to its superior performance and fewer parameters, CAM++ has become the state-of-the-art model for speaker verification tasks. This model uses 2D convolutional blocks to extract front-end features, which are then fed into a densely connected time-delay neural network backbone to extract deep features. However, the simple stacking of 2D convolutions may lead to the generation of a significant amount of redundant features, which is detrimental to efficient feature extraction. Furthermore, although CAM++ already has a relatively small number of parameters, there is still room for further optimization. To address these issues, this paper first employs depthwise separable convolutions to replace the dilated convolutions in the backend network of CAM++, making the model more lightweight. Next, we introduce spatial and channel reconstruction convolution (SCConv) in the ResBlock module of CAM++ to reduce redundant features and optimize the feature extraction process. Finally, after SCConv, we apply squeeze and excitation attention mechanism to model the interdependencies between channels and recalibrate each channel, further enhancing the model’s representational capacity. We name the resulting model LE-CAM++. Our proposed model achieves an EER of 0.686 and a minDCF of 0.084 on the VoxCeleb1-O dataset. Compared to the baseline model CAM++, the EER is reduced by 11%, and the minDCF is reduced by 28%. Additionally, the model parameters are reduced by 8%. ©2024 IEEE.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：