检索结果-内蒙古大学图书馆

Colored Edge Detection Using Thresholding Techniques

Recent Advances in computer science and Communications 2023年第4期16卷 33-41页

作者： Fenyi, Adolf Fenyi, Isaac Asante, Michael Department of Computer Science and Information Technology Kwame Nkrumah University of Science and Technology Kumasi Ghana

Background: In this research, a novel algorithm is formulated through the combination of gradient and adaptive thresholding. A set of 5 X 5 convolution kernels were generated to determine the gradients in the four main directions of the image. Objectives: The researcher converted the gaussian equation into a normalized kernel, which was convolved with the gradients to suppress the impact of noise. Methods: The edges derived were partitioned into a set of 5 x 5 matrices. A weighted variance was calculated for each local window in the image. The pixel that generated the minimum variance was used for the segmentation process in each local window. The researcher then trimmed multiple pixel width edges into singles by developing a set of 5 X 5 Structuring Elements (SE). These elements were placed over the image to remove boundary pixels. In order to produce colored edges, the algorithm was executed over all the channels and the results were concatenated to produce the skeletal colored edges. Results: From the evaluations conducted, the proposed algorithm exhibited better performance than most of the recent algorithms with respect to Human Perception Clarity and time complexity in both noisy and non-uniform illuminated images. Conclusion: The reason for this performance is that it is able to extract edges moving in the various directions of images. It also ensures that identified edges are single pixel width instead of multiple. © 2023 Bentham science Publishers.

关键词： Pixels

来源：评论

学校读者我要写书评

暂无评论

Deepfake Audio Detection for Urdu Language Using Deep Neural Networks

引用

IEEE Access 2025年 13卷 97765-97778页

作者： Ahmad, Omair Khan, Muhammad Sohail Jan, Salman Khan, Inayat University of Engineering and Technology Department of Computer Software Engineering Mardan Pakistan Arab Open University Faculty of Computer Studies A’Ali732 Bahrain University of Engineering and Technology Department of Computer Science Mardan Pakistan

Audio Deepfakes, which are highly realistic fake audio recordings driven by AI tools that clone human voices, With Advancements in Text-Based Speech Generation (TTS) and Vocal Conversion (VC) technologies have enabled it easier to create realistic synthetic and imitative speech, making audio Deepfakes a common and potentially dangerous form of deception. Well-known people, like politicians and celebrities, are often targeted. They get tricked into saying controversial things in fake recordings, causing trouble on social media. Even kids’ voices are cloned to scam parents into ransom payments, etc. Therefore, developing effective algorithms to distinguish Deepfake audio from real audio is critical to preventing such frauds. Various Machine learning (ML) and Deep learning (DL) techniques have been created to identify audio Deepfakes. However, most of these solutions are trained on datasets in English, Portuguese, French, and Spanish, expressing concerns regarding their correctness for other languages. The main goal of the research presented in this paper is to evaluate the effectiveness of deep learning neural networks in detecting audio Deepfakes in the Urdu language. Since there’s no suitable dataset of Urdu audio available for this purpose, we created our own dataset (URFV) utilizing both genuine and fake audio recordings. The Urdu Original/real audio recordings were gathered from random youtube podcasts and generated as Deepfake audios using the RVC model. Our dataset has three versions with clips of 5, 10, and 15 seconds. We have built various deep learning neural networks like (RNN+LSTM, CNN+attention, TCN, CNN+RNN) to detect Deepfake audio made through imitation or synthetic techniques. The proposed approach extracts Mel-Frequency-Cepstral-Coefficients (MFCC) features from the audios in the dataset. When tested and evaluated, Our models’ accuracy across datasets was noteworthy. 97.78% (5s), 98.89% (10s), and 98.33% (15s) were remarkable results for the RNN+LSTM

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network

引用

computers, Materials & Continua 2024年第5期79卷 3067-3087页

作者： Arnab Dey Samit Biswas Dac-Nhuong Le Department of Computer Science and Technology Indian Institute of Engineering Science and TechnologyShibpurHowrah711103India Faculty of Information Technology Haiphong UniversityHaiphong180000Vietnam

Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in videostreams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enableinstant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing actiondatasets often lack diversity and specificity for workout actions, hindering the development of accurate recognitionmodels. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significantcontribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated toencompass various exercises performed by numerous individuals in different settings. This research proposes aninnovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU)network for workout action recognition in video streams. Unlike image-based action recognition, videoscontain spatio-temporal information, making the task more complex and challenging. While substantial progresshas been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions,and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attentionmodel demonstrated exceptional classification performance with 95.81% accuracy in classifying workout actionvideos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101,respectively, showcasing its superiority and robustness in action recognition. The findings suggest practicalimplications in real-world scenarios where precise video action recognition is paramount, addressing the persistingchallenges in the field. TheWAVd datas

关键词： Workout action recognition video stream action recognition residual network GRU attention

来源：评论

学校读者我要写书评

暂无评论

Prioritization and offloading in P4 switch integrated with NFV

引用

Telecommunication Systems 2024年第3期86卷 571-584页

作者： Neha, Farhin Faiza Lai, Yuan-Cheng Hossain, Md. Shohrab Lin, Ying-Dar Department of Computer Science and Engineering Bangladesh University of Engineering and Technology Dhaka Bangladesh Department of Information Management National Taiwan University of Science and Technology Taipei Taiwan Department of Computer Science National Yang Ming Chiao Tung University Hsinchu Taiwan

The architecture of integrating Software Defined Networking (SDN) with Network Function Virtualization (NFV) is excellent because the former virtualizes the control plane, and the latter virtualizes the data plane. As Programming Protocol-independent Packet Processors (P4) become popular, the architecture integrating SDN with NFV may shift from traditional switches to P4 switches. In this architecture, which integrates P4 switch and NFV (P4 + NFV), network functions can be provided in both P4 switches (PNF) and NFV (VNF). Thus, to minimize packet delay, an offloading problem between P4 switches and NFV in this P4 + NFV should be addressed. This paper tackles this offloading problem and figures out the prioritization mechanism between newly arriving packets and packets that require VNF for minimizing packet delay. We model and analyze the P4 + NFV architecture using an M/M/1 queuing model with non-preemptive priority. Also, we propose an optimization solution based on gradient descent to find the optimal offloading probability of going to VNF. Results show that optimal offloading from P4 switch to NFV can reduce the average packet delay from 13.74 to 40.73%, when packets requiring VNF are given higher priority than newly arriving packets. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Software defined networking

来源：评论

学校读者我要写书评

暂无评论

Enhanced Differentiable Architecture Search Based on Asymptotic Regularization

引用

computers, Materials & Continua 2024年第2期78卷 1547-1568页

作者： Cong Jin Jinjie Huang Yuanjian Chen Yuqing Gong School of Computer Science and Technology Harbin University of Science and TechnologyHarbin150006China School of Automation Harbin University of Science and TechnologyHarbin150006China

In differentiable search architecture search methods,a more efficient search space design can significantly improve the performance of the searched architecture,thus requiring people to carefully define the search space with different complexity according to various *** rationalizing the search strategies to explore the well-defined search space will further improve the speed and efficiency of architecture *** this in mind,we propose a faster and more efficient differentiable architecture search method,***,we introduce a more efficient search space enriched by the introduction of two redefined convolution ***,we utilize a more efficient architectural parameter regularization method,mitigating the overfitting problem during the search process and reducing the error brought about by gradient ***,we introduce a natural exponential cosine annealing method to make the learning rate of the neural network training process more suitable for the search ***,group convolution and data augmentation are employed to reduce the computational ***,through extensive experiments on several public datasets,we demonstrate that our method can more swiftly search for better-performing neural network architectures in a more efficient search space,thus validating the effectiveness of our approach.

关键词： Differentiable architecture search allegro search space asymptotic regularization natural exponential cosine annealing

来源：评论

学校读者我要写书评

暂无评论

Fuzzy Reliability and Availability of System under a Calendar-based Inspection Involving Multiple Failures and Its Application to Wind Turbine System

引用

Journal of Systems science and Systems Engineering 2024年第2期33卷 187-206页

作者： Mintu Kumar Himani Pant S.B.Singh Department of Mathematics Statistics and Computer Science G.B.Pant University of AgricultureTechnologyPantnagarUttarakhandIndia

Uncertainty is an important factor that needs to be considered while analyzing the performance of any engineering *** order to quantify uncertainty,fuzzy set theory is frequently used by most of researchers,including energy system *** to the classical reliability theory,component lifetimes have crisp parameters,but due to uncertainty and inaccuracy in data,it is sometimes very difficult to determine the exact values of these parameters in real-world *** overcome this difficulty in the current research,failure and repair rates were taken as triangular fuzzy numbers to determine the fuzzy availability of a system undergoing calendar-based periodic inspection subject to multiple failure modes(FMs).It was assumed that each component in the system had an exponential failure rate and repair rate with fuzzy *** FMs were explicitly taken into account when a functional state of the system was *** FM had a random failure *** the occurrence of any failure,a random time was selected for the relevant corrective repair *** proposed research was studied for one of the major sources of green energy,namely a wind turbine system wherein all the derived propositions have been implemented on it.

关键词： Fuzzy reliability fuzzy availability wind turbine system multiple failure modes triangular fuzzy number

来源：评论

学校读者我要写书评

暂无评论

Modern Standard Arabic speech disorders corpus for digital speech processing applications

引用

International Journal of Speech technology 2024年第1期27卷 157-170页

作者： Alqudah, Assal A. M. Alshraideh, Mohammad A. M. Abushariah, Mohammad A. M. Sharieh, Ahmad A. S. Department of Computer Science King Abdullah II School of Information Technology The University of Jordan Amman Jordan Department of Computer Information Systems King Abdullah II School of Information Technology The University of Jordan Amman Jordan Department of Computer Science Faculty of Science and Information Technology Al-Zaytoonah University of Jordan Amman Jordan

Digital speech processing applications including automatic speech recognition (ASR), speaker recognition, speech translation, and others, essentially require large volumes of speech data for training and testing purposes. Although there are available speech corpora, speech data for speakers suffering speech disorders are hardly available for many languages including Arabic language. Consequently, developing digital speech processing applications that target the entire society becomes hard due to the unavailability of speech corpora that contain sufficient speakers’ variations including healthy and disordered speech. This research presents our work towards developing a Modern Standard Arabic (MSA) speech corpus for speakers suffering distortion and substitution articulation disorders. The speech corpus was recorded by 40 (20 male and 20 female) Jordanian speakers who suffer either distortion or/and substitution articulation disorders. This speech corpus can be used for various applications including ASR, speech and hearing, and others. Part of this speech corpus is used for developing and evaluating an ASR for MSA using the Carnegie Mellon University (CMU) Pocketsphinx tools based on Mel-Frequency Cepstral Coefficients (MFCC) and Hidden Markov Model (HMM) techniques. Furthermore, Linear Discriminant Analysis (LDA) and Maximum Likelihood Linear Transform (MLLT) optimization techniques were applied. Using three different testing data sets, this work obtained 98.38% and 1.76% average word recognition correctness rate (WRCR) and average Word Error Rate (WER), respectively, for speaker-dependent and text-independent. For speaker-independent and text-dependent, this work obtained 99.37% and 0.68% average WRCR and average WER, respectively, whereas for speaker-independent and text-independent this work obtained 96.53% and 4.00% average WRCR and average WER, respectively. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Natur

关键词： Hidden Markov models

来源：评论

学校读者我要写书评

暂无评论

PLD-Det: plant leaf disease detection in real time using an end-to-end neural network approach based on improved YOLOv7

引用

Neural Computing and Applications 2024年第34期36卷 21885-21898页

作者： Mehedi, Md Humaion Kabir Nawer, Nafisa Ahmed, Shafi Khan, Md Shakiful Islam Hasib, Khan Md Mridha, M.F. Alam, Md. Golam Rabiul Nguyen, Thanh Thi Department of Computer Science and Engineering BRAC University Dhaka Bangladesh Department of Computer Science and Engineering Bangladesh University of Business and Technology Dhaka Bangladesh Department of Computer Science American International University Dhaka Bangladesh Department of Data Science and AI Monash University Wellington Rd Clayton MelbourneVIC3800 Australia

In order to maintain sustainable agriculture, it is vital to monitor plant health. Since all species of plants are prone to characteristic diseases, it necessitates regular surveillance to search for any symptoms, which is utterly challenging and time-consuming. Besides, farmers may struggle to identify the type of plant disease and its potential symptoms. Hence, the interest in research like image-based computer-aided automated plant leaf disease detection by analyzing the early symptoms has increased enormously. However, limitations in the plant leaf image database, for instance, unfitting backgrounds, blurry images, and so on, sometimes cause underprivileged feature extraction, misclassification, and overfitting issues in existing models. As a result, we have proposed a real-time plant leaf disease detection architecture incorporating proposed PLD-Det model, which is based on improved YOLOv7 with the intention of assisting farmers while reducing the issues in existing models. The architecture has been trained on the widely used PlantVillage dataset, which resulted in an accuracy of 98.53%. Furthermore, SHapley Additive exPlanations (SHAP) values have been analyzed as a unified measure of feature significance. According to the experimental findings, the proposed PLD-Det model, which is an improved YOLOv7 architecture, outperformed the original YOLOv7 model in test accuracy by approximately 4%. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

关键词： Seed

来源：评论

学校读者我要写书评

暂无评论

Interactive teaching design of mobile English based on deep learning

引用

Journal of Ambient Intelligence and Humanized Computing 2025年第1期16卷 1-14页

作者： Yan, Zhang Bing, Yan Shimeng, Pan Department of Computer Science and Technology Taiyuan University Shanxi Taiyuan030032 China University of Science and Technology Liaoning Liaoning Anshan114051 China

Aiming to enhance the management stage of Mobile English Interactive Educating in the intelligent flipped classroom mode, a design method of Mobile English Interactive Teaching Based on deep learning is proposed. Extract the information entropy of the distribution of Mobile English interactive teaching resources, analyze the feature quantities of mobile information interaction and channel equilibrium scheduling by the methods of deep and enhanced tracking learnings, construct the parameter set of phase space feature reorganization allocation by using the adaptive resource allocation equilibrium control and big data merging scheduling methods, the dynamic allocation of teaching resources is realized by priority scheduling and dynamic allocation. Realize software development and design under the Linux and net framework 4.0 framework. The simulation results show that the proposed method can improve the dynamic allocation and mining ability of Mobile English interactive teaching resources, the data merging level of the platform is high, the channel balance of Mobile English interaction is good, and the information interaction ability is strong. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

关键词： computer operating systems

来源：评论

学校读者我要写书评

暂无评论

A Convolutional Neural Network Model for Wheat Crop Disease Prediction

引用

computers, Materials & Continua 2023年第5期75卷 3867-3882页

作者： Mahmood Ashraf Mohammad Abrar Nauman Qadeer Abdulrahman A.Alshdadi Thabit Sabbah Muhammad Attique Khan Department of Computer Science and Artificial Intelligence College of Computer Science and EngineeringUniversity of JeddahJeddah21577Saudi Arabia Department of Computer Science Bacha Khan University CharsaddaKhyber Pakhtunkhwa24461Pakistan Department of Computer Science Federal Urdu University of ArtsScience&TechnologyIslamabad45570Pakistan Department of Information Systems and Technology College of Computer Science and EngineeringUniversity of JeddahJeddah21577Saudi Arabia Faculty of Technology and Applied Sciences Al-Quds Open UniversityAl-Bireh1804Palestine Department of Computer Science HITEC UniversityTaxila47080Pakistan

Wheat is the most important cereal crop,and its low production incurs import pressure on the *** fulfills a significant portion of the daily energy requirements of the human *** wheat disease is one of the major factors that result in low production and negatively affects the national ***,timely detection of wheat diseases is necessary for improving *** CNN-based architectures showed tremendous achievement in the image-based classification and prediction of crop ***,these models are computationally expensive and need a large amount of training *** this research,a light weighted modified CNN architecture is proposed that uses eight layers particularly,three convolutional layers,three SoftMax layers,and two flattened layers,to detect wheat diseases *** high-resolution images were collected from the fields in Azad Kashmir(Pakistan)and manually annotated by three human *** convolutional layers use 16,32,and 64 *** filter uses a 3×3 kernel *** strides for all convolutional layers are set to *** this research,three different variants of datasets are *** variants S1-70%:15%:15%,S2-75%:15%:10%,and S3-80%:10%:10%(train:validation:test)are used to evaluate the performance of the proposed *** extensive experiments revealed that the S3 performed better than S1 and S2 datasets with 93%*** experiment also concludes that a more extensive training set with high-resolution images can detect wheat diseases more accurately.

关键词： Machine learning wheat crop disease prediction convolutional neural network artificial intelligence

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：