检索结果-内蒙古大学图书馆

Deepfake Audio Detection for Urdu Language Using Deep Neural Networks

IEEE Access 2025年 13卷 97765-97778页

作者： Ahmad, Omair Khan, Muhammad Sohail Jan, Salman Khan, Inayat University of Engineering and Technology Department of Computer Software Engineering Mardan Pakistan Arab Open University Faculty of Computer Studies A’Ali732 Bahrain University of Engineering and Technology Department of Computer Science Mardan Pakistan

Audio Deepfakes, which are highly realistic fake audio recordings driven by AI tools that clone human voices, With Advancements in Text-Based Speech Generation (TTS) and Vocal Conversion (VC) technologies have enabled it easier to create realistic synthetic and imitative speech, making audio Deepfakes a common and potentially dangerous form of deception. Well-known people, like politicians and celebrities, are often targeted. They get tricked into saying controversial things in fake recordings, causing trouble on social media. Even kids’ voices are cloned to scam parents into ransom payments, etc. Therefore, developing effective algorithms to distinguish Deepfake audio from real audio is critical to preventing such frauds. Various Machine learning (ML) and Deep learning (DL) techniques have been created to identify audio Deepfakes. However, most of these solutions are trained on datasets in English, Portuguese, French, and Spanish, expressing concerns regarding their correctness for other languages. The main goal of the research presented in this paper is to evaluate the effectiveness of deep learning neural networks in detecting audio Deepfakes in the Urdu language. Since there’s no suitable dataset of Urdu audio available for this purpose, we created our own dataset (URFV) utilizing both genuine and fake audio recordings. The Urdu Original/real audio recordings were gathered from random youtube podcasts and generated as Deepfake audios using the RVC model. Our dataset has three versions with clips of 5, 10, and 15 seconds. We have built various deep learning neural networks like (RNN+LSTM, CNN+attention, TCN, CNN+RNN) to detect Deepfake audio made through imitation or synthetic techniques. The proposed approach extracts Mel-Frequency-Cepstral-Coefficients (MFCC) features from the audios in the dataset. When tested and evaluated, Our models’ accuracy across datasets was noteworthy. 97.78% (5s), 98.89% (10s), and 98.33% (15s) were remarkable results for the RNN+LSTM

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Scalable data management in global health crises: Leveraging blockchain technology

IET Blockchain

引用

IET Blockchain 2024年第S1期4卷 596-615页

作者： Al Jobaid, Sakib Kabir, Upama Jahan, Mosarrat Department of Computer Science and Engineering University of Dhaka Dhaka Bangladesh

Effective data management is crucial in navigating any health crisis. With proper data management protocols in place, stakeholders can swiftly adapt to evolving circumstances during challenging times. A recent event like the COVID-19 pandemic has unequivocally revealed its significance. It is essential to conduct disease surveillance, practice preventive measures, and devise policies to contain the situation. As the process involves massive data growth, it demands an acute level of oversight and control. Monitoring this vast sensitive data faces multifaceted limitations, namely data tampering, breach of privacy, and centralized data stewardship. In response to these challenges, we propose an innovative blockchain-enabled scalable data management scheme in light of the COVID-19 scenario. However, blockchain cannot scale in a large ecosystem due to storing all contents in every participating node. This work addresses this shortcoming by proposing a lightweight solution that groups nodes into clusters, resulting in less memory and processing overhead. Moreover, it adopts an off-chaining technique to reduce the memory load of every node and, thereby, the entire network. The experimental results demonstrate that it attains approximately 85% and 94% storage reduction per node and the whole network, respectively, and an 87% reduction in transaction processing time. © 2024 The Author(s). IET Blockchain published by John Wiley & Sons Ltd on behalf of The Institution of engineering and Technology.

关键词： COVID-19

来源：评论

学校读者我要写书评

暂无评论

Emotion mining for early suicidal threat detection on both social media and suicide notes using context dynamic masking-based transformer with deep learning

引用

Multimedia Tools and applications 2025年第13期84卷 11729-11752页

作者： Kodati, Dheeraj Tene, Ramakrishnudu Department of Computer Science and Engineering National Institute of Technology Warangal India

Suicide represents a poignant societal issue deeply entwined with mental well-being. While existing research primarily focuses on identifying suicide-related texts, there is a gap in the advanced detection of mental health states leading to suicidal tendencies. The purpose is to recognize the emotions influencing individuals’ mental states, potentially leading to suicidal behavior. This study seeks to identify specific emotions like anger, depression, anxiety, guilt, fear, stress, and sadness within suicide-related texts sourced from both social media and suicide notes. We introduce a novel suicide severity assessment method emphasizing emotion detection across categories like suicidal ideation, suicide planning, attempted, committed, suicide notes, describing, and cannot dare. By identifying high-intensity negative emotions, our approach enables early detection of potential suicide threats. Existing studies often struggle to capture nuances and semantic relationships, particularly in suicide-related contexts. To address this, we propose a context dynamic masking based on bidirectional long short-term memory along with a multi-head self-attention and convolutional neural network (CoDyn-BMHSA-CNN) model, capturing bidirectional contexts and maintaining variable-length sequences. Our model achieves accuracies of 98.2% and 97.4%, with F1 scores of 95.5% and 93.8% for social media and suicide note datasets, respectively. We perform ablation tests to pinpoint the key components of our proposed models. Furthermore, a thorough comparative analysis evaluates the model’s performance across various contexts and platforms. Experimental results illustrate the superior effectiveness of our model in identifying emotions from text sequences compared to state-of-the-art approaches. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Wanet: weight and attention network for video summarization

引用

Discover Artificial Intelligence 2024年第1期4卷 5页

作者： Basu, Arpan Pramanik, Rishav Sarkar, Ram Department of Computer Science and Engineering Jadavpur University Kolkata India

In this paper, we propose a deep learning-based model, called Weight and Attention Network (WANet), for video summarization. The network comprises a simple multi-head attention mechanism, followed by a feed-forward network to obtain the frame importance scores. Summary keyshots are obtained from the scores using a combination of kernel temporal segmentation and the knapsack algorithm. Contrary to past methods, we first enrich the input frames with similar information as opposed to letting the model learn all the features by itself. A novel weight assignment mechanism is introduced to assign weights to the input frames based on their similarity before passing the same to the model. Experimental results on the SumMe and TVSum datasets indicate the effectiveness of the present method when compared to state-of-the-art methods applied to the same datasets. © The Author(s) 2024.

关键词： Video recording

来源：评论

学校读者我要写书评

暂无评论

Virtual machine workload prediction using deep learning

引用

International Journal of Cloud Computing 2024年第6期13卷 549-565页

作者： Abhilash, C.S. Chaithra Garag, Veena Priyanka, H. Department of Computer Science and Engineering PES University Bengaluru India

This paper presents a novel approach to optimise resource allocation in virtualised systems, aiming to maximise performance and minimise operational expenses. Leveraging deep learning models, specifically long-short-term memory (LSTM) and bidirectional gated recurrent unit (bi-GRU), the method focuses on forecasting CPU load patterns in virtual machines (VMs). Accurate predictions are crucial for proactive resource management in dynamic cloud-based infrastructures. LSTM and bi-GRU excel in handling time series forecasting due to their ability to detect temporal connections in sequential data. Using pre-processed historical CPU load data, the models undergo training with hyperparameter adjustments to enhance performance. Experimental results demonstrate that the proposed models outperform others, achieving lower average root mean square error (RMSE) values (0.05636) and mean absolute error (MAE) values (0.03721). Comparative analysis with LSTM, GRU, bi-LSTM, bi-GRU, LSTM-GRU, and bi-LSTM-GRU confirms the high predictive capabilities of LSTM and bi-GRU, with the bidirectional architecture of bi-GRU enhancing accuracy by capturing connections between previous and upcoming time steps. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Resource allocation

来源：评论

学校读者我要写书评

暂无评论

Exploring Nonuniform Structure of NSRTDs for Characterizing MACA Rules in the Null-Boundary Condition

引用

Complex Systems 2025年第1期34卷 91-127页

作者： Hazra, Suvadip Banerjee, Som Dalui, Mamata Chakraborty, Bidesh Department of Computer Science and Engineering Indian Institute of Information Technology Karnataka Dharwad58009 India Department of Computer Science and Engineering National Institute of Technology West Bengal Durgapur713209 India Department of Computer Science and Engineering Haldia Institute of Technology West Bengal Haldia721657 India

The cellular automaton (CA), a discrete model, is gaining popularity in simulations and scientific exploration across various domains, including cryptography, error-correcting codes, VLSI design and test pattern generation. This paper examines a two-state three-neighborhood nonuniform finite CA with a specified number of fixed points. We utilize a graph-based tool known as the next state rule minterm transition diagram (NSRTD) to analyze the spatiotemporal behavior of these cellular automata (CAs) across all lengths. We devise methods for synthesizing nonuniform single length cycle multi-attractor CAs (MACAs), a specialized class of irreversible CA with a predefined number of fixed points for any arbitrary length. Despite the method’s exponential worstcase time complexity, it offers the advantage of selecting rules for each CA cell from a set of candidate rules, ensuring the desired number of fixed points. © 2025, Complex Systems Publications, Inc. All rights reserved.

关键词： cellular automata fixed-point attractor length cycle multi-attractor cellular automata MACA next state rule minterm transition diagram NSRTD

来源：评论

学校读者我要写书评

暂无评论

Efficient Heuristic Replication Techniques for High Data Availability in Cloud

引用

computer Systems science & engineering 2023年第6期45卷 3151-3164页

作者： H.L.Chandrakala R.Loganathan Department of Computer Science and Engineering School of EngineeringPresidency UniversityIndia Department of Computer Science and Engineering HKBK College of EngineeringIndia

Most social networks allow connections amongst many people based on shared *** networks have to offer shared data like videos,photos with minimum latency to the group,which could be challenging as the storage cost has to be minimized and hence entire data replication is not a *** replication of data across a network of read-intensive can potentially lead to increased savings in cost and energy and reduce the end-user’s response *** simple and adaptive replication strategies exist,the solution is non-deter-ministic;the replicas of the data need to be optimized to the data usability,perfor-mance,and stability of the application *** resolve the non-deterministic issue of replication,metaheuristics are *** this work,Harmony Search and Tabu Search algorithms are used optimizing the replication process.A novel Har-mony-Tabu search is proposed for effective placement and replication of *** on large datasets show the effectiveness of the proposed *** is seen that the bandwidth saving for proposed harmony-Tabu replication per-forms better in the range of 3.57%to 18.18%for varying number of cloud data-centers when compared to simple replication,Tabu replication and Harmony replication algorithm.

关键词： Cloud computing data replication bandwidth saving Tabu search Harmony search hybrid Harmony-Tabu search

来源：评论

学校读者我要写书评

暂无评论

Video surveillance in smart cities: current status, challenges & future directions

引用

Multimedia Tools and applications 2025年第16期84卷 15787-15832页

作者： Sharma, Himani Kanwal, Navdeep Department of Computer Science & Engineering Punjabi University Punjab Patiala India

People across the world aspire to settle in urban areas for better opportunities in career, education, and healthcare facilities. The increased proportion of people living in urban areas requires an improvement of smart habitat(s) robust enough to deal with the daily needs of citizens such as personal information management, security and surveillance to deal with anomalous activities in real-time. The present paper aims to provide an extensive review based on 213 research articles published from 2001 to 2023 highlighting various technologies for smart cities and intelligent video surveillance techniques, in order to: (i) Highlight the significance of smart city surveillance as well as the current research tendencies in this field. (ii) To present and explicate a standardized model of a smart city based on video surveillance. (iii) To analyze the current status and highlight challenges, and limitations of surveillance systems in different smart city applications. The paper outlines the critical role of video surveillance as a necessary feature of every subdomain of the smart city model. The fundamental element that defines the soon-to-come victorious period is the most recent technological developments for the detection of anomalous activity, fire, digital tampering, and objects, which are thoroughly examined in existing research papers and elucidated. The article further presents a well outlined bibliographic classification of state-of-the-art techniques. A comparison of the existing video surveillance datasets has also been thoroughly analyzed. Finally, the current work identifies major research challenges and future opportunities in this domain. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

A Survey of Federated Learning for IoT: Addressing Resource Constraints and Heterogeneous Challenges

Informatica (Slovenia)

引用

Informatica (Slovenia) 2025年第17期49卷 1-10页

作者： Vashisth, Sristi Goyal, Anjali Sharda University Department of Computer Science and Engineering Greater Noida India

Federated Learning (FL) has emerged as a promising approach to address the challenges of data privacy, security, and scalability in Internet of Things (IoT) environments. This paper provides a comprehensive survey of recent advances in FL for resource-constrained IoT systems, focusing on addressing the challenges of heterogeneous data, limited computational resources, and dynamic network environments. The survey highlights key achievements, including accuracy improvements of over 90% in domains such as smart homes, industrial IoT, and healthcare. Furthermore, FL solutions leveraging edge and fog computing have demonstrated significant energy efficiency improvements, reducing power consumption by up to 30%. A comparative analysis of state-of-the-art FL frameworks is presented, identifying critical research gaps in scalability, adaptive frameworks, and the integration of blockchain for enhanced security. Finally, the paper proposes future research directions to develop robust, efficient, and scalable FL solutions tailored for diverse IoT applications. © 2025 Slovene Society Informatika. All rights reserved.

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

Monitoring COVID-19 environment: a real-time facial mask detection using YOLO models

引用

Research on Biomedical engineering 2025年第1期41卷 1-18页

作者： Pandiyan, P. Thangaraj, Rajasekaran V, Balasubramanian V. K, Manavalasundaram S, Balakrishnan Department of Electrical and Electronics Engineering KPR Institute of Engineering and Technology Coimbatore India Department of Computer Science and Engineering Nandha Engineering College Erode India Department of Information Technology Velalar College of Engineering and Technology Erode India Department of Computer Science and Engineering Aarupadai Veedu Institute of Technology Chennai India Department of Computer Science and Engineering Sri Ramakrishna Institute of Technology Coimbatore India

Purpose: The rapid spread of COVID-19 has resulted in significant harm and impacted tens of millions of people globally. In order to prevent the transmission of the virus, individuals often wear masks as a protective measure for themselves and others. Coronavirus protection guidelines have been published by the World Health Organization (WHO). According to WHO standards, COVID-19 can be prevented by wearing a mask in public places and congested regions. In these places, it is very difficult to personally check to see if people are wearing face masks or not. Methods: The objective of this research work is to build a powerful, efficient, and real-time approach for detecting people not wearing masks. Three cutting-edge object identification models, namely YOLOv4, Tiny-YOLOv4, and YOLOv5, are employed in this study for the identification of masked faces. Result: The proposed YOLOv5 model is evaluated using real-time images captured using a smartphone or tablet. The test images include both single and multiple people with and without masks. The YOLOv5 model achieved recognition accuracy of 88.90% with an average detection speed of 0.0316 s per image, whereas the YOLOv4 and Tiny-YOLOv4 produced recognition accuracy of 82.24% and 74.80% with an average detection speed of 0.0530 s and 0.0541 s per image, respectively. Conclusion: The comparative performance suggests that the YOLOv5 model has a maximum recognition accuracy of 88.90% in face mask identification tasks compared to other models such as the YOLOv4 and Tiny-YOLOv4. © The Author(s), under exclusive licence to The Brazilian Society of Biomedical engineering 2025.

关键词： COVID-19

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：