检索结果-内蒙古大学图书馆

Optimizing BERT for Bengali Emotion Classification: Evaluating Knowledge Distillation, Pruning, and Quantization

computer Modeling in engineering & sciences 2025年第2期142卷 1637-1666页

作者： Md Hasibur Rahman Mohammed Arif Uddin Zinnat Fowzia Ria Rashedur M.Rahman Department of Electrical and Computer Engineering North South UniversityDhaka1229Bangladesh

The rapid growth of digital data necessitates advanced natural language processing(NLP)models like BERT(Bidi-rectional Encoder Representations from Transformers),known for its superior performance in text ***,BERT’s size and computational demands limit its practicality,especially in resource-constrained *** research compresses the BERT base model for Bengali emotion classification through knowledge distillation(KD),pruning,and quantization *** Bengali being the sixth most spoken language globally,NLP research in this area is *** approach addresses this gap by creating an efficient BERT-based model for Bengali *** have explored 20 combinations for KD,quantization,and pruning,resulting in improved speedup,fewer parameters,and reduced memory *** best results demonstrate significant improvements in both speed and *** instance,in the case of mBERT,we achieved a 3.87×speedup and 4×compression ratio with a combination of Distil+Prune+Quant that reduced parameters from 178 to 46 M,while the memory size decreased from 711 to 178 *** results offer scalable solutions for NLP tasks in various languages and advance the field of model compression,making these models suitable for real-world applications in resource-limited environments.

关键词： Bengali NLP black-box distillation emotion classification model compression post-training quantization unstructured pruning

来源：评论

学校读者我要写书评

暂无评论

DALTON - Deep Local Learning in SNNs via local Weights and Surrogate-Derivative Transfer

引用

IEEE Transactions on Emerging Topics in Computing 2024年 1-12页

作者： Gaurav, Ramashish Do, Duy Anh Doan, Thinh Yi, Yang Department of Electrical and Computer Engineering Virginia Tech USA

Direct training of Spiking Neural Networks (SNNs) is a challenging task because of their inherent temporality. Added to it, the vanilla Back-propagation based methods are not applicable either, due to the non-differentiability of the spikes in SNNs. Surrogate-Derivative based methods with Backpropagation Through Time (BPTT) address these direct training challenges quite well;however, such methods are not neuromorphic-hardware friendly for the On-chip training of SNNs. Recently formalized Three-Factor based Rules (TFR) for direct local-training of SNNs are neuromorphic-hardware friendly;however, they do not effectively leverage the depth of the SNN architectures (we show it empirically here), thus, are limited. In this work, we present an improved version of a conventional three-factor rule, for local learning in SNNs which effectively leverages depth - in the context of learning features hierarchically. Taking inspiration from the Back-propagation algorithm, we theoretically derive our improved, local, three-factor based learning method, named DALTON (Deep LocAl Learning via local WeighTs and SurrOgate-Derivative TraNsfer), which employs weights and surrogate-derivative transfer from the local layers. Along the lines of TFR, our proposed method DALTON is also amenable to the neuromorphic-hardware implementation. Through extensive experiments on static (MNIST, FMNIST, & CIFAR10) and event-based (N-MNIST, DVS128-Gesture, & DVSCIFAR10) datasets, we show that our proposed local-learning method DALTON makes effective use of the depth in Convolutional SNNs, compared to the vanilla TFR implementation. IEEE

关键词： System-on-chip

来源：评论

学校读者我要写书评

暂无评论

Video captioning using transformer-based GAN

引用

Multimedia Tools and Applications 2025年第10期84卷 7091-7113页

作者： Babavalian, Mohammad Reza Kiani, Kourosh Electrical and Computer Engineering Department Semnan University Semnan Iran

Video captioning is the process of automatically generating natural language descriptions of video content. Historically, most video captioning methods have relied on extending Sequence-to-Sequence (Seq2Seq) models. However, such approaches possess limitations due to the sequential nature of the captions, which leads to less accurate captions. To address this limitation, this paper introduces a novel end-to-end architecture for video captioning that combines conditional Wasserstein Generative Adversarial Networks (cWGAN) with a transformer model. The proposed architecture consists of two modules: feature extraction and caption generation. The feature extraction module aims to obtain an encoded feature vector representing the video contents, while the caption generation module generates human-readable captions from encoded feature vector. To the best of our knowledge, this is the first architecture for generative video captioning that integrates a transformer model with GAN. The results of the proposed model based on the BLEU-4, METEOR, ROUGE-L, and CIDEr metrics, on two datasets, MSVD (BLEU-4 = 61.2, METEOR = 41.6) and MSR-VTT (BLEU-4 = 61.2, METEOR = 31.1), compared to state-of-the-art approaches, demonstrate the effectiveness of the transformer with generative model in generating accurate and human-readable captions. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Realtime and Integrated Framework for LiDAR-based Object Tracking

引用

Journal of Institute of Control, Robotics and Systems 2025年第3期31卷 196-205页

作者： Lee, Gyuseok Kim, Kana Lee, Jejun Kim, Hakil Department of Electrical and Computer Engineering Inha University Korea Republic of

This study proposes a real-time integrated framework for LiDAR-based object tracking in autonomous driving environments. Advancements in LiDAR sensors are increasing point cloud data collection, leading to a demand for reliable real-time processing methods. The proposed framework applies voxelization and ground removal techniques to reduce computational load and integrates clustering and deep learning-based object recognition to ensure stability. Combining the point cloud data from LiDAR and the IMU data corrects distortions and refines real-time object movement, enabling accurate tracking in dynamic environments. This framework supports a maximum detection range of 100 m, with a computation time of 52 ms, a positional error of 1.06 m, a heading error of 3.79°, a relative velocity error of 1.46 m/s, and an average tracking frame count of 101, thereby improving object recognition accuracy and tracking performance while fulfilling real-time processing requirements. © ICROS 2025.

关键词： 3D object detection autonomous driving LiDAR object tracking

来源：评论

学校读者我要写书评

暂无评论

MCVL: Multi-directional Camera-based Visual Localization Resistant to Seasonal and Illumination Changes

引用

Journal of Institute of Control, Robotics and Systems 2025年第2期31卷 153-159页

作者： Mun, Giyoung Kim, Hakil Department of Electrical and Computer Engineering Inha University Korea Republic of

The global navigation satellite system-based technology has inherent limitations due to its reliance on radio signals. In contrast, visual localization operates independently of radio communication, presenting a viable solution to overcome these limitations. However, it is susceptible to seasonal and illumination variations, highlighting the need for research to address these challenges. Therefore, this paper proposes the multi-directional camera-based visual localization, which is robust against seasonal and illumination changes. The proposed method combines image from multiple directions and extracts global deep learning features. Subsequently, local deep learning features are extracted to preserve the characteristics of each combined image, allowing for the identification of geographically similar images. This approach utilize multi-directional cameras, enabling resilient performance under various constraints. Moreover, it demonstrates an improvement of 7.56% in recall rate at 1-meter threshold compared to existing methods. © ICROS 2025.

关键词： Global positioning system

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Based Cloud Security: Innovative Attack Detection and Privacy Focused Key Management

引用

IEEE Transactions on computers 2025年第6期74卷 1978-1989页

作者： Ahmad, Shahnawaz Arif, Mohd Mehfuz, Shabana Ahmad, Javed Nazim, Mohd Bennett University School of Computer Science Engineering and Technology Greater Noida201310 India Galgotias University School of Computer Science & Engineering Greater Noida203201 India Jamia Millia Islamia Department of Electrical Engineering New Delhi110025 India Sharda University Department of Computer Science and Engineering Sharda School of Engineering and Technology Greater Noida201310 India Noida Institute of Engineering and Technology Department of Computer Science and Engineering School of Computer Science and IT Greater Noida201306 India

Cloud Computing (CC) is widely adopted in sectors like education, healthcare, and banking due to its scalability and cost-effectiveness. However, its internet-based nature exposes it to cyber threats, necessitating advanced security frameworks. Traditional models suffer from high false positives and limited adaptability. To address these challenges, VECGLSTM, an attack detection model integrating Variable Long Short-Term Memory (VLSTM), capsule networks, and the Enhanced Gannet Optimization Algorithm (EGOA), is introduced. This hybrid approach enhances accuracy, reduces false positives, and dynamically adapts to evolving threats. EGOA is employed for its superior optimization capability, ensuring faster convergence and resilience. Additionally, Chaotic Cryptographic Pelican Tunicate Swarm Optimization (CCPTSO) is proposed for privacy-preserving key management. This model combines chaotic cryptographic techniques with the Pelican Tunicate Swarm Optimization Algorithm (PTSOA), leveraging the pelican algorithm’s exploration strength and the tunicate swarm’s exploitation ability for optimal encryption security. Performance evaluation demonstrates 99.675% accuracy, 99.5175% recall, 99.7075% precision, and 99.615% F1-score, along with reduced training (1.79s), encryption (0.986s), and decryption (1.029s) times. This research significantly enhances CC security by providing a scalable, adaptive framework that effectively counters evolving cyber threats while ensuring efficient key management. © 1968-2012 IEEE.

关键词： Cloud security

来源：评论

学校读者我要写书评

暂无评论

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2024年第9期5卷 4354-4363页

作者： Roy, Ayush Mohiuddin, S.K. Sarkar, Ram Jadavpur University Department of Electrical Engineering Kolkata700032 India Asutosh College Department of Computer Science Kolkata700026 India Jadavpur University Department of Computer Science and Engineering Kolkata700032 India

The process of modifying digital images has been made significantly easier by the availability of several image editing software. However, in a variety of contexts, including journalism, judicial processes, and historical documentation, the authenticity of images is of utmost importance. In particular, copy-move forgery is a distinct type of image manipulation, where a portion of an image is copied and pasted into another area of the same image, creating a fictitious or altered version of the original. In this research, we present a lightweight MultiResUnet architecture with the similarity-based positional attention module (SPAM) attention module for copy-move forgery detection (CMFD). By using a similarity measure across the patches of the features, this attention module identifies the patches, where a forged region is present. The lightweight network also aids in resource-efficient training and transforms the model into one that can be used in real time. We have employed four commonly used but extremely difficult CMFD datasets, namely CoMoFoD, COVERAGE, CASIA v2, and MICC-F600, to assess the effectiveness of our model. The proposed model significantly lowers false positives, thereby improving the pixel-level accuracy and dependability of CMFD tools. © 2020 IEEE.

关键词： Real time systems

来源：评论

学校读者我要写书评

暂无评论

Optimal monitoring and attack detection of networks modeled by Bayesian attack graphs

引用

Cybersecurity 2024年第1期7卷 1-15页

作者： Armita Kazeminajafabadi Mahdi Imani Department of Electrical and Computer Engineering Northeastern UniversityBostonMAUSA

Early attack detection is essential to ensure the security of complex networks,especially those in critical *** is particularly crucial in networks with multi-stage attacks,where multiple nodes are connected to external sources,through which attacks could enter and quickly spread to other network *** attack graphs(BAGs)are powerful models for security risk assessment and mitigation in complex networks,which provide the probabilistic model of attackers’behavior and attack progression in the *** attack detection techniques developed for BAGs rely on the assumption that network compromises will be detected through routine monitoring,which is unrealistic given the ever-growing complexity of *** paper derives the optimal minimum mean square error(MMSE)attack detection and monitoring policy for the most general form of *** exploiting the structure of BAGs and their partial and imperfect monitoring capacity,the proposed detection policy achieves the MMSE optimality possible only for linear-Gaussian state space models using Kalman *** adaptive resource monitoring policy is also introduced for monitoring nodes if the expected predictive error exceeds a user-defined *** and efficient matrix-form computations of the proposed policies are provided,and their high performance is demonstrated in terms of the accuracy of attack detection and the most efficient use of available resources using synthetic Bayesian attack graphs with different topologies.

关键词： Multi-stage attacks Bayesian attack graph Attack detection Optimal monitoring

来源：评论

学校读者我要写书评

暂无评论

Capacitorless Solid-state Power Filter for Single-phase DC-AC Converters

引用

CES Transactions on electrical Machines and Systems 2024年第3期8卷 367-377页

作者： Haitham Kanakri Euzeli C.Dos Santos Purdue University Electrical and Computer Engineering DepartmentIndianapolisIN46202 USA

Converters rely on passive filtering as a crucial element due to the high-frequency operational characteristics of power *** filtering methods involve a dual inductor-capacitor(LC)cell or an inductor-capacitor-inductor(LCL)***,capacitors are susceptible to wear-out mechanisms and failure ***,the necessity for monitoring and regular replacement adds to an elevated cost of ownership for such *** utilization of an active output power filter can be used to diminish the dimensions of the LC filter and the electrolytic dc-link capacitor,even though the inclusion of capacitors remains an indispensable part of the *** paper introduces capacitorless solid-state power filter(SSPF)for single-phase dc-ac *** proposed configuration is capable of generating a sinusoidal ac voltage without relying on *** proposed filter,composed of a planar transformer and an H-bridge converter operating at high frequency,injects voltage harmonics to attain a sinusoidal output *** design parameters of the planar transformer are incorporated,and the impact of magnetizing and leakage inductances on the operation of the SSPF is *** analysis,supported by simulation and experimental results,are provided for a design example for a single-phase *** total harmonic distortion observed in the output voltage is well below the IEEE 519 *** system operation is experimentally tested under both steady-state and dynamic conditions.A comparison with existing technology is presented,demonstrating that the proposed topology reduces the passive components used for filtering.

关键词： Solid-state power filter(SSPF) Capacitorless topology Active output power filter(AOF) Planar transformer Electrolytic capacitor Lifetime Wear-out mechanisms

来源：评论

学校读者我要写书评

暂无评论

Refinement modeling and verification of secure operating systems for communication in digital twins

引用

Digital Communications and Networks 2024年第2期10卷 304-314页

作者： Zhenjiang Qian Gaofei Sun Xiaoshuang Xing Gaurav Dhiman School of Computer Science and Engineering Changshu Institute of TechnologySuzhou215500China University Centre for Research and Development Department of Computer Science and EngineeringChandigarh UniversityMohali140413India Department of Computer Science and Engineering Graphic Era Deemed to be UniversityDehradun248002India

In traditional digital twin communication system testing,we can apply test cases as completely as possible in order to ensure the correctness of the system implementation,and even then,there is no guarantee that the digital twin communication system implementation is completely *** verification is currently recognized as a method to ensure the correctness of software system for communication in digital twins because it uses rigorous mathematical methods to verify the correctness of systems for communication in digital twins and can effectively help system designers determine whether the system is designed and implemented *** this paper,we use the interactive theorem proving tool Isabelle/HOL to construct the formal model of the X86 architecture,and to model the related assembly *** verification result shows that the system states obtained after the operations of relevant assembly instructions is consistent with the expected states,indicating that the system meets the design expectations.

关键词： Theorem proving Isabelle/HOL Formal verification System modeling Correctness verification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：