检索结果-内蒙古大学图书馆

Revisiting face detection: Supercharging Viola-Jones with particle swarm optimization for enhanced performance

Journal of Intelligent and Fuzzy Systems 2024年第4期46卷 10727-10741页

作者： Mohana, M. Subashini, P. Shukla, Diksha Centre for Machine Learning and Intelligence Department of Computer Science Avinashilingam Institute Tamil Nadu Coimbatore India Department of Electrical Engineering and Computer Science University of Wyoming LaramieWY United States

In recent years, face detection has emerged as a prominent research field within computer Vision (CV) and Deep Learning. Detecting faces in images and video sequences remains a challenging task due to various factors such as pose variation, varying illumination, occlusion, and scale differences. Despite the development of numerous face detection algorithms in deep learning, the Viola-Jones algorithm, with its simple yet effective approach, continues to be widely used in real-time camera applications. The conventional Viola-Jones algorithm employs AdaBoost for classifying faces in images and videos. The challenge lies in working with cluttered real-time facial images. AdaBoost needs to search through all possible thresholds for all samples to find the minimum training error when receiving features from Haar-like detectors. Therefore, this exhaustive search consumes significant time to discover the best threshold values and optimize feature selection to build an efficient classifier for face detection. In this paper, we propose enhancing the conventional Viola-Jones algorithm by incorporating Particle Swarm Optimization (PSO) to improve its predictive accuracy, particularly in complex face images. We leverage PSO in two key areas within the Viola-Jones framework. Firstly, PSO is employed to dynamically select optimal threshold values for feature selection, thereby improving computational efficiency. Secondly, we adapt the feature selection process using AdaBoost within the Viola-Jones algorithm, integrating PSO to identify the most discriminative features for constructing a robust classifier. Our approach significantly reduces the feature selection process time and search complexity compared to the traditional algorithm, particularly in challenging environments. We evaluated our proposed method on a comprehensive face detection benchmark dataset, achieving impressive results, including an average true positive rate of 98.73% and a 2.1% higher average prediction accura

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Multi-Representation Spatial-Temporal Graph Convolutional Networks for Network Traffic Prediction

引用

IEEE Internet of Things Journal 2025年第13期12卷 23085-23099页

作者： Yang, Yang He, Yechen Zhao, Binnan Wu, Celimuge Gao, Zhipeng Rui, Lanlan Beijing University of Posts and Telecommunications School of Computer Science Beijing100876 China University of Electro-Communications Department of Computer and Network Engineering Tokyo1828585 Japan

With the rapid proliferation of the Internet of Things, network traffic prediction has become crucial for intelligent network management, enabling more reliable and flexible services for a vast array of IoT devices and applications. The heterogeneous and dynamic nature of IoT networks introduces complex spatial relations and underlying periodic dependencies in spatial-temporal graphs that existing methods struggle to model effectively. In this paper, we propose Multi-Representation Spatial-Temporal Graph Convolutional Networks (MRSTGCN), a novel unified framework specifically designed to address these challenges. MRSTGCN integrates a Multi-Representation Graph Convolutional Network (MRGCN) module to model node heterogeneity and complex traffic propagation, and two complementary embedding modules - Historical Embedding and Temporal Embedding - to capture and fuse periodic dependencies across different fine-grained temporal cycles. Extensive experiments are conducted on two network traffic datasets, and the results demonstrate that MRSTGCN achieves state-of-the-art performance with obvious improvements in MAE, RMSE and MAPE on three prediction horizons. © 2025 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Adaptive Flowing Traffic Prediction in Contention Random Access for Optimizing Virtual/Physical Resource in B5G/5G New Radio and Core Network

引用

IEEE Transactions on Network science and engineering 2024年第2期11卷 1934-1946页

作者： Chang, Ben-Jye Lin, Yu-Ting National Yunlin University of Science and Technology Department of Computer Science and Information Engineering Douliu City64002 Taiwan

For differentiating and customizing different classes of traffic and virtualizing physical resources of networks and machines, B5G/5G specifies several novel mechanisms, including VNF, SDN, Service Function Chaining, Network Slicing, MEC, etc. In B5G/5G, the out-of-band contention based random access using PHY subframe preambles is specified before data transmissions. However, B5G/5G requires dynamically efficient approaches for realizing real 5G, including randomly contending limited sharing channel preambles, dynamically determining flow slicing of SFC, dynamically determining PM/VNFi/VNFc resources, while minimizing carrying cost and energy consumption via multi-link/multi-path 5G network. This paper thus proposes the efficient Adaptive Slice Flowing Prediction in Contention Random Access for Optimizing Virtual/Physical Resource in 5G (APROR) consisting of two phases: Dynamic Flow Random Contention of 5G (dFRC) and Adaptive Real-time Predictive Flow Data Rate of diverse types of slicing (aRPF). Numerical results show APROR outperforms all approaches in performance metrics: contention collision probability, access delay, E2E delay, loss, throughput, etc. Consequently, several contributions are definitely achieved: 1) to dynamically determine the preamble configuration modes and dynamic flow backoff, 2) to adaptively differentiate collision domains for different types of flows and minimize collision probability, and 3) to determine the optimal Virtual/Physical Resource achieving optimal SFC corresponding to network slices. © 2013 IEEE.

关键词： Real time systems

来源：评论

学校读者我要写书评

暂无评论

HoLens:A visual analytics design for higher-order movement modeling and visualization

引用

Computational Visual Media 2024年第6期10卷 1079-1100页

作者： Zezheng Feng Fang Zhu Hongjun Wang Jianing Hao Shuang-Hua Yang Wei Zeng Huamin Qu Department of Computer Science and Engineering The Hong Kong University of Science and TechnologyHong KongChina Department of Computer Science and Engineering Southern University of Science and TechnologyShenzhen 518055China Thrust of Computational Media and Arts(CMA) The Hong Kong University of Science and Technology(Guangzhou)Guangzhou 511458China Department of Computer Science University of ReadingBerkshire RG66AHUK Shenzhen Key Laboratory of Safety and Security for Next Generation of Industrial Internet Southern University of Science and TechnologyShenzhen 518055China

Higher-order patterns reveal sequential multistep state transitions,which are usually superior to origin-destination analyses that depict only first-order geospatial movement *** methods for higher-order movement modeling first construct a directed acyclic graph(DAG)of movements and then extract higher-order patterns from the ***,DAG-based methods rely heavily on identifying movement keypoints,which are challenging for sparse movements and fail to consider the temporal variants critical for movements in urban *** overcome these limitations,we propose HoLens,a novel approach for modeling and visualizing higher-order movement patterns in the context of an urban *** mainly makes twofold contributions:First,we designed an auto-adaptive movement aggregation algorithm that self-organizes movements hierarchically by considering spatial proximity,contextual information,and tem-poral ***,we developed an interactive visual analytics interface comprising well-established visualization techniques,including the H-Flow for visualizing the higher-order patterns on the map and the higher-order state sequence chart for representing the higher-order state *** real-world case studies demonstrate that the method can adaptively aggregate data and exhibit the process of exploring higher-order patterns using *** also demonstrate the feasibility,usability,and effectiveness of our approach through expert interviews with three domain experts.

关键词： data visualization movement modeling state sequence visualization movement visualization urban visual analytics

来源：评论

学校读者我要写书评

暂无评论

A Survey on Enhancing Image Captioning with Advanced Strategies and Techniques

引用

computer Modeling in engineering & sciences 2025年第3期142卷 2247-2280页

作者： Alaa Thobhani Beiji Zou Xiaoyan Kui Amr Abdussalam Muhammad Asim Sajid Shah Mohammed ELAffendi School of Computer Science and Engineering Central South UniversityChangsha410083China Electronic Engineering and Information Science Department University of Science and Technology of ChinaHefei230026China EIAS Data Science Lab College of Computer and Information SciencesPrince Sultan UniversityRiyadh11586Saudi Arabia

Image captioning has seen significant research efforts over the last *** goal is to generate meaningful semantic sentences that describe visual content depicted in photographs and are syntactically *** real-world applications rely on image captioning,such as helping people with visual impairments to see their *** formulate a coherent and relevant textual description,computer vision techniques are utilized to comprehend the visual content within an image,followed by natural language processing *** approaches and models have been developed to deal with this multifaceted *** models prove to be stateof-the-art solutions in this *** work offers an exclusive perspective emphasizing the most critical strategies and techniques for enhancing image caption *** than reviewing all previous image captioning work,we analyze various techniques that significantly improve image caption generation and achieve significant performance improvements,including encompassing image captioning with visual attention methods,exploring semantic information types in captions,and employing multi-caption generation ***,advancements such as neural architecture search,few-shot learning,multi-phase learning,and cross-modal embedding within image caption networks are examined for their transformative *** comprehensive quantitative analysis conducted in this study identifies cutting-edgemethodologies and sheds light on their profound impact,driving forward the forefront of image captioning technology.

关键词： Image captioning semantic attention multi-caption natural language processing visual attention methods

来源：评论

学校读者我要写书评

暂无评论

A Construction of Object Detection Model for Acute Myeloid Leukemia

引用

Intelligent Automation & Soft Computing 2023年第4期36卷 543-560页

作者： K.Venkatesh S.Pasupathy S.P.Raja Department of Computer Science and Engineering Annamalai UniversityChidambaram608002India School of Computer Science and Engineering Vellore Institute of TechnologyVellore632014India

The evolution of bone marrow morphology is necessary in Acute Mye-loid Leukemia(AML)*** takes an enormous number of times to ana-lyze with the standardization and inter-observer ***,we proposed a novel AML detection model using a Deep Convolutional Neural Network(D-CNN).The proposed Faster R-CNN(Faster Region-Based CNN)models are trained with Morphological *** proposed Faster R-CNN model is trained using the augmented *** overcoming the Imbalanced Data problem,data augmentation techniques are *** Faster R-CNN performance was com-pared with existing transfer learning *** results show that the Faster R-CNN performance was signiﬁcant than other *** number of images in each class is *** example,the Neutrophil(segmented)class consists of 8,486 images,and Lymphocyte(atypical)class consists of eleven *** dataset is used to train the CNN for single-cell morphology classiﬁ*** proposed work implies the high-class performance server called Nvidia Tesla V100 GPU(Graphics processing unit).

关键词： Acute myeloid leukemia(AML) convolutional neural network(CNN) and nvidia tesla v100 gpu

来源：评论

学校读者我要写书评

暂无评论

Text Augmentation-Based Model for Emotion Recognition Using Transformers

引用

computers, Materials & Continua 2023年第9期76卷 3523-3547页

作者： Fida Mohammad Mukhtaj Khan Safdar Nawaz Khan Marwat Naveed Jan Neelam Gohar Muhammad Bilal Amal Al-Rasheed Department of Computer Science The University of HaripurHaripur22620Pakistan Department of Computer Systems Engineering Faculty of Electrical and Computer EngineeringUniversity of Engineering and Technology PeshawarPeshawar25120Pakistan Department of Electronics Engineering Technology University of TechnologyNowshera24100Pakistan Department of Computer Science Shaheed Benazir BhuttoWomen UniversityPeshawar25000Pakistan Department of Information Systems College of Computer and Information SciencesPrincess Nourah bint Abdulrahman UniversityRiyadh11671Saudi Arabia

Emotion Recognition in Conversations(ERC)is fundamental in creating emotionally ***-BasedNetwork(GBN)models have gained popularity in detecting conversational contexts for ERC ***,their limited ability to collect and acquire contextual information hinders their *** propose a Text Augmentation-based computational model for recognizing emotions using transformers(TA-MERT)to address *** proposed model uses the Multimodal Emotion Lines Dataset(MELD),which ensures a balanced representation for recognizing human *** used text augmentation techniques to producemore training data,improving the proposed model’s *** encoders train the deep neural network(DNN)model,especially Bidirectional Encoder(BE)representations that capture both forward and backward contextual *** integration improves the accuracy and robustness of the proposed ***,we present a method for balancing the training dataset by creating enhanced samples from the original *** balancing the dataset across all emotion categories,we can lessen the adverse effects of data imbalance on the accuracy of the proposed *** results on the MELD dataset show that TA-MERT outperforms earlier methods,achieving a weighted F1 score of 62.60%and an accuracy of 64.36%.Overall,the proposed TA-MERT model solves the GBN models’weaknesses in obtaining contextual data for ***-MERT model recognizes human emotions more accurately by employing text augmentation and transformer-based *** balanced dataset and the additional training samples also enhance its *** findings highlight the significance of transformer-based approaches for special emotion recognition in conversations.

关键词： Emotion recognition in conversation graph-based network text augmentation-basedmodel multimodal emotion lines dataset bidirectional encoder representation for transformer

来源：评论

学校读者我要写书评

暂无评论

Tightening QC Relaxations of AC Optimal Power Flow Through Improved Linear Convex Envelopes

引用

IEEE Transactions on Power Systems 2025年第2期40卷 1465-1480页

作者： Narimani, Mohammad Rasoul Molzahn, Daniel K. Davis, Katherine R. Crow, Mariesa L. Department of Electrical and Computer Engineering NorthridgeCA91330 United States Georgia Institute of Technology School of Electrical and Computer Engineering AtlantaGA30332 United States Texas A&M University Electrical and Computer Engineering Department College StationTX77840 United States Missouri University of Science and Technology Electrical and Computer Engineering Department RollaMO65409 United States

AC optimal power flow (AC OPF) is a fundamental problem in power system operations. Accurately modeling the network physics via the AC power flow equations makes AC OPF a challenging nonconvex problem. To search for global optima, recent research has developed various convex relaxations that bound the optimal objective values of AC OPF problems. The QC relaxation convexifies the AC OPF problem by enclosing the non-convex terms within convex envelopes. The QC relaxation's accuracy strongly depends on the tightness of these envelopes. This paper proposes two improvements for tightening QC relaxations of OPF problems. We first consider a particular nonlinear function whose projections are the nonlinear expressions appearing in the polar representation of the power flow equations. We construct a polytope-shaped convex envelope around this nonlinear function and derive convex expressions for the nonlinear terms using its projections. Second, we use sine and cosine expression properties, along with changes in their curvature, to tighten this convex envelope. We also propose a coordinate transformation to tighten the envelope by rotating power flow equations based on individual bus-specific angles. We compare these enhancements to a state-of-the-art QC relaxation method using PGLib-OPF test cases, revealing improved optimality gaps in 68% of the cases. © 1969-2012 IEEE.

关键词： Load flow optimization

来源：评论

学校读者我要写书评

暂无评论

ANovel Light Weight CNN Framework Integrated with Marine Predator Optimization for the Assessment of Tear Film-Lipid Layer Patterns

引用

computer Modeling in engineering & sciences 2023年第7期136卷 87-106页

作者： Bejoy Abraham Jesna Mohan Linu Shine Sivakumar Ramachandran Department of Computer Science and Engineering College of Engineering MuttatharaThiruvananthapuramKerala695008India Department of Computer Science and Engineering Mar Baselios College of Engineering and TechnologyThiruvananthapuramKerala695015India Department of Electronics and Communication Engineering College of Engineering TrivandrumKerala695016India

Tear film,the outermost layer of the eye,is a complex and dynamic structure responsible for tear *** tear film lipid layer is a vital component of the tear film that provides a smooth optical surface for the cornea and wetting the ocular *** eye syndrome(DES)is a symptomatic disease caused by reduced tear production,poor tear quality,or excessive *** diagnosis is a difficult task due to its multifactorial *** of several clinical tests available,the evaluation of the interference patterns of the tear film lipid layer forms a potential tool for DES *** instrument known as Tearscope Plus allows the rapid assessment of the lipid layer.A grading scale composed of five categories is used to classify lipid layer *** reported work proposes the design of an automatic system employing light weight convolutional neural networks(CNN)and nature inspired optimization techniques to assess the tear film lipid layer patterns by interpreting the images acquired with the Tearscope *** designed framework achieves promising results compared with the existing state-of-the-art techniques.

关键词： Dry-eye syndrome tearscope plus tear film deep neural networks

来源：评论

学校读者我要写书评

暂无评论

Artificial Intelligence Enabled Future Wireless Electric Vehicles with Multi-Model Learning and Decision Making Models

引用

Tsinghua science and Technology 2024年第6期29卷 1776-1784页

作者： Gajula Ramesh Anil Kumar Budati Shayla Islam Louai A.Maghrabi Abdullah Al-Atwai Department of Computer Science and Engineering Gokaraju Rangaraju Institute of Engineering&TechnologyHyderabad 500090India Institute of Computer Science and Digital Innovation(ICSDI) UCSI UniversityKuala Lumpur 56000Malaysia also with Department of ECE KoneruLakshmaiah Education FoundationHyderabad 500090India ICSDI UCSI UniversityKuala Lumpur 56000Malaysia Department of Software Engineering College of EngineeringUniversity of Business and TechnologyJeddah 21448Kingdom of Saudi Arabia Department of Computer Science Applied CollegeUniversity of TabukTabuk 47512Kingdom of Saudi Arabia

In the contemporary era,driverless vehicles are a reality due to the proliferation of distributed technologies,sensing technologies,and Machine to Machine(M2M)***,the emergence of deep learning techniques provides more scope in controlling and making such vehicles energy *** existing methods,it is understood that there have been many approaches found to automate safe driving in autonomous and electric vehicles and also their energy ***,the models focus on different aspects *** is need for a comprehensive framework that exploits multiple deep learning models in order to have better control using Artificial Intelligence(AI)on autonomous driving and energy *** this end,we propose an AI-based framework for autonomous electric vehicles with multi-model learning and decision *** focuses on both safe driving in highway scenarios and energy *** deep learning based framework is realized with many models used for localization,path planning at high level,path planning at low level,reinforcement learning,transfer learning,power control,and speed *** reinforcement learning,state-action-feedback play important role in decision *** simulation implementation reveals that the efficiency of the AI-based approach towards safe driving of autonomous electric vehicle gives better performance than that of the normal electric vehicles.

关键词： wireless vehicles deep learning multi-model learning reinforcement learning Artificial Intelligence(Al)

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：