检索结果-内蒙古大学图书馆

Earthworm Optimization with Improved SqueezeNet Enabled Facial Expression Recognition Model

computer Systems science & engineering 2023年第8期46卷 2247-2262页

作者： N.Sharmili Saud Yonbawi Sultan Alahmari E.Laxmi Lydia Mohamad Khairi Ishak Hend Khalid Alkahtani Ayman Aljarbouh Samih M.Mostafa Computer Science and Engineering Department Gayatri Vidya Parishad College of Engineering for WomenVisakhapatnamAndhra PradeshIndia Department of Software Engineering College of Computer Science and EngineeringUniversity of JeddahJeddahSaudi Arabia King Abdul Aziz City for Science and Technology RiyadhKingdom of Saudi Arabia Department of Computer Science and Engineering Vignan’s Institute of Information TechnologyVisakhapatnam530049India School of Electrical and Electronic Engineering Engineering CampusUniversiti Sains Malaysia(USM)Nibong TebalPenang14300Malaysia Department of Information Systems College of Computer and Information SciencesPrincess Nourah bint Abdulrahman UniversityRiyadh11564Saudi Arabia Department of Computer Science University of Central AsiaNaryn722600Kyrgyzstan Faculty of Computers and Information South Valley UniversityQena83523Egypt

Facial expression recognition(FER)remains a hot research area among computer vision researchers and still becomes a challenge because of high intraclass *** techniques for this problem depend on hand-crafted features,namely,LBP,SIFT,and HOG,along with that a classifier trained on a database of videos or *** execute perform well on image datasets captured in a controlled condition;however not perform well in the more challenging dataset,which has partial faces and image ***,many studies presented an endwise structure for facial expression recognition by utilizing DL ***,this study develops an earthworm optimization with an improved SqueezeNet-based FER(EWOISN-FER)*** presented EWOISN-FER model primarily applies the contrast-limited adaptive histogram equalization(CLAHE)technique as a pre-processing *** addition,the improved SqueezeNet model is exploited to derive an optimal set of feature vectors,and the hyperparameter tuning process is performed by the stochastic gradient boosting(SGB)***,EWO with sparse autoencoder(SAE)is employed for the FER process,and the EWO algorithm appropriately chooses the SAE ***-ranging experimental analysis is carried out to examine the performance of the proposed *** experimental outcomes indicate the supremacy of the presented EWOISN-FER technique.

关键词： Facial expression recognition deep learning computer vision earthworm optimization hyperparameter optimization

来源：评论

学校读者我要写书评

暂无评论

An Enhanced Searching Strategy for Multi-Agent Mobile Applications

引用

China Communications 2022年第11期19卷 282-296页

作者： Xiaoyu Zhang Wei Liu Fangchun Yang School of Computer Science(National Pilot Software Engineering School) Beijing University of Posts and TelecommunicationsBeijing 100876China State Key Laboratory of Networking and Switching Technology(Beijing University of Posts and Telecommunications) Beijing 100876China

Multi-agent mobile applications play an essential role in mobile applications and have attracted more and more researchers’*** work has always focused on multi-agent applications with perfect *** are usually based on human-designed rules to provide decision-making searching ***,existing methods for solving perfect-information mobile applications cannot be directly applied to imperfect-information mobile ***,we take the Contact Bridge,a multi-agent application with imperfect information,for the case *** propose an enhanced searching strategy to deal with multi-agent applications with imperfect *** design a self-training bidding system model and apply a Recurrent Neural Network(RNN)to model the bidding *** bridge system model consists of two parts,a bidding prediction system based on imitation learning to get a contract quickly and a visualization system for hands understanding to realize regular communication between ***,to dynamically analyze the impact of other players’unknown hands on our final reward,we design a Monte Carlo sampling algorithm based on the bidding system model(BSM)to deal with imperfect *** the same time,a double-dummy analysis model is designed to efficiently evaluate the results of *** results indicate that our searching strategy outperforms the top rule-based mobile applications.

关键词： multi-agent mobile applications imperfect information deep neural network Monte Carlo Contact Bridge

来源：评论

学校读者我要写书评

暂无评论

User Information Perception in Virtual Reality Environment 19

User Information Perception in Virtual Reality Environment

引用

2022 IEEE SmartWorld, 19th IEEE International Conference on Ubiquitous Intelligence and Computing, 2022 IEEE International Conference on Autonomous and Trusted Vehicles Conference, 22nd IEEE International Conference on Scalable Computing and Communications, 2022 IEEE International Conference on Digital Twin, 8th IEEE International Conference on Privacy Computing and 2022 IEEE International Conference on Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PriComp/Metaverse 2022

作者： Chang, Enyao Che, Xiaoping Cui, Jingzhi Qu, Chenxin Beijing Jiaotong University School of Software Engineering Beijing China Beihang University School of Computer Science Beijing China

ISBN: (纸本)9798350346558

With the development of virtual reality (VR) technology, panoramic video, a new method that is the fusion of VR technology and panoramic video technology, have gradually emerged and developed rapidly. Nowadays, VR panoramic video is becoming the most important application of virtual reality technology. Although the amount of video information is large, the user acceptance rate is relatively low and the perception is weak, which has a certain negative impact on the popularity of VR. Currently, there are few types of research on the VR information perception field, and the existing researches lack a great division of user characteristics and panoramic video types. Therefore, this paper mainly discusses the following two questions: Is the user's information perception level in the VR environment significantly better than that in the traditional media environment? Do event types and user characteristics in VR videos affect users' perception of information? In response to the questions, a total of 20 participants were recruited for the research. Through the analysis of the statistical calculation, this paper draws the conclusion: Media type has a significant impact on user information perception and the perception in the VR environment is significantly better than that of traditional media. Besides, this paper also finds a phenomenon: the user characteristics and the proportion of event types in the video have an impact on the user information perception effect. It shows that the higher the level of education, the better the information perception effect (ages between 20-30);the higher the proportion of mobile events and emergencies in the video, the better the user information perception effect. © 2022 IEEE.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

Mask guided diverse face image synthesis

引用

Frontiers of computer science 2022年第3期16卷 67-75页

作者： Song SUN Bo ZHAO Muhammad MATEEN Xin CHEN Junhao WEN School of Big Data&Software Engineering Chongqing UniversityChongqing 401331China Department of Computer Science The University of British ColumbiaVancouve BC V6T 1Z4Canada Department of Computer Science Air University Multan CampusMultan 60000Pakistan

Recent studies have shown remarkable success in face image generation ***,existing approaches have limited diversity,quality and controllability in generating *** address these issues,we propose a novel end-to-end learning framework to generate diverse,realistic and controllable face images guided by face *** face mask provides a good geometric constraint for a face by specifying the size and location of different components of the face,such as eyes,nose and *** framework consists of four components:style encoder,style decoder,generator and *** style encoder generates a style code which represents the style of the result face;the generator translate the input face mask into a real face based on the style code;the style decoder learns to reconstruct the style code from the generated face image;and the discriminator classifies an input face image as real or *** the style code,the proposed model can generate different face images matching the input face mask,and by manipulating the face mask,we can finely control the generated face *** empirically demonstrate the effectiveness of our approach on mask guided face image synthesis task.

关键词： face image generation image translation generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Revisiting adversarial patches for designing camera-agnostic attacks against person detection 24

Revisiting adversarial patches for designing camera-agnostic...

引用

Proceedings of the 38th International Conference on Neural Information Processing Systems

作者： Hui Wei Zhixiang Wang Kewei Zhang Jiaqi Hou Yuanwei Liu Hao Tang Zheng Wang National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University The University of Tokyo School of Computer Science Peking University

ISBN: (纸本)9798331314385

Physical adversarial attacks can deceive deep neural networks (DNNs), leading to erroneous predictions in real-world scenarios. To uncover potential security risks, attacking the safety-critical task of person detection has garnered significant attention. However, we observe that existing attack methods overlook the pivotal role of the camera, involving capturing real-world scenes and converting them into digital images, in the physical adversarial attack workflow. This oversight leads to instability and challenges in reproducing these attacks. In this work, we revisit patch-based attacks against person detectors and introduce a camera-agnostic physical adversarial attack to mitigate this limitation. Specifically, we construct a differentiable camera Image Signal Processing (ISP) proxy network to compensate for the physical-to-digital transition gap. Furthermore, the camera ISP proxy network serves as a defense module, forming an adversarial optimization framework with the attack module. The attack module optimizes adversarial patches to maximize effectiveness, while the defense module optimizes the conditional parameters of the camera ISP proxy network to minimize attack effectiveness. These modules engage in an adversarial game, enhancing cross-camera stability. Experimental results demonstrate that our proposed Camera-Agnostic Patch (CAP) attack effectively conceals persons from detectors across various imaging hardware, including two distinct cameras and four smartphones.

关键词：

来源：评论

学校读者我要写书评

暂无评论

MedKAN: An Advanced Kolmogorov-Arnold Network for Medical Image Classification

arXiv

引用

arXiv 2025年

作者： Yang, Zhuoqin Zhang, Jiansong Luo, Xiaoling Lu, Zheng Shen, Linlin School of Computer Science & Software Engineering Shenzhen University Shenzhen China School of Computer Science University of Nottingham Ningbo China Ningbo China

Recent advancements in deep learning for image classification predominantly rely on convolutional neural networks (CNNs) or Transformer-based architectures. However, these models face notable challenges in medical imaging, particularly in capturing intricate texture details and contextual features. Kolmogorov-Arnold Networks (KANs) represent a novel class of architectures that enhance nonlinear transformation modeling, offering improved representation of complex features. In this work, we present MedKAN, a medical image classification framework built upon KAN and its convolutional extensions. MedKAN features two core modules: the Local Information KAN (LIK) module for fine-grained feature extraction and the Global Information KAN (GIK) module for global context integration. By combining these modules, MedKAN achieves robust feature modeling and fusion. To address diverse computational needs, we introduce three scalable variants—MedKANS, MedKAN-B, and MedKAN-L. Experimental results on nine public medical imaging datasets demonstrate that MedKAN achieves superior performance compared to CNN- and Transformer-based models, highlighting its effectiveness and generalizability in medical image analysis. Copyright © 2025, The Authors. All rights reserved.

关键词： Image classification

来源：评论

学校读者我要写书评

暂无评论

Flame Image Detection Algorithm Based on computer Vision

IAENG International Journal of Computer Science

引用

IAENG International Journal of computer science 2023年第4期50卷 1页

作者： Sun, Xiaoqing Cui, Wenhua Tao, Ye Wang, Zhaoyang School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan China

Fire is one of the most common disasters for human beings. It is also one of the disasters that cameras can easily catch. In order to detect a series of building fires efficiently, a fire image detection algorithm based on YOLOv4 is proposed in this paper. This algorithm can realize the real-time fire warning by identifying all images in the video. The comparison of different evaluation results found that the YOLOv4 fire image detection algorithm using both optimization algorithms achieved higher AP and recall rates. © 2023, International Association of Engineers. All rights reserved.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Efficient Scheduling Mapping Algorithm for Row Parallel Coarse-Grained Reconfigurable Architecture

引用

Tsinghua science and Technology 2021年第5期26卷 724-735页

作者： Naijin Chen Zhen Wang Ruixiang He Jianhui Jiang Fei Cheng Chenghao Han School of Computer and Information Science Anhui Polytechnic UniversityWuhu 241000China School of Computer Science and Technology Shanghai University of Electric PowerShanghai 200090China School of Software Engineering Tongji UniversityShanghai 201804China

Row Parallel Coarse-Grained Reconfigurable Architecture(RPCGRA)has the advantages of maximum parallelism and programmable *** an efficient algorithm to map the diverse applications onto RPCGRA is difficult due to a number of RPCGRA hardware *** solve this problem,the nodes of the data flow graph must be partitioned and scheduled onto the *** this paper,we present a Depth-First Greedy Mapping(DFGM)algorithm that simultaneously considers the communication costs and the use times of the Reconfigurable Cell Array(RCA).Compared with level breadth mapping,the performance of DFGM is *** percentage of maximum improvement in the use times of RCA is 33%and the percentage of maximum improvement in non-original input and output times is 64.4%(Given Discrete Cosine Transfor 8(DCT8),and the area of reconfigurable processing unit is 56).Compared with level-based depth mapping,DFGM also obtains the lowest averages of use times of RCA,non-original input and output times,and the reconfigurable time.

关键词： temporal mapping Reconfigurable Cell Array(RCA) listed scheduling communication costs

来源：评论

学校读者我要写书评

暂无评论

Archer: Adaptive Memory Compression with Page-Association-Rule Awareness for High-Speed Response of Mobile Devices 23

Archer: Adaptive Memory Compression with Page-Association-Ru...

引用

23rd USENIX Conference on File and Storage Technologies, FAST 2025

作者： Li, Changlong Zhu, Zongwei Wang, Chao Liu, Fangming Xu, Fei Sha, Edwin H.-M. Zhou, Xuehai School of Computer Science and Technology East China Normal University China Jianghuai Advance Technology Center Hefei230026 China MoE Engineering Research Center of Hardware/Software Co-Design Technology and Application China School of Software Engineering University of Science and Technology of China Hefei230026 China Suzhou Institute for Advanced Research University of Science and Technology of China Suzhou215123 China Huazhong University of Science and Technology China Peng Cheng Laboratory China

ISBN: (纸本)9781939133458

In mobile systems, memory can be compressed page-by-page to save space. This approach is widely adopted because memory data is accessed by page. However, this paper shows that the system response speed is significantly limited by page-grained compression. In this paper, we observe that approximately a quarter of anonymous memory pages are highly correlated, even though the association is implicit. Inspired by this, we propose Archer, an association-rule-aware memory compression framework in mobile systems. Archer demonstrates that memory in mobile devices should be compressed using flexible granularity, rather than relying solely on traditional page compression. To further integrate association-rule mining techniques into system design, we redesign the LRU mechanism and propose an adaptive memory compression region. Experimental results show that the average app launching speed is 1.55x faster when enabling Archer, and the average photographic speed and frame rate increase by 1.42x and 1.31x, respectively, compared to the state-of-the-art. © 2025 FAST. All Rights Reserved.

关键词： Portable equipment

来源：评论

学校读者我要写书评

暂无评论

An Efficient Privacy-Preserving Access Control Scheme for Cloud Computing Services

引用

IEEE Transactions on Consumer Electronics 2025年

作者： Xiong, Ling Wang, JunKai Yu, Linsheng Xiong, Neal Wu, Hanzhou Xihua university School of Computer and Software Engineering Chengdu610039 China Sul Ross State University School of Department of Computer Science and Mathematics AlpineTX79830 United States Shanghai University School of Computer and Software Shanghai China

To meet the future system requirements of Cloud Computing Services (CCSs) for large numbers of users, multiple services and high efficiency, authentication and access control technologies will evolve in a more secure and efficient direction. The integration of authentication and access control into a single module is critical to improving privacy and efficiency. However, current schemes may involve trade-offs and hard choices between privacy and efficiency. To overcome such limitations, our work proposes an efficient privacy-preserving scheme with integrated authentication and access control for CCSs environments. Unlike traditional authentication techniques, our scheme determines not only the legitimacy of the users identity but also the access permissions of remote users during the authentication process. Users will possess the capability to obtain entry to an array of cloud computing providers via utilization of a credential including the corresponding identity and permission. Moreover, the access permissions of different cloud service providers contained within a single credential are protected by a key-based Merkle tree technology to protect users privacy. That our proposed framework is impregnable within the context of the random oracle paradigm is indicated by the outcome of the security evaluation. Furthermore, the proposed framework exhibits superior computational and communicational proficiencies in contrast to relevant schemes. Therefore, this work effectively enhances the efficiency of services, which has important meaning for the use of the access control technology in CCSs environments. © 2025 IEEE.

关键词： Differential privacy

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：