检索结果-内蒙古大学图书馆

Vehicle color recognition based on smooth modulation neural network with multi-scale feature fusion

Frontiers of computer science 2023年第3期17卷 91-102页

作者： Mingdi HU Long BAI Jiulun FAN Sirui ZHAO Enhong CHEN School of Communications and Information Engineering&School of Artificial Intelligence Xi’an University of Posts&TelecommunicationsXi’an 710121China School of Computer Science and Technology University of Science and Technology of ChinaHefei 230026China

Vehicle Color Recognition(VCR)plays a vital role in intelligent traffic management and criminal investigation ***,the existing vehicle color datasets only cover 13 classes,which can not meet the current actual ***,although lots of efforts are devoted to VCR,they suffer from the problem of class imbalance in *** address these challenges,in this paper,we propose a novel VCR method based on Smooth Modulation Neural Network with Multi-Scale Feature Fusion(SMNN-MSFF).Specifically,to construct the benchmark of model training and evaluation,we first present a new VCR dataset with 24 vehicle classes,Vehicle Color-24,consisting of 10091 vehicle images from a 100-hour urban road surveillance ***,to tackle the problem of long-tail distribution and improve the recognition performance,we propose the SMNN-MSFF model with multiscale feature fusion and smooth *** former aims to extract feature information from local to global,and the latter could increase the loss of the images of tail class instances for training with ***,comprehensive experimental evaluation on Vehicle Color-24 and previously three representative datasets demonstrate that our proposed SMNN-MSFF outperformed state-of-the-art VCR *** extensive ablation studies also demonstrate that each module of our method is effective,especially,the smooth modulation efficiently help feature learning of the minority or tail *** Color-24 and the code of SMNN-MSFF are publicly available and can contact the author to obtain.

关键词： vehicle color recognition benchmark dataset multi-scale feature fusion long-tail distribution improved smooth l1 loss

来源：评论

学校读者我要写书评

暂无评论

A Privacy Preservation Method for Attributed Social Network Based on Negative Representation of Information

引用

computer Modeling in Engineering & sciences 2024年第7期140卷 1045-1075页

作者： Hao Jiang Yuerong Liao Dongdong Zhao Wenjian Luo Xingyi Zhang Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education School of Computer Science and TechnologyAnhui UniversityHefei230601China Chongqing Research Institute School of Computer Science and Artificial IntelligenceWuhan University of TechnologyWuhan430070China Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies School of Computer Science and TechnologyHarbin Institute of TechnologyShenzhen518055China

Due to the presence of a large amount of personal sensitive information in social networks,privacy preservation issues in social networks have attracted the attention of many *** by the self-nonself discrimination paradigmin the biological immune system,the negative representation of information indicates features such as simplicity and efficiency,which is very suitable for preserving social network ***,we suggest a method to preserve the topology privacy and node attribute privacy of attribute social networks,called ***,a negative survey-based method is developed to disturb the relationship between nodes in the social network so that the topology structure can be kept ***,a negative database-based method is proposed to hide node attributes,so that the privacy of node attributes can be preserved while supporting the similarity estimation between different node attributes,which is crucial to the analysis of social *** evaluate the performance of the AttNetNRI,empirical studies have been conducted on various attribute social networks and compared with several state-of-the-art methods tailored to preserve the privacy of social *** experimental results show the superiority of the developed method in preserving the privacy of attribute social networks and demonstrate the effectiveness of the topology disturbing and attribute hiding *** experimental results show the superiority of the developed methods in preserving the privacy of attribute social networks and demonstrate the effectiveness of the topological interference and attribute-hiding components.

关键词： Attributed social network topology privacy node attribute privacy negative representation of information negative survey negative database

来源：评论

学校读者我要写书评

暂无评论

Incomplete Multi-View Clustering via Auto-Weighted Fusion in Partition Space

引用

Tsinghua science and technology 2023年第3期28卷 595-611页

作者： Dongxue Xia Yan Yang Shuhong Yang School of Computing and Artificial Intelligence Southwest Jiaotong UniversityChengdu 611756China School of Computer Guangxi University of Science and TechnologyLiuzhou 545006China

As a class of effective methods for incomplete multi-view clustering,graph-based algorithms have recently drawn wide ***,most of them could use further improvement regarding the following ***,in some graph-based models,all views are forced to share a common similarity graph regardless of the severe consistency degeneration due to incomplete ***,similarity graph construction and cluster analysis are sometimes performed ***,the contribution difference of individual views is not always carefully *** address these issues simultaneously,this paper proposes an incomplete multi-view clustering algorithm based on auto-weighted fusion in partition *** our algorithm,the information of cluster structure is introduced into the process of similarity learning to construct a desirable similarity graph,information fusion is performed in partition space to alleviate the negative impact brought about by consistency degradation,and all views are adaptively weighted to reflect their different contributions to clustering ***,all the subtasks are collaboratively optimized in a united framework to reach an overall optimal *** results show that the proposed method compares favorably with the state-of-the-art methods.

关键词： Incomplete Multi-view Clustering(IMC) partition space auto-weighted fusion collaborative optimization

来源：评论

学校读者我要写书评

暂无评论

Evolutionary Particle Swarm Optimization Algorithm Based on Collective Prediction for Deployment of Base Stations

引用

computers, Materials & Continua 2025年第1期82卷 345-369页

作者： Jiaying Shen Donglin Zhu Yujia Liu Leyi Wang Jialing Hu Zhaolong Ouyang Changjun Zhou Taiyong Li School of Computer Science and Technology Zhejiang Normal UniversityJinhua321004China School of Future Technologies Jiangxi Institute of Applied Science and TechnologyNanchang330000China School of Computing and Artificial Intelligence Southwestern University of Finance and EconomicsChengdu611130China

The wireless signals emitted by base stations serve as a vital link connecting people in today’s society and have been occupying an increasingly important role in real *** development of the Internet of Things(IoT)relies on the support of base stations,which provide a solid foundation for achieving a more intelligent way of *** a specific area,achieving higher signal coverage with fewer base stations has become an urgent ***,this article focuses on the effective coverage area of base station signals and proposes a novel Evolutionary Particle Swarm Optimization(EPSO)algorithm based on collective prediction,referred to herein as *** a new strategy called neighbor-based evolution prediction(NEP)addresses the issue of premature convergence often encountered by *** also employs a strengthening evolution(SE)strategy to enhance the algorithm’s global search capability and efficiency,ensuring enhanced robustness and a faster convergence speed when solving complex optimization *** better adapt to the actual communication needs of base stations,this article conducts simulation experiments by changing the number of base *** experimental results demonstrate thatunder the conditionof 50 ormore base stations,ECPPSOconsistently achieves the best coverage rate exceeding 95%,peaking at 99.4400%when the number of base stations reaches *** results validate the optimization capability of the ECPPSO algorithm,proving its feasibility and *** ablative experiments and comparisons with other algorithms highlight the advantages of ECPPSO.

关键词： Particle swarm optimization effective coverage area global optimization base station deployment

来源：评论

学校读者我要写书评

暂无评论

PrompTHis: Visualizing the Process and Influence of Prompt Editing during Text-to-Image Creation

引用

IEEE Transactions on Visualization and computer Graphics 2024年 PP卷 1-12页

作者： Guo, Yuhan Shao, Hanning Liu, Can Xu, Kai Yuan, Xiaoru National Key Laboratory of General Artificial Intelligence and School of Intelligence Science and Technology Peking University China

Generative text-to-image models, which allow users to create appealing images through a text prompt, have seen a dramatic increase in popularity in recent years. However, most users have a limited understanding of how such models work and often rely on trial and error strategies to achieve satisfactory results. The prompt history contains a wealth of information that could provide users with insights into what has been explored and how the prompt changes impact the output image, yet little research attention has been paid to the visual analysis of such process to support users. We propose the Image Variant Graph, a novel visual representation designed to support comparing prompt-image pairs and exploring the editing history. The Image Variant Graph models prompt differences as edges between corresponding images and presents the distances between images through projection. Based on the graph, we developed the PrompTHis system through co-design with artists. Based on the review and analysis of the prompting history, users can better understand the impact of prompt changes and have a more effective control of image generation. A quantitative user study and qualitative interviews demonstrate that PrompTHis can help users review the prompt history, make sense of the model, and plan their creative process. IEEE

关键词： Visualization

来源：评论

学校读者我要写书评

暂无评论

Scene text recognition via dual character counting-aware visual and semantic modeling network

引用

science China(Information sciences) 2024年第3期67卷 313-314页

作者： Ke XIAO Anna ZHU Brian Kenji IWANA Cheng-Lin LIU School of Computer Science and Artificial Intelligence Wuhan University of Technology Human Interface Laboratory Kyushu University State Key Laboratory of Multimodal Artificial Intelligence Systems Institute of AutomationChinese Academy of Sciences School of Artificial Intelligence University of Chinese Academy of Sciences

Scene text recognition(STR) is drawing increasing attention nowadays due to its wide application in real life. Character counting information, as auxiliary information, has been shown to be effective in boosting text ... 详细信息

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

LesionAid: vision transformers-based skin lesion generation and classification – A practical review

引用

Multimedia Tools and Applications 2025年 1-22页

作者： K, Mallikharjuna Rao Krishna, Ghanta Sai Supriya, Kundrapu Sorgile, Meetiksha Data Science and Artificial Intelligence International Institute of Information Technology Naya Raipur India Computer Science and Engineering International Institute of Information Technology Naya Raipur India

Skin cancer is one of the most prevalent forms of human cancer. It is recognized mainly visually, beginning with clinical screening and continuing with the dermoscopic examination, histological assessment, and specimen collection. Deep convolutional neural networks (CNNs) perform highly segregated and potentially universal tasks against a classified fine-grained object. This study suggests a novel multi-class prediction framework that uses ViT and ViTGAN to categories skin lesions. To address the class disparity, GANs (Generative Adversarial Networks) based on vision transformers are used. The framework comprises four main phases: ViTGANs, Image processing, and explainable AI. Phase 1 consists of generating synthetic images using ViTGAN to balance all the classes in the dataset. To enhance the amount of the data, the second phase involves using various morphological processes and data augmentation techniques. In phases three and four, a ViT model for edge computing systems that can recognize patterns and classify skin lesions from the user's skin that is visible in the picture is developed. In phase 3, after classifying the lesions into the desired class with ViT, we will use explainable AI (XAI) that leads to more explainable results (using activation maps, etc.) while ensuring high predictive accuracy. The results demonstrate that the model used for generating synthetic images has achieved an FID score of 13.32 and the ViT model has achieved 99.2% as its training accuracy and 97.4% as its validation accuracy. The whole framework is compared with the existing frameworks for skin lesion detection. And explainable AI (XAI) is used in the proposed framework in order to increase model transparency and boost trust among users by illustrating the main factors affecting predictions. This interpretability helps physicians in making more informed decisions by providing clear insights into the model's reasoning. © The Author(s), under exclusive licence to Springer science+Bu

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Exploration of Real-Time 3-D View Reconstruction Methods

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on artificial intelligence 2024年第12期5卷 5915-5927页

作者： Agrawal, Arya Sharma, Teena Verma, Nishchal K. Vellore Institute of Technology School of Computer Science and Engineering Vellore632014 India Indian Institute of Technology Mehta Family School of Data Science and Artificial Intelligence Guwahati781039 India Indian Institute of Technology Kanpur Department of Electrical Engineering 208016 India

Real-time 3-D view reconstruction in an unfamiliar environment poses complexity for various applications due to varying conditions such as occlusion, latency, precision, etc. This article thoroughly examines and tests contemporary methodologies addressing challenges in 3-D view reconstruction. The methods being explored in this article are categorized into volumetric and mesh, generative adversarial network based, and open source library based methods. The exploration of these methods undergoes detailed discussions, encompassing methods, advantages, limitations, and empirical results. The real-time testing of each method is done on benchmarked datasets, including ShapeNet, Pascal 3D+, Pix3D, etc. The narrative highlights the crucial role of 3-D view reconstruction in domains such as robotics, virtual and augmented reality, medical imaging, cultural heritage preservation, etc. The article also anticipates future scopes by exploring generative models, unsupervised learning, and advanced sensor fusion to increase the robustness of the algorithms. © 2020 IEEE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

SinGRAV: Learning a Generative Radiance Volume from a Single Natural Scene

引用

Journal of computer science & technology 2024年第2期39卷 305-319页

作者：王玉洁陈学霖陈宝权 School of Computer Science and Technology Shandong UniversityQingdao 266237China State Key Laboratory of General Artificial Intelligence Beijing 100871China School of Intelligence Science and Technology Peking UniversityBeijing 100871China Tencent AI Lab Tencent Holdings LimitedShenzhen 518057China

We present SinGRAV, an attempt to learn a generative radiance volume from multi-view observations of a single natural scene, in stark contrast to existing category-level 3D generative models that learn from images of many object-centric scenes. Inspired by SinGAN, we also learn the internal distribution of the input scene, which necessitates our key designs w.r.t. the scene representation and network architecture. Unlike popular multi-layer perceptrons (MLP)-based architectures, we particularly employ convolutional generators and discriminators, which inherently possess spatial locality bias, to operate over voxelized volumes for learning the internal distribution over a plethora of overlapping regions. On the other hand, localizing the adversarial generators and discriminators over confined areas with limited receptive fields easily leads to highly implausible geometric structures in the spatial. Our remedy is to use spatial inductive bias and joint discrimination on geometric clues in the form of 2D depth maps. This strategy is effective in improving spatial arrangement while incurring negligible additional computational cost. Experimental results demonstrate the ability of SinGRAV in generating plausible and diverse variations from a single scene, the merits of SinGRAV over state-of-the-art generative neural scene models, and the versatility of SinGRAV by its use in a variety of applications. Code and data will be released to facilitate further research.

关键词： generative model neural radiance field 3D scene generation

来源：评论

学校读者我要写书评

暂无评论

A Dynamic YOLO-Based Sequence-Matching Model for Efficient Coverless Image Steganography

引用

computers, Materials & Continua 2024年第11期81卷 3221-3240页

作者： Jiajun Liu Lina Tan Zhili Zhou Weijin Jiang Yi Li Peng Chen School of Computer Science Hunan University of Technology and BusinessChangsha410205China Institute of Artificial Intelligence Guangzhou UniversityGuangzhou510555China

Many existing coverless steganography methods establish a mapping relationship between cover images and hidden *** issue with these methods is that as the steganographic capacity increases,the number of images stored in the database grows *** makes it challenging to build and manage a large image *** improve the image library utilization and anti-attack capability of the steganography system,we propose an efficient coverless scheme based on dynamically matched *** utilize You Only Look Once(YOLO)for selecting optimal objects and create a mapping dictionary between these objects and scrambling *** this dictionary,each image is effectively assigned to a specific scrambling factor,which is then used to scramble the receiver’s sequence *** achieve sufficient steganography capability with a limited image library,all substrings of the scrambled sequences have the potential to hide *** matching the secret information,the ideal number of stego images will be obtained from the *** to experimental results,this technology outperforms most previous works in terms of data load,transmission security,and hiding *** can recover an average of 79.85%of secret information under typical geometric attacks,and only approximately 200 random images are needed to achieve a capacity of 19 bits per image.

关键词： Coverless steganography object detection YOLO

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：