检索结果-内蒙古大学图书馆

Robust Multi-Graph Contrastive Network for Incomplete Multi-View Clustering

IEEE Transactions on Multimedia 2025年 27卷 2747-2759页

作者： Xue, Zhe Li, Yawen Guan, Zhongchao Li, Wenling Liang, Meiyu Zhou, Hai Beijing University of Posts and Telecommunications Beijing Key Laboratory of Intelligent Telecommunication Software and Multimedia School of Computer Science Beijing100876 China Beijing University of Posts and Telecommunications School of Economics and Management Beijing100876 China Beihang University School of Automation Science and Electrical Engineering Beijing100191 China

Food categorization is pivotal in numerous aspects of everyday life, assisting in the selection of food, managing diets, and addressing essential survival requirements. By leveraging the complementary information of various views, multi-view learning usually achieves superior performance compared to the single-view learning methods. However, characterized by the unrestrained openness of internet platforms and potential inconsistencies in food data collection processes, multi-view features often suffer from data loss, resulting in incomplete multi-view food data. Conventional multi-view clustering methods often falter in effectively capitalizing on the diverse correlations contained in food data, and exhibit limitations in dealing with the noise and irregularities pervading different views. Addressing these challenges, this paper presents the Robust Multi-Graph Contrastive network (RMGC) for multi-view food clustering. RMGC artfully combines multi-view representation learning with multi-graph contrastive regularization, creating a cohesive framework to manage incomplete multi-view data. By developing a multi-view encoding network, RMGC seamlessly blends various views into a cohesive representation, astutely assessing the significance of each view. More importantly, the proposed robust multi-graph contrastive regularization enhances the precision of the learned representation and successfully counteracts the noise and unreliability in multi-view data. The experiments conducted across several multi-view datasets manifest the effectiveness of RMGC, showing its superiority over existing methods. Our method not only making an advancement in food categorization but also contributes to the broader field of multi-view learning, offering innovative solutions for handling incomplete and noisy multi-view data. © 1999-2012 IEEE.

关键词： Cluster analysis

来源：评论

学校读者我要写书评

暂无评论

ER-Net:Efficient Recalibration Network for Multi-ViewMulti-Person 3D Pose Estimation

引用

Computer Modeling in Engineering & sciences 2023年第8期136卷 2093-2109页

作者： Mi Zhou Rui Liu Pengfei Yi Dongsheng Zhou National and Local Joint Engineering Laboratory of Computer Aided Design School of Software EngineeringDalian UniversityDalian116622China School of Computer Science and Technology Dalian University of TechnologyDalian116024China

Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application *** the introduction of end-to-end direct regression methods,the field has entered a new stage of ***,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal *** this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external ***,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding *** call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively.

关键词： Multi-view multi-person pose estimation attention mechanism computer vision

来源：评论

学校读者我要写书评

暂无评论

Automated Functionality and Security Evaluation of Large Language Models 9

Automated Functionality and Security Evaluation of Large Lan...

引用

9th IEEE International Conference on Smart Cloud, SmartCloud 2024

作者： Ding, Minjie Shen, Ying Chen, Mingang Shanghai Key Laboratory of Computer Software Testing and Evaluating Shanghai Development Center of Computer Software Technology Shanghai China

ISBN: (纸本)9798350389500

Natural language processing (NLP) is rapidly developing. A series of Large Language Models (LLMs) have emerged, represented by ChatGPT, which have made significant breakthroughs in natural language understanding and generation, enabling fluent dialogue with humans, understanding human intentions, and completing complex tasks. However, in addition to the fairness and toxicity of traditional language models, some new problems, including hallucination, have also emerged in LLMs, making them hard to use. Evaluating LLMs manually is challenging due to subjectivity and inefficiency. In this paper, we focused on the fuzzy matching, toxicity detection, and hallucination detection in the evaluation of LLMs automatically, and fine-tune the Mixtral-8x7B Model, which can be deployed in private cloud environment, and prove the effectiveness of our method through experiments. © 2024 IEEE.

关键词： Toxicity

来源：评论

学校读者我要写书评

暂无评论

Interpretability Research of Variational Autoencoder Generation Process Based on Feature Disentanglement

Interpretability Research of Variational Autoencoder Generat...

引用

Artificial Intelligence, Networking and Information Technology (AINIT), International Seminar on

作者： Sicheng Xi Yanhui Peng Chongqing Key Laboratory of Computational Intelligence College of Computer Science and Technology Chongqing University of Posts and Telecommunications Chongqing China Chongqing Key Laboratory of Computational Intelligence School of Software Engineering Chongqing University of Posts and Telecommunications Chongqing China

ISBN: (数字)9798350385557

ISBN: (纸本)9798350385564

Variational Autoencoder (VAE), as one of the main generative models, has a powerful representation learning capability. However, the hidden space representation learned by VAE is a high-dimensional and complex vector space, which makes it difficult to explain how the model gradually learns and composes the final generated results on different semantic features. To address this problem, firstly, this paper increases the degree of decoupling between different semantic features by increasing the independence between the hidden variables of the modal hermitian space, and explains the learning process of the model on different hermitian spaces by visualization based on the feature decoupling model. In addition, this paper also proposes a hidden variable contribution index to measure the influence of different dimensional hidden variables on the generation results, so as to explain the learning process of the model.

关键词： Training Seminars Representation learning Visualization Correlation Semantics Vectors

来源：评论

学校读者我要写书评

暂无评论

Business Scenario Driven Reinforcement Learning Testing Method 26

Business Scenario Driven Reinforcement Learning Testing Meth...

引用

26th ACIS International Winter Conference on software Engineering, Artificial Intelligence, Networking and parallel/Distributed Computing, SNPD-Winter 2023

作者： Cai, Lizhi Wang, Jin School of Information Science and Engineering East China University of Science and Technology Shanghai Key Laboratory of Computer Software Testing and Evaluating Shanghai China

ISBN: (纸本)9798350345865

Reinforcement learning has been successfully applied in software testing, but the existing testing methods cannot perform effective testing according to the characteristics of applications, and using outdated interactive experience during training, resulting in inefficient testing. In this paper, we propose BSDRTesting. Firstly, the demonstration experience of human users is collected according to the functional scenarios and business logic of each application, and combining reinforcement learning and imitation learning to maximize rewards while imitating user behavior, experience replay aims to sample experiences from the agent's self-exploration and expert demonstrations to improve sampling efficiency. At the same time, according to the input rules, the black-box testing method is used to fully test the input events, and finally an experience filtering mechanism is proposed, and the reward value and TD-Error are used as the basis for priority sampling. The experimental results on 10 open source applications show BSDRTesting has achieved significant improvements in code coverage and branch coverage compared with existing methods. © 2023 IEEE.

关键词： Reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Dual modality prompt learning for visual question-grounded answering in robotic surgery

引用

Visual Computing for Industry,Biomedicine,and Art 2024年第1期7卷 316-328页

作者： Yue Zhang Wanshu Fan Peixi Peng Xin Yang Dongsheng Zhou Xiaopeng Wei National and Local Joint Engineering Laboratory of Computer Aided Design School of Software EngineeringDalian UniversityDalian 116622LiaoningChina School of Computer Science and Technology Dalian University of TechnologyDalian 116081LiaoningChina

With recent advancements in robotic surgery,notable strides have been made in visual question answering(VQA).Existing VQA systems typically generate textual answers to questions but fail to indicate the location of the relevant content within the *** limitation restricts the interpretative capacity of the VQA models and their abil-ity to explore specific image *** address this issue,this study proposes a grounded VQA model for robotic surgery,capable of localizing a specific region during answer *** inspiration from prompt learning in language models,a dual-modality prompt model was developed to enhance precise multimodal information ***,two complementary prompters were introduced to effectively integrate visual and textual prompts into the encoding process of the model.A visual complementary prompter merges visual prompt knowl-edge with visual information features to guide accurate *** textual complementary prompter aligns vis-ual information with textual prompt knowledge and textual information,guiding textual information towards a more accurate inference of the ***,a multiple iterative fusion strategy was adopted for comprehensive answer reasoning,to ensure high-quality generation of textual and grounded *** experimental results vali-date the effectiveness of the model,demonstrating its superiority over existing methods on the EndoVis-18 and End-oVis-17 datasets.

关键词： Prompt learning Visual prompt Textual prompt Grounding-answering Visual question answering

来源：评论

学校读者我要写书评

暂无评论

Fake Information Analysis and Detection on Pandemic in Twitter

引用

SN Computer science 2022年第6期3卷 456页

作者： Jeyasudha, J. Seth, Prashnim Usha, G. Tanna, Pranesh Department of Computational Intelligence SRM Institute of Science and Technology Tamil Nadu Chennai India Department of Software Engineering SRM Institute of Science and Technology Tamil Nadu Chennai India Department of Computational Technologies SRM Institute of Science and Technology Tamil Nadu Chennai India

Twitter has become a popular platform to receive daily updates. The more the people rely on it, the more critical it becomes to get genuine information out. False information can easily be shared on Twitter, which influences people's feelings, especially if fake information is linked to COVID-19. Therefore, it is of utmost importance to detect fake information before it becomes uncontrollable. Real-time tweets were used as part of this study. A few features like tweet’s text, sentiment etc., were extracted and analyzed. The project returns a set of statistics determining the tweet’s veracity. In this study, various classifiers have been used to see which of them works best with the proposed model in classifying the used dataset. The proposed model achieved the best accuracy of 84.54% and the highest F1-score of 0.842 with Random Forest. With careful analysis while feature selection and using few features, the model developed is equivalent in performance to the other models that use a lot of features. This confirms that the model developed is less complex and highly dependable. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.

关键词： Classification Fake information Machine learning Pandemic Random forest Twitter

来源：评论

学校读者我要写书评

暂无评论

Wireless Charging Scheduling for Long-term Utility Optimization

引用

ACM Transactions on Sensor Networks 2025年第1期21卷 1-31页

作者： Xu, Jia Chen, Wenbin Dai, Haipeng Xu, Lijie Xiao, Fu Liu, Linfeng Jiangsu Key Laboratory of Big Data Security and Intelligent Processing Nanjing University of Posts and Telecommunications Nanjing China The State Key Laboratory for Novel Software Technology Nanjing University Nanjing China

Wireless power transmission has been widely used to replenish energy for wireless sensor networks, where the energy consumption rate of sensor nodes is usually time varying and indefinite. However, few works have investigated the problem of long-term charging scheduling with random variable. This article designs an optimization model for the long-term scheduling of chargers to maximize the time-averaged charging utility while ensuring its time-averaged constraints of budget and response rate. The Lyapunov optimization technique is adopted to transform the stochastic optimization problem into a deterministic optimization problem, which remains NP-hard. Thus, an approximation algorithm following greedy approach is proposed to solve the deterministic optimization problem. We further provide the theoretical analysis of feasibility and performance guarantee of the proposed scheduling algorithm. The simulation results show that our algorithm outperforms three comparison algorithms by 6.53%, 20.04%, and 19.97% in terms of time-averaged charging utility, as well as by 11.25%, 4.42%, and 3.73% in terms of time-averaged response rate on average. © 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.

关键词： Inductive power transmission

来源：评论

学校读者我要写书评

暂无评论

Pretraining Billion-Scale Geospatial Foundational Models on Frontier

Pretraining Billion-Scale Geospatial Foundational Models on ...

引用

2024 IEEE International parallel and Distributed Processing Symposium Workshops, IPDPSW 2024

作者： Tsaris, Aristeidis Dias, Philipe Ambrozio Potnis, Abhishek Yin, Junqi Wang, Feiyi Lunga, Dalton National Center for Computational Sciences Oak Ridge National Laboratory Oak RidgeTN United States Geospatial Science and Human Security Oak Ridge National Laboratory Oak RidgeTN United States

ISBN: (纸本)9798350364606

As AI workloads increase in scope, generalization capability becomes challenging for small task-specific models and their demand for large amounts of labeled training samples increases. On the contrary, Foundation Models (FMs) are trained with internet-scale unlabeled data via self-supervised learning and have been shown to adapt to various tasks with minimal fine-tuning. Although large FMs have demonstrated significant impact in natural language processing and computer vision, efforts toward FMs for geospatial applications have been restricted to smaller size models, as pretraining larger models requires very large computing resources equipped with state-of-the-art hardware accelerators. Current satellite constellations collect 100+TBs of data a day, resulting in images that are billions of pixels and multimodal in nature. Such geospatial data poses unique challenges opening up new opportunities to develop FMs. We investigate billion scale FMs and HPC training profiles for geospatial applications by pretraining on publicly available data. We studied from end-to-end the performance and impact in the solution by scaling the model size. Our larger 3B parameter size model achieves up to 30% improvement in top1 scene classification accuracy when comparing a 100M parameter model. Moreover, we detail performance experiments on the Frontier supercomputer, America's first exascale system, where we study different model and data parallel approaches using PyTorch's Fully Sharded Data parallel library. Specifically, we study variants of the Vision Transformer architecture (ViT), conducting performance analysis for ViT models with size up to 15B parameters. By discussing throughput and performance bottlenecks under different parallelism configurations, we offer insights on how to leverage such leadership-class HPC resources when developing large models for geospatial imagery applications. © 2024 IEEE.

关键词： Remote sensing

来源：评论

学校读者我要写书评

暂无评论

A Hybrid Method to Measure Distribution Consistency of Mixed-Attribute Datasets

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2023年第1期4卷 182-196页

作者： He, Yulin Ye, Xuan Huang, Defa Fournier-Viger, Philippe Huang, Joshua Zhexue College of Computer Science and Software Engineering Shenzhen University Shenzhen518060 China National Engineering Laboratory for Big Data System Computing Technology Shenzhen University Shenzhen518060 China

Random sample partition (RSP) is a newly developed data management and processing model for Big Data processing and analysis. To apply the RSP model for Big Data computation tasks, it is very important to measure the distribution consistency of different datasets. Existing measurement methods for continuous-attribute and discrete-attribute datasets cannot directly deal with mixed-attribute datasets. In this article, we design a hybrid method to measure the distribution consistency among different mixed-attribute datasets by using a multilayer extreme learning machine (MLELM) and the generalized maximum mean discrepancy (GMMD) criterion, abbreviated as MLELM-GMMD. MLELM is first used to transform original mixed-attribute datasets into corresponding deep encoding datasets. Then, the GMMD criterion is applied to check the distribution consistency of the deep encoding datasets. Four experiments have been done to validate the feasibility and effectiveness of MLELM-GMMD, i.e., the impact of MLELM on the amount of information during mixed-attribute data transformation, the impact of MLELM on distributions of mixed-attribute data, the distribution consistencies of RSP and non-RSP data blocks, and the comparison with other measurement methods. Experimental results show that the proposed MLELM-GMMD method can measure the distribution consistency of mixed-attribute datasets more accurately than one-hot encoding-based methods. © 2022 IEEE.

关键词： Data mining

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：