检索结果-内蒙古大学图书馆

2025 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2025

作者： Liu, Guoshan Yin, Hailong Zhu, Bin Chen, Jingjing Ngo, Chong-Wah Jiang, Yu-Gang School of Computer Science Fudan University Shanghai Key Lab of Intelligent Information Processing China Shanghai Collaborative Innovation Center on Intelligent Visual Computing China Singapore Management University Singapore

ISBN: (纸本)9798331510831

The growing interest in generating recipes from food images has drawn substantial research attention in recent years. Existing works for recipe generation primarily utilize a two-stage training method - first predicting ingredients from a food image and then generating instructions from both the image and ingredients. Large Multi-modal Models (LMMs), which have achieved notable success across a variety of vision and language tasks, shed light on generating both ingredients and instructions directly from images. Nevertheless, LMMs still face the common issue of hallu-cinations during recipe generation, leading to suboptimal performance. To tackle this issue, we propose a retrieval augmented large multimodal model for recipe generation. We first introduce Stochastic Diversified Retrieval Augmentation (SDRA) to retrieve recipes semantically related to the image from an existing datastore as a supplement, integrating them into the prompt to add diverse and rich context to the input image. Additionally, Self-Consistency Ensemble Voting mechanism is proposed to determine the most confident prediction recipes as the final output. It calculates the consistency among generated recipe candidates, which use different retrieval recipes as context for generation. Extensive experiments validate the effectiveness of our proposed method, which demonstrates state-of-the-art (SOTA) performance in recipe generation on the Recipe1M dataset. © 2025 IEEE.

关键词： Food ingredients

来源：评论

学校读者我要写书评

暂无评论

Joint AP Clustering and Beamforming Design for RIS-Aided Cell-Free Networks

引用

IEEE Transactions on Vehicular Technology 2025年第5期74卷 8315-8320页

作者： Xu, Chunmei Jia, Yuanqi Chen, Youjia Huang, Wei Southeast University National Mobile Communications Research Laboratory School of Information Science and Engineering Nanjing210096 China Southeast University School of Information Science and Engineering Nanjing210096 China Fuzhou University Fujian Key Lab for Intelligent Processing and Wireless Transmission of Media Information College of Physics and Information Engineering Fuzhou350108 China Hefei University of Technology School of Computer Science and Information Engineering Hefei230601 China

Cell-free networks and reconfigurable intelligent surfaces (RIS) are two promising techniques for future wireless communications. The integration of RIS into cell-free networks, termed RIS-aided cell-free networks, offers the potential to significantly enhance network performance. However, the realization of this potential is constrained by the limited capacities of the fronthaul links. To address this challenge, we investigate the joint design of access point (AP) clustering, transmit and passive beamforming in RIS-aided cell-free networks. The objective is to maximize the weighted sum-rate performance while minimizing the number of clustered APs to alleviate the fronthaul overhead. The problem is formulated in a group sparse manner, employing a mixed zero-norm/two-norm term to represent the number of the clustered APs. To solve this problem, we first approximate the mixed zero-norm/two-norm term by the mixed one-norm/two-norm term and provide its equivalent formulation by introducing receive beamforming vectors and weight parameters. Then, an iterative method is proposed based on the block coordinate descent (BCD) technique, which is guaranteed to converge to a stationary point. Simulation results demonstrate the effectiveness of the proposed method. © 1967-2012 IEEE.

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

IRSEnet: Differentially Private Image Generation with Multi-Scale Feature Extraction and Residual Channel Attention 13

IRSEnet: Differentially Private Image Generation with Multi-...

引用

13th International Conference on intelligent Control and information processing, ICICIP 2025

作者： Li, Jiahao Wang, Zhongshuai Ghazali, Kamarul Hawari Bin Yan, Suqing Lan, Rushi Sun, Xiyan Luo, Xiaonan Guangxi Key Lab. of Image and Graphic Intelligent Processing Guilin University of Electronic Technology Guilin541004 China Centre for Advanced Industrial Technology University of Malaysia Pahang Al-Sultan Abdullah Pahang Pekan26600 Malaysia Int. Joint Research Lab. of Spatio-temporal Information and Intelligent Location Services Guilin University of Electronic Technology Guilin541004 China

ISBN: (纸本)9798331516147

Privacy-preserving image generation is particularly crucial in fields like healthcare, where data are both sensitive and limited. However, effective privacy preservation often compromises the visual quality and utility of the generated images due to privacy budget constraints. To address this issue, in this paper, We propose a novel network architecture, IRSEnet, which combines multi-scale feature extraction technology and residual channel attention mechanisms, aiming to enhance the visual quality of generated images and improve the performance of downstream classification tasks under differential privacy. The differential privacy mechanism ensures the security of sensitive data during training, while the multi-scale feature extraction module enhances feature extraction capabilities through parallel convolutional layers at multiple scales. Additionally, the channel attention module dynamically adjusts channel weights to focus on the most discriminative features. Experimental results demonstrate that this model significantly improves the utility of generated images and the accuracy of downstream classification tasks while preserving privacy. Future work will explore the application of this approach on larger datasets and across more diverse tasks. © 2025 IEEE.

关键词： Differential privacy

来源：评论

学校读者我要写书评

暂无评论

A Flexible Knowledge Graph Error Detection Framework Combined with Semantic information 12th

A Flexible Knowledge Graph Error Detection Framework Combine...

引用

12th CCF Conference on BigData, BigData 2024

作者： Zhao, Yangwu Liu, Yang Ao, Xiang He, Qing Henan Institute of Advanced Technology Zhengzhou University Zhengzhou China Beijing China Key Lab of Intelligent Information Processing Institute of Computing Technology CAS Beijing China

ISBN: (纸本)9789819610235

Knowledge graphs (KGs) are extensively utilized in numerous applications, including question-answering systems and recommender systems. However, knowledge graphs are often constructed through web crawling or crowdsourcing, leading to errors in the data. The task of knowledge graph error detection aims to identify inaccurate triplets in KGs and has received substantial attention in recent years. However, the majority of current error detection methods overlook the semantic information of the triplets, which can be vital for accurate error detection. In this paper, we introduce a Flexible Knowledge Graph Error Detection Framework that integrates Semantic information (FKED), which combines both structural and semantic information to detect errors within the knowledge graph. FKED first extracts the structural information of KGs using a graph embedding model. Next, FKED employs a pre-trained language model (PLM) to extract semantic information from the triplets. Then the structural and semantic information are combined to detect errors within the knowledge graph. FKED can be flexibly added to other structure-based error detection models, enhancing their capabilities in downstream tasks of knowledge graphs. We assess FKED using two benchmark datasets: FB15k-237 and WN18RR. Experimental results indicate that our method is both superior and effective, substantially improving the performance of the base models. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Knowledge graph

来源：评论

学校读者我要写书评

暂无评论

Complete Gait Phases Recognition Based on Muscle Synergy Using PSO-CNN-LSTM Algorithm

引用

IEEE Sensors Journal 2025年第10期25卷 16775-16786页

作者： Zhang, Kewen Li, Xiaoling Chen, Xin Yu, Longjie Jin, Yinan Fan, Bingfei Du, Mingyu Bao, Guanjun Wu, Xinyu Cai, Shibo Zhejiang University of Technology College of Mechanical Engineering Key Laboratory of Special Purpose Equipment and Advanced Processing Technology Ministry of Education Hangzhou China Zhejiang University of Technology College of Mechanical Engineering Hangzhou China ZJUT Yinhu Research Institute of Innovation and Entrepreneurship ZJUT Yinhu Research Institute of Innovation and Entrepreneurship ZJUT Yinhu Research Institute of Innovation and Entrepreneurship Fuyang District Hangzhou311400 China Chinese Academy of Sciences Key Laboratory of Human-Machine-Intelligence Synergic Systems Shenzhen Institutes of Advanced Technology Shenzhen China Chinese Academy of Sciences Guangdong Provincial Key Lab of Robotics and Intelligent System Shenzhen Institutes of Advanced Technology Shenzhen China

Accurately recognizing gait phases, by applying proper instrumentation and measurement, is significant in walking rehabilitation training for patients with impaired mobility. In this study, seven phases of complete stand-walk-stand cycle as well as continuous daily walking were recognized based on muscle synergy and PSO-CNN-LSTM model. Firstly, the complete stand-walk-stand walking cycle were divided into starting phase, terminal swing phase, loading response phase, mid-stance phase, terminal stance phase, initial swing phase, and stopping phase, based on the features of surface electromyography (sEMG) collected by portable sEMG acquisition system and measured motion data of lower limb. Secondly, a muscle weight matrix and an activation sequence matrix were calculated by using non-negative matrix factorization (NMF). Finally, a PSO-CNN-LSTM network was designed to recognize the complete stand-walk-stand cycle by applying above mentioned matrices as input. Fourteen subjects volunteered to perform linear walking experiments under procedure to verify the feasibility of the proposed approach by comparing results with other classifiers and features. Experimental results show that the proposed approach was capable of achieving an average recognition accuracy of up to 85.384%. This work will offer promising gait recognition method for rehabilitation robots to achieve natural and flexible human-robot interaction. © 2001-2012 IEEE.

关键词： Non-negative matrix factorization

来源：评论

学校读者我要写书评

暂无评论

Retrieval Augmented Recipe Generation

Retrieval Augmented Recipe Generation

引用

IEEE Workshop on Applications of Computer Vision (WACV)

作者： Guoshan Liu Hailong Yin Bin Zhu Jingjing Chen Chong-Wah Ngo Yu-Gang Jiang Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University Shanghai Collaborative Innovation Center on Intelligent Visual Computing Singapore Management University

ISBN: (数字)9798331510831

ISBN: (纸本)9798331510848

The growing interest in generating recipes from food images has drawn substantial research attention in recent years. Existing works for recipe generation primarily utilize a two-stage training method—first predicting ingredients from a food image and then generating instructions from both the image and ingredients. Large Multi-modal Models (LMMs), which have achieved notable success across a variety of vision and language tasks, shed light on generating both ingredients and instructions directly from images. Nevertheless, LMMs still face the common issue of hallu-cinations during recipe generation, leading to suboptimal performance. To tackle this issue, we propose a retrieval augmented large multimodal model for recipe generation. We first introduce Stochastic Diversified Retrieval Augmentation (SDRA) to retrieve recipes semantically related to the image from an existing datastore as a supplement, integrating them into the prompt to add diverse and rich context to the input image. Additionally, Self-Consistency Ensemble Voting mechanism is proposed to determine the most confident prediction recipes as the final output. It calculates the consistency among generated recipe candidates, which use different retrieval recipes as context for generation. Extensive experiments validate the effectiveness of our proposed method, which demonstrates state-of-the-art (SOTA) performance in recipe generation on the Recipe1M dataset.

关键词： Training Computer vision Accuracy Computational modeling Stochastic processes Predictive models Reliability Faces

来源：评论

学校读者我要写书评

暂无评论

Improving Event-Level Financial Sentiment Analysis with Retrieval-Augmented Multipath Chain-of-Thought Prompting 12th

Improving Event-Level Financial Sentiment Analysis with Ret...

引用

12th CCF Conference on BigData, BigData 2024

作者： Zhang, Yiming Ao, Xiang Yu, Guoxin He, Qing Henan Institute of Advanced Technology Zhengzhou University Zhengzhou450002 China Beijing100190 China Key Lab of Intelligent Information Processing Institute of Computing Technology CAS Beijing100190 China

ISBN: (纸本)9789819610235

Event-level Financial Sentiment Analysis (EFSA) aims to extract all the quintuples containing five sentiment elements from a given financial news text, which has gained prominence as an emerging domain recently. The present study utilizes a 4-hop Chain-of-Thought (CoT) prompting based on LLMs to predict sentiment elements in a fixed order, which neglects the interdependencies among the sentiment elements within a quintuple. Inspired by recent multi-view prompting (MvP) and CoT ideas, we propose a novel framework termed Retrieval-Augmented Multipath Chain-of-Thought (RMP-CoT) that aggregates quintuples generated by LLMs through different reasoning paths, leveraging a retrieval-augmented mechanism. Specifically, RMP-CoT integrates different element orders into CoT prompting to guide LLMs in generating multiple sentiment quintuples through the utilization of retrieval-augmented mechanism, and then selects the most plausible quintuples by voting. To investigate the effectiveness of our framework, we conduct extensive experiments on four benchmark tasks of EFSA. RMP-CoT pushes the state-of-the-art by over 6% F1 on the EFSA task and also performs quite effectively on the other sub-tasks of EFSA. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Chain-of-Thought Event-level Sentiment Analysis Financial Sentiment Analysis

来源：评论

学校读者我要写书评

暂无评论

Predicting Calibrated Conversion Rate of Online Advertising Using a Multi-task Mixture-of-Experts Calibration Model 12th

Predicting Calibrated Conversion Rate of Online Advertising...

引用

12th CCF Conference on BigData, BigData 2024

作者： Zhang, Xinyue Guo, Yuyao Ao, Xiang School of Information Engineering China University of Geosciences Beijing China Beijing China Key Lab of Intelligent Information Processing Institute of Computing Technology CAS Beijing China

ISBN: (纸本)9789819610235

Accurately predicting conversion rate (CVR) is paramount in online advertising. However, traditional models may face problems such as delayed feedback, where there is a delay of an indeterminate amount of time between click and conversion. Calibration is an effective way to optimize conversion rate estimates in online advertising. Unlike conversion delays, post-click user behaviors occur rapidly and are informative to conversion rate prediction. Our proposed solution, the Multi-Task Mixture-of-Experts Calibration (MTMEC) framework, integrates multi-task learning and mixture-of-experts models. It modifies CVR prediction using post-click user behavior data, utilizing streaming learning for real-time data access. Each task is weighted by a gating network, enabling adaptive loss functions through multi-task learning. Parametric scaling further minimizes calibration errors, enhancing prediction accuracy without excessive parameters. Experiments on real-world datasets validate the effectiveness of MTMEC. It improves model prediction and reduces calibration errors. This framework offers a robust solution for online advertising, bridging the gap between calibration accuracy and system responsiveness. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Multi-task learning

来源：评论

学校读者我要写书评

暂无评论

Domain-aware Node Representation Learning for Graph Out-of-Distribution Generalization

Domain-aware Node Representation Learning for Graph Out-of-D...

引用

International Conference on Acoustics, Speech, and Signal processing (ICASSP)

作者： Yi Qiao Yang Liu Qing He Xiang Ao Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS) Institute of Computing Technology CAS Beijing China University of Chinese Academy of Sciencs Beijing China Institute of Intelligent Computing Technology Suzhou CAS

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Graph Neural Networks (GNNs) have demonstrated impressive success across diverse fields when data satisfies in-distribution (ID) assumption. Nevertheless, GNN performance significantly declines in cases of distribution shifts between training and testing graph data. This degradation primarily stems from spurious correlations between irrelevant domain information and target labels in out-of-distribution (OOD) scenarios. Thus, maximizing the utilization of domain information becomes imperative. In light of this, we propose a novel approach named Domain-aware Node Representation Learning (DNRL), comprehensively incorporates domain information to bolster generalization capability. Specifically, DNRL selectively interpolates nodes with the same label but different domains, extending training data into unseen domains and alleviating the effects caused by domain-related spurious correlations. Futhermore, by introducing a domain-aware contrastive learning strategy, our method implicitly decouples domain information from node information to learn domain-independent node representations. Extensive experiments on graph out-of-distribution benchmarks demonstrate that DNRL can achieve effective OOD generalization performance across diverse domains.

关键词： Representation learning Training Degradation Interpolation Correlation Training data Signal processing Graph neural networks Data models Speech processing

来源：评论

学校读者我要写书评

暂无评论

The chordata olfactory receptor database

引用

Protein & Cell 2025年第4期16卷 283-292页

作者： Wei Han Siyu Bao Jintao Liu Yiran Wu Liting Zeng Tao Zhang Ningmeng Chen Kai Yao Shunguo Fan Aiping Huang Yuanyuan Feng Guiquan Zhang Ruiyi Zhang Hongjin Zhu Tian Hua Zhijie Liu Lina Cao Xingxu Huang Suwen Zhao iHuman Institute Shanghai Tech University Research Center for Life Sciences Computing Zhejiang Lab Department of Intelligent Edge Cloud China Telecom Cloud Technology Co. Ltd. School of Life Science and Technology Shanghai Tech University School of Information Science and Technology Shanghai Tech University Shanghai Key Laboratory of High-Resolution Electron Microscopy Shanghai Tech University Zhejiang Provincial Key Laboratory of Pancreatic Disease The First Affiliated Hospital and Institute of Translational Medicine Zhejiang University School of Medicine Shanghai Clinical Research and Trial Center

Introduction of database Olfaction is one of the oldest chemosensory systems in chordates, playing crucial roles in their foraging, predator evasion, social communication, mating and parental care (Guo et al., 2023; Li and Liberles, 2015; Liberles,2014). The initial step of olfaction is the binding and activation of olfactory receptors (ORs) by odorants in a combinatorial way (Malnic et al., 1999).

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：