检索结果-内蒙古大学图书馆

24th international conference on algorithms and Architectures for Parallel processing, ICA3PP 2024

作者： Ye, Xuming Chaomurilige Liu, Zheng Luo, Haoyu Dong, Jun Luo, Yingzhe Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE Minzu University of China Beijing China Hainan International College of Minzu University of China Li’an International Education Innovation pilot Zone Hainan572499 China Faculty of Data Science City University of Macau Macau China

ISBN: (纸本)9789819615278

Existing multimodal summarization methods primarily focus on multimodal fusion to efficiently utilize the visual information for summarization. However, they fail to exploit the deep interaction between textual and visual modality. Moreover, optimizing the model by maximum likelihood estimation (MLE) leads to exposure bias, causing the model to generate the next word that is based on the previously generated erroneous words during inference. To address these challenges, we propose a novel modality-aware fusion module (MAF) with a summarization ranking (SumR) training objective. Specifically, the MAF module exploits the interaction in the multimodal input through multiple fusion layers, and SumR aims to align the probability order predicted by the model with actual quality metrics, therefore it is able to reduce the exposure bias problem during inference. Extensive experiments on a large-scale dataset demonstrate that our method outperforms existing models, achieving superior results in both automatic and human evaluation metrics. The generated multimodal summaries provide richer context and enhance user’s comprehension by combining the key textual information with the relevant visual content. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Maximum likelihood estimation

来源：评论

学校读者我要写书评

暂无评论

Boolean and Fp-Matrix Factorization: From Theory to Practice

Boolean and Fp-Matrix Factorization: From Theory to Practice

引用

IEEE international conference on Fuzzy Systems (FUZZ-IEEE) / IEEE World Congress on Computational Intelligence (IEEE WCCI) / international Joint conference on Neural Networks (IJCNN) / IEEE Congress on Evolutionary Computation (IEEE CEC)

作者： Fomin, Fedor Panolan, Fahad Patil, Anurag Tanveer, Adil Univ Bergen Dept Informat Bergen Norway IIT Hyderabad Dept CSE Sangareddy India EdgeVerve Syst Ltd Bengaluru India Amazon Chennai Tamil Nadu India

ISBN: (纸本)9781728186719

Boolean Matrix Factorization (BMF) aims to find an approximation of a given binary matrix as the Boolean product of two low-rank binary matrices. Binary data is ubiquitous in many fields, and representing data by binary matrices is common in medicine, natural language processing, bioinformatics, computer graphics, among many others. Factorizing a matrix into low-rank matrices is used to gain more information about the data, like discovering relationships between the features and samples, roles and users, topics and articles, etc. In many applications, the binary nature of the factor matrices could enormously increase the interpretability of the data. Unfortunately, BMF is computationally hard and heuristic algorithms are used to compute Boolean factorizations. Very recently, the theoretical breakthrough was obtained independently by two research groups. Ban et al. (SODA 2019) and Fomin et al. (Trans. algorithms 2020) show that BMF admits an efficient polynomial-time approximation scheme (EPTAS). However, despite the theoretical importance, the high double-exponential dependence of the running times from the rank makes these algorithms unimplementable in practice. The primary research question motivating our work is whether the theoretical advances on BMF could lead to practical algorithms. The main conceptional contribution of our work is the following. While EPTAS for BMF is a purely theoretical advance, the general approach behind these algorithms could serve as the basis in designing better heuristics. We also use this strategy to develop new algorithms for related F-p-Matrix Factorization. Here, given a matrix A over a finite field GF(p) where p is a prime, and an integer r, our objective is to find a matrix B over the same field with GF(p)-rank at most r minimizing some norm of A - B. Our empirical research on synthetic and real-world data demonstrates the advantage of the new algorithms over previous works on BMF and F-p-Matrix Factorization.

关键词： Binary matrix factorization Categorical data data mining

来源：评论

学校读者我要写书评

暂无评论

Towards Robust Federated Learning: Investigating Poisoning Attacks Under Clients data Heterogeneity 19

Towards Robust Federated Learning: Investigating Poisoning A...

引用

19th international conference on Ubiquitous Information Management and Communication, IMCOM 2025

作者： Soubih, Abdenour Lahmer, Seyyid Ahmed Abuhamad, Mohammed Abuhmed, Tamer College of Computing and Informatics Sungkyunkwan University Suwon Korea Republic of Department of Information Engineering University of Padova Padua Italy Department of Computer Science Loyola University Chicago Chicago United States

ISBN: (纸本)9798331507817

Federated Learning (FL) offers a privacy-preserving solution by enabling multiple clients to train a shared model collaboratively without centralizing data. However, the decentralized nature of FL presents challenges, particularly regarding security and performance under adversarial conditions. This paper investigates the effects of poisoning attacks under data heterogeneity. Our experiments evaluate the impact of varying malicious client fractions and poison concentration levels on the accuracy of the model. We explore the effects of poisoning attacks on FedAvg and FedNova models using medical imaging tasks. Our findings reveal that increasing data heterogeneity exacerbates the effects of poisoning, with FedNova demonstrating greater resilience compared to FedAvg. We found that the number of malicious clients plays a more significant role in degrading performance than the ratio of poisoning samples shared by each malicious client, suggesting that even modest levels of poisoning can be tolerated by most algorithms. The study highlights the importance of developing robust defense mechanisms to maintain model performance under adversarial conditions. © 2025 IEEE.

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

KnowLog: Knowledge Enhanced Pre-trained Language Model for Log Understanding 24

KnowLog: Knowledge Enhanced Pre-trained Language Model for L...

引用

46th IEEE/ACM international conference on Software Engineering, ICSE 2024

作者： Ma, Lipeng Jiang, Sihang Yang, Weidong Fei, Ben Xu, Bo Liang, Jiaqing Zhou, Mingjie Xiao, Yanghua Shanghai Key Laboratory of Data Science School of Computer Science Fudan University Shanghai China School of Computer Science and Technology Donghua University Shanghai China School of Data Science Fudan University Shanghai China

ISBN: (纸本)9798400702174

Logs as semi-structured text are rich in semantic information, making their comprehensive understanding crucial for automated log analysis. With the recent success of pre-trained language models in natural language processing, many studies have leveraged these models to understand logs. Despite their successes, existing pre-trained language models still suffer from three weaknesses. Firstly, these models fail to understand domain-specific terminology, especially abbreviations. Secondly, these models struggle to adequately capture the complete log context information. Thirdly, these models have difficulty in obtaining universal representations of different styles of the same logs. To address these challenges, we introduce KnowLog, a knowledge-enhanced pre-trained language model for log understanding. Specifically, to solve the previous two challenges, we exploit abbreviations and natural language descriptions of logs from public documentation as local and global knowledge, respectively, and leverage this knowledge by designing novel pre-training tasks for enhancing the model. To solve the last challenge, we design a contrastive learning-based pre-training task to obtain universal representations. We evaluate KnowLog by fine-tuning it on six different log understanding tasks. Extensive experiments demonstrate that KnowLog significantly enhances log understanding and achieves state-of-the-art results compared to existing pre-trained language models without knowledge enhancement. Moreover, we conduct additional experiments in transfer learning and low-resource scenarios, showcasing the substantial advantages of KnowLog. Our source code and detailed experimental data are available at https://***/LeaperOvO/KnowLog. © 2024 IEEE Computer Society. All rights reserved.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Multimodal Bias: Assessing Gender Bias in Computer Vision models with NLP Techniques 23

Multimodal Bias: Assessing Gender Bias in Computer Vision Mo...

引用

25th international conference on Multimodal Interaction (ICMI)

作者： Mandal, Abhishek Little, Suzanne Leavy, Susan Dublin City Univ Sch Comp Insight SFI Res Ctr Data Analyt Dublin Ireland Univ Coll Dublin Sch Informat & Commun Studies Insight SFI Res Ctr Data Analyt Dublin Ireland

ISBN: (纸本)9798400700552

Large multimodal deep learning models such as Contrastive Language Image Pretraining (CLIP) have become increasingly powerful with applications across several domains in recent years. CLIP works on visual and language modalities and forms a part of several popular models, such as DALL-E and Stable Diffusion. It is trained on a large dataset of millions of image-text pairs crawled from the internet. Such large datasets are often used for training purposes without filtering, leading to models inheriting social biases from internet data. Given that models such as CLIP are being applied in such a wide variety of applications ranging from social media to education, it is vital that harmful biases are detected. However, due to the unbounded nature of the possible inputs and outputs, traditional bias metrics such as accuracy cannot detect the range and complexity of biases present in the model. In this paper, we present an audit of CLIP using an established technique from natural language processing called Word Embeddings Association Test (WEAT) to detect and quantify gender bias in CLIP and demonstrate that it can provide a quantifiable measure of such stereotypical associations. We detected, measured, and visualised various types of stereotypical gender associations with respect to character descriptions and occupations and found that CLIP shows evidence of stereotypical gender bias.

关键词： bias fairness multimodal models trustworthiness

来源：评论

学校读者我要写书评

暂无评论

data Platforms for Real-time Insights in Healthcare: Systematic Review 14

Data Platforms for Real-time Insights in Healthcare: Systema...

引用

14th international conference on Ambient Systems, Networks and Technologies Networks, ANT 2023 and The 6th international conference on Emerging data and Industry 4.0, EDI40 2023

作者： Miranda, Rui Alves, Carlos Abelha, Antonio Machado, Jose Centro ALGORITMI Escola de Engenharia Universidade Do Minho Campus Azurem Guimaraes4800-058 Portugal

The ever-growing usage and popularity of Internet of Things devices, coupled with Big data technologies and machine learning algorithms, have allowed for data engineers to explore new opportunities in healthcare and continuous care. Furthermore, there is a need to reduce the gap on time from when information is created to when actions and insights can be offered. However, a challenge in implementing a large-scale data processing architecture is deciding which tools are appropriate, and how to apply them in the best way possible. For example, streaming systems are now mature enough that hospitals worldwide can use their extremely large datasets, along with data producers, to predict and influence future events. Thus, the main objective of this systematic review is to identify the state-of-the-art in data platforms on healthcare that allow the creation of metrics and actions in real-time. The PRISMA guideline for reporting systematic reviews was implemented to deliver a transparent and consistent report, validating the technological advances in a critical sector. Multiple pertinent articles and papers were retrieved from the SCOPUS abstract and citation database on May 13, 2022, using several relevant keywords to identify potentially relevant documents published from January 2020 onward. These documents must have already been published in English and been already published, and accessible through the B-ON consortium that allows Portuguese students to legally download from most publishers. Over seven studies have been selected for deeper discussion based on their relevance and impact for this review, showcasing their main objectives, data sources, and tools used, as well as their approaches for interoperability and support of machine learning algorithms for decision support. In closing, the collected articles have shown that while Big data is currently in use at health institutions of all sizes, the ability of processing large amounts of data from sensors and events, a

关键词： data mining

来源：评论

学校读者我要写书评

暂无评论

PYRAMID TRANSFORMER DRIVEN MULTIBRANCH FUSION FOR POLYP SEGMENTATION IN COLONOSCOPIC VIDEO IMAGES 30

PYRAMID TRANSFORMER DRIVEN MULTIBRANCH FUSION FOR POLYP SEGM...

引用

30th IEEE international conference on Image processing (ICIP)

作者： Wang, Ao Wu, Ming Qi, Hao Shi, Hong Chen, Jianhua Chen, Yinran Luo, Xiongbiao Xiamen Univ Dept Comp Sci & Technol Xiamen 361005 Peoples R China Xiamen Univ Natl Inst Data Sci Hlth & Med Xiamen 361102 Peoples R China Fujian Med Univ Fujian Canc Hosp Canc Hosp Fuzhou 350014 Peoples R China

ISBN: (纸本)9781728198354

Colonoscopic polyp segmentation is essential and valuable to early diagnosis and treatment of colorectal cancer. It remains challenging to accurately extract these polyps due to their small sizes, irregular shapes, image artifacts, and illumination variations. This work proposes a new encoder-decoder architecture called pyramid transformer driven multibranch fusion to precisely segment different types of colorectal polyps during colonoscopy. Specifically, our architecture employs a simple, convolution-free pyramid transformer as its encoder that is a flexible and powerful feature extractor. Next, a multibranch fusion decoder is employed to reserve the detailed appearance information and fuse semantic global cues, which can deal with blurred polyp edges caused by nonuniform illumination and the shaky colonoscope. Additionally, a hybrid spatial-frequency loss function is introduced for accurate training. We evaluate our proposed architecture on colonoscopic polyp images with four types of polyps with different pathological features, with the experimental results showing that our architecture significantly outperforms other deep learning models. Particularly, our method improves the average dice similarity and intersection over union to 90.7% and 0.848, respectively.

关键词： Polyp segmentation vision transformer convolutional neural networks colorectal cancer colonoscopy

来源：评论

学校读者我要写书评

暂无评论

Trajectory Progress-Based Prioritizing and Intrinsic Reward Mechanism for Robust Training of Robotic Manipulations

引用

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING 2025年 22卷 9829-9842页

作者： Liang, Weixiang Liu, Yinlong Wang, Jikun Yang, Zhi-Xin Univ Macau State Key Lab Internet Things Smart City Macau Peoples R China Univ Macau Dept Electromech Engn Macau Peoples R China

Training robots by model-free deep reinforcement learning (DRL) to carry out robotic manipulation tasks without sufficient successful experiences is challenging. Hindsight experience replay (HER) is introduced to enable DRL agents to learn from failure experiences. However, the HER-enabled model-free DRL still suffers from limited training performance due to its uniform sampling strategy and scarcity of reward information in the task environment. Inspired by the progress incentive mechanism in human psychology, we propose Progress Intrinsic Motivation-based HER (P-HER) in this work to overcome these difficulties. First, the Trajectory Progress-based Prioritized Experience Replay (TPPER) module is developed to prioritize sampling valuable trajectory data thereby achieving more efficient training. Second, the Progress Intrinsic Reward (PIR) module is introduced in agent training to add extra intrinsic rewards for encouraging the agents throughout the exploration of task space. Experiments in challenging robotic manipulation tasks demonstrate that our P-HER method outperforms original HER and state-of-the-art HER-based methods in training performance. Our code of P-HER and its experimental videos in both virtual and real environments are available at https://***/weixiang-smart/P-HER. Note to Practitioners-This work is motivated to develop a fast and effective learning method for intelligent robotic manipulation of typical industrial tasks, including pushing, picking, and placing workpieces, which are essential and fundamental processing plan activities for accomplishing robotic machining and assembly applications towards smart manufacturing. The introduction of reinforcement learning enables robots to learn manipulation tasks autonomously, which can save the effort for engineers to teach or hard program the robot and also reduce labor costs. However, the existing HER-based reinforcement learning algorithms are with low training efficiency and performance due to

关键词： Robots Training Trajectory Service robots Space exploration Deep reinforcement learning Psychology data models Smart manufacturing Programming Hindsight experience replay progress intrinsic motivation deep reinforcement learning robotic manipulations

来源：评论

学校读者我要写书评

暂无评论

A Distributed Adaptive Algorithm for Non-Smooth Spatial Filtering Problems 48

A Distributed Adaptive Algorithm for Non-Smooth Spatial Filt...

引用

48th IEEE international conference on Acoustics, Speech and Signal processing, ICASSP 2023

作者： Hovine, Charles Bertrand, Alexander Leuven Belgium Signal Processing and Data Analytics Stadius Center for Dynamical Systems Leuven Belgium Leuven Belgium

ISBN: (纸本)9781728163277

Computing the optimal solution to a spatial filtering problems in a Wireless Sensor Network can incur large bandwidth and computational requirements if an approach relying on data centralization is used. The so-called distributed adaptive signal fusion (DASF) algorithm solves this problem by having the nodes collaboratively solve low-dimensional versions of the original optimization problem, relying solely on the exchange of compressed views of the sensor data between the nodes. However, the DASF algorithm has only been shown to converge for filtering problems that can be expressed as smooth optimization problems. In this paper, we explore an extension of the DASF algorithm to a family of non-smooth spatial filtering problems, allowing the addition of non-smooth regularizers to the optimization problem, which could for example be used to perform node selection, and eliminate nodes not contributing to the filter objective, therefore further reducing communication costs. We provide a convergence proof of the non-smooth DASF algorithm and validate its convergence via simulations in both a static and adaptive setting. © 2023 IEEE.

关键词： Adaptive algorithms

来源：评论

学校读者我要写书评

暂无评论

Spatio-temporal feature fusion model based on Attention mechanism for RFID indoor positioning 27

Spatio-temporal feature fusion model based on Attention mech...

引用

27th international conference on Computer Supported Cooperative Work in Design (CSCWD)

作者： Chen, Houjin Yang, Lvqing Yang, Mulan Hou, Xuehan Chen, Sien Dong, Wensheng Yu, Bo Wang, Qingkai Xiamen Univ Sch Informat Xiamen 361005 Peoples R China Xi An Jiao Tong Univ Sch Econ & Finance Xian 710049 Peoples R China Xi An Jiao Tong Univ Sch Software Xian 710049 Peoples R China Jimei Univ Sch Nav Xiamen 361021 Peoples R China Xiamen Univ Sch Management Xiamen 361005 Peoples R China Zijin Zhixin Xiamen Technol Co Ltd Xiamen 361005 Peoples R China State Key Lab Proc Automat Min & Met Beijing 102600 Peoples R China Beijing Key Lab Proc Automat Min & Met Beijing 102600 Peoples R China

ISBN: (纸本)9798350349184;9798350349191

Amidst the rapid advancement of Internet of Things (IoT) technology, achieving precise indoor localization has emerged as a pivotal research area. Localization algorithms relying on Radio Frequency Identification (RFID) received signal strength indicator (RSSI) have gained widespread adoption in numerous indoor positioning systems due to their straightforward implementation and cost-effectiveness. However, in indoor settings, challenges like building obstructions and multipath effects often lead to signal reception failures by RFID antennas, consequently compromising the reliability of positioning outcomes. Recent research has approached indoor localization as a regression problem, employing deep learning models for analysis and prediction. But most current indoor localization models primarily focus on either spatial or temporal features within RSSI data, leading to suboptimal localization outcomes. To tackle these challenges, this paper proposes an enhanced methodology that leverages Generative Adversarial Networks (GAN) to impute missing RSSI data. Additionally, Convolutional Neural Networks (CNN) are utilized to extract spatial domain features, while Long Short-Term Memory Networks (LSTM) are employed for extracting temporal domain features. Ultimately, this paper designs a novel model, GCLA, which integrates an Attention mechanism with a location coding strategy to fuse features for precise location prediction. Experimental results show that the proposed GCLA model can obtain stable localization results after a short training on a small number of datasets.

关键词： RFID Indoor positioning Deep learning data imputation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：