检索结果-内蒙古大学图书馆

Fast Peer Adaptation with Context-aware Exploration 41

学校读者我要写书评

暂无评论

Fast Peer Adaptation with Context-aware Exploration

41st International Conference on Machine Learning, ICML 2024

作者： Ma, Long Wang, Yuanfei Zhong, Fangwei Zhu, Song-Chun Wang, Yizhou Academy for Advanced Interdisciplinary Studies Peking University China Nat'l Key Laboratory of General Artificial Intelligence BIGAI&PKU China Center on Frontiers of Computing Studies School of Computer Science Peking University China School of Intelligence Science and Technology Peking University China Inst. for Artificial Intelligence Peking University China Nat'l Eng. Research Center of Visual Technology Peking University China

Fast adapting to unknown peers (partners or opponents) with different strategies is a key challenge in multi-agent games. To do so, it is crucial for the agent to probe and identify the peer's strategy efficiently, as this is the prerequisite for carrying out the best response in adaptation. However, exploring the strategies of unknown peers is difficult, especially when the games are partially observable and have a long horizon. In this paper, we propose a peer identification reward, which rewards the learning agent based on how well it can identify the behavior pattern of the peer over the historical context, such as the observation over multiple episodes. This reward motivates the agent to learn a context-aware policy for effective exploration and fast adaptation, i.e., to actively seek and collect informative feedback from peers when uncertain about their policies and to exploit the context to perform the best response when confident. We evaluate our method on diverse testbeds that involve competitive (Kuhn Poker), cooperative (PO-Overcooked), or mixed (Predator-Prey-W) games with peer agents. We demonstrate that our method induces more active exploration behavior, achieving faster adaptation and better outcomes than existing methods1 Copyright 2024 by the author(s)

关键词： Adversarial machine learning

Unveiling Linguistic Regions in Large Language Models

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Zhang, Zhihao Zhao, Jun Zhang, Qi Gui, Tao Huang, Xuanjing School of Computer Science Fudan University China Institute of Modern Languages and Linguistics Fudan University China Shanghai Collaborative Innovation Center of Intelligent Visual Computing China

Large Language Models (LLMs) have demonstrated considerable cross-lingual alignment and generalization ability. Current research primarily focuses on improving LLMs' crosslingual generalization capabilities. However, there is still a lack of research on the intrinsic mechanisms of how LLMs achieve crosslingual alignment. From the perspective of region partitioning, this paper conducts several investigations on the linguistic competence of LLMs. We discover a core region in LLMs that corresponds to linguistic competence, accounting for approximately 1% of the total model parameters. Removing this core region by setting parameters to zero results in a significant performance decrease across 30 different languages. Furthermore, this core region exhibits significant dimensional dependence, perturbations to even a single parameter on specific dimensions leading to a loss of linguistic competence. Moreover, we discover that distinct monolingual regions exist for different languages, and disruption to these specific regions substantially reduces the LLMs' proficiency in those corresponding languages. Our research also indicates that freezing the core linguistic region during further pre-training can mitigate the issue of catastrophic forgetting (CF), a common phenomenon observed during further pre-training of LLMs. Overall, exploring the LLMs' functional regions provides insights into the foundation of their intelligence. Copyright © 2024, The Authors. All rights reserved.

关键词： Computational linguistics

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Ye, Junjie Li, Sixian Li, Guanyu Huang, Caishuang Gao, Songyang Wu, Yilong Zhang, Qi Gui, Tao Huang, Xuanjing School of Computer Science Fudan University China Institute of Modern Languages and Linguistics Fudan University China Shanghai Collaborative Innovation Center of Intelligent Visual Computing China

Tool learning is widely acknowledged as a foundational approach for deploying large language models (LLMs) in real-world scenarios. While current research primarily emphasizes leveraging tools to augment LLMs, it frequently neglects emerging safety considerations tied to their application. To fill this gap, we present ToolSword, a comprehensive framework dedicated to meticulously investigating safety issues linked to LLMs in tool learning. Specifically, ToolSword delineates six safety scenarios for LLMs in tool learning, encompassing malicious queries and jailbreak attacks in the input stage, noisy misdirection and risky cues in the execution stage, and harmful feedback and error conflicts in the output stage. Experiments conducted on 11 open-source and closed-source LLMs reveal enduring safety challenges in tool learning, such as handling harmful queries, employing risky tools, and delivering detrimental feedback, which even GPT-4 is susceptible to. Moreover, we conduct further studies with the aim of fostering research on tool learning safety. The data is released in https://***/Junjie-Ye/ToolSword. Copyright © 2024, The Authors. All rights reserved.

关键词： Contrastive Learning

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition

学校读者我要写书评

暂无评论

arXiv 2023年

作者： Zheng, Tianlun Chen, Zhineng Huang, Bingchen Zhang, Wei Jiang, Yu-Gang School of Computer Science Fudan University China Shanghai Collaborative Innovation Center of Intelligent Visual Computing China Gaoding AI China

Multilingual text recognition (MLTR) systems typically focus on a fixed set of languages, which makes it difficult to handle newly added languages or adapt to ever-changing data distribution. In this paper, we propose the Incremental MLTR (IMLTR) task in the context of incremental learning (IL), where different languages are introduced in batches. IMLTR is particularly challenging due to rehearsal-imbalance, which refers to the uneven distribution of sample characters in the rehearsal set, used to retain a small amount of old data as past memories. To address this issue, we propose a Multiplexed Routing Network (MRN). MRN trains a recognizer for each language that is currently seen. Subsequently, a language domain predictor is learned based on the rehearsal set to weigh the recognizers. Since the recognizers are derived from the original data, MRN effectively reduces the reliance on older data and better fights against catastrophic forgetting, the core issue in IL. We extensively evaluate MRN on MLT17 and MLT19 datasets. It outperforms existing general-purpose IL methods by large margins, with average accuracy improvements ranging from 10.3% to 35.8% under different settings. Code is available at https://***/simplify23/MRN. Copyright © 2023, The Authors. All rights reserved.

关键词： Character recognition

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition

学校读者我要写书评

暂无评论

MRN: Multiplexed Routing Network for Incremental Multilingua...

International Conference on computer Vision (ICCV)

作者： Tianlun Zheng Zhineng Chen Bingchen Huang Wei Zhang Yu-Gang Jiang School of Computer Science Fudan University China Shanghai Collaborative Innovation Center of Intelligent Visual Computing China Gaoding AI China

关键词：

CT2C-QA: Multimodal Question Answering over Chinese Text, Table and Chart 24

学校读者我要写书评

暂无评论

CT2C-QA: Multimodal Question Answering over Chinese Text, Ta...

32nd ACM International Conference on Multimedia, MM 2024

作者： Zhao, Bowen Cheng, Tianhao Zhang, Yuejie Cheng, Ying Feng, Rui Zhang, Xiaobo Fudan University Shanghai China Children's Hospital of Fudan University Shanghai China School of Computer Science Shanghai Key Laboratory of Intelligent Information Processing Fudan University China Shanghai Collaborative Innovation Center of Intelligent Visual Computing. China Fudan Zhangjiang Institute Shanghai China Children's Hospital of Fudan University National Children's Medical Center Shanghai China

ISBN: (纸本)9798400706868

Multimodal Question Answering (MMQA) is crucial as it enables comprehensive understanding and accurate responses by integrating insights from diverse data representations such as tables, charts, and text. Most existing researches in MMQA only focus on two modalities such as image-text QA, table-text QA and chart-text QA, and there remains a notable scarcity in studies that investigate the joint analysis of text, tables, and charts. In this paper, we present CT2C-QA, a pioneering Chinese reasoning-based QA dataset that includes an extensive collection of text, tables, and charts, meticulously compiled from 200 selectively sourced webpages. Our dataset simulates real webpages and serves as a great test for the capability of the model to analyze and reason with multimodal data, because the answer to a question could appear in various modalities, or even potentially not exist at all. Additionally, we present AED (Allocating, Expert and Decision), a multi-agent system implemented through collaborative deployment, information interaction, and collective decision-making among different agents. Specifically, the Assignment Agent is in charge of selecting and activating expert agents, including those proficient in text, tables, and charts. The Decision Agent bears the responsibility of delivering the final verdict, drawing upon the analytical insights provided by these expert agents. We execute a comprehensive analysis, comparing AED with various state-of-the-art models in MMQA, including GPT-4. The experimental outcomes demonstrate that current methodologies, including GPT-4, are yet to meet the benchmarks set by our dataset. © 2024 ACM.

关键词： Question answering

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models

学校读者我要写书评

暂无评论

arXiv 2024年

作者： Gong, Chao Chen, Kai Wei, Zhipeng Chen, Jingjing Jiang, Yu-Gang Shanghai Key Lab of Intell. Info. Processing School of Computer Science Fudan University China Shanghai Collaborative Innovation Center on Intelligent Visual Computing China

Text-to-image models encounter safety issues, including concerns related to copyright and Not-Safe-For-Work (NSFW) content. Despite several methods have been proposed for erasing inappropriate concepts from diffusion models, they often exhibit incomplete erasure, consume a lot of computing resources, and inadvertently damage generation ability. In this work, we introduce Reliable and Efficient Concept Erasure (RECE), a novel approach that modifies the model in 3 seconds without necessitating additional fine-tuning. Specifically, RECE efficiently leverages a closed-form solution to derive new target embeddings, which are capable of regenerating erased concepts within the unlearned model. To mitigate inappropriate content potentially represented by derived embeddings, RECE further aligns them with harmless concepts in cross-attention layers. The derivation and erasure of new representation embeddings are conducted iteratively to achieve a thorough erasure of inappropriate concepts. Besides, to preserve the model’s generation ability, RECE introduces an additional regularization term during the derivation process, resulting in minimizing the impact on unrelated concepts during the erasure process. All the processes above are in closed-form, guaranteeing extremely efficient erasure in only 3 seconds. Benchmarking against previous approaches, our method achieves more efficient and thorough erasure with minor damage to original generation ability and demonstrates enhanced robustness against red-teaming tools. Code is available at https://***/CharlesGong12/RECE. Copyright © 2024, The Authors. All rights reserved.

关键词： Benchmarking

Dynamic Graph Neural Networks-Based Alert Link Prediction for Online Service Systems 23

学校读者我要写书评

暂无评论

Dynamic Graph Neural Networks-Based Alert Link Prediction fo...

Proceedings of the 38th IEEE/ACM International Conference on Automated Software Engineering

作者： Yiru Chen Chenxi Zhang Zhen Dong Dingyu Yang Xin Peng Jiayu Ou Hong Yang Zheshun Wu Xiaojun Qu Wei Li School of Computer Science Fudan University China and Shanghai Key Laboratory of Data Science Fudan University China and Shanghai Collaborative Innovation Center of Intelligent Visual Computing China Alibaba Group China

ISBN: (纸本)9798350329964

A fault in large online service systems often triggers numerous alerts due to the complex business and component dependencies among services, which is known as "alert storm". In a short time, an online service system may generate a huge amount of alert data. This poses a challenge for on-call engineers to identify alerts that are associated with a system failure for root cause analysis. In this paper, we propose DyAlert, a dynamic graph neural networks-based approach for linking alerts that might be triggered by a same fault to reduce the burden of on-call engineers in the fault analysis. Our insight is that alerts are often triggered by alert propagation when a system failure occurs, e.g., alert a would lead to the occurrence of alert b. Whether two alerts should be linked depends on if one alert is triggered by the propagation of the other. Leveraging this insight, we design a dynamic graph (namely Alert-Metric Dynamic Graph) that describes the propagation process of alerts. Based on the dynamic graph, we train a neural networks-based model to predict alert links. We evaluate DyAlert with real-world data collected from an online service system running 85 business units and about 30,000 different services in a large enterprise. The results show that DyAlert is effective in predicting alert links and it outperforms the state-of-the-art approaches with an average increase of 0.259 in F1-score.

关键词： linked alerts

Downstream Task-agnostic Transferable Attacks on Language-Image Pre-training Models

学校读者我要写书评

暂无评论

Downstream Task-agnostic Transferable Attacks on Language-Im...

IEEE International Conference on Multimedia and Expo (ICME)

作者： Yiqiang Lv Jingjing Chen Zhipeng Wei Kai Chen Zuxuan Wu Yu-Gang Jiang Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University Shanghai Collaborative Innovation Center of Intelligent Visual Computing

Vision-language pre-trained models (e.g., CLIP) trained on large-scale datasets via self-supervised learning, are drawing increasing research attention since they can achieve superior performances on multi-modal downstream tasks. Nevertheless, we find that the adversarial perturbations crafted on vision-language pre-trained models can be used to attack different corresponding downstream task models. Specifically, to investigate such adversarial transferability, we introduce a task-agnostic method named Global and Local Augmentation (GLA) attack to generate highly transferable adversarial examples on CLIP, to attack black-box downstream task models. GLA adopts random crop and resize at both global and local patch levels, to create more diversity and make adversarial noises robust. Then GLA generates the adversarial perturbations by minimizing the cosine similarity between intermediate features from augmented adversarial and benign examples. Extensive experiments on three CLIP image encoders with different backbones and three different downstream tasks demonstrate the superiority of our method compared with other strong baselines. The code is available at https://***/yqlvcoding/GLAattack.

关键词：