检索结果-内蒙古大学图书馆

arXiv 2023年

作者： Lei, Yang Li, Jiangtong Cheng, Dawei Ding, Zhijun Jiang, Changjun Department of Computer Science and Technology Tongji University Shanghai China Shanghai Artificial Intelligence Laboratory Shanghai China

Large language models (LLMs) have demonstrated great potential in the financial domain. Thus, it becomes important to assess the performance of LLMs in the financial tasks. In this work, we introduce CFBenchmark, to evaluate the performance of LLMs for Chinese financial assistant. The basic version of CFBenchmark is designed to evaluate the basic ability in Chinese financial text processing from three aspects (i.e. recognition, classification, and generation) including eight tasks, and includes financial texts ranging in length from 50 to over 1,800 characters. We conduct experiments on several LLMs available in the literature with CFBenchmark-Basic, and the experimental results indicate that while some LLMs show outstanding performance in specific tasks, overall, there is still significant room for improvement in basic tasks of financial text processing with existing models. In the future, we plan to explore the advanced version of CFBenchmark, aiming to further explore the extensive capabilities of language models in more profound dimensions as a financial assistant in Chinese. Our codes are released at https://***/TongjiFinLab/CFBenchmark1 Copyright © 2023, The Authors. All rights reserved.

关键词： Text processing

来源：评论

学校读者我要写书评

暂无评论

Towards Stability of Autoregressive Neural Operators

arXiv

引用

arXiv 2023年

作者： McCabe, Michael Harrington, Peter Subramanian, Shashank Brown, Jed Department of Computer Science University of Colorado Boulder United States Lawrence Berkeley National Laboratory United States

Neural operators have proven to be a promising approach for modeling spatiotemporal systems in the physical sciences. However, training these models for large systems can be quite challenging as they incur significant computational and memory expense—these systems are often forced to rely on autoregressive time-stepping of the neural network to predict future temporal states. While this is effective in managing costs, it can lead to uncontrolled error growth over time and eventual instability. We analyze the sources of this autoregressive error growth using prototypical neural operator models for physical systems and explore ways to mitigate it. We introduce architectural and application-specific improvements that allow for careful control of instability-inducing operations within these models without inflating the compute/memory expense. We present results on several scientific systems that include Navier-Stokes fluid flow, rotating shallow water, and a high-resolution global weather forecasting system. We demonstrate that applying our design principles to neural operators leads to significantly lower errors for long-term forecasts as well as longer time horizons without qualitative signs of divergence compared to the original models for these systems. We open-source our code for reproducibility. © 2023, CC BY.

关键词： Weather forecasting

来源：评论

学校读者我要写书评

暂无评论

A Novel Silhouettes Cluster Internal Evaluation Index Based on Granular-Ball

A Novel Silhouettes Cluster Internal Evaluation Index Based ...

引用

IEEE International Conference on Cloud Computing and Big Data Analysis (ICCCBDA)

作者： Pengfei Zhao Zizhong Chen Jiang Xie Shuyin Xia Guoyin Wang College of Computer Science and Technology Chongqing Key Laboratory of Computational Intelligence Chongqing University of Posts and Telecommunications Chongqing China Department of Computer Science and Engineering University of California Riverside Riverside California USA

Cluster internal evaluation index is used to evaluate and guide the results of clustering, which has been considered as one of the vital issues in the application of clustering. Granular-ball is the multi-granularity characterization of the data set, In this paper, the classic Silhouettes indexes were improved by using granular-ball to represent the grain, we proposed a Silhouettes cluster internal evaluation index based on granular-ball(GSCVI), GSCVI can effectively obtain the optimal number of clusters for arbitrary-shaped and noisy data sets, and it is superior to most of the existing indexes for both artificial and real data sets.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Defense against synonym substitution-based adversarial attacks via dirichlet neighborhood ensemble 59

Defense against synonym substitution-based adversarial attac...

引用

Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, ACL-IJCNLP 2021

作者： Zhou, Yi Zheng, Xiaoqing Hsieh, Cho-Jui Chang, Kai-Wei Huang, Xuanjing School of Computer Science Fudan University Shanghai China Shanghai Key Laboratory of Intelligent Information Processing China Department of Computer Science University of California Los Angeles United States

ISBN: (纸本)9781954085527

Although deep neural networks have achieved prominent performance on many NLP tasks, they are vulnerable to adversarial examples. We propose Dirichlet Neighborhood Ensemble (DNE), a randomized method for training a robust model to defense synonym substitution-based attacks. During training, DNE forms virtual sentences by sampling embedding vectors for each word in an input sentence from a convex hull spanned by the word and its synonyms, and it augments them with the training data. In such a way, the model is robust to adversarial attacks while maintaining the performance on the original clean data. DNE is agnostic to the network architectures and scales to large models (e.g., BERT) for NLP applications. Through extensive experimentation, we demonstrate that our method consistently outperforms recently proposed defense methods by a significant margin across different network architectures and multiple data sets. © 2021 Association for Computational Linguistics

关键词： Natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

In-the-loop or on-the-loop? Interactional arrangements to support team coordination with a planning agent

In-the-loop or on-the-loop? Interactional arrangements to su...

引用

作者： Fischer, Joel E. Greenhalgh, Chris Jiang, Wenchao Ramchurn, Sarvapali D. Wu, Feng Rodden, Tom The Mixed Reality Laboratory School of Computer Science University of Nottingham Nottingham United Kingdom Agents Interaction and Complexity Group Department of Electronics and Computer Science University of Southampton Southampton United Kingdom School of Computer Science and Technology University of Science and Technology of China Hefei China

In this paper, we present the study of interactional arrangements that support the collaboration of headquarters (HQ), field responders, and a computational planning agent in a time-critical task setting created by a mixed-reality game. Interactional arrangements define the extent to which control is distributed between the collaborative parties. We provide 2 field trials, one to study an "on-the-loop" arrangement in which HQ monitors and intervenes in agent instructions to field players on demand and the other, to study a version that places HQ more tightly "in-the-loop." The studies provide an understanding of the sociotechnical collaboration between players and the agent in these interactional arrangements by conducting interaction analysis of video recordings and game log data. The first field trial focuses on the collaboration of field responders with the planning agent. Findings highlight how players negotiate the agent guidance within the social interaction of the collocated teams. The second field trial focuses on the collaboration between the automated planning agent and the HQ. We find that the human coordinator and the agent can successfully work together in most cases, with human coordinators inspecting and "correcting" the agent-proposed plans. Through this field trial-driven development process, we generalise interaction design implications of automated planning agents around the themes of supporting common ground and mixed-initiative planning. © 2017 The Authors. Concurrency and Computation: Practice and Experience Published by John Wiley & Sons, Ltd.

关键词： Mixed reality

来源：评论

学校读者我要写书评

暂无评论

Assessing the Degree of Feature Interactions that Determine a Model Prediction

Assessing the Degree of Feature Interactions that Determine ...

引用

IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW

作者： Krishna Khadka Sunny Shree Yu Lei Raghu N. Kacker D. Richard Kuhn Department of Computer Science and Engineering The University of Texas at Arlington Arlington USA Information Technology Laboratory National Institute of Standards and Technology Gaithersburg USA

ISBN: (数字)9798350344790

ISBN: (纸本)9798350344806

Machine Learning (ML) models rely on capturing important feature interactions to generate predictions. This study is focused on validating the hypothesis that model predictions often depend on interactions involving only a few features. This hypothesis is inspired by t-way combinatorial testing for software systems. In our study, we utilize the notion of Shapley Additive Explanations (SHAP) values to quantify each feature’s contribution to model prediction. We then use a greedy approach to identify a minimal subset of features (t) required to determine a model prediction. Our empirical evaluation is performed on three datasets: Adult Income, Mushroom, and Breast Cancer, and three classification models: Logistic Regression, XGBoost, and SVM. Through our experiments, we find that the majority of predictions are determined by interactions involving only a subset of features.

关键词： Support vector machines Logistic regression Additives Conferences Combinatorial testing Machine learning Predictive models

来源：评论

学校读者我要写书评

暂无评论

Dynamic Resource Management for Elastic Scientific Workflows using PMIx

Dynamic Resource Management for Elastic Scientific Workflows...

引用

IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum (IPDPSW)

作者： Rajat Bhattarai Howard Pritchard Sheikh Ghafoor Department of Computer Science Tennessee Tech University Cookeville Tennessee HPC Division Los Alamos National Laboratory Los Alamos New Mexico

ISBN: (数字)9798350364606

ISBN: (纸本)9798350364613

In current scientific workflows, the computational needs of tasks might not be known when it is submitted to a system for execution. Current resource management (RM) systems and workflow managers (WFMs) provide limited support for dynamic resource allocation in HPC systems, thus the common approach is to request the maximum resources needed for a maximum time, potentially wasting resources. However, in some cases, maximum resources may not be estimated a priori, as a result, a workflow may be completed after the deadline, or in cases, the task may terminated by the resource manager. A combination of workflow manager and resource management system that can accommodate a fine-grain elastic resource allocation during the execution of a workflow would alleviate this problem. This paper presents a dynamic elastic resource management framework based on the Parsl workflow manager and PMIx-enabled SLURM and reports the early evaluation of the framework using two workflow applications.

关键词： Distributed processing Runtime Conferences Prototypes Dynamic scheduling Resource management Task analysis

来源：评论

学校读者我要写书评

暂无评论

Multi-Agent Evolution Strategy With Cooperative and Cumulative Step Adaptation for Black-Box Distributed Optimization

引用

IEEE Transactions on Evolutionary Computation 2025年

作者： Chen, Tai-You Chen, Wei-Neng Hao, Jin-Kao Wang, Yang Zhang, Jun South China University of Technology School of Computer Science and Engineering Guangzhou510006 China Université d'Angers LERIA Laboratory Department of Computer Science Angers49045 France Northwestern Polytechnical University School of Management Xi'an710072 China Zhejiang Normal University Nankai University China Hanyang University ERICA Korea Republic of

In recent years, black-box distributed optimization (DBO) has been widely studied to solve complex optimization problems in multi-agent systems, such as hyperparameter optimization of distributed machine learning. However, most existing methods use a fixed or diminishing step size to sample and search in the black box optimization space, which makes it challenging to maintain optimization efficiency on different optimization problems. In this work, we propose a multi-agent evolution strategy with cooperative and cumulative step adaptation (). In, each agent executes the algorithm to sample and explores its local objective function, and communicates with other agents to optimize the global objective function cooperatively, which is the sum of local objective functions. To improve the sampling adaptability, we design a cooperative and cumulative step adaptation method (CCSA) consisting of inner adaptation and outer adaptation. By detecting the evolution path of the multi-agent system, CCSA decreases the step size when the evolution directions of agents are conflicting and increases the step size when consistent. In terms of theoretical analysis, we first discuss the working principle of CCSA, and then discuss the system consensus of. In terms of experimental verification, achieves better consensus performance and competitive solution quality compared with state-of-the-art algorithms for DBO. © 1997-2012 IEEE.

关键词： Optimization algorithms

来源：评论

学校读者我要写书评

暂无评论

Learning the Kalman Filter with Fine-Grained Sample Complexity

Learning the Kalman Filter with Fine-Grained Sample Complexi...

引用

American Control Conference (ACC)

作者： Xiangyuan Zhang Bin Hu Tamer Başar Department of Electrical and Computer Engineering and Coordinated Science Laboratory University of Illinois at Urbana-Champaign Urbana IL USA

We develop the first end-to-end sample complexity of model-free policy gradient (PG) methods in discrete-time infinite-horizon Kalman filtering. Specifically, we introduce the receding-horizon policy gradient (RHPG-KF) framework and demonstrate sample complexity for RHPG-KF in learning a stabilizing filter that is ϵ-close to the optimal Kalman filter. Notably, the proposed RHPG-KF framework does not require the system to be open-loop stable nor assume any prior knowledge of a stabilizing filter. Our results shed light on applying model-free PG methods to control a linear dynamical system where the state measurements could be corrupted by statistical noises and other (possibly adversarial) disturbances.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning

arXiv

引用

arXiv 2024年

作者： Ju, Zhaoxun Yang, Chao Wang, Hongbo Qiao, Yu Sun, Fuchun Academy for Engineering and Technology Fudan University China Shanghai Artificial Intelligence Laboratory China Department of Computer Science and Technology Tsinghua University China

Language-conditioned robot behavior plays a vital role in executing complex tasks by associating human commands or instructions with perception and actions. The ability to compose long-horizon tasks based on unconstrained language instructions necessitates the acquisition of a diverse set of general-purpose skills. However, acquiring inherent primitive skills in a coupled and long-horizon environment without external rewards or human supervision presents significant challenges. In this paper, we evaluate the relationship between skills and language instructions from a mathematical perspective, employing two forms of mutual information within the framework of language-conditioned policy learning. To maximize the mutual information between language and skills in an unsupervised manner, we propose an end-to-end imitation learning approach known as Language Conditioned Skill Discovery (LCSD). Specifically, we utilize vector quantization to learn discrete latent skills and leverage skill sequences of trajectories to reconstruct high-level semantic instructions. Through extensive experiments on language-conditioned robotic navigation and manipulation tasks, encompassing BabyAI, LORel, and CALVIN, we demonstrate the superiority of our method over prior works. Our approach exhibits enhanced generalization capabilities towards unseen tasks, improved skill interpretability, and notably higher rates of task completion success. Copyright © 2024, The Authors. All rights reserved.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：