检索结果-内蒙古大学图书馆

53rd Annual ACM SIGACT Symposium on Theory of Computing (STOC)

作者： Balcan, Maria-Florina DeBlasio, Dan Dick, Travis Kingsford, Carl Sandholm, Tuomas Vitercik, Ellen Carnegie Mellon Univ Pittsburgh PA 15213 USA Univ Texas El Paso El Paso TX 79968 USA Univ Penn Philadelphia PA 19104 USA Ocean Genom Inc Pittsburgh PA USA Strateg Machine Inc Pittsburgh PA USA Strategy Robot Inc Pittsburgh PA USA Optimized Markets Inc Pittsburgh PA USA

ISBN: (纸本)9781450380539

algorithms often have tunable parameters that impact performance metrics such as runtime and solution quality. For many algorithms used in practice, no parameter settings admit meaningful worst-case bounds, so the parameters are made available for the user to tune. Alternatively, parameters may be tuned implicitly within the proof of a worst-case guarantee. Worst-case instances, however, may be rare or nonexistent in practice. A growing body of research has demonstrated that data-driven algorithm design can lead to significant improvements in performance. This approach uses a training set of problem instances sampled from an unknown, application-specific distribution and returns a parameter setting with strong average performance on the training set. We provide a broadly applicable theory for deriving generalization guarantees that bound the difference between the algorithm's average performance over the training set and its expected performance on the unknown distribution. Our results apply no matter how the parameters are tuned, be it via an automated or manual approach. The challenge is that for many types of algorithms, performance is a volatile function of the parameters: slightly perturbing the parameters can cause a large change in behavior. Prior research (e.g., Gupta and Roughgarden, SICOMP'17;Balcan et al., COLT'17, ICML'18, EC'18) has proved generalization bounds by employing case-by-case analyses of greedy algorithms, clustering algorithms, integer programming algorithms, and selling mechanisms. We uncover a unifying structure which we use to prove extremely general guarantees, yet we recover the bounds from prior research. Our guarantees, which are tight up to logarithmic factors in the worst case, apply whenever an algorithm's performance is a piecewise-constant, -linear, or-more generally-piecewise-structured function of its parameters. Our theory also implies novel bounds for voting mechanisms and dynamic programming algorithms from computational bio

关键词： Automated algorithm design data-driven algorithm design automated algorithm configuration machine learning theory computational biology mechanism design

来源：评论

学校读者我要写书评

暂无评论

Learn to optimize——a brief overview

引用

National Science Review 2024年第8期11卷 39-47页

作者： Ke Tang Xin Yao Department of Computer Science and Engineering Southern University of Science and Technology Department of Computing and Decision Sciences Lingnan University

Most optimization problems of practical significance are typically solved by highly configurable parameterized *** achieve the best performance on a problem instance,a trial-and-error configuration process is required,which is very costly and even prohibitive for problems that are already computationally intensive,*** problems associated with machine learning *** the past decades,many studies have been conducted to accelerate the tedious configuration process by learning from a set of training *** article refers to these studies as learn to optimize and reviews the progress achieved.

关键词： optimization data-driven algorithm design automated algorithm configuration machine learning

来源：评论

学校读者我要写书评

暂无评论

A RECURSIVELY RECURRENT NEURAL NETWORK (R2N2) ARCHITECTURE FOR LEARNING ITERATIVE algorithmS

引用

SIAM JOURNAL ON SCIENTIFIC COMPUTING 2024年第2期46卷 A719-A743页

作者： Doncevic, Danimir T. Mitsos, Alexander Guo, Yue Li, Qianxiao Dietrich, Felix Dahmen, Manuel Kevrekidis, Ioannis G. Inst Energy & Climate Res Inst Energy & Climate Res Energy Syst Engn IEK 10 D-52425 Julich Germany Rhein Westfal TH Aachen Proc Syst Engn AVT SVT D-52074 Aachen Germany Natl Univ Singapore Dept Math Singapore 117543 Singapore Tech Univ Munich Dept Informat Boltzmannstr 3 D-85748 Garching Germany Inst Energy & Climate Res Inst Energy & Climate Res Energy Syst Engn IEK 10 D-52425 Julich Germany Johns Hopkins Univ Dept Appl Math & Stat & Chem & Biomol Engn Baltimore MD 21218 USA

Metalearning of numerical algorithms for a given task consists of the data -driven identification and adaptation of an algorithmic structure and the associated hyperparameters. To limit the complexity of the metalearning problem, neural architectures with a certain inductive bias towards favorable algorithmic structures can, and should, be used. We generalize our previously introduced Runge--Kutta neural network to a recursively recurrent neural network superstructure for the design of customized iterative algorithms. In contrast to off-the-shelf deep learning approaches, it features a distinct division into modules for generation of information and for the subsequent assembly of this information towards a solution. Local information in the form of a subspace is generated by subordinate, inner, iterations of recurrent function evaluations starting at the current outer iterate. The update to the next outer iterate is computed as a linear combination of these evaluations, reducing the residual in this space, and constitutes the output of the network. We demonstrate that regular training of the weight parameters inside the proposed superstructure on input/output data of various computational problem classes yields iterations similar to Krylov solvers for linear equation systems, Newton-Krylov solvers for nonlinear equation systems, and Runge--Kutta integrators for ordinary differential equations. Due to its modularity, the superstructure can be readily extended with functionalities needed to represent more general classes of iterative algorithms traditionally based on Taylor series expansions.

关键词： numerical analysis metalearning machine learning Runge-Kutta methods Newton-Krylov solvers data-driven algorithm design

来源：评论

学校读者我要写书评

暂无评论

An analysis of robustness of non-Lipschitz networks

The Journal of Machine Learning Research

引用

The Journal of Machine Learning Research 2023年第1期24卷 4548-4590页

作者： Maria-Florina Balcan Avrim Blum Dravyansh Sharma Hongyang Zhang Carnegie Mellon University Pittsburgh PA Toyota Technological Institute at Chicago Chicago IL University of Waterloo Waterloo ON Canada

Despite significant advances, deep networks remain highly susceptible to adversarial attack. One fundamental challenge is that small input perturbations can often produce large movements in the network's final-layer feature space. In this paper, we define an attack model that abstracts this challenge, to help understand its intrinsic properties. In our model, the adversary may move data an arbitrary distance in feature space but only in random low-dimensional subspaces. We prove such adversaries can be quite powerful: defeating any algorithm that must classify any input it is given. However, by allowing the algorithm to abstain on unusual inputs, we show such adversaries can be overcome when classes are reasonably well-separated in feature space. We further provide strong theoretical guarantees for setting algorithm parameters to optimize over accuracy-abstention trade-offs using data-driven methods. Our results provide new robustness guarantees for nearest-neighbor style algorithms, and also have application to contrastive learning, where we empirically demonstrate the ability of such algorithms to obtain high robust accuracy with low abstention rates. Our model is also motivated by strategic classification, where entities being classified aim to manipulate their observable features to produce a preferred classification, and we provide new insights into that area as well.

关键词： adversarial machine learning abstention nearest-neighbor algorithms data-driven algorithm design strategic classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：