检索结果-内蒙古大学图书馆

Variable Selection for Distributed Sparse Regression Under Memory Constraints

Communications in Mathematics and statistics 2024年第2期12卷 307-338页

作者： Haofeng Wang Xuejun Jiang Min Zhou Jiancheng Jiang Department of Mathematics Harbin Institute of TechnologyHarbinPeople's Republic of China Department of Statistics and Data Science Southerm Univerity of Science and TechnologyShenzhenPeople's Republic of China Beijing Normal University-Hong Kong Baptist University United International College ZhuhaiPeople's Republic of China Department of Mathematics and Statistics University of North Carolina at CharlotteCharlotteUSA

This paper studies variable selection using the penalized likelihood method for dis-tributed sparse regression with large sample size n under a limited memory *** is a much needed research problem to be solved in the big data era.A naive divide-and-conquer method solving this problem is to split the whole data into N parts and run each part on one of N machines,aggregate the results from all machines via averaging,andﬁnally obtain the selected ***,it tends to select more noise variables,and the false discovery rate may not be well *** improve it by a special designed weighted average in *** the alternating direction method of multiplier can be used to deal with massive data in the literature,our proposed method reduces the computational burden a lot and performs better by mean square error in most ***,we establish asymptotic properties of the resulting estimators for the likelihood models with a diverging number of *** some regularity conditions,we establish oracle properties in the sense that our distributed estimator shares the same asymptotic efﬁciency as the estimator based on the full ***,a distributed penalized likelihood algorithm is proposed to reﬁne the results in the context of general ***,the proposed method is evaluated by simulations and a real example.

关键词： Variable selection Distributed sparse regression Memory constraints Distributed penalized likelihood algorithm

来源：评论

学校读者我要写书评

暂无评论

Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference 38

Latent Plan Transformer for Trajectory Abstraction: Planning...

引用

38th Conference on Neural Information Processing Systems, NeurIPS 2024

作者： Kong, Deqian Xu, Dehong Zhao, Minglu Pang, Bo Xie, Jianwen Lizarraga, Andrew Huang, Yuhao Xie, Sirui Wu, Ying Nian Department of Statistics and Data Science UCLA United States Salesforce Research United States Akool Research United States Xi'an Jiaotong University China Department of Computer Science UCLA United States

In tasks aiming for long-term returns, planning becomes essential. We study generative modeling for planning with datasets repurposed from offline reinforcement learning. Specifically, we identify temporal consistency in the absence of step-wise rewards as one key technical challenge. We introduce the Latent Plan Transformer (LPT), a novel model that leverages a latent variable to connect a Transformer-based trajectory generator and the final return. LPT can be learned with maximum likelihood estimation on trajectory-return pairs. In learning, posterior sampling of the latent variable naturally integrates sub-trajectories to form a consistent abstraction despite the finite context. At test time, the latent variable is inferred from an expected return before policy execution, realizing the idea of planning as inference. Our experiments demonstrate that LPT can discover improved decisions from suboptimal trajectories, achieving competitive performance across several benchmarks, including Gym-Mujoco, Franka Kitchen, Maze2D, and Connect Four. It exhibits capabilities in nuanced credit assignments, trajectory stitching, and adaptation to environmental contingencies. These results validate that latent variable inference can be a strong alternative to step-wise reward prompting. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Causally Disentangled Generative Variational AutoEncoder 26

Causally Disentangled Generative Variational AutoEncoder

引用

26th European Conference on Artificial Intelligence, ECAI 2023

作者： An, SeungHwan Song, Kyungwoo Jeon, Jong-June Department of Statistics University of Seoul Korea Republic of Department of Applied Statistics Department of Statistics and Data Science Yonsei University Korea Republic of

ISBN: (纸本)9781643684369

We present a new supervised learning technique for the Variational AutoEncoder (VAE) that allows it to learn a causally disentangled representation and generate causally disentangled outcomes simultaneously. We call this approach Causally Disentangled Generation (CDG). CDG is a generative model that accurately decodes an output based on a causally disentangled representation. Our research demonstrates that adding supervised regularization to the encoder alone is insufficient for achieving a generative model with CDG, even for a simple task. Therefore, we explore the necessary and sufficient conditions for achieving CDG within a specific model. Additionally, we introduce a universal metric for evaluating the causal disentanglement of a generative model. Empirical results from both image and tabular datasets support our findings. © 2023 The Authors.

关键词： Supervised learning

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Based Holistic Speaker Independent Visual Speech Recognition

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2023年第6期4卷 1705-1713页

作者： Nemani, Praneeth Krishna, Ghanta Sai Ramisetty, Nikhil Sai, B Digvijay Sri Kumar, Santosh IIIT Naya Raipur Department of Computer Science and Engineering Raipur493661 India IIIT Naya Raipur Department of Data Science and Artificial Intelligence Raipur493661 India

From a broader perspective, the objective of visual speech recognition (VSR) is to comprehend the speech spoken by an individual using visual deformations. However, some of the significant limitations of existing solutions include the dearth of training data, improper end-to-end deployed solutions, lack of holistic feature representation, and less accuracy. To resolve these limitations, this study proposes a novel, scalable, and robust VSR system that uses the videotape of the user to determine the word which is being spoken. In this regard, a customized 3-D convolutional neural network (3-D CNN) architecture is proposed by extracting the spatio-temporal features and eventually mapping the prediction probabilities of the elements in the corpus. We have created a customized dataset resembling the metadata contained in the MIRACL-VC1 dataset to validate the concept of person-independence. While being robust to a broad spectrum of lighting conditions across multiple devices, our model achieves a training accuracy of 80.2% and a testing accuracy of 77.9% in predicting the word spoken by the user. © 2020 IEEE.

关键词： Speech recognition

来源：评论

学校读者我要写书评

暂无评论

Disease Diagnosis from Facial Alternations Using Ensemble CNNs

Disease Diagnosis from Facial Alternations Using Ensemble CN...

引用

2024 International Research Conference on Smart Computing and Systems Engineering, SCSE 2024

作者： Surasinghe, Pabasara Krishnapillai, Keerthiha Sabapathippillai, Papiththira Thanikasalam, Kokul University of Vavuniya Faculty of Applied Science Department of Physical Science Vavuniya Sri Lanka University of Peradeniya Faculty of Science Department of Statistics and Computer Science Kandy Sri Lanka University of Jaffna Faculty of Science Department of Computer Science Jaffna Sri Lanka

ISBN: (纸本)9798350375688

Identifying health conditions from facial images is crucial for the early detection of certain diseases and provides crucial information for timely intervention. This study introduces a novel ensemble convolutional neural network (CNN) classifier with a visualization technique for diagnosing Bell's palsy, Parry-Romberg syndrome, and Moebius syndrome. As the first step of this study, a dataset was constructed using publicly available images due to the unavailability of benchmark datasets to detect these diseases. The proposed ensemble CNN classifier combines the strengths of ResNet-50, VGG-16, and DenseNet-121 to classify diseases with high accuracy. In addition, a visualization technique was developed to identify the most influential facial regions for detecting these diseases. The proposed ensemble CNN classifier achieved a classification accuracy of 91.96% on the test set of the constructed dataset. © 2024 IEEE.

关键词： Visualization

来源：评论

学校读者我要写书评

暂无评论

Conditional dependence learning with high-dimensional conditioning variables

引用

science China Mathematics 2025年

作者： Jianxin Bi Xingdong Feng Jingyuan Liu Department of Statistics and Data Science in School of Economics Xiamen University School of Statistica and Management Shanghai University of Finance and Economics MOE Key Laboratory of Econometrics Department of Statistics and Data Science in School of EconomicsLaboratory of Digital Finance and Fujian Key Laboratory of Statistical ScienceXiamen University

Conditional dependence plays a crucial role in various statistical procedures, including variable selection, network analysis and causal inference. However, there remains a paucity of relevant research in the context of high-dimensional conditioning variables, a common challenge encountered in the era of big data. To address this issue, many existing studies impose certain model structures, yet high-dimensional conditioning variables often introduce spurious correlations in these models. In this paper, we systematically study the estimation biases inherent in widely-used measures of conditional dependence when spurious variables are present under high-dimensional settings. We discuss the estimation inconsistency both intuitively and theoretically,demonstrating that the conditional dependencies can be either overestimated or underestimated under different scenarios. To mitigate these biases and attain consistency, we introduce a measure based on data splitting and refitting techniques for high-dimensional conditional dependence. A conditional independence test is also developed using the newly advocated measure, with a tuning-free asymptotic null distribution. Furthermore,the proposed test is applied to generating high-dimensional network graphs in graphical modeling. The superior performances of newly proposed methods are illustrated both theoretically and through simulation studies. We also utilize the method to construct the gene-gene networks using a dataset of breast invasive carcinoma, which contains interesting discoveries that are worth further scientific exploration.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Age-of-Information-Aware Federated Learning

引用

Journal of computer science & Technology 2024年第3期39卷 637-653页

作者：徐殷肖明军吴晨吴杰周津锐孙贺 Student Member CCF 1.School of Computer Science and TechnologyUniversity of Science and Technology of ChinaHefei 230026China Suzhou Institute for Advanced Research University of Science and Technology of ChinaSuzhou 215123China School of Data Science University of Science and Technology of ChinaHefei 230026China Department of Computer and Information Sciences Temple UniversityPhiladelphiaPA 19122U.S.A. CCF IEEE

Federated learning(FL)is an emerging privacy-preserving distributed computing paradigm,enabling numerous clients to collaboratively train machine learning models without the necessity of transmitting clients’private datasets to the central *** most existing research where the local datasets of clients are assumed to be unchanged over time throughout the whole FL process,our study addresses such scenarios in this paper where clients’datasets need to be updated periodically,and the server can incentivize clients to employ as fresh as possible datasets for local model *** primary objective is to design a client selection strategy to minimize the loss of the global model for FL loss within a constrained *** this end,we introduce the concept of“Age of Information”(AoI)to quantitatively assess the freshness of local datasets and conduct a theoretical analysis of the convergence bound in our AoI-aware FL *** on the convergence bound,we further formulate our problem as a restless multi-armed bandit(RMAB)***,we relax the RMAB problem and apply the Lagrangian Dual approach to decouple it into multiple ***,we propose a Whittle’s Index Based Client Selection(WICS)algorithm to determine the set of selected *** addition,comprehensive simulations substantiate that the proposed algorithm can effectively reduce training loss and enhance the learning accuracy compared with some state-of-the-art methods.

关键词： federated learning Age of Information restless multi-armed bandit Whittle’s index

来源：评论

学校读者我要写书评

暂无评论

Spatial-structural analysis of macroeconomic factors’ impact on carbon emissions in East Africa: a spatial econometric panel study

引用

Environmental science and Pollution Research 2024年第39期31卷 51883-51901页

作者： Shakiru, Twahil Hemed Liu, Xiaohui Liu, Qing Khan, Muhammad Asif Department of Statistics University of Dar Es Salaam Dar Es Salaam Tanzania United Republic of School of Statistics and Data Science Jiangxi University of Finance and Economics Jiangxi China Earth System and Global Change Lab School of Environmental Science and Engineering Southern University of Science and Technology Shenzhen China

Despite the abundance of research on reducing carbon emissions, there is a significant gap in understanding the influence of macroeconomic factors on carbon dioxide (CO2) emissions from a spatial-structural perspective. This study aims to contribute to the literature by investigating the impact of macroeconomic factors on carbon dioxide emissions in six East African countries between 1989 and 2020. Using spatial econometric panel models, the study analyzed spatial dependence among the variables. The empirical findings indicate that gross domestic product (GDP) per capita and electricity consumption have positive direct and indirect effects on carbon emissions, while fuel prices and exports have negative direct effects, but positive spillover effects on neighboring countries. Imports have a positive impact on local economies, but negative spillover effects. Additionally, the urban population has no significant impact on the environment. These findings provide important policy implications for optimizing spatial growth patterns and achieving a low-carbon economy in East African countries. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024.

关键词： Carbon dioxide

来源：评论

学校读者我要写书评

暂无评论

New Inner Bounds for the Extreme Eigenvalues of Real Symmetric Matrices

引用

IAENG International Journal of Applied Mathematics 2024年第5期54卷 871-876页

作者： Singh, Pravin Singh, Shivani Singh, Virath Department of Mathematics Statistics and Computer Science University of KwaZulu-Natal Private Bag X54001 KZN Durban4001 South Africa Department of Decision Science University of South Africa PO Box 392 Gauteng Pretoria0003 South Africa Department of Mathematics Statistics and Computer Science University of KwaZulu-Natal Private Bag X54001 KZN Durban4001 South Africa

In this paper, we advocate a new technique to determine inner bounds for the extreme eigenvalues of real symmetric matrices. Our method involves the matrix elements and compares favourably with existing methods. We al... 详细信息

关键词： Eigenvalues and eigenfunctions

来源：评论

学校读者我要写书评

暂无评论

Active, anytime-valid risk controlling prediction sets 38

Active, anytime-valid risk controlling prediction sets

引用

38th Conference on Neural Information Processing Systems, NeurIPS 2024

作者： Xu, Ziyu Karampatziakis, Nikos Mineiro, Paul Department of Statistics and Data Science Carnegie Mellon University United States Microsoft United States

Rigorously establishing the safety of black-box machine learning models concerning critical risk measures is important for providing guarantees about model behavior. Recently, Bates et. al. (JACM'24) introduced the notion of a risk controlling prediction set (RCPS) for producing prediction sets that are statistically guaranteed low risk from machine learning models. Our method extends this notion to the sequential setting, where we provide guarantees even when the data is collected adaptively, and ensures that the risk guarantee is anytime-valid, i.e., simultaneously holds at all time steps. Further, we propose a framework for constructing RCPSes for active labeling, i.e., allowing one to use a labeling policy that chooses whether to query the true label for each received data point and ensures that the expected proportion of data points whose labels are queried are below a predetermined label budget. We also describe how to use predictors (i.e., the machine learning model for which we provide risk control guarantees) to further improve the utility of our RCPSes by estimating the expected risk conditioned on the covariates. We characterize the optimal choices of label policy and predictor under a fixed label budget and show a regret result that relates the estimation error of the optimal labeling policy and predictor to the wealth process that underlies our RCPSes. Lastly, we present practical ways of formulating label policies and empirically show that our label policies use fewer labels to reach higher utility than naive baseline labeling strategies on both simulations and real data. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：