检索结果-内蒙古大学图书馆

Properties of the stochastic approximation EM algorithm with mini-batch sampling

STATISTICS AND COMPUTING 2020年第6期30卷 1725-1739页

作者： Kuhn, Estelle Matias, Catherine Rebafka, Tabea Univ Paris Saclay MaIAGE INRAE Jouy En Josas France Univ Paris Sorbonne Univ CNRS Lab Probabil Stat & Modelisat LPSM Paris France

To deal with very large datasets a mini-batch version of the Monte Carlo Markov Chain Stochastic Approximation Expectation-Maximization algorithm for general latent variable models is proposed. For exponential models the algorithm is shown to be convergent under classical conditions as the number of iterations increases. Numerical experiments illustrate the performance of the mini-batch algorithm in various models. In particular, we highlight that mini-batch sampling results in an important speed-up of the convergence of the sequence of estimators generated by the algorithm. Moreover, insights on the effect of the mini-batch size on the limit distribution are presented. Finally, we illustrate how to use mini-batch sampling in practice to improve results when a constraint on the computing time is given.

关键词： EM algorithm mini-batch sampling Stochastic approximation Monte Carlo Markov chain

来源：评论

学校读者我要写书评

暂无评论

Transfer Learning With Active sampling for Rapid Training and Calibration in BCI-P300 Across Health States and Multi-Centre Data

引用

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING 2024年 32卷 3794-3803页

作者： Flores, Christian Contreras, Marcelo Macedo, Ichiro Andreu-Perez, Javier UTEC Ctr BIO UTEC Lima 15063 Peru UTEC Dept Elect & Mechatron Engn Lima 15063 Peru Univ Essex Ctr Computat Intelligence Sch Comp Sci & Elect Engn Colchester CO4 3SQ England

Machine learning and deep learning advancements have boosted Brain-Computer Interface (BCI) performance, but their wide-scale applicability is limited due to factors like individual health, hardware variations, and cultural differences affecting neural data. Studies often focus on uniform single-site experiments in uniform settings, leading to high performance that may not translate well to real-world diversity. Deep learning models aim to enhance BCI classification accuracy, and transfer learning has been suggested to adapt models to individual neural patterns using a base model trained on others' data. This approach promises better generalizability and reduced overfitting, yet challenges remain in handling diverse and imbalanced datasets from different equipment, subjects, multiple centres in different countries, and both healthy and patient populations for effective model transfer and tuning. In a setting characterized by maximal heterogeneity, we proposed P300 wave detection in BCIs employing a convolutional neural network fitted with adaptive transfer learning based on Poison sampling Disk (PDS) called Active sampling (AS), which flexibly adjusts the transition from source data to the target domain. Our results reported for subject adaptive with 40% of adaptive fine-tuning that the averaged classification accuracy improved by 5.36% and standard deviation reduced by 12.22% using two distinct, internationally replicated datasets. These results outperformed in classification accuracy, computational time, and training efficiency, mainly due to the proposed Active sampling (AS) method for transfer learning.

关键词： Accuracy Transfer learning Adaptation models Electroencephalography Brain modeling Deep learning Convergence Adaptive transfer learning Poison disk sampling mini-batch sampling deep learning EEG-based BCI P300 CNN ecological validity

来源：评论

学校读者我要写书评

暂无评论

Two-Timescale Optimization for Intelligent Reflecting Surface-Assisted MIMO Transmission in Fast-Changing Channels

引用

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 2022年第12期21卷 10424-10437页

作者： Cao, Yashuai Lv, Tiejun Ni, Wei Beijing Univ Posts & Telecommun BUPT Sch Informat & Commun Engn Beijing 100876 Peoples R China Commonwealth Sci & Ind Res Org Data61 Sydney NSW 2122 Australia

The application of intelligent reflecting surface (IRS) depends on the knowledge of channel state information (CSI), and has been hindered by the heavy overhead of channel training, estimation, and feedback in fast-changing channels. This paper presents a new two-timescale beamforming approach to maximizing the average achievable rate (AAR) of IRS-assisted MIMO systems, where the IRS is configured relatively infrequently based on statistical CSI (S-CSI) and the base station precoder and power allocation are updated frequently based on quickly outdated instantaneous CSI (I-CSI). The key idea is that we first reveal the optimal small-timescale power allocation based on outdated I-CSI yields a water-filling structure. Given the optimal power allocation, a new mini-batch sampling (mbs)-based particle swarm optimization (PSO) algorithm is developed to optimize the large-timescale IRS configuration with reduced channel samples. Another important aspect is that we develop a model-driven PSO algorithm to optimize the IRS configuration, which maximizes a lower bound of the AAR by only using the S-CSI and eliminates the need of channel samples. The model-driven PSO serves as a dependable lower bound for the mbs-PSO. Simulations corroborate the superiority of the new two-timescale beamforming strategy to its alternatives in terms of the AAR and efficiency, with the benefits of the IRS demonstrated.

关键词： Intelligent reflecting surface passive beamforming two-timescale optimization outdated channel state information mini-batch sampling particle swarm optimization

来源：评论

学校读者我要写书评

暂无评论

batchSampler: sampling mini-batches for Contrastive Learning in Vision, Language, and Graphs 23

BatchSampler: Sampling Mini-Batches for Contrastive Learning...

引用

29th ACM SIGKDD Conference on Knowledge Discovery and Data mining (KDD)

作者： Yang, Zhen Huang, Tinglin Ding, Ming Dong, Yuxiao Ying, Rex Cen, Yukuo Geng, Yangliao Tang, Jie Tsinghua Univ Beijing Peoples R China Yale Univ New Haven CT 06520 USA

ISBN: (纸本)9798400701030

In-batch contrastive learning is a state-of-the-art self-supervised method that brings semantically-similar instances close while pushing dissimilar instances apart within a mini-batch. Its key to success is the negative sharing strategy, in which every instance serves as a negative for the others within the mini-batch. Recent studies aim to improve performance by sampling hard negatives within the current mini-batch, whose quality is bounded by the mini-batch itself. In this work, we propose to improve contrastive learning by sampling mini-batches from the input data. We present batchSampler(1) to sample mini-batches of hard-to-distinguish (i.e., hard and true negatives to each other) instances. To make each mini-batch have fewer false negatives, we design the proximity graph of randomly-selected instances. To form the mini-batch, we leverage random walk with restart on the proximity graph to help sample hard-to-distinguish instances. batchSampler is a simple and general technique that can be directly plugged into existing contrastive learning models in vision, language, and graphs. Extensive experiments on datasets of three modalities show that batchSampler can consistently improve the performance of powerful contrastive models, as shown by significant improvements of SimCLR on ImageNet-100, SimCSE on STS (language), and GraphCL and MVGRL on graph datasets.

关键词： mini-batch sampling Global Hard Negatives Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Stock market forecasting with super-high dimensional time-series data using ConvLSTM, trend sampling, and specialized data augmentation

引用

EXPERT SYSTEMS WITH APPLICATIONS 2020年 161卷 113704-113704页

作者： Lee, Si Woon Kim, Ha Young Ajou Univ Dept Artificial Intelligence & Data Sci Worldcupro 206 Suwon 16499 South Korea Yonsei Univ Grad Sch Informat Yonsei Ro 50 Seoul 03722 South Korea

Forecasting stock market indexes is an important issue for market participants, because even a small improvement in forecast accuracy may lead to better trading decisions than those of other participants. Rising interest in deep learning has led to its application in stock market forecasting. However, it is still challenging to use market-size time-series data to predict composite index prices. In this study, we propose a new stock market forecasting framework, NuNet, which can successfully learn high-level features from super-high dimensional time-series data. NuNet is an end-to-end integrated neural network framework consisting of two feature extractor modules, a super-high dimensional market information feature extractor and a target index feature extractor. In addition, we propose a mini-batch sampling technique, trend sampling, which probabilistically samples more recent data when training. Furthermore, we propose a novel regularization method, called column-wise random shuffling, which is a data augmentation technique that can be applied to convolutional neural networks. The experiments are comprehensively carried out in three aspects for three indexes, namely S&P500, KOSPI200, and FTSE100. The results demonstrate that the proposed model outperforms all baseline models. Specifically, for the S&P500, KOSPI200, and FTSE100, the overall mean squared error of our proposed model NuNet(DA, T) is 60.79%, 51.29%, and 43.36% lower than that of the baseline model SingleNet(R), respectively. Moreover, we employ trading simulations with realistic transaction costs. Our proposed model outperforms the buy-and-hold strategy being an average of 2.57 times more profitable in three indexes. (c) 2020 Elsevier Ltd. All rights reserved.

关键词： Stock market index Deep learning Overfitting mini-batch sampling Data augmentation ConvLSTM

来源：评论

学校读者我要写书评

暂无评论

Deep Neural Network Quantizers Outperforming Continuous Speech Recognition Systems 21st

Deep Neural Network Quantizers Outperforming Continuous Spee...

引用

21st International Conference on Speech and Computer (SPECOM)

作者： Watzel, Tobias Li, Lujun Kuerzinger, Ludwig Rigoll, Gerhard Tech Univ Munich Inst Human Machine Commun Munich Germany

ISBN: (纸本)9783030260606;9783030260613

In Automatic Speech Recognition (ASR), the acoustic model (AM) is modeled by a Deep Neural Network (DNN). The DNN learns a posterior probability in a supervised fashion utilizing input features and ground-truth labels. Current approaches combine a DNN with a Hidden Markov Model (HMM) in a hybrid approach, which achieved good results in the last years. Similar approaches using a discrete version, hence a Discrete Hidden Markov Model (DHMM), have been disregarded in recent past. Our approach revisits the idea of a discrete system, more precisely the so-called Deep Neural Network Quantizer (DNNQ), demonstrating how a DNNQ is created and trained. We introduce a novel approach to train a DNNQ in a supervised fashion with an arbitrary output layer size even though suitable target values are not available. The proposed method provides a mapping function exploiting fixed ground-truth labels. Consequently, we are able to apply a frame-based cross entropy (CE) training. Our experiments demonstrate that the DNNQ reduces the Word Error Rate (WER) by 17.6% on monophones and by 2.2% on tri-phones, respectively, compared to a continuous HMM-Gaussian Mixture Model (GMM) system.

关键词： Deep Neural Network Quantizer Discrete speech recognition mini-batch sampling

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：