检索结果-内蒙古大学图书馆

2023 IEEE International Conference on Big Data, BigData 2023

作者： Zu, Yi Mi, Jiacong Song, Lingning Lu, Shan He, Jieyue Southeast University School of Computer Science and Engineering Key Lab of Computer Network and Information Integration MOE Nanjing China Southeast University School of Software Engineering Nanjing China Nanjing Fenghuo Tiandi Communication Technology Co. Ltd Nanjing China

ISBN: (纸本)9798350324457

The core of quantitative investment lies in predicting future trends in stock prices. The future trend of a stock is closely related to the industry it belongs to and its relationship with other stocks. Although some research has focused on stock trend prediction in recent years, most studies have only considered the stock's own time series feature, neglecting the spatial features between stocks. Some research has incorporated spatial information, but typically only considered predefined static relationships. At the same time, capturing dynamic spatial information in the market has been a long-standing challenge. Thus, we propose a spatio-temporal model, Finformer, in order to go beyond traditional time series models. We designed a sparse static-dynamic transformer to capture dynamic market spatial information as it changes over time and combined predefined relationships to extract highly correlated spatial features in the stock market. To effectively integrate spatial and temporal features, we introduced an adaptive spatio-temporal fusion module that dynamically fuses spatio-temporal features based on market conditions at different periods. Experiments on two real-world stock market datasets show that our proposed model outperforms the state-of-the-art baselines in the signal-based and portfolio-based metrics, which are widely concerned in the financial field. Ablation study and hyper-parameter study further reveal the effectiveness of each module in the model and the impact of hyper-parameters. The code will be made publicly available. 1 © 2023 IEEE.

关键词： computational finance deep learning stock trend prediction transformer

来源：评论

学校读者我要写书评

暂无评论

Early Prediction of Heart Disease via LSTM-XGBoost 23

Early Prediction of Heart Disease via LSTM-XGBoost

引用

9th International Conference on Computing and Artificial Intelligence, ICCAI 2023

作者： Zang, Xiaodong Du, Jin Song, Yuansheng School of Cyber Science and Engineering Qufu Normal University Key Laboratory of Computer Network and Information Integration Ministry of Education Southeast University Nanjing211189 China School of Cyber Science and Engineering Qufu Normal University China

ISBN: (纸本)9781450399029

With the development of information and technology, especially with the boom in big data, healthcare support systems are becoming much better. However, an early diagnosis is not an easy task because it is hard to find out the hidden patterns in the disease. The symptoms of heart failure are usually non-specific. Therefore, this study aims to devise an early prediction model for incident heart disease based on machine learning algorithms. In this study, we compared several powerful models, including support vector machines (SVM), k-nearest neighbor (KNN), decision tree, extreme gradient boosting (XGBoost), random forest, gradient boosting decision tree (GBDT), categorical boosting (CatBoost), light gradient boosting machine (LightGBM), and choose two best models, namely, XGBoost and LSTM as our prediction model. The XGBoost algorithm was used to extract a subset from the data of patients, which will be fed into the LSTM-XGBoost model for training. An experiment comparison is conducted based on the data from Kaggle. Experimental results demonstrate that the LSTM-XGBoost model can operate effectively with a prediction accuracy achieving 0.9896 and a loss rate achieving 0.0105, which is much better than XGBoost or LSTM, respectively. © 2023 Copyright held by the owner/author(s).

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Fast Multi-Instance Partial-Label Learning 39

Fast Multi-Instance Partial-Label Learning

引用

39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

作者： Yang, Yin-Fang Tang, Wei Zhang, Min-Ling School of Computer Science and Engineering Southeast University Nanjing210096 China Key Laboratory of Computer Network and Information Integration Southeast University Ministry of Education China

ISBN: (纸本)157735897X

Multi-instance partial-label learning (MIPL) is a paradigm where each training example is encapsulated as a multi-instance bag associated with the candidate label set, which includes one true label and several false positives. Current MIPL algorithms typically assume that all instances are independent, thereby neglecting the dependencies and heterogeneity inherent in MIPL data. Moreover, these algorithms often prove to be excessively time-consuming when dealing with complex datasets, significantly limiting the practical application of MIPL. In this paper, we propose FASTMIPL, a framework that employs mixed-effects model to explicitly capture the dependencies and heterogeneity among instances and bags. FASTMIPL is able to learn from MIPL data both effectively and efficiently by utilizing the predefined dependencies modeling module and leveraging the posterior predictive probability disambiguation strategy. Experiments show that the performance of FASTMIPL is highly competitive to state-of-the-art methods, while significantly reducing computational time in benchmark and the real-world datasets. Copyright © 2025, Association for the Advancement of Artificial Intelligence (***). All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Benchmarking Temporal Reasoning and Alignment Across Chinese Dynasties

arXiv

引用

arXiv 2025年

作者： Wang, Zhenglin Wu, Jialong Li, Pengfei Jiang, Yong Zhou, Deyu School of Computer Science and Engineering Key Laboratory of Computer Network and Information Integration Ministry of Education Southeast University China Tongyi Lab Alibaba Group China

Temporal reasoning is fundamental to human cognition and is crucial for various real-world applications. While recent advances in Large Language Models have demonstrated promising capabilities in temporal reasoning, existing benchmarks primarily rely on rule-based construction, lack contextual depth, and involve a limited range of temporal entities. To address these limitations, we introduce Chinese Time Reasoning (CTM), a benchmark designed to evaluate LLMs on temporal reasoning within the extensive scope of Chinese dynastic chronology. CTM emphasizes cross-entity relationships, pairwise temporal alignment, and contextualized and culturally-grounded reasoning, providing a comprehensive evaluation. Extensive experimental results reveal the challenges posed by CTM and highlight potential avenues for improvement. Copyright © 2025, The Authors. All rights reserved.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

Compositional metric learning for multi-label classification

引用

Frontiers of computer Science 2021年第5期15卷 1-12页

作者： Yan-Ping SUN Min-Ling ZHANG School of Computer Science and Engineering Southeast UniversityNanjing 210096China Key Laboratory of Computer Network and Information Integration(Southeast University) Ministry of EducationChina Collaborative Innovation Center for Wireless Communications Technology Nanjing 211100China

Multi-label classification aims to assign a set of proper labels for each instance,where distance metric learning can help improve the generalization ability of instance-based multi-label classification *** multi-label metric learning techniques work by utilizing pairwise constraints to enforce that examples with similar label assignments should have close distance in the embedded feature *** this paper,a novel distance metric learning approach for multi-label classification is proposed by modeling structural interactions between instance space and label *** one hand,compositional distance metric is employed which adopts the representation of a weighted sum of rank-1 PSD matrices based on com-ponent *** the other hand,compositional weights are optimized by exploiting triplet similarity constraints derived from both instance and label *** to the compositional nature of employed distance metric,the resulting problem admits quadratic programming formulation with linear optimization complexity *** number of training *** also derive the generalization bound for the proposed approach based on algorithmic robustness analysis of the compositional *** experiments on sixteen benchmark data sets clearly validate the usefulness of compositional metric in yielding effective distance metric for multi-label classification.

关键词： machine learning multi-label learning metric learning compositional metric positive semidefinite matrix decomposition

来源：评论

学校读者我要写书评

暂无评论

Calibration bottleneck: over-compressed representations are less calibratable 24

Calibration bottleneck: over-compressed representations are ...

引用

Proceedings of the 41st International Conference on Machine Learning

作者： Deng-Bao Wang Min-Ling Zhang School of Computer Science and Engineering Southeast University Nanjing China and Key Lab. of Computer Network and Information Integration (Southeast University) MOE China

Although deep neural networks have achieved remarkable success, they often exhibit a significant deficiency in reliable uncertainty calibration. This paper focus on model calibratability, which assesses how amenable a model is to be well recalibrated post-hoc. We find that the widely used weight decay regularizer detrimentally affects model calibratability, subsequently leading to a decline in final calibration performance after post-hoc calibration. To identify the underlying causes leading to poor calibratability, we delve into the calibratability of intermediate features across the hidden layers. We observe a U-shaped trend in the calibratability of intermediate features from the bottom to the top layers, which indicates that over-compression of the top representation layers significantly hinders model calibratability. Based on the observations, this paper introduces a weak classifier hypothesis, i.e., given a weak classification head that has not been over-trained, the representation module can be better learned to produce more calibratable features. Consequently, we propose a progressively layer-peeled training (PLP) method to exploit this hypothesis, thereby enhancing model calibratability. Our comparative experiments show the effectiveness of our method, which improves model calibration and also yields competitive predictive performance.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Binary decomposition: a problem transformation perspective for open-set semi-supervised learning 24

Binary decomposition: a problem transformation perspective f...

引用

Proceedings of the 41st International Conference on Machine Learning

作者： Jun-Yi Hang Min-Ling Zhang School of Computer Science and Engineering Southeast University Nanjing China and Key Laboratory of Computer Network and Information Integration Southeast University Ministry of Education China

Semi-supervised learning (SSL) is a classical machine learning paradigm dealing with labeled and unlabeled data. However, it often suffers performance degradation in real-world open-set scenarios, where unlabeled data contains outliers from novel categories that do not appear in labeled data. Existing studies commonly tackle this challenging open-set SSL problem with detect-and-filter strategy, which attempts to purify unlabeled data by detecting and filtering outliers. In this paper, we propose a novel binary decomposition strategy, which refrains from error-prone procedure of outlier detection by directly transforming the original open-set SSL problem into a number of standard binary SSL problems. Accordingly, a concise yet effective approach named BDMatch is presented. BDMatch confronts two attendant issues brought by binary decomposition, i.e. class-imbalance and representation-compromise, with adaptive logit adjustment and label-specific feature learning respectively. Comprehensive experiments on diversified benchmarks clearly validate the superiority of BDMatch as well as the effectiveness of our binary decomposition strategy.

关键词：

来源：评论

学校读者我要写书评

暂无评论

RankMatch: A Novel Approach to Semi-Supervised Label Distribution Learning Leveraging Inter-label Correlations

arXiv

引用

arXiv 2023年

作者： Kou, Zhiqiang Xie, Yucheng Wang, Jing Jia, Yuheng Shi, Boyu Geng, Xin MOE Key Laboratory of Computer Network and Information Integration School of Computer Science and Engineering Southeast University Nanjing China

This paper introduces RankMatch, an innovative approach for Semi-Supervised Label Distribution Learning (SSLDL). Addressing the challenge of limited labeled data, RankMatch effectively utilizes a small number of labeled examples in conjunction with a larger quantity of unlabeled data, reducing the need for extensive manual labeling in Deep Neural network (DNN) applications. Specifically, RankMatch introduces an ensemble learning-inspired averaging strategy that creates a pseudo-label distribution from multiple weakly augmented images. This not only stabilizes predictions but also enhances the model’s robustness. Beyond this, RankMatch integrates a pairwise relevance ranking (PRR) loss, capturing the complex inter-label correlations and ensuring that the predicted label distributions align with the ground truth. We establish a theoretical generalization bound for RankMatch, and through extensive experiments, demonstrate its superiority in performance against existing SSLDL methods. Copyright © 2023, The Authors. All rights reserved.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

PROMIPL: A Probabilistic Generative Model for Multi-Instance Partial-Label Learning

PROMIPL: A Probabilistic Generative Model for Multi-Instance...

引用

IEEE International Conference on Data Mining (ICDM)

作者： Yin-Fang Yang Wei Tang Min-Ling Zhang School of Computer Science and Engineering Southeast University Nanjing China Key Laboratory of Computer Network Information Integration (Southeast University) Ministry of Education China

ISBN: (数字)9798331506681

ISBN: (纸本)9798331506698

Multi-instance partial-label learning (MIPL) tackles scenarios where each training sample is represented as a multiinstance bag associated with a candidate label set. This set contains one true label and several false positives. Existing MIPL algorithms have predominantly focused on mapping multiinstance bags to candidate label sets for disambiguation. However, these algorithms may not be adequately generalizable in intricate real-world situations due to their reliance on heuristic methods for identifying true labels. In this paper, we propose PROMIPL, i.e., a PRObabilistic generative model for Multi-instance partiallabel learning, to address these challenges. PROMIPL is the first attempt to explore the probabilistic generative model to infer latent ground-truth labeling information from the data generation process in multi-instance partial-label learning. Besides, the discovered underlying structures also provide improved explanations of the classification predictions. To circumvent the computationally intensive process of training the generative model, we formulate a unified variational lower bound within the stochastic gradient variational Bayesian framework for the model parameters. Experimental results from benchmark and realworld datasets show that our proposed PROMIPL is competitive or superior to the state-of-the-art methods.

关键词： Training Lower bound Computational modeling Heuristic algorithms Stochastic processes Probabilistic logic Data models Bayes methods Labeling Data mining

来源：评论

学校读者我要写书评

暂无评论

Joint Optimization of UAV Deployment and Task Computation Offloading Decision in UAV-assisted Edge Computing network

Joint Optimization of UAV Deployment and Task Computation Of...

引用

IEEE International Conference on High Performance Computing and Communications (HPCC)

作者： Yichuan Liu Jinbin Tu Yun Wang Key Lab of Computer Network and Information Integration MOE School of Computer Science and Engineering Southeast University Nanjing China

The UAVs' deployment decision and task computation offloading decision in the UAV-assisted edge computing network significantly impact the operating efficiency of edge network. On the basis of this, the Optimization Model for UAV Cluster Deployment and Computation Offloading Decision (OMUCDCOD) is established. The model jointly optimizes the number, location of UAV s, and task computation offloading decision. Different from previous studies, this model regards the terminal devices in the edge network as virtual MEC servers, and introduces the collaborative computing mode. The task computation offloading decision made can effectively utilize the computing capabilities offered by the edge network. Considering that the two problems of UAV deployment decision and task computation offloading decision are intricately interconnected, we propose a two-layer optimization algorithm combining K-Means and ant colony algorithm (ToKmAc) to solve OMUCDCOD. ToKmAc is divided into upper and lower layers to solve this optimization problem. The upper layer uses K-Means to solve the UAV deployment decision, that is, the number and location of UAVS; the lower layer employs ant colony algorithm to solve computation offloading decision. Finally, extensive experiments verify the effectiveness of OMUCDCOD and ToKmAc.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：