检索结果-内蒙古大学图书馆

learning Hierarchical Modular Networks for Video Captioning

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND machine INTELLIGENCE 2024年第2期46卷 1049-1064页

作者： Li, Guorong Ye, Hanhua Qi, Yuankai Wang, Shuhui Qing, Laiyun Huang, Qingming Yang, Ming-Hsuan Univ Chinese Acad Sci Sch Comp Sci & Technol Key Lab Big Data Min & Knowledge Management Beijing 100049 Peoples R China Univ Adelaide Australian Inst Machine Learning Adelaide SA 5005 Australia Chinese Acad Sci Inst Comp Technol Key Lab Intelligent Informat Proc Beijing 100045 Peoples R China Univ Calif Merced Merced CA 95343 USA Yonsei Univ Seoul 03722 South Korea Google Mountain View CA 94043 USA

Video captioning aims to generate natural language descriptions for a given video clip. Existing methods mainly focus on end-to-end representation learning via word-by-word comparison between predicted captions and ground-truth texts. Although significant progress has been made, such supervised approaches neglect semantic alignment between visual and linguistic entities, which may negatively affect the generated captions. In this work, we propose a hierarchical modular network to bridge video representations and linguistic semantics at four granularities before generating captions: entity, verb, predicate, and sentence. Each level is implemented by one module to embed corresponding semantics into video representations. Additionally, we present a reinforcement learning module based on the scene graph of captions to better measure sentence similarity. Extensive experimental results show that the proposed method performs favorably against the state-of-the-art models on three widely-used benchmark datasets, including microsoft research video description corpus (MSVD), MSR-video to text (MSR-VTT), and video-and-TEXt (VATEX).

关键词： Video captioning hierarchical modular network scene-graph reward reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Supervised Feature Selection via Collaborative Neurodynamic Optimization

引用

IEEE TRANSACTIONS ON NEURAL NETWORKS AND learning SYSTEMS 2024年第5期35卷 6878-6892页

作者： Wang, Yadi Wang, Jun Pal, Nikhil R. Henan Univ Sch Comp & Informat Engn Henan Key Lab Big Data Anal & Proc Kaifeng 475004 Peoples R China Henan Univ Sch Comp & Informat Engn Inst Data & Knowledge Engn Kaifeng 475004 Peoples R China City Univ Hong Kong Dept Comp Sci Kowloon Hong Kong Peoples R China City Univ Hong Kong Sch Data Sci Kowloon Hong Kong Peoples R China Indian Stat Inst Ctr Artificial Intelligence & Machine Learning Kolkata 700108 India Indian Stat Inst Elect & Commun Sci Unit Kolkata 700108 India

As a crucial part of machine learning and pattern recognition, feature selection aims at selecting a subset of the most informative features from the set of all available features. In this article, supervised feature selection is at first formulated as a mixed-integer optimization problem with an objective function of weighted feature redundancy and relevancy subject to a cardinality constraint on the number of selected features. It is equivalently reformulated as a bound-constrained mixed-integer optimization problem by augmenting the objective function with a penalty function for realizing the cardinality constraint. With additional bilinear and linear equality constraints for realizing the integrality constraints, it is further reformulated as a bound-constrained biconvex optimization problem with two more penalty terms. Two collaborative neurodynamic optimization (CNO) approaches are proposed for solving the formulated and reformulated feature selection problems. One of the proposed CNO approaches uses a population of discrete-time recurrent neural networks (RNNs), and the other use a pair of continuous-time projection networks operating concurrently on two timescales. Experimental results on 13 benchmark datasets are elaborated to substantiate the superiority of the CNO approaches to several mainstream methods in terms of average classification accuracy with three commonly used classifiers.

关键词： Feature extraction Optimization Redundancy Neurodynamics Recurrent neural networks Collaboration Mutual information Biconvex optimization collaborative neurodynamic optimization (CNO) feature selection mixed-integer optimization

来源：评论

学校读者我要写书评

暂无评论

Multi-UAV-Assisted Federated learning for Energy-Aware Distributed Edge Training

引用

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2024年第1期21卷 280-294页

作者： Tang, Jianhang Nie, Jiangtian Zhang, Yang Xiong, Zehui Jiang, Wenchao Guizani, Mohsen Guizhou Univ State Key Lab Publ Big Data Guiyang 550025 Peoples R China Nanyang Technol Univ Sch Comp Sci & Engn Singapore Singapore Nanjing Univ Aeronaut & Astronaut Coll Comp Sci & Technol Nanjing 210000 Peoples R China Singapore Univ Technol & Design Pillar Informat Syst Technol & Design Singapore Singapore Mohamed Bin Zayed Univ Artificial Intelligence Machine Learning Dept Abu Dhabi U Arab Emirates

Unmanned aerial vehicle (UAV)-assisted mobile edge computing (MEC) has largely extended the border and capacity of artificial intelligence of things (AIoT) by providing a key element for enabling flexible distributed data inputs, computing capacity, and high mobility. To enhance data privacy for AIoT applications, federated learning (FL) is becoming a potential solution to perform training tasks locally on distributed IoT devices. However, with the limited onboard resources and battery capacity of each UAV node, optimization is required to achieve a large-scale and high-precision FL scheme. In this work, an optimized multi-UAV-assisted FL framework is designed, where regular IoT devices are in charge of performing training tasks, and multiple UAVs are leveraged to execute local and global aggregation tasks. An online resource allocation (ORA) algorithm is proposed to minimize the training latency by jointly deciding the selection decisions of clients and a global aggregation server. By leveraging the Lyapunov optimization technique, virtual energy queues are studied to depict the energy deficit. With the help of the actor-critic learning framework, a deep reinforcement learning (DRL) scheme is designed to improve per-round training performance. A deep neural network (DNN)-based actor module is designed to derive client selection decisions, and a critic module is proposed through a conventional optimization method to evaluate the obtained selection decisions. Moreover, a greedy scheme is developed to find the optimal global aggregation server. Finally, extensive simulation results demonstrate that the proposed ORA algorithm can achieve optimal training latency and energy consumption under various system settings.

关键词： UAV federated learning resource allocation client selection DRL

来源：评论

学校读者我要写书评

暂无评论

Blockchain Assisted data Edge Verification With Consensus Algorithm for machine learning Assisted IoT

引用

IEEE ACCESS 2023年 11卷 55370-55379页

作者： Vaiyapuri, Thavavel Shankar, K. Rajendran, Surendran Kumar, Sachin Acharya, Srijana Kim, Hyunil Prince Sattam bin Abdulaziz Univ Coll Comp Engn & Sci Al Kharj 11942 Saudi Arabia Saveetha Inst Med & Tech Sci Saveetha Sch Engn Dept Comp Sci & Engn Chennai 602105 India South Ural State Univ Big Data & Machine Learning Lab Chelyabinsk 454080 Russia Kongju Natl Univ Dept Convergence Sci Gongju Si 32588 Chungcheongnam South Korea

Internet of Things (IoT) devices are becoming increasingly ubiquitous in daily life. They are utilized in various sectors like healthcare, manufacturing, and transportation. The main challenges related to IoT devices are the potential for faults to occur and their reliability. In classical IoT fault detection, the client device must upload raw information to the central server for the training model, which can reveal sensitive business information. Blockchain (BC) technology and a fault detection algorithm are applied to overcome these challenges. Generally, the fusion of BC technology and fault detection algorithms can give a secure and more reliable IoT ecosystem. Therefore, this study develops a new Blockchain Assisted data Edge Verification with Consensus Algorithm for machine learning (BDEV-CAML) technique for IoT Fault Detection purposes. The presented BDEV-CAML technique integrates the benefits of blockchain, IoT, and ML models to enhance the IoT network's trustworthiness, efficacy, and security. In BC technology, IoT devices that possess a significant level of decentralized decision-making capability can attain a consensus on the efficiency of intrablock transactions. For fault detection in the IoT network, the deep directional gated recurrent unit (DbigRU) model is used. Finally, the African vulture optimization algorithm (AVOA) technique is utilized for the optimal hyperparameter tuning of the DbigRU model, which helps in improving the fault detection rate. A detailed set of experiments were carried out to highlight the enhanced performance of the BDEV-CAML algorithm. The comprehensive experimental results stated the improved performance of the BDEV-CAML technique over other existing models with maximum accuracy of 99.6%.

关键词： Internet of Things Fault detection Logic gates Blockchains Security Tuning Consensus algorithm Blockchain consensus algorithm fault detection deep learning hyperparameter tuning

来源：评论

学校读者我要写书评

暂无评论

MULTI-SCALE CLINICAL-GUIDED BINOCULAR FUSION FRAMEWORK FOR PREDICTING NEW-ONSET HYPERTENSION OVER A FOUR-YEAR PERIOD 21

MULTI-SCALE CLINICAL-GUIDED BINOCULAR FUSION FRAMEWORK FOR P...

引用

21st IEEE International Symposium on Biomedical Imaging (ISBI)

作者： Li, Haoshen Chen, Zifan Zhao, Jie Chen, Heyun Dong, Hexin Yuan, Mingze Dong, Bin Zhang, Li Peking Univ Ctr Data Sci Beijing Peoples R China Peking Univ Natl Engn Lab Big Data Anal & Applicat Beijing Peoples R China Peking Univ Beijing Int Ctr Math Res Beijing Peoples R China Peking Univ Ctr Machine Learning Res Beijing Peoples R China Peking Univ Changsha Inst Comp & Digital Econ Beijing Peoples R China

ISBN: (纸本)9798350313345;9798350313338

Hypertension is a major global health concern, linked to various cardiovascular diseases and associated with distinct ocular manifestations. While recent advances in artificial intelligence have enabled accurate diagnosis of current hypertension through fundus images, predicting the future onset of hypertension remains an uncharted domain. In this study, we introduce the multi-scale clinical-guided binocular fusion framework (MCBO), designed to predict the likelihood of developing hypertension within the next four years. MCBO uniquely integrates left and right fundus images and clinical data, utilizing a shared-weight multi-stage Transformer-based encoder. Our multi-scale clinical-guided module (MCM) ensures image feature extraction is clinically contextualized based on clinical information, and our binocular fusion module (BFM) fuses binocular information. Comparative performance against seven baseline models establishes MCBO's supremacy, with improvements of 6.7% in Area Under Curve (AUC), 6.9% in Accuracy (ACC), 5.1% in Sensitivity (SEN) and 5.5% in Specificity (SPE). This approach offers a promising avenue for proactive hypertension management, underscoring the potential of integrating Deep learning with clinical data for enhanced healthcare outcomes. Our code is available at https://***/HaoshenLi/MCBO.

关键词： Future New-Onset Hypertension Prediction Multi-Scale Feature Fusion Fundus Image

来源：评论

学校读者我要写书评

暂无评论

BIM: Improving Graph Neural Networks with Balanced Influence Maximization 40

BIM: Improving Graph Neural Networks with Balanced Influence...

引用

40th IEEE International Conference on data Engineering, ICDE 2024

作者： Zhang, Wentao Gao, Xinyi Yang, Ling Cao, Meng Huang, Ping Shan, Jiulong Yin, Hongzhi Cui, Bin Peking University Center for Machine Learning Research China Institute of Advanced Algorithms Research Shanghai China National Engineering Labratory for Big Data Analytics and Applications The University of Queensland Australia Peking University Key Lab of High Confidence Software Technologies China Apple Inc. Institute of Computational Social Science Peking University Qingdao China

ISBN: (纸本)9798350317152

The imbalanced data classification problem has aroused lots of concerns from both academia and industry since data imbalance is a widespread phenomenon in many real-world scenarios. Although this problem has been well researched from the view of imbalanced class samples, we further argue that graph neural networks (GNNs) expose a unique source of imbalance from the influenced nodes of different classes of labeled nodes, i.e., labeled nodes are imbalanced in terms of the number of nodes they influenced during the influence propagation in GNNs. To tackle this previously unexplored influence-imbalance issue, we connect social influence maximization with the imbalanced node classification problem and propose balanced influence maximization (BIM). Specifically, BIM greedily assigns the pseudo label to the node which can maximize the number of influenced nodes in GNN training while making the influence of each class more balance. Experimental results on five public datasets demonstrate the effectiveness of our method in relieving the influence-imbalance issue. For example, when training a GCN with an imbalance ratio of 0.1, BIM significantly outperforms the most competitive baseline by 0.6% -9.8% in five public datasets in terms of the F1 score. © 2024 IEEE.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

NC-ALG: Graph-Based Active learning under Noisy Crowd 40

NC-ALG: Graph-Based Active Learning under Noisy Crowd

引用

40th IEEE International Conference on data Engineering, ICDE 2024

作者： Zhang, Wentao Wang, Yexin You, Zhenbang Li, Yang Cao, Gang Yang, Zhi Cui, Bin Center for Machine Learning Research Peking University China Key Lab of High Confidence Software Technologies Peking University China Institute of Advanced Algorithms Research Shanghai China Institute of Computational Social Science Peking University Qingdao China National Engineering Labratory for Big Data Analytics and Applications China TEG Tencent Inc. Department of Data Platform China Beijing Academy of Artificial Intelligence China

ISBN: (纸本)9798350317152

Graph Neural Networks (GNNs) have achieved great success in various data mining tasks but they heavily rely on a large number of annotated nodes, requiring considerable human efforts. Despite the effectiveness of existing GNN-based Active learning (AL) methods, they assume that the annotated labels are always correct, which is contradictory to the error-prone labeling process in a practical crowdsourcing environment. Besides, due to this impractical assumption, existing works only focus on optimizing the node selection in AL but neglect optimizing the labeling process. Therefore, we present NC-ALG, the first GNN-based AL framework that optimizes both the node selection and node labeling process under a noisy crowd. For node selection, NC-ALG introduces a new measurement to model influence reliability and an effective influence maximization objective to select nodes. For node labeling, NC-ALG significantly reduces the labeling cost by considering the model-predicted labels and the labels of mirror nodes. To the best of our knowledge, this is the first attempt to consider GNN-based AL under the practical noisy crowd. Empirical studies on public datasets demonstrate that NC-ALG significantly outperforms existing methods in terms labeling efficiency. Notably, it only takes NC-ALG one-third of the labeling budget that the competitive baseline GRAIN needs to achieve an accuracy of 70.7 % on PubMed. © 2024 IEEE.

关键词： Crowdsourcing

来源：评论

学校读者我要写书评

暂无评论

An efficient fine-grained searchable encryption for secure communication in IoT-based vehicular social networks

引用

TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES 2024年第2期35卷

作者： Mamta Sharma, Shivani Kumar, Sachin Punjab Engn Coll Dept Comp Sci & Engn Chandigarh India Thapar Inst Engn & Technol Dept Comp Sci & Engn Patiala Punjab India South Ural State Univ Big Data & Machine Learning Lab Chelyabinsk Russia

Vehicle data security is a fundamental requirement in any vehicular social network (VSN). Encryption is the key technology to address this requirement. However, with encryption, we lose all the accessibility to the data. Further, we may need to provide differential access capabilities to the different users. The attribute-based searchable encryption (ABSE) method satisfies all these requirements. It is a method for safely searching through encrypted files stored in a networked repository. It is a multi-user encryption method that combines the advantages of attribute-based encryption (ABE) with searchable encryption (SE). However, ABSE has an inherent cost and cannot be applied in a resource-constrained setting. Therefore, the proposed scheme aims to reduce the computational cost by readily accommodating frequent changes in the access structure and using a secret key and search trapdoor of constant size. This, in turn, reduces the bandwidth requirement as well. In addition, the suggested technique requires constant pairing operations during the search phase, making the search operation fast. Quantitatively, the secret key and trapdoor storage costs have been reduced to two and four-source group elements, respectively, and the number of bilinear pairing operations in the search algorithm has been reduced to four.

关键词： Digital storage

来源：评论

学校读者我要写书评

暂无评论

A Tailored Particle Swarm and Egyptian Vulture Optimization-Based Synthetic Minority-Oversampling Technique for Class Imbalance Problem

引用

INFORMATION 2022年第8期13卷 386页

作者： Rout, Subhashree Mallick, Pradeep Kumar Reddy, Annapareddy V. N. Kumar, Sachin Deemed Univ Kalinga Inst Ind Technol KIIT Sch Comp Engn Bhubaneswar 751024 Odisha India Lakireddy Bali Reddy Coll Engn Dept Informat Technol Mylavaram 521230 Andhra Pradesh India South Ural State Univ Big Data & Machine Learning Lab Chelyabinsk 454080 Russia

Class imbalance is one of the significant challenges in classification problems. The uneven distribution of data samples in different classes may occur due to human error, improper/unguided collection of data samples, etc. The uneven distribution of class samples among classes may affect the classification accuracy of the developed model. The main motivation behind this study is the design and development of methodologies for handling class imbalance problems. In this study, a new variant of the synthetic minority oversampling technique (SMOTE) has been proposed with the hybridization of particle swarm optimization (PSO) and Egyptian vulture (EV). The proposed method has been termed SMOTE-PSOEV in this study. The proposed method generates an optimized set of synthetic samples from traditional SMOTE and augments the five datasets for verification and validation. The SMOTE-PSOEV is then compared with existing SMOTE variants, i.e., Tomek Link, Borderline SMOTE1, Borderline SMOTE2, Distance SMOTE, and ADASYN. After data augmentation to the minority classes, the performance of SMOTE-PSOEV has been evaluated using support vector machine (SVM), Naive Bayes (NB), and k-nearest-neighbor (k-NN) classifiers. The results illustrate that the proposed models achieved higher accuracy than existing SMOTE variants.

关键词： class imbalance problem data augmentation SMOTE particle swarm optimization Egyptian vulture

来源：评论

学校读者我要写书评

暂无评论

Weakly Supervised Video Individual Counting

Weakly Supervised Video Individual Counting

引用

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

作者： Liu, Xinyan Li, Guorong Qi, Yuankai Yan, Ziheng Han, Zhenjun van den Hengel, Anton Yang, Ming-Hsuan Huang, Qingming Univ Chinese Acad Sci Beijing Peoples R China Macquarie Univ Sydney NSW Australia UCAS Key Lab Big Data Min & Knowledge Management Beijing Peoples R China Chinese Acad Sci Key Lab Intel Info Proc Inst Comput Tech Beijing Peoples R China Univ Adelaide Australian Inst Machine Learning Adelaide SA Australia Univ Calif Merced Merced CA USA

ISBN: (纸本)9798350353006

Video Individual Counting (VIC) aims to predict the number of unique individuals in a single video. Existing methods learn representations based on trajectory labels for individuals, which are annotation-expensive. To provide a more realistic reflection of the underlying practical challenge, we introduce a weakly supervised VIC task, wherein trajectory labels are not provided. Instead, two types of labels are provided to indicate traffic entering the field of view (inflow) and leaving the field view (outflow). We also propose the first solution as a baseline that formulates the task as a weakly supervised contrastive learning problem under group-level matching. In doing so, we devise an end-to-end trainable soft contrastive loss to drive the network to distinguish inflow, outflow, and the remaining. To facilitate future study in this direction, we generate annotations from the existing VIC datasets SenseCrowd and CroHD and also build a new dataset, UAVVIC. Extensive results show that our baseline weakly supervised method outperforms supervised methods, and thus, little information is lost in the transition to the more practically relevant weakly supervised task. The code and trained model can be found at CGNet.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：