检索结果-内蒙古大学图书馆

Robust video question answering via contrastive cross-modality representation learning

Science China(information Sciences) 2024年第10期67卷 211-226页

作者： Xun YANG Jianming ZENG Dan GUO Shanshan WANG Jianfeng DONG Meng WANG School of Information Science and Technology University of Science and Technology of China Institute of Artificial Intelligence Hefei Comprehensive National Science Center School of Computer Science and Information Engineering Hefei University of Technology Institutes of Physical Science and Information Technology Anhui University School of Computer Science and Technology Zhejiang Gongshang University

Video question answering(VideoQA) is a challenging yet important task that requires a joint understanding of low-level video content and high-level textual semantics. Despite the promising progress of existing efforts, recent studies revealed that current VideoQA models mostly tend to over-rely on the superficial correlations rooted in the dataset bias while overlooking the key video content, thus leading to unreliable results. Effectively understanding and modeling the temporal and semantic characteristics of a given video for robust VideoQA is crucial but, to our knowledge, has not been well investigated. To fill the research gap, we propose a robust VideoQA framework that can effectively model the cross-modality fusion and enforce the model to focus on the temporal and global content of videos when making a QA decision instead of exploiting the shortcuts in datasets. Specifically, we design a self-supervised contrastive learning objective to contrast the positive and negative pairs of multimodal input, where the fused representation of the original multimodal input is enforced to be closer to that of the intervened input based on video perturbation. We expect the fused representation to focus more on the global context of videos rather than some static keyframes. Moreover, we introduce an effective temporal order regularization to enforce the inherent sequential structure of videos for video representation. We also design a Kullback-Leibler divergence-based perturbation invariance regularization of the predicted answer distribution to improve the robustness of the model against temporal content perturbation of videos. Our method is model-agnostic and can be easily compatible with various VideoQA backbones. Extensive experimental results and analyses on several public datasets show the advantage of our method over the state-of-the-art methods in terms of both accuracy and robustness.

关键词： video question answering cross-modality fusion contrastive learning cross-media reasoning

来源：评论

学校读者我要写书评

暂无评论

Privacy-preserving explainable AI: a survey

引用

Science China(information Sciences) 2025年第1期68卷 23-56页

作者： Thanh Tam NGUYEN Thanh Trung HUYNH Zhao REN Thanh Toan NGUYEN Phi Le NGUYEN Hongzhi YIN Quoc Viet Hung NGUYEN School of Information and Communication Technology Griffith University School of Computer and Communication Sciences Ecole Polytechnique Federale de Lausanne Faculty of Mathematics and Computer Science University of Bremen Faculty of Information Technology HUTECH University Department of Computer Science Hanoi University of Science and Technology School of Electrical Engineering and Computer Science The University of Queensland

As the adoption of explainable AI(XAI) continues to expand, the urgency to address its privacy implications intensifies. Despite a growing corpus of research in AI privacy and explainability, there is little attention on privacy-preserving model explanations. This article presents the first thorough survey about privacy attacks on model explanations and their countermeasures. Our contribution to this field comprises a thorough analysis of research papers with a connected taxonomy that facilitates the categorization of privacy attacks and countermeasures based on the targeted explanations. This work also includes an initial investigation into the causes of privacy leaks. Finally, we discuss unresolved issues and prospective research directions uncovered in our analysis. This survey aims to be a valuable resource for the research community and offers clear insights for those new to this domain. To support ongoing research, we have established an online resource repository, which will be continuously updated with new and relevant findings.

关键词： privacy-preserving explainable AI privacy attacks privacy defences PrivEx PPXAI

来源：评论

学校读者我要写书评

暂无评论

Modern Machine Learning Solution for Electricity Consumption Management in Smart Buildings

引用

IEEE engineering Management Review 2025年第1期53卷 54-62页

作者： Gautam, Sandeep Kumar Shrivastava, Vinayak Udmale, Sandeep S. Singh, Amit Kumar Singh, Sanjay Kumar Department of Computer Science and Engineering Varanasi221005 India Department of Computer Engineering and Information Technology Mumbai400019 India National Institute of Technology Department of Computer Science and Engineering Patna800005 India

Effective management of electricity consumption (EC) in smart buildings (SBs) is crucial for optimizing operational efficiency, cost savings, and ensuring sustainable resource utilization. Accurate EC prediction enables proactive decision-making, ensuring that resources are allocated efficiently to meet actual demand levels while maintaining occupant comfort. Population growth, building expansion, and technology usage swiftly escalate electricity demand, thus necessitating economical EC management strategies and assist consumers to better understand and strategically plan their EC. To address these challenges, this article proposes a novel approach based on a hybrid prediction model combining temporal convolutional networks (TCNs) and gated recurrent units (GRU). This approach capitalizes on the strengths of both TCN and GRU. TCN is adept at efficiently identifying diverse patterns, particularly within the complex working environments of SBs, by effectively capturing high- and low-frequency information. Subsequently, GRU is leveraged to address the long-term dependencies within the data, enhancing the accuracy of EC prediction. In this article, the results demonstrate the effectiveness of the proposed hybrid model, outperforming competitive methods with an impressive mean absolute error score. This underscores the potential of this approach to improve energy management practices significantly within SB environments, ultimately enhancing both operational efficiency and occupant satisfaction. © 1973-2011 IEEE.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

DNACDS:Cloud IoE big data security and accessing scheme based on DNA cryptography

引用

Frontiers of computer Science 2024年第1期18卷 157-170页

作者： Ashish SINGH Abhinav KUMAR Suyel NAMASUDRA School of Computer Engineering KIIT Deemed to be UniversityBhubaneshwar 751024India Department of Computer Science and Engineering Indian Institute of Information Technology SuratSurat 394190India Department of Computer Science and Engineering National Institute of Technology AgartalaAgartala 799046India

The Internet of Everything(IoE)based cloud computing is one of the most prominent areas in the digital big data *** approach allows efficient infrastructure to store and access big real-time data and smart IoE services from the *** IoE-based cloud computing services are located at remote locations without the control of the data *** data owners mostly depend on the untrusted Cloud Service Provider(CSP)and do not know the implemented security *** lack of knowledge about security capabilities and control over data raises several security *** Acid(DNA)computing is a biological concept that can improve the security of IoE big *** IoE big data security scheme consists of the Station-to-Station Key Agreement Protocol(StS KAP)and Feistel cipher *** paper proposed a DNA-based cryptographic scheme and access control model(DNACDS)to solve IoE big data security and access *** experimental results illustrated that DNACDS performs better than other DNA-based security *** theoretical security analysis of the DNACDS shows better resistance capabilities.

关键词： IoE based cloud computing DNA cryptography IoE big data security StS KAP feistel cipher IoE big data access

来源：评论

学校读者我要写书评

暂无评论

Fuzzy clustering for electric field characterization and its application to thunderstorm interpretability

引用

Digital Communications and Networks 2025年第2期11卷 299-307页

作者： Xu Yang Hongyan Xing Xinyuan Ji Wei Xu Witold Pedrycz School of Electronics and Information Engineering Nanjing University of Information Science and Technology Department of Electrical and Computer Engineering University of Alberta Systems Research Institute Polish Academy of Sciences Department of Computer Engineering Faculty of Engineering and Natural Sciences Istinye University

Changes in the Atmospheric Electric Field Signal(AEFS) are highly correlated with weather changes, especially with thunderstorm activities. However, little attention has been paid to the ambiguous weather information implicit in AEFS changes. In this paper, a Fuzzy C-Means(FCM) clustering method is used for the first time to develop an innovative approach to characterize the weather attributes carried by AEFS. First, a time series dataset is created in the time domain using AEFS attributes. The AEFS-based weather is evaluated according to the time-series Membership Degree(MD) changes obtained by inputting this dataset into the FCM. Second, thunderstorm intensities are reflected by the change in distance from a thunderstorm cloud point charge to an AEF apparatus. Thus, a matching relationship is established between the normalized distance and the thunderstorm dominant MD in the space domain. Finally, the rationality and reliability of the proposed method are verified by combining radar charts and expert experience. The results confirm that this method accurately characterizes the weather attributes and changes in the AEFS, and a negative distance-MD correlation is obtained for the first time. The detection of thunderstorm activity by AEF from the perspective of fuzzy set technology provides a meaningful guidance for interpretable thunderstorms.

关键词： Atmospheric electric field (AEF) Thunderstorm Fuzzy C-means (FCM) Attribute

来源：评论

学校读者我要写书评

暂无评论

A Review of Image Steganography Based on Multiple Hashing Algorithm

引用

computers, Materials & Continua 2024年第8期80卷 2463-2494页

作者： Abdullah Alenizi Mohammad Sajid Mohammadi Ahmad A.Al-Hajji Arshiya Sajid Ansari Department of Information Technology College of Computer and Information SciencesMajmaah UniversityAl-Majmaah11952Saudi Arabia Department of Computer Science College of Engineering and Information TechnologyOnaizah CollegesQassim56312Saudi Arabia

Steganography is a technique for hiding secret messages while sending and receiving communications through a cover *** ancient times to the present,the security of secret or vital information has always been a significant *** development of secure communication methods that keep recipient-only data transmissions secret has always been an area of ***,several approaches,including steganography,have been developed by researchers over time to enable safe data *** this review,we have discussed image steganography based on Discrete Cosine Transform(DCT)algorithm,*** have also discussed image steganography based on multiple hashing algorithms like the Rivest–Shamir–Adleman(RSA)method,the Blowfish technique,and the hash-least significant bit(LSB)*** this review,a novel method of hiding information in images has been developed with minimal variance in image bits,making our method secure and effective.A cryptography mechanism was also used in this *** encoding the data and embedding it into a carry image,this review verifies that it has been ***,embedded text in photos conveys crucial signals about the *** review employs hash table encryption on the message before hiding it within the picture to provide a more secure method of data *** the message is ever intercepted by a third party,there are several ways to stop this operation.A second level of security process implementation involves encrypting and decrypting steganography images using different hashing algorithms.

关键词： Image steganography multiple hashing algorithms Hash-LSB approach RSA algorithm discrete cosine transform(DCT)algorithm blowfish algorithm

来源：评论

学校读者我要写书评

暂无评论

A systematic review on deep learning implementation in brain tumor segmentation, classification and prediction

引用

Multimedia Tools and Applications 2025年 1-40页

作者： Abid, Muhammad Adeel Munir, Kashif Institute of Computer Science Khwaja Fareed University of Engineering and Information Technology Rahim Yar Khan Pakistan Institute of Information Technology Khwaja Fareed University of Engineering and Information Technology Rahim Yar Khan Pakistan

The brain is the central part of the body that controls the overall functionality of the human body. The formulation of abnormal cells in the brain may lead to a brain tumor. Manual examination of a brain tumor is challenging and time-consuming. Deep learning is purely based on neural networks, and it's beneficial in identifying and diagnosing brain tumors. Different groups of researchers put efforts into implementing deep learning for the classification, segmentation, and prediction of brain tumor. They proposed different models of deep learning and the accuracy of their models in terms of dice score or evaluation parameters that are quite reasonable and acceptable. The primary purpose of this study is to review the different articles implementing deep learning in the segmentation, classification, and prediction of brain tumors. Appropriate keywords are used to extract articles from January 2018 to September 2023. Evaluation is done based on the implementation of deep learning and the accuracy of the proposed model, and a complete data sheet is maintained against each article. A total of 154 articles were collected, and 80 research articles were selected for this study after complete analysis. Each selected article proposed models based on deep learning with reasonable and acceptable accuracy in identifying the brain tumor. 2-D CNN and 3-D CNN are used on Magnetic Resonance Imaging (MRI) images of the brain for brain tumor detection. Thus, much more attention is required in the future on this topic to improve the accuracy of brain tumor identification. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.

关键词： Magnetic resonance imaging

来源：评论

学校读者我要写书评

暂无评论

Hybrid CatBoost and SVR Model for Earthquake Prediction Using the LANL Earthquake Dataset

Informatica (Slovenia)

引用

Informatica (Slovenia) 2025年第14期49卷 93-110页

作者： Kaushal, Arush Gupta, Ashok Kumar Sehgal, Vivek Kumar Department of Computer Science and Information Technology Jaypee University of Information Technology Solan171234 India Department of Civil Engineering Jaypee University of Information Technology Solan171234 India

Earthquakes have the potential to cause catastrophic structural and economic damage. This research explores the application of machine learning for earthquake prediction using LANL (Los Alamos National Laboratory) dataset. The data, obtained from a laboratory stick-slip friction experiment, simulate real earthquakes through digitized acoustic signals recorded against the time to failure of a granular layer. We introduced a hybrid model combining CatBoost and Support Vector Regression (SVR) to predict the time of the next earthquake, evaluating its performance against individual CatBoost and SVR models. The hybrid model demonstrated superior accuracy with a Mean Absolute Error (MAE) of 0.0825, outperforming the individual models. We implemented feature engineering to optimize the predictive capability of the models. Additionally, we compared our hybrid model's performance with previous studies to validate its efficacy. Our findings underscore the potential of machine learning, particularly hybrid models, in enhancing earthquake prediction accuracy. This study highlights the robustness and effectiveness of the hybrid CatBoost-SVR model, paving the way for advanced AI algorithms in seismology and contributing to improved disaster preparedness and mitigation strategies. © 2025 Slovene Society Informatika. All rights reserved.

关键词： Stick-slip

来源：评论

学校读者我要写书评

暂无评论

ContextAug:model-domain failing test augmentation with contextual information

引用

Frontiers of computer Science 2024年第2期18卷 43-60页

作者： Zhuo ZHANG Jianxin XUE Deheng YANG Xiaoguang MAO School of Information Technology and Engineering Guangzhou College of CommerceGuangzhou 511363China School of Computer and Information Engineering Institute for Artificial IntelligenceShanghai Polytechnic UniversityShanghai 201209China College of Computer National University of Defense TechnologyChangsha 410073China

In the process of software development,the ability to localize faults is crucial for improving the efficiency of *** speaking,detecting and repairing errant behavior at an early stage of the development cycle considerably reduces costs and development *** have tried to utilize various methods to locate the faulty ***,failing test cases usually account for a small portion of the test suite,which inevitably leads to the class-imbalance phenomenon and hampers the effectiveness of fault ***,in this work,we propose a new fault localization approach named *** obtaining dynamic execution through test cases,ContextAug traces these executions to build an information model;subsequently,it constructs a failure context with propagation dependencies to intersect with new model-domain failing test samples synthesized by the minimum variability of the minority feature *** contrast to traditional test generation directly from the input domain,ContextAug seeks a new perspective to synthesize failing test samples from the model domain,which is much easier to augment test *** conducting empirical research on real large-sized programs with 13 state-of-the-art fault localization approaches,ContextAug could significantly improve fault localization effectiveness with up to 54.53%.Thus,ContextAug is verified as able to improve fault localization effectiveness.

关键词： context fault localization test cases

来源：评论

学校读者我要写书评

暂无评论

Domain generalization with semi-supervised learning for people-centric activity recognition

引用

Science China(information Sciences) 2025年第1期68卷 171-188页

作者： Jing LIU Wei ZHU Di LI Xing HU Liang SONG Academy for Engineering & Technology Fudan University Shanghai East-bund Research Institute on Networking Systems of AI School of Optoelectronic Information and Computer Engineering University of Shanghai for Science & Technology

People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications,including intelligent transportation systems, healthcare services, and brain-computer interfaces. Large-scale data collection and annotation make the application of machine learning algorithms prohibitively expensive when adapting to new tasks. One way of circumventing this limitation is to train the model in a semi-supervised learning manner that utilizes a percentage of unlabeled data to reduce the labeling burden in prediction tasks. Despite their appeal, these models often assume that labeled and unlabeled data come from similar distributions, which leads to the domain shift problem caused by the presence of distribution gaps. To address these limitations, we propose herein a novel method for people-centric activity recognition,called domain generalization with semi-supervised learning(DGSSL), that effectively enhances the representation learning and domain alignment capabilities of a model. We first design a new autoregressive discriminator for adversarial training between unlabeled and labeled source domains, extracting domain-specific features to reduce the distribution gaps. Second, we introduce two reconstruction tasks to capture the task-specific features to avoid losing information related to representation learning while maintaining task-specific consistency. Finally, benefiting from the collaborative optimization of these two tasks, the model can accurately predict both the domain and category labels of the source domains for the classification task. We conduct extensive experiments on three real-world sensing datasets. The experimental results show that DGSSL surpasses the three state-of-the-art methods with better performance and generalization.

关键词： activity recognition deep learning domain generalization semi-supervised learning adversarial training

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：