检索结果-内蒙古大学图书馆

Hyperspectral image restoration using noise gradient and dual priors under mixed noise conditions

CAAI Transactions on Intelligence technology 2025年第1期10卷 72-93页

作者： Hazique Aetesam Suman Kumar Maji V.B.Surya Prasath Computer Science and Engineering Birla Institute of Technology MesraBiharIndia Computer Science and Engineering Indian Institute of Technology PatnaBiharIndia Department of Computer Science University of CincinnatiCincinnatiOhioUSA

Images obtained from hyperspectral sensors provide information about the target area that extends beyond the visible portions of the electromagnetic ***,due to sensor limitations and imperfections during the image acquisition and transmission phases,noise is introduced into the acquired image,which can have a negative impact on downstream analyses such as classification,target tracking,and spectral *** in hyperspectral images(HSI)is modelled as a combination from several sources,including Gaussian/impulse noise,stripes,and *** HSI restoration method for such a mixed noise model is ***,a joint optimisation framework is proposed for recovering hyperspectral data corrupted by mixed Gaussian-impulse noise by estimating both the clean data as well as the sparse/impulse noise ***,a hyper-Laplacian prior is used along both the spatial and spectral dimensions to express sparsity in clean image ***,to model the sparse nature of impulse noise,anℓ_(1)−norm over the impulse noise gradient is *** the proposed methodology employs two distinct priors,the authors refer to it as the hyperspectral dual prior(HySpDualP)*** the best of authors'knowledge,this joint optimisation framework is the first attempt in this *** handle the non-smooth and nonconvex nature of the generalℓ_(p)−norm-based regularisation term,a generalised shrinkage/thresholding(GST)solver is ***,an efficient split-Bregman approach is used to solve the resulting optimisation *** results on synthetic data and real HSI datacube obtained from hyperspectral sensors demonstrate that the authors’proposed model outperforms state-of-the-art methods,both visually and in terms of various image quality assessment metrics.

关键词： hyper-laplacian prior hyperspectral images image restoration mixed noise variational approach

来源：评论

学校读者我要写书评

暂无评论

Robust video question answering via contrastive cross-modality representation learning

引用

science China(Information sciences) 2024年第10期67卷 211-226页

作者： Xun YANG Jianming ZENG Dan GUO Shanshan WANG Jianfeng DONG Meng WANG School of Information Science and Technology University of Science and Technology of China Institute of Artificial Intelligence Hefei Comprehensive National Science Center School of Computer Science and Information Engineering Hefei University of Technology Institutes of Physical Science and Information Technology Anhui University School of Computer Science and Technology Zhejiang Gongshang University

Video question answering(VideoQA) is a challenging yet important task that requires a joint understanding of low-level video content and high-level textual semantics. Despite the promising progress of existing efforts, recent studies revealed that current VideoQA models mostly tend to over-rely on the superficial correlations rooted in the dataset bias while overlooking the key video content, thus leading to unreliable results. Effectively understanding and modeling the temporal and semantic characteristics of a given video for robust VideoQA is crucial but, to our knowledge, has not been well investigated. To fill the research gap, we propose a robust VideoQA framework that can effectively model the cross-modality fusion and enforce the model to focus on the temporal and global content of videos when making a QA decision instead of exploiting the shortcuts in datasets. Specifically, we design a self-supervised contrastive learning objective to contrast the positive and negative pairs of multimodal input, where the fused representation of the original multimodal input is enforced to be closer to that of the intervened input based on video perturbation. We expect the fused representation to focus more on the global context of videos rather than some static keyframes. Moreover, we introduce an effective temporal order regularization to enforce the inherent sequential structure of videos for video representation. We also design a Kullback-Leibler divergence-based perturbation invariance regularization of the predicted answer distribution to improve the robustness of the model against temporal content perturbation of videos. Our method is model-agnostic and can be easily compatible with various VideoQA backbones. Extensive experimental results and analyses on several public datasets show the advantage of our method over the state-of-the-art methods in terms of both accuracy and robustness.

关键词： video question answering cross-modality fusion contrastive learning cross-media reasoning

来源：评论

学校读者我要写书评

暂无评论

openGauss:An Open-Source Database for the Era of Artificial Intelligence

引用

Journal of computer science & technology 2024年第5期39卷 1005-1006页

作者： Jian-Zhong Li Shenzhen Institute of Advanced Technology Shenzhen 518055China School of Computer Science and Technology Harbin Institute of TechnologyHarbin 150001China

Databases play a vital role in data management in many fields,such as finance,government,telecommunications,energy,electricity,transportation,*** the database management system has become a core foundational *** is an enterprise-grade open-source database,a product of deep integration of research and development from Huawei,Tsinghua University,and China Mobile in the past decade.

关键词： database Open finance

来源：评论

学校读者我要写书评

暂无评论

Efficient Maximum Vertex(k,ℓ)-Biplex Computation on Bipartite Graphs

引用

Tsinghua science and technology 2025年第2期30卷 569-584页

作者： Hongru Zhou Shengxin Liu Ruidi Cao School of Computer Science and Technology Harbin Institute of TechnologyShenzhen 518055China Science and Technology Office Harbin Institute of TechnologyShenzhen 518055China

Cohesive subgraph search is a fundamental problem in bipartite graph *** integers k andℓ,a(k,ℓ)-biplex is a cohesive structure which requires each vertex to disconnect at most k orℓvertices in the other ***(k,ℓ)-biplexes has been a popular research topic in recent years and has various ***,most existing studies considered the problem of finding(k,ℓ)-biplex with the largest number of *** this paper,we instead consider another variant and focus on the maximum vertex(k,ℓ)-biplex problem which aims to search for a(k,ℓ)-biplex with the maximum *** first show that this problem is Non-deterministic Polynomial-time hard(NP-hard)for any positive integers k andℓwhile max{k,ℓ}is at least *** by this negative result,we design an efficient branch-and-bound algorithm with a novel *** particular,we introduce a branching strategy based on whether there is a pivot in the current set,with which our proposed algorithm has the time complexity ofγ^(n)n^(O(1)),whereγ<*** addition,we also apply multiple speed-up techniques and various pruning ***,we conduct extensive experiments on various real datasets which demonstrate the efficiency of our proposed algorithm in terms of running time.

关键词： bipartite graphs cohesive subgraph search maximum vertex(k,ℓ)-biplex branch-and-bound algorithm

来源：评论

学校读者我要写书评

暂无评论

Transformer-Based Person Re-Identification: A Comprehensive Review

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年第7期9卷 1-19页

作者： Sarker, Prodip Kumar Zhao, Qingjie Uddin, Md. Kamal School of Computer Science and Technology Beijing Institute of Technology China Department of Computer Science and Telecommunication Engineering Noakhali Science and Technology University Bangladesh

In the evolving landscape of surveillance and security applications, the task of person re-identification(re-ID) has significant importance, but also presents notable difficulties. This task entails the process of accurately matching and identifying persons across several camera views that do not overlap with one another. This is of utmost importance to video surveillance, public safety, and person-tracking applications. However, vision-related difficulties, such as variations in appearance, occlusions, viewpoint changes, cloth changes, scalability, limited robustness to environmental factors, and lack of generalizations, still hinder the development of reliable person re-ID methods. There are few approaches have been developed based on these difficulties relied on traditional deep-learning techniques. Nevertheless, recent advancements of transformer-based methods, have gained widespread adoption in various domains owing to their unique architectural properties. Recently, few transformer-based person re-ID methods have developed based on these difficulties and achieved good results. To develop reliable solutions for person re-ID, a comprehensive analysis of transformer-based methods is necessary. However, there are few studies that consider transformer-based techniques for further investigation. This review proposes recent literature on transformer-based approaches, examining their effectiveness, advantages, and potential challenges. This review is the first of its kind to provide insights into the revolutionary transformer-based methodologies used to tackle many obstacles in person re-ID, providing a forward-thinking outlook on current research and potentially guiding the creation of viable applications in real-world scenarios. The main objective is to provide a useful resource for academics and practitioners engaged in person re-ID. IEEE

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

P-IOC-FPMine: an association rule mining algorithm for item constraints based on the MapReduce framework

引用

International Journal of Wireless and Mobile Computing 2022年第1期23卷 57-66页

作者： Chu, Zenan Wang, Wei Department of Computer Science and Information Engineering Anyang Institute of Technology Henan Anyang China

In the application of big data, one of the most challenging problems is how to consider the requirements of users. To avoid this problem, we proposed IOC-FP-growth. This added user-defined pre-term or post-item constraints into the classic FP-growth algorithm. What’s more, a parallel data mining algorithm based on MapReduce, namely P-IOC-FPMine was proposed, which was a low-memory fast association rule mining algorithm. Finally, by evaluating the effectiveness of the method in the public data. The results showed that the IOC-FP-growth method can consider the user’s needs for association rules easily. Compared with the FP-growth, Recorder and PNARCMC, we only need to meet the requirements of users to extract accurate rules, which will bring huge advantages for data mining. After parallelisation, the performance of the P-IOC-FPMine was better than FP-growth on the data set. The results showed that the P-IOC-FPMine was more appropriate for handling large-scale data sets. Copyright © 2022 Inderscience Enterprises Ltd.

关键词： MapReduce

来源：评论

学校读者我要写书评

暂无评论

Multi-Task ConvMixer Networks with Triplet Attention for Low-Resource Keyword Spotting

引用

Tsinghua science and technology 2025年第2期30卷 875-893页

作者： Alexander Rogath Kivaisi Qingjie Zhao Yuanbing Zou School of Computer Science and Technology Beijing Institute of TechnologyBeijing 100081China

Customized keyword spotting needs to adapt quickly to small user *** methods primarily solve the problem under moderate noise *** work increases the level of difficulty in detecting keywords by introducing keyword ***,the current solution has been explored on large models with many parameters,making it unsuitable for deployment on small *** applying the current solution to lightweight models with minimal training data,the performance degrades compared to the baseline ***,we propose a light-weight multi-task architecture(<9.0×10^(4)parameters)created from integrating the triplet attention module in the ConvMixer networks and a new auxiliary mixed labeling encoding to address the *** results of our experiment show that the proposed model outperforms similar light-weight models for keyword spotting,with accuracy gains ranging from 0.73%to 2.95%for a clean set and from 2.01%to 3.37%for a mixed set under different scales of training ***,our model shows its robustness in different low-resource language datasets while converging faster.

关键词： KeyWord Spotting(KWS) multi-task learning cross-dimension attention low-resource mixed speech

来源：评论

学校读者我要写书评

暂无评论

Mode Management of Peripherals Based on State Transition Model in FRP Language for Embedded Systems

Computer Software

引用

computer Software 2025年第1期42卷 40-53页

作者： Takimoto, Satoshi Moriguchi, Sosuke Watanabe, Takuo Department of Computer Science Tokyo Institute of Technology Institute of Science Tokyo Japan

XStorm, an FRP language for small-scale embedded systems, allows us to concisely describe state-dependent behaviors based on the state transition model. However, when we use different sets of peripheral devices depending on states, device management, such as switching power modes, should be implemented in a driver code in C. This would result in bugs as inconsistency between the state in the XStorm program and that in the driver code cannot be detected. In this research, we extend XStorm’s state hook model to express modes of peripherals that depend on states. By the extension, the language manages modes of peripherals, and thus the inconsistency is statically avoided. © 2025 Japan Society for Software science and technology. All rights reserved.

关键词： C (programming language)

来源：评论

学校读者我要写书评

暂无评论

Enhanced Smart Contract Vulnerability Detection via Graph Neural Networks: Achieving High Accuracy and Efficiency

引用

IEEE Transactions on Software Engineering 2025年第6期51卷 1854-1865页

作者： Xu, Chang Xu, Huaiyu Zhu, Liehuang Shen, Xiaodong Sharif, Kashif Beijing Institute of Technology School of Cyberspace Science and Technology Beijing100811 China Beijing Institute of Technology School of Computer Science and Technology Beijing100811 China

As blockchain technology becomes prevalent, smart contracts have shown significant utility in finance and supply chain management. However, vulnerabilities in smart contracts pose serious threats to blockchain security, leading to substantial economic losses. Therefore, developing effective vulnerability detection solutions is urgent. To address this issue, we propose a method for detecting vulnerabilities in smart contracts using graph neural networks (GNNs) that can identify eight common vulnerabilities. Our method is fully automated, applicable to all Ethereum smart contracts, and does not require expert-defined rules or manually defined features. We extract the Control Flow Graph and Abstract Syntax Graph from the smart contract code, which are then processed by a GNN to generate feature vectors for classification. Experiments on a real Ethereum dataset demonstrate that our method significantly outperforms existing state-of-the-art approaches. For individual detection tasks, the combined source code and bytecode method achieves an average accuracy of 95.78%, with a peak of 99.13%, and an average F1 score of 93.80%. Compared to competitors, our method shows an average improvement of 51.92% in accuracy and 47.21% in F1 score. The bytecode-only method achieves an average accuracy of 94.68% and an F1 score of 92.36%. For multi-class tasks, both methods achieve high accuracies of 91.26% and 87.34%, with F1 scores of 97.42% and 96.43%, respectively. © 1976-2012 IEEE.

关键词： Smart contract

来源：评论

学校读者我要写书评

暂无评论

MH-Net: Multiheaded 3D Hand Pose Estimation Network With 3D Anchorsets and Improved Multiscale Vision Transformer

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年第10期9卷 1-12页

作者： Tewolde, Tekie Tsegay Manjotho, Ali Asghar Niu, Zhendong School of Computer Science and Technology Beijing Institute of Technology Beijing China

Accurate 3D hand pose estimation is a challenging computer vision problem primarily because of self-occlusion and viewpoint variations. Existing methods address viewpoint variations by applying data-centric transformations, such as data alignments or generating multiple views, which are prone to data sensitivity, error propagation, and prohibitive computational requirements. We improve the estimation accuracy by mitigating the impact of self-occlusion and viewpoint variations from the network side and propose MH-Net, a novel multiheaded network for accurate 3D hand pose estimation from a depth image. MH-Net comprises three key components. First, a multiscale feature extraction backbone based on an improved multiscale vision transformer (MViTv2) is proposed to extract shift-invariant global features. Second, a 3D anchorset generator is proposed to generate three disjoint sets of 3D anchors that serve two purposes: formulating hand pose estimation as an anchor-to-joint offset estimation and defining three unique viewpoints from a single depth image. Third, three identical regression heads are proposed to regress 3D joint positions based on unique viewpoints defined by their respective anchorsets. Extensive ablation studies have been conducted to investigate the impact of anchorsets, regression heads, and feature extraction backbones. Experiments on three public datasets, ICVL, MSRA, and NYU, show significant improvements over the state-of-the-art. IEEE

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：