检索结果-内蒙古大学图书馆

Task-specific Part Discovery for Fine-grained Few-shot Classification

Machine Intelligence Research 2024年第5期21卷 954-965页

作者： Yongxian Wei Xiu-Shen Wei School of Computer Science and Technology Nanjing University of Science and TechnologyNanjing210094China

Localizing discriminative object parts(e.g.,bird head)is crucial for fine-grained classification tasks,especially for the more challenging fine-grained few-shot *** work always relies on the learned object parts in a unified manner,where they attend the same object parts(even with common attention weights)for different few-shot episodic *** this paper,we propose that it should adaptively capture the task-specific object parts that require attention for each few-shot task,since the parts that can distinguish different tasks are naturally *** for a few-shot task,after obtaining part-level deep features,we learn a task-specific part-based dictionary for both aligning and reweighting part features in an ***,part-level categorical prototypes are generated based on the part features of support data,which are later employed by calculating distances to classify query data for *** retain the discriminative ability of the part-level representations(i.e.,part features and part prototypes),we design an optimal transport solution that also utilizes query data in a transductive way to optimize the aforementioned distance calculation for the final *** experiments on five fine-grained benchmarks show the superiority of our method,especially for the 1-shot setting,gaining 0.12%,8.56%and 5.87%improvements over state-of-the-art methods on CUB,Stanford Dogs,and Stanford Cars,respectively.

关键词： Fine-grained image recognition few-shot learning transductive learning visual dictionary part feature discovery

来源：评论

学校读者我要写书评

暂无评论

Optimizing B^(+)-tree for hybrid memory with in-node hotspot cache and eADR awareness

引用

Frontiers of computer science 2024年第5期18卷 133-145页

作者： Peiquan JIN Zhaole CHU Gaocong LIU Yongping LUO Shouhong WAN School of Computer Science and Technology University of Science and Technology of ChinaHefei 230027China

he advance in Non-Volatile Memory(NVM)has changed the traditional *** to DRAM,NVM has the advantages of nonvolatility and large ***,as the read/write speed of NVM is still lower than that of DRAM,building DRAM/NVM-based hybrid memory systems is a feasible way of adding NVM into the current computer *** paper aims to optimize the well-known B^(+)-tree for hybrid *** novelty of this study is ***,we observed that the space utilization of internal nodes in B^(+)-tree is generally below 70%.Inspired by this observation,we propose to maintain hot keys in the free space within internal nodes,yielding a new index named HATree(Hotness-Aware Tree).The new idea of HATree is to use the unused space of the parent of leaf nodes(PLNs)as the hotspot data ***,no extra space is needed,and the in-node hotspot cache can efficiently improve query ***,to further improve the update performance of HATree,we propose to utilize the eADR technology supported by the third-generation Intel Xeon Scalable Processors to enhance HATree with instant log persistence,which results in the new HATree-Log *** conduct extensive experiments on real hybrid memory architecture involving DRAM and Intel Optane Persistent Memory to evaluate the performance of HATree and *** state-of-the-art indices for hybrid memory,namely NBTree,LBTree,and FPTree,are included in the experiments,and the results suggest the efficiency of HATree and HATree-Log.

关键词： hybrid memory B^(+)-tree hotspot in-node cache eADR

来源：评论

学校读者我要写书评

暂无评论

Measuring discrete sensing capability for ISAC via task mutual information

引用

science China(Information sciences) 2025年第5期68卷 287-288页

作者： Fei SHANG Haohua DU Panlong YANG Xin HE Jingjing WANG Xiang-Yang LI School of Computer Science and Technology University of Science and Technology of China School of Cyber Science and Technology Beihang University School of Computer Science Nanjing University of Information Science and Technology School of Computer and Information Anhui Normal University

Thanks to its ubiquity,using radio frequency (RF) signals for sensing has found widespread *** traditional integrated sensing and communication systems,such as joint radar-communication systems,common sensing tasks include target localization and ***,increasingly intelligent systems,such as smart agriculture,lowaltitude economy,and smart healthcare,have demanded more comprehensive and continuous information sensing capabilities to support higher-level *** sensing has the potential to offer both spatial and temporal continuity,meeting the multi-dimensional sensing needs of these intelligent ***,numerous advanced systems have been proposed,expanding the application scope of RF sensing to be more pervasive,including discrete state ubiquitous sensing tasks (such as material identification [1]),and continuous state ubiquitous sensing tasks (such as health monitoring [2]).With the advent of the 6G era,it is anticipated that the sensing potential of RF systems will be further unleashed.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Video Captioning Method by Semantic Topic-Guided Generation

引用

computers, Materials & Continua 2024年第1期78卷 1071-1093页

作者： Ou Ye Xinli Wei Zhenhua Yu Yan Fu Ying Yang College of Computer Science and Technology Xi’an University of Science and TechnologyXi’an710054China

In the video captioning methods based on an encoder-decoder,limited visual features are extracted by an encoder,and a natural sentence of the video content is generated using a ***,this kind ofmethod is dependent on a single video input source and few visual labels,and there is a problem with semantic alignment between video contents and generated natural sentences,which are not suitable for accurately comprehending and describing the video *** address this issue,this paper proposes a video captioning method by semantic topic-guided ***,a 3D convolutional neural network is utilized to extract the spatiotemporal features of videos during the ***,the semantic topics of video data are extracted using the visual labels retrieved from similar video *** the decoding,a decoder is constructed by combining a novel Enhance-TopK sampling algorithm with a Generative Pre-trained Transformer-2 deep neural network,which decreases the influence of“deviation”in the semantic mapping process between videos and texts by jointly decoding a baseline and semantic topics of video *** this process,the designed Enhance-TopK sampling algorithm can alleviate a long-tail problem by dynamically adjusting the probability distribution of the predicted ***,the experiments are conducted on two publicly used Microsoft Research Video Description andMicrosoft Research-Video to Text *** experimental results demonstrate that the proposed method outperforms several state-of-art ***,the performance indicators Bilingual Evaluation Understudy,Metric for Evaluation of Translation with Explicit Ordering,Recall Oriented Understudy for Gisting Evaluation-longest common subsequence,and Consensus-based Image Description Evaluation of the proposed method are improved by 1.2%,0.1%,0.3%,and 2.4% on the Microsoft Research Video Description dataset,and 0.1%,1.0%,0.1%,and 2.8% on the Microsoft Research-Video to Text dataset

关键词： Video captioning encoder-decoder semantic topic jointly decoding Enhance-TopK sampling

来源：评论

学校读者我要写书评

暂无评论

A Weakly-Supervised Crowd Density Estimation Method Based on Two-Stage Linear Feature Calibration

引用

IEEE/CAA Journal of Automatica Sinica 2024年第4期11卷 965-981页

作者： Yong-Chao Li Rui-Sheng Jia Ying-Xiang Hu Hong-Mei Sun the College of Computer Science and Engineering Shandong University of Science and TechnologyQingdao 266590 the Faculty of Information Science and Engineering Ocean University of ChinaQingdao 266000China the College of Computer Science and Engineering Shandong University of Science and TechnologyQingdao 266590China the College of Computer Science and Engineering Nanjing University of Science and TechnologyNanjing 210000China

In a crowd density estimation dataset,the annotation of crowd locations is an extremely laborious task,and they are not taken into the evaluation *** this paper,we aim to reduce the annotation cost of crowd datasets,and propose a crowd density estimation method based on weakly-supervised learning,in the absence of crowd position supervision information,which directly reduces the number of crowds by using the number of pedestrians in the image as the supervised *** this purpose,we design a new training method,which exploits the correlation between global and local image features by incremental learning to train the ***,we design a parent-child network(PC-Net)focusing on the global and local image respectively,and propose a linear feature calibration structure to train the PC-Net simultaneously,and the child network learns feature transfer factors and feature bias weights,and uses the transfer factors and bias weights to linearly feature calibrate the features extracted from the Parent network,to improve the convergence of the network by using local features hidden in the crowd *** addition,we use the pyramid vision transformer as the backbone of the PC-Net to extract crowd features at different levels,and design a global-local feature loss function(L2).We combine it with a crowd counting loss(LC)to enhance the sensitivity of the network to crowd features during the training process,which effectively improves the accuracy of crowd density *** experimental results show that the PC-Net significantly reduces the gap between fullysupervised and weakly-supervised crowd density estimation,and outperforms the comparison methods on five datasets of Shanghai Tech Part A,ShanghaiTech Part B,UCF_CC_50,UCF_QNRF and JHU-CROWD++.

关键词： Crowd density estimation linear feature calibration vision transformer weakly-supervision learning

来源：评论

学校读者我要写书评

暂无评论

Priority Encoder Based on DNA Strand Displacement

引用

Chinese Journal of Electronics 2024年第6期33卷 1538-1544页

作者： Fang WANG Xinjian ZHANG Xin CHEN Shuying LYU Congzhou CHEN Xiaolong SHI Institution of Computing Science and Technology Guangzhou University Beijing University of Technology School of Computer Science Peking University

The slow development of traditional computing has prompted the search for new materials to replace silicon-based computers. Bio-computers, which use molecules as the basis of computation, are highly parallel and information capable, attracting a lot of attention. In this study, we designed a NAND logic gate based on the DNA strand displacement mechanism. We assembled a molecular calculation model, a 4-wire-2-wire priority encoder logic circuit, by cascading the proposed NAND gates. Different concentrations of input DNA chains were added into the system, resulting in corresponding output, through DNA hybridization and strand displacement. Therefore, it achieved the function of a priority encoder. Simulation results verify the effectiveness and accuracy of the molecular NAND logic gate and the priority coding system presented in this study. The unique point of this proposed circuit is that we cascaded only one kind of logic gate, which provides a beneficial exploration for the subsequent development of complex DNA cascade circuits and the realization of the logical coding function of information.

关键词： Visualization Accuracy Logic circuits Simulation DNA Logic gates Encoding Software Logic Integrated circuit modeling

来源：评论

学校读者我要写书评

暂无评论

Fault-tolerant Quantized Control for Switched Neural Networks with Actuator Faults and Dynamic Output Quantization

引用

IAENG International Journal of Applied Mathematics 2025年第1期55卷 7-15页

作者： Su, Yue Wang, Xinrui Tai, Weipeng Zhou, Jianping School of Computer Science and Technology Anhui University of Technology Ma'anshan243032 China School of Computer Science and Technology Chengdu University of Technology Chengdu610051 China School of Computer Science and Technology Anhui University of Technology Ma'anshan243032 China School of Computer Science and Technology Anhui University of Technology Ma'anshan243032 China

This paper examines fault-tolerant quantized control for neural networks under persistent dwell-time switching, considering the presence of actuator faults and dynamic output quantization. The dynamic scaling factor (DSF) of the quantizer is designed as a piecewise function concerning the output to avoid the possibility of division by zero. To reduce conservatism, the controller is designed to combine the system model with a time scheduler constructed with a minimum time span. A sufficient condition for the asymptotic stability and L2-gain of the closed-loop system is derived using a piecewise Lyapunov functional and decoupling approach. When the condition is satisfied, the needed feedback gains and the parameter range associated with the DSF can be determined by exact mathematical expressions. For comparison, feedback gains that depend only on the system mode are also studied, and the corresponding design method is presented. The numerical simulation results demonstrate the effectiveness of the proposed control scheme. © (2025), (International Association of Engineers). All rights reserved.

关键词： Lyapunov functions

来源：评论

学校读者我要写书评

暂无评论

Pushing the Limits of WiFi-Based Gait Recognition Towards Non-Gait Human Behaviors

引用

IEEE Transactions on Mobile Computing 2025年第7期24卷 6137-6153页

作者： Yan, Dawei Yang, Panlong Shang, Fei Han, Feiyu Yan, Yubo Li, Xiang-Yang University of Science and Technology of China School of Computer Science and Technology Hefei230021 China Nanjing University of Information Science & Technology School of Computer Science Nanjing210044 China

WiFi-based gait recognition technologies have seen significant advancements in recent years. However, most existing approaches rely on a critical assumption: users must walk continuously and maintain a consistent body posture. This poses a substantial challenge when users engage in non-periodic or discontinuous behaviors (e.g., stopping, starting, or turning mid-walk), which can disrupt the extraction of gait-related features and degrade recognition performance. To address this issue, we propose freeGait, a novel approach designed to mitigate the impact of non-gait behaviors in WiFi-based gait recognition systems. Our solution models this problem as domain adaptation, where we learn domain-independent representations to isolate gait features from behavior-dependent noise. We treat human behaviors with labeled user data as source domains and behaviors without user labels as target domains. However, applying domain adaptation directly is challenging due to the ambiguous classification boundaries in the target domains for WiFi signals. To overcome this, we align the posterior distributions between the source and target domains and constrain the conditional distribution within the target domains to enhance gait classification accuracy. Additionally, we implement a data augmentation module to generate data resembling the labeled data, while supervised learning ensures distinctiveness between users. Our experiments, conducted with 20 participants across 3 different scenarios, demonstrate that freeGait can accurately predict data across 15 domains by labeling only a small subset from 6 source domains, achieving up to a 45% improvement in user classification accuracy compared to existing methods. © 2002-2012 IEEE.

关键词： Wi-Fi

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral image restoration using noise gradient and dual priors under mixed noise conditions

引用

CAAI Transactions on Intelligence technology 2025年第1期10卷 72-93页

作者： Hazique Aetesam Suman Kumar Maji V.B.Surya Prasath Computer Science and Engineering Birla Institute of Technology MesraBiharIndia Computer Science and Engineering Indian Institute of Technology PatnaBiharIndia Department of Computer Science University of CincinnatiCincinnatiOhioUSA

Images obtained from hyperspectral sensors provide information about the target area that extends beyond the visible portions of the electromagnetic ***,due to sensor limitations and imperfections during the image acquisition and transmission phases,noise is introduced into the acquired image,which can have a negative impact on downstream analyses such as classification,target tracking,and spectral *** in hyperspectral images(HSI)is modelled as a combination from several sources,including Gaussian/impulse noise,stripes,and *** HSI restoration method for such a mixed noise model is ***,a joint optimisation framework is proposed for recovering hyperspectral data corrupted by mixed Gaussian-impulse noise by estimating both the clean data as well as the sparse/impulse noise ***,a hyper-Laplacian prior is used along both the spatial and spectral dimensions to express sparsity in clean image ***,to model the sparse nature of impulse noise,anℓ_(1)−norm over the impulse noise gradient is *** the proposed methodology employs two distinct priors,the authors refer to it as the hyperspectral dual prior(HySpDualP)*** the best of authors'knowledge,this joint optimisation framework is the first attempt in this *** handle the non-smooth and nonconvex nature of the generalℓ_(p)−norm-based regularisation term,a generalised shrinkage/thresholding(GST)solver is ***,an efficient split-Bregman approach is used to solve the resulting optimisation *** results on synthetic data and real HSI datacube obtained from hyperspectral sensors demonstrate that the authors’proposed model outperforms state-of-the-art methods,both visually and in terms of various image quality assessment metrics.

关键词： hyper-laplacian prior hyperspectral images image restoration mixed noise variational approach

来源：评论

学校读者我要写书评

暂无评论

Transformer-Based Person Re-Identification: A Comprehensive Review

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年第7期9卷 1-19页

作者： Sarker, Prodip Kumar Zhao, Qingjie Uddin, Md. Kamal School of Computer Science and Technology Beijing Institute of Technology China Department of Computer Science and Telecommunication Engineering Noakhali Science and Technology University Bangladesh

In the evolving landscape of surveillance and security applications, the task of person re-identification(re-ID) has significant importance, but also presents notable difficulties. This task entails the process of accurately matching and identifying persons across several camera views that do not overlap with one another. This is of utmost importance to video surveillance, public safety, and person-tracking applications. However, vision-related difficulties, such as variations in appearance, occlusions, viewpoint changes, cloth changes, scalability, limited robustness to environmental factors, and lack of generalizations, still hinder the development of reliable person re-ID methods. There are few approaches have been developed based on these difficulties relied on traditional deep-learning techniques. Nevertheless, recent advancements of transformer-based methods, have gained widespread adoption in various domains owing to their unique architectural properties. Recently, few transformer-based person re-ID methods have developed based on these difficulties and achieved good results. To develop reliable solutions for person re-ID, a comprehensive analysis of transformer-based methods is necessary. However, there are few studies that consider transformer-based techniques for further investigation. This review proposes recent literature on transformer-based approaches, examining their effectiveness, advantages, and potential challenges. This review is the first of its kind to provide insights into the revolutionary transformer-based methodologies used to tackle many obstacles in person re-ID, providing a forward-thinking outlook on current research and potentially guiding the creation of viable applications in real-world scenarios. The main objective is to provide a useful resource for academics and practitioners engaged in person re-ID. IEEE

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：