检索结果-内蒙古大学图书馆

Chemical-motif characterization of short-range order with E(3)-equivariant graph neural networks

npj Computational Materials 2024年第1期10卷 983-992页

作者： Killian Sheriff Yifan Cao Rodrigo Freitas Department of Materials Science and Engineering Massachusetts Institute of TechnologyCambridgeMAUSA

Crystalline materials have atomic-scale fluctuations in their chemical composition that modulate various mesoscale *** chemistry–microstructure relationships in such materials requires proper characterization of these chemical ***,current characterization approaches(e.g.,Warren–Cowley parameters)make only partial use of the complete chemical and structural information contained in local chemical *** we introduce a framework based on E(3)-equivariant graph neural networks that is capable of completely identifying chemical motifs in arbitrary crystalline structures with any number of chemical *** approach naturally leads to a proper information-theoretic measure for quantifying chemical short-range order(SRO)in chemically complex materials and a reduced representation of the chemical motif *** framework enables the correlation of any per-atom property with their corresponding local chemical motif,thereby enabling the exploration of structure–property relationships in chemically complex *** the MoTaNbTi high-entropy alloy as a test system,we demonstrate the versatility of this approach by evaluating the lattice strain associated with each chemical motif,and computing the temperature dependence of chemical-fluctuations length scale.

关键词： properties characterization property

来源：评论

学校读者我要写书评

暂无评论

Transformer-Based Person Re-Identification: A Comprehensive Review

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年第7期9卷 1-19页

作者： Sarker, Prodip Kumar Zhao, Qingjie Uddin, Md. Kamal School of Computer Science and Technology Beijing Institute of Technology China Department of Computer Science and Telecommunication Engineering Noakhali Science and Technology University Bangladesh

In the evolving landscape of surveillance and security applications, the task of person re-identification(re-ID) has significant importance, but also presents notable difficulties. This task entails the process of accurately matching and identifying persons across several camera views that do not overlap with one another. This is of utmost importance to video surveillance, public safety, and person-tracking applications. However, vision-related difficulties, such as variations in appearance, occlusions, viewpoint changes, cloth changes, scalability, limited robustness to environmental factors, and lack of generalizations, still hinder the development of reliable person re-ID methods. There are few approaches have been developed based on these difficulties relied on traditional deep-learning techniques. Nevertheless, recent advancements of transformer-based methods, have gained widespread adoption in various domains owing to their unique architectural properties. Recently, few transformer-based person re-ID methods have developed based on these difficulties and achieved good results. To develop reliable solutions for person re-ID, a comprehensive analysis of transformer-based methods is necessary. However, there are few studies that consider transformer-based techniques for further investigation. This review proposes recent literature on transformer-based approaches, examining their effectiveness, advantages, and potential challenges. This review is the first of its kind to provide insights into the revolutionary transformer-based methodologies used to tackle many obstacles in person re-ID, providing a forward-thinking outlook on current research and potentially guiding the creation of viable applications in real-world scenarios. The main objective is to provide a useful resource for academics and practitioners engaged in person re-ID. IEEE

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Multi-Task ConvMixer Networks with Triplet Attention for Low-Resource Keyword Spotting

引用

Tsinghua science and technology 2025年第2期30卷 875-893页

作者： Alexander Rogath Kivaisi Qingjie Zhao Yuanbing Zou School of Computer Science and Technology Beijing Institute of TechnologyBeijing 100081China

Customized keyword spotting needs to adapt quickly to small user *** methods primarily solve the problem under moderate noise *** work increases the level of difficulty in detecting keywords by introducing keyword ***,the current solution has been explored on large models with many parameters,making it unsuitable for deployment on small *** applying the current solution to lightweight models with minimal training data,the performance degrades compared to the baseline ***,we propose a light-weight multi-task architecture(<9.0×10^(4)parameters)created from integrating the triplet attention module in the ConvMixer networks and a new auxiliary mixed labeling encoding to address the *** results of our experiment show that the proposed model outperforms similar light-weight models for keyword spotting,with accuracy gains ranging from 0.73%to 2.95%for a clean set and from 2.01%to 3.37%for a mixed set under different scales of training ***,our model shows its robustness in different low-resource language datasets while converging faster.

关键词： KeyWord Spotting(KWS) multi-task learning cross-dimension attention low-resource mixed speech

来源：评论

学校读者我要写书评

暂无评论

Mode Management of Peripherals Based on State Transition Model in FRP Language for Embedded Systems

Computer Software

引用

computer Software 2025年第1期42卷 40-53页

作者： Takimoto, Satoshi Moriguchi, Sosuke Watanabe, Takuo Department of Computer Science Tokyo Institute of Technology Institute of Science Tokyo Japan

XStorm, an FRP language for small-scale embedded systems, allows us to concisely describe state-dependent behaviors based on the state transition model. However, when we use different sets of peripheral devices depending on states, device management, such as switching power modes, should be implemented in a driver code in C. This would result in bugs as inconsistency between the state in the XStorm program and that in the driver code cannot be detected. In this research, we extend XStorm’s state hook model to express modes of peripherals that depend on states. By the extension, the language manages modes of peripherals, and thus the inconsistency is statically avoided. © 2025 Japan Society for Software science and technology. All rights reserved.

关键词： C (programming language)

来源：评论

学校读者我要写书评

暂无评论

MH-Net: Multiheaded 3D Hand Pose Estimation Network With 3D Anchorsets and Improved Multiscale Vision Transformer

IEEE Transactions on Intelligent Vehicles

引用

IEEE Transactions on Intelligent Vehicles 2024年 1-12页

作者： Tewolde, Tekie Tsegay Manjotho, Ali Asghar Niu, Zhendong School of Computer Science and Technology Beijing Institute of Technology Beijing China

Accurate 3D hand pose estimation is a challenging computer vision problem primarily because of self-occlusion and viewpoint variations. Existing methods address viewpoint variations by applying data-centric transformations, such as data alignments or generating multiple views, which are prone to data sensitivity, error propagation, and prohibitive computational requirements. We improve the estimation accuracy by mitigating the impact of self-occlusion and viewpoint variations from the network side and propose MH-Net, a novel multiheaded network for accurate 3D hand pose estimation from a depth image. MH-Net comprises three key components. First, a multiscale feature extraction backbone based on an improved multiscale vision transformer (MViTv2) is proposed to extract shift-invariant global features. Second, a 3D anchorset generator is proposed to generate three disjoint sets of 3D anchors that serve two purposes: formulating hand pose estimation as an anchor-to-joint offset estimation and defining three unique viewpoints from a single depth image. Third, three identical regression heads are proposed to regress 3D joint positions based on unique viewpoints defined by their respective anchorsets. Extensive ablation studies have been conducted to investigate the impact of anchorsets, regression heads, and feature extraction backbones. Experiments on three public datasets, ICVL, MSRA, and NYU, show significant improvements over the state-of-the-art. IEEE

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

An efficient schedulability analysis based on worst-case interference time for real-time systems

引用

science China(Information sciences) 2024年第9期67卷 88-104页

作者： Hongbiao LIU Mengfei YANG Lei QIAO Xi CHEN Jian GONG School of Computer Science and Technology Xidian University China Academy of Space Technology Beijing Institute of Control Engineering

Real-time systems are widely implemented in the Internet of Things(IoT) and safety-critical systems, both of which have generated enormous social value. Aiming at the classic schedulability analysis problem in real-time systems, we proposed an exact Boolean analysis based on interference(EBAI) for schedulability analysis in real-time systems. EBAI is based on worst-case interference time(WCIT), which considers both the release jitter and blocking time of the task. We improved the efficiency of the three existing tests and provided a comprehensive summary of related research results in the field. Abundant experiments were conducted to compare EBAI with other related results. Our evaluation showed that in certain cases, the runtime gain achieved using our analysis method may exceed 73% compared to the stateof-the-art schedulability test. Furthermore, the benefits obtained from our tests grew with the number of tasks, reaching a level suitable for practical application. EBAI is oriented to the five-tuple real-time task model with stronger expression ability and possesses a low runtime overhead. These characteristics make it applicable in various real-time systems such as spacecraft, autonomous vehicles, industrial robots, and traffic command systems.

关键词： five-tuple real-time task model real-time system spacecraft Internet of Things exact schedulability analysis worst-case interference time

来源：评论

学校读者我要写书评

暂无评论

Bidirectional Transformer with absolute-position aware relative position encoding for encoding sentences

引用

Frontiers of computer science 2023年第1期17卷 63-71页

作者： Le QI Yu ZHANG Ting LIU School of Computer Science and Technology Harbin Institute of TechnologyHarbin 150001China

Transformers have been widely studied in many natural language processing (NLP) tasks, which can capture the dependency from the whole sentence with a high parallelizability thanks to the multi-head attention and the position-wise feed-forward network. However, the above two components of transformers are position-independent, which causes transformers to be weak in modeling sentence structures. Existing studies commonly utilized positional encoding or mask strategies for capturing the structural information of sentences. In this paper, we aim at strengthening the ability of transformers on modeling the linear structure of sentences from three aspects, containing the absolute position of tokens, the relative distance, and the direction between tokens. We propose a novel bidirectional Transformer with absolute-position aware relative position encoding (BiAR-Transformer) that combines the positional encoding and the mask strategy together. We model the relative distance between tokens along with the absolute position of tokens by a novel absolute-position aware relative position encoding. Meanwhile, we apply a bidirectional mask strategy for modeling the direction between tokens. Experimental results on the natural language inference, paraphrase identification, sentiment classification and machine translation tasks show that BiAR-Transformer achieves superior performance than other strong baselines.

关键词： Transformer relative position encoding bidirectional mask strategy sentence encoder

来源：评论

学校读者我要写书评

暂无评论

Combating with extremely noisy samples in weakly supervised slot filling for automatic diagnosis

引用

Frontiers of computer science 2023年第5期17卷 67-73页

作者： Xiaoming SHI Wanxiang CHE School of Computer Science and Technology Harbin Institute of TechnologyHarbin 150001China

Slot filling,to extract entities for specific types of information(slot),is a vitally important modular of dialogue systems for automatic *** responses can be regarded as the weak supervision of patient *** this way,a large amount of weakly labeled data can be obtained from unlabeled diagnosis dialogue,alleviating the problem of costly and time-consuming data ***,weakly labeled data suffers from extremely noisy *** alleviate the problem,we propose a simple and effective Co-WeakTeaching *** method trains two slot filling models *** two models learn from two different weakly labeled data,ensuring learning from two ***,one model utilizes selected weakly labeled data generated by the other,*** model,obtained by the Co-WeakTeaching on weakly labeled data,can be directly tested on testing data or sequentially fine-tuned on a small amount of human-annotated *** results on these two settings illustrate the effectiveness of the method with an increase of 8.03%and 14.74%in micro and macro f1 scores,respectively.

关键词： dialogue system slot filling co-teaching

来源：评论

学校读者我要写书评

暂无评论

Limb movement detection and analysis based on visual recognition of human posture

引用

Discover Artificial Intelligence 2025年第1期5卷 1-12页

作者： Xiao, Zhiguo Wang, Chunxiang Ding, Tianjiao Shen, Xiangfeng Li, Xinyuan Li, Dongni School of Computer Science & Technology Beijing Institute of Technology Beijing100811 China School of Computer Science Technology Changchun University Changchun130022 China

Current motion detection and evaluation technologies face challenges such as limited scalability, imprecise feedback, and lack of personalized guidance. To address these challenges, this research integrated efficient BlazePose technology with pioneering DW_KNN* algorithm, resulting in the remarkable accuracy of 98.2% in action recognition and showcasing outstanding scalability. Furthermore, the established ACLstm time series prediction model could comprehensively analyze historical sports data and associated factors of users. In Rehab dataset, MAE(Mean Absolute Error, MAE) loss was 1.383 for motion count and 0.508 for motion time. This innovative framework delivered precise feedback and tailored guidance for physical exercise and medical rehabilitation. © The Author(s) 2025.

关键词： Time series

来源：评论

学校读者我要写书评

暂无评论

NeurDB: an AI-powered autonomous data system

引用

science China(Information sciences) 2024年第10期67卷 129-150页

作者： Beng Chin OOI Shaofeng CAI Gang CHEN Yanyan SHEN Kian-Lee TAN Yuncheng WU Xiaokui XIAO Naili XING Cong YUE Lingze ZENG Meihui ZHANG Zhanhao ZHAO School of Computing National University of Singapore College of Computer Science and Technology Zhejiang University Department of Computer Science and Engineering Shanghai Jiao Tong University School of Information Renmin University of China School of Computer Science and Technology Beijing Institute of Technology

In the wake of rapid advancements in artificial intelligence(AI), we stand on the brink of a transformative leap in data systems. The imminent fusion of AI and DB(AI×DB) promises a new generation of data systems, which will relieve the burden on end-users across all industry sectors by featuring AI-enhanced functionalities, such as personalized and automated in-database AI-powered analytics, and selfdriving capabilities for improved system performance. In this paper, we explore the evolution of data systems with a focus on deepening the fusion of AI and DB. We present NeurDB, an AI-powered autonomous data system designed to fully embrace AI design in each major system component and provide in-database AI-powered analytics. We outline the conceptual and architectural overview of NeurDB, discuss its design choices and key components, and report its current development and future plan.

关键词： AI$\times$DB in-database AI intelligent data system

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：