检索结果-内蒙古大学图书馆

SPEED:Semantic Prior and Extremely Efficient Dilated Convolution Network for Real-Time Metal Surface Defects Detection

引用

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS 2023年第12期19卷 11380-11390页

作者： Guo, Bingyang Wang, Yuting Zhen, Shi Yu, Ruiyun Su, Zhan Northeastern Univ Sch Software Engn Shenyang 110000 Peoples R China

Automatic defect detection on the metal surface is a vital task for product inspection in industrial assembly lines or production processes. Owing to miscellaneous patterns of defects, interclass similarity, intraclass difference, and fewer defect samples, achieving accurate and automatic detection remains a big challenge. What is more, since the rising demand for production efficiency, real-time detection is increasingly desirable. This article proposes a semantic prior and extremely efficient dilated convolution network, named SPEED, for pixel-wise detection on the metal surface, which aims to address the aforementioned issues. The architecture of SPEED involves the following: 1) a semantic prior (SP) branch, with shallow layer and prior mapping module to capture low-level details;and 2) an extremely efficient dilation (EED) branch, with lightweight bottleneck to obtain high-level context. Furthermore, an aggregation module is designed to fuse both types of feature representation. Additionally, different level features of bottleneck are fused to improve the segmentation performance. Experimental results on three metal surface defect datasets indicate that the proposed method outperforms the state-of-the-art approaches in terms of the mean intersection of union, model parameters, FLOPs, and FPS. More specifically, SPEED achieves 92.34% mIoU on NEU-Seg, 88.65% mIoU on Severstal Strip Steel, and 63.91% mIoU on MT Defect.

关键词： Dilated convolution real-time semantic segmentation surface defect detection

来源：评论

学校读者我要写书评

暂无评论

Dense Voxel Representation Network for Implicit Scene Completion

Dense Voxel Representation Network for Implicit Scene Comple...

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Fan Dai Yun Zhu Yaqi Shen Jin Xie Jianjun Qian PCA Lab Nanjing University of Science and Technology Nanjing China State Key Laboratory for Novel Software Technology & School of Intelligence Science and Technology Nanjing University China

ISBN: (数字)9798350390155

ISBN: (纸本)9798350390162

Implicit scene completion aims to learn an implicit representation of dense point clouds from incomplete ones. Since point clouds are disordered and irregular, some implicit scene completion methods learn representations from voxelized point clouds with sparse convolution. Despite achieving promising results, they lack deep exploration of feature learning on empty voxels, which is beneficial for implicit scene completion task. To address this, we propose a dense voxel representation network for implicit scene completion. First, we design a Bird’s-Eye View (BEV) assisted enhancement module to enhance non-empty voxel features by incorporating the information contained in the learned dense BEV features into them through deformable cross-attention. Second, we construct a feature adaptive completion module to adaptively complete voxel features using deformable self-attention, realizing the transfer of the information from non-empty voxels to empty voxels. Extensive experiments on SemanticKITTI and SemanticPOSS datasets demonstrate our method achieves state-of-the-art performance.

关键词： Point cloud compression Representation learning Fuses Convolution

来源：评论

学校读者我要写书评

暂无评论

RoCA: Robust Contrastive One-class Time Series Anomaly Detection with Contaminated Data

arXiv

引用

arXiv 2025年

作者： Mou, Xudong Wang, Rui Li, Bo Wo, Tianyu Sun, Jie Wang, Hui Liu, Xudong School of Computer Science and Engineering Beihang University Beijing China Zhongguancun Laboratory Beijing China School of Software Beihang University Beijing China

The accumulation of time-series signals and the absence of labels make time-series Anomaly Detection (AD) a self-supervised task of deep learning. Methods based on normality assumptions face the following three limitations: (1) A single assumption could hardly characterize the whole normality or lead to some deviation. (2) Some assumptions may go against the principle of AD. (3) their basic assumption is that the training data is uncontaminated (free of anomalies), which is unrealistic in practice, leading to a decline in robustness. This paper proposes a novel robust approach, RoCA, which is the first to address all of the above three challenges, as far as we are aware. It fuses the separated assumptions of one-class classification and contrastive learning in a single training process to characterize a more complete so-called normality. Additionally, it monitors the training data and computes a carefully designed anomaly score throughout the training process. This score helps identify latent anomalies, which are then used to define the classification boundary, inspired by the concept of outlier exposure. The performance on AIOps datasets improved by 6% compared to when contamination was not considered (COCA). On two large and high-dimensional multivariate datasets, the performance increased by 5% to 10%. RoCA achieves the highest average performance on both univariate and multivariate datasets. The source code is available at https://***/ruiking04/RoCA. Copyright © 2025, The Authors. All rights reserved.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

A CTR Prediction Model Based on Attention Mechanism and Logarithmic Transformation 2

A CTR Prediction Model Based on Attention Mechanism and Loga...

引用

2nd International Conference on Digital Signal and Computer Communications, DSCC 2022

作者： Meng, Lu Shi, Tian-Wei Chang, Guang-Ming Qiang, Jiao-Feng Cui, Wen-Hua School of Computer Science and Software Engineering University of Science and Technology Liaoning Liaoning Anshan114051 China

ISBN: (纸本)9781510656734

To address the problems of inadequate feature interaction and lack of targeting in feature combination in the click-through rate prediction model. We propose a click-through prediction model called SELFM. It based on attention mechanism and logarithmic transformation structure. The model first incorporates the attention mechanism in the feature embedding stage to distinguish the importance of different features and avoids the effects of invalid features. Then the field-aware factorization machine is used to learn low-order feature interactions. The logarithmic transformation structure is used to convert the power of each feature in the feature combination into the coefficients to be learned and combined with the hidden layer for higher-order nonlinear feature interactions. The final output layer is processed with the Sigmoid function to get the click-through rate prediction results. The experimental results show that the AUC and Logloss of this paper's model are better than the existing click-through rate prediction models, which effectively improves the prediction accuracy and enhances the ability of the recommendation system to process data. © 2022 SPIE.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Recent Advances in Additive Manufacturing Technology:Achievements of the Rapid Manufacturing Center in Huazhong University of science and Technology

引用

Additive Manufacturing Frontiers 2024年第2期3卷 51-89页

作者： Yusheng Shi Chunze Yan Bo Song Bin Su Qingsong Wei Lichao Zhang Jiamin Wu Shifeng Wen Jie Liu Chao Cai Shengfu Yu Chenhui Li Yan Zhou Annan Chen Lei Yang Peng Chen Yang Zou Minkai Tang Ying Chen Yunsong Shi Hongzhi Wu Lei Zhang Zhufeng Liu Haoze Wang Changshun Wang Siqi Wu Guizhou Liu Zhen Ouyang State Key Laboratory of Material Processing and Die&Mould Technology School of Materials Science and EngineeringHuazhong University of Science and TechnologyWuhan430074China

Additive manufacturing(AM)technology enables the creation of a wide variety of assemblies and complex shapes from three-dimensional model data in a bottom-up,layer-by-layer ***,AM has revolutionized the modern manufacturing industry,attracting increasing interest from both academic and industrial *** Rapid Manufacturing Center(RMC)of the School of Materials science and Engineering at the Huazhong Univer-sity of science and Technology(HUST),one of the earliest and most powerful AM research teams in China,has been engaged in AM research since *** to address the“stuck neck”problems of specific high-strength products for AM,the RMC has conducted full-chain research in the aspects of special materials,processes,equip-ment,and applications for ***,it has formed a multi-disciplinary research team over the past three *** research achievements in the AM field include winning five national awards,more than ten first prizes,and more than ten second prizes at the provincial and ministerial *** RMC was complimented as“the world’s most influential organization in the laser AM field in 2018”by Virtual and Physical Prototyping(an international authoritative magazine of AM).Moreover,their industrialization achievements were evaluated as“having affected countries such as Singapore,South Korea,and the United States”by an international author-itative Wohlers Report on *** this study,we first summarize the representative research achievements of the RMC in the AM *** include the preparation and processing technology of high-performance polymeric,metallic,and ceramic materials for AM;advanced processing technology and software/equipment for AM;and typical AM-fabricated products and their ***,we discuss the latest research achievements in cutting-edge 4D printing in terms of feedstock selection,printing processes,induction strategies,and potential ***,we provide insights into the future di

关键词： Additive manufacturing Rapid manufacturing center High-performance materials Advanced processing technology software and equipment High-performance products Typical applications

来源：评论

学校读者我要写书评

暂无评论

TCSE: Trend and cascade based spatiotemporal evolution network to predict online content popularity

引用

MULTIMEDIA TOOLS AND APPLICATIONS 2023年第1期82卷 1459-1475页

作者： Wu, Danke Tan, Zhenhua Xia, Zhenche Ning, Jingyu Northeastern Univ Software Coll Shenyang 110819 Liaoning Peoples R China

During online social networks (OSNs), popularity prediction uncovers the final size of online content based on the observed cascade, which has been the critical technology for online recommendation, viral marketing, and rumor detection. Recently, representation learning could help to infer the mapping between the dynamic cascade and the final popularity efficiently, and has been a new research paradigm for popularity prediction. However, those methods are vulnerable to structure disturbance when lack of fine-grained supervision, as only the dynamic cascade is used. Therefore, we propose a novel trend and cascade based spatiotemporal evolution network (TCSE-Net), which preserves the distinguishable structure pattern while eliminating potential noise, via aligning and fusing the temporal popularity and cascade. To be specific, we first leveraged the Long-Short Term Memory (LSTM) and recurrent graph convolutional network (GCN) to learn the trend representation and the corresponding cascade representation respectively. Meanwhile, we represent node with it's layer, thereby the hierarchy is preserved in cascade representation through GCN. Then, both trend and cascade representations are aligned in time sequence and selectively assembled by a set of shared parameters for popularity prediction. The extensive experimental results show that our TCSE-Net outperforms state-of-the-art baselines on two real datasets. Related code will be publicly available on https://***/TAN-OpenLab/TCSE-Net.

关键词： Popularity prediction Information diffusion Online social network GCN LSTM

来源：评论

学校读者我要写书评

暂无评论

Load-Balanced and Length-Minimized Link Scheduling for Multi-Channel TDMA Wireless Mesh Networks

引用

Chinese Journal of Electronics 2023年第4期19卷 733-736页

作者： Junfeng Jin Baohua Zhao Hao Zhou School of Computer Science and Technology University of Science and Technology of China Hefei China State Key Laboratory of Networking and Switching Technology Beijing China Province Key Laboratory of Software in Computing and Communication Hefei China

To minimize the length of scheduling and guarantee the load balance of channels, a Load-balanced and length-minimized link scheduling (LBLM) algorithm is proposed. LBLM algorithm is a heuristic scheme, which assigns time slots for unicast traffic based on link's weight and hop-count in the routing traffic tree. Thus the algorithm considers both primary and secondary interference, as well as guarantees the proportional fairness. The ns2 simulation results show that in multi-channel TDMA Wireless mesh networks (WMNs), the proposed algorithm has the benefits of lower complexity, shorter frame length and better channel balance compared to other well-known schedule mechanisms.

关键词： Wireless communication Time division multiple access Schedules Unicast Simulation Wireless mesh networks Routing

来源：评论

学校读者我要写书评

暂无评论

Performance analysis of classification between a particular number and average using the same distance measurements

引用

Multimedia Tools and Applications 2024年第38期83卷 86121-86139页

作者： Avuçlu, Emre Department of Software Engineering Faculty of Engineering Aksaray University Aksaray Turkey

Artificial intelligence techniques are used in many areas today to find solutions to different problems. Scientists are trying to solve some problems in people’s daily lives using these techniques. To solve these problems, researchers often use some Machine Learning (ML) algorithms. It is important for researchers to have preliminary information about some metrics of machine learning algorithms in their scientific studies. In this study, k-Nearest Neighbors (k-NN) and Minimum Distance to Means (MDC) ML algorithm, which classifies according to distance measurement methods, were analyzed using the same distance measurement methods. k-NN, which measures distance according to a specified number of distance, and MDC algorithms, which measure distance according to the averages of the classes, were examined in terms of classification performance. The performance comparison of these two algorithms with 5 different distance measurements (Euclidean, Manhattan, Minkowski, Chebyshev, Hellinger) was made using 2 different datasets (Ecoli and Cardiotocography). For Ecoli dataset, the highest train score 100%, test score 84.85% from the k-NN algorithm, and the highest train score 73.78% and test score 76.12% from the MDC algorithm were obtained. For the cardiotocography dataset, the highest train score 99.94%, test score 84.52% from the k-NN algorithm, and the highest train score 69.50% and test score 61.57% from the MDC algorithm were obtained. According to the results of statistical experimental studies, the k-NN algorithm, which classifies according to a certain number, gave better results in both datasets. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Distance measurement

来源：评论

学校读者我要写书评

暂无评论

A Shape Expression approach for assessing the quality of Linked Open Data in libraries

引用

SEMANTIC WEB 2023年第2期14卷 159-179页

作者： Candela, Gustavo Escobar, Pilar Dolores Saez, Maria Marco-Such, Manuel Univ Alicante Dept Software & Comp Syst Alicante Spain

Cultural heritage institutions are exploring Semantic Web technologies to publish and enrich their catalogues. Several initiatives, such as Labs, are based on the creative and innovative reuse of the materials published by cultural heritage institutions. In this way, quality has become a crucial aspect to identify and reuse a dataset for research. In this article, we propose a methodology to create Shape Expressions definitions in order to validate LOD datasets published by libraries. The methodology was then applied to four use cases based on datasets published by relevant institutions. It intends to encourage institutions to use ShEx to validate LOD datasets as well as to promote the reuse of LOD, made openly available by libraries.

关键词： Linked open data data quality libraries cultural heritage

来源：评论

学校读者我要写书评

暂无评论

PVDet: Towards pedestrian and vehicle detection on gigapixel-level images

引用

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE 2023年 118卷

作者： Mo, Wanghao Zhang, Wendong Wei, Hongyang Cao, Ruyi Ke, Yan Luo, Yiwen Xinjiang Univ Software Coll Urumqi 830000 Xinjiang Peoples R China

Recently, gigapixel photography has been developed considerably and gradually put into remote sensing, video surveillance, etc. Gigapixel images have a visible field of view area at the square-kilometer level (containing thousands of targets) and up to 100 times the scale variation. Among them, the differences in target pose, scale, and occlusion are huge, and most existing target detection algorithms cannot directly process them. To solve these problems, we propose a new multi-target pedestrian and vehicle detector PVDet (Towards Pedestrian and Vehicle Detection on Gigapixel-level images) for gigapixel-level images. First, the DPRNet (Deformable deeP Residual Network) is designed as the backbone network to enhance the effective perceptual field and improve the feature representation of pose varying and occluded targets. Then, the PAFPN (Path Aggregation Feature Pyramid Network) is adopted to process the multi-scale features extracted by the backbone, boosting the multi-scale target modeling capability and the localization of small targets. Finally, the DyHead module is introduced to enhance the detection head's scale, spatial and task awareness, further optimizing pedestrian and vehicle classification and localization. Compared with other State-of-the-Art methods on the PANDA dataset, the experimental results show that the proposed method dramatically improves AP of pedestrian and vehicle detection in gigapixel-level images by 10.4 AP over baseline, which is better than the existing target detection algorithms. We also conducted experiments on the PASCAL VOC 2012 dataset to further demonstrate the generalization capability and effectiveness of the proposed method.

关键词： Pedestrian detection Vehicle detection Gigapixel-level image Attention mechanism

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：