检索结果-内蒙古大学图书馆

Robust license plate detection through auxiliary information and context fusion model 1

2nd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2019

作者： Wang, Ning Liu, Feng Gan, Zongliang Jiangsu Province Key Lab on Image Processing and Image Communications Nanjing University of Posts and Telecommunications Nanjing210003 China

ISBN: (数字)9783030317263

ISBN: (纸本)9783030317256

License plate detection has wide applications in the intelligent transportation system, while it still remains challenges to improve the robustness under various shooting distance and observation angles. To get better performance, a novel convolutional-neural-network-based method is proposed, which is achieved with auxiliary information and context fusion model. First, the auxiliary information is employed in our framework, which corresponds with resolutions, orientations and shapes of license plates. Specifically, the multiple resolutions are collected through integrating multi-level features of convolution hierarchy. Besides the various scales and ratios, the region proposal network RPN with multi-angle anchors and branching structure is applied to generate proper proposals. Second, an effective context fusion model is designed to fully exploit the hidden correlation between license plates and contextual properties. The local and contextual features are independently learned in the dual pathways, which are later joint to form a powerful representation in subsequent layers. Comprehensive experiments on the publicly available datasets confirm the effectiveness of the proposed method. © Springer Nature Switzerland AG 2019.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

GMRE-iUnet: Isomorphic Unet fusion model for PET and CT lung tumor images

引用

COMPUTERS IN BIOLOGY AND MEDICINE 2023年 166卷 107514-107514页

作者： Zhou, Tao Zhang, Xiangxiang Lu, Huiling Li, Qi Liu, Long Zhou, Huiyu North Minzu Univ Sch Comp Sci & Engn Yinchuan 750021 Peoples R China Ningxia Med Univ Sch Med Informat & Engn Yinchuan 750004 Peoples R China Univ Leicester Sch Comp & Math Sci Leicester LE1 7RH Leics England North Minzu Univ Key Lab Image & Graph Intelligent Processing State Ethn Affairs Commiss Yinchuan 750021 Peoples R China

Lung tumor PET and CT image fusion is a key technology in clinical diagnosis. However, the existing fusion methods are difficult to obtain fused images with high contrast, prominent morphological features, and accurate spatial localization. In this paper, an isomorphic Unet fusion model (GMRE-iUnet) for lung tumor PET and CT images is proposed to address the above problems. The main idea of this network is as following: Firstly, this paper constructs an isomorphic Unet fusion network, which contains two independent multiscale dual encoders Unet, it can capture the features of the lesion region, spatial localization, and enrich the morphological information. Secondly, a Hybrid CNN-Transformer feature extraction module (HCTrans) is constructed to effectively integrate local lesion features and global contextual information. In addition, the residual axial attention feature compensation module (RAAFC) is embedded into the Unet to capture fine-grained information as compensation features, which makes the model focus on local connections in neighboring pixels. Thirdly, a hybrid attentional feature fusion module (HAFF) is designed for multiscale feature information fusion, it aggregates edge information and detail representations using local entropy and Gaussian filtering. Finally, the experiment results on the multimodal lung tumor medical image dataset show that the model in this paper can achieve excellent fusion performance compared with other eight fusion models. In CT mediastinal window images and PET images comparison experiment, AG, EI, QAB/F, SF, SD, and IE indexes are improved by 16.19%, 26%, 3.81%, 1.65%, 3.91% and 8.01%, respectively. GMRE-iUnet can highlight the information and morphological features of the lesion areas and provide practical help for the aided diagnosis of lung tumors.

关键词： Lung tumor Multimodal medical image fusion Isomorphic unet CNN-Transformer Hybrid attention mechanism

来源：评论

学校读者我要写书评

暂无评论

Adaptive Person-Specific Appearance-Based Gaze Estimation 16th

Adaptive Person-Specific Appearance-Based Gaze Estimation

引用

16th International Forum on Digital TV and Wireless Multimedia communication, IFTC 2019

作者： Zheng, Chuanyang Zhou, Jun Sun, Jun Zhao, Lihua Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai200240 China Shanghai Key Lab of Digital Media Processing and Transmissions Shanghai Jiao Tong University Shanghai200240 China Children’s Hospital of Shanghai Shanghai200062 China

ISBN: (纸本)9789811533402

Non-invasive gaze estimation from only eye images captured by camera is a challenging problem due to various eye shapes, eye structures and image qualities. Recently, CNN network has been applied to directly regress eye image to gaze direction and obtains good performance. However, generic approaches are susceptible to bias and variance highly relating to different individuals. In this paper, we study the person-specific bias when applying generic methods on new person. And we introduce a novel appearance-based deep neural network integrating meta-learning to reduce the person-specific bias. Given only a few person-specific calibration images collected in normal calibration process, our model adapts quickly to test person and predicts more accurate gaze directions. Experiments on public MPIIGaze dataset and Eyediap dataset show our approach has achieved competitive accuracy to current state-of-the-art methods and are able to alleviate person-specific bias problem. © 2020, Springer Nature Singapore Pte Ltd.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

SVM Based Fast CU Partitioning Algorithm for VVC Intra Coding

SVM Based Fast CU Partitioning Algorithm for VVC Intra Codin...

引用

IEEE International Symposium on Circuits and Systems

作者： Guoqing Wu Yan Huang Chen Zhu Li Song Wenjun Zhang Institute of Image Communication and Network Engineering Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University

Recently, Joint Video Experts Team (JVET) has completed the new Versatile Video Coding (H.266/VVC) standard. VVC employs a new block partition structure named quad-tree with nested multi-type tree (QTMT) to improve coding efficiency. However, the new block partition structure increases huge encoding time compared with HEVC for brute-force ratedistortion (RD) optimization. To reduce encoding complexity, we propose a Support Vector Machine (SVM) based fast CU partitioning algorithm for VVC intra coding in this paper which terminates redundant partitions early by predicting the partition of CU using texture information. We trained classifiers for CUs of different sizes to improve accuracy and control the complexity of the classifiers themselves. Different thresholds are set for each classifier to achieve a trade-off between encoding complexity and RD performance. Experimental results show that the proposed method can save encoder time ranging from 30.78% to 63.16% with 1.10% to 2.71% BD-BR increase.

关键词： Support vector machines Prediction algorithms Encoding Distance measurement Partitioning algorithms Classification algorithms Copper

来源：评论

学校读者我要写书评

暂无评论

Infectious Probability Analysis on COVID-19 Spreading with Wireless Edge Networks

arXiv

引用

arXiv 2022年

作者： Li, Xuran Guo, Shuaishuai Dai, Hong-Ning Li, Dengwang Shandong Key Laboratory of Medical Physics and Image Processing School of Physics and Electronics Shandong Normal University Jinan250061 China School of Control Science and Engineering Shandong University Jinan250061 China Shandong Provincial Key Laboratory of Wireless Communication Technologies Jinan China The Department of Computer Science Hong Kong Baptist University Hong Kong

The emergence of infectious disease COVID-19 has challenged and changed the world in an unprecedented manner. The integration of wireless networks with edge computing (namely wireless edge networks) brings opportunities to address this crisis. In this paper, we aim to investigate the prediction of the infectious probability and propose precautionary measures against COVID-19 with the assistance of wireless edge networks. Due to the availability of the recorded detention time and the density of individuals within a wireless edge network, we propose a stochastic geometry-based method to analyze the infectious probability of individuals. The proposed method can well keep the privacy of individuals in the system since it does not require to know the location or trajectory of each individual. Moreover, we also consider three types of mobility models and the static model of individuals. Numerical results show that analytical results well match with simulation results, thereby validating the accuracy of the proposed model. Moreover, numerical results also offer many insightful implications. Thereafter, we also offer a number of countermeasures against the spread of COVID-19 based on wireless edge networks. This study lays the foundation toward predicting the infectious risk in realistic environment and points out directions in mitigating the spread of infectious diseases with the aid of wireless edge networks. Copyright © 2022, The Authors. All rights reserved.

关键词： Stochastic systems

来源：评论

学校读者我要写书评

暂无评论

Optimized efficient attention-based network for facial expressions analysis in neurological health care

引用

Computers in Biology and Medicine 2024年 179卷 108822-108822页

作者： Munsif, Muhammad Sajjad, Muhammad Ullah, Mohib Tarekegn, Adane Nega Cheikh, Faouzi Alaya Tsakanikas, Panagiotis Muhammad, Khan Sejong University Seoul143-747 Korea Republic of Digital Image Processing Lab Department of Computer Science Islamia College Peshawar25000 Pakistan Department of Computer Science Norwegian University for Science and Technology Gjøvik2815 Norway Department of Computer Science Norwegian University for Science and Technology Gjøvik2815 Norway Institute of Communication and Computer Systems National Technical University of Athens Athens15773 Greece Department of Applied Artificial Intelligence School of Convergence College of Computing and Informatics Sungkyunkwan University Seoul03063 Korea Republic of

Facial Expression Analysis (FEA) plays a vital role in diagnosing and treating early-stage neurological disorders (NDs) like Alzheimer's and Parkinson's. Manual FEA is hindered by expertise, time, and training requirements, while automatic methods confront difficulties with real patient data unavailability, high computations, and irrelevant feature extraction. To address these challenges, this paper proposes a novel approach: an efficient, lightweight convolutional block attention module (CBAM) based deep learning network (DLN) to aid doctors in diagnosing ND patients. The method comprises two stages: data collection of real ND patients, and pre-processing, involving face detection and an attention-enhanced DLN for feature extraction and refinement. Extensive experiments with validation on real patient data showcase compelling performance, achieving an accuracy of up to 73.2%. Despite its efficacy, the proposed model is lightweight, occupying only 3MB, making it suitable for deployment on resource-constrained mobile healthcare devices. Moreover, the method exhibits significant advancements over existing FEA approaches, holding tremendous promise in effectively diagnosing and treating ND patients. By accurately recognizing emotions and extracting relevant features, this approach empowers medical professionals in early ND detection and management, overcoming the challenges of manual analysis and heavy models. In conclusion, this research presents a significant leap in FEA, promising to enhance ND diagnosis and *** code and data used in this work are available at: https://***/munsif200/Neurological-Health-Care. © 2024 Elsevier Ltd

关键词： Feature extraction

来源：评论

学校读者我要写书评

暂无评论

A LIGHTWEIGHT SALIENCY PREDICTION MODEL FOR OMNIDIRECTIONAL imageS

A LIGHTWEIGHT SALIENCY PREDICTION MODEL FOR OMNIDIRECTIONAL ...

引用

2021 IEEE International Conference on Multimedia and Expo, ICME 2021

作者： Zhu, Dandan Chen, Yongqing Zhao, Defang Min, Xiongkuo Zhou, Qiangqiang Yu, Shaobo Zhai, Guangtao Yang, Xiaokang MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University China College of Information and Communication Hainan University China School of Software Engineering Tongji University China Institute of Image Communication and Network Engineering Shanghai Jiao Tong University China School of Software Jiangxi Normal University China Information Technology Services East China Normal University China

ISBN: (纸本)9781665438643

At present, most high-performing saliency prediction models for omnidirectional images (ODIs) depend on deeper or wider convolutional neural networks (CNNs), benefiting from their superior feature representation capability but suffering from high computational costs. To address this issue, we propose a novel lightweight saliency prediction model to predict the eye fixations on ODIs. Specifically, our proposed model consists of three modules: a lightweight feature representation module, a supervised attention module, and a dynamic convolution aggregation module. Different from the existing saliency prediction models, our proposed model is the first to introduce the dynamic convolution into the saliency prediction and aggregate multiple parallel convolution kernels dynamically based on their attention. Such a dynamic convolution operation is not only computationally efficient (small kernel size), but also increases the feature representation capability since these convolution kernels are aggregated in a non-linear manner via attention. Experimental results on two benchmark datasets show that our model is lightweight and outperforms other state-of-the-art methods. © 2021 IEEE

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

SCSQ-MDD: A Detection Approach for Multi-Style Strokes of Chinese Characters

SSRN

引用

SSRN 2023年

作者： Zhou, Tian Xie, Wu Zhang, Huimin Fan, Yong Guangxi Key Laboratory of Image and Graphic Intelligent Processing Guilin University of Electronic Technology Guilin China School of Computer Science and Information Security Guilin University of Electronic Technology Guilin China Key Lab of Education Blockchain and Intelligent Technology Ministry of Education Guangxi Normal University Guilin541004 China Guangxi Key Lab of Multi-Source Information Mining and Security Guangxi Normal University Guilin541004 China School of Mechanical and Electrical Engineering Guilin University of Electronic Science and Technology Guilin China

In the Chinese character writing task of the robotic arms, the stroke category and position information should be extracted by object detection. The detection algorithms based on predefined anchor frames have difficulty in resolving the differences among many different styles of Chinese character strokes. While the deformable detection transformer (deformable DETR) algorithms without predefined anchor frames result in some invalid sampling points having no contribution to the feature update of the current reference point due to the random sampling of sampling points in the deformable attention module. These processes cause the effectiveness of correlation calculations between reference points in Chinese strokes and their surrounding sampled points is limited. So that the speed of vector learning stroke features in the detection head is reduced. In view of this problem, a new detection method of multi-style strokes of Chinese characters via SCSQ-MDD (Simple Conditional Spatial Query Mask Deformable DETR) is proposed in this paper. Firstly, a mask prediction layer is jointly determined using the shallow feature map of the Chinese character image and the query vector of the transformer encoder, which is used to filter the points with actual contribution and resample the points without contribution, so that the randomness of correlation calculation among reference points is solved. Secondly, by separating the content query and spatial query of the transformer deocder, the content embedding and spatial embedding can be separately focused on when cross-attention computations are performed. Thus the dependence of the prediction task on the content embedding is relaxed and the training process is simplified. Finally, the detection model without predefined anchor frames based on deformable DETR called SCSQ-MDD is constructed using the mask mechanism and the simple conditional spatial query mechanism, and trained and validated on a multi-style Chinese character stroke dataset

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

An Adaptive Feature-based Quantization Algorithm for Point Cloud Compression

An Adaptive Feature-based Quantization Algorithm for Point C...

引用

Picture Coding Symposium, PCS

作者： Da Ai Hongying Lu Yurong Yang Ying Liu National Key Lab. of Electronic Information Processing for Applications in Crime Scene Investigation Ministry of Public Security Xi‘an China Center for Image and Information Processing Xi'an University of Posts and Telecommunications Xi'an China The International Joint-Research Center for Wireless Communication and Information Processing Xi‘an University of Posts and Telecommunications Xi‘an China

To reduce over-rasterization distortion caused by global uniform quantization for static surface point cloud, an adaptive quantization coding method based on feature mining is proposed. Combining spatial position and texture feature of point clouds with level of details, the quantization increment is dynamically set according to feature priority, which can reserve the number of effective points to the maximum extent, and reduce the rasterization distortion. Experimental results show that the proposed method can effectively enhance the subjective reconstruction quality of compressed point cloud, gaining better results of rate-distortion optimization.

关键词： Graphics Geometry Three-dimensional displays Quantization (signal) Transform coding Rate-distortion Distortion

来源：评论

学校读者我要写书评

暂无评论

Looking here or there? Gaze Following in 360-Degree images

Looking here or there? Gaze Following in 360-Degree Images

引用

International Conference on Computer Vision (ICCV)

作者： Yunhao Li Wei Shen Zhongpai Gao Yucheng Zhu Guangtao Zhai Guodong Guo Institute of Image Communication and Network Engineering Shanghai Jiao Tong University MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Baidu

ISBN: (纸本)9781665428132

Gaze following, i.e., detecting the gaze target of a human subject, in 2D images has become an active topic in computer vision. However, it usually suffers from the out of frame issue due to the limited field-of-view (FoV) of 2D images. In this paper, we introduce a novel task, gaze following in 360-degree images which provide an omnidirectional FoV and can alleviate the out of frame issue. We collect the first dataset, "GazeFollow360" 1 , for this task, containing around 10,000 360-degree images with complex gaze behaviors under various scenes. Existing 2D gaze following methods suffer from performance degradation in 360degree images since they may use the assumption that a gaze target is in the 2D gaze sight line. However, this assumption is no longer true for long-distance gaze behaviors in 360-degree images, due to the distortion brought by sphere-to-plane projection. To address this challenge, we propose a 3D sight line guided dual-pathway framework, to detect the gaze target within a local region (here) and from a distant region (there), parallelly. Specifically, the local region is obtained as a 2D cone-shaped field along the 2D projection of the sight line starting at the human subject’s head position, and the distant region is obtained by searching along the sight line in 3D sphere space. Finally, the location of the gaze target is determined by fusing the estimations from both the local region and the distant region. Experimental results show that our method achieves significant improvements over previous 2D gaze following methods on our GazeFollow360 dataset.

关键词： Degradation Computer vision Solid modeling Three-dimensional displays Head Estimation Distortion

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：