检索结果-内蒙古大学图书馆

Strengthening attention: knowledge distillation via cross-layer feature fusion for image classification

international JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL 2024年第2期13卷 23-23页

作者： Zhai, Zhongyi Liang, Jie Cheng, Bo Zhao, Lingzhong Qian, Junyan Guilin Univ Elect Technol Guangxi Key Lab Image & Graph Intelligent Proc Guilin Peoples R China Beijing Univ Posts & Telecommun State Key Lab Networking & Switching Technol Beijing Peoples R China Guangxi Normal Univ Key Lab Educ Blockchain & Intelligent Technol Guilin Peoples R China

Deep learning has achieved great success in computer vision, especially in image classification tasks. How to improve the generalization ability and compactness of deep neural networks has gradually attracted widespread attention from researchers. Knowledge distillation is an effective technique for model compression. It transfers general knowledge from a sophisticated teacher model to a smaller student model. Recently, some studies refine knowledge from feature maps or adopt complex attention mechanisms to better supervise students imitating teachers. However, their methods focus too much on how to improve students' accuracy and largely overlook the associated training costs, which violates the original intention of knowledge distillation to compress the model. To achieve a balance between performance and efficiency, in this paper, we introduce a straightforward and effective distillation method to utilize the deepest feature maps to enhance shallow features. Specifically, our method performs processing only on the original feature maps without an extra assisting network. Moreover, we use cross-layer feature fusion to enhance the attention on shallow feature maps. By visualizing the features of different layers, we demonstrate the importance of the fusion operation in our method. Our experimental results on the CIFAR-100, tinyimageNet and miniimageNet datasets show that our approach outperforms previous methods, especially in the balance between performance and training cost. Further ablative studies verify the effectiveness of the design.

关键词： Deep learning Knowledge distillation image classification Attention

来源：评论

学校读者我要写书评

暂无评论

A Sequential Model of Neural Networks for Low Light image Enhancement 14

A Sequential Model of Neural Networks for Low Light Image En...

引用

14th international conference on Computing Communication and Networking Technologies, ICCCNT 2023

作者： Ashok, Naveen Kumar Swapna, T.R. Amrita Vishwa Vidyapeetham Amrita School of Computing Department of Computer Science and Engineering Coimbatore India

ISBN: (纸本)9798350335095

Enhancing the quality of low light images is a critical area of research, and the recent advancements in this field offer significant potential for enhancing the standard of low light images and their subsequent processing. However, most of these approaches do not consider the case of spatially uneven dark areas with backlit illumination. To address this issue, a pipeline of processes is proposed, where the sky region is segmented first, as most of the source of backlit illumination is a direct result of the presence of sky region. For noise removal and adjustment of exposure without color distortion, conversion of the remaining region from RGB to Luminance Chrominance color space is proposed. The enhancement is done on the Luminance component alone which is then combined with the chroma components and the segmented sky region. Furthermore, the results show that the proposed framework is promising and outperforms the Zero Deep Curve Estimation (DCE) model. © 2023 IEEE.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

A Specular Reflection Removal Technique in Cervigrams 1

A Specular Reflection Removal Technique in Cervigrams

引用

1st IEEE international conference on Contemporary Computing and Communications, InC4 2023

作者： Mukku, Lalasa Thomas, Jyothi Department of Computer Science and Engineering Bangalore India

ISBN: (纸本)9798350335774

Cancer detection through medical image segmentation and classification is possible owing to the advancement in image processing techniques. Segmentation and classification tasks carried out to predict and classify diseases need to be dependable and precise. Specular reflections are the high-intensity and low-saturation areas that reflect the light from the probing devices that capture the picture of the organ surface. These areas sometimes mimic the features that are key identifying factors for cancers like acetowhite lesions. This review article examines the various methods proposed for removing specular reflections from medical images, especially those captured by colposcopes. The fundamentals of specular reflection removal and its associated challenges are discussed. The paper reviews several prominent approaches for removal of specular reflections proposes a novel method to remove the specular reflections. The comprehensive review can be a strong foundation for researchers looking to decide on appropriate techniques to employ in their respective research approaches. © 2023 IEEE.

关键词： Diseases

来源：评论

学校读者我要写书评

暂无评论

Person Re-Identification Based on Improved Transformer and CNN 5

Person Re-Identification Based on Improved Transformer and C...

引用

5th IEEE international conference on Civil Aviation Safety and Information Technology, ICCASIT 2023

作者： Dai, Yuyun Shanghai University Shanghai China

ISBN: (纸本)9798350310603

Person re-identification is a cross-view pedestrian tracking and retrieval technology, which is of great significance in the field of security monitoring. Due to the different conditions of the shooting scene, there will be a series of problems such as low resolution, perspective occlusion, pose changes, and lighting, which bring many challenges to the application of person re-identification technology. In order to solve the problem of low model recognition accuracy due to the loss of image block information and insufficient expression of pedestrian local features in person re-identification, this paper proposes a person re-identification method based on improved Transformer and CNN. Using the backbone network combined with ResNet and Transformer enhances the ability of pedestrian feature extraction. Through a large number of experiments on the mainstream data sets Market1501 and DukeMTMC-reID, the experimental results show that the person re-identification algorithm based on the improved Transformer and CNN can effectively improve the accuracy of person re-identification © 2023 IEEE.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Depth Map Estimation from a Single 2D image 17

Depth Map Estimation from a Single 2D Image

引用

17th international conference on Signal-image Technology and Internet-Based Systems, SITIS 2023

作者： Suarez, Patricia L. Carpio, Dario Sappa, Angel Espol Polytechnic University Guayaquil Ecuador Computer Vision Center Bellaterra Barcelona08193 Spain

ISBN: (纸本)9798350370911

This paper presents an innovative architecture based on a Cycle Generative Adversarial Network (CycleGAN) for the synthesis of high-quality depth maps from monocular images. The proposed architecture leverages a diverse set of loss functions, including cycle consistency, contrastive, identity, and least square losses, to facilitate the generation of depth maps that exhibit realism and high fidelity. A notable feature of the approach is its ability to synthesize depth maps from grayscale images without the need for paired training data. Extensive comparisons with different state-of-the-art methods show the superiority of the proposed approach in both quantitative metrics and visual quality. This work addresses the challenge of depth map synthesis and offers significant advancements in the field. © 2023 IEEE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Third international conference on Optics and Communication Technology, ICOCT 2023

Third International Conference on Optics and Communication T...

引用

3rd international conference on Optics and Communication Technology, ICOCT 2023

ISBN: (纸本)9781510672529

The proceedings contain 33 papers. The topics discussed include: high precious automatic balanced homodyne detector for quantum information processing based on proportional-integral-derivative;an underwater image enhancement method based on SWIN transformer;image enhancement and edge detection for defect identification using infrared thermal wave radar imaging;DNU-Net for infrared small target detection;DDCU-Net: dual dynamic convolutional U-Net for infrared small-target detection;a broadband deconvolution beamforming acceleration method;telephoto camera calibration based on robust homography matrix;research on surface defect detection of aerospace electronic components based on machine vision;and multi-channel optical module based on PLCC packaging.

关键词：

来源：评论

学校读者我要写书评

暂无评论

An image enhancement algorithm based on multi-scale Retinex theory to improve the images quality of sensors 12

An image enhancement algorithm based on multi-scale Retinex ...

引用

SPIE 12th international Symposium on Multispectral image processing and Pattern Recognition, MIPPR 2023

作者： Li, Yiyao Chen, Zhong Sun, Wenyuan School of Journalism and Communication Central China Normal University Luoyu Road 152 Wuhan China School of Artificial Intelligence and Automation Huazhong University of Science and Technology Luoyu Road 1037 Wuhan China National Key Laboratory of Science and Technology on Multi-spectral Information Processing Luoyu Road 1037 Wuhan China Key Laboratory of Ministry of Education for Image Processing and Intelligent Control Luoyu Road 1037 Wuhan China Central China Normal University Library Luoyu Road 152 Wuhan China

ISBN: (数字)9781510674967

ISBN: (纸本)9781510674950

Nowadays, image recognition plays a pivotal role in acquiring data via sensors. However, the adaptability of traditional algorithms is hindered by the unpredictable nature of open environments, varying sensor quality, and image dimensions. Challenges arise in adverse conditions like inclement weather, low light, and optical distortions. Retinex-based methods have emerged as a viable solution, effectively enhancing images plagued by shadows or poor lighting. Yet, issues surface when images possess saturated colors;the conventional multi-scale Retinex with color restoration risks color inversion. Moreover, during gain compensation, extreme histogram values occupy significant gray level space, obscuring vital image details. This study delves into these challenges and proposes an enhanced multi-scale Retinex algorithm. Our approach substitutes logarithmic functions with tansig functions, eliminating color inversion risks. Additionally, a novel gain compensation method, integrating histogram stretching with Gamma correction, refines image clarity. The algorithm's robustness is evidenced in diverse scenarios, including adverse weather, low light, underwater imaging, and non-uniform lighting. Experimental results validate our method's superiority, surpassing other Retinex-based techniques both qualitatively and quantitatively. This research contributes valuable insights into image enhancement methodologies, fostering advancements in sensor-based data gathering in Smart Spaces. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

Deep Learning: The Future of Medical image processing

Deep Learning: The Future of Medical Image Processing

引用

2023 international conference on Computational Intelligence and Sustainable Engineering Solution, CISES 2023

作者： Singh, Kamred Udham Pandey, Saroj Kumar Yadav, Dhirendra Prasad Singh, Teekam Kumar, Gaurav Kumar, Ankit School of Computing Graphic Era Hill University Dehradun India Gla University Department of Computer Engineering & Applications UP Mathura India Graphic Era Deemed to Be University Department of Computer Science and Engineering Dehradun India

ISBN: (纸本)9798350323917

Health care is a vital service that is constantly in high demand since everyone needs it. Individuals have higher hopes for advancement in this profession than for receiving the status quo since they would rather be treated better. Sometimes, or maybe more accurately, in many situations, the findings are not clear and the sickness cannot be understood at the first stage from a manual reading of the report. When it comes to viewing medical images, the fact that they must be interpreted by hand inevitably leads to delays and errors. Many deep learning and machine learning approaches can be used to address this issue. Therefore, although machine learning may be thought of as a subset of deep learning, the reverse is not true. Let's talk about how deep-learning models will be used to process medical images. Medical image processing is one area where deep learning models are having a profound effect. Convolutional neural networks (CNNs) and other deep learning techniques have made it feasible to automate the processing of medical pictures and improve diagnostic and treatment accuracy. It is widely used in radiology to examine medical pictures such as X-rays, CT scans, and MRIs. In order to aid radiologists in their diagnostic work, deep learning models may be taught to identify patterns and characteristics in medical pictures that are diagnostic of certain diseases or ailments. © 2023 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

image Integrity Checking Using Watermarking in Cloud Computing: A Review 3rd

Image Integrity Checking Using Watermarking in Cloud Computi...

引用

3rd international conference on Computing and Communication Networks, ICCCN 2023

作者： Rani, Jyoti Nath, Rajender Department of Computer Science and Applications Kurukshetra University Kurukshetra Thanesar India

ISBN: (纸本)9789819726707

In today’s era of cloud computing, modification and tampering of digital images on cloud storage have turn out to be easier due to proliferation of digital image processing tools. Consequently, tamper detection and integrity checking of images on remote server emerged as huge concern. image watermarking techniques provide an efficient way to tackle such problems. This paper provides a systematic examination of existing watermarking methods currently in use in cloud computing environment. This paper also presents a comparative analysis of those techniques. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Cloud platforms

来源：评论

学校读者我要写书评

暂无评论

Depth Is All You Need for Monocular 3D Detection

Depth Is All You Need for Monocular 3D Detection

引用

IEEE international conference on Robotics and Automation (ICRA)

作者： Park, Dennis Li, Jie Chen, Dian Guizilini, Vitor Gaidon, Adrien Toyota Res Inst Toyota Japan

ISBN: (纸本)9798350323658

A key contributor to recent progress in 3D detection from single images is monocular depth estimation. Existing methods focus on how to leverage depth explicitly, by generating pseudo-pointclouds or providing attention cues for image features. More recent works leverage depth prediction as a pretraining task and fine-tune the depth representation while training it for 3D detection. However, the adaptation is limited in scale by manual labels. In this work, we propose further aligning the depth representation with the target domain in an unsupervised fashion. Our methods leverage commonly available LiDAR or RGB videos during training time to fine-tune the depth representation, which leads to improved 3D detectors. Especially when using RGB videos, we show that our two-stage training by first generating depth pseudo-labels is critical, because of the inconsistency in loss distribution between the two tasks. With either type of reference data, our multi-task learning approach improves over the state of the art on both KITTI and NuScenes, while matching the test-time complexity of its single-task sub-network. Source code and pretrained models are available on https://***/TRI-ML/DD3D.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：