检索结果-内蒙古大学图书馆

IEEE Region 10 International Conference TENCON

作者： Ei Ei Tun Aktanin Konkitkriengkrai Watchara Ruangsang Supavadee Aramvith Department of Electrical Engineering Faculty of Engineering Chulalongkorn University Bangkok Thailand Department of Electrical Engineering Multimedia Data Analytics and Processing Research Unit Faculty of Engineering Chulalongkorn University Bangkok Thailand

Image compression is a topic of significant interest as it reduces file sizes in stored data. In this paper, we propose a model that achieves multiple levels of compression, thereby minimizing the storage space required for images, which typically consume substantial amounts of data due to their size and resolution. We combine an image downscaling and upscaling model with an image compression model. By leveraging convolutional techniques to identify image features, we can effectively reduce the size of the image through downscaling and subsequently upscaling it. Additionally, we employ entropy image compression and arithmetic encoding to compress and reconstruct the image while preserving its lossless data. Through experimentation with the Kodak dataset, we observed that our proposed model achieved a compression rate of 96.92%, significantly reducing the data needed for file storage. Moreover, our reconstructed images attained a standardized measure with a signal-to-noise ratio of 33.10 dB and a structural similarity of 0.9219. Notably, the perceptual quality of the images, including intricate details, remained intact to the human eye.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Enhancing Visual Fidelity in Unpaired Night-to-Day Image Translation through a Perceptual Quality Focused Training Objective

Enhancing Visual Fidelity in Unpaired Night-to-Day Image Tra...

引用

Artificial Intelligence (SLAAI-ICAI), SLAAI International Conference on

作者： H.K.I.S. Lakmal Maheshi B. Dissanayake Supavadee Aramvith Dept. of Electrical and Electronic Engineering Faculty of Engineering University of Peradeniya Peradeniya Sri Lanka Dept. of Electrical Engineering Multimedia Data Analytics and Processing Research Unit Chulalongkorn University Bangkok Thailand

Nighttime driving poses visibility challenges, but image translation methods can help by transforming night images into day-like scenes. The Cycle-GAN is a versatile unpaired image translation model which can easily be adept to night-to-day image translation tasks. However, it generates unnatural and unrealistic outcomes in these cases. This study is focused on addressing this with a novel training strategy for the Cycle-GAN, employing a tailored training objective that incorporates perceptual quality optimization. This training objective aims to boost the naturalness and perceptual quality of the generated images by preserving high-level image features. The optimization process involves minimizing Euclidean distances between synthesized and target image feature maps, which are derived from the pre-trained VGG19 network. Experimental findings attest to the effectiveness of this method, revealing noteworthy improvements of 8% in Inception Score, 2% in NIQE Score, and 13% in BRISQUE Score.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Combined Channel and Spatial Attention-based Stereo Endoscopic Image Super-Resolution

arXiv

引用

arXiv 2023年

作者： Hayat, Mansoor Armvith, Supavadee Achakulvisut, Titipat Department of Electrical Engineering Chulalongkorn University Bangkok Thailand Multimedia Data Analytics and Processing Research Unit Department of Electrical Engineering Chulalongkorn University Bangkok Thailand Department of Biomedical Engineering Mahidol University Bangkok Thailand

Stereo Imaging technology integration into medical diagnostics and surgeries brings a great revolution in the field of medical sciences. Now, surgeons and physicians have better insight into the anatomy of patients' organs. Like other technologies, stereo cameras have limitations, e.g., low resolution (LR) and blurry output images. Currently, most of the proposed techniques for super-resolution focus on developing complex blocks and complicated loss functions, which cause high system complexity. We proposed a combined channel and spatial attention block to extract features incorporated with a specific but very strong parallax attention module (PAM) for endoscopic image super-resolution. The proposed model is trained using the da Vinci dataset on scales 2 and 4. Our proposed model has improved PSNR up to 2.12 dB for scale 2 and 1.29 dB for scale 4, while SSIM is improved by 0.03 for scale 2 and 0.0008 for scale 4. By incorporating this method, diagnosis and treatment for endoscopic images can be more accurate and effective. Copyright © 2023, The Authors. All rights reserved.

关键词： Geometrical optics

来源：评论

学校读者我要写书评

暂无评论

Combined Channel and Spatial Attention-Based Stereo Endoscopic Image Super-Resolution

Combined Channel and Spatial Attention-Based Stereo Endoscop...

引用

IEEE Region 10 International Conference TENCON

作者： Mansoor Hayat Supavadee Armvith Titipat Achakulvisut Department of Electrical Engineering Chulalongkorn University Bangkok Thailand Department of Electrical Engineering Multimedia Data Analytics and Processing Research Unit Chulalongkorn University Bangkok Thailand Department of Biomedical Engineering Mahidol University Bangkok Thailand

关键词：

来源：评论

学校读者我要写书评

暂无评论

Enhanced Cross-Modality MRI Segmentation Using Dilated Convolutions and Multi-Scale Gradient Map

Enhanced Cross-Modality MRI Segmentation Using Dilated Convo...

引用

International Computer Science and Engineering Conference (ICSEC)

作者： Ghulam Murtaza Charnchai Pluempitiwiriyawej Somkiat Wangsiripitak Mohammad Jawad Fareed Mudassar Khalid Department of Electrical Engineering Chulalongkorn University Bangkok Thailand Department of Electrical Engineering Multimedia Data Analytics and Processing Research Unit Chulalongkorn University Bangkok Thailand Machine Intelligence and Vision Laboratory School of Information Technology King Mongkut’s Insittute of Technology Ladkrabang Thailand

ISBN: (数字)9798350366860

ISBN: (纸本)9798350366877

In recent years, convolutional neural networks have significantly advanced image segmentation, particularly for brain images, where important edge features are automatically found. However, accurate segmentation of tumors in a brain remains a challenge across different magnetic resonance modalities, like T1, T2, T1ce, and FLAIR. Using a simple gradient map as an input to the neural networks is not effective due to variations in cross-modality image characteristics. To address this issue, we introduced multi-scale gradient maps that incorporate Holistically Nested Edge Detection (HED) and dilated convolutions into the UNet model. The HED model captures detailed gradient information, enhancing structural feature identification across modalities, while dilated convolutions expand the UNet receptive field for better contextual understanding without increasing parameters. Our method was trained and evaluated on the BraTS2018 dataset. The experimental results demonstrate significant improvements in segmentation accuracy and robustness. Specifically, our method achieved a Dice Similarity Coefficient (DSC) of 0.6902 for T2 to T1ce, 0.6858 for T2 to T1, 0.4329 for FLAIR to T1, and 0.6004 for FLAIR to T1ce, outperforming previous state-of-the-art methods. This demonstrates the effectiveness of our approach in enhancing segmentation performance across different MR image modalities.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Estimation of Eucalyptus DBH from UAV-LiDAR data Utilizing Advanced Point Cloud processing Techniques

Estimation of Eucalyptus DBH from UAV-LiDAR Data Utilizing A...

引用

International Conference on Business and Industrial Research (ICBIR)

作者： Peerapong Dangrungroj Monton Udompitaksook Natapong Intarasuk Charnchai Pluempitiwiriyawej Teerapol Silawan Department of Electrical Engineering Communications and Information Engineering (CIE) Laboratory Chulalongkorn University Bangkok Thailand Sky Visual Imaging Venture Co. Ltd Bangkok Thailand Department of Electrical Engineering Multimedia Data Analytics and Processing Research Unit Chulalongkorn University Bangkok Thailand

ISBN: (数字)9798350383027

ISBN: (纸本)9798350383034

The Diameter at Breast Height (DBH) measurements are essential for forest management and carbon absorption estimation in environmental challenges. Traditional DBH measurements require more precision and efficiency, especially in inaccessible forests. Existing methods, including manual measurement or remote sensing, struggle with precision in low-intensity point cloud areas. Our technique solves these limitations by analyzing the entire tree stem by the process, including convex hull construction and advanced noise reduction using real-world datasets. It has reduced the Root Mean Squared Error (RMSE) to 0.020, or 92.25% improvement, and the bias has decreased to -0.008. These enhancements highlight the accuracy and reliability of our DBH estimation technique.

关键词： Point cloud compression Adaptation models Convex hulls Accuracy Noise reduction Estimation Forestry Vegetation Breast Remote sensing

来源：评论

学校读者我要写书评

暂无评论

Senext: Squeeze-and-Excitationnext for Single Image Super-Resolution

SSRN

引用

SSRN 2022年

作者： Muhammad, Wazir Aramvith, Supavadee Onoye, Takao Department of Electrical Engineering Faculty of Engineering Chulalongkorn University Bangkok10330 Thailand Multimedia Data Analytics and Processing Unit Department of Electrical Engineering Faculty of Engineering Chulalongkorn University Bangkok10330 Thailand Graduate School of Information Science and Technology Osaka University 1-5 Yamadaoka Suita565-0871 Japan

Recent research on single image super-resolution (SISR) using deep convolutional neural networks (CNNs) has shown significant development in the area computer vision-based tasks specially image and video processing. SISR seeks to reconstruct a visibly appealing high-quality / high-resolution (HR) output image from a low-quality / low-resolution (LR) input image as its primary goal. However, most existing CNN-based image super-resolution (SR) frameworks often use a deeper and broader network architecture that requires a sizeable computational resource, risk of overfitting, increases computational complexity, and more memory consumption, as well as takes more processing time during the evaluations. To resolve these problems, we propose a Squeeze-and-ExcitationNext for Single Image Super-Resolution concept named as SENext. In detail, the squeeze-and-excitation blocks (SEB) are used in our network architecture to reduce the computational cost and adopt the channel-wise feature mappings to adaptively recalibrate the features. Furthermore, local, sub-local and global skip connections are employed between each SEB to enable the feature reusability and stabilize training convergence smoothly. Instead of hand-designed bicubic upsampling at pre-processing step, we perform post-upsampling at the later end to reconstruct the high-resolution (HR) image. Extensive quantitative and qualitative experiments are performed on the benchmark test dataset, including Set5, Set14, BSDS100, Urban100, and Manga109. These experimental evaluations validate the superiority of the SENext over other deep CNN image SR methods in terms of PSNR/SSIM, FLOPs, Number of parameters, processing speed, and visually pleasing effect. © 2022, The Authors. All rights reserved.

关键词： Network architecture

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：