检索结果-内蒙古大学图书馆

13th International Conference on Innovations in Bio-Inspired Computing and applications, IBICA 2022, and 12th World Congress on Information and Communication Technologies, WICT 2022

作者： Kumar, C S Ayush Maharana, Advaith Das Krishnan, Srinath Murali Hanuma, Sannidhi Sri Sai Lal, G. Jyothish Ravi, vinayakumar Amrita School of Engineering Coimbatore Amrita Vishwa Vidyapeetham Coimbatore India Center for Artificial Intelligence Prince Mohammad Bin Fahd University Khobar Saudi Arabia

ISBN: (纸本)9783031274985

The importance of speech emotion recognition has increased as a result of the acceptance of intelligent conversational assistant services. The communication between humans and machines may be made better via emotion recognition and analysis. We propose the application of attention based deep learning techniques to process and recognize speech emotions. In this paper we look at two major approaches CNN-LSTM and Mel Spectrogram-vision Transformer based models and is compared over to the existing benchmarks. The experimental results roots for the feature extraction strategy of deep learning based approaches, eliminating the need of handpicking the features for traditional machine learning (ML) classifiers present in the current literature. A comparative study and evaluation between CNN-LSTM and vision Transformers (viT) have been evaluated and established from the experimental results. Both the models performed similarly with CNN-LSTM giving an accuracy of 88.50% when compared to the accuracy of 85.36% by viT surpassing the existing benchmarks and providing the scope of study of attention and image processing based learning for speech emotion recognition. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Recovering image Information from Speckle Noise by image processing 23

Recovering Image Information from Speckle Noise by Image Pro...

引用

6th International Conference on machine vision and applications, ICMvA 2023

作者： Nie, Jianlin Hanson, Steen G. Takeda, Mitsuo Wang, Wei Xi'An Technological University Shaanxi Xi'an China DTU Fotonik Department of Photonics Engineering Technical University of Denmark RoskildeDk-4000 Denmark Utsunomiya University Utsunomiya Tochigi Japan School of Engineering and Physical Sciences Heriot-Watt University EdinburghEH14 4AS United Kingdom

ISBN: (纸本)9781450399531

As a kind of noise, speckle seriously affects the imaging quality of optical imaging system. However, the speckle image carries a large amount of information related to the physical characteristics of the object surface, which can be used as the basis to identify and judge hidden objects. In this paper, speckle noise removal in optical imaging is studied. The average is derived for the squared moduli of spectra of short-exposure speckle images to recover the amplitude information. At the same time, cross-spectrum function is used to recover the phase information. We use this method to process the images. Then, the simulation experiment analysis is carried out by varying two aspects: the stacking numbers and the different objects. The results show that this method can recover the feature information from the speckle image, thus verifying the feasibility of the method. © 2023 ACM.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Multi-view image Rectification for UAv-Captured image Sequences

Multi-View Image Rectification for UAV-Captured Image Sequen...

引用

2023 IEEE International Conference on visual Communications and image processing, vCIP 2023

作者： Zou, Shun Jin, Xin Fan, Yihui Tsinghua University Shenzhen International Graduate School Shenzhen China

ISBN: (纸本)9798350359855

The image sequences captured by Unmanned Aerial vehicles (UAvs) can be applied to many computer vision tasks. However, due to the instability of UAv flight, the captured image sequences will deviate from the preset trajectory and pose, which reduce the quality of subsequent applications such as panoramic image stitching. In this paper, a novel method is proposed to rectify UAv-captured image sequences by transforming the images to a regular trajectory with the uniform pose. First, to minimize the total transformation deviation, virtual regular camera trajectory is derived by minimizing the global error of coordinates between actual and virtual camera trajectories. Then, camera-pose-relevant local homography is proposed by inserting the camera pose into local homography to transform the images to the derived virtual trajectory with the uniform pose and correct translation parallax. The experimental results demonstrate the effectiveness of the proposed rectification algorithm from both theoretical and application levels. © 2023 IEEE.

关键词： Cameras

来源：评论

学校读者我要写书评

暂无评论

Assessment of 3D MRI image segmentation and Classification for Brain Tumor Detection Using ConvLSTM 5

Assessment of 3D MRI Image segmentation and Classification f...

引用

5th IEEE International Conference on Cybernetics, Cognition and machine Learning applications, ICCCMLA 2023

作者： Raju, K Srujan Arvind, Sudha Chegoni, Ramesh Naryana, v.A. vivekananda, A. Kishore Babu, Ch Raja CMR Technical Campus Department of CSE Telangana Hyderabad India CMR Technical Campus Department of ECE Telangana Hyderabad India CMR Technical Campus Department of IT Telangana Hyderabad India CMR College of Engineering & Technology Department of CSE Telangana Hyderabad India

ISBN: (纸本)9798350338287

The human brain serves as the principal controller of the humanoid system. Brain tumors are the result of abnormal cell division and proliferation, and the development of these tumors can result in brain cancer. The use of computer vision in diagnostic procedures has the potential to lessen human mistake in judgment. The incorporation of new technology in healthcare is seen as a technique to improve human decision-making in the area of diagnosis. Magnetic Resonance Imaging (MRI) is thought to be comparatively more dependable and secure than other diagnostic imaging techniques. In order to identify brain tumors on (BraTS), we suggested a method using Convolutional Long Short-Term Memory (ConvLSTM) on segmented anomalous portions of 3D MRI brain images in Matlab. Using Matlab, a graphical user interface that is simple to use is created to find brain tumors. This effort sought to identify the precise tumor site by first classifying the findings from various brains imaging into three categories: normal, benign, and ***- Deep Learning, Pytorch, Neural Network, Artificial Intelligence, Natural Language processing, Tkinter. © 2023 IEEE.

关键词： Magnetic resonance imaging

来源：评论

学校读者我要写书评

暂无评论

vision Transformer Based vision Enhancement for visually Impaired Individuals 7

Vision Transformer Based Vision Enhancement for Visually Imp...

引用

7th International Conference on Circuit Power and Computing Technologies, ICCPCT 2024

作者： Mohammed Ovaiz, A. Yogaraj, A. Rani, K.S. Madhan Kumar, v. Mohammed Kabir, M. veeramuthu, K. Vel Tech High Tech Dr Rangarajan Dr Sakunthala Engineering College Department of Electronics and Communication Engineering Chennai India

ISBN: (数字)9798350372816

ISBN: (纸本)9798350372816

The goal of visual implants is to create artificial vision that can partially restore function. It can enhance the quality of life for visually challenged individuals by allowing them to feel light, even after years of darkness, by the use of 60 microelectrodes implanted in the retina. The artificial vision that is made possible by current visual system stimulators has very poor resolution because of their small number of microelectrodes. Numerous researchers have sought to enhance artificial vision produced by low-resolution implants through the application of machine learning and image processing techniques. Because phosphine pictures have low resolution, users report unhappiness with the Retinal Prosthesis System. This underscores the important need for targeted research aimed at improving visual clarity and user pleasure in general. This research proposes simulating artificial vision in which the visually impaired user receives information synthesized by the system through a low-resolution photo courtesy of a visual implant. Through the use of vision Transformer, the technique gathers useful data about people in the immediate vicinity of the visually impaired person, including their number, familiarity, gender, approximated ages, facial emotions, nearby items, and approximate distances. The information obtained from the user's glasses' camera frames is used to create signals that are then sent into a visual stimulator, offering a potentially effective way to improve the visual experience for those who are visually impaired. In order to facilitate economical real-time implementations in an independent portable system, an algorithm that best suits each feature is chosen based on its accuracy and time complexity. The proposed approach uses audio to provide crucial information about those in close proximity to a visually impaired person, enabling them to converse with others more comfortably. This paper can thus be taken into consideration for some next-generation v

关键词： Ophthalmology

来源：评论

学校读者我要写书评

暂无评论

visibility Enhancement of Sand-Dust Weather images Based on Color Compensation and Color Correction 7

Visibility Enhancement of Sand-Dust Weather Images Based on ...

引用

7th Edition Global Conference on Wireless and Optical Technologies, GCWOT 2024

作者： Kashif Masood, Muhammad Khawaja Otero, Pablo Nava, Enrique University of Malaga Spain

ISBN: (纸本)9798331534271

Sand-dust weather causes low contrast as well as color distortions in outdoor shots, which has a significant impact on outdoor vision applications, particularly on autonomous cars. This autonomous car system analyses still images (photos) in an offline manner. By processing these images offline, this proposed study aims to improve visibility in dusty conditions. An effective strategy for enhancing sand dust photos is provided in this study to mitigate the color cast and low contrast in the image produced by weather with sand and dust. In this proposed study, color compensation and color correction methods are used to enhance sand-dust images to a higher level. Prior to white balancing, a color compensation method is used to fix the initial color cast, which damaged the dusty image. In this proposed method, color compensation algorithm uses numerous yellow channel information produced by sand dust scattering to compensate for the blue and green channel information. While color compensation removes the sand-dust cast, it also reduces the image detail. After that, using color correction algorithm the obtained image is converted from the RGB to the HSv color space, and the new color divergence is avoided by using the CLAHE to raise the v component. In order to maintain edges and minimize noise, the Laplacian filter were used prior to converting the final image back to RGB. Finally, this proposed method exhibits a 1% higher mean value, a 1.13% higher standard deviation, and a 1% higher entropy compared to the existing methods used. © 2024 IEEE.

关键词： Color image processing

来源：评论

学校读者我要写书评

暂无评论

Photonic signal processor based on a Kerr microcomb for real-time video image processing

引用

COMMUNICATIONS ENGINEERING 2023年第1期2卷 94页

作者： Tan, Mengxi Xu, Xingyuan Boes, Andreas Corcoran, Bill Nguyen, Thach G. Chu, Sai T. Little, Brent E. Morandotti, Roberto Wu, Jiayang Mitchell, Arnan Moss, David J. Beihang Univ Sch Elect & Informat Engn Beijing 100191 Peoples R China Swinburne Univ Technol Opt Sci Ctr Hawthorn Vic 3122 Australia RMIT Univ Sch Engn Melbourne Vic 3001 Australia Beijing Univ Posts & Telecommun State Key Lab Informat Photon & Opt Commun Beijing 100876 Peoples R China Univ Adelaide Inst Photon & Adv Sensing IPAS Adelaide SA 5005 Australia Univ Adelaide Sch Elect & Elect Engn Adelaide SA 5005 Australia Monash Univ Dept Elect & Comp Syst Engn Clayton Vic 3168 Australia City Univ Hong Kong Dept Phys & Mat Sci Tat Chee Ave Hong Kong Peoples R China Chinese Acad Sci Xian Inst Opt & Precis Mech Xian Peoples R China INRS Energie Materiaux & Telecommun 1650 Blvd Lionel Boulet Varennes J3X 1S2 PQ Canada

Signal processing has become central to many fields, from coherent optical telecommunications, where it is used to compensate signal impairments, to video image processing. image processing is particularly important for observational astronomy, medical diagnosis, autonomous driving, big data and artificial intelligence. For these applications, signal processing traditionally has mainly been performed electronically. However these, as well as new applications, particularly those involving real time video image processing, are creating unprecedented demand for ultrahigh performance, including high bandwidth and reduced energy consumption. Here, we demonstrate a photonic signal processor operating at 17 Terabits/s and use it to process video image signals in real-time. The system processes 400,000 video signals concurrently, performing 34 functions simultaneously that are key to object edge detection, edge enhancement and motion blur. As compared with spatial-light devices used for image processing, our system is not only ultra-high speed but highly reconfigurable and programable, able to perform many different functions without any change to the physical hardware. Our approach is based on an integrated Kerr soliton crystal microcomb, and opens up new avenues for ultrafast robotic vision and machine learning.

关键词： Frequency combs Solitons

来源：评论

学校读者我要写书评

暂无评论

Regional Transformer for image Super-Resolution 7

Regional Transformer for Image Super-Resolution

引用

7th International Conference on machine vision and Information Technology, CMvIT 2023

作者： Yang, Sen Yang, Jiahong Xu, Dahong Li, Xi Hunan Normal University China Hunan Normal University Key Laboratory of Sports Intelligence Research China

ISBN: (纸本)9781665464857

In the image super-resolution algorithm model, a large receptive field can provide more valuable features, so the Transformer with strong information interaction ability has achieved excellent results in image super-resolution processing applications. However, when the range of the receptive field reaches a certain critical value, the restoration performance of the super-resolution algorithm also reaches a certain critical value, which indicates that unconditionally increasing the receptive field will not continue to promote the improvement of the restoration performance. At the same time, the larger the receptive field range, the more data the model needs to process, which also seriously increases the computational complexity of the algorithm. In order to exchange information in a wider range more effectively, in this paper, a new type of super-resolution network based on Transformer, namely Regional Transformer, is designed. The key element in the newly designed network structure is the Region Block (RB) with the Boundary Restriction (BR) mechanism. In addition, the paper designs a Boundary Restriction based on coarse-To-fine pipes. This paper conducts a large number of experiments on multiple datasets, and the experiments show that the network structure designed in this paper has a significant improvement in performance. © 2023 IEEE.

关键词： Restoration

来源：评论

学校读者我要写书评

暂无评论

Development of a fusion technique and an algorithm for merging images recorded in the IR and visible spectrum in dust and fog 19

Development of a fusion technique and an algorithm for mergi...

引用

Conference on Electro-Optical and Infrared Systems - Technology and applications XIX

作者： Semenishchev, Evgeny Zelensky, Aleksandr Alepko, Andrey Zhdanova, Marina voronin, viacheslav Ilyukhin, Yury Tula State Univ TulSU Lab Cognit Technol & Simulat Syst 92 Sq Lenina Tula 300012 Tula Russia Moscow State Tech Univ STANKIN Ctr Cognit Technol & Machine Vis 1a Vadkovsky Moscow 127055 Russia

ISBN: (纸本)9781510655461

The article proposes a fusion technique and an algorithm for combining images recorded in the IR and visible spectrum in relation to the problem of processing products by robotic complexes in dust and fog. Primary data processing is based on the use of a multi-criteria processing with complex data analysis and cross-change of the filtration coefficient for different types of data. The search for base points is based on the application of the technique of reducing the range of clusters (image simplification) and searching for transition boundaries using the approach of determining the slope of the function in local areas. As test data used to evaluate the effectiveness, pairs of test images obtained by sensors with a resolution of 1024x768 (8 bit, color image, visible range) and 640x480 (8 bit, color, IR image) are used. images of simple shapes are used as analyzed objects.

关键词： image fusion machine vision preprocessing IR noise robotic complexes

来源：评论

学校读者我要写书评

暂无评论

Task-Attentive Transformer Architecture for Continual Learning of vision-and-Language Tasks Using Knowledge Distillation

Task-Attentive Transformer Architecture for Continual Learni...

引用

Conference on Empirical Methods in Natural Language processing (EMNLP)

作者： Cai, Yuliang Thomason, Jesse Rostami, Mohammad Univ Southern Calif Los Angeles CA 90007 USA

ISBN: (纸本)9798891760615

The size and the computational load of fine-tuning large-scale pre-trained neural networks are becoming two major obstacles in adopting machine learning in many applications. Continual learning (CL) can serve as a remedy through enabling knowledge-transfer across sequentially arriving tasks. However, existing CL algorithms primarily consider learning unimodal vision-only or language-only tasks. We develop a transformer-based CL architecture for learning multimodal vision-and-language (vaL) tasks based on dynamic model expansion and knowledge distillation. Additional parameters are used to specialize the network for each task. Our approach, Task Attentive Multimodal Continual Learning (TAM-CL), enables sharing information between the tasks while addressing catastrophic forgetting. Our approach is scalable, requiring little memory and time overhead. TAM-CL reaches SOTA performance on challenging multimodal tasks. The code is publicly available on https://***/YuliangCai2022/***.

关键词： Distillation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：