This research paper presents a comparative study on various machine learning algorithms for sign language detection. The objective of this study is to find the sign language identification method that is most accurate...
详细信息
imageprocessing has become a central topic in the era of big data, particularly within computer vision, due to the growing volume and diverse resolutions of images. Low-resolution images introduce uncertainty, unders...
详细信息
Convolutional neuralnetworks (CNNs) for imageprocessing tend to focus on localized texture patterns, commonly referred to as texture bias. While most of the previous works in the literature focus on the task of imag...
详细信息
The wavelet transform has emerged as a powerful tool in deciphering structural information within images. And now, the latest research suggests that combining the prowess of wavelet transform with neuralnetworks can ...
详细信息
ISBN:
(纸本)1577358872
The wavelet transform has emerged as a powerful tool in deciphering structural information within images. And now, the latest research suggests that combining the prowess of wavelet transform with neuralnetworks can lead to unparalleled image deraining results. By harnessing the strengths of both the spatial domain and frequency space, this innovative approach is poised to revolutionize the field of imageprocessing. The fascinating challenge of developing a comprehensive framework that takes into account the intrinsic frequency property and the correlation between rain residue and background is yet to be fully explored. In this work, we propose to investigate the potential relationships among rainfree and residue components at the frequency domain, forming a frequency mutual revision network (FMRNet) for image deraining. Specifically, we explore the mutual representation of rain residue and background components at frequency domain, so as to better separate the rain layer from clean background while preserving structural textures of the degraded images. Meanwhile, the rain distribution prediction from the low-frequency coefficient, which can be seen as the degradation prior is used to refine the separation of rain residue and background components. Inversely, the updated rain residue is used to benefit the low-frequency rain distribution prediction, forming the multi-layer mutual learning. Extensive experiments demonstrate that our proposed FMRNet delivers significant performance gains for seven datasets on image deraining task, surpassing the state-of-the-art method ELFormer by 1.14 dB in PSNR on the Rain100L dataset, while with similar computation cost. Code and retrained models are available at https://***/kuijiang94/FMRNet.
Differential counting of white blood cells (WBCs) in bone marrow using artificial intelligence (AI)-based models, such as convolutional neural network (CNN) and its various variants, can help physicians to efficiently...
详细信息
Differential counting of white blood cells (WBCs) in bone marrow using artificial intelligence (AI)-based models, such as convolutional neural network (CNN) and its various variants, can help physicians to efficiently diagnose many critical diseases such as leukaemia, AIDS and cancers. In this work, we implement a deep transfer learning to several CNN models to examine their effectiveness on automatically classifying WBCs into lymphocytes and non-lymphocytes groups. Our results show that transfer learning can enhance the training of the model and improve the model performance. We also discover that using image masking to remove irrelevant image pixels can further increase the accuracy of the model predictions. Moreover, we assess the impact of three data augmentation techniques to address the imbalance in the data set, which commonly occurs in many biological applications. Our results show that all the three examined data augmentation methods improve the classification results on both training and testing data sets. Altogether, we demonstrate that deep neuralnetworks, when combined with transfer learning and imaging processing techniques, can serve as a powerful tool to conduct automatic differential counting of WBCs, and thus facilitate the diagnosis of the WBC-related disorders, monitor the disease progression and improve the effectiveness of therapeutics.
artificialneural Network (ANN) has been used extensively and constantly developed. The combination of wavelet transform theory and the neural network has become an important branch to explore the optimization of neur...
详细信息
ISBN:
(纸本)9783031310652;9783031310669
artificialneural Network (ANN) has been used extensively and constantly developed. The combination of wavelet transform theory and the neural network has become an important branch to explore the optimization of neural network structure, and Wavelet neural Network (WNN), a special network structure, was born. This paper reviews WNN's development and summarizes the system structure and algorithm implementation and presents derivative models and cutting-edge applications with obvious characteristics. The sorting and analysis of the above contents show that the combination of wavelet theory and neural network algorithm can make the network model have the advantages of fast convergence speed and high model accuracy, and has a rapid development trend in many fields such as audio signal and imageprocessing. The work of this paper is intended to provide a reference for potential applications based on WNN and new network model design ideas.
The prevalence and pervasiveness of artificial intelligence (AI) with medical images in veterinary and human medicine is rapidly increasing. This article provides essential definitions of AI with medical images with a...
详细信息
The prevalence and pervasiveness of artificial intelligence (AI) with medical images in veterinary and human medicine is rapidly increasing. This article provides essential definitions of AI with medical images with a focus on veterinary radiology. Machine learning methods common in medical image analysis are compared, and a detailed description of convolutional neuralnetworks commonly used in deep learning classification and regression models is provided. A brief introduction to natural language processing (NLP) and its utility in machine learning is also provided. NLP can economize the creation of "truth-data" needed when training AI systems for both diagnostic radiology and radiation oncology applications. The goal of this publication is to provide veterinarians, veterinary radiologists, and radiation oncologists the necessary background needed to understand and comprehend AI-focused research projects and publications.
Agriculture is essential for human civilization, contributing to the economy and ensuring food supply. However, plant diseases can hinder growth and reduce crop yield. It is crucial to identify and categorize these di...
详细信息
Recommender systems are widely used artificial intelligence technologies that provide personalized recommendations to users from massive amounts of data. In the era of the Internet, recommender systems have become ess...
详细信息
ISBN:
(数字)9798350350920
ISBN:
(纸本)9798350350920
Recommender systems are widely used artificial intelligence technologies that provide personalized recommendations to users from massive amounts of data. In the era of the Internet, recommender systems have become essential components in e-commerce, social media, news media, audio-video entertainment, and other fields. However, traditional recommender systems often rely on user's historical behavior or related attribute information for recommendations, which may lead to issues such as "over-recommendation" or "inefficient recommendation" due to their limited ability to uncover the latent connections between users and *** address these issues, many deep learning-based recommendation models have emerged in recent years, such as DeepFM, NCF, etc,which have achieved significant improvements in recommendation performance. RippleNet and KGCN are two popular recommendation models among them. The RippleNet model employs graph neuralnetworks to explore the interaction relationships between users and items as the basis for recommendations. On the other hand, the KGCN model utilizes knowledge graphs to better understand the semantic relationships between ***, both models have their respective limitations. For instance, RippleNet only focuses on user representation while neglecting item representation, whereas KGCN overlooks the shortcomings in user representation. To further enhance recommendation performance, this paper proposes a new RNKN recommendation model that combines the strengths of RippleNet and KGCN, paying attention to both user and item representations to better uncover the latent connections between them. And apply the model to three datasets: MovieLens-1M (movies), Book Crossing (books) and *** (music). Compared with RippleNet and KGCN, the AUC index of RNKN on the MovieLens-1M data set has increased by 0.4% and 1.4% respectively;the ACC index has increased by 0.45% and 1.2%;compared with the AUC index of RNKN on the Book-Crossing data set Ri
The multimodal fusion of infrared-visible images in a high-quality way allows for the preservation of the respective advantages offered by each modality. However, existing methods encounter the challenge of high redun...
详细信息
ISBN:
(纸本)9789819756773;9789819756780
The multimodal fusion of infrared-visible images in a high-quality way allows for the preservation of the respective advantages offered by each modality. However, existing methods encounter the challenge of high redundancy in local information within early neuralnetworks. Specifically, excessive feature extraction of infrared information can cause the retention of excessive noise in the fused image, thereby obscuring its clarity. To solve this problem, we introduce the concept of super-token attention into an improved auto-encoder fusion network for better global modeling by reducing the number of tokens in the self-attention-mechanism. Specifically, we first use STA blocks as shared encoders to extract shallow features from different modalities. Next, we employ the CNN-Attention extractor to extract deeper features from various modalities using a two-branch approach. Extensive experiments have confirmed that the proposed network achieves state-of-the-art fusion performance across multiple metrics. Furthermore, our approach exhibits strong transferability to the field of medical imageprocessing.
暂无评论