Integrated on-board EV battery charger which exploits the EV drivetrain elements into the charging process, is considered as a solution to reduce the cost of EVs. Whether in charger mode or propulsion mode, the therma...
详细信息
Handwritten Chinese character recognition has achieved high accuracy using deep neural networks (DNNs), but the structural recognition (which offers structural interpretation, e.g., stroke and radical composition) is ...
详细信息
ISBN:
(纸本)9789819988495;9789819988501
Handwritten Chinese character recognition has achieved high accuracy using deep neural networks (DNNs), but the structural recognition (which offers structural interpretation, e.g., stroke and radical composition) is still a challenge. Existing DNNs treat character image as a whole and perform classification end-to-end without perception of the structure. They need a large amount of training samples to guarantee high generalization accuracy. In this paper, we propose a method for structural recognition of handwritten Chinese characters based on a modified part capsule auto-encoder (PCAE), which explicitly considers the hierarchical part-whole relationship of characters, and leverages extracted structural information for character recognition. Our PCAE is improved based on stacked capsule auto-encoder (SCAE) so as to better extract strokes and perform classification. By themodified PCAE, the character image is firstly decomposed into primitives (stroke segments), with their shape and pose information decoupled. The transformed primitives are aggregated into higherlevel parts (strokes) guided by prior knowledge extracted from writing rules. This process enhances interpretability and improves the discrimination ability of features. Experimental results on a large dataset demonstrate the effectiveness of our method in both Chinese character recognition and stroke extraction tasks.
Cloud computing"is a computer model that provides end users with quantifiable, scalable, and on-demand services. These days, almost every organization uses computer technology extensively for infrastructure, cost...
详细信息
In educational institutions, attendance used to be manually recorded on attendance sheets. This method doesn39;t work since substitute pupils can easily use them. This study introduces a face recognition-based autom...
详细信息
As cellular-vehicle-to-everything (C-V2X) networks gain importance within emerging intelligent transportation systems, providing effective, secure network management is crucial. This survey paper focuses on the opport...
详细信息
Medical Visual Question Answering is an emerging field which uses artificial intelligence to effectively combine the multimodal data and generate output. Users interact with the visual data through text to generate an...
详细信息
In this paper, a sequence-to-sequence model is proposed to deal with continuous emphasis levels, and machine translation and emphasis translation are combined into a spoken English situational translation system. In t...
详细信息
Anomaly detection using audio signals from industrial machines in the manufacturing industry has gained broad interest over the last few years. For example, predictive maintenance solutions utilize raw analog signals ...
详细信息
ISBN:
(纸本)9783031686160;9783031686177
Anomaly detection using audio signals from industrial machines in the manufacturing industry has gained broad interest over the last few years. For example, predictive maintenance solutions utilize raw analog signals to identify trends and patterns. In a few scenarios, an engineer working in a factory setting can tell when a machine is behaving abnormally just by hearing unexpected sounds (e.g., the loudness of sound) that are well within the human perceivable frequency range (20 Hz - lowest pitch to 20 kHz - highest pitch) which are typically concentrated in a narrow range of frequencies and amplitudes. The human perception of the amplitude of a sound is its loudness. However, the audio signal in its raw form is not always the best representation of the important features (e.g., frequencies, amplitude, peaks). Additionally, the machine learning applications which rely on using traditional digital signal processing techniques (e.g., digital signal processors, chips) have a lot of dependency on subject matter experts to tune the system for a better performance. Thus, we investigate how the digital transformation of waveform signals from microphone sensors (e.g., Audio recordings of industrial pumps, valves, slide rails) into Spectrograms (A spectrogram is a voiceprint of a signal which expresses an audio signal as an image using different colors to indicate the amplitude or strength of each frequency.) can help to monitor machine health (e.g., anomaly classification). In the pre-processing phase, raw audio signals (.WAV format) from each machine are converted to Mel Spectrogram images using short-term Fourier transformation. Then, comparative study of image classification techniques using deep convolutional neural networks (CNN) with and without data augmentation, is conducted to classify images as normal or abnormal. The approach is evaluated using Malfunctioning industrial machine investigation and inspection dataset [1-3] (MIMII dataset). Results show that the neu
The proceedings contain 27 papers. The topics discussed include: CLARITY AI: a comprehensive checklist integrating established frameworks for enhanced research quality in medical AI studies;assessing the validity of a...
The proceedings contain 27 papers. The topics discussed include: CLARITY AI: a comprehensive checklist integrating established frameworks for enhanced research quality in medical AI studies;assessing the validity of a functional status knowledge graph in a large-scale living lab;to heart via liver: a study on prognostic stratification of heart disease in MASLD patients using machine learning models;strategic optimization of blood allocation in blood banks for enhanced resource utilization;features selection through autoencoder filtering and Deepshap: an iterative algorithm;an adaptive version of the metropolis adjusted Langevin algorithm for survival prediction in a high-dimensional framework;Bayesian networks in medicine: presenting query response uncertainty for decision support;and reinforcement learning and fuzzy logic modelling for personalized dynamic treatment.
Application of machine learning in the analysis of medical data can be said to be one of the current transformations happening within the healthcare fraternity. This paper highlights several ways in which various mach...
详细信息
暂无评论