Deep neural networks (DNNs) currently constitute the best-performing artificial vision systems. However, humans are still better at recognizing many characters, especially distorted, ornamental, or calligraphic charac...
详细信息
Deep neural networks (DNNs) currently constitute the best-performing artificial vision systems. However, humans are still better at recognizing many characters, especially distorted, ornamental, or calligraphic characters compared with the highly sophisticated recognitionmodels. Understanding themechanism of character recognition by humans may give some cues for building better recognition models. However, the appropriate methodological approach to using these cues has not been much explored for developing character recognition models. Therefore, this paper tries to understand the process of character recognition by humans and DNNs by generating visual explanations for their respective decisions. We have used eye tracking to assay the spatial distribution of information hotspots for humans via fixation maps. We have proposed a gradient-based method for visualizing the reasoning behind the model's decision through visualization maps and have proved that our method is better than the other class activation mapping methods. Qualitative comparison between visualization maps and fixation maps reveals that both model and humans focus on similar regions in character in the case of correctly classified characters. However, when the focused regions are different for humans and model, the characters are typically misclassified by the latter. Hence, we propose to use the fixation maps as a supervisory input to train the model that ultimately results in improved recognition performance and better generalization. As the proposedmodel gives some insights about the reasoning behind its decision, it can find applications in fields, such as surveillance and medical applications, where explainability helps to determine system fidelity. Impact Statement-Humans and DNNs rely on selective information uptake while classifying a character. This information selection strategy can be understood by visualizing the important, informative character regions that ultimately govern the decision o
Researchers emphasize the importance of hardware accelerators for mathematical morphology. If there are any issues, the hardware architecture may need to be redesigned. Thus, we propose a novel, reconfigurable hardwar...
详细信息
Indoor positioning systems (IPS) are critical for enabling accurate navigation, asset tracking, and emergency response in indoor environments where GPS fails. This study evaluates five machine learning algorithms-K-Ne...
详细信息
ISBN:
(数字)9798331520403
ISBN:
(纸本)9798331520410
Indoor positioning systems (IPS) are critical for enabling accurate navigation, asset tracking, and emergency response in indoor environments where GPS fails. This study evaluates five machine learning algorithms-K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Multi-Layer Perceptron (MLP), Decision Tree, and Random Forest-using two benchmark datasets, UJIIndoorLoc and UTSIndoorLoc. The results reveal that SVM achieves the highest accuracy, with 81.3 % on UJIIndoorLoc and 92 % on UTSIndoorLoc, followed by MLP with $\mathbf{7 8. 8 \%}$ and 90.9 %, respectively. Random Forest provides stable performance, with 77.6 % and 86.08 %, while KNN reaches 75 % and 89.7 %, performing well in structured environments. Decision Tree shows the lowest accuracy, 71.2 % and 76.54 %, highlighting its limitations with complex data. The UTSIndoorLoc dataset consistently yields higher accuracies, demonstrating its structured signal distribution. These findings underscore SVM and MLP as optimal algorithms for Wi-Fi fingerprinting IPS, offering robust and scalable solutions for indoor localization.
In recent years, Artificial Intelligence (AI) has gained increasing popularity in the area of art creation, by demonstrating its great potential. Research in this topic has developed AI systems able to generate creati...
详细信息
The increasing presence of digital evidence in legal, criminal, and civil cases requires adaptations to traditional chain-of-custody processes to ensure the integrity of this evidence. This work proposes the use of bl...
详细信息
作者:
Yau, Yeu-TorngDepartment of Ph.D. Program
Prospective Technology of Electrical Engineering and Computer Science National Chin-Yi University of Technology Taichung No.57 Sec. 2 Zhongshan Rd. Taiping Dist Taichung41170 Taiwan
To provide a hold-up time function in DC-DC supplies for cell site stations or data centers, using a boost converter with a bulk output capacitor as a front-end converter stage is a simple and highly cost-effective so...
详细信息
As COVID-19 has brought a major disruption in educational system, online learning has replaced the traditional classroom learning during the pandemic in many parts of the world. One of the inferior point of online lea...
详细信息
Financial technology (FinTech) has drawn much attention among investors and companies. While conventional stock analysis in FinTech targets at predicting stock prices, less effort is made for profitable stock recommen...
详细信息
Observing and filming a group of moving actors with a team of aerial robots is a challenging problem that combines elements of multi-robot coordination, coverage, and view planning. A single camera may observe multipl...
详细信息
Based on WHO's data, breast cancer is one of the most deadly diseases that has claimed many victims, especially women. This disease begins with the presence of an undetected and eventually turns into malignant (ca...
详细信息
暂无评论