The increasing number of electronic transactions on the Internet has given rise to the design of recommendation systems. The main objective of these systems is to give recommendations to the users about the items (i.e...
详细信息
Recently,to build a smart factory,research has been conducted to perform fault diagnosis and defect detection based on vibration and noise signals generated when a mechanical system is driven using deep-learning techn...
详细信息
Recently,to build a smart factory,research has been conducted to perform fault diagnosis and defect detection based on vibration and noise signals generated when a mechanical system is driven using deep-learning technology,a field of artificial *** of the related studies apply various audio-feature extraction techniques to one-dimensional raw data to extract sound-specific features and then classify the sound by using the derived spectral image as a training ***,compared to numerical raw data,learning based on image data has the disadvantage that creating a training dataset is very ***,we devised a two-step data preprocessing method that efficiently detects machine anomalies in numerical raw *** the first preprocessing process,sound signal information is analyzed to extract features,and in the second preprocessing process,data filtering is performed by applying the proposed *** efficient dataset was built formodel learning through a total of two steps of data *** addition,both showed excellent performance in the training accuracy of the model that entered each dataset,but it can be seen that the time required to build the dataset was 203 s compared to 39 s,which is about 5.2 times than when building the image dataset.
Cancer remains the leading cause of death worldwide, significantly impacting individuals and healthcare systems alike. In recent decades, skin cancer has surged in prevalence compared to other major cancer types. Vari...
详细信息
Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In thi...
详细信息
Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In this paper, we conducted a comprehensive evaluation of large multimodal models, such as GPT4V and Gemini, in various text-related visual tasks including text recognition, scene text-centric visual question answering(VQA), document-oriented VQA, key information extraction(KIE), and handwritten mathematical expression recognition(HMER). To facilitate the assessment of optical character recognition(OCR) capabilities in large multimodal models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available. Furthermore, our study reveals both the strengths and weaknesses of these models, particularly in handling multilingual text, handwritten text, non-semantic text, and mathematical expression *** importantly, the baseline results presented in this study could provide a foundational framework for the conception and assessment of innovative strategies targeted at enhancing zero-shot multimodal *** evaluation pipeline and benchmark are available at https://***/Yuliang-Liu/Multimodal OCR.
Fractional order algorithms have shown promising results in various signal processing applications due to their ability to improve performance without significantly increasing *** goal of this work is to inves-tigate ...
详细信息
Fractional order algorithms have shown promising results in various signal processing applications due to their ability to improve performance without significantly increasing *** goal of this work is to inves-tigate the use of fractional order algorithm in the field of adaptive beam-forming,with a focus on improving performance while keeping complexity *** effectiveness of the algorithm will be studied and evaluated in this *** this paper,a fractional order least mean square(FLMS)algorithm is proposed for adaptive beamforming in wireless applications for effective utilization of *** algorithm aims to improve upon existing beam-forming algorithms,which are inefficient in performance,by offering faster convergence,better accuracy,and comparable computational *** FLMS algorithm uses fractional order gradient in addition to the standard ordered gradient in weight *** derivation of the algorithm is provided and supported by mathematical convergence *** is evaluated through simulations using mean square error(MSE)minimization as a metric and compared with the standard LMS algorithm for various *** results,obtained through Matlab simulations,show that the FLMS algorithm outperforms the standard LMS in terms of convergence speed,beampattern accuracy and scatter *** outperforms LMS in terms of convergence speed by 34%.From this,it can be concluded that FLMS is a better candidate for adaptive beamforming and other signal processing applications.
Many datasets in real life are complex and dynamic, that is, their key densities are varied over the whole key space and their key distributions change over time. It is challenging for an index structure to efficientl...
详细信息
This paper addresses the underexplored landscape of chaotic functions in steganography, existing literature when examined under PRISMA-ScR framework it was realized that most of the studies predominantly focuses on ut...
详细信息
Photovoltaic arrays receive varying levels of solar radiation due to factors such as shadows created by clouds, surrounding buildings, and other obstructions. Therefore, an effective Maximum Power Point Tracking (MPPT...
详细信息
Malware attacks on Windows machines pose significant cybersecurity threats,necessitating effective detection and prevention *** machine learning classifiers have emerged as promising tools for malware ***,there remain...
详细信息
Malware attacks on Windows machines pose significant cybersecurity threats,necessitating effective detection and prevention *** machine learning classifiers have emerged as promising tools for malware ***,there remains a need for comprehensive studies that compare the performance of different classifiers specifically for Windows malware *** this gap can provide valuable insights for enhancing cybersecurity *** numerous studies have explored malware detection using machine learning techniques,there is a lack of systematic comparison of supervised classifiers for Windows malware *** the relative effectiveness of these classifiers can inform the selection of optimal detection methods and improve overall security *** study aims to bridge the research gap by conducting a comparative analysis of supervised machine learning classifiers for detecting malware on Windows *** objectives include Investigating the performance of various classifiers,such as Gaussian Naïve Bayes,K Nearest Neighbors(KNN),Stochastic Gradient Descent Classifier(SGDC),and Decision Tree,in detecting Windows *** the accuracy,efficiency,and suitability of each classifier for real-world malware detection *** the strengths and limitations of different classifiers to provide insights for cybersecurity practitioners and *** recommendations for selecting the most effective classifier for Windows malware detection based on empirical *** study employs a structured methodology consisting of several phases:exploratory data analysis,data preprocessing,model training,and *** data analysis involves understanding the dataset’s characteristics and identifying preprocessing *** preprocessing includes cleaning,feature encoding,dimensionality reduction,and optimization to prepare the data for *** training utilizes various
Tear film,the outermost layer of the eye,is a complex and dynamic structure responsible for tear *** tear film lipid layer is a vital component of the tear film that provides a smooth optical surface for the cornea an...
详细信息
Tear film,the outermost layer of the eye,is a complex and dynamic structure responsible for tear *** tear film lipid layer is a vital component of the tear film that provides a smooth optical surface for the cornea and wetting the ocular *** eye syndrome(DES)is a symptomatic disease caused by reduced tear production,poor tear quality,or excessive *** diagnosis is a difficult task due to its multifactorial *** of several clinical tests available,the evaluation of the interference patterns of the tear film lipid layer forms a potential tool for DES *** instrument known as Tearscope Plus allows the rapid assessment of the lipid layer.A grading scale composed of five categories is used to classify lipid layer *** reported work proposes the design of an automatic system employing light weight convolutional neural networks(CNN)and nature inspired optimization techniques to assess the tear film lipid layer patterns by interpreting the images acquired with the Tearscope *** designed framework achieves promising results compared with the existing state-of-the-art techniques.
暂无评论