In today's digital age, attendance systems utilizing facial recognition are essential in schools, universities, companies, etc. One feature of the human body that might help identify a person is the face. Using a ...
详细信息
The application of Machine Learning (ML)-based Intrusion Detection System (IDS) has been widely used. The advantage of ML-based IDS is that it can detect intrusions in the network. However, in its application, there a...
详细信息
We study the problem of a local bandwidth recovery for nonstationary stochastic signals when the measured information is given in terms of level crossings. We propose a kernel estimate of the local bandwidth from samp...
详细信息
In the dynamic landscape of online knowledge exchange, platforms such as Quora have become vital arenas for diverse discussions and information dissemination. However, distinguishing sincere from insincere questions p...
In the dynamic landscape of online knowledge exchange, platforms such as Quora have become vital arenas for diverse discussions and information dissemination. However, distinguishing sincere from insincere questions poses a challenge, impacting user experience and content credibility. The research seeks to utilize BERT-based frameworks to classify Quora questions, with a specific focus on discerning between sincere and insincere inquiries. The study underscores the significance of automated insincere content identification in fostering a trustworthy and constructive online environment on Quora, thereby contributing to the improvement of online interaction quality. The research delves into a comprehensive methodology, including data preprocessing, model training, fine-tuning, and performance evaluation. Extensive experimentation demonstrates that the proposed BERT-based approach achieves an impressive F1-score of 96.1%, outperforming existing methods. The contribution of the study lies in its potential to enhance online interactions by automating the identification of insincere content, fostering a conducive environment for meaningful conversations.
Despite the significant progress in few-shot 2D image classification, few-shot 3D point cloud classification remains relatively under-explored, particularly in addressing the challenges posed by missing points in 3D p...
详细信息
ISBN:
(数字)9798350365474
ISBN:
(纸本)9798350365481
Despite the significant progress in few-shot 2D image classification, few-shot 3D point cloud classification remains relatively under-explored, particularly in addressing the challenges posed by missing points in 3D point clouds. Most existing methods for few-shot 3D point cloud classification are point-based, and thus, highly sensitive to missing points. Despite recent attempts, such as ViewNet, which introduce projection-based backbones to increase robustness against missing points, the reliance on max pooling, to extract information from multiple images simultaneously, makes them prone to information loss. To address these limitations, we introduce dept.Voting, a novel projection-based approach, for few-shot 3D point cloud classification. Instead of extracting features from multiple projection images simultaneously, dept.Voting captures features from pairs of projection images (obtained from opposite view angles) separately, enhancing the extraction of more comprehensive information. These features are sent to multiple few-shot heads, which share parameters. To further refine predictions, dept.Voting incorporates a voting mechanism, allowing contribution and incorporating information from different pairs. We conduct extensive experiments on three datasets, namely ModelNet40, ModelNet40-C, and ScanObjectNN, along with cross-validation. Our proposed method consistently outperforms the state-of-the-art baselines on all datasets in terms of average accuracy with even higher margins on the challenging ScanObjectNN dataset.
Real-world data is often imbalanced, such that the number of training instances varies by class. Data augmentation (DA) of under-represented classes is commonly used to improve model generalization in the face of clas...
详细信息
ISBN:
(数字)9798350364941
ISBN:
(纸本)9798350364958
Real-world data is often imbalanced, such that the number of training instances varies by class. Data augmentation (DA) of under-represented classes is commonly used to improve model generalization in the face of class imbalance. Despite its ubiquity, the impact of data augmentation on machine learning (ML) models is not clearly understood. Here, we undertake a holistic examination of the effect of DA on under-represented classes. Unlike other studies, which focus on a single ML model type, we examine three different classifier families: convolutional neural networks, support vector machines, and logistic regression models; five different DA techniques and two different data modalities - image and tabular. Our research indicates that DA, when applied to imbalanced data, produces substantial changes in model weights, support vectors and front-end feature selection. These changes occur with respect to all classes, not just the ones that DA is applied to. Further, our empirical analysis shows that data augmentation's positive influence on generalization does not necessarily occur as a result of reducing weight norms. Rather, weight and support vector specialization play important roles in generalization. The specialization process may be a form of memorization that is spawned by variances introduced by augmented data. We investigate the seeming contradiction between improved generalization versus weight and support vector specialization.
The research illustrates use of ML algos in the field of housing price prediction. The models have been analyzed on real datasets downloaded from Kaggle created by Amitabha Chakraborty. We know that the source on the ...
详细信息
An essential part of computer vision, lip reading, has grown significantly and is now used in autonomous driving, public safety, and hearing-impaired communication. This work provides an in-dept. analysis of both conv...
An essential part of computer vision, lip reading, has grown significantly and is now used in autonomous driving, public safety, and hearing-impaired communication. This work provides an in-dept. analysis of both conventional and deep learning-based lip reading systems, examining their development over time, difficulties encountered, and critical role in human communication. The suggested LipNet model, which leverages the complementary abilities of Long Short-Term Memory networks (LSTMs) for temporal modelling and Convolutional Neural Networks (CNNs) for spatial feature extraction, is a central focus of this research. When LipNet is tested using the GRID Corpus dataset, it outperforms baseline models in terms of performance. Saliency maps and viseme analysis highlight LipNet's ability to capture phonologically relevant areas in lip motions, providing insight into the network's attention processes. Apart from examining the importance of datasets such as OuluVs2and GRID Corpus, this study presents GhostNet, a lightweight model that tackles the requirement for effective network designs. The testing results highlight LipNet's competitive Character Error Rate (CER) and Word Error Rate (WER) in comparison to baseline models, proving that it is the best method for real-world use. The research ends with a summary of potential future developments in lip reading technology, such as modal integration, realtime applications, personalisation, ethical issues, and the goal of creating a communication environment that is more inclusive. This study highlights the effectiveness of LipNet and paves the way for future developments in the ever-evolving area of lip reading by examining a variety of methodologies.
Water bodies are essential component of Earth’s surface and accurate water region mapping is crucial for disaster management and environmental monitoring. Existing methods to map water bodies relies on traditional mo...
详细信息
ISBN:
(数字)9798350369175
ISBN:
(纸本)9798350369182
Water bodies are essential component of Earth’s surface and accurate water region mapping is crucial for disaster management and environmental monitoring. Existing methods to map water bodies relies on traditional modelling methods that need extensive manual calibration struggle with scalability and lack advanced pattern recognition. This paper evaluates the effectiveness of deep learning-based semantic segmentation models, DeepLabV3+ and U-Net, using aerial images resized to $\mathbf{2 5 6 x} 256$ pixels in mapping water regions in aerial images. DeepLabV3+, with a ResNet50 backbone, and U-Net are trained on a comprehensive flood-affected dataset with preprocessing techniques such as resizing and augmentation. Model performance is assessed using metrics like IoU, F1 score, accuracy, precision, and recall. The results indicate that DeepLabV3+ outperforms U-Net, showing promise for automating water region mapping and improving flood surveys and urban planning.
Peephole optimization of quantum circuits provides a method of leveraging standard circuit synthesis approaches into scalable quantum circuit optimization. One application of this technique partitions an entire circui...
详细信息
暂无评论