The overwhelming growth and popularity of web motivated the cyber attackers to develop fraudulent web sites and execute numerous attacking strategies to trick the user revealing sensitive information, installing malwa...
详细信息
The overwhelming growth and popularity of web motivated the cyber attackers to develop fraudulent web sites and execute numerous attacking strategies to trick the user revealing sensitive information, installing malware automatically through drive-by-download attack, stealing the identity and money etc. Most of the attacking strategies are spreading through compromised URL (Uniform Resource Locator) and it greatly influences the internet performance. Blacklisting approach is adopted to mitigate these issues, but the demerits of this approach is unable to identify the zero-day attacking patterns and is computationally expensive as it has to search the URLs from a large pool of database. So, to plan an effective detection framework for identifying suspicious web sites is a tedious task. To overcome these issues, we have proposed a suspicious URL detection technique for selecting the most influential significant features for classifying the URL as safe or malignant with the help of multivariate filter-based feature selection technique (MFBFST) of machinelearning. The redundant features are eliminated by correlation feature selection technique and the significance of relevant attributes are tested with statistical t-test to obtain the most significant features that has more impact on the prediction of malicious websites. The relevant features obtained from MFBFST are used for evaluating the machinelearning algorithms like Bagging, Adaboost, GBoost and kNN (k Nearest Neighbour) to create an efficient model for predicting the malicious web sites efficiently. To assess the effectiveness of MFBFST in the enhancement of prediction of classification results we have evaluated the classifier with and without considering the FST also tested the scalability issues by considering two publicly available datasets. Our implementation results demonstrated that with utilizing the CFS, the machinelearning algorithms accomplished the highest classification accuracy of 97% in dataset I
We study the problem of (learning) algorithm comparison, where the goal is to find differences between models trained with two different learning algorithms. We begin by formalizing this goal as one of finding disting...
详细信息
We study the problem of (learning) algorithm comparison, where the goal is to find differences between models trained with two different learning algorithms. We begin by formalizing this goal as one of finding distinguishing feature transformations, i.e., input transformations that change the predictions of models trained with one learning algorithm but not the other. We then present MODELDIFF, a framework that leverages data-models (Ilyas et al., 2022) to compare learning algorithms based on how they use training data. We demonstrate MODELDIFF through three case studies, comparing models trained with/without data augmentation, with/without pre-training, and with different SGD hyperparameters. Our code is available at https://***/MadryLab/modeldiff.
Good health and well being is one of the most essential targets of the Sustainable Development Goals (SDGs). This paper primarily focuses on Preventive and Diagnostic care of Women Health because even today, women are...
详细信息
The oil and gas industry faces several challenges associated with managing massive datasets and extracting relevant information. The machinelearning tools have proven to be significantly valuable for analysing comple...
详细信息
The oil and gas industry faces several challenges associated with managing massive datasets and extracting relevant information. The machinelearning tools have proven to be significantly valuable for analysing complex, heterogeneous data and produce quicker and more reliable outcomes even on large-scales. machinelearning and data mining tools have been applied in several aspects of the upstream oil and gas industry, such as exploration, drilling, reservoir engineering, and production forecasting. This review has been explicitly focused on machinelearning and data mining implementations in reservoir engineering, including reservoir characterisation and performance prediction, well test analysis, well logging and formation evaluation, and enhanced oil recovery operations. The commonly used statistical measures for classification and regression models have been discussed as well. The observations from the review have led to suitable suggestions that shall enrich the research in this area. [Received: July 29, 2021;Accepted: October 30, 2021]
Breast cancer is among the most prevalent cancers in women and one of the highest reason for women’s fatality rates. Most of the works in breast cancer detection are done either using deep learning models or heavily ...
详细信息
Transformers have demonstrated effectiveness in in-context solving data-fitting problems from various (latent) models, as reported by Garg et al. (2022). However, the absence of an inherent iterative structure in the ...
详细信息
Software products now have more users than ever. This means more people to please, more use-cases to consider, and more requirements to fulfill. These users can then write feedback on software in any number of public ...
详细信息
ISBN:
(纸本)9781665495981
Software products now have more users than ever. This means more people to please, more use-cases to consider, and more requirements to fulfill. These users can then write feedback on software in any number of public or private online repositories. Many tools have been proposed for classifying, embedding, clustering, and characterizing this feedback in aid of generating requirements from it. I am investigating which techniques and machinelearning models are most appropriate for enabling these analyses across multiple feedback platforms and data domains.
The goal was to facilitate quick development and revitalisation of rural areas by utilising IoT knowledge to accomplish smart agriculture against the backdrop of big data. Depleted soil fertility, increased pest attac...
详细信息
We propose a few-shot learning approach that aligns visual and semantic features in an embedding feature space to alleviate the shortage of training (or reference) data in remote sensing scene classification (RSSC). S...
详细信息
ISBN:
(纸本)9798400709234
We propose a few-shot learning approach that aligns visual and semantic features in an embedding feature space to alleviate the shortage of training (or reference) data in remote sensing scene classification (RSSC). Specifically, the self-supervised learning is first employed to improve the expressive ability of the learned feature, which could effectively enhance the features' generalizability. Meanwhile, we align the image feature and its corresponding class-semantic feature, which is obtained by feeding the class name to a language model such as BERT, to increase the image feature's discriminability. By systematically integrating the self-supervised learning and visual-semantic alignment with the backbone network, our approach could achieve image features with good generalizability and discriminability. Experiments on UCMerced LandUse, NWPU-RESISC45, and AID benchmarks validate the feasibility of our approach and verify its improved few-shot classification performance in RSSC.
Digital transformation of enterprises, as one of the core issues in modern enterprise management, has become an important scenario for computer applications. This article empirically analyzes how digital transformatio...
详细信息
暂无评论