In today's world, cyber-attacks are on the rise, and PDF files are commonly used as a means of attack. One common type of attack through PDF files is the covert embedding of dangerous malware and tricking users in...
详细信息
As artificial intelligence (AI) transitions from research to deployment, creating the appropriate datasets and data pipelines to develop and evaluate AI models is increasingly the biggest challenge. Automated AI model...
详细信息
As artificial intelligence (AI) transitions from research to deployment, creating the appropriate datasets and data pipelines to develop and evaluate AI models is increasingly the biggest challenge. Automated AI model builders that are publicly available can now achieve top performance in many applications. In contrast, the design and sculpting of the data used to develop AI often rely on bespoke manual work, and they critically affect the trustworthiness of the model. This Perspective discusses key considerations for each stage of the data-for-AI pipeline—starting from data design to data sculpting (for example, cleaning, valuation and annotation) and data evaluation—to make AI more reliable. We highlight technical advances that help to make the data-for-AI pipeline more scalable and rigorous. Furthermore, we discuss how recent data regulations and policies can impact AI.
An accurate predictive model of temperature and humidity plays a vital role in many industrial processes that utilize a closed space such as in agriculture and building management. With the exceptional performance of ...
详细信息
Exams are an important component of any educational program, including online education. In any test, there is a possibility of cheating, so its detection and prevention is important. This study aims to conduct an in-...
详细信息
Feature attribution, the ability to localize regions of the input data that are relevant for classification, is an important capability for ML models in scientific and biomedical domains. Current methods for feature a...
详细信息
Feature attribution, the ability to localize regions of the input data that are relevant for classification, is an important capability for ML models in scientific and biomedical domains. Current methods for feature attribution, which rely on "explaining" the predictions of end-to-end classifiers, suffer from imprecise feature localization and are inadequate for use with small sample sizes and high-dimensional datasets due to computational challenges. We introduce prospector heads, an efficient and interpretable alternative to explanation-based attribution methods that can be applied to any encoder and any data modality. Prospector heads generalize across modalities through experiments on sequences (text), images (pathology), and graphs (protein structures), outperforming baseline attribution methods by up to 26.3 points in mean localization AUPRC. We also demonstrate how prospector heads enable improved interpretation and discovery of class-specific patterns in input data. Through their high performance, flexibility, and generalizability, prospectors provide a framework for improving trust and transparency for ML models in complex domains. Copyright 2024 by the author(s)
Malaria is a lethal disease responsible for thousands of deaths worldwide every *** methods of malaria diagnosis are timeconsuming that require a great deal of human expertise and *** automated diagnosis of diseases i...
详细信息
Malaria is a lethal disease responsible for thousands of deaths worldwide every *** methods of malaria diagnosis are timeconsuming that require a great deal of human expertise and *** automated diagnosis of diseases is progressively becoming *** deep learning models show high performance in the medical field,it demands a large volume of data for training which is hard to acquire for medical ***,labeling of medical images can be done with the help of medical experts *** recent studies have utilized deep learning models to develop efficient malaria diagnostic system,which showed promising ***,the most common problem with these models is that they need a large amount of data for *** paper presents a computer-aided malaria diagnosis system that combines a semi-supervised generative adversarial network and transfer *** proposed model is trained in a semi-supervised manner and requires less training data than conventional deep learning *** of the proposed model is evaluated on a publicly available dataset of blood smear images(with malariainfected and normal class)and achieved a classification accuracy of 96.6%.
Internet of Things (IoT) devices are small, low-power devices used for detecting and processing data remotely through the internet. These devices have increasingly been integrated into our daily lives, both in the dig...
详细信息
Deep learning and big data analysis are among the most important research topics in the fields of biomedical applications and digital healthcare. With the fast development of artificial intelligence (AI) and Internets...
详细信息
Can we identify the weights of a neural network by probing its input-output mapping? At first glance, this problem seems to have many solutions because of permutation, overparameterisation and activation function symm...
详细信息
Can we identify the weights of a neural network by probing its input-output mapping? At first glance, this problem seems to have many solutions because of permutation, overparameterisation and activation function symmetries. Yet, we show that the incoming weight vector of each neuron is identifiable up to sign or scaling, depending on the activation function. Our novel method 'Expand-and-Cluster' can identify layer sizes and weights of a target network for all commonly used activation functions. Expand- and-Cluster consists of two phases: (i) to relax the non-convex optimisation problem, we train multiple overparameterised student networks to best imitate the target function;(ii) to reverse engineer the target network's weights, we employ an ad-hoc clustering procedure that reveals the learnt weight vectors shared between students - these correspond to the target weight vectors. We demonstrate successful weights and size recovery of trained shallow and deep networks with less than 10% overhead in the layer size and describe an 'ease-of-identifiability' axis by analysing 150 synthetic problems of variable difficulty. Copyright 2024 by the author(s)
In this paper, we investigate an accurate synchronization between a physical network and its digital network twin (DNT) that is a virtual representation of the physical network. The considered network includes a physi...
详细信息
ISBN:
(纸本)9798350351255
In this paper, we investigate an accurate synchronization between a physical network and its digital network twin (DNT) that is a virtual representation of the physical network. The considered network includes a physical network where a base station (BS) serves a set of users, and a DNT that evolves with the status of both DNT and the physical network. The BS must use its limited spectrum resources to serve the users, as well as transmit the physical network information to the cloud server for DNT synchronization. Since the DNT can predict the physical network status, the BS may not need to transmit physical network information to the server at each time slot thus saving spectrum resources to serve users. However, if the BS does not transmit physical information to the DNT over a long period of time, the DNT may not be able to represent the physical network accurately. To this end, the BS must determine whether to send physical network information to the server to update DNT and the spectrum resources used for physical network information transmission and serving users. We formulate this resources allocation problem as an optimization problem aiming to maximize the sum of data rates of all users, while minimizing the gap between the states of the physical network and the DNT. The formulated problem is challenging to solve by conventional optimization methods, since the BS may not be able to know the future status of the DNT. To solve this problem, we design a gate recurrent unit (GRU) and soft action-critic (SAC) based algorithm. The GRU enables the DNT to predict its future states by using historical state data, and updating the DNT when the BS does not transmit physical network information. The SAC based algorithm enables the BS to learn the relationship between the physical network information transmission and the future status estimation accuracy of the DNT thus determining whether to transmit physical network information to the cloud server, ensuring an accurac
暂无评论