Federated learning is a new type of distributed learning framework that allows multiple participants to share training results without revealing their data *** data privacy becomes more important,it becomes difficult ...
详细信息
Federated learning is a new type of distributed learning framework that allows multiple participants to share training results without revealing their data *** data privacy becomes more important,it becomes difficult to collect data from multiple data owners to make machinelearning predictions due to the lack of data *** is forced to be stored independently between companies,creating“data silos”.With the goal of safeguarding data privacy and security,the federated learning framework greatly expands the amount of training data,effectively improving the shortcomings of traditional machinelearning and deep learning,and bringing AI algorithms closer to our *** the context of the current internationaldata security issues,federated learning is developing rapidly and has gradually moved from the theoretical to the applied *** paper first introduces the federated learning framework,analyzes its advantages,reviews the results of federated learning applications in industries such as communication and healthcare,then analyzes the pitfalls of federated learning and discusses the security issues that should be considered in applications,and finally looks into the future of federated learning and the application layer.
The proceedings contain 46 papers. The special focus in this conference is on machinelearning, Optimization, and datascience. The topics include: Adaptive dimensionality reduction in multiobjective optimization with...
ISBN:
(纸本)9783030137083
The proceedings contain 46 papers. The special focus in this conference is on machinelearning, Optimization, and datascience. The topics include: Adaptive dimensionality reduction in multiobjective optimization with multiextremal criteria;REFINE: Representation learning from diffusion events;augmented design-space exploration by nonlinear dimensionality reduction methods;classification and survival prediction in diffuse large B-cell lymphoma by gene expression profiling;learning consistent tree-augmented dynamic bayesian networks;designing ships using constrained multi-objective efficient global optimization;a new approach to measuring distances in dense graphs;ant colony optimization for markov blanket-based feature selection. Application for precision medicine;Average performance analysis of the stochastic gradient method for online PCA;simple learning with a teacher via biased regularized least squares;Improving traditional dual ascent algorithm for the uncapacitated multiple allocation hub location problem: A RAMP approach;supervised learning approach for surface-mount device production;crawling in Rogue’s dungeons with (partitioned) A3C;decision of neural networks hyperparameters with a population-based algorithm;strong duality of the kantorovich-rubinstein mass transshipment problem in metric spaces;evolutionary construction of convolutional neural networks;improving clinical subjects clustering by learning and optimizing feature weights;a framework to automatically extract funding information from text;Speeding up budgeted stochastic gradient descent SVM training with precomputed golden section search;an unsupervised learning classifier with competitive error performance;feature based multivariate data imputation;A GRASP/VND heuristic for the max cut-clique problem;a modelica-based simulation method for black-box optimal control problems with level-set dynamic programming;a clonal selection algorithm for multiobjective energy reduction multi-depot vehicle r
machinelearning applications are rapidly adopted by industry leaders in any field. The growth of investment in AI-driven solutions,including the emerging field of General AI (GenAI), has created new challenges in man...
详细信息
ISBN:
(纸本)9798400704901
machinelearning applications are rapidly adopted by industry leaders in any field. The growth of investment in AI-driven solutions,including the emerging field of General AI (GenAI), has created new challenges in managing datascience and ML resources, people and projects as a whole. The discipline of managing appliedmachinelearning teams, requires a healthy mix between agile product development tool-set and a long term research oriented mindset. The abilities of investing in deep research while at the same time connecting the outcomes to significant business results create a large knowledge based on management methods and best practices in the field. The Third KDD Workshop on appliedmachinelearning Management brings together applied research managers from various fields to share methodologies and case-studies on management of ML teams, products, and projects, achieving business impact with advanced AI-methods.
This research paper aims to investigate the impact of using the Synthetic Minority Over-Sampling Technique (SMOTE) on the performance of several machinelearning algorithms on imbalanced dataset. Imbalanced datasets a...
详细信息
This research paper aims to investigate the impact of using the Synthetic Minority Over-Sampling Technique (SMOTE) on the performance of several machinelearning algorithms on imbalanced dataset. Imbalanced datasets are a common problem in many real-world applications, where one class is much more prevalent than the other class. This imbalance can lead to biased models, where the majority class dominates the model's predictions, and the minority class is often misclassified. To address this problem, we applied the SMOTE algorithm to generate synthetic data for the minority class. We evaluated the performance of several popular machinelearning algorithms including logistic regression, decision trees, ensemble learning, support vector machines, Neural networks and Auto ML approach on both the original imbalanced dataset and the SMOTE-augmented dataset. The experimental results demonstrate that using SMOTE significantly improves the accuracy of the machinelearning algorithms on imbalanced datasets. In conclusion, our research highlights the importance of considering the impact of imbalanced datasets on machinelearning algorithm's performance and demonstrates the effectiveness of SMOTE in addressing this issue. Our results can be useful to practitioners working on imbalanced datasets to choose an appropriate machine-learning algorithm and to decide whether to use SMOTE to improve their model's performance.
This work proposes a tool for the implementation of an educational data mining model that applies automated machinelearning and machinelearning interpretability. Starting from the selection between different types o...
详细信息
ISBN:
(纸本)9789811963469;9789811963476
This work proposes a tool for the implementation of an educational data mining model that applies automated machinelearning and machinelearning interpretability. Starting from the selection between different types of educational problems, the tool: allows semi-automatically building the data set, obtaining an optimized machinelearning model using automated machinelearning and enabling the explanation of results with machinelearning interpretability methods. The proposal allows university institutions to draw conclusions on complex problems, requiring a minimum number of experts in datascience and providing a framework for both end users and legal entities to inform themselves about results.
Forecasting the emergence of a dominant design in advance is important because the emergence of the dominant design can provide useful information about the external environment for the product launch. Although the em...
详细信息
Forecasting the emergence of a dominant design in advance is important because the emergence of the dominant design can provide useful information about the external environment for the product launch. Although the emergence of the dominant design can only be determined as a result of the introduction of the product into the market, it may be possible to predict the emergence of the dominant design in advance by applying a solution based on patent analysis. In the newly proposed technique of separating patents, we can capture changes in the state of technological innovation and analyze the emergence of the dominant design, but there is a problem that it requires processing of large amounts of patent data, and that the processing involves subjective judgments by experts. This study focuses on analyzing technological innovation trends using an approach that separates product patents from process patents, investigates whether this approach can be applied to machinelearning, and aims to develop a learning model that automatically classifies patents. We applied text mining to patent information to create structured data sets and compared nine different machinelearning classification algorithms with and without dimensionality reduction. The approach was effectively applied to machinelearning, and the Random Forest, AdaBoost and Support Vector machine models achieved high classification performance of over 95%. By developing these learning models, it is possible to objectively forecast the emergence of a dominant design with high accuracy.
machinelearning methods, represented by deep learning, are rapidly changing and enhancing the process and results of the Numerical Weather Prediction (NWP) model. To better explore the application potential of the ma...
详细信息
ISBN:
(纸本)9798350385113;9798350385106
machinelearning methods, represented by deep learning, are rapidly changing and enhancing the process and results of the Numerical Weather Prediction (NWP) model. To better explore the application potential of the machinelearning method in the Microphysical Parameterization Schemes (MPS) for NWP model, this study introduces a deep learning based cloud microphysical processes scheme. The scheme includes two steps: data processing and machinelearning. The data process step in our scheme is to prepare data for the machinelearning method, where the data is extracted and refined from NWP outputs. Subsequent to this process, a one-dimensional dense convolutional neural network (1DD-CNN) is applied to rigorously analyze the data and boost the predictive precision and computational speed of cloud microphysical processes. Empirical results validate the scheme's efficacy in advancing meteorological predictive capabilities.
The proceedings contain 20 papers. The special focus in this conference is on appliedmachinelearning and data Analytics. The topics include: RU-Net: A Novel Approach for Gastro-Intestinal Tract Image Segmentati...
ISBN:
(纸本)9783031342219
The proceedings contain 20 papers. The special focus in this conference is on appliedmachinelearning and data Analytics. The topics include: RU-Net: A Novel Approach for Gastro-Intestinal Tract Image Segmentation Using Convolutional Neural Network;a Credit Card Fraud Detection Model Using machinelearning Methods with a Hybrid of Undersampling and Oversampling for Handling Imbalanced datasets for High Scores;Implementation of YOLOv7 for Pest Detection;online Grocery Shopping: - Key Factors to Understand Shopping Behavior from data Analytics Perspective;a Novel Approach: Semantic Web Enabled Subject Searching for Union Catalogue;how People in South America is Facing Monkeypox Outbreak?;multilevel Classification of Satellite Images Using Pretrained AlexNet Architecture;handwriting Recognition for Predicting Gender and Handedness Using Deep learning;retrieval of Weighted Lexicons Based on Supervised learning Method;univariate Feature Fitness Measures for Classification Problems: An Empirical Assessment;performance Evaluation of Smart Flower Optimization Algorithm Over Industrial Non-convex Constrained Optimization Problems;securing Advanced Metering Infrastructure Using Blockchain for Effective Energy Trading;keratoconus Classification Using Feature Selection and machinelearning Approach;Semantic Segmentation of the Lung to Examine the Effect of COVID-19 Using UNET Model;SMDKGG: A Socially Aware Metadata Driven Knowledge Graph Generation for Disaster Tweets;Brain MRI Image Classification Using Deep learning;a Real-Time Face Recognition Attendance Using machinelearning;towards Abalone Differentiation Through machinelearning.
Continual learning in machinelearning systems requires models to adapt and evolve based on new data and experiences. However, this dynamic nature also introduces a vulnerability to data poisoning attacks, wheremalici...
详细信息
Continual learning in machinelearning systems requires models to adapt and evolve based on new data and experiences. However, this dynamic nature also introduces a vulnerability to data poisoning attacks, wheremaliciously crafted input can lead to misleading model updates. In this research, we propose a novel approach utilizing theEdDSAencryption system to safeguard the integrity of data streams in continual learning scenarios. By leveraging EdDSA, we establish a robust defense against data poisoning attempts, maintaining the model's trustworthiness and performance over time. Through extensive experimentation on diverse datasets and continual learning scenarios, we demonstrate the efficacy of our proposed approach. The results indicate a significant reduction in susceptibility to data poisoning attacks, even in the presence of sophisticated adversaries.
This paper presents machinelearning methods for health assessment of power transformer based on sweep frequency response analysis. The paper presents an overview of monitoring and diagnostics based on statistical Swe...
详细信息
This paper presents machinelearning methods for health assessment of power transformer based on sweep frequency response analysis. The paper presents an overview of monitoring and diagnostics based on statistical Sweep Frequency Response Analysis (SFRA) based indicators that are used to evaluate the state of the power transformer. Experimental data obtained from power transformers with internal short-circuit faults is used as a database for applying machinelearning. machinelearning is implemented to achieve more precise asset management and condition-based maintenance. Unsupervised machinelearning was applied through the k-means cluster method for classifying and dividing the examined power transformer state into groups with similar state and probability of failure. Artificial neural network (ANN) and Adaptive Neuro Fuzzy Inference System (ANFIS) as part of supervised machinelearning are created in order to detect fault severity in tested power transformers of different lifetime. The presented machinelearning methods can be used to improve health assessment of power transformers.
暂无评论