Based on the text data of authoritative skin care community, this study analyzes the user39;s needs and summarizes six topics on the premise that it has the option of repurchase intention. We need to calculate the e...
详细信息
The escalation of cybercrime in recent years, moves on by the proliferation of electronic devices, poses a significant threat as users engage in diverse online activities, including financial transactions. Cybercrimin...
详细信息
ISBN:
(纸本)9798350372977;9798350372984
The escalation of cybercrime in recent years, moves on by the proliferation of electronic devices, poses a significant threat as users engage in diverse online activities, including financial transactions. Cybercriminals exploit phishing information and manipulate cardholders' account balances to illicitly appropriate funds, presenting challenges in traceability. This study addresses detecting fraudulent banking transactions within customers' account data, characterized by highly imbalanced fraud records. Employing diverse machinelearning algorithms, particularly focusing on tree-based approaches, this research reveals that LightGBM and XGBoost exhibit superior performance, with LightGBM demonstrating notable supremacy. The resultant evaluation metric, the Area Under the Curve (AUC), attains a commendable value of 0.94649, underscoring the efficacy of the proposed machinelearning models in cybersecurity measures against fraudulent financial activities.
In the dynamic era of online education, the pursuit of a personalized and effective learning experience is paramount. A transformative approach in online education by integrating Multimodal data Mining and data Synthe...
详细信息
The proliferation of machinelearning (ML) has brought unprecedented advancements in technology, but it has also raised concerns about its environmental impact, particularly concerning carbon emissions. To address the...
详细信息
ISBN:
(纸本)9798400705915
The proliferation of machinelearning (ML) has brought unprecedented advancements in technology, but it has also raised concerns about its environmental impact, particularly concerning carbon emissions. To address the imperative of environmentally responsible ML, we present in this paper a novel ML pipeline, named CEMAI, designed to monitor and analyze carbon emissions across the entire lifecycle of ML model development, from data preparation to training and deployment. Our endeavor involves an exhaustive evaluation process underpinned by three industrial case studies. These case studies are structured around the application of ML models to predict tool wear, estimate remaining useful lifetimes, and detect anomalies in the Industrial Internet of Things (IIoT). Leveraging sensor data originating from CNC machining and broaching operations, our research shows empirically the efficacy of carbon emissions as a dependable metric guiding the configuration of an ML development process. The essence of our approach lies in striking a balance between superior performance and minimal carbon emissions. Our findings reveal the potential to optimize pipeline configurations for ML models in a manner that not only enhances performance but also drastically reduces carbon emissions, thereby underlining the significance of adopting ecologically responsible engineering practices.
This study explores the impact of industry-education integration on college students’ employment rate using machinelearning models. The original data was preprocessed through feature engineering, and models such as ...
详细信息
datasets and models are two key artifacts in machinelearning (ML) applications. Although there exist tools to support dataset and model developers in managing ML artifacts, little is known about how these datasets an...
详细信息
ISBN:
(纸本)9798400705915
datasets and models are two key artifacts in machinelearning (ML) applications. Although there exist tools to support dataset and model developers in managing ML artifacts, little is known about how these datasets and models are integrated into ML applications. In this paper, we study how datasets and models in ML applications are managed. In particular, we focus on how these artifacts are stored and versioned alongside the applications. After analyzing 93 repositories, we identified the most common storage location to store datasets and models is the file system, which causes availability issues. Notably, large data and model files, exceeding approximately 60 MB, are stored exclusively in remote storage and downloaded as needed. Most of the datasets and models lack proper integration with the version control system, posing potential traceability and reproducibility issues. Additionally, although datasets and models are likely to evolve during the application development, they are rarely updated in application repositories.
To solve the problem of automatic mapping of single line diagrams in distribution networks, a large amount of manual adjustment is required to meet practical requirements. This article is based on automatic mapping of...
详细信息
In recent years, there has been a notable surge in exploring statistical methodologies and artificial intelligence (AI) approaches, such as machinelearning and deep learning, across various domains, including enginee...
详细信息
ISBN:
(纸本)9798350372977;9798350372984
In recent years, there has been a notable surge in exploring statistical methodologies and artificial intelligence (AI) approaches, such as machinelearning and deep learning, across various domains, including engineering. These data-driven techniques hold promise for delivering faster and more accurate predictions, representing a significant avenue for progress. Building upon prior research, this study extends the scope by incorporating additional features and utilizing a distinct dataset for forecasting the average heated bridge deck surface temperature and the outlet fluid temperature from the hydronic heating loops. Leveraging machinelearning algorithms on data collected from a bridge de-icing project in Texas, we examine the effectiveness of Multiple Linear Regression (MLR) and Support Vector Regression (SVR). Through rigorous comparison with field data, we validate the robustness of these methodologies in temperature forecasting tasks, noting high accuracy across all algorithms. Notably, while both MLR and SVR demonstrate commendable performance, MLR marginally outperforms SVR, achieving an R-2 value of 0.79.
Using big data theory to analyze athletes39; performance data, this paper finds out the changing characteristics of athletes39; performance. Using the principle of big data, we find the performance characteristics...
详细信息
This project introduces the creation of a productivity application specifically designed to cater to the requirements of individuals impacted by Attention Deficit Hyperactivity Disorder (ADHD). Leveraging machine lear...
详细信息
暂无评论