Cyberbullying (CB) is a global dilemma that is growing rapidly to affect more individuals including minors. The devastating consequences of CB indicate a pressing necessity to regulate unethical or illegal users' ...
详细信息
ISBN:
(纸本)9798400700446
Cyberbullying (CB) is a global dilemma that is growing rapidly to affect more individuals including minors. The devastating consequences of CB indicate a pressing necessity to regulate unethical or illegal users' online behaviors. A remarkable number of researchers attempted to harness the potential of machinelearning to detect and prevent such harmful behaviors, however, the existing studies targeting Arabic-based content are still emerging. Therefore, this paper provides a comprehensive review of the published empirical studies in CB detection in Arabic-based content with an emphasis on the adapted methodologies, gaps, and challenges. We hope this work would support researchers in the area of CB-detection to foster a safe online environment and protect against any harmful consequences of CB among users.
machinelearning system analysis requires different approaches for each different task and domain. Selecting a proper set of analytic models can be challenging for a specific problem. This paper discusses the extensib...
详细信息
ISBN:
(纸本)9798350301137
machinelearning system analysis requires different approaches for each different task and domain. Selecting a proper set of analytic models can be challenging for a specific problem. This paper discusses the extensibility of the Multi-View Modeling Framework for ML Systems approach using process mapping and extensible metamodel. We conducted a case study to evaluate the feasibility of such extensibility by extending the approach to facilitate an activity-driven analysis for an optical character recognition system. Based on the result of the case study, we found that Multi-View Modeling Framework for ML Systems is likely to be extensible.
Communication is one of the most demanding activities in software development. The effectiveness of communication can be measured by analyzing three interpersonal communication dimensions: Active Discussion, Creative ...
详细信息
ISBN:
(纸本)9798350324983
Communication is one of the most demanding activities in software development. The effectiveness of communication can be measured by analyzing three interpersonal communication dimensions: Active Discussion, Creative conflict, and Conversation Management. Previous work relied on manually labeling the communication dimensions to analyze the effectiveness of software design discussions, a process that is time-consuming and not applicable to real-time use. In this study, natural language processing and supervised machinelearning are used to create COMET, a tool for automatic classification and assessment of the effectiveness of interpersonal communication during software design meetings. To determine the optimal classification approach, nine different classifiers are examined. The classifier model that performed the best is Random Forest which managed to achieve 93.66% accuracy, 93.76% precision, and 93.63% recall when trained and tested with a stratified 10-fold cross-validation technique.
- The development of the group based on the machinelearning process is being generated with the support of software analysis. Development of the ideas is being generated and proper predictions about the research topi...
详细信息
The use of a finer-grained classroom language analysis coding form facilitates a more in-depth analysis of teachers' classroom language behavior. This study uses a finer-grained T-coding system to encode texts con...
详细信息
Defects are largely inevitable in the software development life cycle. Since we cannot avoid them during the development process, we can only desire to fight back with our limited resources in terms of time and moneta...
详细信息
Detection of novel game cheating tools is critical for ensuring fair online play. Such cheating tools are visual-based and effectively avoid detection because they do not change the data of game software. With the dev...
详细信息
software Maintenance is of utmost importance for any industry. So, to predict the value of software maintenance beforehand also becomes very important, hence many software maintenance prediction algorithms have been d...
详细信息
machinelearning is facing a 'reproducibility crisis' where a significant number of works report failures when attempting to reproduce previously published results. We evaluate the sources of reproducibility f...
详细信息
ISBN:
(纸本)9798350301137
machinelearning is facing a 'reproducibility crisis' where a significant number of works report failures when attempting to reproduce previously published results. We evaluate the sources of reproducibility failures using a meta-analysis of 142 replication studies from ReScience C and 204 code repositories. We find that missing experiment details such as hyperparameters are potential causes of unreproducibility. We experimentally show the bias of different hyperparameter selection strategies and conclude that consolidated artifacts with a unified framework can help support reproducibility.
There has been an increasing interest in enhancing the fairness of machinelearning (ML). Despite the growing number of fairness-improving methods, we lack a systematic understanding of the trade-offs among factors co...
详细信息
ISBN:
(纸本)9798350329964
There has been an increasing interest in enhancing the fairness of machinelearning (ML). Despite the growing number of fairness-improving methods, we lack a systematic understanding of the trade-offs among factors considered in the ML pipeline when fairness-improving methods are applied. This understanding is essential for developers to make informed decisions regarding the provision of fair ML services. Nonetheless, it is extremely difficult to analyze the trade-offs when there are multiple fairness parameters and other crucial metrics involved, coupled, and even in conflict with one another. This paper uses causality analysis as a principled method for analyzing trade-offs between fairness parameters and other crucial metrics in ML pipelines. To practically and effectively conduct causality analysis, we propose a set of domain-specific optimizations to facilitate accurate causal discovery and a unified, novel interface for trade-off analysis based on well-established causal inference methods. We conduct a comprehensive empirical study using three real-world datasets on a collection of widely-used fairness-improving techniques. Our study obtains actionable suggestions for users and developers of fair ML. We further demonstrate the versatile usage of our approach in selecting the optimal fairness-improving method, paving the way for more ethical and socially responsible AI technologies.
暂无评论