In very recent years more attention has been placed on probing the role of pre-training data in Large Language Models (LLMs) downstream behaviour. Despite the importance, there is no public tool that supports such ana...
The three common genetic models(or modes of inheritance)in association analysis are the dominant,additive,and recessive *** is known that the Cochran-Armitage trend test(CATT)which correctly incorporates information f...
详细信息
The three common genetic models(or modes of inheritance)in association analysis are the dominant,additive,and recessive *** is known that the Cochran-Armitage trend test(CATT)which correctly incorporates information from genetic models,is more powerful than the commonly used Pearson’s chi-square ***,the true genetic model is usually unknown in practice,and the power of the CAT test could be substantially reduced with a wrongly specified genetic *** achieve a power that is close to that of a correctly specified CAT test,it is natural to apply trend tests under different possible genetic models and to report the most significant test *** results in a MAX-type testing procedure,and it was found that this test is usually more powerful than the Pearson’s chi-square *** the signi-ficance(i.e.,p value)of the MAX-type test can be accessed by either large sample approximation or permutation methods,requirements for sample size or simulation replicates are demanding with respect to accuracy and *** paper proposes an approach to calculate the exact p values of MAX-type tests based on the combinatorial counting *** simulation results show that the exact method is more accurate than the large sample approximation methods and more computationally efficient than the permutation method,and our method can be readily applied to genome-wide association studies(GWASs).The proposed methodis built in an R package,MaXact,which is available at the https://***/Myuan 2019/MaXact/.
We consider the Sherrington-Kirkpatrick model with no external field and inverse temperature β 8. Our result follows by establishing an approximate formula for the covariance matrix which we obtain by differentiating...
详细信息
The growth of the commodification of music in the present age has made royalties allocation in an efficient, straight-forward manner to the stakeholders, in general, a complex issue. To address these challenges, this ...
详细信息
The creation of digital content and the easy accessibility of information have led to a surge in academic and textual plagiarism. Plagiarism detection in multiple languages is essential to maintain the integrity of ac...
The creation of digital content and the easy accessibility of information have led to a surge in academic and textual plagiarism. Plagiarism detection in multiple languages is essential to maintain the integrity of academic and literary works. In the context of the Urdu language, there is a growing need for effective plagiarism detection methods that are tailored to its unique linguistic characteristics. Existing Urdu plagiarism detection tools often rely on external sources or lack robustness in handling intrinsic forms of plagiarism, where the copied content is slightly modified or paraphrased. This research aims to bridge this gap by developing an intrinsic plagiarism detection system for the Urdu language, using a combination of machine learning, ensemble learning and Multi-Layer Perceptron (MLP). Furthermore, to train and evaluate our plagiarism detection models, we manually curate a corpus comprising a substantial collection of 1807 documents in Urdu. This corpus forms the foundation of our research, enabling us to develop and fine-tune our detection algorithms to effectively identify instances of intrinsic plagiarism in Urdu text. To comprehensively assess the unique stylistic fingerprints of documents, we employ a diverse set of word based stylometry features. This multifaceted approach enhances our ability to pinpoint instances of plagiarism in a robust manner. This research contributes to the ongoing efforts to combat plagiarism and uphold the integrity of written content, particularly in the context of the Urdu language, while also showcasing the effectiveness of different word based stylometry features in addressing this critical issue.
The use of smartphone-based Serious Games in mental health care is an emerging and promising research field. Combining the intrinsic characteristics of games (e.g., interactiveness, immersiveness, playfulness, user-ta...
详细信息
The assessment of power quality is vital for evaluating the existing state of the electricity supply, pinpointing its deficiencies, and guiding enhancements to guarantee a stable and consistent power supply, particula...
详细信息
Global interest in complementary and alternative medicine has increased in recent years, with Kampo medicine in Japan gaining greater trust and use. Detailed patient interviews are essential in Kampo medicine, as the ...
详细信息
Machine learning systems have gained tremendous attention in solving complex tasks. For instance, image classification using machine learning has multiple applications in the medical field, where different imaging mod...
Machine learning systems have gained tremendous attention in solving complex tasks. For instance, image classification using machine learning has multiple applications in the medical field, where different imaging modalities are used for diagnosing, detecting, and classifying diseases. This study aims to analyze different machine-learning approaches for detecting knee osteoarthritis using X-rays. We used different machine learning algorithms to classify X-ray images of the knee to various levels of osteoarthritis severity. We have solved the problem as a binary classification task, multi-class classification, and ordinal regression task. The Histogram of Oriented Gradients and Haralick Features are used for feature extraction in the binary classification task. For Binary Classification, six experiments are performed using different machine learning classifiers. Logistic regression has a maximum accuracy of 84.50%, F measure of 81%, AUC of 0.84, Mathew’s Correlation Coefficient of 0.69, and Cohen’s Kappa 0.66. 10-Fold cross-validation also reveals that the models’ performance is consistent across all folds. Furthermore, texture and shape features are extracted for ordinal classification using the histogram of oriented gradients and dense DAISY descriptors. Finally, separate experiments are performed, and results for both descriptors are compared. Our implemented approaches have shown state-of-the-art results.
The prediction of heart disease is crucial for effective prevention and treatment. However, extracting clinical infor-mation such as CAD, smoking, hypertension, hyperlipidemia, obesity, and family history of CAD from ...
详细信息
ISBN:
(数字)9798331510503
ISBN:
(纸本)9798331510510
The prediction of heart disease is crucial for effective prevention and treatment. However, extracting clinical infor-mation such as CAD, smoking, hypertension, hyperlipidemia, obesity, and family history of CAD from unstructured electronic health records (EHRs) poses significant challenges to clinicians. This research introduces a novel approach that leverages an ensemble of transfer learning algorithms combined with a multi-head attention mechanism to automatically extract heart disease risk factors from EHRs. Various deep learning models, including BERT, BioBERT, BioClinical BERT, RoBERTa, and XLNet, were initially trained on medical data sets and subsequently fine-tuned on the i2b2 clinical data set. Individual models delivered strong results, with RoBERTa achieving the highest accuracy of 95. 27% and an F1 score of 94. 94%. BioBERT, BioClinical BERT, XLNet, and BERT also performed well, with precision ranging from 94. 73% to 95. 03%. However, the proposed ensemble model with multi-head attention outperformed all, achieving an accuracy of 96.35% and the F1-score of 95.76%. These findings highlight the superior ability of the ensemble model to capture complex inter-dependencies between heart disease risk factors, making it a robust tool for clinical prediction.
暂无评论