The agricultural sector is one of India's most important and major endeavors, and it is also critical to the country's economic development. Agriculture is one of the most important things that contributes to ...
详细信息
The Berry-Esseen bound provides an upper bound on the Kolmogorov distance between a random variable and the normal *** this paper,we establish Berry-Esseen bounds with optimal rates for self-normalized sums of locally...
详细信息
The Berry-Esseen bound provides an upper bound on the Kolmogorov distance between a random variable and the normal *** this paper,we establish Berry-Esseen bounds with optimal rates for self-normalized sums of locally dependent random variables,assuming only a second-moment *** proof leverages Stein's method and introduces a novel randomized concentration inequality,which may also be of independent interest for other *** main results have applied to self-normalized sums of m-dependent random variables and graph dependency models.
With the acceleration of urbanization construction, the contradiction between supply and demand of urban public transportation resources is becoming increasingly prominent, resulting in increasingly serious problems s...
详细信息
Twitter is a powerful platform for communication and information sharing but is also susceptible to spreading false information. This false information has adverse consequences for society and can significantly impact...
详细信息
Detecting plagiarism in documents is a well-established task in natural language processing (NLP). Broadly, plagiarism detection is categorized into two types (1) intrinsic: to check the whole document or all the pass...
详细信息
Detecting plagiarism in documents is a well-established task in natural language processing (NLP). Broadly, plagiarism detection is categorized into two types (1) intrinsic: to check the whole document or all the passages have been written by a single author;(2) extrinsic: where a suspicious document is compared with a given set of source documents to figure out sentences or phrases which appear in both documents. In the pursuit of advancing intrinsic plagiarism detection, this study addresses the critical challenge of intrinsic plagiarism detection in Urdu texts, a language with limited resources for comprehensive language models. Acknowledging the absence of sophisticated large language models (LLMs) tailored for Urdu language, this study explores the application of various machine learning, deep learning, and language models in a novel framework. A set of 43 stylometry features at six granularity levels was meticulously curated, capturing linguistic patterns indicative of plagiarism. The selected models include traditional machine learning approaches such as logistic regression, decision trees, SVM, KNN, Naive Bayes, gradient boosting and voting classifier, deep learning approaches: GRU, BiLSTM, CNN, LSTM, MLP, and large language models: BERT and GPT-2. This research systematically categorizes these features and evaluates their effectiveness, addressing the inherent challenges posed by the limited availability of Urdu-specific language models. Two distinct experiments were conducted to evaluate the impact of the proposed features on classification accuracy. In experiment one, the entire dataset was utilized for classification into intrinsic plagiarized and non-plagiarized documents. Experiment two categorized the dataset into three types based on topics: moral lessons, national celebrities, and national events. Both experiments are thoroughly evaluated through, a fivefold cross-validation analysis. The results show that the random forest classifier achieved an ex
In the digital era, cyberbullying is a growing concern that impacts the well-being of its victims. The rise of cyberbullying among social media users necessitates robust detection solutions. One of these solutions is ...
详细信息
Accurate monitoring of urban waterlogging contributes to the city’s normal operation and the safety of residents’daily ***,due to feedback delays or high costs,existing methods make large-scale,fine-grained waterlog...
详细信息
Accurate monitoring of urban waterlogging contributes to the city’s normal operation and the safety of residents’daily ***,due to feedback delays or high costs,existing methods make large-scale,fine-grained waterlogging monitoring impossible.A common method is to forecast the city’s global waterlogging status using its partial waterlogging *** method has two challenges:first,existing predictive algorithms are either driven by knowledge or data alone;and second,the partial waterlogging data is not collected selectively,resulting in poor *** overcome the aforementioned challenges,this paper proposes a framework for large-scale and fine-grained spatiotemporal waterlogging monitoring based on the opportunistic sensing of limited bus *** framework follows the Sparse Crowdsensing and mainly comprises a pair of iterative predictor and *** predictor uses the collected waterlogging status and the predicted status of the uncollected area to train the graph convolutional neural *** combines both knowledge-driven and data-driven approaches and can be used to forecast waterlogging status in all regions for the upcoming *** selector consists of a two-stage selection procedure that can select valuable bus routes while satisfying budget *** experimental results on real waterlogging and bus routes in Shenzhen show that the proposed framework could easily perform urban waterlogging monitoring with low cost,high accuracy,wide coverage,and fine granularity.
Autism spectrum disorder (ASD) affects 1 in 100 children globally. Early detection and intervention can enhance life quality for individuals diagnosed with ASD. This research utilizes the support vector machine-recurs...
详细信息
Autism spectrum disorder (ASD) affects 1 in 100 children globally. Early detection and intervention can enhance life quality for individuals diagnosed with ASD. This research utilizes the support vector machine-recursive feature elimination (SVM-RFE) method in its approach for ASD classification using the phenotypic and Automated Anatomical Labeling (AAL) Brain Atlas datasets of the Autism Brain Imaging data Exchange preprocessed dataset. The functional connectivity matrix (FCM) is computed for the AAL data, generating 6670 features representing pair-wise brain region activity. The SVM-RFE feature selection method was applied five times to the FCM data, thus determining the optimal number of features to be 750 for the best performing support vector machine (SVM) model, corresponding to a dimensionality reduction of 88.76%. Pertinent phenotypic data features were manually selected and processed. Subsequently, five experiments were conducted, each representing a different combination of the features used for training and testing the linear SVM, deep neural networks, one-dimensional convolutional neural networks, and random forest machine learning models. These models are fine-tuned using grid search cross-validation (CV). The models are evaluated on various metrics using 5-fold CV. The most relevant brain regions from the optimal feature set are identified by ranking the SVM-RFE feature weights. The SVM-RFE approach achieved a state-of-the-art accuracy of 90.33% on the linear SVM model using the data Processing Assistant for Resting-State Functional Magnetic Resonance Imaging pipeline. The SVM model’s ability to rank the features used based on their importance provides clarity into the factors contributing to the diagnosis. The thalamus right, rectus right, and temporal middle left AAL brain regions, among others, were identified as having the highest number of connections to other brain regions. These results highlight the importance of using traditional ML models fo
Age-related Macular Degeneration (AMD) is the most common eye disease that causes visual impairment in elder people. Prevalently, AMD is detected by Spectral Domain Optical Coherence Tomography (SD-OCT) for diagnosis ...
详细信息
User engagement has been improved by using recommender systems, which are essential for giving user recommendations. Matrix factorization (MF), one of the traditional approaches, has shown a promising work in capturin...
详细信息
暂无评论