In this work we investigate the generalization performance of random feature ridge regression (RFRR). Our main contribution is a general deterministic equivalent for the test error of RFRR. Specifically, under a certa...
ISBN:
(纸本)9798331314385
In this work we investigate the generalization performance of random feature ridge regression (RFRR). Our main contribution is a general deterministic equivalent for the test error of RFRR. Specifically, under a certain concentration property, we show that the test error is well approximated by a closed-form expression that only depends on the feature map eigenvalues. Notably, our approximation guarantee is non-asymptotic, multiplicative, and independent of the feature map dimension— allowing for infinite-dimensional features. We expect this deterministic equivalent to hold broadly beyond our theoretical analysis, and we empirically validate its predictions on various real and synthetic datasets. As an application, we derive sharp excess error rates under standard power-law assumptions of the spectrum and target decay. In particular, we provide a tight result for the smallest number of features achieving optimal minimax error rate.
Pairwise dot product-based self-attention is key to the success of transformers which achieve state-of-the-art performance across a variety of applications in language and vision, but are costly to compute. It has bee...
详细信息
In today's world the internet has ingrained itself deeply into our lives, and web searching has become essential for people of all ages, locations, and occupations. However, due to the rise in internet usage, ther...
详细信息
ISBN:
(数字)9798350389609
ISBN:
(纸本)9798350389616
In today's world the internet has ingrained itself deeply into our lives, and web searching has become essential for people of all ages, locations, and occupations. However, due to the rise in internet usage, there has been a rise in spoofing attacks through malicious websites. Online purchases, reservations, recharges, and various other transactions are now commonly conducted online. Internet users are increasing; it has become crucial to develop automatic URL detection systems to protect users. This proposed research aims to compare different categorization models like Decision Tree, Random Forest and Logistic Regression to determine which one achieves finest accuracy in distinguishing among genuine websites and False websites. Objective is to identify attacking sites effectively and determine the artificial intelligence algorithm that provides the best accuracy for this use.
Semi-supervised learning (SSL) offers an effective approach by leveraging unlabeled data to alleviate the excessive reliance on labeled data. Despite demonstrating promising performance, the issue of selecting the opt...
详细信息
Ensembling has a long history in statistical data analysis, with many impactful applications. However, in many modern machine learning settings, the benefits of ensembling are less ubiquitous and less obvious. We stud...
Ensembling has a long history in statistical data analysis, with many impactful applications. However, in many modern machine learning settings, the benefits of ensembling are less ubiquitous and less obvious. We study, both theoretically and empirically, the fundamental question of when ensembling yields significant performance improvements in classification tasks. Theoretically, we prove new results relating the ensemble improvement rate (a measure of how much ensembling decreases the error rate versus a single model, on a relative scale) to the disagreement-error ratio. We show that ensembling improves performance significantly whenever the disagreement rate is large relative to the average error rate; and that, conversely, one classifier is often enough whenever the disagreement rate is low relative to the average error rate. On the way to proving these results, we derive, under a mild condition called competence, improved upper and lower bounds on the average test error rate of the majority vote classifier. To complement this theory, we study ensembling empirically in a variety of settings, verifying the predictions made by our theory, and identifying practical scenarios where ensembling does and does not result in large performance improvements. Perhaps most notably, we demonstrate a distinct difference in behavior between interpolating models (popular in current practice) and non-interpolating models (such as tree-based methods, where ensembling is popular), demonstrating that ensembling helps considerably more in the latter case than in the former.
Cost optimization is a common problem encountered in the design of telescopes. This paper comprehensively discusses various radio telescope designs worldwide, focusing on their design and utilities. It contextualizes ...
详细信息
In the event of a disaster, social media is often used to draw attention to affected areas and distressed people. The massive population and diversity in Indian languages warrant a novel real-time, big-data solution t...
详细信息
To alleviate the big data difficulties that have created a potential problem for many Internet users, it is necessary to filter, rank, and efficiently communicate the relevant information on the Web, where the diversi...
详细信息
The forward-forward algorithm [Hinton, 2022] presents a new method of training neural networks by updating weights during an inference, performing parameter updates for each layer individually. This immediately reduce...
详细信息
Early diagnosis of osteonecrosis of the femoral head (ONFH) can inhibit the progression and improve femoral head preservation. The radiograph difference between early ONFH and healthy ones is not apparent to the naked...
详细信息
暂无评论