In today's connected world, where users have millions of choices on online platforms, recommendation systems play an important role in personalizing experiences. Traditionally, recommendation systems are created b...
详细信息
Efficient transfer learning algorithms are key to the success of foundation models on diverse downstream tasks even with limited data. Recent works of Basu et al. (2023) and Kaba et al. (2022) propose group averaging ...
详细信息
ISBN:
(纸本)9781713899921
Efficient transfer learning algorithms are key to the success of foundation models on diverse downstream tasks even with limited data. Recent works of Basu et al. (2023) and Kaba et al. (2022) propose group averaging (equitune) and optimization-based methods, respectively, over features from group-transformed inputs to obtain equivariant outputs from non-equivariant neural networks. While Kaba et al. (2022) are only concerned with training from scratch, we find that equitune performs poorly on equivariant zero-shot tasks despite good finetuning results. We hypothesize that this is because pretrained models provide better quality features for certain transformations than others and simply averaging them is deleterious. Hence, we propose lambda-equitune that averages the features using importance weights, lambda s. These weights are learned directly from the data using a small neural network, leading to excellent zero-shot and finetuned results that outperform equitune. Further, we prove that lambda-equitune is equivariant and a universal approximator of equivariant functions. Additionally, we show that the method of Kaba et al. (2022) used with appropriate loss functions, which we call equizero, also gives excellent zero-shot and finetuned performance. Both equitune and equizero are special cases of lambda-equitune. To show the simplicity and generality of our method, we validate on a wide range of diverse applications and models such as 1) image classification using CLIP, 2) deep Q-learning, 3) fairness in natural language generation (NLG), 4) compositional generalization in languages, and 5) image classification using pretrained CNNs such as Resnet and Alexnet.
visual navigation and shape reconstruction are of paramount importance for the safety and success of asteroid missions. This paper proposes a new method using silhouette and pixel area measurements to simultaneously e...
详细信息
The proliferation of surface waste in water bodies poses significant environmental and ecological challenges. Traditional methods of waste detection are often labor-intensive and limited in scope. This paper presents ...
详细信息
ISBN:
(纸本)9798350352900;9798350352894
The proliferation of surface waste in water bodies poses significant environmental and ecological challenges. Traditional methods of waste detection are often labor-intensive and limited in scope. This paper presents a novel approach to surface waste detection using artificial intelligence (AI) and advanced imaging technologies. Leveraging cutting-edge techniques such as deep learning algorithms, high-resolution satellite imagery, and real-time data processing, our system offers an automated solution for identifying and monitoring waste in water bodies. We developed a robust AI model trained on diverse datasets, including satellite and drone-captured images, to detect various types of surface waste with high accuracy. The system integrates real-time processing capabilities to provide timely alerts and actionable insights for environmental management. Evaluation results demonstrate that our approach significantly improves detection accuracy and operational efficiency compared to conventional methods. This research contributes to the advancement of smart environmental monitoring systems and offers a scalable solution for mitigating the impact of surface waste on aquatic ecosystems.
As a major breakthrough in artificial intelligence and deep learning, Convolutional Neural Networks have achieved an impressive success in solving many problems in several fields including computer vision and image pr...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
As a major breakthrough in artificial intelligence and deep learning, Convolutional Neural Networks have achieved an impressive success in solving many problems in several fields including computer vision and imageprocessing. Real-time performance, robustness of algorithms and fast training processes remain open problems in these contexts. In addition object recognition and detection are challenging tasks for resource-constrained embedded systems, commonly used in the industrial sector. To overcome these issues, we propose a dimensionality reduction framework based on Proper Orthogonal Decomposition, a classical model order reduction technique, in order to gain a reduction in the number of hyperparameters of the net. We have applied such framework to SSD300 architecture using PASCAL VOC dataset, demonstrating a reduction of the network dimension and a remarkable speedup in the fine-tuning of the network in a transfer learning context.
Super-resolution has advanced significantly in the last 20 years, particularly with the application of deep learning methods. One of the most important imageprocessing methods for boosting an image's resolut...
详细信息
Smart homes are to be protected from fire hazards which is a crucial safety concern. Existing ways of detecting fire is time consuming, hence causing maximum injuries and financial loss, so we have come up with an eff...
详细信息
The objective of physics-based differentiable rendering (PBDR) is to propagate gradients between scene parameters and the intensities of image pixels in a manner that is physically correct. The gradients obtained can ...
详细信息
ISBN:
(数字)9781510688780
ISBN:
(纸本)9781510688773
The objective of physics-based differentiable rendering (PBDR) is to propagate gradients between scene parameters and the intensities of image pixels in a manner that is physically correct. The gradients obtained can be applied in optimization algorithms for the reconstruction of 3D geometry or materials, or they can be further propagated into neural network to learn neural representations of the scene. However, applying automatic differentiation techniques directly to the primary rendering process will result in biased gradients, as the rendering integral contains moving high-dimensional discontinuities. Based on how these discontinuities are managed-either implicitly or explicitly-existing PBDR methods can be categorized into two groups: reparameterization methods and boundary sampling methods. Boundary sampling methods need to construct paths that have one segment tangent to the geometry being differentiated in order to estimate a boundary integral to address the discontinuities explicitly. Such paths are usually constructed by sampling the tangent segment first and then extending it to complete the paths for subsequent processing. Fortunately, the number of dimensions in the space composed of such tangent segments is only three. In scenes comprised solely of triangle meshes, the first dimension is used to parameterize all the edges on the mesh, which determines a point on the tangent segment. The remaining two dimensions are used to parameterize the direction of the tangent segment. However, state-of-the-art boundary sampling methods parameterize the first dimension uniformly, which is inefficient because only a small portion of the edges contributes to the boundary integral, resulting in wasted parameter space. In this paper, we parameterize the first dimension by considering both edge length and contributions, thereby allocating more parameter space to important edges. Experiments demonstrate that our methods achieve lower variance gradients in the forward dif
Chest radiography allows a detailed inspection of a patient's thorax via an imaging modality, but requires specialized training for proper interpretation. With the advent of high performance general purpose image ...
详细信息
In electronic factors manufacturing, icing product quality through disfigurement discovery is pivotal for maintaining trustability and performance norms. This exploration investigates the operation of deep literacy wa...
详细信息
暂无评论