para>Defects in pipeline welds are fatal for pipelines, considering that weld negatives need to be electronically preserved due to high preservation costs, easy damage, etc., and that most of the weld defects are j...
详细信息
Artificial intelligence (AI) systems are increasingly being used not only to classify and analyze but also to generate images and text. As recent work on the content produced by text and image Generative AIs has shown...
详细信息
Artificial intelligence (AI) systems are increasingly being used not only to classify and analyze but also to generate images and text. As recent work on the content produced by text and image Generative AIs has shown (e.g., Cheong et al., 2024, Acerbi & Stubbersfield, 2023), there is a risk that harms of representation and bias, already documented in prior AI and natural language processing (NLP) algorithms may also be present in generative models. These harms relate to protected categories such as gender, race, age, and religion. There are several kinds of harms of representation to consider in this context, including stereotyping, lack of recognition, denigration, under-representation, and many others (Crawford in Soundings 41:45-55, 2009;in: Barocas et al., SIGCIS conference, 2017). Whereas the bulk of researchers' attention thus far has been given to stereotyping and denigration, in this study we examine 'exnomination', as conceived by Roland Barthes (1972), of religious groups. Our case study is DALL-E, a tool that generates images from natural language prompts. Using DALL-E mini, we generate images from generic prompts such as "religious person." We then examine whether the generated images are recognizably members of a nominated group. Thus, we assess whether the generated images normalize some religions while neglecting others. We hypothesize that Christianity will be recognizably represented more frequently than other religious groups. Our results partially support this hypothesis but introduce further complexities, which we then explore.
The proceedings contain 134 papers. The topics discussed include: an adaptive storage switching algorithm for fault-tolerant network attached storage systems;Covid-19 prediction using machine learning algorithms;energ...
ISBN:
(纸本)9798350387933
The proceedings contain 134 papers. The topics discussed include: an adaptive storage switching algorithm for fault-tolerant network attached storage systems;Covid-19 prediction using machine learning algorithms;energy management of hybrid electric vehicles using cascaded fuzzy logic controller;dynamic lane management with IoT for real-time lane configuration and traffic flow;a closer look at sclera: emerging trends in biometric security;cognitive vision companion: an ai-enhanced support system for the visually impaired;advances in medical imageprocessing for liver tumor recognition: a comprehensive survey;a gradient boosting algorithm to predict energy consumption for home applications;and review on text classification using improved deep learning models.
The computer aided diagnosis systems from radiological images has been of interest to researchers mostly for detection of bone fracture or dislocation. The accuracy highly depends on bone segmentation. Any improvement...
详细信息
ISBN:
(数字)9788362065424
ISBN:
(纸本)9788362065424
The computer aided diagnosis systems from radiological images has been of interest to researchers mostly for detection of bone fracture or dislocation. The accuracy highly depends on bone segmentation. Any improvement of such systems, particularly for noisy X-ray images, is very valuable. Classical image segmentation depending on image homogeneity are time consuming and require pixel-wise labelling. On the other hand, saliency map based approaches fail to detect the region around the fracture or segment the dislocated bones. In our research we have used transfer learning to train the faster regional convolutional neural network (FCNN) alongside distance regularized level set evolution (DRLSE) to have accurate bone segmentation without any pixel-wise labelling enabling segmentation of the region around the fracture and dislocated bones. We applied the proposed method to a number of hand X-ray images and achieved accuracy values of 95% and average precision-recall of 0.96.
作者:
Menconero, SofiaDepartment of History
Representation and Restoration of Architecture Sapienza University of Rome Piazza Borghese 9 Rome00186 Italy
Piranesi printed the 16 etchings of the Carceri d’invenzione in 1761. This version, which is more widespread and better known, derives from the reworking of matrices that the Venetian engraver produced in 1749–50. T...
详细信息
Communicating online without fearing third-party interventions is becoming a challenge in the modern world. Especially the sectors like the military, and government organizations or private companies sharing sensitive...
详细信息
Surface water resource identification is one of the main techniques used in remote sensing image analysis. This is necessary to stop calamities like floods and droughts. Feature selection based on prior information an...
详细信息
Total p-norm Variation (TpV) is a well-established technique in imageprocessing, used to denoise and preserve edges. However, the related non-convex minimization is still a challenging task in optimization, both for ...
详细信息
To article proposes an approach to improve the quality of data used in various processes based on machine vision systems. The paper proposes a combined approach to applying the method of multi-criteria processing base...
详细信息
ISBN:
(数字)9781510662117
ISBN:
(纸本)9781510662100;9781510662117
To article proposes an approach to improve the quality of data used in various processes based on machine vision systems. The paper proposes a combined approach to applying the method of multi-criteria processing based on the use of a combined criterion in order to implement an edge detector, smoothing and separation areas of the background / object in the image. The application of the method allows eliminating the noise caused by external factors (such as dust and water suspension on the lens or space). The generated data make it possible to form an adaptive criterion for changing the correction parameters for a non-linear change in color balance in areas of increased detail or selected masks of changes blocks. The proposed algorithms make it possible to increase the visibility of small elements, reduce the noise component, while maintaining the boundaries of objects, increase the accuracy of selecting the boundaries of objects and the visual quality of data. As test data used to evaluate the effectiveness, nature data and expert evaluation results for test images obtained by a machine vision system with a sensor with a resolution of 1024x768 (8-bit, color image, visible range) are used. images of simple shapes are used as analyzed objects.
A vital and rapidly growing application, remote sensing offers vast yet sparsely labeled, spatially aligned multimodal data;this makes self-supervised learning algorithms invaluable. We present CROMA: a framework that...
详细信息
ISBN:
(纸本)9781713899921
A vital and rapidly growing application, remote sensing offers vast yet sparsely labeled, spatially aligned multimodal data;this makes self-supervised learning algorithms invaluable. We present CROMA: a framework that combines contrastive and reconstruction self-supervised objectives to learn rich unimodal and multimodal representations. Our method separately encodes masked-out multispectral optical and synthetic aperture radar samples-aligned in space and time-and performs cross-modal contrastive learning. Another encoder fuses these sensors, producing joint multimodal encodings that are used to predict the masked patches via a lightweight decoder. We show that these objectives are complementary when leveraged on spatially aligned multimodal data. We also introduce X- and 2D-ALiBi, which spatially biases our cross- and self-attention matrices. These strategies improve representations and allow our models to effectively extrapolate to images up to 17.6x larger at test-time. CROMA outperforms the current SoTA multispectral model, evaluated on: four classification benchmarks-finetuning (*** arrow 1.8%), linear (*** arrow 2.4%) and nonlinear (*** arrow 1.4%) probing, kNN classification (*** arrow 3.5%), and K-means clustering (*** arrow 8.4%);and three segmentation benchmarks (*** arrow 6.4%). CROMA's rich, optionally multimodal representations can be widely leveraged across remote sensing applications.
暂无评论