Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks. This is a weakness of current zero-shot lea...
详细信息
ISBN:
(数字)9781665487399
ISBN:
(纸本)9781665487399
Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks. This is a weakness of current zero-shot learning frameworks as such disjoint embeddings fail to adequately associate visual and textual information to their shared semantic content. Therefore, we propose to learn semantically grounded and enriched visual information by computing a joint image and text model with a two-stream network on a proxy task. To improve this alignment between image and textual representations, provided by attributes, we leverage ancillary captions to provide grounded semantic information. Our method, dubbed joint embeddings for zero-shot learning is evaluated on several benchmark datasets, improving the performance of existing state-of-the-art methods in both standard (+1.6% on aPY, +2.6% on FLO) and generalized (+2.1% on AWA2, +2.2% on CUB) zero-shot recognition.
Wearable gadgets are becoming a major constituent of our society due to a wide range of applications like financial transactions, unlocking automobiles, tracking health and fitness, and many more. Personal data is usu...
详细信息
Aiming at the application of Orbital Angular Momentum (OAM) in the field of optical communication, this paper proposes an OAM patternrecognition method based on ResNet-18 transfer learning. The OAM mode is a light wa...
详细信息
Image inpainting has achieved great advances by simultaneously leveraging image structure and texture features. However, due to lack of effective multi-feature fusion techniques, existing image inpainting methods stil...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
Image inpainting has achieved great advances by simultaneously leveraging image structure and texture features. However, due to lack of effective multi-feature fusion techniques, existing image inpainting methods still show limited improvement. In this paper, we design a deep multi-feature co-learning network for image inpainting, which includes soft-gating Dual Feature Fusion (SDFF) and Bilateral Propagation Feature Aggregation (BPFA) modules. To be specific, we first use two branches to learn structure features and texture features separately. Then the proposed SDFF module integrates structure features into texture features, and meanwhile uses texture features as an auxiliary in generating structure features. Such a co-learning strategy makes the structure and texture features more consistent. Next, the proposed BPFA module enhances the connection from local feature to overall consistency by co-learning contextual attention, channel-wise information and feature space, which can further refine the generated structures and textures. Finally, extensive experiments are performed on benchmark datasets, including CelebA, Places2, and Paris StreetView. Experimental results demonstrate the superiority of the proposed method over the state-of-the-art. The source codes are available at https://***/GZHU-DVL/MFCL-Inpainting.
A Delaunay graph method is a proposed solution to the matching issue in facial recognition. The techniques for feature matching and feature detection are scale-invariant feature transform (SIFT) and speeded-up robust ...
详细信息
The proceedings contain 20 papers. The special focus in this conference is on Data Analytics and Learning. The topics include: Nitrogen Deficiency and Yield Estimation in Paddy Field;medical Image Compression Using Hu...
ISBN:
(纸本)9789819963454
The proceedings contain 20 papers. The special focus in this conference is on Data Analytics and Learning. The topics include: Nitrogen Deficiency and Yield Estimation in Paddy Field;medical Image Compression Using Huffman Coding for Tiff Images;Efficient Wavelet Based Denoising Technique Combined with Features of Cyclespinning and BM3D for Grayscale and Color Images;PBRAMEC: Prioritized Buffer Based Resource Allocation for Mobile Edge computing Devices;surface Water Quality Analysis Using IoT;children Facial Growth pattern Analysis Using Deep Convolutional Neural Networks;classification of Forged Logo Images;detection, Classification and Counting of Moving Vehicles from Videos;two-Stage Word Spotting Scheme for Historical Handwritten Devanagari Documents;face recognition Using Sketch Images;3D Object Detection in Point Cloud Using Key Point Detection Network;IRIS and Face-Based Multimodal Biometrics Systems;a Survey on the Detection of Diseases in Plants Using the Computer Vision-Based Model;an Approach to Conserve Wildlife Habitat by Predicting Forest Fire Using Machine Learning Technique;A Method to Detect Phishing Websites Using Distinctive URL Characteristics by Employing Machine Learning Technique;aquaculture Monitoring System: A Prescriptive Model;machine Learning-Based patternrecognition Models for Image recognition and Classification;a Review of Silk Farming Automation Using Artificial Intelligence, Machine Learning, and Cloud-Based Solutions.
Cyber-attacks on Industrial Control Systems (ICS) present critical risks to operational stability, public safety, and national security. As industrial networks become more integrated and interconnected, their suscepti...
详细信息
This paper represents survey of various signal processing advancement and supported softcomputing techniques for detecting islanding disturbances that occur in distributed generation (DG) based power system which is ...
详细信息
The new age power system is highly subjected to PQ disturbances that require proper attention and address. The research in this field is mainly categorized into different parts such as mathematical modeling, basic PQ ...
详细信息
The new age power system is highly subjected to PQ disturbances that require proper attention and address. The research in this field is mainly categorized into different parts such as mathematical modeling, basic PQ principles, standards, impact and solutions, sources, and analysis. There are several underlying causes behind the occurrence of the PQ disturbance. Therefore, it is important to address the exact underlying cause for proper mitigation of the PQ disturbance. There are several methods available in the literature, which concentrated on to detection and classification of power quality events rather than the root cause of the PQ events. An effective method for root cause identification of PQ events is the need of the day. This article covers a broad review of signal processing and softcomputing techniques used for the detection & recognition of the underlying cause of it. This will help the researcher, engineers, designers working in the field of detection, recognition, and monitoring of power quality. The comparative study of existing methods used in the literature is tabulated. The major concerns and obstacles in categorizing the recognition of power quality disturbances are thoroughly examined and discussed. The potential for new researchers in the field of power quality disturbance detection and recognition of underlying causes is further explored in this review. Copyright (c) 2022 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the internationalconference on Artificial Intelligence & Energy Systems.
The traditional method relies on historical data and simple statistical model to predict the number of participants in rural cultural square activities, which cannot fully take into account various complex factors and...
详细信息
暂无评论