Aiming at the current problem of unsatisfactory vehicle detection in complex scenes, an improved vehicle target detection network model is proposed. First, Res2Net residual network is fused in SCP, and the CSP_R struc...
详细信息
With the improvement of information technology, service robots are becoming more and more deeply involved in our work and life, and provide a wider variety of services. How to make robots communicate with humans more ...
详细信息
The next recognized development direction of large language models (LLMs) is to integrate and enhance multimodal capability. Although current multimodal large language models (MLLMs) have achieved impressive performan...
详细信息
The proceedings contain 69 papers. The topics discussed include: predicting mushroom edibility with effective classification and efficient feature selection techniques;performance enhancement of conventional design of...
ISBN:
(纸本)9798350346435
The proceedings contain 69 papers. The topics discussed include: predicting mushroom edibility with effective classification and efficient feature selection techniques;performance enhancement of conventional design of 4-bit carry look-ahead adder;two-bit magnitude comparator design using gate diffusion input technique and static CMOS logic;a comprehensive study of camouflaged object detection using deep learning;fuzzy logic-based design optimization and economic planning of a microgrid for a residential community in Bangladesh;performance analysis of the AVR using an artificial neural network and genetic algorithm optimization technique;electrical impedance measurement technique to determine the impedance of a volume conductor with embedded object;design and implementation of embedded sensor network for an automated radio telescope;and development of a facial recognition pantograph drawing robot.
Adapting pre-trained models to new tasks can exhibit varying effectiveness across datasets. Visual prompting, a state-of-the-art parameter-efficient transfer learning method, can significantly improve the performance ...
详细信息
Cognitive impairment detection through spontaneous speech is a promising avenue for early diagnosis of Alzheimer's disease (AD) and mild cognitive impairment (MCI), where timely intervention can significantly impr...
详细信息
Automatic fruit detection has greatly reduced labor costs and crop damage rates, contributing to the progress of agricultural modernization. It involves real-time assessment of the surrounding environment and recognit...
详细信息
Face photo-sketch synthesis involves transforming photos into sketches and vice versa. A well-transformed image should preserve its original identity characteristics and naturalness. However, identity preservation rem...
详细信息
ISBN:
(纸本)9781728198354
Face photo-sketch synthesis involves transforming photos into sketches and vice versa. A well-transformed image should preserve its original identity characteristics and naturalness. However, identity preservation remains a challenge because of the large discrepancy between the photo and sketch domains. To this end, we propose a novel face photo-sketch synthesis framework that uses domain-invariant feature embedding (DIFE). The DIFE framework generates images assuming the domain-invariant feature of an image pair for the same person to be the identity information. A joint feature embedding module considers latent features from two different domains as input and transfers them into the domain-invariant latent space. Subsequently, a semantic-aware decoder completes the desired image guided by multiscale facial parsing masks. Experimental results demonstrate that the DIFE method outperforms state-of-the-art approaches visually and perceptually.
GNSS is extensively employed for applications requiring high reliability. However, GNSS inherently encompasses varying error factors and is susceptible to malicious attacks such as spoofing. To ensure stable GNSS util...
详细信息
Recently, spatial and temporal inconsistencies have been shown to effectively enhance the generalization performance of face forgery detection, as common forgery strategies create inconsistencies among face regions an...
详细信息
暂无评论