Recent text-to-image generative models have demonstrated remarkable abilities in generating realistic images. Despite their great success, these models struggle to generate high-fidelity images with prompts oriented t...
详细信息
Recent text-to-image generative models have demonstrated remarkable abilities in generating realistic images. Despite their great success, these models struggle to generate high-fidelity images with prompts oriented toward human-object interaction (HOI). The difficulty in HOI generation arises from two aspects. Firstly, the complexity and diversity of human poses challenge plausible human generation. Furthermore, untrustworthy generation of interaction boundary regions may lead to deficiency in HOI semantics. To tackle the problems, we propose a Semantic-Aware HOI generation framework SA-HOI. It utilizes human pose quality and interaction boundary region information as guidance for denoising process, thereby encouraging refinement in these regions to produce more reasonable HOI images. Based on it, we establish an iterative inversion and image refinement pipeline to continually enhance generation quality. Further, we introduce a comprehensive benchmark for HOI generation, which comprises a dataset involving diverse and fine-grained HOI categories, along with multiple custom-tailored evaluation metrics for HOI generation. Experiments demonstrate that our method significantly improves generation quality under both HOI-specific and conventional image evaluation metrics. The code is available at https://***/XZPKU/***. Copyright 2024 by the author(s)
Federated Learning (FL) is an emerging privacy-preserving distributed computing paradigm that enables numerous clients to collaboratively train machine learning models without the need for transmitting the private dat...
详细信息
With the emergence of the Metaverse concept, the rendering and transmission of 3D virtual scenes demand high-bandwidth, high-quality real-time rendering technology, as well as ultra-reliable low-latency communication ...
详细信息
Ground elevation estimation is vital for numerous applications in autonomous vehicles and intelligent robotics including three-dimensional object detection,navigable space detection,point cloud matching for localizat...
详细信息
Ground elevation estimation is vital for numerous applications in autonomous vehicles and intelligent robotics including three-dimensional object detection,navigable space detection,point cloud matching for localization,and registration for ***,most works regard the ground as a plane without height information,which causes inaccurate manipulation in these *** this work,we propose GeeNet,a novel end-to-end,lightweight method that completes the ground in nearly real time and simultaneously estimates the ground elevation in a grid-based *** leverages the mixing of two-and three-dimensional convolutions to preserve a lightweight architecture to regress ground elevation information for each cell of the *** the first time,GeeNet has fulfilled ground elevation estimation from semantic scene *** use the SemanticKITTI and SemanticPOSS datasets to validate the proposed GeeNet,demonstrating the qualitative and quantitative performances of GeeNet on ground elevation estimation and semantic scene completion of the point ***,the crossdataset generalization capability of GeeNet is experimentally *** achieves state-of-the-art performance in terms of point cloud completion and ground elevation estimation,with a runtime of 0.88 ms.
Electroencephalogram (EEG) analysis is a critical tool for diagnosing various neurological disorders. Intelligent EEG models facilitate the analysis and diagnosis of these conditions. However, the development of such ...
详细信息
Index access is one of the dominant performance factors in transactional database systems. Many systems use a B+tree or one of its variants to handle point and range operations. This access pattern has room for perfor...
详细信息
Since the advent of cryptocurrencies such as Bitcoin, blockchain, as their underlying technologies,has drawn a massive amount of attention from both academia and the industry. This ever-evolving technology inherits t...
详细信息
Since the advent of cryptocurrencies such as Bitcoin, blockchain, as their underlying technologies,has drawn a massive amount of attention from both academia and the industry. This ever-evolving technology inherits the “genes” of distributed systems, offering significant advantages of immutability, transparency,auditability, and tamper-resistance. These benefits help blockchain re-establish public confidence, and hold the significant promise of reliable information sharing and value transfer. Therefore, blockchain has become the foundation of crucial strategic deployments in countries across the world, and the fundamental basis for building the next generation Web 3.0 — “Internet of value”. In this article, we will start with unraveling the essential ingredients of blockchain technology, and showing the characteristics of each of these ingredients in the context of distributed systems. We will then present the core technical challenges that need to be addressed prior to unleashing its full potential, including its performance, scalability, and cross-chain interoperability. Finally, we will introduce the recent developments of blockchain systems, and discuss the future trends of the blockchain ecosystem.
The images captured under extreme lighting conditions can exhibit severe image degradation, which significantly impacts the performance of downstream visual tasks. Existing deep learning-based approaches for low-light...
详细信息
Software system developments generally involve writing codes. As code reduction is not considered, with accelerated software development, the number of code increases, which in turn increases the system management loa...
详细信息
Agriculture performs an critical position in India's economic system. Early detection of plant illnesses is critical to save you crop damage and similarly spread of diseases. Most plants, along with apple, tomato,...
详细信息
暂无评论