Very recently, a memory-efficient version (called MeZO) of simultaneous perturbation stochastic approximation (SPSA), one well-established zeroth-order optimizer from the automatic control community, has shown competi...
详细信息
Feature fusion techniques represent a critical research topic within the field of computer vision, playing an extensive role in downstream tasks that necessitate rich object representations, such as image classificati...
详细信息
ISBN:
(纸本)9789819755875;9789819755882
Feature fusion techniques represent a critical research topic within the field of computer vision, playing an extensive role in downstream tasks that necessitate rich object representations, such as image classification, semantic segmentation, and object detection. The Feature Pyramid Network (FPN) was proposed in 2017 for object detection, emerged as a groundbreaking work in the area of feature fusion methods. Subsequently, a variety of powerful and innovative feature fusion architectures have been successively proposed, bringing significant performance enhancements in object detection tasks. This paper takes a depth look into feature fusion technologies starting from the domain of object detection. Systematically categorizes existing methods into two main classes based on the intrinsic properties of different feature fusion structures: simple topological fusion structures and complex topological fusion structures. The paper conducts analyses and summarizes the mechanisms of these two types of structures, introduced the evaluation dataset and evaluation indicators, accompanied by a collation of open-source codes for mainstream feature fusion architectures. Ultimately, through systematic review, the paper summarizes the challenges faced by feature fusion methodologies and provides an outlook on future development trends in this area.
Logical proportions are a type of propositional connector that involves four variables, expressed as a formula that encodes the conjunction of two equivalences. These equivalences refer to indicators of similarity or ...
详细信息
Image fusion aims to integrate complementary information from different images to provide richer scene details. However, in real-world scenarios, low-light illumination conditions not only affect the brightness, contr...
详细信息
With the advent of big data era, there is a growing demand for users to store data in the cloud. To enhance the availability and privacy of data in the cloud and mitigate risks such as vendor lock-in, distributed mult...
详细信息
The HERitage sMart social mEdia aSsistant project offers innovative services enabling contextualized and multi-perspective, cross-cultural explorations of the rich and various cultural heritage of a territory. The pro...
详细信息
A combinatorial algorithm is presented here which partitions a given orthogonal polyhedron, P, (genus zero and non-self-intersecting) into approximately minimum number of cuboids in O(nlogn) time where n is ...
详细信息
Improving algorithmic literacy empowers people to engage with algorithm-driven products across myriad applications with material impact. However, there remains a shortage of interventions aimed at nurturing algorithmi...
详细信息
With the rapid development of artificial intelligence (AI) in recent years, generative AI, represented by ChatGPT, Gemini and Sora has demonstrated its powerful capabilities in many fields. Such AI tools are "pen...
详细信息
In our paper, we propose the Adaptive Attention-based Generative Adversarial Network (AAGAN) for text to image generation, and the modal combines the multi-layer GANs and Adaptive Attention Mechanisms to control the f...
详细信息
暂无评论