This study seeks to investigate the utilization of neural networks for the generation of high-quality image interpolation. This introduction endeavors to deliver a comprehensive exploration of the foundational princip...
详细信息
In the domain of hyperspectral imaging classification (HIC), the challenge of limited labeled training samples, commonly referred to as the Hughes phenomenon, poses a significant obstacle. As hyperspectral datasets ca...
详细信息
In winter scenes, the degradation of images taken under snow can be pretty complex, where the spatial distribution of snowy degradation varies from image to image. Recent methods adopt deep neural networks to recover ...
详细信息
ISBN:
(纸本)9783031263125;9783031263132
In winter scenes, the degradation of images taken under snow can be pretty complex, where the spatial distribution of snowy degradation varies from image to image. Recent methods adopt deep neural networks to recover clean scenes from snowy images directly. However, due to the paradox caused by the variation of complex snowy degradation, achieving reliable high-Definition image desnowing performance in real time is a considerable challenge. We develop a novel Efficient Pyramid Network with asymmetrical encoder-decoder architecture for realtime HD image desnowing. The general idea of our proposed network is to utilize the multi-scale feature flow fully and implicitly to mine clean cues from features. Compared with previous state-of-the-art desnowing methods, our approach achieves a better complexity-performance trade-off and effectively handles the processing difficulties of HD and Ultra-HD images. The extensive experiments on three large-scale image desnowing datasets demonstrate that our method surpasses all state-of-the-art approaches by a largemargin both quantitatively and qualitatively, boosting the PSNR metric from 31.76 dB to 34.10 dB on the CSD test dataset and from 28.29 dB to 30.87 dB on the SRRS test dataset. The source code is available at https://***/Owen718/Towards-Real time-highDefinition-Image-Snow-Removal-Efficient-Pyramid-Network.
With the rapid development of internet applications such as cloud computing and streaming media, people put higher demands on data center (DC) switching capabilities. Technology limitations of the backplane cause bott...
详细信息
ISBN:
(纸本)9781665406079
With the rapid development of internet applications such as cloud computing and streaming media, people put higher demands on data center (DC) switching capabilities. Technology limitations of the backplane cause bottlenecks in its performance, which limits the growth rate of the communication ability of DC. The technology of waveguides based on polymer materials and waveguide connectors on the backplane has become an essential issue in the last few years. Architectures of optical interconnection remain to be explored. We propose a strictly non-blocking Clos-based optical architecture, which uses densely integrated optical waveguides to connect optical chips and achieve on-board networking. In addition, we design a Ring-Clos architecture to reduce the packet loss rate. The simulation results indicate that our design has good performance on throughput, packet loss rate, and latency.
Large-scale single-stream pre-training has shown dramatic performance in image-text retrieval. Regrettably, it faces low inference efficiency due to heavy attention layers. Recently, two-stream methods like CUP and AL...
详细信息
ISBN:
(数字)9781665469463
ISBN:
(纸本)9781665469463
Large-scale single-stream pre-training has shown dramatic performance in image-text retrieval. Regrettably, it faces low inference efficiency due to heavy attention layers. Recently, two-stream methods like CUP and ALIGN with high inference efficiency have also shown promising performance, however;they only consider instance-level alignment between the two streams (thus there is still room for improvement). To overcome these limitations, we propose a novel Collaborative Two-Stream vision-language pre-training model termed COTS for image-text retrieval by enhancing cross-modal interaction. In addition to instance-level alignment via momentum contrastive learning, we leverage two extra levels of cross-modal interactions in our COTS: (1) Token-level interaction - a masked vision-language modeling (MVLM) learning objective is devised without using a cross-stream network module, where variational autoencoder is imposed on the visual encoder to generate visual tokens for each image. (2) Task-level interaction - a KL-alignment learning objective is devised between text-to-image and image-to-text retrieval tasks, where the probability distribution per task is computed with the negative queues in momentum contrastive learning. Under a fair comparison setting, our COTS achieves the highest performance among all two-stream methods and comparable performance (but with 10,800x faster in inference) ***. the latest single-stream methods. Importantly, our COTS is also applicable to text-to-video retrieval, yielding new state-of-the-art on the widely-used MSR-VTT dataset.
Knowledge-enhanced pre-trained language models (KEPLMs) leverage relation triples from knowledge graphs (KGs) and integrate these external data sources into language models via self-supervised learning. Previous works...
详细信息
Intracerebral hemorrhage (ICH) is the most fatal subtype of stroke and is characterized by a high incidence of disability. Accurate segmentation of the ICH region and prognosis prediction are critically important for ...
详细信息
With the rapid development of industrial automation, higher requirements are put forward for reliable and deterministic communication in industrial networks. And time-sensitive networking (TSN) is a promising technolo...
详细信息
This research proposes a design method to reduce shaft voltage of the propulsion motors based on the motor configuration considering electromagnetic performances. First, design variables that affect parasitic capacita...
详细信息
program synthesis is the process of generating a computerprogram following a set of specifications, which can be a high-level description of the problem and/or a set of input-output examples. The synthesis can be mod...
详细信息
ISBN:
(纸本)9798400701191
program synthesis is the process of generating a computerprogram following a set of specifications, which can be a high-level description of the problem and/or a set of input-output examples. The synthesis can be modeled as a search problem in which the search space is the set of all the programs valid under a grammar. As the search space is vast, brute force is usually not viable and search heuristics, such as genetic programming, also have difficulty navigating it without any guidance. In this paper we present HOTGP, a new genetic programming algorithm that synthesizes pure, typed, and functional programs. HOTGP leverages the knowledge provided by the rich data-types associated with the specification and the built-in grammar to constrain the search space and improve the performance of the synthesis. The grammar is based on Haskell's standard base library (the synthesized code can be directly compiled using any standard Haskell compiler) and includes support for higher-order functions,..-functions, and parametric polymorphism. Experimental results show that, when compared to 6 stateof-the-art algorithms using a standard set of benchmarks, HOTGP is competitive and capable of synthesizing the correct programs more frequently than any other of the evaluated algorithms.
暂无评论