Position information is critical for Vision Transformers (VTs) due to the permutation-invariance of self-attention operations. A typical way to introduce position information is adding the absolute Position Embedding ...
Position information is critical for Vision Transformers (VTs) due to the permutation-invariance of self-attention operations. A typical way to introduce position information is adding the absolute Position Embedding (PE) to patch embedding before entering VTs. However, this approach operates the same Layer Normalization (LN) to token embedding and PE, and delivers the same PE to each layer. This results in restricted and monotonic PE across layers, as the shared LN affine parameters are not dedicated to PE, and the PE cannot be adjusted on a per-layer basis. To overcome these limitations, we propose using two independent LNs for token embeddings and PE in each layer, and progressively delivering PE across layers. By implementing this approach, VTs will receive layer-adaptive and hierarchical PE. We name our method as Layer-adaptive Position Embedding, abbreviated as LaPE, which is simple, effective, and robust. Extensive experiments on image classification, object detection, and semantic segmentation demonstrate that LaPE significantly outperforms the default PE method. For example, LaPE improves +1.06% for CCT on CIFAR100, +1.57% for DeiT-Ti on ImageNet-1K, +0.7 box AP and +0.5 mask AP for ViT-Adapter-Ti on COCO, and +1.37 mIoU for tiny Segmenter on ADE20K. This is remarkable considering LaPE only increases negligible parameters, memory, and computational cost.
Exceptional points (EPs) have been extensively explored in mechanical, acoustic, plasmonic, and photonic systems. However, little is known about the role of EPs in tailoring the dynamic tunability of optical devices. ...
详细信息
We have successfully demonstrated, for the first time, an innovative back-end-of-line (BEOL) compatible electro-optic modulator and memory (EOMM) based on Lithium Niobate on Insulator (LNOI) micro-ring resonator (MRR)...
We have successfully demonstrated, for the first time, an innovative back-end-of-line (BEOL) compatible electro-optic modulator and memory (EOMM) based on Lithium Niobate on Insulator (LNOI) micro-ring resonator (MRR) integrated with Ferroelectric Hafnium Zirconate Hf 0.5 Zr 0.5 O (HZO) non-volatile analog memory. High non-volatile memory and modulation performances are both achieved in a single compact device, exhibiting high extinction ratio of 13.3 dB, excellent efficiency of 66pm/V, stable nine-state switching, record-high endurance exceeding 10 9 cycles. This is accomplished by utilizing Pockels effect in LNOI, induced by electric-field effect from remnant HZO ferroelectric polarization. We studied the system implementation of reconfigurable chiplet-interposer photonic interconnect, enabled by the EOMM and EOMM with hybrid thermal-optical modulation. Our model shows a potential 70% energy efficiency improvement over conventional electrical interposer interconnect. We have also tested the integration of the EOMM with POET technologies’ 400G Tx/Rx optical interposer chip and studied a limited scale demonstration of the EOMM device.
Suitably tailored forms of spatiotemporal modulation in electronic circuit networks have been recently employed to overcome fundamental challenges in modern electronic systems, including breaking reciprocity, squeezin...
详细信息
Suitably tailored forms of spatiotemporal modulation in electronic circuit networks have been recently employed to overcome fundamental challenges in modern electronic systems, including breaking reciprocity, squeezing the footprint of high-Q resonators, and overcoming the delay-bandwidth limit. Rotating patterns of temporal modulations have been used to synthesize angular momentum, which replaces magnetic bias to break reciprocity in integrated circuits. However, this approach is limited by trade-offs between modulation speed, footprint, and bandwidth of operation. Rotating switching patterns in commutated capacitor networks also enables compact filters and quasielectrostatic wave propagation, overcoming the delay-bandwidth limit. In this paper, we combine these mechanisms in an integrated-circuit ring that synthetically rotates in two dimensions, realizing an effective helicoidal motion that provides ultrabroadband quasielectrostatic nonreciprocal responses fitting within a theoretically infinitesimal size. We also analyze the impact of modulation signal noise on time-modulated nonreciprocal components and unveil the role of a dynamic noise mechanism based on which the noise level increases in the presence of a strong signal passing through the component, along with methods to mitigate this effect. We experimentally verify these principles in a three-port integrated circulator based on a 65-nm CMOS process that operates from dc to 1 GHz with a miniaturization factor of 2 × 106.
Visible light communication transmits data through light-emitting diodes. In this work a commercial lamp consisting of 7 white LEDs is employed in the demonstration of data transmission, obeying the recommended lighti...
详细信息
ISBN:
(纸本)9781943580910
Visible light communication transmits data through light-emitting diodes. In this work a commercial lamp consisting of 7 white LEDs is employed in the demonstration of data transmission, obeying the recommended lighting requirement for an indoor scenario.
Summary: Named entity recognition (NER) is a fundamental part of extracting information from documents in biomedical applications. A notable advantage of NER is its consistency in extracting biomedical entities in a d...
详细信息
In this research, we have been developed teaching materials for electromagnetics that demonstrate theories using real-world actual applications (capacitors, coils, piezoelectric elements, Wireless Power Transfer(WPT),...
详细信息
This paper examines the reproducibility of massive information analytics under particular factors. The paper proposes the “performing Scalable Inference” technique to cope with scalability troubles and to exploit cu...
This paper examines the reproducibility of massive information analytics under particular factors. The paper proposes the “performing Scalable Inference” technique to cope with scalability troubles and to exploit current big statistics platforms for efficient computing and statistics garage of the statistics. In particular, the paper describes how to perform leak-free, parallelizable visible analytics over massive datasets using present extensive records analytics frameworks such as Apache Flink. This method presents an automated manner to execute analytics that preserves reproducibility and the ability to make adjustments without re-running the entire technique. The paper also demonstrates how these analytics may help several real-world use instances, explore affected person cohorts for studies, and develop stratified patient cohorts for hospital therapy. In the end, the paper observes how the proposed method may be exercised within the real world. Actively scalable inference for massive information analytics is pivotal in optimizing decision-making and allocation of assets. Typically, such inferences are made based on information accumulated from numerous sources, databases, unstructured data, and different digital sources. So one can ensure scalability, a complete cloud-primarily based platform has to be hired. This solution will permit the ***, deploying the essential records series and evaluation algorithms are prime here. It could permit the platform to recognize the styles inside the statistics and discover any ability correlations or traits. Additionally, predictive analytics and system mastering strategies may be incorporated to provide insights into the results of the information. In the long run, by leveraging those techniques, the platform can draw efficient inferences and appropriately compare situations in an agile and green way..
Recent advances in robust semi-supervised learning (SSL) typically filter out-of-distribution (OOD) information at the sample level. We argue that an overlooked problem of robust SSL is its corrupted information on se...
Recent advances in robust semi-supervised learning (SSL) typically filter out-of-distribution (OOD) information at the sample level. We argue that an overlooked problem of robust SSL is its corrupted information on semantic level, practically limiting the development of the field. In this paper, we take an initial step to explore and propose a unified framework termed OOD Semantic Pruning (OSP), which aims at pruning OOD semantics out from in-distribution (ID) features. Specifically, (i) we propose an aliasing OOD matching module to pair each ID sample with an OOD sample with semantic overlap. (ii) We design a soft orthogonality regularization, which first transforms each ID feature by suppressing its semantic component that is collinear with paired OOD sample. It then forces the predictions before and after soft orthogonality decomposition to be consistent. Being practically simple, our method shows a strong performance in OOD detection and ID classification on challenging benchmarks. In particular, OSP surpasses the previous state-of-the-art by 13.7% on accuracy for ID classification and 5.9% on AUROC for OOD detection on TinyImageNet dataset. The source codes are publicly available at https://***/rain305f/OSP.
Circular synthetic aperture sonars (CSAS) capture multiple observations of a scene to reconstruct high-resolution images. We can characterize resolution by modeling CSAS imaging as the convolution between a scene'...
详细信息
暂无评论