Successful detection of Out-of-Distribution (OoD) data is becoming increasingly important to ensure safe deployment of neural networks. One of the main challenges in OoD detection is that neural networks output overco...
Successful detection of Out-of-Distribution (OoD) data is becoming increasingly important to ensure safe deployment of neural networks. One of the main challenges in OoD detection is that neural networks output overconfident predictions on OoD data, make it difficult to determine OoD-ness of data solely based on their predictions. Outlier exposure addresses this issue by introducing an additional loss that encourages low-confidence predictions on OoD data during training. While outlier exposure has shown promising potential in improving OoD detection performance, all previous studies on outlier exposure have been limited to utilizing visual outliers. Drawing inspiration from the recent advancements in vision-language pre-training, this paper venture out to the uncharted territory of textual outlier exposure. First, we uncover the benefits of using textual outliers by replacing real or virtual outliers in the image-domain with textual equivalents. Then, we propose various ways of generating preferable textual outliers. Our extensive experiments demonstrate that generated textual outliers achieve competitive performance on large-scale OoD and hard OoD benchmarks. Furthermore, we conduct empirical analyses of textual outliers to provide primary criteria for designing advantageous textual outliers: near-distribution, descriptiveness, and inclusion of visual semantics. Code is available at https://***/wiarae/TOE
This paper presents a novel algorithm for reachability analysis of nonlinear discrete-time systems. The proposed method combines constrained zonotopes (CZs) with polyhedral relaxations of factorable representations of...
详细信息
The prolonged waiting time at supermarket checkout lines poses a significant challenge to the shopping experience, impacting customer satisfaction and operational efficiency. This paper presents a prototype that addre...
The prolonged waiting time at supermarket checkout lines poses a significant challenge to the shopping experience, impacting customer satisfaction and operational efficiency. This paper presents a prototype that addresses this issue by utilizing computer vision and deep learning. The model, incorporating convolutional neural networks such as YOLO v4 tiny and YOLO v5 small, along with tools like OpenCV and Roboflow for dataset management, achieves a remarkable 98% mean average precision for two-class detection. It efficiently detects, classifies, tracks, and counts items on a mobile supermarket conveyor belt. Additionally, we introduce a versatile framework designed for seamless integration into real-world applications. It comprises a customizable monitoring application and simulator that facilitates synthetic image data generation. Managing diverse items in a supermarket presents a major challenge for data gathering, labeling, and training. In that sense, the importance of customizable monitoring and simulation tools is highlighted, emphasizing their practical role. Our findings demonstrate the feasibility of maintaining a minimal 0% to 2.85% precision tradeoff while using half of the data as synthetic for two-class detection, indicating potential practicality in supermarkets with proper scaling. In summary, this study brings tangible benefits to both customers and retailers, offering a potential to streamline, speed up, and cut costs in the supermarket checkout process.
Image classification has been instrumental in the interpretation and labeling of images in the field of remote sensing, computer vision, and in robotics applications. Machine learning and artificial intelligence algor...
详细信息
ISBN:
(数字)9798350391084
ISBN:
(纸本)9798350391091
Image classification has been instrumental in the interpretation and labeling of images in the field of remote sensing, computer vision, and in robotics applications. Machine learning and artificial intelligence algorithms, particularly artificial neural networks, are extensively utilized for this purpose. In this work we propose the Expanded Latent Space Autoencoder (ELSA) with a case use application to classify land cover data. The main idea on the ELSA network structure is to utilize the latent spaces of multiple internal autoencoders in order to create an expanded latent space. This expanded latent space extracts more information from the input data, and serves as input features for a more simpler classifier network. In order to evaluate the proposed network's ability to extract features and classify complex and multispectral images we employed it to the EuroSAT dataset. The results demonstrate a remarkable capacity for feature extraction using the ELSA network, with lower complexity, trained with a reduced number of images. The classifier network achieved a final accuracy of 98.7%, matching or exceeding the performance of more complex state-of-the-art models.
Efficient quantum repeaters are needed to combat photon losses in fibers in future quantum networks. Single atom coupled with photonic cavity offer a great platform for photon-atom gate. Here I propose a quantum repea...
详细信息
Reconfigurable Intelligent Surface (RIS) is a transformative technology using the passive beamforming capabilities to create the program.able wireless environments. However, the control overhead is a challenging issue...
详细信息
In this paper, we consider the problem of characterizing a robust global dependence between two brain regions where each region may contain several voxels or channels. This work is driven by experiments to investigate...
详细信息
Sequence modeling faces challenges in capturing long-range dependencies across diverse tasks. Recent linear and transformer-based forecasters have shown superior performance in time series forecasting. However, they a...
ISBN:
(纸本)9798331314385
Sequence modeling faces challenges in capturing long-range dependencies across diverse tasks. Recent linear and transformer-based forecasters have shown superior performance in time series forecasting. However, they are constrained by their inherent inability to effectively address long-range dependencies in time series data, primarily due to using fixed-size inputs for prediction. Furthermore, they typically sacrifice essential temporal correlation among consecutive training samples by shuffling them into mini-batches. To overcome these limitations, we introduce a fast and effective Spectral Attention mechanism, which preserves temporal correlations among samples and facilitates the handling of long-range information while maintaining the base model structure. Spectral Attention preserves long-period trends through a low-pass filter and facilitates gradient to flow between samples. Spectral Attention can be seamlessly integrated into most sequence models, allowing models with fixed-sized look-back windows to capture longrange dependencies over thousands of steps. Through extensive experiments on 11 real-world time series datasets using 7 recent forecasting models, we consistently demonstrate the efficacy of our Spectral Attention mechanism, achieving state-of-the-art results.
Generalized-linear dynamical models (GLDMs) remain a widely-used framework within neuroscience for modeling time-series data, such as neural spiking activity or categorical decision outcomes. Whereas the standard usag...
ISBN:
(纸本)9798331314385
Generalized-linear dynamical models (GLDMs) remain a widely-used framework within neuroscience for modeling time-series data, such as neural spiking activity or categorical decision outcomes. Whereas the standard usage of GLDMs is to model a single data source, certain applications require jointly modeling two generalized-linear time-series sources while also dissociating their shared and private dynamics. Most existing GLDM variants and their associated learning algorithms do not support this capability. Here we address this challenge by developing a multi-step analytical subspace identification algorithm for learning a GLDM that explicitly models shared vs. private dynamics within two generalized-linear time-series. In simulations, we demonstrate our algorithm's ability to dissociate and model the dynamics within two time-series sources while being agnostic to their respective observation distributions. In neural data, we consider two specific applications of our algorithm for modeling discrete population spiking activity with respect to a secondary time-series. In both synthetic and real data, GLDMs learned with our algorithm more accurately decoded one time-series from the other using lower-dimensional latent states, as compared to models identified using existing GLDM learning algorithms.
One of the problems causing low quality of essential oils is the high content of solvent resulting from traditional extraction process employing low purity metallic components. This extracted oil therefore requires fu...
详细信息
暂无评论