We propose a conceptually simple and thus fast multi-object tracking (MOT) model that does not require any attached modules, such as the Kalman filter, Hungarian algorithm, transformer blocks, or graph networks. Conve...
详细信息
ISBN:
(纸本)9781728198354
We propose a conceptually simple and thus fast multi-object tracking (MOT) model that does not require any attached modules, such as the Kalman filter, Hungarian algorithm, transformer blocks, or graph networks. Conventional MOT models are built upon the multi-step modules listed above, and thus the computational cost is high. Our proposed end-to-end MOT model, TicrossNet, is composed of a base detector and a cross-attention module only. As a result, the overhead of tracking does not increase significantly even when the number of instances (N-t) increases. We show that TicrossNet runs in real-time;specifically, it achieves 32.6 FPS on MOT17 and 31.0 FPS on MOT20 (Tesla V100), which includes as many as >100 instances per frame. We also demonstrate that TicrossNet is robust to N-t;thus, it does not have to change the size of the base detector, depending on N-t, as is often done by other models for real-timeprocessing.
Modern wafer inspection systems in Integrated Circuit (IC) manufacturing utilize deep neural networks. The training of such networks requires the availability of a very large number of defective or faulty die patterns...
详细信息
ISBN:
(纸本)9781510673878;9781510673861
Modern wafer inspection systems in Integrated Circuit (IC) manufacturing utilize deep neural networks. The training of such networks requires the availability of a very large number of defective or faulty die patterns on a wafer called wafer maps. The number of defective wafer maps on a production line is often limited. In order to have a very large number of defective wafer maps for the training of deep neural networks, generative models can be utilized to generate realistic synthesized defective wafer maps. This paper compares the following three generative models that are commonly used for generating synthesized images: Generative Adversarial Network (GAN), Variational Auto-Encoder (VAE), and CycleGAN which is a variant of GAN. The comparison is carried out based on the public domain wafer map dataset WM-811K. The quality aspect of the generated wafer map images is evaluated by computing the five metrics of peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), inception score (IS), Frechet inception distance (FID), and kernel inception distance (KID). Furthermore, the computational efficiency of these generative networks is examined in terms of their deployment in a real-time inspection system.
With the rapid development of video-on-demand (VOD) and real-time streaming video technologies, the accurate objective assessment of streaming video Quality of Experience (QoE) has become a focal point for optimizing ...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
With the rapid development of video-on-demand (VOD) and real-time streaming video technologies, the accurate objective assessment of streaming video Quality of Experience (QoE) has become a focal point for optimizing streaming-related technologies. However, due to the inherent transmission distortions caused by poor Quality of Service (QoS) conditions in streaming videos, such as intermittent stalling, rebuffering, and drastic changes in video sharpness due to bitrate fluctuations, evaluating streaming video QoE presents numerous challenges. This paper introduces a large and diverse in-the-wild streaming video QoE evaluation dataset - the SJLIVE-1k dataset. This work addresses the limitations of corresponding datasets, which lack in-the-wild video sequences under real network conditions and whose amount of video content is insufficient. Furthermore, we propose an end-to-end objective QoE evaluation strategy that extracts video content and QoS features from the video itself without using any extra information. By implementing self-supervised contrastive learning as the "reminder" to bridge the gap between the different types of features, our approach achieves state-of-the-art results across three datasets. Our proposed dataset will be released to facilitate further research.
The field of imageprocessing is playing a vital role in making technological changes those results in realtime applications. image scaling is one of such fundamental method that helps to resolve storage issue and al...
详细信息
A blurred image is an image that has undergone a blurring or smoothing effect, resulting in a loss of sharpness and clarity. Blurring is a technique used in imageprocessing to reduce noise, remove unwanted details, o...
详细信息
Childhood leukaemia demands meticulous blood cell analysis for diagnosis, focusing on morphological irregularities like asymmetry and abnormal cell counts. Traditional manual diagnosis via microscopic blood smear imag...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
Childhood leukaemia demands meticulous blood cell analysis for diagnosis, focusing on morphological irregularities like asymmetry and abnormal cell counts. Traditional manual diagnosis via microscopic blood smear images suffers from limitations: reduced reliability, time intensiveness, and observer variability. Automated methods using Computer-Aided Diagnostic (CAD) systems address these challenges. Integrating real-timeimage preprocessing and segmentation ensures swift operation, reducing the CAD system's processingtime and enhancing its overall effectiveness, enabling timely medical intervention and better patient outcomes. This study aims to simplify the overall algorithmic complexity of other state-of-the-art techniques using preprocessing steps, including bilateral filtering and Contrast-Limited Adaptive Histogram Equalization (CLAHE), alongside the segmentation stage involving morphological operations and the watershed algorithm. Therefore, an edge-based approach is proposed to enhance the segmentation mask as these operations aggregate time complexity. This work also describes a parallel implementation utilizing OpenMP and CUDA of the proposed method, evaluating its performance using Intersection over Union (IoU) and F1-score metrics along with computing time and algorithmic complexity. Furthermore, computing time is compared using two x86-64 and two ARM architectures, each with a different number of available CPU cores, to evaluate the behaviour of the proposed approach in different conditions. The study highlights the benefits of parallel processing in enhancing the efficiency and accuracy of White Blood Cell (WBC) Segmentation. It reports that the proposed approach has an average IoU of 0.9439 and an F1-score of 0.9697 with an imageprocessing rate higher than 60 images per second.
We present a real-time system for vehicle detection and classification in road intersections, incorporating imageprocessing techniques. This system estimates the traffic flow at a specific point, as it is capable of ...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
We present a real-time system for vehicle detection and classification in road intersections, incorporating imageprocessing techniques. This system estimates the traffic flow at a specific point, as it is capable of recognizing the trajectories of different vehicles at an intersection, inferring whether they leave or enter the city. It is designed to be integrated into a high-fidelity digital twin, aiding in estimating environmental traffic pollutants. Since Computational Fluid Dynamics (CFD) use estimators like average or aggregate measurements, we use more accurate methods to estimate pollution. The implications of our study are significant for urban planning and traffic management. It allows for immediate decisions and informs long-term infrastructure planning by providing a deep understanding of intersection dynamics. Our research offers a comprehensive perspective on traffic analysis, introducing data-driven traffic management strategies for efficient urban mobility. The code developed for this purpose can be found in https://***/capo- urjc/TrackingSORT
The article discusses the challenges of real-time data processing and analyzes various methods used to solve them, with a focus on imageprocessing. It points out the limitations of existing methods and argues for the...
详细信息
ISBN:
(纸本)9781510672895;9781510672888
The article discusses the challenges of real-time data processing and analyzes various methods used to solve them, with a focus on imageprocessing. It points out the limitations of existing methods and argues for the need to use more effective and modern technologies, proposing parallel- hierarchical networks as a promising solution. The article provides a detailed description of the structural-functional model of this type of network, which involves cyclically transforming the input data matrix using a "common part" criterion and an array evolution operator until a set of individual elements is formed. The proposed model is expected to improve real-timeimage recognition and can potentially be applied to other fields by using the "common part" criterion.
This real-time 4K image super-resolution (SR) is an intensive task that demands substantial computational resources, typically allocated to GPUs or ASICs. In contrast, most high-quality video transcoding workloads are...
详细信息
ISBN:
(纸本)9798350389166;9798350389173
This real-time 4K image super-resolution (SR) is an intensive task that demands substantial computational resources, typically allocated to GPUs or ASICs. In contrast, most high-quality video transcoding workloads are executed on CPUs. This disparity necessitates the use of different hardware platforms for super-resolution and transcoding, resulting in increased complexity and elevated costs. With the advent of the new AI instructions, AMX, on the 5th generation Xeon platform, a single-socket CPU is now capable of delivering over 200 TOPs of INT8 computational power. Despite this advancement, the current quantization algorithms for SR models, including Quantization-Aware Training (QAT) and Post-Training Quantization (PTQ), fall short of expectations. In this paper, we introduce an innovative hybrid algorithm tailored for the quantization of image super-resolution models, designed to strike an optimal balance between processing speed and image quality.
This work offers a thorough method for real-time dehazing of drone-captured images by different filtering techniques with post-processing improvements. Enhancing visibility and picture clarity in hazy situations is th...
详细信息
暂无评论