Monocular 3D object detection is a crucial topic in autonomous driving and Intelligent transportation systems (ITS). Most existing methods are evaluated on clean datasets but exhibit arresting performance degradation ...
详细信息
It is well known that videoimages captured by in-vehicle cameras and surveillance cameras are degraded due to noise introduced by their shooting environments. In this paper, we remove such noise using a learning meth...
详细信息
Circulatory system abnormalities might be an indicator of diseases or tissue damage. Early detection of vascular abnormalities might have an important role during treatment and also raise the patient's awareness. ...
详细信息
Circulatory system abnormalities might be an indicator of diseases or tissue damage. Early detection of vascular abnormalities might have an important role during treatment and also raise the patient's awareness. Current detection methods for vascular imaging are high-cost, invasive, and mostly radiation-based. In this study, a low-cost and portable microcomputer-based tool has been developed as a Near-Infrared (NIR) superficial vascular imaging device. The device uses NIR Light-Emitting Diode (LED) light at 850 nm along with other electronic and optical components. It operates as a non-contact and safe infrared (IR) imaging method in real-time. image and video analysis are carried out using OpenCV (Open-Source Computer Vision), a library of programming functions mainly used in computer vision. Various tests were carried out to optimize the imaging system and set up a suitable external environment. To test the performance of the device, the images taken from three diabetic volunteers, who are expected to have abnormalities in the vascular structure due to the possibility of deformation caused by high glucose levels in the blood, were compared with the images taken from two non-diabetic volunteers. As a result, tortuosity was observed successfully in the superficial vascular structures, where the results need to be interpreted by the medical experts in the field to understand the underlying reasons. Although this study is an engineering study and does not have an intention to diagnose any diseases, the developed system here might assist healthcare personnel in early diagnosis and treatment follow-up for vascular structures and may enable further opportunities.
With the increasing demand for high-resolution video and real-timeprocessing, the limited efficiency of video-denoising algorithms has become a critical factor. This paper proposes a spatio-temporal video denoising c...
详细信息
With the increasing demand for high-resolution video and real-timeprocessing, the limited efficiency of video-denoising algorithms has become a critical factor. This paper proposes a spatio-temporal video denoising co-processor to suppress an image sequence's spatial and temporal noise. Temporal denoising is achieved by merging the current and previous frames at the pixel level in which the current frame is processed by a spatial filter. After exploiting noise estimation and motion detection, the Wiener filter calculates the merge ratio. Rather than buffering the entire previous frame, the JPEG-like codec can dynamically adjust the compression ratio through a predefined quantization table to satisfy the designed on-chip storage. The experimental results demonstrate that the spatio-temporal denoising co-processor can effectively eliminate the fluctuation of the grayscale value of the noise in videos. Simultaneously, the adaptive codec can reduce the storage space consumption for the frame buffer by at least 80% of the original size. To the best of our knowledge, this is the first fully integrated spatio-temporal denoising co-processor without any external memory. Additionally, the grayscale, RGB, and RAW versions of the co-processor are also implemented on the Stratix V FPGA platform and synthesized in 28nm CMOS technology.
video-based Dynamic Mesh Coding (V-DMC) is an emerging standard for dynamic mesh compression, where the original meshes are decimated into simplified meshes called base meshes. This paper introduces a novel SKIP type ...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
video-based Dynamic Mesh Coding (V-DMC) is an emerging standard for dynamic mesh compression, where the original meshes are decimated into simplified meshes called base meshes. This paper introduces a novel SKIP type for base mesh coding in V-DMC, complementing the existing INTRA and INTER types. When the SKIP type is used in base mesh coding, it directly copies the reconstructed base mesh from the reference frame, eliminating the need for additional data coding. Thus, the reconstructed base mesh in the current frame is identical to that in the reference frame. This significantly reduces the bit rate and decoding time for base meshes. Additionally, this paper employs a Lagrangian cost function using a linear model for bit estimation of INTRA type and L1 norms for distortion approximation of SKIP type to enable the encoder to select the best type for base meshes. Experimental results demonstrate superior BD-rate performance and significantly reduced decoding time for base meshes using the SKIP type, particularly in sequences with minimal object movements.
Breast cancer stands as the foremost cause of cancer-related deaths among women worldwide. The prompt and accurate detection of breast lesions through ultrasound videos plays a crucial role in early diagnosis. However...
详细信息
ISBN:
(纸本)9789819985579;9789819985586
Breast cancer stands as the foremost cause of cancer-related deaths among women worldwide. The prompt and accurate detection of breast lesions through ultrasound videos plays a crucial role in early diagnosis. However, existing ultrasound video lesion detectors often rely on multiple adjacent frames or non-local temporal fusion strategies to enhance performance, consequently compromising their detection speed. This study presents a simple yet effective network called the Space time Feature Aggregation Network (STA-Net). Its main purpose is to efficiently identify lesions in ultrasound videos. By leveraging a temporally shift-based space-time aggregation module, STA-Net achieves impressive real-timeprocessing speeds of 54 frames per second on a single GeForce RTX 3090 GPU. Furthermore, it maintains a remarkable accuracy level of 38.7 mean average precision (mAP). Through extensive experimentation on the BUV dataset, our network surpasses existing state-of-the-art methods both quantitatively and qualitatively. These promising results solidify the effectiveness and superiority of our proposed STA-Net in ultrasound video lesion detection.
In the article describe the methods of increasing the contrast of images in video information systems, the main compression of the video stream is provided by eliminating inter-frame redundancy using motion compensati...
详细信息
A parking system refers to the infrastructure and technology used to manage and facilitate the parking of vehicles in a specific area. These systems are designed to streamline the process of finding parking spaces, en...
详细信息
This paper proposes a generalized self-cueing real-time attention scheduling framework for DNN-based visual machine perception pipelines on resource-limited embedded platforms. Self-cueing means we identify subframe-l...
详细信息
This paper proposes a generalized self-cueing real-time attention scheduling framework for DNN-based visual machine perception pipelines on resource-limited embedded platforms. Self-cueing means we identify subframe-level regions of interest in a scene internally by exploiting temporal correlations among successive video frames as opposed to externally via a cueing sensor. One limitation of our original self-cueing-and-inspection strategy (Liu et al. in Proceedings of the 28th IEEE real-time and embedded technology and applications symposium (RTAS), 2022b) lies in its lack of computational efficiency under high workloads, like busy traffic scenarios where a large number of objects are identified and separately inspected. We extend the conference publication by integrating image resizing with intermittent inspection and task batching in attention scheduling. The extension enhances the original algorithm by accelerating the processing of large objects by reducing their resolution at the cost of only a negligible degradation in accuracy, thereby achieving a higher overall object inspection throughput. After extracting partial regions around objects of interest, using an optical flow-based tracking algorithm, we allocate computation resources (i.e. DNN inspection) to them in a criticality-aware manner using a generalized batched proportional balancing algorithm (GBPB), to minimize a concept of generalized system uncertainty. It saves computational resources by inspecting low-priority regions intermittently at low frequencies and inspecting large objects at low resolutions. We implement the system on an NVIDIA Jetson Xavier platform and extensively evaluate its performance using a real-world driving dataset from Waymo. The proposed GBPB algorithm consistently outperforms the previous BPB algorithm that only uses intermittent inspection and a set of baselines. The performance gain of GBPB is larger in facing more significant resource constraints (i.e., lower sampling inte
With the public Internet becoming more unstable due to issues like network congestion and line outages, ensuring the quality of large-scale real-time communications is becoming a significant concern. The real-time str...
详细信息
暂无评论