Reference-free low-light image enhancement methods only employ low-light images during training, thereby significantly alleviating the over-reliance on obtaining paired or unpaired datasets. Existing reference-free lo...
详细信息
Reference-free low-light image enhancement methods only employ low-light images during training, thereby significantly alleviating the over-reliance on obtaining paired or unpaired datasets. Existing reference-free low-light image enhancement approaches still struggle to strike a balance between enhancing vivid color and suppressing noise in low-light images. To mitigate such issues, we propose a novel deep learning-based reference-free method that contains two phases, separating the low-light image enhancement into decomposition and refinement problems. In the decomposition phase, we present a value channel prior based on histogram equalization on HSv color space, termed as v-HE prior. Inspired by retinex theory, v-HE prior guides the decomposition network (Dec-Net) to estimate the reflectance component of the value channel. To further refine the pre-enhanced result, we construct a structure-aware loss to guide the refinement network (Ref-Net) in the refinement phase. We conduct extensive experiments to verify the effectiveness of the proposed method, qualitatively and quantitatively. Compared with other reference-free algorithms, our approach effectively addresses the challenges of low-light image enhancement and significantly improves image quality.
Subject of study. This study investigates the influence of optical-system parameters on the error in determining the orientation and position of fiducial markers. Aim of study. This study determines the dependencies o...
详细信息
Subject of study. This study investigates the influence of optical-system parameters on the error in determining the orientation and position of fiducial markers. Aim of study. This study determines the dependencies of absolute error in position and orientation on various influencing factors. Method. The error in a machine-vision system is assessed based on fiducial markers using computer-image modeling in the Unity 3D graphics system. Main results. Over 100,000 images of AprilTag markers in different positions and orientations were synthesized and processed during the simulation. The results of this simulation yielded the dependencies of absolute position and orientation errors on the distance between the camera and marker, the rotation angle of the marker, and the focal lengths of the camera. Practical significance. The obtained results may be utilized to optimize the placement of markers on the platform, select the optimal video camera positions and lens focal lengths, and implement adjustments in the image-processing algorithm. These changes can improve measurement accuracy in systems used for developing orientation algorithms for microsatellites. (c) 2024 Optica Publishing Group
The rapid growth of Internet of vehicle (Iov) devices and Artificial Intelligence (AI) applications has accelerated the adoption of Cloud and Edge Computing. The advent of sixth-generation mobile communication technol...
详细信息
The rapid growth of Internet of vehicle (Iov) devices and Artificial Intelligence (AI) applications has accelerated the adoption of Cloud and Edge Computing. The advent of sixth-generation mobile communication technology (6G) further facilitates the deployment of Cloud-Edge collaborative computing in large-scale Intelligent Transportation systems (ITS). Effective ITS must efficiently handle both latency-sensitive tasks (e.g., obstacle detection, traffic signal recognition) and computationally intensive tasks (e.g., path optimization, traffic flow prediction). However, existing Cloud-Edge collaborative frameworks struggle to accurately classify diverse workloads and provide efficient low-latency processing, leading to energy inefficiencies and task failures. To address these challenges, this paper introduces a Deep Learning-based Cloud-Edge Collaboration Framework (CECF) designed to optimize energy conservation in Cloud and Edge environments. CECF employs a DNN-based classifier to categorize workloads for processing in the Cloud or Edge. The classified tasks are managed by a dedicated Cloud scheduler (DSGA) and an Edge scheduler (EA-DFPSO), respectively. To enhance scheduling efficiency for highly variable Cloud tasks, DSGA incorporates a novel self-adaptive mutation algorithm and a random point fixed distance crossover method. Extensive evaluations using real-world workload traces demonstrate that CECF achieves up to a 8.5% improvement in system reliability and reduces energy consumption by 35.88% compared to baseline approaches.
This paper considers a scientific school of synthesis of samples and creation of datasets, which is a part of the family of scientific schools associated with imageprocessing and analysis, originating from the work o...
详细信息
This paper considers a scientific school of synthesis of samples and creation of datasets, which is a part of the family of scientific schools associated with imageprocessing and analysis, originating from the work of a team led by Prof. v.L. Arlazarov in the 1970s. As part of the work of the school, the researchers have obtained important fundamental and applied results as well as set new research tasks. Over the years of the school's existence the scientific team has developed several algorithms and systems for the synthesis and augmentation of image samples. Moreover, they have created and published more than ten open annotated image datasets, including the unique MIDv dataset family that contains synthesized images of identity documents and is the first in the world to allow a full open comparison of recognition systems for such documents.
imageprocessing using neural networks act as a tool to speed up predictions for users, specifically on large-scale image samples. To guarantee the clean data for training accuracy, various deep learning-based adversa...
详细信息
imageprocessing using neural networks act as a tool to speed up predictions for users, specifically on large-scale image samples. To guarantee the clean data for training accuracy, various deep learning-based adversarial attack detection techniques have been proposed. These crisp set-based detection methods directly determine whether an image is clean or attacked, while, calculating the loss is nondifferentiable and hinders training through normal back-propagation. Motivated by the recent success in fuzzy systems, in this work, we present an attack detection method to further improve detection performance, which is suitable for any pretrained neural network classifier. Subsequently, the fuzzification network is used to obtain feature maps to produce fuzzy sets of difference degree between clean and attacked images. The fuzzy rules control the intelligence that determines the detection boundaries. Different from previous fuzzy systems, we propose a fuzzy mean-intelligence mechanism with new support and confidence functions to improve fuzzy rule's quality. In the defuzzification layer, the fuzzy prediction from the intelligence is mapped back into the crisp model predictions for images. The loss between the prediction and label controls the rules to train the fuzzy detector. We show that the fuzzy rule-based network learns rich feature information than binary outputs and offer to obtain an overall performance gain. Experiment results show that compared to various benchmark fuzzy systems and adversarial attack detection methods, our fuzzy detector achieves better detection performance over a wide range of images.
This paper is devoted to a review of the achievements of the Moscow scientific school of image recognition, formed under the leadership of Professor vladimir L'vovich Arlazarov, in the field of development and app...
详细信息
This paper is devoted to a review of the achievements of the Moscow scientific school of image recognition, formed under the leadership of Professor vladimir L'vovich Arlazarov, in the field of development and application of the viola-Jones method. One of the main areas of research at the school is the development of computationally efficient recognition algorithms, which requires a deep understanding of the problem and a wide expertise in the field of existing classical algorithms. Such classic method as the viola-Jones method became an essential tool to solve a wide range of image recognition problems. This paper provides an overview of the modifications of the original method developed by the scientific school and describes in detail the experience of solving many different practical problems that arise in the development of modern energy-efficient image recognition systems.
作者:
Li, JingmengWei, HuiFudan Univ
Sch Comp Sci Lab Algorithms Cognit Models Shanghai 200438 Peoples R China Fudan Univ
Innovat Ctr Callig & Painting Creat Technol Sch Comp Sci Lab Algorithms Cognit ModelsMCT Shanghai 200438 Peoples R China
Perceptual edge grouping is a technique for organizing the cluttered edge pixels into meaningful structures and further serves high-level vision tasks, which has long been a basic and critical task in computer vision....
详细信息
Perceptual edge grouping is a technique for organizing the cluttered edge pixels into meaningful structures and further serves high-level vision tasks, which has long been a basic and critical task in computer vision. Existing methods usually have a poor performance when coping with the junctions caused by occlusion and noise in natural images. In this paper, we present GPGrouper, a perceptual edge grouping model based on gestalt theory and the primary visual cortex (v1). Different from the existing methods, GPGrouper leverages the edge representation and grouping matrix (ERGM), a functional structure inspired by v1 mechanisms, to represent edges in a way that can effectively reduce grouping errors caused by occlusion between objects. ERGM is trained with natural image contours and further provides a priori guidance for the construction of the edge connection graph (ECG) that is useful to minimize the impact of noise on grouping. In the experiment, we compared GPGrouper and the state-of-the-art (SOTA) method of perceptual grouping on the visual psychology pathfinder challenge. The results demonstrate that GPGrouper outperforms the SOTA method in grouping performance. Furthermore, in the grouping experiments involving line segments with varying lengths detected by the Line Segment Detector (LSD), as well as those involving superpixel segmentation results with significant levels of interfering noise using the SLIC algorithm, GPGrouper was superior to the existing methods in terms of grouping effect and robustness. Moreover, the results of applying the grouping results to the vision tasks objectness demonstrate that GPGrouper can contribute significantly to high-level visual tasks.
This paper presents an innovative way of image compression using Field-Programmable Gate Array (FPGA) implementation of the Integer Wavelet Transform (IWT) and Discrete Wavelet Transform (DWT) algorithms. For situatio...
详细信息
A definition of neuroiconics is proposed as a branch of science at the intersection of human and animal physiology and iconics that studies neurophysiological processes and algorithms for processingvideo information ...
详细信息
A definition of neuroiconics is proposed as a branch of science at the intersection of human and animal physiology and iconics that studies neurophysiological processes and algorithms for processingvideo information and evaluates the possibility of using these algorithms in technical systems. (c) 2022 Optica Publishing Group
Since the preceding decade, there has been a great deal of interest in forecasting landslides using remote-sensing images. Early detection of possible landslide zones will help to save lives and money. However, this a...
详细信息
暂无评论