Convolutional neural networks (CNNs) have achieved considerable success across a spectrum of computervision tasks, with applications ranging from healthcare to automated driving. Recent literature has also explored i...
详细信息
ISBN:
(纸本)9798400710810
Convolutional neural networks (CNNs) have achieved considerable success across a spectrum of computervision tasks, with applications ranging from healthcare to automated driving. Recent literature has also explored its potential utility in trading and risk management within the finance industry. Despite their versatility, CNNs are substantially constrained by their data-hungry nature. The lack of well-labeled image datasets poses a major challenge to the widespread adoption of CNNs in financial machine learning research across academia and industry. To address these concerns, this work presents Generative-CNN, a novel approach that utilizes a generative adversarial network (GAN) to synthetically generate images to enhance the performance of a CNN with applications in trading.
We present a novel boundary-aware loss term for semantic segmentation using an inverse-transformation network, which efficiently learns the degree of parametric transformations between estimated and target boundaries....
详细信息
ISBN:
(纸本)9781665445092
We present a novel boundary-aware loss term for semantic segmentation using an inverse-transformation network, which efficiently learns the degree of parametric transformations between estimated and target boundaries. This plug-in loss term complements the cross-entropy loss in capturing boundary transformations and allows consistent and significant performance improvement on segmentation backbone models without increasing their size and computational complexity. We analyze the quantitative and qualitative effects of our loss function on three indoor and outdoor segmentation benchmarks, including Cityscapes, NYU-Depth-v2, and PASCAL, integrating it into the training phase of several backbone networks in both single-task and multi-task settings. Our extensive experiments show that the proposed method consistently outperforms baselines, and even sets the new state-of-the-art on two datasets.
A recurrent structure is a popular framework choice for the task of video super-resolution. The state-of-the-art method BasicVSR adopts bidirectional propagation with feature alignment to effectively exploit informati...
详细信息
The loss of a pet has a significant emotional impact on the owner and the pet. This sad experience generates worrying in the owner, while the pet gets stressed with fear and loneliness. Supporting owners to find their...
详细信息
This paper presents an AI-based classroom monitoring system utilizing computervision and machine learning. The system employs Multi-Task Cascaded Convolutional Neural Networks (MTCNN) for face detection, FaceNet for ...
详细信息
ISBN:
(纸本)9798350308266;9798350308259
This paper presents an AI-based classroom monitoring system utilizing computervision and machine learning. The system employs Multi-Task Cascaded Convolutional Neural Networks (MTCNN) for face detection, FaceNet for facial recognition, and a CNN model trained on the Facial Expression recognition (FER) 2013 dataset for emotional analysis. A novel heterogeneous approach combines Field Programmable Gate Arrays (FPGA) and a central processor to overcome the limitations of BRAM and complex computation constraints. Evaluation in real-world classrooms yielded a promising 70% accuracy in emotion detection, marking a significant stride in the field. This research not only advances AI-based monitoring systems but also indicates potential applications in surveillance and security.
The integration of facial recognition technologies with the monitoring of vital signs, especially blood pressure (BP) and oxygen saturation (SpO2) measurements, has emerged as a promising approach in healthcare qualit...
详细信息
underwater object detection and recognition are critical technologies in marine science and engineering, enabling advancements in environmental monitoring, resource exploration, and autonomous underwater vehicles. The...
详细信息
Removing text from images is a challenging task, particularly when dealing with fuzzy borders, shading, and embossing. This process is essential but can be difficult to achieve using existing solutions, which may not ...
详细信息
There have been many successful implementations of neural style transfer in recent years. In most of these works, the stylization process is confined to the pixel domain. However, we argue that this representation is ...
详细信息
ISBN:
(纸本)9781665445092
There have been many successful implementations of neural style transfer in recent years. In most of these works, the stylization process is confined to the pixel domain. However, we argue that this representation is unnatural because paintings usually consist of brushstrokes rather than pixels. We propose a method to stylize images by optimizing parameterized brushstrokes instead of pixels and further introduce a simple differentiable rendering mechanism. Our approach significantly improves visual quality and enables additional control over the stylization process such as controlling the flow of brushstrokes through user input. We provide qualitative and quantitative evaluations that show the efficacy of the proposed parameterized representation. Code is available at https://***/CompVis/brushstroke-parameterizedstyle-transfer.
Climate change is a pressing issue that is currently affecting and will affect every part of our lives. It's becoming incredibly vital we, as a society, address the climate crisis as a universal effort, including ...
详细信息
ISBN:
(纸本)9781665448994
Climate change is a pressing issue that is currently affecting and will affect every part of our lives. It's becoming incredibly vital we, as a society, address the climate crisis as a universal effort, including those in the computervision (CV) community. In this work, we analyze the total cost of CO2 emissions by breaking it into (1) the architecture creation cost and (2) the life-time evaluation cost. We show that over time, these costs are non-negligible and are having a direct impact on our future. Importantly, we conduct an ethical analysis of how the CV-community is unintentionally overlooking its own ethical AI principles by emitting this level of CO2. To address these concerns, we propose adding "enforcement" as a pillar of ethical AI and provide some recommendations for how architecture designers and broader CV community can curb the climate crisis.
暂无评论