The fields of computervision and imageprocessing, which deal with the analysis and modification of visual data like pictures and videos, are closely related. While computervision focuses on tasks like object recogn...
详细信息
The paper seeks to reflect on ways to create an effective regulation by understanding the aspects of random reward mechanisms in video games, also known as loot boxes. Players acquire digital assets of a game through ...
详细信息
Diabetic retinopathy is a common complication of diabetes that can lead to vision loss. Accurate detection of diabetic retinopathy is crucial for early intervention and treatment. This paper proposes a diabetic retino...
详细信息
This research evaluated the user experience (UX) on the top three most visited e-commerce websites in Malaysia, and identified common and unique issues which affected user experience while interacting with such sites....
详细信息
The proceedings contain 36 papers. The topics discussed include: simple measurement of user response;is gesture-based interaction equally viable in manual and autonomous driving?;visual analytics for the marriage netw...
ISBN:
(纸本)9789898704498
The proceedings contain 36 papers. The topics discussed include: simple measurement of user response;is gesture-based interaction equally viable in manual and autonomous driving?;visual analytics for the marriage network in the Goryeo Dynasty, Korea;gradient pattern analysis applied for computervision in medical ultrasound diagnosis;prototyping visualizations as a support for selecting representative models of petroleum reservoirs;how to teach programming with electronic games;a mobile conducting app with a switching beat-following algorithm;an augmented reality game for iOS smartphones;a gamified application for mitigating plastic pollution in coastal areas;and the impact of perceived challenge on narrative immersion in RPG video games: a preliminary study.
Interactive image segmentation aims to segment the target from the background with the manual guidance, which takes as input multimodal data such as images, clicks, scribbles, polygons, and bounding boxes. Recently, v...
ISBN:
(纸本)9798350307443
Interactive image segmentation aims to segment the target from the background with the manual guidance, which takes as input multimodal data such as images, clicks, scribbles, polygons, and bounding boxes. Recently, vision transformers have achieved a great success in several downstream visual tasks, and a few efforts have been made to bring this powerful architecture to interactive segmentation task. However, the previous works neglect the relations between two modalities and directly mock the way of processing purely visual information with self-attentions. In this paper, we propose a simple yet effective network for click-based interactive segmentation with cross-modality vision transformers. Cross-modality transformers exploit mutual information to better guide the learning process. The experiments on several benchmarks show that the proposed method achieves superior performance in comparison to the previous state-of-the-art models. In addition, the stability of our method in term of avoiding failure cases shows its potential to be a practical annotation tool. The code and pretrained models will be released under https: //***/lik1996/iCMFormer.
With the rapid advances of deep learning-based computervision (CV) technology, digital images are increasingly consumed, not by humans, but by downstream CV algorithms. However, capturing high-fidelity and high-resol...
详细信息
ISBN:
(纸本)9798400700958
With the rapid advances of deep learning-based computervision (CV) technology, digital images are increasingly consumed, not by humans, but by downstream CV algorithms. However, capturing high-fidelity and high-resolution images is energy-intensive. It not only dominates the energy consumption of the sensor itself (i.e. in low-power edge devices), but also contributes to significant memory burdens and performance bottlenecks in the later storage, processing, and communication stages. In this paper, we systematically explore a new paradigm of in-sensor processing, termed "learned compressive acquisition" (LeCA). Targeting machine vision applications on the edge, the LeCA framework exploits the joint learning of a sensor autoencoder structure with the downstream CV algorithms to effectively compress the original image into low-dimensional features with adaptive bit depth. We employ column-parallel analog-domain processing directly inside the image sensor to perform the compressive encoding of the raw image, resulting in meaningful hardware savings, and energy efficiency improvements. Evaluated within a modern machine visionprocessing pipeline, LeCA achieves 4x, 6x, and 8x compression ratios prior to any digital compression, with minimal accuracy loss of 0.97%, 0.98%, and 2.01% on imageNet, outperforming existing methods. Compared with the conventional full-resolution image sensor and the state-of-the-art compressive sensing sensor, our LeCA sensor is 6.3x and 2.2x more energy-efficient while reaching a 2x higher compression ratio.
The encryption of images is an essential component of ensuring data security in the digital age. Delving into chaotic mappings, our study unveils their robust potential for image encryption. In this paper, we propose ...
详细信息
Zero-shot learning is a popular strategy for low-light image enhancement, as it allows convolutional neural networks (CNNs) to be trained without paired data. However, existing zero-shot learning methods often lead to...
详细信息
At present, the application of industrial robots combined with visual systems to achieve dynamic grasping materials of the belt is becoming increasingly widespread. Unlike the behavior of industrial robots grabbing af...
详细信息
At present, the application of industrial robots combined with visual systems to achieve dynamic grasping materials of the belt is becoming increasingly widespread. Unlike the behavior of industrial robots grabbing after the belt stops, industrial robots dynamically tracking and grabbing materials on the belt can greatly improve production efficiency. For vision, processing distorted material images during high-speed movement and to obtain accurate coordinate points of materials is a key task to improve the accuracy of the belt tracking applications with industrial robot. Developing imageprocessing algorithms based on MATLAB can, on the one hand, utilize existing software and hardware interface functions to improve development efficiency;On the other hand, autonomous and controllable imageprocessing algorithms can be developed based on application requirements to maximize system accuracy.
暂无评论