High-performance Activity Recognition models from video data are difficult to train and deploy efficiently. We measure efficiency in performance, model size, and run-time;during training and inference. Researchers hav...
详细信息
ISBN:
(数字)9781510634107
ISBN:
(纸本)9781510634107
High-performance Activity Recognition models from video data are difficult to train and deploy efficiently. We measure efficiency in performance, model size, and run-time;during training and inference. Researchers have demonstrated that 3D convolutions capture the space-time dynamics well [13]. The challenge is that 3D convolutions are computationally intensive. [8] Propose the Temporal Shift Module (TSM) for train-efficiency, and [5] proposes DeepCompression for inference-efficiency. TSM is a simple yet effective way to gain near 3D convolution performance at 2D convolution computation cost. We apply these efficiency techniques to a newly labeled activity recognition data set through transfer learning. Our labeling strategy is meant to create highly temporal activity. We benchmark against a 2D ResNet50 backbone trained on individual frames, and a multilayer 3DCNN on multi-frame short videos. Our contributions are: 1. A new highly temporal activity recognition dataset based on egoHands [1]. 2. results that show a 3D backbone on videos outperforms a 2D one on frames. 3. With TSM we achieve 5x train efficiency in run-time with negligible performance loss. 4. With Quantization alone we achieve 10x inference efficiency in model size with negligible performance loss.
Motion segmentation has applications in, amongst others, robotics, traffic monitoring, sports analysis, inspection, video surveillance, compression, and video indexing. However, the performance of most methods is limi...
详细信息
ISBN:
(数字)9781728195209
ISBN:
(纸本)9781728195216
Motion segmentation has applications in, amongst others, robotics, traffic monitoring, sports analysis, inspection, video surveillance, compression, and video indexing. However, the performance of most methods is limited compared to human capabilities. Based on extensive literature the following challenges remain: occlusions, temporary stopping, missing data, and segmenting multiple objects. In this paper, several popular and state-of-the-art methods were reviewed, with the focus on the most important attributes. These methods were classified according to the main approach taken, namely image Difference, Optical Flow, Wavelet, Statistical, Layers, Manifold Clustering, Template Matching, and Deep Learning. The investigated methods are compared and major research challenges are highlighted. Based on the review, improvements are identified as a basis for future research.
Fish-eye lenses are used more and more widely in the field of photography andcomputervision nowadays. However, images captured with these lenses suffer from high barrel-type spatial distortion, so severely errors ar...
详细信息
ISBN:
(纸本)9781450372800
Fish-eye lenses are used more and more widely in the field of photography andcomputervision nowadays. However, images captured with these lenses suffer from high barrel-type spatial distortion, so severely errors are brought into the camera calibration, feature extraction, parameter calculation and following analyses. This paper proposes a new and effective method to correct the high distortion. With the improved model in this paper, the degree of correction changes in accordance with the distance from the center pixel location. And the inverse mapping from corrected images space to the distorted images space based on the cubic spline interpolation method is taken to eliminate the vacant pixels in the corrected images caused by forwarding mapping. The final correction results are satisfactory by using this improved method.
Artificial Intelligence has become the new powerhouse of data analytics in this technological era. With advent of different Machine Learning andcomputervision algorithms, applying them in data analytics has become a...
详细信息
In this paper, we propose an asynchronous paradigm for controlling a car using steady-state visual evoked potential (SSVEP)-based brain-computer interface (BCI) and conduct experimental tests on real car outside the l...
详细信息
ISBN:
(数字)9781510634107
ISBN:
(纸本)9781510634107
In this paper, we propose an asynchronous paradigm for controlling a car using steady-state visual evoked potential (SSVEP)-based brain-computer interface (BCI) and conduct experimental tests on real car outside the laboratory. The paradigm uses six stimulation frequencies to classify targets by canonical correlation analysis (CCA) method and generates multi-task vehicle control strategies, including left and right turn signals, wipers, horns, doors and hazard lights. Four healthy volunteers participated in the online car control experiment, and the average correct rate reached 88.43%. Subject S1 showed the most satisfactory BCI-based performance, and its true positive rate and false positive rate were in line with expectations. The research shows the feasibility and effectiveness of the paradigm in automotive control applications, which lays the foundation for future research and development of related brain-controlled automotive technologies, thereby helping individuals with mobility impairments to provide supplements or alternatives, and can also provide an auxiliary vehicle driving strategy for healthy people.
image enhancement is the preprocessing task in digital imageprocessing. It helps to improve the appearance or perception of the image so that the image can be used for analytics and human visual system. image enhance...
详细信息
Stereo matching is one of the most dynamic fields in computervision. Though its relevant research has already stepped into a mature stage, there are still certain challenges to obtain real-time and high-precision dis...
详细信息
This article presents a neural network and machine vision-based approach to classify the vegetables as normal or affected. The farmers will have great difficulty if there is a change from one disease control to anothe...
详细信息
Action recognition is a leading research topic in the field of computervision. This paper proposes an effective method for action recognition task based on the skeleton data. Four features are proposed based on the j...
详细信息
Disease detection in crops and plants is essential for production of good and improved quality of food, life and a stable agricultural economy. It becomes tedious and time consuming to observe the infected parts of pl...
详细信息
暂无评论