Scene text detection plays an important role in computervision and patternrecognition field in recent years due to extract the accurate and rich text information. At present, component-based methods have become the ...
详细信息
ISBN:
(纸本)9781728114101
Scene text detection plays an important role in computervision and patternrecognition field in recent years due to extract the accurate and rich text information. At present, component-based methods have become the trend, and there are still some challenges because of illumination, blur and difficult background. In this paper, an orientation-correction detection method for scene text based on Spatial Pyramid Pooling Convolutional Neural Networks, SPP-CNN, is proposed. Firstly, an enhanced multi-channel MSER model, which is constructed from R, G, B, H, S, V and grey channels of manually blurred and sharpened images, is built. then, the manually-designed features are embedded in SPP-CNN as the effective text feature detector and classifier. Finally, a twolayer text grouping algorithm is achieved which can handle slightly-slanted text. Experiments on ICDAR 2011 show the f-measure has improved to 84%, the precision and f-measure of ICDAR 2013 have reached 87% and 85% respectively.
Remote sensing image registration technology has important significance in the field of image processing. the registration technology of SAR images has been a hot research in this field, and it is also a challenge. Ai...
详细信息
Aiming at the aircraft target in visible light remote sensing image, this paper proposes a false alarm removal method for target detection under small sample training conditions. First, use data enhancement methods fo...
详细信息
Planes commonly exist in a human-made scene and are useful for robust localization. In this paper, we propose a novel monocular visual-inertial odometry system which leverages multi-plane priors. A novel visual-inerti...
详细信息
Withthe growing cosmopolitan culture of modern cities, the need of robust Multi-Lingual scene Text (MLT) detection and recognition systems has never been more immense. Withthe goal to systematically benchmark and pu...
详细信息
Binocular depth estimation is a hot research topic in computervision. Traditional methods need high precision camera calibration and key point matching, but the results are not ideal. In this paper, we introduce an a...
详细信息
We present the DeepScores dataset withthe goal of advancing the state-of-the-art in small object recognition by placing the question of object recognition in the context of scene understanding. DeepScores contains hi...
详细信息
ISBN:
(纸本)9781538637883
We present the DeepScores dataset withthe goal of advancing the state-of-the-art in small object recognition by placing the question of object recognition in the context of scene understanding. DeepScores contains high quality images of musical scores, partitioned into 300;000 sheets of written music that contain symbols of different shapes and sizes. With close to a hundred million small objects, this makes our dataset not only unique, but also the largest public dataset. DeepScores comes with ground truth for object classification, detection and semantic segmentation. DeepScores thus poses a relevant challenge for computervision in general, and optical music recognition (OMR) research in particular. We present a detailed statistical analysis of the dataset, comparing it with other computervision datasets like PASCAL VOC, SUN, SVHN, ImageNet, MS-COCO, as well as with other OMR datasets. Finally, we provide baseline performances for object classification, intuition for the inherent difficulty that DeepScores poses to state-of-the-art object detectors like YOLO or R-CNN, and give pointers to future research based on this dataset.
Head detection and localization are one of the most investigated and demanding tasks of the computervision community. these are also a key element for many disciplines, like Human computer Interaction, Human Behavior...
详细信息
ISBN:
(纸本)9781538637883
Head detection and localization are one of the most investigated and demanding tasks of the computervision community. these are also a key element for many disciplines, like Human computer Interaction, Human Behavior Understanding, Face Analysis and Video Surveillance. In last decades, many efforts have been conducted to develop accurate and reliable head or face detectors on standard RGB images, but only few solutions concern other types of images, such as depth maps. In this paper, we propose a novel method for head detection on depth images, based on a deep learning approach. In particular, the presented system overcomes the classic sliding-window approach, that is often the main computational bottleneck of many object detectors, through a Fully Convolutional Network. Two public datasets, namely Pandora and Watch-n-Patch, are exploited to train and test the proposed network. Experimental results confirm the effectiveness of the method, that is able to exceed all the state-of-art works based on depth images and to run with real time performance.
the proceedings contain 103 papers. the topics discussed include: parking gate control based on mobile application;traffic analysis using decentralized social Internet of things;faster image compression technique base...
ISBN:
(纸本)9781538651612
the proceedings contain 103 papers. the topics discussed include: parking gate control based on mobile application;traffic analysis using decentralized social Internet of things;faster image compression technique based on LZW algorithm using GPU parallel processing;low latency image processing of transportation system using parallel processing co-incident multithreading (PPcM);neural network bayesian regularization backpropagation to solve inverse kinematics on planar manipulator;a convolutional autoencoder for detecting tumors in double contrast X-ray images;frontal gait recognition from incomplete RGB-D streams using gait cycle analysis;and understanding real time traffic characteristics of urban zones using GPS data: a computational study on Dhaka City
Deep convolution neural network (CNN) is one of the most popular Deep neural networks (DNN). It has won state-of-the-art performance in many computervision tasks. the most used method to train DNN is Gradient descent...
详细信息
暂无评论