the construction of multi-scale image pyramids is used in state-of-the-art methods that perform robust object recognition, such as SIFT and SURF. However, building such image pyramids is computationally expensive, esp...
详细信息
ISBN:
(纸本)9789897580000
the construction of multi-scale image pyramids is used in state-of-the-art methods that perform robust object recognition, such as SIFT and SURF. However, building such image pyramids is computationally expensive, especially when implementations in embedded systems with limited computing resources are considered. therefore, the use of alternative less expensive approaches are necessary if near real-time operation is desired. Previous work has reported that using binomial filters to construct half-octave multi-scale pyramids consumes only 1=4 of the processing time of the Gaussian pyramid originally used in the SIFT framework. Here we investigate how interest points detected using the binomial approach behave when compared to the Gaussian approach, focusing on repeatability. Experimental results show that in average up to 86% of interest points detected withthe original SIFT pyramid building scheme are also detected when using the binomial method, despite of large gains in processing time. When rotation of image features is considered, experimental results demonstrate that slightly superior repeatability of interest points is achieved using the binomial pyramid.
the proceedings contain 34 papers. the topics discussed include: human sign recognition for robot manipulation;fuzzy sets for human fall pattern recognition;vision system for 3D reconstruction with telecentric lens;a ...
ISBN:
(纸本)9783642311482
the proceedings contain 34 papers. the topics discussed include: human sign recognition for robot manipulation;fuzzy sets for human fall pattern recognition;vision system for 3D reconstruction with telecentric lens;a tool for hand-sign recognition;improving the multiple alignments strategy for fingerprint verification;breaking reCAPTCHAs with unpredictable collapse: heuristic character segmentation and recognition;using short-range interactions and simulated genetic strategy to improve the protein contact map prediction;associative model for solving the wall-following problem;the list of clusters revisited;a heuristically perturbation of dataset to achieve a diverse ensemble of classifiers;compact and efficient permutations for proximity searching;and NURBS parameterization: a new method of parameterization using the correlation relationship between nodes.
the problem of curve detection decomposes naturally into two stages: (i) inferring the (discrete) trace and tangent to the curve;and (ii) finding integrals through the resultant tangent field. Orientation selection is...
详细信息
the ability to perceive the environment accurately is a core requirement for autonomous navigation. In the past, researchers and practitioners have explored a broad spectrum of sensors that can be used to detect obsta...
详细信息
ISBN:
(纸本)9781728152370
the ability to perceive the environment accurately is a core requirement for autonomous navigation. In the past, researchers and practitioners have explored a broad spectrum of sensors that can be used to detect obstacles or to recognize navigation targets. Due to their low hardware cost and high fidelity, stereo camera systems are often considered to be a particularly versatile sensing technology. Consequently, there has been a lot of work on integrating them into mobile robots. However, the existing literature focuses on presenting the concepts and algorithms used to implement the desired robot functions on top of a given camera setup. As a result, the rationale and impact of choosing this camera setup are usually neither discussed nor described. thus, when designing the stereo camera system for a mobile robot, there is not much general guidance beyond isolated setups that worked for a specific robot. To close the gap, this paper studies the impact of the physical setup of a stereo camera system in indoor environments. To do this, we present the results of an experimental analysis in which we use a given software setup to estimate the distance to an object while systematically changing the camera setup. thereby, we vary the three main parameters of the physical camera setup, namely the angle and distance between the cameras as well as the field of view. Based on the results, we derive several guidelines on how to choose the parameters for an application.
the emergence of cheap structured light sensors, like the Kinect, opened the door to an increased interest in all matters related to the processing of 3D visual data. Applications for these technologies are abundant, ...
详细信息
Epilepsy is a chronic neurological disease, and locating the lesions precisely is crucial to the success of epilepsy surgery. the high-frequency oscillations (HFOs) in magnetoencephalography (MEG) of epileptic patient...
详细信息
Image fusion is the task of generating images from the same scene with different sensors. Images obtained by different sensors have different expressions under the image target semantics. Good semantic aware images ca...
详细信息
Untangling ropes, wires, and cables is a challenging task for robots due to the high-dimensional configuration space, visual homogeneity, self-occlusions, and complex dynamics. We consider dense (tight) knots that lac...
详细信息
Untangling ropes, wires, and cables is a challenging task for robots due to the high-dimensional configuration space, visual homogeneity, self-occlusions, and complex dynamics. We consider dense (tight) knots that lack space between self-intersections and present an iterative approach that uses learned geometric structure in configurations. We instantiate this into an algorithm, HULK: Hierarchical Untangling from Learned Keypoints, which combines learning-based perception with a geometric planner into a policy that guides a bilateral robot to untangle knots. To evaluate the policy, we perform experiments both in a novel simulation environment modelling cables with varied knot types and textures and in a physical system using the da Vinci surgical robot. We find that HULK is able to untangle cables with dense figure-eight and overhand knots and generalize to varied textures and appearances. We compare two variants of HULK to three baselines and observe that HULK achieves 43.3% higher success rates on a physical system compared to the next best baseline. HULK successfully untangles a cable from a dense initial configuration containing up to two overhand and figure-eight knots in 97.9% of 378 simulation experiments with an average of 12.1 actions per trial. In physical experiments, HULK achieves 61.7% untangling success, averaging 8.48 actions per trial. Supplementary material, code, and videos can be found at https://***/y3a88ycu
robots, endowed withcomputervision function by on-board cameras and embedded systems, have become popular in the modem power industry. In order to realize automatic image analysis at the data acquisition part, a fas...
详细信息
ISBN:
(纸本)9781728143903
robots, endowed withcomputervision function by on-board cameras and embedded systems, have become popular in the modem power industry. In order to realize automatic image analysis at the data acquisition part, a fast detection algorithm especially for small target based on YOLOv3 was proposed. It is proposed to up sample by 2 X feature map which is down sampled by 8 X of the previous network, and concatenate it withthe output of the second Res block unit, and the last Res block unit of the previous network was abandoned. To obtain more features of the small target, add two Resnet units in the second Res block unit of Darknet53 in YOLO V3 network structure. Aiming at deploying the algorithm to the hardware, model pruning was applied to decrease model size and improve detection speed. the experimental results demonstrate that the mAP (Mean Average Precision) reaches 0.85, and the proposed algorithm is capable of real-time detection as faster than 15 frames per second (fps).
暂无评论