Segmenting foreground object from a video is a challenging task because of large deformations of objects, occlusions, and background clutter. In this paper, we propose a frame-by-frame but computationally efficient ap...
详细信息
A tone mapping operator converts High Dynamic Range (HDR) images to Low Dynamic Range (LDR) images, which can be seen on LDR displays. There has been a lot of research done in the direction of an optimal Tone Mapping ...
详细信息
Medical image quality assessment (MIQA) is highly related to content interpretation and disease diagnosis in medical community. However, a few metrics have been developed. On the contrary, massive models have been des...
详细信息
ISBN:
(数字)9781510622005
ISBN:
(纸本)9781510622005
Medical image quality assessment (MIQA) is highly related to content interpretation and disease diagnosis in medical community. However, a few metrics have been developed. On the contrary, massive models have been designed for natural image quality assessment (NIQA) in the field of computervision. Connect both sides of MIQA and NIQA is useful and challenging. This study explores signal-to-noise ratio (SNR) as the intermediate metric to bridge the gap between MIQA and NIQA and consequently, models for NIQA can be employed or modified for MIQA applications. A number of 411 images from 4 magnetic resonance (MR) imaging sequences are collected. First, the consistency of SNR in MIQA is validated which involves inter-rater and intra-rater (inter-session) reliability analysis. Then, 4 NIQA models (BIQI, BLIINDS-II, BRISQUE and NIQE) are evaluated on these MR images. After that, the correlation between SNR values and NIQA results are analyzed. Statistical analysis indicates that SNR measurement shows reliability regard to different raters in each sequence. Moreover, BLIINDS-II and BRISQUE have the potential for automated MIQA tasks. This study attempts to use SNR bridging the gap between MIQA and NIQA, and a large-scale experiment should be further conducted to verify the conclusion.
We present a non-intrusive automated system to translate human postures into Labanotation, a graphical notation for human postures and movements. The system uses Kinect to capture the human postures, identifies the po...
详细信息
Optimum bit-allocation between texture video and depth map in 3D video results in better virtual view quality. To incorporate this, rate distortion optimization (RDO) property is used. The RDO in 3D video implies mini...
详细信息
We forget action steps and perform some unwanted action movements as amateur performers during our daily exercise routine, dance performances, etc. To improve our proficiency, it is important that we get a feedback on...
详细信息
We propose a deep learning based technique to classify actions based on Long Short Term Memory (LSTM) networks. The proposed scheme first learns spatial temporal features from the video, using an extension of the Conv...
详细信息
We show that finetuning pretrained CNNs entirely on synthetic images is an effective strategy to achieve transfer learning. We apply this strategy for detecting packaged food products clustered in refrigerator scenes....
详细信息
We propose an end-to-end real time framework to generate high resolution graphics grade textured 3D map of urban environment. The generated detailed map finds its application in the precise localization and navigation...
详细信息
ISBN:
(纸本)9781538680940
We propose an end-to-end real time framework to generate high resolution graphics grade textured 3D map of urban environment. The generated detailed map finds its application in the precise localization and navigation of autonomous vehicles. It can also serve as a virtual test bed for various vision and planning algorithms as well as a background map in the computer games. In this paper, we focus on two important issues: (i) incrementally generating a map with coherent 3D surface, in real time and (ii) preserving the quality of color texture. To handle the above issues, firstly, we perform a pose-refinement procedure which leverages camera image information, Delaunay triangulation and existing scan matching techniques to produce high resolution 3D map from the sparse input LIDAR scan. This 3D map is then texturized and accumulated by using a novel technique of ray-filtering which handles occlusion and inconsistencies in pose-refinement. Further, inspired by human fovea, we introduce foveal-processing which significantly reduces the computation time and also assists ray-filtering to maintain consistency in color texture and coherency in 3D surface of the output map. Moreover, we also introduce texture error (TE) and mean texture mapping error (MTME), which provides quantitative measure of texturing and overall quality of the textured maps.
暂无评论