In this paper, we proposed an improved coarse to fine improved algorithm to enhance the accuracy of facial key landmark points locating. Based on the analysis of PCA, the proposed algorithm redesigns the parameter upd...
In this paper, we proposed an improved coarse to fine improved algorithm to enhance the accuracy of facial key landmark points locating. Based on the analysis of PCA, the proposed algorithm redesigns the parameter update rule through adding a monotonically decreasing inert factor function to the traditional ASM iterations (D-ASM). The new rule could update parameters at a finer process. Besides, we compare the performances of different types of inert factor functions and select the suitable one. Furthermore, we further design a classifier-based algorithm for the more accurate locating of 2D key corner points. Finally, local D-ASM is constructed and the inner landmarks are further fitting with corner points fixed. Experimental results on various faces demonstrate the effectiveness and rationality of our proposed algorithm.
In this paper, a novel unified channel model framework is proposed for cooperative multiple-input multiple-output (MIMO) wireless channels. The proposed model framework is generic and adaptable to multiple cooperative...
详细信息
In this paper, a novel unified channel model framework is proposed for cooperative multiple-input multiple-output (MIMO) wireless channels. The proposed model framework is generic and adaptable to multiple cooperative MIMO scenarios by simply adjusting key model parameters. Based on the proposed model framework and using a typical cooperative MIMO communication environment as an example, we derive a novel geometry-based stochastic model (GBSM) applicable to multiple wireless propagation scenarios. The proposed GBSM is the first cooperative MIMO channel model that has the ability to investigate the impact of the local scattering density (LSD) on channel characteristics. From the derived GBSM, the corresponding multi-link spatial correlation functions are derived and numerically analyzed in detail.
This paper is a survey of visualization and analysis techniques for medical 3D images for researchers and students in computer science. Publications from this decade with internationally high evaluation are reviewed, ...
详细信息
In this research, the authors evaluate the degree to which dancers copy or follow the techniques of a master, or the degree of proficiency, by analyzing movements in traditional Japanese dance. The data used consist o...
详细信息
This paper proposes a detection method of moving ships from the navigational image sequence that was taken with cameras installed on the bridge of the ship. The image is influenced by roll and pitch of the ship. There...
详细信息
This paper proposes a detection method of moving ships from the navigational image sequence that was taken with cameras installed on the bridge of the ship. The image is influenced by roll and pitch of the ship. Therefore, usual technique of imageprocessing cannot be used such as Finite Difference method. Moreover, as many sea waves appear in the images, the image cannot be dealt without rejecting sea waves. The technique in this paper proposes how to get rid of the influence of roll, pitch and sea waves. After that, we detect the ships. An interval between each of the frames is one-thirty second in this experiment. The size of the images is 640 pixels in width and 480 pixels in height. At first, the frames are segmented into about 5000 regions using brightness value of the image. About 100 regions were estimated with the ship. Each region was matched with another two frames, which passed 0.33 second and 0.66 second. 0.33 second corresponds to 10 frames. The movement of the ship is less than 50 pixels in this interval. The SSDA method is used for matching processing to be more efficiently. Both the coordinate points of an original region and the matched one are recorded on the table. It is used to calculate the deflection, the speed and the direction of the moving vector of each region. The ships can be detected with these parameters. The detected ships can be displayed clearly as a result of processing. We have taken a lot of video images in Tokyo Bay and Tokyo Port. It becomes possible to show that the ships can be detected from the video images.
作者:
宋红石峰Department of Computer Science and Engineering
Beijing Institute of Technology Beijing 100081 China Department of Computer Science and Engineering
Beijing Institute of Technology Beijing 100081 Chinaecurity access control systems and automatic video surveillance systems are becoming increasingly important recently and detecting human faces is one of the indispensable processes. In this paper an approach is presented to detect faces in video surveillance. Firstly both the skin-color and motion components are applied to extract skin-like regions. The skin-color segmentation algorithm is based on the BPNN (back-error-propagation neural network) and the motion component is obtained with frame difference algorithm. Secondly the image is clustered into separated face candidates by using the region growing technique. Finally the face candidates are further verified by the rule-based algorithm. Experiment results demonstrate that both the accuracy and processing speed are very promising and the approach can be applied for the practical use.
Security access control systems and automatic video surveillance systems are becoming increasingly important recently,and detecting human faces is one of the indispensable *** this paper,an approach is presented to de...
详细信息
Security access control systems and automatic video surveillance systems are becoming increasingly important recently,and detecting human faces is one of the indispensable *** this paper,an approach is presented to detect faces in video ***,both the skin-color and motion components are applied to extract skin-like *** skin-color segmentation algorithm is based on the BPNN (back-error-propagation neural network) and the motion component is obtained with frame difference ***,the image is clustered into separated face candidates by using the region growing ***,the face candidates are further verified by the rule-based *** results demonstrate that both the accuracy and processing speed are very promising and the approach can be applied for the practical use.
We demonstrate a method for evaluating the quality of color scanning-filter sets that use sinusoidal spectral power distributions (sine SPDs) instead of physical test targets. Filter quality is quantified as the mean ...
详细信息
We demonstrate a method for evaluating the quality of color scanning-filter sets that use sinusoidal spectral power distributions (sine SPDs) instead of physical test targets. Filter quality is quantified as the mean square error of the filter sets' responses to the fundamental metamers of sine SPDs having varying frequency and phase, relative to a perfect filter set. Filter quality is also depicted graphically by plotting filter input versus output in CIELAB color space, and by plotting the magnitude of the filters' CIELAB color vector response to sine SPDs. The advantages of this approach to scanning-filter evaluation are discussed.
Current methods for image-text retrieval commonly propose various fusion modules to achieve robust visual-textual alignment, primarily relying on in-batch learning to guide the matching process. Some follow-up methods...
详细信息
Current methods for image-text retrieval commonly propose various fusion modules to achieve robust visual-textual alignment, primarily relying on in-batch learning to guide the matching process. Some follow-up methods seek to enlarge the number of negative samples to boost image-text contrastive learning. However, these methods often face challenges posed by semantic-consistent negatives, i.e., negatives samples that share correspondence with the ground truth, leading to confusion in learning cross-modal semantics. To address this issue, we propose a novel Retrieve with Authentic negative repository Learning (ReAL) method, which constructs a specific Authentic Negative Repository filled with valuable negative sample pairs. By introducing a Unique Negative Filter with a Discriminative Triplet Ranking Loss, ReAL effectively filters out the semantic-consistent negatives through similarity distribution analysis and threshold learning. Moreover, existing fusion paradigms suffer from intricate use of fine-grained representations from word- and region-level instances to progressively refine the fused embedding. In this paper, we propose a lightweight Cluster Refinement Module to exploit cross-modal semantics in a 1-way-1-out paradigm. Each visual-textual alignment can spontaneously uncover correlations with adjacent alignments through aggregation and re-allocation, without the need for a redundant and cost-inefficient refinement stage. Furthermore, ReAL employs dual momentum encoders with two memory banks, expanding the selection range of the Authentic Negative Repository to include a broader set of negatives. Extensive experiments conducted on Flickr30K, MS-COCO, and the augmented Flickr30K (with more hard negatives) demonstrate the superiority and robustness of ReAL, while also showcasing its significantly reduced inference time compared to other competitive baselines.
暂无评论