Machine Reading Comprehension(MRC), including a series of tasks that test the ability of models to understand natural language, has received quite a few attention in Natural Language Processing(NLP). Most existing wor...
详细信息
ISBN:
(纸本)9781665405836
Machine Reading Comprehension(MRC), including a series of tasks that test the ability of models to understand natural language, has received quite a few attention in Natural Language Processing(NLP). Most existing works deal with MRC tasks by exploiting the expression capability of neural networks. Some of them have achieved impressive performance. Despite the rapid iteration of the models used, few work have focused on output layer and prediction method of answer span - also known as span extraction. In this paper, we focus on span extraction in the Question Answering(QA) task. A cross-sectional comparison of widely used span extraction methods is presented, withtheir strengths and weaknesses noted in detail. Furthermore, inspired by Faster R-CNN, we propose a brand new span extraction method. Experiment results show that our proposed method outperforms existing span extraction methods on both English and chinese MRC tasks.
Face gender recognition is a challenging problem in the traditional field of patternrecognition. In this paper, we propose a deep learning model that can learn the joint high-level and low-level features of human fac...
详细信息
Face gender recognition is a challenging problem in the traditional field of patternrecognition. In this paper, we propose a deep learning model that can learn the joint high-level and low-level features of human face to address this problem. Our deep neural networks apply convolution and subsampling in extracting the local and abstract features of human face, and reconstruct the raw input images to learn global and effective features as supplementary information at the same time. We also add a trainable weight in the networks when combining the two kinds of features to realize the final gender classification. Experiment results show that our method achieves the highest accuracy compared with existing methods, when test on the mixed face dataset. Further, in the generalization test, the average classification rate on 3 public datasets of our method is 5% higher than the joint Local Binary pattern(LBP) and Support Vector Machine(SVM) method, and is nearly 1% higher than the SVM with face pixels method. this proves our method outperforms the traditional methods in both learning ability and generalization ability.
In the conventional bag of visual words (BoW) based image representation, single visual word is not discriminative enough and the spatial contextual information among local image features is ignored. In this paper, de...
详细信息
In the conventional bag of visual words (BoW) based image representation, single visual word is not discriminative enough and the spatial contextual information among local image features is ignored. In this paper, descriptive local feature groups are proposed to address these two problems. First, local image features are refined by slightly transforming the original image. then they are clustered and represented by visual words. Second, the candidate local feature groups are generated by searching the neighbors of every local image features. this kind of grouping shows more discriminative power than a single feature and the local spatial contexts can be catched. third, we obtain the groups more descriptive to the object category by defining a significance score and the groups with high score are selected. Finally, the high order descriptive local feature groups are integrated to the vector based object categorization framework by a feature reweighting strategy. Experimental results on Scene-15 and Caltech 101 demonstrate the superior performance of our method.
Fitness design fundamental research in the thing class sorting system, is one kind of person’s factor consideration standardized method for the purpose of proposing. this article as an example of Pu Tian Sorting Syst...
详细信息
Fitness design fundamental research in the thing class sorting system, is one kind of person’s factor consideration standardized method for the purpose of proposing. this article as an example of Pu Tian Sorting System from the man-machine work criterion fitness, the man-machine work rhythm fitness, the man-machine working pattern fitness, the man-machine work mood fitness four aspects carried on the analysis. By it, we took the basis reasonable man-machine function assignment and the thing flowed equipment development design, to obtain the highest comprehensive potency of system.
Convolution operations have been widely used in many important application domains, such as deep learning and computervision, in which convolution is always the most time-consuming part. High computational throughput...
详细信息
ISBN:
(纸本)9781479989386
Convolution operations have been widely used in many important application domains, such as deep learning and computervision, in which convolution is always the most time-consuming part. High computational throughput and memory bandwidth make many-core architectures the promising targets to accelerate these applications. In this paper, we implement and optimize different convolution operations, including 1D convolution, 2D convolution and multi-channel 2D convolution executed in mini-batch mode, on both GPU and Intel MIC many-core architectures. We find out that the performance bottleneck of 1D and 2D convolutions is on registers rather than local memory or L1/L2 cache, and therefore, register tiling is used to improve the performance. In addition, we present a novel solution for multi-channel 2D convolution, in which convolution is conducted on images directly instead of being translated to matrix multiplication, and the data reuse of the algorithm is fully exploited. We further summarize the parameters of autotuning for multichannel 2D convolution and prune the search space based on heuristics. the experimental results show that, for the large filter size, our solution gets up to 33% performance improvement over cuDNN-v2 and up to 28% over clBLASbased implementation, on GTX TITAN and AMD W8000 respectively. On Intel MIC, our solution gets up to 25% of the theoretical peak performance.
In pig production,food conversion ratio and profit can be evaluated by real time detection of pig live *** pig weight detections usually require direct contact with pigs,which are limited by its low efficiency and res...
详细信息
In pig production,food conversion ratio and profit can be evaluated by real time detection of pig live *** pig weight detections usually require direct contact with pigs,which are limited by its low efficiency and result in a lot of stresses even to *** non-contact detection of pig body weight has become a challenge in pig production for *** image analysis and machine vision method enable the real time estimation of pig live weight by detecting pig critical body dimensions without any *** article elucidated the advantages and limitations of each detection method of pig body weight by comparing the system framework and estimation *** research trends of contactless pig weight estimation were analyzed as well.
this paper presents a novel LCD screen based photometric stereo method. the method is implemented with only a normal laptop. the screen of the computer is used as the light source and portioned into some different reg...
this paper presents a novel LCD screen based photometric stereo method. the method is implemented with only a normal laptop. the screen of the computer is used as the light source and portioned into some different regions to simulate various lighting conditions. And the embedded camera is used for the image capture. With a deduction and proof, we show that a circular partition scheme can obtain optimal lighting effects. By the proposed lighting direction calibration means, a traditional photometric stereo algorithm is utilized for the 3D reconstruction. the experiments are conducted with a real object and compared with conventional means to demonstrate its feasibility and reconstruction accuracy.
Color coding is an important research topic in spatial encoded structured light sensing(SLS). In this study, we propose a novel graphical model based approach for the color pattern decoding task. For efficient color l...
详细信息
Color coding is an important research topic in spatial encoded structured light sensing(SLS). In this study, we propose a novel graphical model based approach for the color pattern decoding task. For efficient color labeling, the color pattern is firstly decomposed into separate binary pattern images. Withthe labeled pattern elements, a unified probabilistic graphical framework is constructed to represent the pseudorandom pattern as a clique tree structure. the model contains two parts: the Conditional Random Field(CRF) is used to represent the dependences between these local decisions, and the Bayesian network(BN) is applied for the representation of background colors effect. A colorful target is experimented to demonstrate its feasibility. And the 3D reconstructed models based on the decoding results are also provided to show its robustness.
the digital logic rewiring technique has been shown to be one of the most powerful logic transformation methods being able to further improve some already excellent results on many EDA problems, starting from logic mi...
详细信息
the digital logic rewiring technique has been shown to be one of the most powerful logic transformation methods being able to further improve some already excellent results on many EDA problems, starting from logic minimization, partitioning, FPGA technology mappings and final routings. Previous studies show that GBAW, a graph-based rewiring engine, is able to outperform ATPG-based rewiring tools with 50-time faster runtime while being able to cover nearly half target wires in the circuit. this paper presents several new improving extensions on GBAW, including coverage of arbitrary gate sizes, to improve its rewiring power. Experimental results based on MCNC benchmark circuits show that, compared to previous GBAW, this new version is able to cover 12% more target wires and provide 1.5 times more alternative wires while runs over 100 times faster than its ATPG-based counterpart. For some problems only requiring a good-enough and very quick solution, this new rewiring technique may serve as a useful alternative.
Image stylization refers to the process of transforming an input image into a new one, while retaining its original content but in different styles. However, most existing works only support single-modal guidance, whi...
详细信息
暂无评论