Convolutional neural network (CNN) has achieved great success in many vision tasks. A key to this success is its ability to powerful automatically learns both high-level and low-level features. In general, low-level f...
详细信息
ISBN:
(纸本)9781479958306
Convolutional neural network (CNN) has achieved great success in many vision tasks. A key to this success is its ability to powerful automatically learns both high-level and low-level features. In general, low-level features have a small size of receptive fields and appear multiple times in different locations of objects, while high-level semantic features have a relatively large size of receptive fields and only appear once in a specific location of objects. However, traditional CNN treats these two kinds of features in the same manner, i.e., learning them by the convolution operation, which can be approximately considered as cumulating the probabilities that a feature appears in different locations. this strategy is reasonable for low-level features but not for high-level semantic ones, especially in the case of pedestrian detection, where a local feature can be shared by different locations but a semantic part, e.g., a head, only appears once for a human. To jointly model the spatial structure and appearance of high-level semantic features, we propose a new module to learn spatially weighted max pooling in CNN. the proposed method is evaluated on several pedestrian detection databases and the experimental results show that it achieves much better performance than traditional CNN.
Dog classification has a wide range of applications in computervision and zoology. However, traditional deep learning methods face challenges in classifying chinese native dog breeds due to the limitations and unique...
详细信息
ISBN:
(数字)9798350385625
ISBN:
(纸本)9798350385632
Dog classification has a wide range of applications in computervision and zoology. However, traditional deep learning methods face challenges in classifying chinese native dog breeds due to the limitations and uniqueness of sample data. this paper proposes a classification method for domestic local dog breeds based on an improved ResNet (Residual Network). the model employs ResNet50 as the backbone network. By adjusting the fully connected layer to the Sigmoid output function and the BCELoss loss function, the misclassification probability of unseen categories is effectively reduced. Initially, crawler technology was utilized to obtain image data for nine types of native dogs, which were then preprocessed. Subsequently, these nine dog classification tasks were categorized into two groups. For each category, an improved ResNet was trained independently to obtain two distinct models. Finally, the two models were tested using multithreaded parallel techniques to compare their outputs and determine the most suitable dog breed classification result. Results demonstrate that the proposed algorithm performs well in the chinese native dog breed classification task. Compared withthe unimproved model without parallel technology and the model using parallel technology but without improvement, the accuracy improved from 93.8% and 94.5% to 96.4%. Additionally, precision, recall, and F1-score reached 96.7%, 96.4%, and 96.5%, respectively. these findings fully confirm the effectiveness of the algorithm. this study provides a robust solution to promote the protection and research of indigenous dog breeds in China.
A novel structure for a charge pump circuit is proposed,in which the current follow technology is used to make perfect current matching characteristics,and two differential inverters are implanted to increase the spee...
详细信息
A novel structure for a charge pump circuit is proposed,in which the current follow technology is used to make perfect current matching characteristics,and two differential inverters are implanted to increase the speed of charge *** results,with 1silicon 0.25um 2.5V CMOS mixed signal process,show the good current matching characteristics regardless of the charge pump output voltages.
A much extended graph-based alternative wiring (GBAW) scheme to identify alternative wires in multilevel logic with promising results is presented. By modeling subsets of circuits as minimal graphs and applying purely...
详细信息
ISBN:
(纸本)0780365429
A much extended graph-based alternative wiring (GBAW) scheme to identify alternative wires in multilevel logic with promising results is presented. By modeling subsets of circuits as minimal graphs and applying purely graph-based local pattern search technique, we have found more than 40 graph patterns which contain alternative wires within 2-edge distance from the target wire. Applying proper grouping technique for the similar patterns, the complexity of our rewiring technique can be reduced. Experimental results on MCNC benchmarks show that our technique is much faster than the ATPG-based technique RAMBO with competitive number of alternative wires found. Withthis augmented pattern family of alternative wires, we are able to find 30% more alternative wires compared to RAMBO with75 times speedup on average. We applied GBAW in logic minimization as a perturbation engine and simplify the target circuit by SIS algebraic operations. Results show a further reduction of 11.1% in literal count compared to applying algebraic operations alone.
Recent work in monocular pedestrian detection is trying to improve the execution time while keeping the accuracy as high as possible.A popular and successful approach for monocular intensity pedestrian detection is ba...
Recent work in monocular pedestrian detection is trying to improve the execution time while keeping the accuracy as high as possible.A popular and successful approach for monocular intensity pedestrian detection is based on the approximation(instead of computation) of image features for multiple scales based on the features computed on set of predefined *** port this idea to the infrared *** contributions reside in the combination of four channel features,namely infrared,histogram of gradient orientations,normalized gradient magnitude and local binary patterns withthe objective of detecting pedestrians for night vision applications dealing with far infrared *** scale feature computation is done by feature *** contribution is the study of different formulations for Local Binary patterns like uniform patterns and rotation invariant patterns and their effect on detection *** detection speed is also boosted by the aid of a fast morphological based region of interest *** vary the number of approximated scales per octave and study the impact on execution time and accuracy.A reasonable result hits a speed of 18 fps with a log average miss rate of 39%.
Object tracking plays a crucial role in computervision and finds extensive applications in various areas such as video surveillance, autonomous driving, and intelligent security systems. However, existing target trac...
详细信息
ISBN:
(数字)9798350350760
ISBN:
(纸本)9798350350777
Object tracking plays a crucial role in computervision and finds extensive applications in various areas such as video surveillance, autonomous driving, and intelligent security systems. However, existing target tracking methods is deficient in handling information redundancy, and extracting key information, which affects the accuracy and robustness of tracking. In this paper, we propose a new method of target tracking that combines the information bottleneck theory, which optimizes the information transmission path and extracts key information related to the target tracking task by introducing an information bottleneck module in the tracking model. the information bottleneck approach can be used to extract robust feature representations that help the tracking algorithm remain stable and accurate in the face of target changes, occlusions and noise. the method is combined with Visual Transformer (ViT) to enhance the model's ability to adapt to targets in complex scenes. Experimental findings indicate that the proposed method delivers outstanding performance across multiple public datasets. With significant improvement in accuracy and robustness compared to existing methods, the research in this paper provides new ideas and methods to improve the performance of target tracking algorithms.
the insertion of dummy metals is necessary to reduce the pattern-dependent variations of the dielectric thickness in the CMP *** makes conventional tools of capacitance extraction exhibit prohibitive calculation *** p...
详细信息
the insertion of dummy metals is necessary to reduce the pattern-dependent variations of the dielectric thickness in the CMP *** makes conventional tools of capacitance extraction exhibit prohibitive calculation *** paper presents an efficient method for 3-D capacitance extraction with taking the floating dummies into *** on the QMM-accelerated BEM,our method inherits high computational speed while considering the floating conditions and using a new *** some typical structures including floating dummies,our method shows high accuracy and over 1000x speed-up over Raphael,about 10x speed-up over the method in Ref[5].
the driving intelligence of the virtual intelligent vehicle in the driving simulation system is crucial to the system's sense of reality and the reliability of the experiment. this paper focuses on the features an...
详细信息
the driving intelligence of the virtual intelligent vehicle in the driving simulation system is crucial to the system's sense of reality and the reliability of the experiment. this paper focuses on the features and functionality of the virtual intelligent vehicle in driving simulation system. Intelligent vehicle should first distinguish the roadway network in order to drive correctly along a certain path. then the intelligent vehicle should be able to obtain the traffic information efficiently and to execute the decision-making process to response to the changing driving environment. this paper describes a method to construct the virtual intelligent vehicle using database technology and the collision detection algorithm. the experiment result illustrates that the virtual intelligent vehicles in virtual traffic environment can simulate the human vision genuinely and exhibit human-like driving behavior. the virtual intelligent vehicle can still keep the correctness and the efficiency in a comparatively complex traffic environment, also the stability of the system is satisfied.
Based on the digital image processing technology, this study applied the mathematic morphological character extracting software to get exterior shape feature from skull of three rodent species (Meriones unguiculatus, ...
详细信息
Based on the digital image processing technology, this study applied the mathematic morphological character extracting software to get exterior shape feature from skull of three rodent species (Meriones unguiculatus, Microtus brandti and Rattus norvegicus). Via SPSS 13.0 for Windows, the data was explored for automatic recognition. 22 mathematic morphological characters were extracted and the results of univariate two-factor analysis of variance showed that all the features can provide a reliable foundation for a rapid automatic recognition, though failed to distinguish gender. Stepwise discriminatory method was applied and 9 parameters were selected for rodent identification: short axis(X1), perimeter(X2), eccentricity(X3), sphericity(X4), bump area(X5), paraxial area of enclosing rectangle(X6), hu1(X7), hu2(X8), hu3(X9). Withthe selected parameters, two standardized canonical discriminant functions were found: Y1 = 8.014X1 - 3.585X2 + 9.682X3 - 2.504X4 + 7.823X5 - 10.948X6 - 0.896X7 + 12.471X8 - 0.781X9; Y2 = 2.593X1 + 0.242X2 + 6.323X3 + 0.509X4 - 1.219X5 - 2.898X6 - 4.226X7 + 7.674X8 + 0.785X9. these two discriminant functions worked well for identification, the accuracy rate was 100%.
暂无评论