This document introduces the 1stinternational Virtual conference on Visual pattern Extraction and recognition for Cultural Heritage Understanding (VIPERC 2022), a premier forum for presenting the state-of-the-art, ne...
详细信息
The training of the text-to-image task relies on a large number of image-text pairs, and the problems of insufficient sample size of the dataset and inprominent image subject content will greatly affect the overall pe...
详细信息
ISBN:
(纸本)9798400707032
The training of the text-to-image task relies on a large number of image-text pairs, and the problems of insufficient sample size of the dataset and inprominent image subject content will greatly affect the overall performance of the model. In addition, most of the existing methods are stacked multi-stage generation structure, and the quality of generated images is difficult to meet the actual application requirements. Aiming at the above problems, this paper enhances the dataset and designs a single-stage text generation image model based on styleGAN. Based on the analysis of global semantic features and semantic diversity expressions, this paper carries out adaptive perturbation text data enhancement, and designs an image cropping method that fuses boundary information, and adopts random cropping and rotation to complete image data enhancement. In the generator, the conditional style construction network fuses global semantic information and hidden space vectors to obtain conditional style control vectors that control different visual features of the image. The generation network adopts a progressive generation structure, accepts the control of conditional style in the way of AdaIN, and adds random noise to each layer at the same time, which improves the authenticity of the image and enriches the details. At the same time, the semantic inner product is added to the discriminator, and the loss function of cyclic consistency loss and visual semantic matching loss are introduced to further improve the semantic consistency of graphics and texts. The IS, FID and R-precision values of this model on the CUB-200-2011 dataset reached 4.96, 14.11 and 75.20%, respectively.
China is vigorously developing electric vehicles, and the penetration rate of new energy vehicles in China has exceeded 10%. The penetration rates in the United states, Europe, and other regions are also growing, lead...
详细信息
ISBN:
(纸本)9798400707032
China is vigorously developing electric vehicles, and the penetration rate of new energy vehicles in China has exceeded 10%. The penetration rates in the United states, Europe, and other regions are also growing, leading to a high-speed increase in the demand for charging piles. Although the number of public charging piles has been increasing year by year, big data shows that the utilization rate of public charging piles is less than 15%. Therefore, how to guide users to avoid peak electricity consumption periods without compromising the convenience of charging for electric vehicle users is key to solving the problem. Finding an appropriate scheduling strategy is the core of solving this dilemma. For personalized recommendation algorithms for charging piles, we propose a Top-N recommendation algorithm for charging piles based on a neural collaborative filtering framework that combines multiple feature fusion methods. This algorithm uses concatenation and outer product operations to construct interaction relationships between users and items, respectively. Then, two learners are used to learn their interactive features, and finally, the learned prediction vectors are merged through concatenation. This approach not only fully excavates the potential information of embedded vectors but also has nonlinear feature fitting capabilities, thereby better improving the model's recommendation performance.
Breech face impressions are marks left by the impact of a bullet casing against the breech face due to the reactive force of the gunpowder explosion, which is essential evidence linking firearms to fired bullet casing...
详细信息
ISBN:
(纸本)9798400707032
Breech face impressions are marks left by the impact of a bullet casing against the breech face due to the reactive force of the gunpowder explosion, which is essential evidence linking firearms to fired bullet casings. Aiming at the problem that traditional methods exhibit a high error rate and cannot be applied to both granular and striated impressions, a learnable image matcher designed on the core principle of generalization (OmniGlue) is introduced for the automatic comparison of breech face impressions. The method employs an attention mechanism to propagate information across the built graphs, removes irrelevant key points and separates the local feature descriptors from the position information of key points during information propagation, thereby reducing the dependence of the feature descriptors on position information. Position information is not merged into the local descriptors used for matching, enhancing the generalization of these descriptors and improving the model's performance across different domains. The method is validated using the Fadul and Hamby datasets provided by the National Institute of standards and Technology (NIst). The results show that the method is effective for both granular and striated marks, exhibiting a lower identification error rate and superior performance compared to traditional methods.
Support Vector machine(SVM) is one of the most efficient machinelearning algorithms, which is mostly used for patternrecognition since its introduction in 1990s. SVMs vast variety of usage, such as face and speech r...
详细信息
ISBN:
(纸本)9781479945559
Support Vector machine(SVM) is one of the most efficient machinelearning algorithms, which is mostly used for patternrecognition since its introduction in 1990s. SVMs vast variety of usage, such as face and speech recognition, face detection and imagerecognition has turned it into a very useful algorithm. This has also been applied to many pattern classification problems such as imagerecognition, speech recognition, text categorization, face detection, and faulty card detection. statistics was collected from journals and electronic sources published in the period of 2000 to 2013. patternrecognition aims to classify data based on either a priori knowledge or statistical information extracted from raw data, which is a powerful tool in data separation in many disciplines. The Support Vector machine ( SVM) is a kind of algorithms in biometrics. It is a statistics technical and used orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables.
This paper presents a new inference algorithm for Active learning Method (ALM). ALM is a pattern-based algorithm for soft computing which uses the Ink Drop Spread (IDS) algorithm as its main engine for feature extract...
详细信息
ISBN:
(纸本)9781467362061;9781467362047
This paper presents a new inference algorithm for Active learning Method (ALM). ALM is a pattern-based algorithm for soft computing which uses the Ink Drop Spread (IDS) algorithm as its main engine for feature extraction. In this paper a fuzzy number is extracted from each IDS plane rather than the narrow path and spread as in previous approaches. In order to compare performance of the proposed algorithm with the original one, two functions which are widely used in literature are modeled as the benchmark. Simulation results show that the proposed algorithm is as effective as previous one in the modeling task.
In recent years, with the continuous development of multispectral industrial image technology, multispectral industrial image data has been greatly improved in terms of spectrum and spatial resolution, which provides ...
详细信息
This study reports a disease symptom classification algorithm using a proposed patternrecognition approach to individually localize early and late blight visual disease symptoms. The algorithm uses the pathological a...
详细信息
ISBN:
(纸本)9781728133775
This study reports a disease symptom classification algorithm using a proposed patternrecognition approach to individually localize early and late blight visual disease symptoms. The algorithm uses the pathological analogy hierarchy of the diseases to produce a novel homogeneous pattern localization, more informative to extract features that would be utilized for a machinelearning system to classify the two diseases in digital photographs of vegetable plants. One of the most significant advantages of the proposed pattern analysis is localizing symptomatic and necrotic regions based on pathological disease analogy using soft computing, with which the pattern of each disease manifestation along the leaf surface can be tracked and quantified for characterization. In the 1st phase of the experiment, individual symptomatic (R-S), necrotic (R-N), and blurred (R-B, in-between healthy and symptomatic) regions were identified, segmented, and quantified. The 2nd phase focuses on the extraction of pattern features for classification and severity estimation with a machinelearning classifier. The obtained results are encouraging, successfully localizing and quantifying individual disease lesions. This also indicates the enhanced applicability of the proposed approach discriminating the two diseases based on their dissimilarity. It is also envisaged that the algorithm can be extended to other plant disease symptoms. Moreover, it provides opportunities for early identification and detection of subtle changes in plant growth, disease stage, and severity estimation to assisting crop diagnostics in precision agriculture.
One of the problems in imageprocessing is finding an appropriate threshold in order to convert an image to a binary one. In this paper we introduce a new method for image thresholding. We use reinforcement learning a...
详细信息
ISBN:
(纸本)0769521274
One of the problems in imageprocessing is finding an appropriate threshold in order to convert an image to a binary one. In this paper we introduce a new method for image thresholding. We use reinforcement learning as an effective way to find the optimal threshold. Q (A) is implemented as a learning algorithm to achieve more accurate results. The reinforcement agent uses objective rewards to explore/exploit the solution space. It means that there is not any experienced operator involved and the reward and punishment function must be defined for the agent. The results show that this method works successfully and can be trained for any particular application.
This book constitutes the refereed proceedings of the 4th internationalconference on patternrecognition and machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. The 65 revised papers presented...
ISBN:
(数字)9783642217869
ISBN:
(纸本)9783642217852
This book constitutes the refereed proceedings of the 4th internationalconference on patternrecognition and machine Intelligence, PReMI 2011, held in Moscow, Russia in June/July 2011. The 65 revised papers presented together with 5 invited talks were carefully reviewed and selected from 140 submissions. The papers are organized in topical sections on patternrecognition and machinelearning; image analysis; image and video information retrieval; natural language processing and text and data mining; watermarking, steganography and biometrics; soft computing and applications; clustering and network analysis; bio and chemo analysis; and document imageprocessing.
暂无评论