We have developed a framework aiming at normalizing product attributes from Web pages collected from different Web sites without the need of labeled training examples. It can deal with pages composed of different layo...
详细信息
ISBN:
(纸本)9781450304931
We have developed a framework aiming at normalizing product attributes from Web pages collected from different Web sites without the need of labeled training examples. It can deal with pages composed of different layout format and content in an unsupervised manner. As a result, it can handle a variety of different domains with minimal effort. Our model is based on a generative probabilistic graphical model incorporated with Hidden Markov Models (HMM) considering both attribute names and attribute values to extract and normalize text fragments from Web pages in a unified manner. Dirichlet Process is employed to handle the unlimited number of attributes in a domain. An unsupervised inference method is proposed to predict the unobservable variables. We have also developed a method to automatically construct a domain ontology using the normalized product attributes which are the output of the inference on the graphical model. We have conducted extensive experiments and compared with existing works using product Web pages collected from real-world Web sites in three different domains to demonstrate the effectiveness of our framework. Copyright 2011 ACM.
Machine learning(ML)has taken the world by a tornado with its prevalent applications in automating ordinary tasks and using turbulent insights throughout scientific research and design *** is a massive area within art...
详细信息
Machine learning(ML)has taken the world by a tornado with its prevalent applications in automating ordinary tasks and using turbulent insights throughout scientific research and design *** is a massive area within artificial intelligence(AI)that focuses on obtaining valuable information out of data,explaining why ML has often been related to stats and data *** advanced meta-heuristic optimization algorithm is proposed in this work for the optimization problem of antenna architecture *** algorithm is designed,depending on the hybrid between the Sine Cosine Algorithm(SCA)and the Grey Wolf Optimizer(GWO),to train neural networkbased Multilayer Perceptron(MLP).The proposed optimization algorithm is a practical,versatile,and trustworthy platform to recognize the design parameters in an optimal way for an endorsement double T-shaped monopole *** proposed algorithm likewise shows a comparative and statistical analysis by different curves in addition to the ANOVA and *** offers the superiority and validation stability evaluation of the predicted results to verify the procedures’accuracy.
Sensor deployment is one of the important problems in Wireless Sensor Network (WSN). Deployment affects the coverage of the network especially in critical applications such as in radiation detection. The contributions...
详细信息
This article develops a new controller design approach to stabilize system states onto the equilibrium at an arbitrarily selected time instant irrespective of the initial system states and parameters. By the stabiliza...
详细信息
Generative models for text data are based on the idea that a document can be modeled as a mixture of topics, each of which is represented as a probability distribution over the terms. Such models have traditionally as...
详细信息
ISBN:
(纸本)9783642244766
Generative models for text data are based on the idea that a document can be modeled as a mixture of topics, each of which is represented as a probability distribution over the terms. Such models have traditionally assumed that a document is an indivisible unit for the generative process, which may not be appropriate to handle documents with an explicit multi-topic structure. This paper presents a generative model that exploits a given decomposition of documents in smaller text blocks which are topically cohesive (segments). A new variable is introduced to model the within-document segments: using this variable at document-level, word generation is related not only to the topics but also to the segments, while the topic latent variable is directly associated to the segments, rather than to the document as a whole. Experimental results have shown that, compared to existing generative models, our proposed model provides better perplexity of language modeling and better support for effective clustering of documents.
Stochastic jump phenomena in the random responses of a Duffing oscillator subjected to narrow band excitation are investigated. The stochastic jump phenomena correspond to the existence of multiple stationary response...
详细信息
Electroencephalograph (EEG) recordings during right and left motor imagery can be used to move a cursor to a target on a computer screen. Such an EEG-based bra in-computer interface (BCI) can provide a new communicati...
详细信息
During the COVID19 epidemic, people of all ages from all walks of life around the world have become inevitably familiar with and almost dependent on the digital tools of the age and the opportunities they offer. A cha...
详细信息
Malware is a‘malicious software program that performs multiple cyberattacks on the Internet,involving fraud,scams,nation-state cyberwar,and *** malicious software programs come under different classifications,namely ...
详细信息
Malware is a‘malicious software program that performs multiple cyberattacks on the Internet,involving fraud,scams,nation-state cyberwar,and *** malicious software programs come under different classifications,namely Trojans,viruses,spyware,worms,ransomware,Rootkit,botnet malware,*** is a kind of malware that holds the victim’s data hostage by encrypting the information on the user’s computer to make it inaccessible to users and only decrypting it;then,the user pays a ransom procedure of a sum of *** prevent detection,various forms of ransomware utilize more than one mechanism in their attack flow in conjunction with Machine Learning(ML)*** study focuses on designing a Learning-Based Artificial Algae Algorithm with Optimal Machine Learning Enabled Malware Detection(LBAAA-OMLMD)approach in computer *** presented LBAAA-OMLMDmodelmainly aims to detect and classify the existence of ransomware and goodware in the *** accomplish this,the LBAAA-OMLMD model initially derives a Learning-Based Artificial Algae Algorithm based Feature Selection(LBAAA-FS)model to reduce the curse of dimensionality ***,the Flower Pollination Algorithm(FPA)with Echo State Network(ESN)Classification model is *** FPA model helps to appropriately adjust the parameters related to the ESN model to accomplish enhanced classifier *** experimental validation of the LBAAA-OMLMD model is tested using a benchmark dataset,and the outcomes are inspected in distinct *** comprehensive comparative examination demonstrated the betterment of the LBAAAOMLMD model over recent algorithms.
The problem of estimating the thermal state of an induction cooking system is analyzed. The first step is to model the thermal system composed by the cooktop, the pot, and the pot content. Then, by relying on the form...
详细信息
暂无评论