Sparse coding is an unsupervised learning algorithm that learns a succinct high-level representation of the inputs given only unlabeled data;it represents each input as a sparse linear combination of a set of basis fu...
详细信息
ISBN:
(纸本)0974903930
Sparse coding is an unsupervised learning algorithm that learns a succinct high-level representation of the inputs given only unlabeled data;it represents each input as a sparse linear combination of a set of basis functions. Originally applied to modeling the human visual cortex, sparse coding has also been shown to be useful for self-taught learning, in which the goal is to solve a supervised classification task given access to additional unlabeled data drawn from different classes than that in the supervised learning problem. Shift-invariant sparse coding (SISC) is an extension of sparse coding which reconstructs a (usually time-series) input using all of the basis functions in all possible shifts. In this paper, we present an efficient algorithm for learning SISC bases. Our method is based on iteratively solving two large convex optimization problems: The first, which computes the linear coefficients, is an L1-regularized linear least squares problem with potentially hundreds of thousands of variables. Existing methods typically use a heuristic to select a small subset of the variables to optimize, but we present a way to efficiently compute the exact solution. The second, which solves for bases, is a constrained linear least squares problem. By optimizing over complex-valued variables in the Fourier domain, we reduce the coupling between the different variables, allowing the problem to be solved efficiently. We show that SISC's learned high-level representations of speech and music provide useful features for classification tasks within those domains. When applied to classification, under certain conditions the learned features outperform state of the art spectral and cep-stral features.
An inverse method using strain values obtained at finite locations is used to predict coefficients of an unknown continuous static load function applied to a composite sandwich plate. The strain values obtained from a...
详细信息
A (page or web) snippet is document excerpts allowing a user to understand if a document is indeed relevant without accessing it. This paper proposes an effective snippet generation method. The pseudo relevance feedba...
详细信息
ISBN:
(纸本)1595935975
A (page or web) snippet is document excerpts allowing a user to understand if a document is indeed relevant without accessing it. This paper proposes an effective snippet generation method. The pseudo relevance feedback technique and text summarization techniques are applied to salient sentences extraction for generating good quality snippets. In the experimental results, the proposed method showed much better performance than other methods including Google and Naver.
Finding a boat in wind and current is an age old problem that puzzled many people throughout history. It is only until recently a simple theory is developed to allow efficient and accurate prediction of the boat's...
详细信息
ISBN:
(纸本)1934272256
Finding a boat in wind and current is an age old problem that puzzled many people throughout history. It is only until recently a simple theory is developed to allow efficient and accurate prediction of the boat's drift trajectory. Such theory could be developed using simple physics and algebra. In this paper, we will describe the derivation of the theory that would give the boat's trajectory in terms of different types of boat condition under given wind field and current field. We will show that the relationship is versatile, and could be used for practical application. We will further implement the results in computer animation setting so that the results can be obtained easily for a given application. The paper demonstrates the power of mathematics and technology to solve problem of boat drift of search and rescue.
This paper explores what kind of user simulation model is suitable for developing a training corpus for using Markov Decision Processes (MDPs) to automatically learn dialog strategies. Our results suggest that with sp...
详细信息
In large-scale disaster events, infrastructure owners are faced with many challenges in deciding the allocation ofresources for preparation and response actions. This decision process involves building situation aware...
详细信息
Gene expression analysis techniques identify important genes that predict specified outcomes based on sample characteristics. Given the small sample sizes common to these studies and the large dimensionality of the da...
详细信息
Empirical spoken dialog research often involves the collection and analysis of a dialog corpus. However, it is not well understood whether and how a corpus of dialogs collected using recruited subjects differs from a ...
The vast growth in digital content generated by individuals marks a new social trend known as “Generation C” ( ***/trends/GENERATION_*** ). Personal content ranges from informal to formal, and includes scholarly pap...
The vast growth in digital content generated by individuals marks a new social trend known as “Generation C” ( ***/trends/GENERATION_*** ). Personal content ranges from informal to formal, and includes scholarly papers, blogs, genealogical records, personal webpages, photo albums, family videos, music collections, power point presentations, bookmarks, personal correspondence, articles, computerprograms, audio recordings (e.g. research interviews), spaces in collaborative systems, and personal digital libraries. Digital storage available for personal use continues to increase dramatically in capacity while declining in cost. Panelists will address challenges in the design of personal digital collections such as gathering, organizing, preserving, segmenting, accessing, and using digital content. Panelists are leaders in their field, representing an array of perspectives.
暂无评论