Implicit relevance feedback (IRF) is the process by which a search system unobtrusively gathers evidence on searcher interests from their interaction with the system. IRF is a new method of gathering information on us...
详细信息
Software and product designers use card sorting to understand item groups and relationships. In the usability community, a common method of formal statistical analysis for open card sort data is hierarchical cluster a...
Software and product designers use card sorting to understand item groups and relationships. In the usability community, a common method of formal statistical analysis for open card sort data is hierarchical cluster analysis, which results in a tree of the items sorted into distinct, nested clusters. Hierarchical cluster analysis is appropriate for highly structured settings, like software menus. However, many situations call for softer clusters, such as designing websites where multiple pages link to the same target page. Factor analysis summarizes the categories created in card sorts and generates clusters that can overlap. This paper explains how to prepare card sort data for statistical analysis, describes the results of factor analysis and how to interpret them, and discusses when hierarchical cluster analysis and factor analysis are appropriate.
This paper examines MPEG-4 coding efficiency for interlaced and progressively scanned video data recorded during remote patient monitoring. Experiments are performed using both interlaced and progressive coding method...
详细信息
Dance is believed to be important in the courtship of a variety of species, including humans, but nothing is known about what dance reveals about the underlying phenotypic - or genotypic - quality of the dancer1-6. On...
详细信息
Dynapad is a development environment designed to support research prototyping of multiscale workspaces. In this paper we describe applications designed to facilitate visual access to and spatial organization of digita...
详细信息
Dynapad is a development environment designed to support research prototyping of multiscale workspaces. In this paper we describe applications designed to facilitate visual access to and spatial organization of digital photo collections and personal libraries of PDF documents. The research objective is to explore a generalization of the notion of a "pile" as a foundation for a versatile suite of tools to provide unobtrusive assistance for organizing collections and other sensemaking activities. We detail the architecture underlying the applications, explain how it supports diverse functionality and interaction styles, and abstract a set of principles for designing spatial tools.
Authoring tools routinely include a timeline representation to allow the author to specify the sequence of animations and interactions. However, traditional static timelines are best suited for static, linear sequence...
详细信息
ISBN:
(纸本)1595930027
Authoring tools routinely include a timeline representation to allow the author to specify the sequence of animations and interactions. However, traditional static timelines are best suited for static, linear sequences (such MIDI sequencers) and do not lend themselves to interactive content. This forces authors to supplement their timelines with scripted actions which are not represented. Timelines also force frameaccuracy on the author, which interferes with rapid exploration of different designs. We present a redesign of the timeline in which users can specify the relative ordering and causality of events without specifying exact times or durations. This effectively enables users to "work rough" in time. We then implement a prototype and perform a user study to investigate its efficiency.
Recent work on intelligent tutoring systems has used Bayesian networks to model students' acquisition of skills. In many cases, researchers have hand-coded the parameters of the networks, arguing that the conditio...
详细信息
A community computer network facilitates civic participation by providing pervasive local resources online and by connecting people to local communication and discussion channels, public and non-profit organization le...
详细信息
A lot of researches on classifiers, which can perform well with a given set of feature vectors, have been done. However, researches on feature vectors, which extract better feature vectors automatically, have not been...
详细信息
Robust and accurate cancer classification is critical in cancer treatment. Gene expression profiling is expected to enable us to diagnose tumors precisely and systematically. However, the classification task in this c...
详细信息
ISBN:
(纸本)0769523447
Robust and accurate cancer classification is critical in cancer treatment. Gene expression profiling is expected to enable us to diagnose tumors precisely and systematically. However, the classification task in this context is very challenging because of the curse of dimensionality and the small sample size problem. In this paper, we propose a novel method to solve these two problems. Our method is able to map gene expression data into a very low dimensional space and thus meets the recommended samples to features per class ratio. As a result, it can be used to classify new samples robustly with low and trustable (estimated) error rates. The method is based on linear discriminant analysis (LDA). However, the conventional LDA requires that the within-class scatter matrix S/sub w/ be nonsingular. Unfortunately, S/sub w/ is always singular in the case of cancer classification due to the small sample size problem. To overcome this problem, we develop a generalized linear discriminant analysis (GLDA) that is a general, direct, and complete solution to optimize Fisher's criterion. GLDA is mathematically well-founded and coincides with the conventional LDA when S/sub w/ is nonsingular. Different from the conventional LDA, GLDA does not assume the nonsingularity of S/sub w/, and thus naturally solves the small sample size problem. To accommodate the high dimensionality of scatter matrices, a fast algorithm of GLDA is also developed. Our extensive experiments on seven public cancer datasets show that the method performs well. Especially on some difficult instances that have very small samples to genes per class ratios, our method achieves much higher accuracies than widely used classification methods such as support vector machines, random forests, etc.
暂无评论