documentation is an integral part of a software system. It contains the information that is necessary to effectively and successfully develop, use, and maintain a system. In practice, however, the creation Of appropri...
详细信息
ISBN:
(纸本)0769525466
documentation is an integral part of a software system. It contains the information that is necessary to effectively and successfully develop, use, and maintain a system. In practice, however, the creation Of appropriate documentation is largely neglected. This paper investigates the reasons for this neglect, presents view-based software documentation, our approach to improve the current situation, and reports on empirical evidence in support of the presented approach. Because the quality of documentation depends on its usage, view-based software documentation exploits existing software modeling techniques to provide all users of documentation with the documentation they require for performing their tasks. View-based software documentation has been empirically validated in a series of experiments and case studies that showed that the approach improves the completeness, correctness, and usefulness of produced and maintained documentation. One of the experiments is described here in more detail.
In multicluster systems, and more generally, in grids, jobs may require co-allocation. i.e., the simultaneous allocation of resources such as processors and input files in multiple clusters. While such jobs may have r...
详细信息
ISBN:
(纸本)0780386949
In multicluster systems, and more generally, in grids, jobs may require co-allocation. i.e., the simultaneous allocation of resources such as processors and input files in multiple clusters. While such jobs may have reduced runtimes because they have access to more resources, waiting for processors in multiple clusters and for the input files to become available in the right locations may introduce inefficiencies. In previous work we have studied through simulations only processor co-allocation. Here, we extend this work with an analysis of the performance in a real testbed of out prototype Processor and Data Co-Allocator with the Close-to-Files (CF) job-placement algorithm. CF tries to place job components on clusters with enough idle processors which are close to the sites where the input files reside. We present a comparison of the perfomance of CF and the Worst-Fit job-placement algorithm, with and without file replication, achieved with our prototype. Our most important findings are that CF with replication works best. and that the utilization in our testbed call be driven to about 80%.
Software development relies heavily on reusable libraries and software components. For correct use of the provided API, proper documentation is needed. API usage is often demonstrated by giving example applications an...
详细信息
ISBN:
(纸本)9780769528601
Software development relies heavily on reusable libraries and software components. For correct use of the provided API, proper documentation is needed. API usage is often demonstrated by giving example applications and code samples. In this paper we propose an approach for mining such usage scenarios from run-time communication between sample applications and the API. This is done automatically by first monitoring the API usage of sample applications, then filtering the generated traces, and finally synthesizing the sequence diagrams and illustrating them in a well-formed way as UML2 sequence diagrams. Such usage scenarios support the software engineer in comprehending the usage of the API. With proper tool support they can also be used for validating other applications' AN usage and for generating code for a new application using the same API.
Ordering information is a critical task for multi-document summarization(MDS) because it heavily influent the coherence of the generated summary. In this paper, we propose a hybrid model for sentence ordering in extra...
详细信息
ISBN:
(纸本)3540454853
Ordering information is a critical task for multi-document summarization(MDS) because it heavily influent the coherence of the generated summary. In this paper, we propose a hybrid model for sentence ordering in extractive multi-document summarization that combines four relations between sentences - chronological relation, positional relation, topical relation and dependent relation. This model regards sentence as vertex and combined relation as edge of a directed graph on which the approximately optimal ordering can be generated with PageRank analysis. Evaluation of our hybrid model shows a significant improvement of the ordering over strategies losing some relations and the results also indicate that this hybrid model is robust for articles with different genre.
We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware.(1,2) Because resources are scarce, it is important to have a small set of robust and maxima...
详细信息
ISBN:
(纸本)0780388690
We report on experiments in adapting document categorization techniques to provide for implementation in high-speed hardware.(1,2) Because resources are scarce, it is important to have a small set of robust and maximally informative variables over which learning can occur. We generate variables using information-theoretic clustering. The resulting performance is on par with general-purpose computing implementations which are able to take advantage of large amounts of time and memory. We conclude that custom high-speed hardware for document categorization can be made very accurate. We also believe that some of the strengths of information-theoretic data analysis techniques are brought out.
This paper investigates commonalities and differences between Agile and documentation-driven approaches in managing uncertainty in requirement gathering. The research method is a survey collected interviewing sixteen ...
详细信息
ISBN:
(纸本)0769526918
This paper investigates commonalities and differences between Agile and documentation-driven approaches in managing uncertainty in requirement gathering. The research method is a survey collected interviewing sixteen project managers of Italian software companies, 8 using Agile Methods, and 8 using documentation-driven methods. The results show that Agile and Document-driven companies consider in a different way the problem of changing requirements and the related uncertainty: thus, they manage differently requirements gathering and the relationship with the customer.
Living guidelines are documents presenting up-to-date and state-of-the-art knowledge to practitioners. To have guidelines implemented by computer-support they firstly have to be formalized in a computer-interpretable ...
详细信息
ISBN:
(纸本)9783540735984
Living guidelines are documents presenting up-to-date and state-of-the-art knowledge to practitioners. To have guidelines implemented by computer-support they firstly have to be formalized in a computer-interpretable form. Due to the complexity of such formats the formalization process is challenging, but burdensome and time-consuming. The LASSIE methodology supports this task by formalizing guidelines in several steps from the textual form to the guideline representation language Asbru using a document-centric approach. LASSIE uses Information Extraction technique to semi-automatically accomplish these steps. We apply LASSIE to support the implementation of living guidelines. Based on a living guideline published by the Scottish Intercollegiate Guidelines Network (SIGN) we show that adaptations of previously formalized guidelines can be accomplished easily and fast. By using this new approach only new and changed text parts have to be modeled. Furthermore, models can be inherited from previously modeled guideline versions that were added by domain experts.
In the fast pace business world of today where competition and technology are at their zenith, software development companies need to improve their quality standards in addition to cost reduction in operations. To ach...
详细信息
ISBN:
(纸本)076952611X
In the fast pace business world of today where competition and technology are at their zenith, software development companies need to improve their quality standards in addition to cost reduction in operations. To achieve these challenging objectives various developments are on the verge. In recent past agile methodologies have emerged as one of the most efficient implementations in the world of software development arena. Especially eXtreme programming (XP) which is integrated by Test first approach recent research proves the emergence of Test Driven Development (TDD)from this concept which is based on formalizing the requirement as a test and secondly to write such a code that can pass the test. This attempt of our research provides a mechanism to reduce the cost of testing, mainly due to troublesome test which fails again and again. We used TDD, analyzed the problem and proposed a workable solution. Test Driven Development is a technique which encourages less documentation resulting in a lot of difficulties for developers in contrast to traditional methods. In order to reduce the burden on developers we have proposed some steps in documentation.
Peculiar data are objects that are relatively few in number and significantly different from the other objects in a data set. In this paper we propose the PDD framework for detecting multiple categories of peculiar da...
详细信息
ISBN:
(纸本)9780769527017
Peculiar data are objects that are relatively few in number and significantly different from the other objects in a data set. In this paper we propose the PDD framework for detecting multiple categories of peculiar data. This framework provides an extensible set of perspectives for viewing data, currently including viewing data as a set of records, attributes, firequencies, intervals, sequences, or sequences of changes. By using these six views of the data, multiple categories of peculiar data can be detected to reveal different aspects of the data. For each view, the framework provides an extensible set of peculiarity measures to detect outliers and other kinds of peculiar data. The PDD framework has been implemented for Oracle and Access. Experiments are reported for data sets concerning Regina weather and NHL hockey.
This paper proposes a new mechanism for document similarity search, which uses the indexing structure called signature tables. The mechanism of signature tables is originally invented for similarity search of market b...
详细信息
ISBN:
(纸本)0780370805
This paper proposes a new mechanism for document similarity search, which uses the indexing structure called signature tables. The mechanism of signature tables is originally invented for similarity search of market basket data, and in this paper we apply it to document data. Since the characteristics of document data is definitely different from that of market basket data, the performance of similarity search is not satisfactory when the mechanism is naively applied to document data. In this paper, we describe the reason why the naive application decreases the efficiency, and propose some techniques for improving the performance. The results of simulation using real document data set show that the proposed mechanism implements good performance.
暂无评论