In the past, the field of CSCW has concentrated on providing support for the cooperative and coordination aspects of human work with the help of networked computer systems. The dominant issues now are the integration ...
详细信息
In the past, the field of CSCW has concentrated on providing support for the cooperative and coordination aspects of human work with the help of networked computer systems. The dominant issues now are the integration of different existing tools into comprehensive CSCW systems and the interoperation of heterogeneous CSCW systems. Work platforms based on services are providing new impulses for this integration: considerable contributions to integration can be made by standardizing the component parts offered as services in a globally networked system. The resulting increased reusability cuts costs. The paper presents a concept for a work platform for groupware applications based on services within a distributed system.
The development of binary classifiers to identify highly error-prone or high maintenance cost components is increasing in the softwareengineering quality modeling literature and in practice. One approach for evaluati...
详细信息
The development of binary classifiers to identify highly error-prone or high maintenance cost components is increasing in the softwareengineering quality modeling literature and in practice. One approach for evaluating these classifiers is to determine their ability to predict the classes of unseen cases, i.e., predictive validity. A chi-square statistical test has been frequently used to evaluate predictive validity. We illustrate that this test has a number of disadvantages. The disadvantages include a difficulty in using the results of the test to determine whether a classifier is a good predictor, demonstrated through a number of examples, and a rather conservative Type I error rate, demonstrated through a Monte Carlo simulation. We present an alternative test that has been used in the social sciences for evaluating agreement with a "gold standard". The use of this alternative test is illustrated in practice by developing a classification model to predict maintenance effort for an object oriented system, and evaluating its predictive validity on data from a second object-oriented system in the same environment.
In order to control inspections, the number of remaining defects in software artifacts after their inspection should be estimated. This would allow, for example, deciding whether a reinspection of supposedly faulty ar...
详细信息
In order to control inspections, the number of remaining defects in software artifacts after their inspection should be estimated. This would allow, for example, deciding whether a reinspection of supposedly faulty artifacts is necessary. Several studies in softwareengineering have considered capture-recapture models for performing such estimations. These models were initially developed for estimating animal abundance in wildlife research. In addition to these models, researchers in softwareengineering have recently proposed an alternative approach, namely the detection profile method (DPM), that makes less restrictive assumptions than some capture-recapture models and that show promise in terms of estimation accuracy. The authors investigate how to select between these two approaches for defect content estimation. As a result of this investigation they present a selection procedure taking into account the strength and weaknesses of the two methods. A weakness known for capture-recapture models is that they tend to provide extreme under/over estimation. The existence of such extreme outliers can discourage their use because their consequences in terms of wasted effort or defect slippage can be substantial, and therefore it is not clear,whether a particular estimate can be trusted. The evaluation of the selection procedure with actual inspection data indicates that this selection procedure provides the same accuracy as capture-recapture models alone and DPM alone, and most importantly does not exhibit extreme over/under estimation.
Current cost estimation techniques have a number of drawbacks. For example, developing algorithmic models requires extensive past project data. Also, off-the-shelf models have been found to be difficult to calibrate b...
详细信息
Current cost estimation techniques have a number of drawbacks. For example, developing algorithmic models requires extensive past project data. Also, off-the-shelf models have been found to be difficult to calibrate but inaccurate without calibration. Informal approaches based on experienced estimators depend on estimators' availability and are not easily repeatable, as well as not being much more accurate than algorithmic techniques. We present a method for cost estimation that combines aspects of algorithmic and experiential approaches (referred to as COBRA, COst estimation, Benchmarking, and Risk Assessment). We find through a case study that cost estimates using COBRA show an average ARE of 0.09. Although we do not have the room to describe the benchmarking and risk assessment parts, the reader will find detailed information in (Briand et al., 1997).
The paper contributes a controlled experiment to characterize the effects of code inspection and structural testing on software quality. Twenty subjects performed sequentially code inspection and structural testing us...
详细信息
The paper contributes a controlled experiment to characterize the effects of code inspection and structural testing on software quality. Twenty subjects performed sequentially code inspection and structural testing using different coverage values as test criteria on a C-code module. The results of this experiment show that inspection significantly outperforms the defect detection effectiveness of structural testing. Furthermore, the experimental results indicate little evidence to support the hypothesis that structural testing detects different defects, that is, defects of a particular class, that were missed by inspection and vice versa. These findings suggest that inspection and structural testing do not complement each other well. Since 39 percent (on average) of the defects were not detected at all, it might be more valuable to apply inspection, together with other testing techniques, such as boundary value analysis, to achieve a better defect coverage. We are aware that a single experiment has many limitations and often does not provide conclusive evidence. Hence, we consider this experiment a starting point and encourage other researchers to investigate the optimal mix of defect detection techniques.
Counts of defects found during the various defect defection activities in software projects and their classification provide a basis for product quality evaluation and process improvement. However, since defect classi...
详细信息
Counts of defects found during the various defect defection activities in software projects and their classification provide a basis for product quality evaluation and process improvement. However, since defect classifications are subjective, it is necessary to ensure that they are repeatable (i.e., that the classification is not dependent on the individual). We evaluate a slight adaptation of a commonly used defect classification scheme that has been applied in IBM's Orthogonal Defect Classification work, and in the SEI's Personal software Process. The evaluation utilizes the Kappa statistic. We use defect data from code inspections conducted during a development project. Our results indicate that the classification scheme is in general repeatable. We further evaluate classes of defects to find out if confusion between some categories is more common, and suggest a potential improvement to the scheme.
COTS-based development raises a number of issues that are usually not encountered in more classical software development settings. For instance, how to select appropriate COTS? (e.g., middleware for distributed applic...
详细信息
COTS-based development raises a number of issues that are usually not encountered in more classical software development settings. For instance, how to select appropriate COTS? (e.g., middleware for distributed applications, GUI builders). How to integrate COTS selection and evaluation in the requirements engineering and design phases? In this position paper, we address a number of questions regarding COTS evaluation and selection. We first try to better state why this problem is an important one, and why it is likely to become pervasive in the near future. Then, we attempt to clearly identify the difficulties associated with evaluating and selecting COTS. A number of solutions that may alleviate these problems are then discussed. Last, we will list a number of open research questions.
ISO/IEC 15504 is an emerging international standard for software process assessment. It has undergone a major change in the rating scale used to measure the capability of processes. The objective of this paper is to p...
详细信息
ISO/IEC 15504 is an emerging international standard for software process assessment. It has undergone a major change in the rating scale used to measure the capability of processes. The objective of this paper is to present a follow up evaluation of the internal consistency of this process capability scale. Internal consistency is a form of reliability of a subjective measurement instrument. A previous study evaluated the internal consistency of the first version of the ISO/IEC 15504 document set (also known as SPICE version 1). In the current study we evaluate the internal consistency of the second version (also known as ISO/IEC PDTR 15504). Our results indicate that the internal consistency of the capability dimension did not deteriorate, and that it is still sufficiently high for practical purposes. Furthermore, we identify that the capability scale has two dimensions that we termed "Process Implementation" and "Quantitative Process Management".
Little is known about requirements engineering practices in small and medium enterprises (SMEs). This paper summarises the results of a workshop on requirements engineering held with practitioners from 10 SMEs. The cu...
Little is known about requirements engineering practices in small and medium enterprises (SMEs). This paper summarises the results of a workshop on requirements engineering held with practitioners from 10 SMEs. The current state-of-the-practice, as reported by the practitioners, differs significantly, and so do their individual problems due to contextual issues (e.g., in-house vs. contract development, type of product). The participants were presented with a set of current requirements engineering principles, techniques, methods, and tools. Important concepts were motivated by small case studies and experiments which we employed as a vehicle for technology transfer. The design of these and their results are described, as well as the practitioner’s rating of the techniques and methods.
暂无评论