SDC is currently engaged in producing a 舠JOVIAL舡 compiler for each of its computers. The compilers are being produced by a bootstrapping method similar to that described in a report on UNCOL [1]. In discussing the s...
详细信息
automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are ...
详细信息
automatic coding of short text responses opens new doors in assessment. We implemented and integrated baseline methods of natural language processing and statistical modelling by means of software components that are available under open licenses. The accuracy of automatic text coding is demonstrated by using data collected in the Programme for International Student Assessment (PISA) 2012 in Germany. Free text responses of 10 items with n = 41, 990 responses in total were analyzed. We further examined the effect of different methods, parameter values, and sample sizes on performance of the implemented system. The system reached fair to good up to excellent agreement with human codings (.458 <= kappa <= .959): Especially items that are solved by naming specific semantic concepts appeared properly coded. The system performed equally well with n >= 1, 661 and somewhat poorer but still acceptable down to n = 249. Based on our findings, we discuss potential innovations for assessment that are enabled by automatic coding of short text responses.
Smart contract applications based on Ethereum blockchain have been widely used in many fields. They are developed by professional developers using specialized programming languages like solidity. It requires high requ...
详细信息
Smart contract applications based on Ethereum blockchain have been widely used in many fields. They are developed by professional developers using specialized programming languages like solidity. It requires high requirements on knowledge of the specialized field and the proficiency in contract programming Thus, it is hard for normal users to design a usable smart contract based on their own demands. Most current studies about smart contracts focus on the security of coding while lack of friendly tools for users to design the specialized templates of contracts coding. This paper provides a visual and user-defined smart contract designing systems. It makes the development of domain-specific smart contracts simpler and visualization for contract users. The system implements the domain-specific features extraction about the crawled data sets of smart contract programs by TF-IDF and K-means++ clustering algorithm. Then, it achieves the automatic generation of unified basic function codes by Char-RNN (improved by LSTM) based on the domain-specific features. The system adopts Google Blockly and links the generated codes with UI controls. Finally, it provides a set of specialized templates of basic functions for users to design smart contracts by the friendly interface. It reduces the difficulty and costs of contract programming. The paper offers a case study to design contracts by users. The designed contracts were validated on the existing system to implement the food trading and traders' credit evaluation. The experimental results show that the designed smart contracts achieve good integration with the existing system and they can be deployed and compiled successfully.
automatic approaches to coding party manifestos and other political texts have become more widespread. This research note addresses the question to what extent the source language of a text affects the results. To do ...
详细信息
automatic approaches to coding party manifestos and other political texts have become more widespread. This research note addresses the question to what extent the source language of a text affects the results. To do so, Swiss manifestos in German and French are coded automatically, comparing a keyword-based dictionary approach and Wordscores. Because of language differences, both stemming and particularly stop words are important to obtain comparable results for Wordscores. If both are used, the predicted scores are almost identical in both languages. With the right preparations, the challenge of language differences can thus be overcome.
An automated method for coding hospital patient records is described, which uses a combination of simple computer programs and a structured dictionary. The system accepts unrestricted medical language in French as inp...
详细信息
An automated method for coding hospital patient records is described, which uses a combination of simple computer programs and a structured dictionary. The system accepts unrestricted medical language in French as input, and generates code combinations which represent the semantic values of the original statements. The coding structure is the trial version of the Systematized Nomenclature of Medicine (SNOMED). A prototype system has been developed from a 20,000 word corpus of obstetrical and gynecological data, recorded from patient records written by French-speaking Quebec physicians. Preliminary test results on a small sample show that the system generates accurate codes and code combinations for 97.9% of the data tested. The programs were written in Fortran IV and implemented on a DEC PDP-9 computer. The dictionary keys are word segments and the entries contain operators, word segments and SNOMED codes. The dictionary structure permits a single entry to generate multiple SNOMED codes and to handle the types of linguistic paraphrase found in hospital records. The SNOMED codes and code combinations constitute a metalanguage which provides the means for data compression and uniform representation of natural language medical data.
We propose and empirically evaluate a theoretical framework of how to use coding guides for automatic coding (scoring) and how, in turn, automatic coding can enhance the use of coding guides. We adopted a recently des...
详细信息
ISBN:
(纸本)9781467384933
We propose and empirically evaluate a theoretical framework of how to use coding guides for automatic coding (scoring) and how, in turn, automatic coding can enhance the use of coding guides. We adopted a recently described baseline approach to automatically classify responses. Well-established coding guides from PISA, comprising reference responses, and its German sample from 2012 were used for evaluation. Ten items with 41,990 responses at total were analyzed. Results showed that (1) responses close to the cluster centroid constitute prototypes, (2) automatic coding can improve coding guides, while (3) the proposed procedure leads to unreliable accuracy for small numbers of clusters but promising agreement to human coding for higher numbers. Further analyses are still to be done to find the optimal balance of the implied coding effort and model accuracy.
Digestive endoscopies, along with all medical procedures in France are coded with the CCAM. This task is done by the physicians, is time-consuming and requires a good knowledge of the terminology besides a medical kno...
详细信息
ISBN:
(纸本)9781614999218;9781614999201
Digestive endoscopies, along with all medical procedures in France are coded with the CCAM. This task is done by the physicians, is time-consuming and requires a good knowledge of the terminology besides a medical knowledge. This method offers an automatic coding of endoscopic procedures from free-text reports. Thanks to a supervised learning method, the reports are coded with an average precision and recall of 0.92 on a 1639 texts corpus.
An algorithm for automatic coding of pathology reports using a multiaxial codification (ADICAP) is described and evaluated. It extracts "significant words" or expressions from a corpus and records the statis...
详细信息
ISBN:
(纸本)9781586038649
An algorithm for automatic coding of pathology reports using a multiaxial codification (ADICAP) is described and evaluated. It extracts "significant words" or expressions from a corpus and records the statistical relationships between them and the modalities of the different axes. Different weighting functions are evaluated. With the best settings, in more than two cases out of three the correct modality was found among the top 5 list of candidates, except for the "organ" axis. Several ways of improvement are discussed especially regarding the poor results on the "organ" axis. Perspectives of a two stages assembling algorithm completing this first step are proposed.
暂无评论