This study describes how to develop a machine learning system for the translation of Indian languages (Hindi, Gujarati, and Punjabi) using a form of deep neural network (DNN). We are employing a form of RNN called lon...
详细信息
An evidence file is a bit-stream copy of any digital storage media or a hard disk partition. Retrieving files from these evidence files without the intervention of file system is quite challenging as the storage locat...
详细信息
With the rapid and recent development of Internet of things (IoT), Bigdata, the most fundamental challenge is to explore the large volume of data from heterogeneous data sources (logs, audio, video, reports, news, soc...
详细信息
In this paper, we present a unimodular loop transformation called rotation as a simple, systematic and uniform method for partitioning the iteration spaces of doubly nested loops for execution on distributed memory mu...
详细信息
For developing and using various features provided by the internet by all the sections of the society, it is important to make the technology accessible irrespective of the language. So, the translation between the la...
详细信息
The whole world is overloaded via innumerable data like scientific data, financial data, mathematical data, geographic data, employment data, and other various kinds of data. Manually analyzing or summarizing this dat...
详细信息
In traditional crowdsourcing, workers are expected to provide independent answers to tasks so as to ensure the diversity of answers. However, recent studies show that the crowd is not a collection of independent worke...
详细信息
In traditional crowdsourcing, workers are expected to provide independent answers to tasks so as to ensure the diversity of answers. However, recent studies show that the crowd is not a collection of independent workers, but instead that workers communicate and collaborate with each other. To pursue more rewards with little effort, some workers may collude to provide repeated answers, which will damage the quality of the aggregated results. Nonetheless, there are few efforts considering the negative impact of collusion on result inference in crowdsourcing. In this paper, we are specially concerned with the Collusion-Proof result inference problem for general crowdsourcing tasks in public platforms. To that end, we design a metric, the worker performance change rate, to identify the colluded answers by computing the difference of the mean worker performance before and after removing the repeated answers. Then we incorporate the collusion detection result into existing result inference methods to guarantee the quality of the aggregated results even with the occurrence of collusion behaviors. With real-world and synthetic datasets, we conducted an extensive set of evaluations of our approach. The experimental results demonstrate the superiority of our approach in comparison with the state-of-the-art methods.
In this paper we present ALDIMS, a language that combines the expressibility of general functional (MIMD) parallelism with compact expressibility of data (SPMD) parallelism. It uses distributed data structures for spe...
A Part of Speech classifier is an important tool that is used to develop many NLP tasks. In this paper we described Deep Neural Network based architecture, that given a sentence, outputs a part-of-speech tag sequence....
详细信息
暂无评论