The proceedings contain 27 papers. The topics discussed include: a computational evaluation of optimization solvers for CTA;flexible rounding based on consistent post-tabular stochastic noise;an investigation of model...
ISBN:
(纸本)9783642336263
The proceedings contain 27 papers. The topics discussed include: a computational evaluation of optimization solvers for CTA;flexible rounding based on consistent post-tabular stochastic noise;an investigation of model-based microdata masking for magnitude tabular data release;clustering-based categorical data protection;anonymization methods for taxonomic microdata;hybrid microdata via model-based clustering;logistic regression with variables subject to post randomization method;valid statistical inference on automatically matched files;generating useful test data for complex linked employer-employee datasets;when excessive perturbation goes wrong and why IPUMS-international relies instead on sampling, suppression, swapping, and other minimally harmful methods to protect privacy of census microdata;achieving comparability of earnings;and designing multiple releases from the small and medium enterprises survey.
This paper considers a scenario where two parties having private databases wish to cooperate by computing a data mining algorithm on the union of their databases without revealing any unnecessary information. In parti...
详细信息
We prove that, with respect to a database query response privacy mechanism employing output perturbation with i.i.d. random noise addition, an adversary can, allowed a sufficiently large number of queries, exactly det...
详细信息
We develop a statistical process for determining a confidence set for an unknown bipartite matching. It requires only modest assumptions on the nature of the distribution of the data. The confidence set involves a set...
详细信息
In this paper, we evaluate empirically the quality of statistical inference from differentially-private synthetic contingency tables. We compare three methods: histogram perturbation, the Dirichlet-Multinomial synthes...
详细信息
We develop the core of a method for solving the data archive and curation problem that confronts the custodians of restricted-access research data and the scientific users of such data. Our solution recognizes the dua...
详细信息
IPUMS-international disseminates population census microdata at no cost for 69 countries. Currently, a series of 212 samples totaling almost a half billion person records are available to researchers. Registration is ...
详细信息
The need of improving the privacy on public datasets is becoming more and more important because the number of public available datasets is growing very fast. This forced the continuous research to find better protect...
详细信息
Many statistical agencies nowadays operate or envision tools for ad hoc creation and visualization of aggregate tables. Such tools can indeed increase the efficiency of those parts of the data production process that ...
详细信息
The question of microdata access is solved in most of the EU Member States and the access to national data is basically possible in one or another way. This infrastructure can be used now to satisfy the strong demand ...
详细信息
暂无评论