检索结果-内蒙古大学图书馆

An Introduction to Verification of Visualization Techniques

2015年

作者： Tiago Etiene Robert M. Kirby Claudio T. Silva

ISBN: (数字)9781627058346

As we increase our reliance on computer-generated information, often using it as part of our decision-making process, we must devise tools to assess the correctness of that information. Consider, for example, software embedded on vehicles, used for simulating aircraft performance, or used in medical imaging. In those cases, software correctness is of paramount importance as there"s little room for error. Software verification is one of the tools available to attain such goals. Verification is a well known and widely studied subfield of computer science and computational science and the goal is to help us increase confidence in the software implementation by verifying that the software does what it is supposed to do. The goal of this book is to introduce the reader to software verification in the context of visualization. In the same way we became more dependent on commercial software, we have also increased our reliance on visualization software. The reason is simple: visualization is the lens through which users can understand complex data, and as such it must be verified. The explosion in our ability to amass data requires tools not only to store and analyze data, but also to visualize it. This book is comprised of six chapters. After an introduction to the goals of the book, we present a brief description of both worlds of visualization (Chapter 2) and verification (Chapter 3). We then proceed to illustrate the main steps of the verification pipeline for visualization algorithms. We focus on two classic volume visualization techniques, namely, Isosurface Extraction (Chapter 4) and Direct Volume Rendering (Chapter 5). We explain how to verify implementations of those techniques and report the latest results in the field of verification of visualization techniques. The last chapter concludes the book and highlights new research topics for the future.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Author Correction: Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors

引用

Nature genetics 2018年第12期50卷 1754页

作者： Huipeng Li Elise T Courtois Debarka Sengupta Yuliana Tan Kok Hao Chen Jolene Jie Lin Goh Say Li Kong Clarinda Chua Lim Kiat Hon Wah Siew Tan Mark Wong Igor Cima Min-Han Tan Lawrence J K Wee Axel M Hillmer Iain Beehuat Tan Paul Robson Shyam Prabhakar Computational and Systems Biology Genome Institute of Singapore Singapore Singapore. Developmental Cellomics Laboratory Genome Institute of Singapore Singapore Singapore. Department of Computer Science and Engineering and Center for Computational Biology Indraprastha Institute of Information Technology Delhi India. Synthetic Biology Genome Institute of Singapore Singapore Singapore. Cancer Therapeutics and Stratified Oncology Genome Institute of Singapore Singapore Singapore. Department of Medical Oncology National Cancer Centre Singapore Singapore Singapore. Department of Pathology Singapore General Hospital Singapore Singapore. Department of Colorectal Surgery Singapore General Hospital Singapore Singapore. Institute of Bioengineering and Nanotechnology Singapore Singapore. Data Analytics Department Institute for Infocomm Research Singapore Singapore. Cancer Therapeutics and Stratified Oncology Genome Institute of Singapore Singapore Singapore. iain.tan.b.h@singhealth.com.sg. Department of Medical Oncology National Cancer Centre Singapore Singapore Singapore. iain.tan.b.h@singhealth.com.sg. Program in Cancer and Stem Cell Biology Duke-NUS Medical School Singapore Singapore. iain.tan.b.h@singhealth.com.sg. Developmental Cellomics Laboratory Genome Institute of Singapore Singapore Singapore. paul.robson@***. The Jackson Laboratory for Genomic Medicine Farmington Connecticut USA. paul.robson@***. Department of Genetics and Genome Sciences Institute for Systems Genomics University of Connecticut Farmington Connecticut USA. paul.robson@***. Department of Biological Sciences National University of Singapore Singapore Singapore. paul.robson@***. Computational and Systems Biology Genome Institute of Singapore Singapore Singapore. prabhakars@gis.a-star.edu.sg.

In the version of the article published, the author list is not accurate. Igor Cima and Min-Han Tan should have been authors, appearing after Mark Wong in the author list, while Paul Jongjoon Choi should not have been listed as an author. Igor Cima and Min-Han Tan both have the affiliation Institute of Bioengineering and Nanotechnology, Singapore, Singapore, and their contributions should have been noted in the Author Contributions section as "I.C. preprocessed Primary Cell Atlas data with inputs from M.-H.T." The following description of the contribution of Paul Jongjoon Choi should not have appeared: "P.J.C. supported the smFISH experiments." In the 'RCA: global panel' section of the Online Methods, the following sentence should have appeared as the second sentence, "An expression atlas of human primary cells (the Primary Cell Atlas) was preprocessed similarly to in ref. 55," with new reference 55 (Cima, I. et al. Tumor-derived circulating endothelial cell clusters in colorectal cancer. science Transl. Med. 8, 345ra89, 2016).

关键词：

来源：评论

学校读者我要写书评

暂无评论

Using Feature Selection to Improve the Utility of Differentially Private data Publishing

引用

Procedia computer science 2014年 37卷 511-516页

作者： Yasser Jafer Stan Matwin Marina Sokolova School of Electrical Engineering and Computer Science University of Ottawa Canada Institute for Big Data Analytics Dalhousie University Canada Institute for Computer Science Polish Academy of Sciences Faculty of Medicine University of Ottawa Canada

Protection of patient's privacy is an obligation enforced by laws and regulations in the US, Canada, and other jurisdictions. With exponential growth of exchange of personal health information (PHI) brought about by e-health, there is a need for smart algorithms that help the data publisher to protect PHI. Within exiting privacy models, differential privacy is considered one of the strongest privacy protection techniques that does not make any assumption about the attacker's background knowledge. One way to achieve differential privacy in the non-interactive mode is to derive a contingency table of the raw data over the database domain, to add noise to each count, and to publish the resulting noisy table of counts. This approach, however, is not suitable for high-dimensional data with large domains as the added noise substantially destroys the utility of the data. In this work, we show that when the K-anonymity is preceded by feature selection, it is possible to obtain a contingency table with higher counts. As a result, when noise is added to satisfy differential privacy, its distorting effect is minimized and high utility of the data is preserved. We propose the TOP_Diff algorithm which offers a trade-off between anonymization level K and the privacy budget ɛ, and enables us to publish privacy preserving datasets with high utility. Our approach is capable of handling both numerical and categorical features.

关键词： Privacy Feature Selection K-anonymity Differential Privacy Classification

来源：评论

学校读者我要写书评

暂无评论

Extremal optimization-based semi-supervised algorithm with conflict pairwise constraints for community detection

Extremal optimization-based semi-supervised algorithm with c...

引用

International Conference on Advances in Social Network Analysis and Mining, ASONAM

作者： Lei Li Mei Du Guanfeng Liu Xuegang Hu Gongqing Wu School of Computer Science and Information Engineering Hefei University of Technology Hefei Anhui China Soochow Advanced Data Analytics Lab Soochow University Suzhou Jiangsu China

The research on community structure is a key to analyze the network functionality and topology, and thus it is significant to detect and analysis the community structure. During the abstract process from an actual system to a network, especially for a large-scale network, it is inevitable to have mistaken connections between nodes or have connection missing. In addition, in real applications, from time to time we can obtain prior information in the form of pairwise constraints between nodes besides topology information, although they may be inaccurate or conflicted. These noises in the network-related information will dramatically reduce the accuracy of community detection. Hence, in this paper, we introduce a dissimilarity index to determine the trustworthiness of pairwise constraints and settle the conflict of pairwise constraints. Then, focusing on the community detection with false connections or conflicted connections, we propose a pairwise constrained structure-enhanced extremal optimization-based semi-supervised algorithm (PCSEO-SS algorithm). Compared with existing semi-supervised community detection approaches, the experimental results executed on real networks and synthetic networks, show that PCSEO-SS can solve the problem of false connections or conflicted connections to some extent and detect the community structure more precisely.

关键词： Communities Optimization Detection algorithms Indexes Accuracy Algorithm design and analysis Noise

来源：评论

学校读者我要写书评

暂无评论

Privacy-aware filter-based feature selection

Privacy-aware filter-based feature selection

引用

IEEE International Conference on Big data

作者： Yasser Jafer Stan Matwin Marina Sokolova School of Electrical Engineering and Computer Science University of Ottawa Canada Institute for Big Data Analytics Dalhousie University Canada Polish Academy of Sciences Institute for Computer Science Poland Faculty of Medicine University of Ottawa Canada

ISBN: (纸本)9781479956678

A large amount of digital information collected and stored in databases creates new opportunities for knowledge discovery and data mining. The datasets, however, may contain personally identifiable information that needs to be protected. With high dimensionality of many large datasets, dimensionality reduction such as feature selection becomes indispensible. In this work, we aim at incorporating privacy into the very process of feature selection and as such, propose a privacy-aware filter-based feature selection method (PF-IFR). Our method enables data custodians to define a trade-off measure for controlling the amount of privacy and efficacy using filter-based feature selection techniques.

关键词： Correlation Privacy Accuracy data privacy Filtering algorithms Educational institutions Publishing

来源：评论

学校读者我要写书评

暂无评论

Methods for Evaluating Medical Tests and Biomarkers

引用

Diagnostic and Prognostic Research 2017年第1期1卷 1-34页

作者： Gowri Gopalakrishna Miranda Langendam Patrick Bossuyt Mariska Leeflang Rob Scholten Anna Noel-Storr James Thomas Iain Marshall Byron Wallace Penny Whiting Clare Davenport Gowri GopalaKrishna Isabel de Salis Sue Mallett Robert Wolff Marie Westwood Jos Kleinen Richard Riley Gary Collins Hans Reitsma Karel Moons Antonia Zapf Katharina Kramer Annika Hoyer Oliver Kuss J. Ensor R. D. Riley J. J. Deeks E. C. Martin Gerta Rücker Martin Schumacher Susanne Steinhauser Joie Ensor Kym Snell Brian Willis Jon Deeks Thomas Debray Lavinia Ferrante di Ruffano Sian Taylor-Phillips Chris Hyde Stuart A. Taylor Gauraang Batnagar Lavinia Ferrante Di Ruffano Farah Seedat Aileen Clarke Sarah Byron Frances Nixon Rebecca Albrow Thomas Walker Carla Deakin Zhivko Zhelev Harriet Hunt Yaling Yang Lucy Abel James Buchanan Thomas Fanshawe Bethany Shinkins Laure Wynants Sabine Van Huffel Jan Verbakel Dirk Timmerman Ben Van Calster Aeliko Zwinderman Jason Oke Jack O’Sullivan Rafael Perera Brian Nicholson Hannah L. Bromley Tracy E. Roberts Adele Francis Denniis Petrie G. Bruce Mann Kinga Malottki Holly Smith Lucinda Billingham Alice Sitch Oke Gerke Mie Holm-Vilstrup Eivind Antonsen Segtnan Ulrich Halekoh Poul Flemming Høilund-Carlsen Bernard G. Francq Jac Dinnes Julie Parkes Walter Gregory Jenny Hewison Peter Selby Doug Altman William Rosenberg Julien Asselineau Paul Perez Aïssatou Paye Emilie Bessede Cécile Proust-Lima Christiana Naaktgeboren Joris de Groot Johannes Reitsma Anne Rutjes Emmanuel Ogundimu Jonathan Cook Yannick Le Manach Yvonne Vergouwe Romin Pajouheshnia Rolf Groenwold Karen Moons Linda Peelen Bavo De Cock Daan Nieboer Ewout W. Steyerberg Micael J. Pencina Jennifer Cooper Nick Parsons Chris Stinton Steve Smith Andy Dickens Rachel Jordan Alexandra Enocson David Fitzmaurice Peymane Adab Charles Boachie Gaj Vidmar Karoline Freeman Martin Connock Rachel Court Carl Moons Jessica Harris Zoe Plummer Barnaby Reeves Chris Rogers Andrew Mumford Kurtis Lee Veerle Verheyden Gianni D. Angelini Gavin J. Murphy Jeremy Huddy Melody Ni George Hanna Department of Clinical Epidemiology Biostatistics & Bioinformatics Academic Medical Center University of Amsterdam Amsterdam The Netherlands Department of Clinical Epidemiology and Biostatistics Academic Medical Center University of Amsterdam Amsterdam Netherlands Cochrane Netherlands Julius Center for Health Sciences and Primary Care University Medical Center Utrecht Utrecht Netherlands Cochrane Dementia and Cognitive Improvement Group University of Oxford Oxford UK EPPI-Centre Department of Social Science University College London London UK Division of Health and Social Care Research King’s College London London UK College of Computer and Information Science Northeastern University Boston USA University Hospitals Bristol NHS Foundation Trust School of Social and Community Medicine Bristol UK Institute of Applied Health Research University of Birmingham Birmingham UK School of Social and Community Medicine University of Bristol Bristol UK Kleijnen Systematic Reviews Ltd York UK Research Institute for Primary Care and Health Sciences Keele University Keele UK Centre for Statistics in Medicine Nuffield Department of Orthopaedics Rheumatology and Musculoskeletal Sciences University of Oxford Oxford UK Julius Center for Health Sciences and Primary Care University Medical Center Utrecht Utrecht The Netherlands Cochrane Netherlands University Medical Center Utrecht Utrecht The Netherlands Julius Center for Health Sciences and Primary Care University Medical Center Utrecht 3584 CG Utrecht Netherlands Department of Medical Statistics University Medical Center Göttingen Göttingen Germany Institute for Biometry and Epidemiology German Diabetes Center Leibniz Institute for Diabetes Research at Heinrich Heine University Düsseldorf Germany Manchester Pharmacy School University of Manchester Manchester UK Institute for Medical Biometry and Statistics Faculty of Medicine and Medical Center – University of Freiburg Stefan-Meier-Str. 26 79104 Freiburg Germany

来源：评论

学校读者我要写书评

暂无评论

Highlights from the 11th ISCB Student Council Symposium 2015. Dublin, Ireland. 10 July 2015

引用

BMC bioinformatics 2016年第3期17 Suppl 3卷 95页

作者： Katie Wilkins Mehedi Hassan Margherita Francescatto Jakob Jespersen R. Gonzalo Parra Bart Cuypers Dan DeBlasio Alexander Junge Anupama Jigisha Farzana Rahman Griet Laenen Sander Willems Lieven Thorrez Yves Moreau Nagarajan Raju Sonia Pankaj Chothani C. Ramakrishnan Masakazu Sekijima M. Michael Gromiha Paddy J Slator Nigel J Burroughs Przemysław Szałaj Zhonghui Tang Paul Michalski Oskar Luo Xingwang Li Yijun Ruan Dariusz Plewczynski Giulia Fiscon Emanuel Weitschek Massimo Ciccozzi Paola Bertolazzi Giovanni Felici Pieter Meysman Manu Vanaerschot Maya Berg Hideo Imamura Jean-Claude Dujardin Kris Laukens Westa Domanova James R. Krycer Rima Chaudhuri Pengyi Yang Fatemeh Vafaee Daniel J. Fazakerley Sean J. Humphrey David E. James Zdenka Kuncic Plant Pathology and Plant-Microbe Biology Section School of Integrative Plant Science Cornell University Ithaca USA Graduate Field of Computational Biology Cornell University Ithaca USA School of Computing & Mathematics University of South Wales Cardiff UK Department of Genome Biology for Neurodegenerative Diseases German Center for Neurodegenerative Diseases (DZNE) within the Helmholtz Association ᅟ Germany Department of Surgery Massachusetts General Hospital The Broad Institute of MIT and Harvard Boston USA Department of Systems Biology Technical University of Denmark Kemitorvet Denmark Protein Physiology Lab Facultad de Ciencias Exactas y Naturales Universidad de Buenos Aires Buenos Aires Argentina Molecular Parasitology Unit (MPU) Institute of Tropical Medicine Antwerp Belgium Advanced Database Research and Modeling (ADReM) research group University of Antwerp Antwerpen Belgium Department of Computer Science University of Arizona Tucson USA Department of Veterinary Clinical and Animal Sciences Center for non-coding RNA in Technology and Health University of Copenhagen Copenhagen Denmark University College Dublin Dublin Ireland Department of Electrical Engineering (ESAT) STADIUS Center for Dynamical Systems Signal Processing and Data Analytics KU Leuven Leuven Belgium Minds Medical IT Department Leuven Belgium Scientific Institute of Public Health (WIV-ISP) Platform of Biotechnology and Molecular Biology (PBB) Brussels Belgium Department of Development and Regeneration @ Kulak KU Leuven Kortrijk Belgium Department of Biotechnology Bhupat and Jyoti Metha School of Biosciences Indian Institute of Technology Madras Chennai India Philips Research North America Briarcliff Manor USA Global Scientific Information and Computing Center (GSIC) Tokyo Institute of Technology Tokyo Japan Systems Biology Centre University of Warwick Senate House Coventry UK Systems Biology Doctoral Training Centre University of Warwick Senate House Coventry UK Center for B

A1 Highlights from the eleventh ISCB Student Council Symposium 2015 Katie Wilkins, Mehedi Hassan, Margherita Francescatto, Jakob Jespersen, R. Gonzalo Parra, Bart Cuypers, Dan DeBlasio, Alexander Junge, Anupama Jigisha, Farzana Rahman O1 Prioritizing a drug’s targets using both gene expression and structural similarity Griet Laenen, Sander Willems, Lieven Thorrez, Yves Moreau O2 Organism specific protein-RNA recognition: A computational analysis of protein-RNA complex structures from different organisms Nagarajan Raju, Sonia Pankaj Chothani, C. Ramakrishnan, Masakazu Sekijima; M. Michael Gromiha O3 Detection of Heterogeneity in Single Particle Tracking Trajectories Paddy J Slator, Nigel J Burroughs O4 3D-NOME: 3D NucleOme Multiscale Engine for data-driven modeling of three-dimensional genome architecture Przemysław Szałaj, Zhonghui Tang, Paul Michalski, Oskar Luo, Xingwang Li, Yijun Ruan, Dariusz Plewczynski O5 A novel feature selection method to extract multiple adjacent solutions for viral genomic sequences classification Giulia Fiscon, Emanuel Weitschek, Massimo Ciccozzi, Paola Bertolazzi, Giovanni Felici O6 A Systems Biology Compendium for Leishmania donovani Bart Cuypers, Pieter Meysman, Manu Vanaerschot, Maya Berg, Hideo Imamura, Jean-Claude Dujardin, Kris Laukens O7 Unravelling signal coordination from large scale phosphorylation kinetic data Westa Domanova, James R. Krycer, Rima Chaudhuri, Pengyi Yang, Fatemeh Vafaee, Daniel J. Fazakerley, Sean J. Humphrey, David E. James, Zdenka Kuncic

关键词： Visceral Leishmaniasis Feature Selection Method Frequent Itemset Mining Leishmania Donovani Student Council

来源：评论

学校读者我要写书评

暂无评论

Novel SNP improves differential survivability and mortality in non-small cell lung cancer patients

引用

BMC Genomics 2014年第9期15卷 1-7页

作者： Mah, Tzia Liang Yap, Xin Ning Adeline Limviphuvadh, Vachiranee Li, Nanpu Sridharan, Srinath Kuralmani, Vellaisemy Feng, Mengling Liem, Natalia Adhikari, Sharmila Yong, Wei Peng Soo, Ross A Maurer-Stroh, Sebastian Eisenhaber, Frank Tong, Joo Chuan Institute for Infocomm Research Data Analytics Department 1 Fusionopolis Way #21-01 Connexis South Tower Singapore 138632 Singapore National University Health System Department of Haematology-Oncology 5 Lower Kent Ridge Road Singapore 119074 Singapore Bioinformatics Institute 30 Biopolis Street #07-01 Matrix Singapore 138671 Singapore Nanyang Technological University (NTU) School of Biological Sciences (SBS) 60 Nanyang Drive 637551 Singapore Nanyang Technological University (NTU) School of Computer Engineering (SCE) 50 Nanyang Drive 637553 Singapore National University of Singapore (NUS) Department of Biological Sciences (DBS) 8 Medical Drive 4 117597 Singapore National University of Singapore Department of Biochemistry Yong Loo Lin School of Medicine Singapore 117597 Singapore Institute of High Performance Computing 1 Fusionopolis Way #16-16 Connexis 138632 Singapore National University of Singapore Cancer Science Institute of Singapore Singapore

Background: Non-small cell lung cancer (NSCLC) is a major cause of cancer-related death worldwide due to poor patient prognosis and clinical outcome. Here, we studied the genetic variations underlying NSCLC pathogenesis based on their association to patient outcome after gemcitabine therapy. Results: Bioinformatics analysis was used to investigate possible effects of POLA2 G583R (POLA2+1747 GG/GA, dbSNP ID: rs487989) in terms of protein function. Using biostatistics, POLA2+1747 GG/GA (rs487989, POLA2 G583R) was identified as strongly associated with mortality rate and survival time among NSCLC patients. It was also shown that POLA2+1747 GG/GA is functionally significant for protein localization via green fluorescent protein (GFP)-tagging and confocal laser scanning microscopy analysis. The single nucleotide polymorphism (SNP) causes DNA polymerase alpha subunit B to localize in the cytoplasm instead of the nucleus. This inhibits DNA replication in cancer cells and confers a protective effect in individuals with this SNP. Conclusions: The results suggest that POLA2+1747 GG/GA may be used as a prognostic biomarker of patient outcome in NSCLC pathogenesis. © 2014 Mah et al.;licensee BioMed Central Ltd.

关键词： Biomarker Genetic variant Mortality Non-small cell lung cancer Survival Outcome

来源：评论

学校读者我要写书评

暂无评论

11th German Conference on Chemoinformatics (GCC 2015) Abstracts

引用

JOURNAL OF CHEMINFORMATICS 2016年第SUPPL 1期8卷 1-27页

作者： [Anonymous] GDCh-CIC Division Associated Board Member Beilstein-Institut zur Förderung der Chemischen Wissenschaften Trakehner Str. 7-9 60487 Frankfurt Germany. ufechner@beilstein-institut.de. Division Medicinal Chemistry Amsterdam Institute for Molecules Medicines and Systems (AIMMS) VU University Amsterdam The Netherlands. Centre for Bioinformatics Uni Hamburg Bundesstr. 43 20146 Hamburg Germany. Sanofi-Aventis Deutschland GmbH 65926 Frankfurt am Main Germany. stefan.guessregen@***. Sanofi-Aventis Deutschland GmbH 65926 Frankfurt am Main Germany. GlaxoSmithKline Stevenage SG1 2NY UK. nicola.j.richmond@***. Discngine Paris 75011 France. Organisch-Chemisches Institut Westfälische Wilhelms-Universität Münster Germany. marwin.segler@wwu.de. Organisch-Chemisches Institut Westfälische Wilhelms-Universität Münster Germany. Bundeskriminalamt Wiesbaden Central Analytics II 65173 Wiesbaden Germany. Department of Chemistry and Biochemistry University of California Santa Barbara CA 93111 USA. shea@chem.ucsb.edu. Department of Chemistry and Biochemistry University of California Santa Barbara CA 93111 USA. Chemistry Department Ludwig-Maximilians-Universität München Butenandtstr. 7 81377 Munich Germany. Inorganic Chemistry and Center for Nanointegration University of Duisburg-Essen Essen Germany. CAM-D Technologies Essen Germany. Institute for Bioinformatics and Chemoinformatics Westphalian University of Applied Sciences Recklinghausen Germany. Institute for Bioinformatics and Chemoinformatics Westphalian University of Applied Sciences Recklinghausen Germany. achim.zielesny@w-hs.de. Leiden University Leiden Netherlands. j.fraaije@chem.leidenuniv.nl. Culgi BV Berlin Germany. Physikalische Chemie III TU Dortmund 44227 Dortmund Germany. stefan.kast@tu-dortmund.de. Centre for Molecular Informatics Department of Chemistry University of Cambridge Lensfield Road Cambridge CB2 1EW United Kingdom. kcb27@cam.ac.uk. Centre for Molecular Informatics Department of Chemistry

来源：评论

学校读者我要写书评

暂无评论

Detecting click fraud in online advertising: a data mining approach

引用

The Journal of Machine Learning Research 2014年第1期15卷

作者： Kevin Murphy Bernhard Schölkopf Richard Oentaryo Ee-Peng Lim Michael Finegold David Lo Feida Zhu Clifton Phua Eng-Yeow Cheu Ghim-Eng Yap Kelvin Sim Minh Nhut Nguyen Kasun Perera Bijay Neupane Mustafa Faisal Zeyar Aung Wei Lee Woon Wei Chen Dhaval Patel Daniel Berrar Google Living Analytics Research Centre Singapore Management University Singapore SAS Institute Pte. Ltd. Singapore Data Analytics Department Institute for Infocomm Research Singapore Masdar Institute of Science and Technology Abu Dhabi United Arab Emirates Institute for Infocomm Research Singapore Department of Computer Science and Engineering Indian Institute of Technology Roorkee Roorkee Uttarakhand India Interdisciplinary Graduate School of Science and Engineering Tokyo Institute of Technology Yokohama Japan

Click fraud-the deliberate clicking on advertisements with no real interest on the product or service offered-is one of the most daunting problems in online advertising. Building an effective fraud detection method is thus pivotal for online advertising businesses. We organized a Fraud Detection in Mobile Advertising (FDMA) 2012 Competition, opening the opportunity for participants to work on real-world fraud data from BuzzCity Pte. Ltd., a global mobile advertising company based in Singapore. In particular, the task is to identify fraudulent publishers who generate illegitimate clicks, and distinguish them from normal publishers. The competition was held from September 1 to September 30, 2012, attracting 127 teams from more than 15 countries. The mobile advertising data are unique and complex, involving heterogeneous information, noisy patterns with missing values, and highly imbalanced class distribution. The competition results provide a comprehensive study on the usability of data mining-based fraud detection approaches in practical setting. Our principal findings are that features derived from fine-grained time-series analysis are crucial for accurate fraud detection, and that ensemble methods offer promising solutions to highly-imbalanced nonlinear classification tasks with mixed variable types and noisy/missing patterns. The competition data remain available for further studies at http://***/fdma2012/.

关键词： ensemble learning feature engineering fraud detection imbalanced classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：