检索结果-内蒙古大学图书馆

A Deep Learning-Based Approach to Detect Correct Suryanamaskara Pose

SN computer science 2022年第5期3卷 337页

作者： Bhaumik, Ujjayanta Singh, Koushlendra Kumar Akbari, Akbar Sheikh Bajpai, Manish Kumar University College London United Kingdom Machine Vision and Intelligence Lab Department of Computer Science and Engineering National Institute of Technology Jamshedpur India School of Computing Creative Technologies and Engineering at Leeds Beckett University Leeds United Kingdom IIITDM Jabalpur India

We present a technique to analyse Suryanamaskar poses using keypoint estimation and statistical analysis. The proposed approach uses a trained model based on COCO keypoint detection dataset and uses it to determine keypoints in yoga poses. Our work uses the keypoint detection to suggest a self yoga correction system. A novel dataset, Surya-yoga, containing 10000 Suryanamaskara poses has been generated and made publicly available. The model presented in this paper performed better on the COCO dataset and combined COCO and Surya-yoga dataset when tested using part affinity fields. The work also presents an analytical method of distinguishing different Suryanamaskar poses alongside deep learning methods. © 2022, The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd.

关键词： Keypoint detection Pose detection Suryanamaskar Yoga

来源：评论

学校读者我要写书评

暂无评论

vision Transformer Based Automated Model for Enhancing Lung Cancer Classification

Vision Transformer Based Automated Model for Enhancing Lung ...

引用

IEEE International Workshop on Imaging Systems and Techniques (IST)

作者： Akbar Sheikh Akbari Arvind Kumar B Ramachandra Reddy Koushlendra Kumar Singh Masahiro Takei Leeds Beckett University UK Machine Vision and Intelligence Lab National Institute of Technology Jamshedpur Jharkhand India Computer Science and Engineering National Institute of Technology Jamshedpur Jharkhand India Department of Mechanical Engineering Chiba University Chiba Japan

ISBN: (数字)9798350378214

ISBN: (纸本)9798350378221

Lung cancer is one of the leading causes of cancer related mortality. The early detection and classification of the cancers tissues will reduce the mortalities rate. The present research focus on the development of automated classification model for lung and colon cancers tissues based on the histopathology images. The present work encompasses a vision transformer (ViT) based model to enhance diagnostic accuracy of lung cancers tissues. The proposed model utilizes the self-attention mechanism of ViT to focus on essential features present in histopathologicals images. The proposed model has been validated using two different dataset namely LC25000 & IQ-OTH/NCCD with 25000 & 1096 images respectively. The performance of proposed model is compared with traditional convolutional neural network (CNN) model and it has been observed the based model outforms better in terms of accuracy which - 98.80% & 99.09% respectively for datasets.

关键词： computer vision Analytical models Accuracy Lungs Lung cancer Imaging Transformers Convolutional neural networks Reliability Testing

来源：评论

学校读者我要写书评

暂无评论

Automatic Polyp Segmentation with Multiple Kernel Dilated Convolution Network

arXiv

引用

arXiv 2022年

作者： Tomar, Nikhil Kumar Srivastava, Abhishek Bagci, Ulas Jha, Debesh School of Informatics and Computer Science Indira Gandhi National Open University India Computer Vision and Pattern Recognition Unit Indian Statistical Institute India Machine and Hybrid Intelligence Lab Department of Radiology Feinberg School of Medicine Northwestern University United States

The detection and removal of precancerous polyps through colonoscopy is the primary technique for the prevention of colorectal cancer worldwide. However, the miss rate of colorectal polyp varies significantly among the endoscopists. It is well known that a computer-aided diagnosis (CAD) system can assist endoscopists in detecting colon polyps and minimize the variation among endoscopists. In this study, we introduce a novel deep learning architecture, named MKDCNet, for automatic polyp segmentation robust to significant changes in polyp data distribution. MKDCNet is simply an encoder-decoder neural network that uses the pre-trained ResNet50 as the encoder and novel multiple kernel dilated convolution (MKDC) block that expands the field of view to learn more robust and heterogeneous representation. Extensive experiments on four publicly available polyp datasets and cell nuclei dataset show that the proposed MKDCNet outperforms the state-of-the-art methods when trained and tested on the same dataset as well when tested on unseen polyp datasets from different distributions. With rich results, we demonstrated the robustness of the proposed architecture. From an efficiency perspective, our algorithm can process at (≈ 45) frames per second on RTX 3090 GPU. MKDCNet can be a strong benchmark for building real-time systems for clinical colonoscopies. The code of the proposed MKDCNet is available at https://***/nikhilroxtomar/MKDCNet. © 2022, CC BY.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Inconsistency Distillation For Consistency:Enhancing Multi-View Clustering via Mutual Contrastive Teacher-Student Leaning

Inconsistency Distillation For Consistency:Enhancing Multi-V...

引用

IEEE International Conference on Data Mining (ICDM)

作者： Dunqiang Liu Shu-Juan Peng Xin Liu Lei Zhu Zhen Cui Taihao Li Dept. of Comput. Sci. & Fujian Key Lab. of Big Data Intelligence and Security Huaqiao University Xiamen China Zhejiang Lab Hangzhou China Xiamen Key Lab. of Computer Vision and Pattern Recognition Huaqiao University Xiamen China Key Lab. of Computer Vision and Machine Learning (Huaqiao University) Fujian Province University Xiamen China School of Information Sci. and Eng. Shandong Normal University Jinan China School of Computer Sci. and Eng. Nanjing University of Science and Technology Nanjing China

Multi-view clustering has attracted more attention recently since many real-world data are comprised of different representations or views. Recent multi-view clustering works mainly exploit the instance consistency to obtain the shared representations across different views, and apply a single-view clustering method to perform data partitions. However, these existing methods often ignore the inconsistency of instance associations within the views, which may enlarge the intra-class diversity among the views and therefore degrade the clustering performance. To address this issue, this paper proposes an efficient mutual contrastive teacher-student leaning (MC-TSL) model to enhance the multi-view clustering, which is the first attempt to study the inconsistency distillation for consistency learning. First, the proposed MC-TSL approach exploits a view-specific encoder with two heads, an instance encoding head and a semantic distillation head, respectively, for capturing the consistent and discriminative feature representations. To be specific, the former head exploits a cross-view contrastive learning method to obtain a redundancy-free consistent representation at the instance level, while the latter head designs a mutual teacher-student learning module to capture the intra-view information at semantic level. By training these two heads in an end-to-end manner, the discriminative multi-view embeddings are efficiently obtained and refined by minimizing the weighted sum of the reconstruction loss, contrastive loss and contrast distillation loss. Extensive experiments verify the superiorities of the proposed MC-TSL framework and show its competitive clustering performances.

关键词： Training Learning systems Clustering methods Semantics Encoding Data mining

来源：评论

学校读者我要写书评

暂无评论

NTIRE 2020 Challenge on Video Quality Mapping: Methods and Results

NTIRE 2020 Challenge on Video Quality Mapping: Methods and R...

引用

IEEE computer Society Conference on computer vision and Pattern Recognition Workshops (CVPRW)

作者： Dario Fuoli Zhiwu Huang Martin Danelljan Radu Timofte Hua Wang Longcun Jin Dewei Su Jing Liu Jaehoon Lee Michal Kudelski Lukasz Bala Dmitry Hrybov Marcin Mozejko Muchen Li Siyao Li Bo Pang Cewu Lu Chao Li Dongliang He Fu Li Shilei Wen Computer Vision Lab ETH Zurich Switzerland School of Software Engineering South China University of Technology Guangdong China Multimedia and Computer Vision Lab East China Normal University Shanghai China Department of Electronics and Computer Engineering Hanyang University Seoul Korea TCL Research Europe Warsaw Poland Machine Vision and Intelligence Group Department of Computer Science Shanghai Jiao Tong University Shanghai China Department of Computer Vision Technology (VIS) Baidu Inc. Beijing China

ISBN: (数字)9781728193601

ISBN: (纸本)9781728193618

This paper reviews the NTIRE 2020 challenge on video quality mapping (VQM), which addresses the issues of quality mapping from source video domain to target video domain. The challenge includes both a supervised track (track 1) and a weakly-supervised track (track 2) for two benchmark datasets. In particular, track 1 offers a new Internet video benchmark, requiring algorithms to learn the map from more compressed videos to less compressed videos in a supervised training manner. In track 2, algorithms are required to learn the quality mapping from one device to another when their quality varies substantially and weakly- aligned video pairs are available. For track 1, in total 7 teams competed in the final test phase, demonstrating novel and effective solutions to the problem. For track 2, some existing methods are evaluated, showing promising solutions to the weakly-supervised video quality mapping problem.

关键词： Video recording Quality assessment Generative adversarial networks Target tracking Training Cameras Image coding

来源：评论

学校读者我要写书评

暂无评论

SoccerNet 2023 Challenges Results

arXiv

引用

arXiv 2023年

作者： Cioppa, Anthony Giancola, Silvio Somers, Vladimir Magera, Floriane Zhou, Xin Mkhallati, Hassan Deliège, Adrien Held, Jan Hinojosa, Carlos Mansourian, Amir M. Miralles, Pierre Barnich, Olivier De Vleeschouwer, Christophe Alahi, Alexandre Ghanem, Bernard Van Droogenbroeck, Marc Kamal, Abdullah Maglo, Adrien Clapés, Albert Abdelaziz, Amr Xarles, Artur Orcesi, Astrid Scott, Atom Liu, Bin Lim, Byoungkwon Chen, Chen Deuser, Fabian Yan, Feng Yu, Fufu Shitrit, Gal Wang, Guanshuo Choi, Gyusik Kim, Hankyul Guo, Hao Fahrudin, Hasby Koguchi, Hidenari Ardö, Håkan Salah, Ibrahim Yerushalmy, Ido Muhammad, Iftikar Uchida, Ikuma Be'ery, Ishay Rabarisoa, Jaonary Lee, Jeongae Fu, Jiajun Yin, Jianqin Xu, Jinghang Nang, Jongho Denize, Julien Li, Junjie Zhang, Junpei Kim, Juntae Synowiec, Kamil Kobayashi, Kenji Zhang, Kexin Habel, Konrad Nakajima, Kota Jiao, Licheng Ma, Lin Wang, Lizhi Wang, Luping Li, Menglong Zhou, Mengying Nasr, Mohamed Abdelwahed, Mohamed Liashuha, Mykola Falaleev, Nikolay Oswald, Norbert Jia, Qiong Pham, Quoc-Cuong Song, Ran Hérault, Romain Peng, Rui Chen, Ruilong Liu, Ruixuan Baikulov, Ruslan Fukushima, Ryuto Escalera, Sergio Lee, Seungcheon Chen, Shimin Ding, Shouhong Someya, Taiga Moeslund, Thomas B. Li, Tianjiao Shen, Wei Zhang, Wei Li, Wei Dai, Wei Luo, Weixin Zhao, Wending Zhang, Wenjie Yang, Xinquan Ma, Yanbiao Joo, Yeeun Zeng, Yingsen Gan, Yiyang Zhu, Yongqiang Zhong, Yujie Ruan, Zheng Li, Zhiheng Huang, Zhijian Meng, Ziyu Belgium Saudi Arabia Sportradar Norway UCLouvain Belgium EPFL Switzerland EVS Broadcast Equipment Belgium Baidu Research United States Belgium Sharif University of Technology Iran Footovision France Zewail City of Science Technology and Innovation Egypt Université Paris-Saclay CEA France Universitat de Barcelona Spain Computer Vision Center Spain Nagoya University Japan Research Center for Applied Mathematics and Machine Intelligence Zhejiang Lab China AIBrain United States OPPO Research Institute China Germany Meituan China Tencent Youtu Lab China Amazon Prime Video Sport United States Sogang University Korea Republic of The University of Tokyo Japan Spiideo Sweden University of Tsukuba Japan School of Artificial Intelligence Beijing University of Posts and Telecommunications China Normandie Univ INSA Rouen LITIS France Shanghai Jiao Tong University China Key Laboratory of Intelligent Perception and Image Understanding The Ministry of Education Xidian University China NASK - National Research Institute Poland Robo Space China Tongji University China Sportlight Technology United Kingdom School of Control Science and Engineering Shandong University China lRomul Russia Aalborg University Denmark Turing AI Cultures GmbH Germany Information Systems Technology and Design Singapore University of Technology and Design Singapore Sun Yat-sen University China

The SoccerNet 2023 challenges were the third annual video understanding challenges organized by the SoccerNet team. For this third edition, the challenges were composed of seven vision-based tasks split into three main themes. The first theme, broadcast video understanding, is composed of three high-level tasks related to describing events occurring in the video broadcasts: (1) action spotting, focusing on retrieving all timestamps related to global actions in soccer, (2) ball action spotting, focusing on retrieving all timestamps related to the soccer ball change of state, and (3) dense video captioning, focusing on describing the broadcast with natural language and anchored timestamps. The second theme, field understanding, relates to the single task of (4) camera calibration, focusing on retrieving the intrinsic and extrinsic camera parameters from images. The third and last theme, player understanding, is composed of three low-level tasks related to extracting information about the players: (5) re-identification, focusing on retrieving the same players across multiple views, (6) multiple object tracking, focusing on tracking players and the ball through unedited video streams, and (7) jersey number recognition, focusing on recognizing the jersey number of players from tracklets. Compared to the previous editions of the SoccerNet challenges, tasks (2-3-7) are novel, including new annotations and data, task (4) was enhanced with more data and annotations, and task (6) now focuses on end-to-end approaches. More information on the tasks, challenges, and leader-boards are available on https://***. Baselines and development kits can be found on https://***/SoccerNet. © 2023, CC BY.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Lessons Learned from Assessing Trustworthy AI in Practice

引用

Digital Society 2023年第3期2卷 1-25页

作者： Vetter, Dennis Amann, Julia Bruneault, Frédérick Coffee, Megan Düdder, Boris Gallucci, Alessio Gilbert, Thomas Krendl Hagendorff, Thilo van Halem, Irmhild Hickman, Eleanore Hildt, Elisabeth Holm, Sune Kararigas, Georgios Kringen, Pedro Madai, Vince I. Wiinblad Mathez, Emilie Tithi, Jesmin Jahan Westerlund, Magnus Wurth, Renee Zicari, Roberto V. Computational Vision and Artificial Intelligence Lab Goethe University Frankfurt Frankfurt Am Main Germany Z-Inspection® Initiative Venice Italy Health Ethics and Policy Lab ETH Zurich Zurich Switzerland Strategy and Innovation Careum Foundation Zurich Switzerland Philosophie Departement Collège André-Laurendeau Montréal Canada École Des Médias Université du Québec À Montréal Montréal Canada Department of Medicine Division of Infectious Diseases and Immunology New York University Grossman School of Medicine New York City USA Department of Computer Science University of Copenhagen Copenhagen Denmark Digital Life Initiative Cornell Tech New York City USA Cluster of Excellence “Machine Learning: New Perspectives for Science” University of Tuebingen Tuebingen Germany School of Law University of Bristol Bristol UK Center for the Study of Ethics in the Professions Illinois Institute of Technology Chicago USA Department of Business Management and Analytics Arcada University of Applied Sciences Helsinki Finland Department of Food & Resource Economics University of Copenhagen Copenhagen Denmark Department of Physiology Faculty of Medicine University of Iceland Reykjavik Iceland QUEST Centre for Responsible Research Berlin Institute of Health Charité Universitätsmedizin Berlin Berlin Germany Faculty of Computing Engineering and the Built Environment School of Computing and Digital Technology Birmingham City University Birmingham UK Parallel Computing Labs Intel Santa Clara USA School of Economics Innovation and Technology Kristiania University College Oslo Norway Data Science Graduate School Seoul National University Seoul South Korea

Building artificial intelligence (AI) systems that adhere to ethical standards is a complex problem. Even though a multitude of guidelines for the design and development of such trustworthy AI systems exist, these guidelines focus on high-level and abstract requirements for AI systems, and it is often very difficult to assess if a specific system fulfills these requirements. The Z-Inspection® process provides a holistic and dynamic framework to evaluate the trustworthiness of specific AI systems at different stages of the AI lifecycle, including intended use, design, and development. It focuses, in particular, on the discussion and identification of ethical issues and tensions through the analysis of socio-technical scenarios and a requirement-based framework for ethical and trustworthy AI. This article is a methodological reflection on the Z-Inspection® process. We illustrate how high-level guidelines for ethical and trustworthy AI can be applied in practice and provide insights for both AI researchers and AI practitioners. We share the lessons learned from conducting a series of independent assessments to evaluate the trustworthiness of real-world AI systems, as well as key recommendations and practical suggestions on how to ensure a rigorous trustworthiness assessment throughout the lifecycle of an AI system. The results presented in this article are based on our assessments of AI systems in the healthcare sector and environmental monitoring, where we used the framework for trustworthy AI proposed in the Ethics Guidelines for Trustworthy AI by the European Commission’s High-Level Expert Group on AI. However, the assessment process and the lessons learned can be adapted to other domains and include additional frameworks.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Why is the Winner the Best?

Why is the Winner the Best?

引用

Conference on computer vision and Pattern Recognition (CVPR)

作者： M. Eisenmann A. Reinke V. Weru M. D. Tizabi F. Isensee T. J. Adler S. Ali V. Andrearczyk M. Aubreville U. Baid S. Bakas N. Balu S. Bano J. Bernal S. Bodenstedt A. Casella V. Cheplygina M. Daum M. De Bruijne A. Depeursinge R. Dorent J. Egger D. G. Ellis S. Engelhardt M. Ganz N. Ghatwary G. Girard P. Godau A. Gupta L. Hansen K. Harada M. Heinrich N. Heller A. Hering A. Huaulmé P. Jannin A. E. Kavur O. Kodym M. Kozubek J. Li H. Li J. Ma C. Martín-Isla B. Menze A. Noble V. Oreiller N. Padoy S. Pati K. Payette T. Rädsch J. Rafael-Patiño V. Singh Bawa S. Speidel C. H. Sudre K. Van Wijnen M. Wagner D. Wei A. Yamlahi M. H. Yap C. Yuan M. Zenk A. Zia D. Zimmerer D. Aydogan B. Bhattarai L. Bloch R. Brüngel J. Cho C. Choi Q. Dou I. Ezhov C. M. Friedrich C. Fuller R. R. Gaire A. Galdran Á. García Faura M. Grammatikopoulou S. Hong M. Jahanifar I. Jang A. Kadkhodamohammadi I. Kang F. Kofler S. Kondo H. Kuijf M. Li M. Luu T. Martinčič P. Morais M. A. Naser B. Oliveira D. Owen S. Pang J. Park S. Park S. Płotka E. Puybareau N. Rajpoot K. Ryu N. Saeed A. Shephard P. Shi D. Štepec R. Subedi G. Tochon H. R. Torres H. Urien J. L. Vilaça K. A. Wahid H. Wang J. Wang L. Wang X. Wang B. Wiestler M. Wodzinski F. Xia J. Xie Z. Xiong S. Yang Y. Yang Z. Zhao K. Maier-Hein P. F. Jäger A. Kopp-Schneider L. Maier-Hein Division of Intelligent Medical Systems German Cancer Research Center (DKFZ) Heidelberg Germany Helmholtz Imaging German Cancer Research Center (DKFZ) Heidelberg Germany Faculty of Mathematics and Computer Science Heidelberg University Heidelberg Germany Division of Biostatistics German Cancer Research Center (DKFZ) Heidelberg Germany Division of Medical Image Computing German Cancer Research Center (DKFZ) Heidelberg Germany Faculty of Engineering and Physical Sciences School of Computing University of Leeds Leeds UK Institute of Informatics School of Management HES-SO Valais-Wallis University of Applied Sciences and Arts Western Switzerland Sierre Switzerland Department of Nuclear Medicine and Molecular Imaging Lausanne University Hospital Lausanne Switzerland Technische Hochschule Ingolstadt Ingolstadt Germany Center for Artificial Intelligence and Data Science for Integrated Diagnostics (AI2D) and Center for Biomedical Image Computing and Analytics (CBICA) University of Pennsylvania Philadelphia PA USA Department of Pathology and Laboratory Medicine Perelman School of Medicine University of Pennsylvania Philadelphia PA USA Department of Radiology Perelman School of Medicine University of Pennsylvania Philadelphia PA USA Department of Radiology University of Washington Seattle WA USA Department of Computer Science Wellcome/EPSRC Centre for Interventional and Surgical Sciences (WEISS) University College London London UK Universitat Autònoma de Barcelona & Computer Vision Center Barcelona Spain Division of Translational Surgical Oncology National Center for Tumor Diseases (NCT/UCC) Dresden Dresden Germany Department of Advanced Robotics Istituto Italiano di Tecnologia Italy Department of Electronics Information and Bioengineering Politecnico di Milano Milan Italy IT University of Copenhagen Copenhagen Denmark Department of General Visceral and Transplantation Surgery Heidelberg University Hospital Heidelberg Germany Department of Radiology and Nuc

International benchmarking competitions have become fundamental for the comparative performance assessment of image analysis methods. However, little attention has been given to investigating what can be learnt from these competitions. Do they really generate scientific progress? What are common and successful participation strategies? What makes a solution superior to a competing method? To address this gap in the literature, we performed a multicenter study with all 80 competitions that were conducted in the scope of IEEE ISBI 2021 and MICCAI 2021. Statistical analyses performed based on comprehensive descriptions of the submitted algorithms linked to their rank as well as the underlying participation strategies revealed common characteristics of winning solutions. These typically include the use of multi-task learning (63%) and/or multi-stage pipelines (61%), and a focus on augmentation (100%), image preprocessing (97%), data curation (79%), and post-processing (66%). The “typical” lead of a winning team is a computer scientist with a doctoral degree, five years of experience in biomedical image analysis, and four years of experience in deep learning. Two core general development strategies stood out for highly-ranked teams: the reflection of the metrics in the method design and the focus on analyzing and handling failure cases. According to the organizers, 43% of the winning algorithms exceeded the state of the art but only 11% completely solved the respective domain problem. The insights of our study could help researchers (1) improve algorithm development strategies when approaching new problems, and (2) focus on open research questions revealed by this work.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Deep Multi-Model Fusion for Single-Image Dehazing

Deep Multi-Model Fusion for Single-Image Dehazing

引用

International Conference on computer vision (ICCV)

作者： Zijun Deng Lei Zhu Xiaowei Hu Chi-Wing Fu Xuemiao Xu Qing Zhang Jing Qin Pheng-Ann Heng South China University of Technology Guangdong Provincial Key Laboratory of Computer Vision and Virtual Reality Technology Shenzhen Institutes of Advanced Technology CAS The Chinese University of Hong Kong State Key Laboratory of Subtropical Building Science Guangdong Provincial Key Lab of Computational Intelligence and Cyberspace Information Sun Yat-sen University The Hong Kong Polytechnic University CAS Key Laboratory of Human-Machine Intelligence-Synergy Systems Shenzhen Institutes of Advanced Technology CAS

ISBN: (数字)9781728148038

ISBN: (纸本)9781728148045

This paper presents a deep multi-model fusion network to attentively integrate multiple models to separate layers and boost the performance in single-image dehazing. To do so, we first formulate the attentional feature integration module to maximize the integration of the convolutional neural network (CNN) features at different CNN layers and generate the attentional multi-level integrated features (AMLIF). Then, from the AMLIF, we further predict a haze-free result for an atmospheric scattering model, as well as for four haze-layer separation models, and then fuse the results together to produce the final haze-free image. To evaluate the effectiveness of our method, we compare our network with several state-of-the-art methods on two widely-used dehazing benchmark datasets, as well as on two sets of real-world hazy images. Experimental results demonstrate clear quantitative and qualitative improvements of our method over the state-of-the-arts.

关键词： Atmospheric modeling Predictive models Scattering Computational modeling Neural networks computer vision Solid modeling

来源：评论

学校读者我要写书评

暂无评论

Biomedical image analysis competitions: The state of current participation practice

arXiv

引用

arXiv 2022年

作者： Eisenmann, Matthias Reinke, Annika Weru, Vivienn Tizabi, Minu Dietlinde Isensee, Fabian Adler, Tim J. Godau, Patrick Cheplygina, Veronika Kozubek, Michal Maier-Hein, Klaus Jäger, Paul F. Kopp-Schneider, Annette Maier-Hein, Lena Ali, Sharib Gupta, Anubha Kybic, Jan Noble, Alison de Solórzano, Carlos Ortiz Pachade, Samiksha Petitjean, Caroline Sage, Daniel Wei, Donglai Wilden, Elizabeth Alapatt, Deepak Andrearczyk, Vincent Baid, Ujjwal Bakas, Spyridon Balu, Niranjan Bano, Sophia Bawa, Vivek Singh Bernal, Jorge Bodenstedt, Sebastian Casella, Alessandro Choi, Jinwook Commowick, Olivier Daum, Marie Depeursinge, Adrien Dorent, Reuben Egger, Jan Eichhorn, Hannah Engelhardt, Sandy Ganz, Melanie Girard, Gabriel Hansen, Lasse Heinrich, Mattias Heller, Nicholas Hering, Alessa Huaulmé, Arnaud Kim, Hyunjeong Li, Hongwei Bran Landman, Bennett Li, Jianning Ma, Jun Martel, Anne Martín-Isla, Carlos Menze, Bjoern Nwoye, Chinedu Innocent Oreiller, Valentin Padoy, Nicolas Pati, Sarthak Payette, Kelly Sudre, Carole van Wijnen, Kimberlin Vardazaryan, Armine Vercauteren, Tom Wagner, Martin Wang, Chuanbo Yap, Moi Hoon Yu, Zeyun Yuan, Chun Zenk, Maximilian Zia, Aneeq Zimmerer, David Bao, Rina Choi, Chanyeol Cohen, Andrew Dzyubachyk, Oleh Galdran, Adrian Gan, Tianyuan Guo, Tianqi Gupta, Pradyumna Haithami, Mahmood Ho, Edward Jang, Ikbeom Li, Zhili Luo, Zhengbo Lux, Filip Makrogiannis, Sokratis Müller, Dominik Oh, Young-Tack Pang, Subeen Pape, Constantin Polat, Gorkem Reed, Charlotte Rosalie Ryu, Kanghyun Scherr, Tim Thambawita, Vajira Wang, Haoyu Wang, Xinliang Xu, Kele Yeh, Hung Yeo, Doyeob Yuan, Yixuan Zeng, Yan Zhao, Xin Abbing, Julian Adam, Jannes Adluru, Nagesh Agethen, Niklas Ahmed, Salman Al Khalil, Yasmina Alenyà, Mireia Alhoniemi, Esa An, Chengyang Arega, Tewodros Weldebirhan Avisdris, Netanell Aydogan, Dogu Baran Bai, Yingbin Calisto, Maria Baldeon Basaran, Berke Doga Beetz, Marcel Bian, Hao Blansit, Kevin Bloch, Louise Bohnsack, Robert Bosticardo, Sara Breen, Jack Brudfors, Mikael Brüngel, Raphael Cabezas, Mariano Cacciola, Alb Heidelberg Division of Intelligent Medical Systems Germany Heidelberg HI Helmholtz Imaging Germany Faculty of Mathematics and Computer Science Heidelberg University Heidelberg Germany Heidelberg Division of Biostatistics Germany Heidelberg Division of Medical Image Computing Germany Heidelberg HI Applied Vision Lab Germany IT University of Copenhagen Copenhagen Denmark Centre for Biomedical Image Analysis Masaryk University Brno Czech Republic Heidelberg Interactive Machine Learning Group Germany Faculty of Mathematics and Computer Science and Medical Faculty Heidelberg University Heidelberg Germany NCT Heidelberg DKFZ University Hospital Heidelberg Germany School of Computing University of Leeds Leeds United Kingdom SBILab Department of ECE IIIT-Delhi India Faculty of Electrical Engineering Czech Technical University Prague Czech Republic Institute of Biomedical Engineering University of Oxford United Kingdom Center for Applied Medical Research Pamplona Spain Shri Guru Gobind Singhji Institute of Engineering and Technology Maharashtra Nanded India Université de Rouen Normandie France Lausanne Switzerland School of Engineering and Applied Science Harvard University United States ICube University of Strasbourg CNRS France Institute of Informatics School of Management HES-SO Valais-Wallis University of Applied Sciences and Arts Western Switzerland Techno-Pôle 3 Sierre3960 Switzerland Department of Nuclear Medicine and Molecular Imaging Lausanne University Hospital Rue du Bugnon 46 LausanneCH-1011 Switzerland University of Pennsylvania PhiladelphiaPA United States Department of Radiology University of Washington United States Wellcome EPSRC Centre for Interventional and Surgical Sciences University College London London United Kingdom Visual Artificial Intelligence Lab Oxford Brookes University Oxford United Kingdom Universitat Autònoma de Barcelona & Computer Vision Center Spain Dresden Fetscherstraße 74 PF 64 Dresden01307 Germany

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps. © 2022, CC BY.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：