检索结果-内蒙古大学图书馆

43rd Annual Meeting of the Cognitive science Society: Comparative Cognition: Animal Minds, CogSci 2021

作者： An, Sungeun Rugaber, Spencer Weigel, Emily Goel, Ashok School of Interactive Computing Georgia Institute of Technology AtlantaGA30308 United States School of Computer Science Georgia Institute of Technology AtlantaGA30308 United States School of Biological Sciences Georgia Institute of Technology AtlantaGA30332 United States

Virtual laboratories that enable novice scientists to construct, evaluate and revise models of complex systems heavily involve parameter estimation tasks. We seek to understand novice strategies for parameter estimation in model exploration to design better cognitive supports for them. We conducted a study of 50 college students for a parameter estimation task in exploring an ecological model. We identified three types of behavioral patterns and their underlying cognitive strategies. Specifically, the students used systematic search, problem decomposition and reduction, and global search followed by local search as their cognitive strategies. © Cognitive science Society: Comparative Cognition: Animal Minds, CogSci *** rights reserved.

关键词： Parameter estimation

来源：评论

学校读者我要写书评

暂无评论

DISCRETE REPRESENTATIONS STRENGTHEN VISION TRANSFORMER ROBUSTNESS

arXiv

引用

arXiv 2021年

作者： Mao, Chengzhi Jiang, Lu Dehghani, Mostafa Vondrick, Carl Sukthankar, Rahul Essa, Irfan Google Research Computer Science Columbia University United States School of Interactive Computing Georgia Insitute of Technology United States

Vision Transformer (ViT) is emerging as the state-of-the-art architecture for image recognition. While recent studies suggest that ViTs are more robust than their convolutional counterparts, our experiments find that ViTs trained on ImageNet are overly reliant on local textures and fail to make adequate use of shape information. ViTs thus have difficulties generalizing to out-of-distribution, real-world data. To address this deficiency, we present a simple and effective architecture modification to ViT’s input layer by adding discrete tokens produced by a vector-quantized encoder. Different from the standard continuous pixel tokens, discrete tokens are invariant under small perturbations and contain less information individually, which promote ViTs to learn global information that is invariant. Experimental results demonstrate that adding discrete representation on four architecture variants strengthens ViT robustness by up to 12% across seven ImageNet robustness benchmarks while maintaining the performance on ImageNet. Copyright © 2021, The Authors. All rights reserved.

关键词： Image recognition

来源：评论

学校读者我要写书评

暂无评论

Unanticipated Lessons from Communities: Navigating Society-CenteredResearch in the AI Era

Unanticipated Lessons from Communities: Navigating Society-C...

引用

2025 CHI Conference on Human Factors in computing Systems, CHI EA 2025

作者： Wang, Ding Denton, Remi Sinha, Anoop K. Sheth, Shruti Wilcox, Lauren Mustafa, Maryam Eslami, Motahhare Holstein, Ken Sap, Maarten Smith, Angela D.R. Parker, Andrea G. Kumar, Neha Karusala, Naveena Dillahunt, Tawanna R. Katzman, Jared Lee Google AtlantaGA United States Google New YorkNY United States Google Mountain ViewCA United States School of Interactive Computing Georgia Institute of Technology AtlantaGA United States Lahore University of Management Sciences Lahore Pakistan School of Computer Science Carnegie Mellon University PittsburghPA United States Language Technologies Institute Carnegie Mellon Unviersity PittsburghPA United States School of Information University of Texas at Austin AustinTX United States Georgia Tech AtlantaGA United States School of Information University of Michigan Ann ArborMI United States

ISBN: (纸本)9798400713958

As AI technologies increasingly integrate into daily life, their deployment often overlooks the complexities of the communities they aim to serve. This gap is particularly acute for marginalized communities, where AI can exacerbate inequalities due to techno-solutionism – the tendency to frame technology as a one-size-fits-all solution. This Special Interest Group (SIG) will explore unexpected lessons from community-led AI initiatives, emphasizing strategies for meaningful collaboration, shared ownership, and equitable partnerships. Through discussions and storytelling, the SIG aims to advance community-centered approaches to AI development and foster a robust, sustained network of researchers and practitioners. © 2025 Copyright held by the owner/author(s).

关键词： Community-based Research Participatory Methods Society-Centered AI

来源：评论

学校读者我要写书评

暂无评论

Enhancing Collaborative Inference on Heterogeneous Edge Devices via Adaptive Ensemble Knowledge Distillation

引用

IEEE Journal on Selected Areas in Communications 2025年

作者： Wu, Shangrui Li, Yupeng Wang, Wenhua Guo, Jianxiong Fan, Wentao Liu, Qin Jia, Weijia Yu, Shui Cao, Jiannong Wang, Tian Beijing Normal-Hong Kong Baptist University Guangdong Provincial/Zhuhai Key Laboratory of IRADS Department of Computer Science Zhuhai China Hong Kong Baptist University Hong Kong Hong Kong Hong Kong Baptist University Department of Interactive Media Hong Kong Hong Kong Beijing Normal University Institute of Artificial Intelligence and Future Networks Zhuhai China Xi'an University of Posts and Telecommunications Shaanxi Key Laboratory of Information Communication Network and Security Shaanxi Xi'an China Hunan University College of Computer Science and Electronic Engineering Changsha China University of Technology Sydney School of Computer Science Sydney Australia The Hong Kong Polytechnic University Department of Computing Hong Kong Hong Kong

The integration of edge computing with deep neural networks (DNNs) is crucial for intelligent industrial cyber-physical systems. Typically, deploying DNNs on heterogeneous edge devices relies on methods like model compression and partitioning. However, these approaches often result in homogeneous models across devices. This homogeneity limits the collective capability of edge computing systems, particularly in terms of generalization to diverse data distributions and adaptation to dynamic industrial environments. In this work, we propose to treat each DNN on an edge device as an independent model, aggregating their capabilities via ensemble learning to enhance generalization and dynamic adaptability. To realize this, we introduce the Adaptive Ensemble Knowledge Distillation Framework (AEKDF), combining cloud-based model training with edge computing based collaborative inference. In the cloud, AEKDF develops an enhanced Born Again Network that generates diverse, lightweight models tailored to specific edge devices through knowledge distillation. This process ensures model diversity which is critical to effective ensemble learning. On the edge, AEKDF employs an adaptive ensemble technique that aggregates prediction logits across devices, enabling rapid adaptation to changing environments and maintaining inference efficiency. Our extensive evaluations conducted on a realistic prototype demonstrate the substantial boost in predictive performance achieved by our AEKDF, showcasing a 4% to 10% accuracy improvement on the CIFAR-100 compared to conventional single-model approaches, while maintaining low latency. © 1983-2012 IEEE.

关键词： Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

A Novel U-Net Based CNN Algorithm for High-Accuracy Weather Forecasting

A Novel U-Net Based CNN Algorithm for High-Accuracy Weather ...

引用

International Conference on Computational Intelligence and Communication Networks (CICN)

作者： Thirumurugan Shanmugam Chirag Chandrashekar Arun Kumar Sivaraman Janakiraman Nithiyanantham Priya Ravindran S T Sarvesh Narayanan Department of Computing and Information Sciences University of Technology and Applied Sciences Suhar Oman School of CSE Vellore Institute of Technology Chennai India Digital Engineering Solution Centre Photon Interactive Inc. Ontario Canada Department of ECE K.L.N. College of Engineering Tamilnadu India School of Computer Science Dalhousie University Halifax Canada

ISBN: (数字)9798331505264

ISBN: (纸本)9798331505271

Accurate weather forecasting is essential for sectors like agriculture, aviation, and disaster management. However, deep learning algorithms face challenges in prediction accuracy due to issues like vanishing gradients, overfitting, and high computational demands. This research proposes a novel U-Net based architecture utilizing a Convolutional Neural Network (CNN) bottleneck layer to improve weather forecasting. Key features include a skip-connection mechanism, modified weight update rules, Gaussian-mutation operations, and the Adam optimizer for enhanced feature extraction and faster, more accurate predictions. The model was tested using precipitation data from Doppler Weather Radar (DWR) Chennai Radar and weather parameters from European Centre for Medium-Range Weather Forecasts (ECMWF). A dedicated GeoServer facilitates realtime data processing. Experimental results show the proposed algorithm achieves 97.5% accuracy, outperforming CNN and long short-term memory (LSTM) models by 5.84% and 2.41%, respectively.

关键词： Accuracy Computational modeling Atmospheric modeling Weather forecasting Predictive models Feature extraction Prediction algorithms Convolutional neural networks Long short term memory Meteorology

来源：评论

学校读者我要写书评

暂无评论

LuciEntry: Towards Understanding the Design of Lucid Dream Induction 25

LuciEntry: Towards Understanding the Design of Lucid Dream I...

引用

Proceedings of the 2025 ACM Designing interactive Systems Conference

作者： Po-Yao (Cosmos) Wang Xiao Zoe Fang Gabriel Ducos Nathaniel Yung Xiang Lee Antony Smith Loose Rohit Rajesh Nethmini Botheju Eric Chen Maria F. Montoya Alexandra Kitson Karen Konkoly Rohan Sagi Rakesh Patibanda Nathan W Whitmore Mahdad Jafarzadeh Esfahani Jialin Deng Jiajun Bu Martin Dresler Don Samitha Elvitigala Nathan Arthur Semertzidis Florian ‘Floyd’ Mueller Exertion Games Lab Department of Human Centred Computing Monash University Melbourne VIC Australia Department of Computer Science and Technology Zhejiang University Hangzhou China Exertion Games Lab Department of Human Centred Computing Monash University CLAYTON VIC Australia Exertion Games Lab Department of Human Centred Computing Monash University Clayton Victoria Australia Exertion Games Lab Department of Human Centred Computing Monash University CLAYTON Victoria Australia Exertion Games Lab Department of Human Centred Computing Monash University CLAYTON Australia Exertion Games Lab Department of Human-Centred Computing Monash University Melbourne VIC Australia School of Interactive Arts and Technology Simon Fraser University Surrey British Columbia Canada cognitive neuroscience lab Northwestern University Evanston Illinois USA Media Lab Massachusetts Institute of Technology Cambridge Massachusetts USA a Donders Institute for Brain Cognition and Behaviour Radboudumc Nijmegen Netherlands Exertion Games Lab Department of Human Centred Computing Department of Human-Centred Computing Monash University Melbourne Victoria Australia College of Computer Science Zhejiang University Hangzhou Zhejiang China Donders Institute Nijmegen Netherlands Exertion Games Lab Department of Human Centred Computing Monash University Melbourne Australia Centre for Artificial Intelligence in Mental Health Innovation Institute for Social Neuroscience Melbourne Victoria Australia

来源：评论

学校读者我要写书评

暂无评论

Causal Perception in Question-Answering Systems 21

Causal Perception in Question-Answering Systems

引用

Proceedings of the 2021 CHI Conference on Human Factors in computing Systems

作者： Po-Ming Law Leo Yu-Ho Lo Alex Endert John Stasko Huamin Qu Georgia Institute of Technology United States Department of Computer Science and Engineering The Hong Kong University of Science and Technology China School of Interactive Computing Georgia Institute of Technology United States

ISBN: (纸本)9781450380966

Root cause analysis is a common data analysis task. While question-answering systems enable people to easily articulate a why question (e.g., why students in Massachusetts have high ACT Math scores on average) and obtain an answer, these systems often produce questionable causal claims. To investigate how such claims might mislead users, we conducted two crowdsourced experiments to study the impact of showing different information on user perceptions of a question-answering system. We found that in a system that occasionally provided unreasonable responses, showing a scatterplot increased the plausibility of unreasonable causal claims. Also, simply warning participants that correlation is not causation seemed to lead participants to accept reasonable causal claims more cautiously. We observed a strong tendency among participants to associate correlation with causation. Yet, the warning appeared to reduce the tendency. Grounded in the findings, we propose ways to reduce the illusion of causality when using question-answering systems.

关键词： correlation and causation question answering

来源：评论

学校读者我要写书评

暂无评论

Touchstone benchmark: are we on the right way for evaluating AI algorithms for medical segmentation? 24

Touchstone benchmark: are we on the right way for evaluating...

引用

Proceedings of the 38th International Conference on Neural Information Processing Systems

作者： Pedro R. A. S. Bassi Wenxuan Li Yucheng Tang Fabian Isensee Zifu Wang Jieneng Chen Yu-Cheng Chou Saikat Roy Yannick Kirchhoff Maximilian Rokuss Ziyan Huang Jin Ye Junjun He Tassilo Wald Constantin Ulrich Michael Baumgartner Klaus H. Maier-Hein Paul Jaeger Yiwen Ye Yutong Xie Jianpeng Zhang Ziyang Chen Yong Xia Zhaohu Xing Lei Zhu Yousef Sadegheih Afshin Bozorgpour Pratibha Kumari Reza Azad Dorit Merhof Pengcheng Shi Ting Ma Yuxin Du Fan Bai Tiejun Huang Bo Zhao Haonan Wang Xiaomeng Li Hanxue Gu Haoyu Dong Jichen Yang Maciej A. Mazurowski Saumya Gupta Linshan Wu Jiaxin Zhuang Hao Chen Holger Roth Daguang Xu Matthew B. Blaschko Sergio Decherchi Andrea Cavalli Alan L. Yuille Zongwei Zhou Department of Computer Science Johns Hopkins University and Department of Pharmacy and Biotechnology University of Bologna and Center for Biomolecular Nanotechnologies Istituto Italiano di Tecnologia Department of Computer Science Johns Hopkins University NVIDIA Division of Medical Image Computing German Cancer Research Center (DKFZ) and Helmholtz Imaging German Cancer Research Center (DKFZ) ESAT-PSI KU Leuven Division of Medical Image Computing German Cancer Research Center (DKFZ) and Faculty of Mathematics and Computer Science Heidelberg University Division of Medical Image Computing German Cancer Research Center (DKFZ) and Faculty of Mathematics and Computer Science Heidelberg University and HIDSS4Health - Helmholtz Information and Data Science School for Health Shanghai Jiao Tong University Shanghai Artificial Intelligence Laboratory Division of Medical Image Computing German Cancer Research Center (DKFZ) Division of Medical Image Computing German Cancer Research Center (DKFZ) and Pattern Analysis and Learning Group Department of Radiation Oncology Heidelberg University Hospital Helmholtz Imaging German Cancer Research Center (DKFZ) and Interactive Machine Learning Group (IML) DKFZ School of Computer Science and Engineering Northwestern Polytechnical University Australian Institute for Machine Learning The University of Adelaide College of Computer Science and Technology Zhejiang University Hong Kong University of Science and Technology (Guangzhou) Hong Kong University of Science and Technology (Guangzhou) and Hong Kong University of Science and Technology Faculty of Informatics and Data Science University of Regensburg Faculty of Electrical Engineering and Information Technology RWTH Aachen University Faculty of Informatics and Data Science University of Regensburg and Fraunhofer Institute for Digital Medicine MEVIS Electronic & Information Engineering School Harbin Institute of Technology (Shenzhen) Shanghai Jiao Tong University and Beijing Academy of Artificial Intelligence (BAAI) S

ISBN: (纸本)9798331314385

How can we test AI performance? This question seems trivial, but it isn't. Standard benchmarks often have problems such as in-distribution and small-size test sets, oversimplified metrics, unfair comparisons, and short-term outcome pressure. As a consequence, good performance on standard benchmarks does not guarantee success in real-world scenarios. To address these problems, we present Touchstone, a large-scale collaborative segmentation benchmark of 9 types of abdominal organs. This benchmark is based on 5,195 training CT scans from 76 hospitals around the world and 5,903 testing CT scans from 11 additional hospitals. This diverse test set enhances the statistical significance of benchmark results and rigorously evaluates AI algorithms across out-of-distribution scenarios. We invited 14 inventors of 19 AI algorithms to train their algorithms, while our team, as a third party, independently evaluated these algorithms. In addition, we also evaluated pre-existing AI frameworks—which, differing from algorithms, are more flexible and can support different algorithms—including MONAI from NVIDIA, nnU-Net from DKFZ, and numerous other open-source frameworks. We are committed to expanding this benchmark to encourage more innovation of AI algorithms for the medical domain.

关键词：

来源：评论

学校读者我要写书评

暂无评论

BISECT: Learning to split and rephrase sentences with bitexts

arXiv

引用

arXiv 2021年

作者： Kim, Joongwon Maddela, Mounica Kriz, Reno Xu, Wei Callison-Burch, Chris Department of Computer and Information Science University of Pennsylvania School of Interactive Computing Georgia Institute of Technology Human Language Technology Center of Excellence Johns Hopkins University

An important task in NLP applications such as sentence simplification is the ability to take a long, complex sentence and split it into shorter sentences, rephrasing as necessary. We introduce a novel dataset and a new model for this 'split and rephrase' task. Our BISECT training data consists of 1 million long English sentences paired with shorter, meaning-equivalent English sentences. We obtain these by extracting 1-2 sentence alignments in bilingual parallel corpora and then using machine translation to convert both sides of the corpus into the same language. BISECT contains higher quality training examples than previous Split and Rephrase corpora, with sentence splits that require more significant modifications. We categorize examples in our corpus, and use these categories in a novel model that allows us to target specific regions of the input sentence to be split and edited. Moreover, we show that models trained on BISECT can perform a wider variety of split operations and improve upon previous state-of-the-art approaches in automatic and human evaluations. © 2021, CC BY.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Enhancing predictive imaging biomarker discovery through treatment effect analysis

arXiv

引用

arXiv 2024年

作者： Xiao, Shuhan Klein, Lukas Petersen, Jens Vollmuth, Philipp Jaeger, Paul F. Maier-Hein, Klaus H. Heidelberg Division of Medical Image Computing Germany Faculty of Mathematics and Computer Science Heidelberg University Germany DKFZ Heidelberg Interactive Machine Learning Group Germany Institute for Machine Learning ETH Zürich Switzerland DKFZ Heidelberg Helmholtz Imaging Germany Clinic for Neuroradiology University Hospital Bonn Germany Medical Faculty Bonn University of Bonn Germany Pattern Analysis and Learning Group Department of Radiation Oncology Heidelberg University Hospital Germany

Identifying predictive covariates, which forecast individual treatment effectiveness, is crucial for decision-making across different disciplines such as personalized medicine. These covariates, referred to as biomarkers, are extracted from pre-treatment data, often within randomized controlled trials, and should be distinguished from prognostic biomarkers, which are independent of treatment assignment. Our study focuses on discovering predictive imaging biomarkers, specific image features, by leveraging pretreatment images to uncover new causal relationships. Unlike labor-intensive approaches relying on handcrafted features prone to bias, we present a novel task of directly learning predictive features from images. We propose an evaluation protocol to assess a model’s ability to identify predictive imaging biomarkers and differentiate them from purely prognostic ones by employing statistical testing and a comprehensive analysis of image feature attribution. We explore the suitability of deep learning models originally developed for estimating the conditional average treatment effect (CATE) for this task, which have been assessed primarily for their precision of CATE estimation while overlooking the evaluation of imaging biomarker discovery. Our proof-of-concept analysis demonstrates the feasibility and potential of our approach in discovering and validating predictive imaging biomarkers from synthetic outcomes and real-world image datasets. Our code is available at https://***/MIC-DKFZ/predictive_ image_biomarker_analysis. © 2024, CC BY.

关键词： Decision making

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：