检索结果-内蒙古大学图书馆

7th International Conference on Medical Imaging with Deep Learning, MIDL 2024

作者： Föllmer, Bernhard Schulze, Kenrick Wald, Christian Stober, Sebastian Samek, Wojciech Dewey, Marc Department of Radiology Charité-Universitätsmedizin Berlin Freie Universität Berlin Humboldt-Universität zu Berlin Berlin10117 Germany Institute of Mathematics Technical University of Berlin Berlin Germany Artificial Intelligence Lab. Otto-von-Guericke-Universität Magdeburg Germany Department of Artificial Intelligence Fraunhofer Heinrich Hertz Institute Berlin Germany BIFOLD – Berlin Institute for the Foundations of Learning and Data Berlin Germany Department of Electrical Engineering and Computer Science Technical University of Berlin Berlin Germany partner site Berlin Berlin Germany

Annotating medical images for segmentation tasks is a time-consuming process that requires expert knowledge. Active learning can reduce this annotation cost and achieve optimal model performance by selecting only the most informative samples for annotation. However, the effectiveness of active learning sample selection strategies depends on the model architecture and training procedure used. The nnUNet has achieved impressive results in various automated medical image segmentation tasks due to its self-configuring pipeline for automated model design and training. This raises the question of whether the nnUNet is applicable in an active learning setting to avoid cumbersome manual configuration of the training process and improve accessibility for non-experts in deep learning-based segmentation. This paper compares various sample selection strategies in an active learning setting in which the self-configuring nnUNet is used as the segmentation model. Additionally, we propose a new sample selection strategy for UNet-like architectures: USIM - Uncertainty-Aware Submodular Mutual Information Measure. The method combines uncertainty and submodular mutual information to select batches of uncertain, diverse, and representative samples. We evaluate the performance gain and labeling costs on three medical image segmentation tasks with different segmentation challenges. Our findings demonstrate that utilizing nnUNet as the segmentation model in an active learning setting is feasible, and most sampling strategies outperform random sampling. Furthermore, we demonstrate that our proposed method yields a significant improvement compared to existing baseline methods. © 2024 CC-BY 4.0, B. Föllmer, K. Schulze, C. Wald, S. Stober, W. Samek & M. Dewey.

关键词： Active learning

来源：评论

学校读者我要写书评

暂无评论

Highsimb: A Concrete Blockchain High Simulation with Contract Vulnerability Detection for Ethereum and Hyperledger Fabric 4th

Highsimb: A Concrete Blockchain High Simulation with Contra...

引用

4th International Conference on Machine Learning for Cyber Security, ML4CS 2022

作者： Huang, Pengfei Jie, Wanqing Voundi Koe, Arthur Sandor Hou, Ruitao Yan, Hongyang Nouioua, Mourad Thien, Phan Duc Mbous Ikong, Jacques Lancine, Camara Institute of Artificial Intelligence and Blockchain Guangzhou University Guangzhou510006 China Pazhou Lab Guangzhou510330 China Faculty of Mathematics and Computer Science University of Mohamed Bachir El Ibrahimi Bordj Bou Arreridj34030 Algeria Faculty of Information Technology Nam Dinh University of Technology Education Nam Dinh420000 Viet Nam Department of Electrical and Telecommunications Engineering National Advanced School of Engineering - University of Yaounde I Yaounde337 Cameroon Department of Computer Science University of Social Sciences and Management Bamako2575 Mali

ISBN: (纸本)9783031200984

Blockchain testing plays a critical role in the maturation of blockchain technology by ensuring the quality of implemented functional and non-functional requirements. In the new global economy, rapid time to market has become a central issue: developers fail to scrutinize their blockchain designs prior to deployment and customers undergo negative experiences that hurt the widespread adoption of the blockchain technology. Previous published studies aimed for effective blockchain simulators. However, existing solutions exhibit several drawbacks: they rely on guesswork, conceal low-level implementation details, lack expected realistic outcomes and automated testing, as well as lag in smart contract vulnerability analysis. In this paper, we introduce highsimb: the first concrete blockchain high simulation platform for Ethereum and Hyperledger Fabric that supports smart contract vulnerability detection. Unlike a testnet, the blockchain tester can customize any low-level detail to achieve realistic expected results under automated testing. Theoretical analysis demonstrates our concrete simulator is highly observable, supports realistic feedback, is scalable, detects smart contract vulnerabilities, has strong white-box testing capabilities and automates experiments. Our framework complements existing blockchain simulators and introduces a novel development paradigm for blockchain testing. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Blockchain

来源：评论

学校读者我要写书评

暂无评论

Direct Shooting Method for Numerical Optimal Control: A Modified Transcription Approach

arXiv

引用

arXiv 2024年

作者： Tang, Jiawei Zhong, Yuxing Wang, Pengyu Chen, Xingzhou Wu, Shuang Shi, Ling The Department of Electronic and Computer Engineering Hong Kong University of Science and Technology Clear Water Bay Hong Kong Shenzhen Key Laboratory of Robotics Perception and Intelligence The Department of Electronic and Electrical Engineering Southern University of Science and Technology Shenzhen China The Noah’s Ark Lab Huawei Canada

Direct shooting is an efficient method to solve numerical optimal control. It utilizes the Runge-Kutta scheme to discretize a continuous-time optimal control problem making the problem solvable by nonlinear programming solvers. However, conventional direct shooting raises a contradictory dynamics issue when using an augmented state to handle high-order systems. This paper fills the research gap by considering the direct shooting method for high-order systems. We derive the modified Euler and Runge-Kutta-4 methods to transcribe the system dynamics constraint directly. Additionally, we provide the global error upper bounds of our proposed methods. A set of benchmark optimal control problems shows that our methods provide more accurate solutions than existing approaches. © 2024, CC BY.

关键词： Nonlinear programming

来源：评论

学校读者我要写书评

暂无评论

Deepwalk-aware graph convolutional networks

引用

science China(Information sciences) 2022年第5期65卷 81-95页

作者： Taisong JIN Huaqiang DAI Liujuan CAO Baochang ZHANG Feiyue HUANG Yue GAO Rongrong JI Media Analytics and Computing Lab Department of Computer Science and Technology School of InformaticsXiamen University Media Analytics and Computing Lab Department of Artificial Intelligence School of InformaticsXiamen University School of Automation Science and Electrical Engineering Beihang University Tencent Youtu Lab School of Software Tsinghua University

Graph convolutional networks(GCNs) provide a promising way to extract the useful information from graph-structured data. Most of the existing GCNs methods usually focus on local neighborhood information based on specific convolution operations, and ignore the global structure of the input data. To extract the latent representation for the graph-structured data more effectively, we introduce a deepwalk strategy into GCNs to efficiently explore the global graph information. This strategy can complement the local neighborhood information of a graph, resulting in the more robust representation for the graph *** fusion of the local neighboring and global structured information of a graph can further facilitate deep feature learning at the output layer of GCNs for node classification. Experimental results show that the proposed model has achieved state-of-the-art results on three benchmark datasets including Cora, Citeseer,and Pubmed citation networks.

关键词： graph convolutional networks global information fusion node classification

来源：评论

学校读者我要写书评

暂无评论

Byzantine Robust Aggregation in Federated Distillation with Adversaries

Byzantine Robust Aggregation in Federated Distillation with ...

引用

International Conference on distributed Computing Systems

作者： Wenrui Li Hanlin Gu Sheng Wan Zhirong Luan Wei Xi Lixin Fan Qiang Yang Badong Chen Institute of Artificial Intelligence and Robotics Xi'an Jiaotong University Xi'an China WeBank AI Lab Shenzhen China Department of Computer Science Hong Kong University of Science and Technology Hong Kong China School of Electrical Engineering Xi'an University of Technology Xi'an China Department of Computer Science Xi'an Jiaotong University Xi'an China

ISBN: (数字)9798350386059

ISBN: (纸本)9798350386066

Federated learning empowers privacy-preserving, multi-party secure model training without the necessity of sharing raw data. In recent years, knowledge distillation has emerged as a promising solution to address the significant challenge of model heterogeneity within federated learning. However, current research often overlooks the potential threats posed by Byzantine attacks, which can significantly compromise the security of federated distillation. Previous work on Byzantine attacks has been primarily focused on manipulating local gradients to compromise global model, lacking attacks on logits in knowledge distillation scenarios. In this paper, we introduce two innovative attacks, shedding light on the inherent risks in federated distillation. The proposed attacks include a top-k attack, which perturbs the top k values of logits in each column, and an impersonation attack, which emulates knowledge significantly deviating from the norm. To counter such attacks, we propose a robust aggregation strategy-FedTGD (Federated Top Guard Distillation), designed to ensure robust distillation with heterogeneous models. Specifically, FedTGD incorporates Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and maximum cosine similarity on top-k values of logits to select benign knowledge. Experimental evaluations conducted on FEMNIST and CIFAR100 datasets, considering scenarios for both IID and Non-IID, reveal that top-k attack results in a substantial 27.16% accuracy reduction for FedMD. In contrast, our aggregation method shows a marginal 0.7% accuracy decrease under top-k attacks, outperforming state-of-the-art baselines.

关键词： Training Accuracy Federated learning Computational modeling Noise Impersonation attacks Data models

来源：评论

学校读者我要写书评

暂无评论

Training Artificial Neural Networks by Coordinate Search Algorithm

arXiv

引用

arXiv 2024年

作者： Rokhsatyazdi, Ehsan Rahnamayan, Shahryar Miyandoab, Sevil Zanjani Bidgoli, Azam Asilian Tizhoosh, H.R. Lab Department of Electrical Computer and Software Engineering Ontario Tech University OshawaON Canada Department of Engineering Brock University St. CatharinesON Canada Faculty of Science Wilfrid Laurier University WaterlooON Canada Rhazes Lab Department of Artificial Intelligence and Informatics Mayo Clinic RochesterMN United States

Training Artificial Neural Networks (ANNs) poses a challenging and critical problem in machine learning. Despite the effectiveness of gradient-based learning methods, such as Stochastic Gradient Descent (SGD), in training neural networks, they do have several limitations. For instance, they require differentiable activation functions, and cannot optimize a model based on several independent non-differentiable loss functions simultaneously;for example, the F1-score, which is used during testing, can be used during training when a gradient-free optimization algorithm is utilized. Furthermore, the training (i.e., optimization of weights) in any DNN can be possible with a small size of the training dataset. To address these concerns, we propose an efficient version of the gradient-free Coordinate Search (CS) algorithm, an instance of General Pattern Search (GPS) methods, for training (i.e., optimizing) neural networks. The proposed algorithm can be used with non-differentiable activation functions and tailored to multi-objective/multi-loss problems. Finding the optimal values for weights of ANNs is a large-scale optimization problem. Therefore, instead of finding the optimal value for each variable, which is the common technique in classical CS, we accelerate optimization and convergence by bundling the variables (i.e., weights). In fact, this strategy is a form of dimension reduction for optimization problems. Based on the experimental results, the proposed method is comparable with the SGD algorithm, and in some cases, it outperforms the gradient-based approach. Particularly, in situations with insufficient labeled training data, the proposed CS method performs better. The performance plots demonstrate a high convergence rate, highlighting the capability of our suggested method to find a reasonable solution with fewer function calls. As of now, the only practical and efficient way of training ANNs with hundreds of thousands of weights is gradient-based algorithms such

关键词： Optimization

来源：评论

学校读者我要写书评

暂无评论

Building Text and Speech Benchmark Datasets and Models for Low-Resourced East African Languages: Experiences and Lessons

Applied AI Letters

引用

Applied AI Letters 2024年第2期5卷

作者： Nakatumba-Nabende, Joyce Babirye, Claire Nabende, Peter Tusubira, Jeremy Francis Mukiibi, Jonathan Wairagala, Eric Peter Mutebi, Chodrine Bateesa, Tobius Saul Nahabwe, Alvin Tusiime, Hewitt Katumba, Andrew Department of Computer Science Makerere University Kampala Uganda Makerere Artificial Intelligence Lab Makerere University Kampala Uganda Department of Information Systems Makerere University Kampala Uganda Department of Electrical and Computer Engineering Makerere University Kampala Uganda

Africa has over 2000 languages;however, those languages are not well represented in the existing natural language processing ecosystem. African languages lack essential digital resources to effectively engage in advancing language technologies. There is a need to generate high-quality natural language processing resources for low-resourced African languages. Obtaining high-quality speech and text data is expensive and tedious because it can involve manual sourcing and verification of data sources. This paper discusses the process taken to curate and annotate text and speech datasets for five East African languages: Luganda, Runyankore-Rukiga, Acholi, Lumasaba, and Swahili. We also present results obtained from baseline models for machine translation, topic modeling and classification, sentiment classification, and automatic speech recognition tasks. Finally, we discuss the experiences, challenges, and lessons learned in creating the text and speech datasets. © 2024 The Authors. Applied AI Letters published by John Wiley & Sons Ltd.

关键词： automatic speech recognition low-resourced language machine translation speech dataset text dataset topic modeling

来源：评论

学校读者我要写书评

暂无评论

HCA-NET: Hierarchical Context Attention Network for Intervertebral Disc Semantic labeling

HCA-NET: Hierarchical Context Attention Network for Interver...

引用

IEEE International Symposium on Biomedical Imaging

作者： Afshin Bozorgpour Bobby Azad Reza Azad Yury Velichko Ulas Bagci Dorit Merhof Faculty of Informatics and Data Science University of Regensburg Germany Electrical Engineering and Computer Science Department South Dakota State University USA Faculty of Electrical Engineering and Information Technology RWTH Aachen University Germany Machine and Hybrid Intelligence Lab Northwestern University Chicago IL USA Fraunhofer Institute for Digital Medicine MEVIS Germany

ISBN: (数字)9798350313338

ISBN: (纸本)9798350313345

Accurate and automated segmentation of intervertebral discs (IVDs) in medical images is crucial for assessing spine-related disorders, such as osteoporosis, vertebral fractures, or IVD herniation. We present HCA-Net, a novel contextual attention network architecture for semantic labeling of IVDs, with a special focus on exploiting prior geometric information. Our approach excels at processing features across different scales and effectively consolidating them to capture the intricate spatial relationships within the spinal cord. To achieve this, HCA-Net models IVD labeling as a pose estimation problem, aiming to minimize the discrepancy between each predicted IVD location and its corresponding actual joint location. In addition, we introduce a skeletal loss term to reinforce the model’s geometric dependence on the spine. This loss function is designed to constrain the model’s predictions to a range that matches the general structure of the human vertebral skeleton. As a result, the network learns to reduce the occurrence of false predictions and adaptively improves the accuracy of IVD location estimation. Through extensive experimental evaluation on multi-center spine datasets, our approach consistently outperforms previous state-of-the-art methods on both MRI T1w and T2w modalities. The code-base is accessible to the public on GitHub.

关键词： Adaptation models Accuracy Spinal cord Semantics Pose estimation Predictive models Skeleton

来源：评论

学校读者我要写书评

暂无评论

IMPROVING EQUIVARIANT NETWORKS WITH PROBABILISTIC SYMMETRY BREAKING

arXiv

引用

arXiv 2025年

作者： Lawrence, Hannah Portilheiro, Vasco Zhang, Yan Kaba, Sékou-Oumar Department of Electrical Engineering and Computer Science MIT United States Gatsby Computational Neuroscience Unit UCL United Kingdom Samsung – SAIT AI Lab Montreal Canada Mila – Quebec Artficial Intelligence Institute Canada McGill University Canada

Equivariance encodes known symmetries into neural networks, often enhancing generalization. However, equivariant networks cannot break symmetries: the output of an equivariant network must, by definition, have at least the same self-symmetries as the input. This poses an important problem, both (1) for prediction tasks on domains where self-symmetries are common, and (2) for generative models, which must break symmetries in order to reconstruct from highly symmetric latent spaces. This fundamental limitation can be addressed by considering equivariant conditional distributions, instead of equivariant functions. We present novel theoretical results that establish necessary and sufficient conditions for representing such distributions. Concretely, this representation provides a practical framework for breaking symmetries in any equivariant network via randomized canonicalization. Our method, SymPE (Symmetry-breaking Positional Encodings), admits a simple interpretation in terms of positional encodings. This approach expands the representational power of equivariant networks while retaining the inductive bias of symmetry, which we justify through generalization bounds. Experimental results demonstrate that SymPE significantly improves performance of group-equivariant and graph neural networks across diffusion models for graphs, graph autoencoders, and lattice spin system modeling. © 2025, CC BY-NC-SA.

关键词： Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Beyond the Horizon: Decoupling UAVs Multi-View Action Recognition via Partial Order Transfer

arXiv

引用

arXiv 2025年

作者： Liu, Wenxuan Zhong, Xian Zhou, Zhuo Yang, Siyuan Lin, Chia-Wen Kot, Alex Chichung School of Computer Science Peking University Beijing100871 China Hubei Key Laboratory of Transportation Internet of Things School of Computer Science and Artificial Intelligence Wuhan University of Technology Wuhan430070 China Rapid-Rich Object Search Lab School of Electrical and Electronic Engineering Nanyang Technological University Singapore639798 Singapore School of Computer Science Wuhan University Wuhan430072 China Department of Electrical Engineering National Tsing Hua University Hsinchu30013 Taiwan

Action recognition in unmanned aerial vehicles (UAVs) poses unique challenges due to significant view variations along the vertical spatial axis. Unlike traditional ground-based settings, UAVs capture actions from a wide range of altitudes, resulting in considerable appearance discrepancies. We introduce a multi-view formulation tailored to varying UAV altitudes and empirically observe a partial order among views, where recognition accuracy consistently decreases as the altitude increases. This motivates a novel approach that explicitly models the hierarchical structure of UAV views to improve recognition performance across altitudes. To this end, we propose the Partial Order Guided Multi-View Network (POG-MVNet), designed to address drastic view variations by effectively leveraging view-dependent information across different altitude levels. The framework comprises three key components: a View Partition (VP) module, which uses the head-to-body ratio to group views by altitude;an Order-aware Feature Decoupling (OFD) module, which disentangles action-relevant and view-specific features under partial order guidance;and an Action Partial Order Guide (APOG), which leverages the partial order to transfer informative knowledge from easier views to support learning in more challenging ones. We conduct experiments on DRONE-ACTION, MOD20, and UAV datasets, demonstrating that POG-MVNet significantly outperforms competing methods. For example, POG-MVNet achieves a 4.7% improvement on DRONE-ACTION dataset and a 3.5% improvement on UAV dataset compared to state-of-the-art methods ASAT and FAR. The code for POG-MVNet will be made available soon. Copyright © 2025, The Authors. All rights reserved.

关键词： Drones

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：