检索结果-内蒙古大学图书馆

arXiv 2024年

作者： de la Rosa, Ezequiel Reyes, Mauricio Liew, Sook-Lei Hutton, Alexandre Wiest, Roland Kaesmacher, Johannes Hanning, Uta Hakim, Arsany Zubal, Richard Valenzuela, Waldo Robben, David Sima, Diana M. Anania, Vincenzo Brys, Arne Meakin, James A. Mickan, Anne Broocks, Gabriel Heitkamp, Christian Gao, Shengbo Liang, Kongming Zhang, Ziji Siddiquee, Md Mahfuzur Rahman Myronenko, Andriy Ashtari, Pooya Van Huffel, Sabine Jeong, Hyun-Su Yoon, Chi-Ho Kim, Chulhong Huo, Jiayu Ourselin, Sebastien Sparks, Rachel Clèrigues, Albert Oliver, Arnau Lladó, Xavier Chalcroft, Liam Pappas, Ioannis Bertels, Jeroen Heylen, Ewout Moreau, Juliette Hatami, Nima Frindel, Carole Qayyum, Abdul Mazher, Moona Puig, Domenec Lin, Shao-Chieh Juan, Chun-Jung Hu, Tianxi Boone, Lyndon Goubran, Maged Liu, Yi-Jui Wegener, Susanne Kofler, Florian Ezhov, Ivan Shit, Suprosanna Hernandez Petzsche, Moritz R. Menze, Bjoern Kirschke, Jan S. Wiestler, Benedikt Department of Quantitative Biomedicine University of Zurich Zurich Switzerland Department of Informatics Technical University Munich Germany Icometrix Leuven Belgium ARTORG Center for Biomedical Research University of Bern Bern Switzerland Department of Radiation Oncology University Hospital Bern University of Bern Switzerland University of Bern Bern Switzerland Chan Division of Occupational Science and Occupational Therapy University of Southern California Los AngelesCA United States Stevens Neuroimaging and Informatics Institute Department of Neurology Keck School of Medicine University of Southern California United States University Institute of Diagnostic and Interventional Neuroradiology Inselspital Bern Switzerland University Institute of Diagnostic and Interventional Neuroradiology University Hospital Bern Inselspital University of Bern Bern Switzerland Department of Diagnostic and Interventional neuroradiology University Medical Center Hamburg-Eppendorf Hamburg Germany Department of Medical Imaging Radboud University Medical Center Institute for Health Sciences Nijmegen Netherlands Deepwise AI Lab Beijing China Beijing University of Posts and Telecommunications Bejing China School of Computing and Augmented Intelligence Arizona State University TempeAZ United States NVIDIA Santa ClaraCA United States STADIUS Center for Dynamical Systems Signal Processing and Data Analytics KU Leuven Leuven Belgium Pohang Korea Republic of School of Biomedical Engineering & Imaging Sciences King’s College London United Kingdom Institute of Computer Vision and Robotics University of Girona Spain Wellcome Centre for Human Neuroimaging University College London London United Kingdom Laboratory of Neuro Imaging Stevens Institute for Neuroimaging and Informatics Keck School of Medicine University of Southern California Los Angeles United States KU Leuven Leuven Belgium CREATIS Université Lyon1 CNRS UMR5220 INSERM U1206 INSA-Lyon Villeurbanne696

Diffusion-weighted MRI (DWI) is essential for stroke diagnosis, treatment decisions, and prognosis. However, image and disease variability hinder the development of generalizable AI algorithms with clinical value. We address this gap by presenting a novel ensemble algorithm derived from the 2022 Ischemic Stroke Lesion Segmentation (ISLES) challenge. ISLES’22 provided 400 patient scans with ischemic stroke from various medical centers, facilitating the development of a wide range of cutting-edge segmentation algorithms by the research community. By assessing them against a hidden test set, we identified strengths, weaknesses, and potential biases. Through collaboration with leading teams, we combined top-performing algorithms into an ensemble model that overcomes the limitations of individual solutions. Our ensemble model combines the individual algorithms’ strengths and achieved superior ischemic lesion detection and segmentation accuracy (median Dice score: 0.82, median lesion-wise F1 score: 0.86) on our internal test set compared to individual algorithms. This accuracy generalized well across diverse image and disease variables. Furthermore, the model excelled in extracting clinical biomarkers like lesion types and affected vascular territories. Notably, in a Turing-like test, neuroradiologists consistently preferred the algorithm’s segmentations over manual expert efforts, highlighting increased comprehensiveness and precision. Validation using a real-world external dataset (N=1686) confirmed the model’s generalizability (median Dice score: 0.82, median lesion-wise F1 score: 0.86). The algorithm’s outputs also demonstrated strong correlations with clinical scores (admission NIHSS and 90-day mRS) on par with or exceeding expert-derived results, underlining its clinical relevance. This study offers two key findings. First, we present an ensemble algorithm that detects and segments ischemic stroke lesions on DWI across diverse scenarios on par with expert (neuro)rad

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

Classify EEG and Reveal Latent Graph Structure with Spatio-Temporal Graph Convolutional Neural Network

Classify EEG and Reveal Latent Graph Structure with Spatio-T...

引用

IEEE International Conference on data Mining (ICDM)

作者： Xiaoyu Li Buyue Qian Jishang Wei An Li Xuan Liu Qinghua Zheng School of Computer Science and Technology Xi’an Jiaotong University Xi’an Shaanxi China HP Labs Palo Alto CA USA National Engineering Lab for Big Data Analytics Xi’an Jiaotong University Xi’an Shaanxi China

Electroencephalogram(EEG) is a test that detect brain activities using multiple electrodes placed on the scalp. Multiple channels of EEG signals are recorded through the electrodes and are widely used in applications such as neurological disease diagnosis, emotion recognition, and behavior modeling. Recently, deep learning methods have been applied to classify EEG signals, where the different EEG channels are almost treated as a 2D grid input to the machine learning model. This data formation doesn't consider The complex connection among the EEG channels is not considered in such data formation. In our work, we treat EEG signals as frames of graph, and propose an end-to-end edge-aware spatio-temporal graph convolutional neural network for EEG classification. Specifically, we iteratively apply graph convolutional layer spatially and standard convolutional layer temporally. Since there is no prior knowledge about the exact connection among EEG channels, in our model, we initialize the connection as complete graph and apply learnable mask to capture graph structure at different levels. Furthermore, we also propose an iterative method based on information aggregation in graph convolution mechanism to reveal the latent graph structure. Empirical evaluation shows that our model achieves superior performance over state-of-the-art methods for EEG classification, and the learnt and revealed latent EEG graph structure is verified to be meaningful by neuroscientists.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Towards a Personalized Item Recommendation Approach in Social Tagging Systems Using Intuitionistic Fuzzy DBSCAN 10

Towards a Personalized Item Recommendation Approach in Socia...

引用

10th International Conference on Intelligent Human-Machine Systems and Cybernetics, IHMSC 2018

作者： Guan, Chun Yuen, Kevin Kam Fung Yue, Yong Department of Computer Science University of Liverpool Liverpool United Kingdom Research Institute of Big Data Analytics Department of Computer Science and Software Engineering Xi'an Jiaotong-Liverpool University Suzhou China School of Business University of Social Sciences Singapore Singapore Department of Computer Science and Software Engineering Xi'an Jiaotong-Liverpool University Suzhou China

ISBN: (纸本)9781538658369

In folksonomies, users annotate items with abundant personalized tags. The tags can be used in recommendation systems to produce meaningful information. The Density-Based Spatial Clustering of Applications with Noise (DBSCAN) can be applied in recommendation systems. This paper proposes the Intuitionistic Fuzzy DBSCAN (IF-DBSCAN) for the personalized item recommendation in folksonomy. The IF-DBSCAN is used to cluster items with respect to the user-defined tags. The Intuitionistic Fuzzy Set (IFS) is used to represent tag values which are vague and uncertain. DBSCAN can cluster items with the tags represented by using IFSs into different groups. An example of movie recommendation is demonstrated for the applicability of the proposed method. © 2018 IEEE.

关键词： Recommender systems

来源：评论

学校读者我要写书评

暂无评论

On privacy protection of latent dirichlet allocation model training

arXiv

引用

arXiv 2019年

作者： Zhao, Fangyuan Ren, Xuebin Yang, Shusen Yang, Xinyu School of Computer Science and Technology Xi'an Jiaotong University China National Engineering Laboratory for Big Data Analytics Xi'an Jiaotong University China Ministry of Education Key Lab For Intelligent Networks and Network Security Xi'an Jiaotong University China

Latent Dirichlet Allocation (LDA) is a popular topic modeling technique for discovery of hidden semantic architecture of text datasets, and plays a fundamental role in many machine learning applications. However, like many other machine learning algorithms, the process of training a LDA model may leak the sensitive information of the training datasets and bring significant privacy risks. To mitigate the privacy issues in LDA, we focus on studying privacy-preserving algorithms of LDA model training in this paper. In particular, we first develop a privacy monitoring algorithm to investigate the privacy guarantee obtained from the inherent randomness of the collapsed gibbs sampling (CGS) process in a typical LDA training algorithm on centralized curated datasets. Then, we further propose a locally private LDA training algorithm on crowdsourced data to provide local differential privacy for individual data contributors. The experimental results on real-world datasets demonstrate the effectiveness of our proposed algorithms. Copyright © 2019, The Authors. All rights reserved.

关键词： Statistics

来源：评论

学校读者我要写书评

暂无评论

Analyzing the Structure of Attention in a Transformer Language Model

arXiv

引用

arXiv 2019年

作者： Vig, Jesse Belinkov, Yonatan Palo Alto Research Center Machine Learning and Data Science Group Interaction and Analytics Lab Palo AltoCA United States Harvard John A. Paulson School of Engineering and Applied Sciences MIT Computer Science and Artificial Intelligence Laboratory CambridgeMA United States

The Transformer is a fully attention-based alternative to recurrent networks that has achieved state-of-the-art results across a range of NLP tasks. In this paper, we analyze the structure of attention in a Transformer language model, the GPT-2 small pretrained model. We visualize attention for individual instances and analyze the interaction between attention and syntax over a large corpus. We find that attention targets different parts of speech at different layer depths within the model, and that attention aligns with dependency relations most strongly in the middle layers. We also find that the deepest layers of the model capture the most distant relationships. Finally, we extract exemplar sentences that reveal highly specific patterns targeted by particular attention heads. Copyright © 2019, The Authors. All rights reserved.

关键词： Recurrent neural networks

来源：评论

学校读者我要写书评

暂无评论

Big-data clustering: K-Means or k-indicators?

arXiv

引用

arXiv 2019年

作者： Chen, Feiyu Yang, Yuchen Xu, Liwei Zhang, Taiping Zhang, Yin School of Big Data and Software Engineering Chongqing University Chongqing China Department of Computational and Applied Mathematics Rice University HoustonTX United States School of Mathematical Sciences University of Electronic Science and Technology of China Chengdu Sichuan China College of Computer Science Chongqing University Chongqing China Institute for Data and Decision Analytics Chinese University of Hong Kong Shenzhen China

The K-means algorithm is arguably the most popular data clustering method, commonly applied to processed datasets in some "feature spaces", as is in spectral clustering. Highly sensitive to initializations, however, K-means encounters a scalability bottleneck with respect to the number of clusters K as this number grows in big data applications. In this work, we promote a closely related model called K-indicators model and construct an efficient, semi-convex-relaxation algorithm that requires no randomized initializations. We present extensive empirical results to show advantages of the new algorithm when K is large. In particular, using the new algorithm to start the K-means algorithm, without any replication, can significantly outperform the standard K-means with a large number of currently state-ofthe-A rt random replications. Copyright © 2019, The Authors. All rights reserved.

关键词： K-means clustering

来源：评论

学校读者我要写书评

暂无评论

FAIR Principles:Interpretations and Implementation Considerations

引用

data Intelligence 2020年第1期2卷 10-29,293-302,322页

作者： Annika Jacobsen Ricardo de Miranda Azevedo Nick Juty Dominique Batista Simon Coles Ronald Cornet Melanie Courtot Merce Crosas Michel Dumontier Chris T.Evelo Carole Goble Giancarlo Guizzardi Karsten Kryger Hansen Ali Hasnain Kristina Hettne Jaap Heringa Rob W.W.Hooft Melanie Imming Keith G.Jeffery Rajaram Kaliyaperumal Martijn GKersloot Christine R.Kirkpatrick Tobias Kuhn Ignasi Labastida Barbara Magagna PeterMcQuilton Natalie Meyers Annalisa Montesanti Mirjam van Reisen Philippe Rocca-Serra Robert Pergl Susanna-Assunta Sansone Luiz Olavo Bonino da Silva Santos Juliane Schneider George Strawn Mark Thompson Andra Waagmeester Tobias Weigel Mark D.Wilkinson Egon L.Willighagen Peter Wittenburg Marco Roos Barend Mons Erik Schultes Leiden University Medical Center Leiden2333 ZAThe Netherlands Institute of Data Science Maastricht UniversityUniversiteitssingel 60Maastricht 6229 ERThe Netherlands Department of Computer Science The University of ManchesterOxford RoadManchester M139PLUK Oxford e-Research Centre Department of Engineering SciencesUniversity of OxfordOxford OX13PJUK School of Chemistry Faculty of Engineering and Physical SciencesUniversity of SouthamptonSO171BJUK Amsterdam UMC University of AmsterdamAmsterdam 1000 GGThe Netherlands European Bioinformatics Institute(EMBL-EBI) HinxtonCambridgeCB101SDUK Harvard University CambridgeMassachusetts 02138USA Department of Bioinformatics–BiGCaT NUTRIMMaastricht UniversityMaastricht 6229 ERThe Netherlands Conceptual and Cognitive Modeling Research Group(CORE) Free University of Bozen-BolzanoBolzano 39100Italy Aalborg University Aalborg DK-9220Denmark Insight Centre for Data Analytics National University of Ireland GalwayH91 TK33Ireland Centre for Digital Scholarship Leiden University LibrariesLeiden2333 ZAThe Netherlands Department of Computer Science Vrije Universiteit AmsterdamDe Boelelaan 11051081 HV AmsterdamThe Netherlands Dutch Techcentre for Life Sciences(DTL) UtrechtThe Netherlands SURF Utrecht 3511 EPThe Netherlands Keith G Jeffery Consultants FaringdonUK Castor EDC Paasheuvelweg 25Wing 5D1105 BPAmsterdamThe Netherlands San Diego Supercomputer Center University of California San DiegoLa JollaCalifornia 92093USA Learning and Research Resources Centre(CRAI) Universitat de Barcelona08007 BarcelonaSpain Environment Agency Austria A-1090 ViennaAustria University of Notre Dame 75004 ParisFrance Health Research Board(HRB) Dublin 2DO2 H638Ireland Liacs Institute of Advanced Computer Science Leiden University2311 GJ LeidenThe Netherlands Czech Technical University in Prague Faculty of Information Technology(FIT CTU)16000 Prague 6Czech Republic GO FAIR International Support&Coordination Office(GFISCO) LeidenThe Netherlands Harvard Catalys

The FAIR principles have been widely cited,endorsed and adopted by a broad range of stakeholders since their publication in *** intention,the 15 FAIR guiding principles do not dictate specific technological implementations,but provide guidance for improving Findability,Accessibility,Interoperability and Reusability of digital *** has likely contributed to the broad adoption of the FAIR principles,because individual stakeholder communities can implement their own FAIR ***,it has also resulted in inconsistent interpretations that carry the risk of leading to incompatible ***,while the FAIR principles are formulated on a high level and may be interpreted and implemented in different ways,for true interoperability we need to support convergence in implementation choices that are widely accessible and(re)-*** introduce the concept of FAIR implementation considerations to assist accelerated global participation and convergence towards accessible,robust,widespread and consistent FAIR *** self-identified stakeholder community may either choose to reuse solutions from existing implementations,or when they spot a gap,accept the challenge to create the needed solution,which,ideally,can be used again by other communities in the ***,we provide interpretations and implementation considerations(choices and challenges)for each FAIR principle.

关键词： FAIR guiding principles FAIR implementation FAIR convergence FAIR communities choices and challenges

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Conditional Adversarial Networks for Tax Evasion Detection

Unsupervised Conditional Adversarial Networks for Tax Evasio...

引用

IEEE International Conference on Big data

作者： Rongzhe Wei Bo Dong Qinghua Zheng Xulyu Zhu Jianfei Ruan Huan He Qian Xuesen College School of Mathematics and Statisitcs Xi’an Jiaotong University Xi’an China National Engineering Lab of Big Data Analytics School of Distance Education Xi’an Jiaotong University Xi’an China School of Computer Science and Technology Xi’an Jiaotong University Xi’an China SPKLSTN Lab Xi’an Jiaotong University Xi’an China

ISBN: (数字)9781728108582

ISBN: (纸本)9781728108599

The identification of tax evasion plays an important role in ensuring tax order, promoting the level of tax collection and management, and reducing tax losses. With the advancements in data mining technology, many machine learning techniques have yielded results in identifying tax evasion. However, to realize satisfactory performance, these models require large amounts of human annotated data. In the tax field, unlabeled tax data are abundant, data annotation in a single region is expensive, and the distributions of characteristics differ among regions; these factors pose substantial difficulties in the development of an identification model. Existing tax evasion detection methods are either trained for single-region tasks, in which case they perform poorly on inter-region tax evasion identification due to the discrepancies in feature distributions, or utilize labeled data from both the target-task field and different but related auxiliary fields to reuse and transfer knowledge of the target domain data, in which case they cannot deal with scenarios in which there are no labeled data in target audit tasks. Although current unsupervised transfer learning techniques can train models in labeled regions for unlabeled regions, large intra-class distribution discrepancies cannot be perfectly minimized in tax evasion detection scenarios. To better address the above challenges, this paper proposes a general architecture, namely, the unsupervised conditional adversarial networks (UCAN) for tax evasion detection, which is the first approach to solve audit tasks in unlabeled target domains via inter-region transfer. Our architecture establishes an adversarial neural network adding label information in the distribution adapter, which can granularly adapt the joint probability distribution (JPD) of the data. We introduce a constraint that is based on the conditional maximum mean discrepancy (CMMD) of the extracted features to align the conditional probability distribution (CPD)

关键词： Finance Feature extraction Task analysis Adaptation models computer architecture data models data mining

来源：评论

学校读者我要写书评

暂无评论

Automatic Extraction of Medication Mentions from Tweets-Overview of the BioCreative VII Shared Task 3 Competition

引用

database : the journal of biological databases and curation 2023年第2023期2023卷 baac108页

作者： Davy Weissenbacher Karen O'Connor Siddharth Rawal Yu Zhang Richard Tzong-Han Tsai Timothy Miller Dongfang Xu Carol Anderson Bo Liu Qing Han Jinfeng Zhang Igor Kulev Berkay Köprü Raul Rodriguez-Esteban Elif Ozkirimli Ammer Ayach Roland Roller Stephen Piccolo Peijin Han V G Vinod Vydiswaran Ramya Tekumalla Juan M Banda Parsa Bagherzadeh Sabine Bergler João F Silva Tiago Almeida Paloma Martinez Renzo Rivera-Zavala Chen-Kai Wang Hong-Jie Dai Luis Alberto Robles Hernandez Graciela Gonzalez-Hernandez Department of Computational Biomedicine Cedars-Sinai Medical Center Los Angeles CA USA. DBEI The Perelman School of Medicine University of Pennsylvania Philadelphia PA USA. Department of Computer Science and Information Engineering National Central University No. 300 Zhongda Rd Zhongli District Taoyuan 320 Taiwan. IoX Center National Taiwan University Da'an District Section 4 Roosevelt Rd No. 1 Barry Lam Hall Taipei 106 Taiwan. Research Center for Humanities and Social Sciences Academia Sinica No. 128 Section 2 Academia Rd Nangang District Taipei 115 Taiwan. Computational Health Informatics Program Boston Children's Hospital Boston MA USA. Department of Pediatrics Harvard Medical School Boston MA USA. NVIDIA Santa Clara CA USA. Department of Statistics Florida State University Tallahassee FL USA. Data and Analytics Chapter F. Hoffmann-La Roche Ltd Switzerland. Pharmaceutical Research and Early Development Roche Innovation Center Basel Switzerland. Speech and Language Technology Lab DFKI Berlin Germany. Department of Biology Brigham Young University Provo UT USA. Department of Computational Medicine and Bioinformatics Medical School University of Michigan Ann Arbor MI USA. Department of Learning Health Sciences Medical School University of Michigan Ann Arbor MI USA. School of Information University of Michigan Ann Arbor MI USA. Department of Computer Science Georgia State University Atlanta GA USA. CLaC Labs Concordia University Montreal Canada. DETI Institute of Electronics and Informatics Engineering of Aveiro University of Aveiro Portugal. Department of Computation University of A Coruña Spain. Computer Science and Engineering Department Universidad Carlos III de Madrid Madrid Spain. Big Data Laboratory Chunghwa Telecom Laboratories Taoyuan Taiwan. Department of Computer Science National Yang Ming Chiao Tung University Hsinchu Taiwan. Department of Electrical Engineering College of Electrical Engineering and Computer Sci

This study presents the outcomes of the shared task competition BioCreative VII (Task 3) focusing on the extraction of medication names from a Twitter user's publicly available tweets (the user's 'timeline'). In general, detecting health-related tweets is notoriously challenging for natural language processing tools. The main challenge, aside from the informality of the language used, is that people tweet about any and all topics, and most of their tweets are not related to health. Thus, finding those tweets in a user's timeline that mention specific health-related concepts such as medications requires addressing extreme imbalance. Task 3 called for detecting tweets in a user's timeline that mentions a medication name and, for each detected mention, extracting its span. The organizers made available a corpus consisting of 182 049 tweets publicly posted by 212 Twitter users with all medication mentions manually annotated. The corpus exhibits the natural distribution of positive tweets, with only 442 tweets (0.2%) mentioning a medication. This task was an opportunity for participants to evaluate methods that are robust to class imbalance beyond the simple lexical match. A total of 65 teams registered, and 16 teams submitted a system run. This study summarizes the corpus created by the organizers and the approaches taken by the participating teams for this challenge. The corpus is freely available at https://***/tasks/biocreative-vii/track-3/. The methods and the results of the competing systems are analyzed with a focus on the approaches taken for learning from class-imbalanced data.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Ontology-based automatic reclassification of tissues and organs in histological images 12

Ontology-based automatic reclassification of tissues and org...

引用

12th Alberto Mendelzon International Workshop on Foundations of data Management, AMW 2018

作者： Mazo, Claudia Trujillo, Maria Alegre, Enrique Salazar, Liliana Universidad Del Valle Computer and Systems Engineering School Cali Colombia University College Dublin CeADAR: Centre for Applied Data Analytics Research School of Computer Science I-Dublin Ireland OncoMark Limited I-Dublin Ireland Universidad de León Industrial and Informatics Engineering School León Spain Universidad Del Valle Morphology Department Cali Colombia

Heterogeneous data source produces different types of data that cannot be treated in the same way. In this paper, two sources of data are considered: image and human knowledge. The former is rep-resented using visual descriptors and the latter is represented using an ontology. The integration of these data sources is used in the automatic classiffication of tissues and organs of the human cardiovascular system together. Firstly, visual descriptors texture descriptors { are used in the automatic classiffication using a cascade Support Vector Machine. Secondly, obtained classiffication results are refined using a histological ontology of the human cardiovascular system to confirm or reclassified. The final classiffication results are more precise than the obtained using only image data, in all cases. Keywords: Automatic Classiffiation Histological Ontology Histology Images Image Processing. © 2018 CEUR-WS. All rights reserved.

关键词： Ontology

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：