检索结果-内蒙古大学图书馆

arXiv 2024年

作者： Shi, Yuhui Sheng, Qiang Cao, Juan Mi, Hao Hu, Beizhe Wang, Danding Key Lab of Intelligent Information Processing Chinese Academy of Sciences Institute of Computing Technology Chinese Academy of Sciences China University of Chinese Academy of Sciences China Xi’an Jiaotong University China

With the rapidly increasing application of large language models (LLMs), their abuse has caused many undesirable societal problems such as fake news, academic dishonesty, and information pollution. This makes AI-generated text (AIGT) detection of great importance. Among existing methods, white-box methods are generally superior to black-box methods in terms of performance and generalizability, but they require access to LLMs’ internal states and are not applicable to black-box settings. In this paper, we propose to estimate word generation probabilities as pseudo white-box features via multiple re-sampling to help improve AIGT detection under the black-box setting. Specifically, we design POGER, a proxy-guided efficient re-sampling method, which selects a small subset of representative words (e.g., 10 words) for performing multiple re-sampling in black-box AIGT detection. Experiments on datasets containing texts from humans and seven LLMs show that POGER outperforms all baselines in macro F1 under black-box, partial white-box, and out-of-distribution settings and maintains lower re-sampling costs than its existing counterparts. © 2024, CC BY-NC-SA.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

FediLive: A Framework for Collecting and Preprocessing Snapshots of Decentralized Online Social Networks 25

FediLive: A Framework for Collecting and Preprocessing Snaps...

引用

Companion Proceedings of the ACM on Web Conference 2025

作者： Shaojie Min Shaobin Wang Yaxiao Luo Min Gao Qingyuan Gong Yu Xiao Yang Chen Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University Shanghai China Research Institute of Intelligent Complex Systems Fudan University Shanghai China Department of Information and Communications Engineering Aalto University Espoo Finland

ISBN: (纸本)9798400713316

Decentralized online social networks such as Mastodon have emerged quickly during the past years, and offer unique opportunities to investigate user behavior, moderation strategies, and community evolution. However, their decentralized nature imposes challenges for data collection and preprocessing, particularly for obtaining a real-time snapshot in a timely manner. This paper introduces FediLive, a framework designed to rapidly collect and preprocess the live feeds from Mastodon, generate a comprehensive snapshot in real-time, including user-generated contents, interaction networks, and users' demographic attributes. Such a snapshot could further be leveraged for data analysis from different angles, leading to a deeper understanding of user activities on Mastodon. Using FediLive, we collected a 13-day snapshot of Mastodon, covering the publicly-visible activities of all Mastodon users. Our study demonstrates the usefulness of FediLive, and reveals its potential in facilitating data-driven analysis for decentralized online social networks.

关键词： decentralized online social networks

来源：评论

学校读者我要写书评

暂无评论

EDA: Enhanced Domain-Adversarial Training for Anatomical Landmark Detection 22

EDA: Enhanced Domain-Adversarial Training for Anatomical Lan...

引用

22nd IEEE International Symposium on Biomedical Imaging, ISBI 2025

作者： Yang, Fan Zhou, S. Kevin School of Biomedical Engineering Division of Life Sciences and Medicine University of Science and Technology of China Anhui Hefei 230026 China Center for Medical Imaging Robotics Analytic Computing & Learning (MIRACLE) Suzhou Institute for Advanced Research University of Science and Technology of China Jiangsu Suzhou 215123 China Key Laboratory of Precision and Intelligent Chemistry Ustc Anhui Hefei 230026 China Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS) Institute of Computing Technology Cas Beijing 100190 China

ISBN: (纸本)9798331520526

Manually annotating anatomical landmarks in medical images requires experienced clinicians and is a labor-intensive process. However, recent AI-assisted methods for landmark detection often rely on the training and test data originating from the same domain. This work introduces a novel unsupervised domain adaptation (UDA) framework aimed at anatomical landmark detection from a medical image, designed to bridge the gap between a source domain with labels and an unlabeled target domain. Specifically, we have developed a new Domain-Adversarial Network that incorporates skip connections to transfer and fuse high-resolution feature maps. Additionally, we proposed Dynamic Gaussian Learning, which allows the model to escape from local error regions. We carry out experiments on landmark detection for both head and chest, and the results demonstrate that our method achieves state-of-the-art performances in each experiment. © 2025 IEEE.

关键词： Anatomical Landmark Detection Unsupervised Domain Adaptation

来源：评论

学校读者我要写书评

暂无评论

Harnessing Hierarchical label Distribution Variations in Test Agnostic Long-tail Recognition 41

Harnessing Hierarchical Label Distribution Variations in Tes...

引用

41st International Conference on Machine Learning, ICML 2024

作者： Yang, Zhiyong Xu, Qianqian Wang, Zitai Li, Sicong Han, Boyu Bao, Shilong Cao, Xiaochun Huang, Qingming School of Computer Science and Tech. University of Chinese Academy of Sciences China Key Lab. of Intelligent Information Processing Institute of Computing Tech. CAS China Institute of Information Engineering CAS China School of Cyber Security University of Chinese Academy of Sciences China School of Cyber Science and Tech. Shenzhen Campus of Sun Yat-sen University China BDKM University of Chinese Academy of Sciences China

This paper explores test-agnostic long-tail recognition, a challenging long-tail task where the test label distributions are unknown and arbitrarily imbalanced. We argue that the variation in these distributions can be broken down hierarchically into global and local levels. The global ones reflect a broad range of diversity, while the local ones typically arise from milder changes, often focused on a particular neighbor. Traditional methods predominantly use a Mixture-of-Expert (MoE) approach, targeting a few fixed test label distributions that exhibit substantial global variations. However, the local variations are left unconsidered. To address this issue, we propose a new MoE strategy, DirMixE, which assigns experts to different Dirichlet meta-distributions of the label distribution, each targeting a specific aspect of local variations. Additionally, the diversity among these Dirichlet meta-distributions inherently captures global variations. This dual-level approach also leads to a more stable objective function, allowing us to sample different test distributions better to quantify the mean and variance of performance outcomes. Theoretically, we show that our proposed objective benefits from enhanced generalization by virtue of the variance-based regularization. Comprehensive experiments across multiple benchmarks confirm the effectiveness of DirMixE. The code is available at https://***/scongl/DirMixE. Copyright 2024 by the author(s)

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

X-Prompt: Multi-modal Visual Prompt for Video Object Segmentation

arXiv

引用

arXiv 2024年

作者： Guo, Pinxue Li, Wanyun Huang, Hao Hong, Lingyi Zhou, Xinyu Chen, Zhaoyu Li, Jinglun Jiang, Kaixun Zhang, Wei Zhang, Wenqiang Shanghai Engineering Research Center of AI & Robotics Academy for Engineering & Technology Fudan University China Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University China Engineering Research Center of AI & Robotics Ministry of Education Academy for Engineering & Technology Shanghai Key Lab of Intelligent Information Processing School of Computer Science Fudan University China

Multi-modal Video Object Segmentation (VOS), including RGB-Thermal, RGB-Depth, and RGB-Event, has garnered attention due to its capability to address challenging scenarios where traditional VOS methods struggle, such as extreme illumination, rapid motion, and background distraction. Existing approaches often involve designing specific additional branches and performing full-parameter fine-tuning for fusion in each task. However, this paradigm not only duplicates research efforts and hardware costs but also risks model collapse with the limited multi-modal annotated data. In this paper, we propose a universal framework named X-Prompt for all multi-modal video object segmentation tasks, designated as RGB+X. The X-Prompt framework first pretrains a video object segmentation foundation model using RGB data, and then utilize the additional modality of the prompt to adapt it to downstream multi-modal tasks with limited data. Within the X-Prompt framework, we introduce the Multi-modal Visual Prompter (MVP), which allows prompting foundation model with the various modalities to segment objects precisely. We further propose the Multi-modal Adaptation Experts (MAEs) to adapt the foundation model with pluggable modality-specific knowledge without compromising the generalization capacity. To evaluate the effectiveness of the X-Prompt framework, we conduct extensive experiments on 3 tasks across 4 benchmarks. The proposed universal X-Prompt framework consistently outperforms the full fine-tuning paradigm and achieves state-of-the-art performance. Code: https://***/PinxueGuo/***. Copyright © 2024, The Authors. All rights reserved.

关键词： Image segmentation

来源：评论

学校读者我要写书评

暂无评论

Fact-Preserved Personalized News Headline Generation

Fact-Preserved Personalized News Headline Generation

引用

IEEE International Conference on Data Mining (ICDM)

作者： Zhao Yang Junhong Lian Xiang Ao Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS) Institute of Computing Technology CAS Beijing China University of Chinese Academy of Sciences Beijing China Institute of Intelligent Computing Technology Suzhou CAS

Personalized news headline generation, aiming at generating user-specific headlines based on readers’ preferences, burgeons a recent flourishing research direction. Existing studies generally inject a user interest embedding into an encoder-decoder headline generator to make the output personalized, while the factual consistency of headlines is inadequate to be verified. In this paper, we propose a framework Fact-Preserved Personalized News Headline Generation (short for FPG), to prompt a tradeoff between personalization and consistency. In FPG, the similarity between the candidate news to be exposed and the historical clicked news is used to give different levels of attention to key facts in the candidate news, and the similarity scores help to learn a fact-aware global user embedding. Besides, an additional training procedure based on contrastive learning is devised to further enhance the factual consistency of generated headlines. Extensive experiments conducted on a real-world benchmark PENS 1 validate the superiority of FPG, especially on the tradeoff between personalization and factual consistency. 1 https://***/***

关键词：

来源：评论

学校读者我要写书评

暂无评论

A TRACK ASSOCIATION ALGORITHM BASED ON MOTION DYNAMICS FEATURES IN ENTOMOLOGICAL RADAR

A TRACK ASSOCIATION ALGORITHM BASED ON MOTION DYNAMICS FEATU...

引用

IET International Radar Conference 2023, IRC 2023

作者： Li, Biao Liu, Hanzhe Zhang, Tianran Cai, Jiong Wang, Rui The Radar Research Lab School of Information and Electronics Beijing Institute of Technology Beijing100081 China The Advanced Technology Research Institute Beijing Institute of Technology Jinan250300 China Beijing Key Laboratory of Embedded Real-time Information Processing Technology Beijing100081 China Beijing Institute of Electronic System Engineering Beijing100854 China

ISBN: (纸本)9781839539954

When tracking densely distributed targets such as insects, the traditional trajectory association algorithm exhibits poor correlation performance, leading to a decline in multi-target tracking efficiency. In this paper, based on the behavioral characteristics of insects gathering and co-orienting during aerial flight, a motion dynamic feature-assisted multi-target tracking method is proposed. Firstly, a three-dimensional biological distribution state matrix is constructed through spatial gridization. Then, a three-dimensional circular convolution is employed to estimate the target's motion dynamic feature, assisting in multi-target trajectory initialization. Finally, the improvement effect of the method on trajectory tracking accuracy is validated through simulations and migration insect data. © The Institution of Engineering & Technology 2023.

关键词： Radar tracking

来源：评论

学校读者我要写书评

暂无评论

Magnetic Topological Dirac Semimetal Transition Driven by SOC in EuMg_(2)Bi_(2)

引用

Chinese Physics Letters 2024年第1期41卷 63-67页

作者：王佳萌钱浩吉姜琦乔山叶茂 State Key Laboratory of Functional Materials for Informatics Shanghai Institute of Microsystem and Information TechnologyChinese Academy of SciencesShanghai 200050China Center of Materials Science and Optoelectronics Engineering University of Chinese Academy of SciencesBeijing 100049China Research Center for Intelligent Chips and Devices Zhejiang LabHangzhou 311121China Center for Transformative Science ShanghaiTech UniversityShanghai 201210China School of Physical Science and Technology ShanghaiTech UniversityShanghai 201210China Shanghai Synchrotron Radiation Facility Shanghai Advanced Research InstituteChinese Academy of SciencesShanghai 201204China

Magnetic topological semimetals have been at the forefront of condensed matter physics due to their ability to exhibit exotic transport *** the interplay between magnetic and topological orders in systems with broken time-reversal symmetry is crucial for realizing non-trivial quantum *** delve into the electronic structure of the rare-earth-based antiferromagnetic Dirac semimetal EuMg_(2)Bi_(2) using first-principles calculations and angle-resolved photoemission *** calculations reveal that the spin-orbit coupling(SOC)in EuMg_(2)Bi_(2) prompts an insulator to topological semimetal transition,with the Dirac bands protected by crystal *** linearly dispersive states near the Fermi level,primarily originating from Bi 6p orbitals,are observed on both the(001)and(100)surfaces,confirming that EuMg_(2)Bi_(2) is a three-dimensional topological Dirac *** research offers pivotal insights into the interplay between magnetism,SOC and topological phase transitions in spintronics applications.

关键词： spectroscopy topological Dirac

来源：评论

学校读者我要写书评

暂无评论

An Improved Method for Rockfall Detection and Tracking Based on Video Stream

An Improved Method for Rockfall Detection and Tracking Based...

引用

IET International Radar Conference 2023, IRC 2023

作者： Wang, Longyue Wang, Songge Xie, Xin Deng, Yunkai Tian, Weiming Radar Research Lab School of Information and Electronics Beijing Institute of Technology Beijing China Chongqing Innovation Center Beijing Institute of Technology Chongqing China Beijing Key Laboratory of Embedded Real-time Information Processing Technology Beijing Institute of Technology Beijing China Advanced Technology Research Institute Beijing Institute of Technology Jinan China

ISBN: (纸本)9781839539954

Rockfall events occur frequently in mountainous areas. To address the problems of missed detection, false detection, and trajectory interruption when using the deep learning-based online multiple object tracking methods to detect rockfalls, this paper proposes a rockfall detection and tracking method based on video streams. In the detection stage, three-frame difference method is utilized to obtain the moving targets from the video streams, and they are combined with the detection results of the rock detector obtained by the offline-trained YOLOX model. In the tracking stage, data association is firstly performed based on the rockfall detection results. For the existing trajectories that are not matched at the current moment, re-matching is performed by combining the moving object detection results to achieve accurate tracking of rockfalls. Simulations and field experiments prove that the detection method proposed in this paper can effectively separate the rockfalls in the video, and the detected rockfalls have high precision. Besides, it significantly improves the accuracy of rockfall tracking, effectively suppressing phenomena such as trajectory interruption during tracking. © The Institution of Engineering & Technology 2023.

关键词： Rock bursts

来源：评论

学校读者我要写书评

暂无评论

SES: Bridging the Gap Between Explainability and Prediction of Graph Neural Networks

SES: Bridging the Gap Between Explainability and Prediction ...

引用

International Conference on Data Engineering

作者： Zhenhua Huang Kunhao Li Shaojie Wang Zhaohong Jia Wentao Zhu Sharad Mehrotra Anhui University Hefei China Key Lab of Intelligent Computing and Signal Processing of Ministry of Education Hefei China Amazon Research Seattle USA University of California Irvine Irvine USA

ISBN: (数字)9798350317152

ISBN: (纸本)9798350317169

Despite the Graph Neural Networks' (GNNs) pro-ficiency in analyzing graph data, achieving high-accuracy and interpretable predictions remains challenging. Existing GNN interpreters typically provide post-hoc explanations disjointed from GNNs' predictions, resulting in misrepresentations. Self-explainable GNNs offer built-in explanations during the training process. However, they cannot exploit the explanatory outcomes to augment prediction performance, and they fail to provide high-quality explanations of node features and require additional processes to generate explainable subgraphs, which is costly. To address the aforementioned limitations, we propose a self-explained and self-supervised graph neural network (SES) to bridge the gap between explainability and prediction. SES comprises two processes: explainable training and enhanced predictive learning. During explainable training, SES employs a global mask generator co-trained with a graph encoder and directly produces crucial structure and feature masks, reducing time consumption and providing node feature and subgraph explanations. In the enhanced predictive learning phase, mask-based positive-negative pairs are constructed utilizing the ex-planations to compute a triplet loss and enhance the node representations by contrastive learning. Extensive experiments demonstrate the superiority of SES on multiple datasets and tasks. SES outperforms baselines on real-world node classification datasets by notable margins of up to 2.59% and achieves state-of-the-art (SOTA) performance in explanation tasks on synthetic datasets with improvements of up to 3.0%. Moreover, SES delivers more coherent explanations on real-world datasets, has a fourfold increase in Fidelity+ score for explanation quality, and demonstrates faster training and expla-nation generating times. To our knowledge, SES is a pioneering GNN to achieve SOTA performance on both explanation and prediction tasks.

关键词： Training Bridges Accuracy Reliability engineering Data engineering Graph neural networks Generators

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：