检索结果-内蒙古大学图书馆

arXiv 2025年

作者： Zhang, Yuwei Jin, Zhi Xing, Ying Li, Ge Liu, Fang Zhu, Jiaxin Dou, Wensheng Wei, Jun Affiliated with Nanjing Institute of Software Technology University of Chinese Academy of Sciences Nanjing China Key Laboratory of System Software Chinese Academy of Sciences Institute of Software Chinese Academy of Sciences University of Chinese Academy of Sciences Beijing China Key Laboratory of High Confidence Software Technologies Peking University Ministry of Education School of Computer Science Peking University Beijing China School of Computer Science Wuhan University Wuhan China School of Intelligent Engineering and Automation Beijing University of Posts and Telecommunications Beijing China State Key Laboratory of Complex & Critical Software Environment School of Computer Science and Engineering Beihang University Beijing China

Bug fixing holds significant importance in software development and maintenance. Recent research has made substantial strides in exploring the potential of large language models (LLMs) for automatically resolving software bugs. However, a noticeable gap in existing approaches lies in the oversight of collaborative facets intrinsic to bug resolution, treating the process as a single-stage endeavor. Moreover, most approaches solely take the buggy code snippet as input for LLMs during the patch generation stage. To mitigate the aforementioned limitations, we introduce a novel stage-wise framework named PATCH. Specifically, we first augment the buggy code snippet with corresponding dependence context and intent information to better guide LLMs in generating the correct candidate patches. Additionally, by taking inspiration from bug management practices, we decompose the bug-fixing task into four distinct stages: bug reporting, bug diagnosis, patch generation, and patch verification. These stages are performed interactively by LLMs, aiming to simulate the collaborative behavior of programmers during the resolution of software bugs. By harnessing these collective contributions, PATCH effectively enhances the bug-fixing capability of LLMs. We implement PATCH by employing the powerful dialogue-based LLM ChatGPT. Our evaluation on the widely used bug-fixing benchmark BFP demonstrates that PATCH has achieved better performance than state-of-the-art LLMs. Copyright © 2025, The Authors. All rights reserved.

关键词： computer software maintenance

来源：评论

学校读者我要写书评

暂无评论

NetScribed: A Deep Learning Approach for Machine-Based Melody Transcription of Audio Files 7th

NetScribed: A Deep Learning Approach for Machine-Based Melo...

引用

7th International Conference on Applied Informatics, ICAI 2024

作者： Volschenk, Francois van Der Haar, Dustin Academy of Computer Science and Software Engineering University of Johannesburg Cnr University Road and Kingsway Avenue Auckland Park Gauteng Johannesburg2092 South Africa

ISBN: (纸本)9783031751431

Automatic Music Transcription (AMT) entails creating an algorithm that converts an acoustic signal from an audio file into the corresponding sheet music representation. This paper uses deep learning methods and models AMT as a translation problem, comparing the effectiveness of an instance-based translation approach using an MLP to a sequence-based approach using an RNN. The models were trained on the EsAc dataset and evaluated using MUSTER metrics. The results show that the instance-based model better classifies the correct pitch. However, the sequence-based approach outperforms the instance-based approach on all other aspects of the MUSTER metrics, producing a 98% accuracy. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

On EDA-Driven Learning for SAT Solving 23

On EDA-Driven Learning for SAT Solving

引用

Proceedings of the 60th Annual ACM/IEEE Design Automation Conference

作者： Min Li Zhengyuan Shi Qiuxia Lai Sadaf Khan Shaowei Cai Qiang Xu Department of Computer Science and Engineering The Chinese University of Hong Kong Hong Kong S.A.R Communication University of China China State Key Laboratory of Computer Science Institute of Software Chinese Academy of Sciences China

ISBN: (纸本)9798350323481

We present DeepSAT, a novel end-to-end learning framework for the Boolean satisfiability (SAT) problem. Unlike existing solutions trained on random SAT instances with relatively weak supervision, we propose applying the knowledge of the well-developed electronic design automation (EDA) field for SAT solving. Specifically, we first resort to logic synthesis algorithms to pre-process SAT instances into optimized and-inverter graphs (AIGs). By doing so, the distribution diversity among various SAT instances can be dramatically reduced, which facilitates improving the generalization capability of the learned model. Next, we regard the distribution of SAT solutions being a product of conditional Bernoulli distributions. Based on this observation, we approximate the SAT solving procedure with a conditional generative model, leveraging a novel directed acyclic graph neural network (DAGNN) with two polarity prototypes for conditional SAT modeling. To effectively train the generative model, with the help of logic simulation tools, we obtain the probabilities of nodes in the AIG being logic '1' as rich supervision. We conduct comprehensive experiments on various SAT problems. Our results show that, DeepSAT achieves significant accuracy improvements over state-of-the-art learning-based SAT solutions, especially when generalized to SAT instances that are relatively large or with diverse distributions.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Encryption Traffic Classification Based on Mining Traffic Context and Transport Relationship

Encryption Traffic Classification Based on Mining Traffic Co...

引用

IEEE Conference on Wireless Communications and Networking

作者： Weilin Gai Runqing Zhang Huiyuan Zhang Yu Guo Jun Yin Peng Zhang TCA Institute of Software Chinese Academy of Sciences Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China National Computer Network Emergency Response TechnicalTeam/Coordination Center of China Beijing China School of Cyber Science and Engineering Nanjing University of Science and Technology Nanjing China

ISBN: (数字)9798350368369

ISBN: (纸本)9798350368376

This paper proposes a novel ETC-MTCTR, which is designed to enable more accurate, versatile and efficient traffic classification in the context of multi-scenario, low-resource encrypted traffic. Through three modules of Datagram Token conversion, pretraining and fine-tuning, the method uses large-scale unlabeled encrypted traffic for pretraining, mining and learning the traffic context and transmission relationship of encrypted traffic classification tasks, so that a small number of labeled data samples can be effectively used in the fine-tuning stage. Significantly improve the performance of the model on specific downstream classification tasks, enhance the accuracy, adaptability and robustness of the model in diverse environments, limited resources and new encryption security protocols, and realize efficient encryption traffic classification in multi-scenario and low-resource background. The results show that ETC-MTCTR achieves the best performance on three tasks: encryption malware classification, VPN encrypted traffic classification, and TLS 1.3 encryption application classification. Its F1 score is improved by 0.22% in the classification task of encrypted malware, 1.4% in the classification task of VPN encrypted traffic App, 4.56% in the classification task of VPN encrypted traffic Service, and 9.89% in the classification task of TLS 1.3 encrypted application, which is significantly better than other comparison methods.

关键词： Adaptation models Accuracy Protocols Malware Robustness Encryption Virtual private networks Data mining

来源：评论

学校读者我要写书评

暂无评论

High-Fidelity Image Inpainting with Multimodal Guided GAN Inversion

arXiv

引用

arXiv 2025年

作者： Zhang, Libo Yu, Yongsheng Yao, Jiali Fan, Heng Institute of Software Chinese Academy of Sciences Beijing China Hangzhou Institute for Advanced Study University of Chinese Academy of Sciences Hangzhou China Department of Computer Science and Engineering University of North Texas DentonTX United States

Generative Adversarial Network (GAN) inversion have demonstrated excellent performance in image inpainting that aims to restore lost or damaged image texture using its unmasked content. Previous GAN inversion-based methods usually utilize well-trained GAN models as effective priors to generate the realistic regions for missing holes. Despite excellence, they ignore a hard constraint that the unmasked regions in the input and the output should be the same, resulting in a gap between GAN inversion and image inpainting and thus degrading the performance. Besides, existing GAN inversion approaches often consider a single modality of the input image, neglecting other auxiliary cues in images for improvements. Addressing these problems, we propose a novel GAN inversion approach, dubbed MMInvertFill, for image inpainting. MMInvertFill contains primarily a multimodal guided encoder with a pre-modulation and a GAN generator with F&W+ latent space. Specifically, the multimodal encoder aims to enhance the multi-scale structures with additional semantic segmentation edge texture modalities through a gated mask-aware attention module. Afterwards, a pre-modulation is presented to encode these structures into style vectors. To mitigate issues of conspicuous color discrepancy and semantic inconsistency, we introduce the F&W+ latent space to bridge the gap between GAN inversion and image inpainting. Furthermore, in order to reconstruct faithful and photorealistic images, we devise a simple yet effective Soft-update Mean Latent module to capture more diversified in-domain patterns for generating high-fidelity textures for massive corruptions. In our extensive experiments on six challenging datasets, including CelebA-HQ [25], Places2 [75], OST [51], CityScapes [8], MetFaces [22] and Scenery [62], we show that our MMInvertFill qualitatively and quantitatively outperforms other state-of-the-arts and it supports the completion of out-of-domain images effectively. Our project webpage incl

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

PROTOTYPE CONTRASTIVE CONSISTENCY LEARNING FOR SEMI-SUPERVISED MEDICAL IMAGE SEGMENTATION

arXiv

引用

arXiv 2025年

作者： He, Shihuan Lai, Zhihui Wang, Ruxin Kong, Heng Shenzhen Institutes of Advanced Technology Chinese Academy of Sciences China Computer Vision Institute College of Computer Science and Software Engineering Shenzhen University China Department of Breast and Thyroid Surgery Baoan Central Hospital of Shenzhen China

Medical image segmentation is a crucial task in medical image analysis, but it can be very challenging especially when there are less labeled data but with large unlabeled data. Contrastive learning has proven to be effective for medical image segmentation in semi-supervised learning by constructing contrastive samples from partial pixels. However, although previous contrastive learning methods can mine semantic information from partial pixels within images, they ignore the whole context information of unlabeled images, which is very important to precise segmentation. In order to solve this problem, we propose a novel prototype contrastive learning method called Prototype Contrastive Consistency Segmentation (PCCS) for semi-supervised medical image segmentation. The core idea is to enforce the prototypes of the same semantic class to be closer and push the prototypes in different semantic classes far away from each other. Specifically, we construct a signed distance map and an uncertainty map from unlabeled images. The signed distance map is used to construct prototypes for contrastive learning, and then we estimate the prototype uncertainty from the uncertainty map as tradeoff among prototypes. In order to obtain better prototypes, based on the student-teacher architecture, a new mechanism named prototype updating prototype is designed to assist in updating the prototypes for contrastive learning. In addition, we propose an uncertainty-consistency loss to mine more reliable information from unlabeled data. Extensive experiments on medical image segmentation demonstrate that PCCS achieves better segmentation performance than the state-of-the-art methods. The code is available at https://***/comphsh/PCCS. © 2025, CC BY.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Energy-Efficient Tactile-Driven Rule Configuration and Anomaly Detection in Industrial IoT Systems

引用

IEEE Internet of Things Journal 2025年

作者： Tan, Lizhuang Singh, Amritpal Zhang, Wei Pei, Hongjuan Zhang, Peiying Chahal, Prabhjot Kaur Singh, Maninderpal Ji'nan250014 China Pohang University of Science and Technology Department of Computer Science and Engineering Pohang37673 Korea Republic of Northumbria University Department of Computer and Information Sciences United Kingdom University of Chinese Academy of Sciences School of Engineering Science Beijing100049 China China University of Petroleum Qingdao Institute of Software College of Computer Science and Technology East China Qingdao266580 China Chandigarh Group of Colleges CEC Punjab Mohali India Southern Methodist University Electrical and Computer Engineering Department Dallas United States

The Industrial Internet of Things (IIoT) enables communication among automation systems, machinery, and sensors in an industrial setting. To optimize critical industrial operations, a substantial volume of data concerning diverse in-factory activities and automation services is generated by IoT devices and sensors. This data is subsequently transferred to distant processing systems for analysis and decision-making. Nevertheless, a substantial latency in data transmission or any abnormality in the generated data may result in delayed or erroneous decisions, consequently impacting the efficacy of essential industrial systems. To address these challenges, we established an intelligent network architecture utilizing software-defined networking that achieves tactile latencies efficiently while handling industrial data traffic in an energy-efficient manner. To address the initial challenge, the suggested architecture utilizes the Self-Organized Maps approach to distinguish between industrial traffic requiring tactile latencies and non-tactile traffic. We utilize a binary tree-based flow table mapping method to enhance flow table matching and decrease lookup times. To address the second challenge, we employ the Support Vector Machine technique to identify anomalies in real-time industrial data traffic. The Hadoop system and Mininet emulator are utilized to evaluate the proposed architecture using the UNSW dataset. The results demonstrate the effectiveness of the suggested solution in providing energy-efficient tactile assurances and identifying anomalies in traffic. © 2014 IEEE.

关键词： Table lookup

来源：评论

学校读者我要写书评

暂无评论

DDA: Distillation-Driven Acceleration of the Reverse Diffusion Process for Stochastic Multi-Ship Trajectory Prediction

DDA: Distillation-Driven Acceleration of the Reverse Diffusi...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Kun Ma Qilong Han Jingzheng Yao Changmao Wu Yuntao Zhang College of Computer Science And Technology Harbin Engineering University Harbin China Yantai Research Institute Harbin Engineering University Yantai China Institute of Software Chinese Academy of Sciences Beijing China College of Shipbuilding Engineering Harbin Engineering University Harbin China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Modeling stochastic multi-ship trajectories is vital for maritime safety and interaction efficiency. Recent researches show that diffusion models excel in trajectory prediction, surpassing GANs and VAEs in generation quality, diversity and stability. However, their slow sampling speed remains a major limitation, as producing high-quality trajectories typically requires hundreds of denoising steps. We introduce DDA, a novel method that accelerates multi-ship trajectory generation by distilling the reverse diffusion process, progressively reducing sampling steps by half while minimizing quality loss. We use CVAE-based encoder to map multimodal inputs into state embeddings in the latent space, and use distillation diffusion in the latent space to more quickly and better represent multi-ship trajectories. The diffusion model uses Transformer-based core, and we incorporate SO(2) invariance and equivariance to enhance model representation. Validation on real-world AIS datasets shows that the student model retains high-quality trajectory generation while sampling speed is approximately 30 times faster.

关键词： Diffusion processes Stochastic processes Transformer cores Signal processing Diffusion models Transformers Stability analysis Trajectory Artificial intelligence Speech processing

来源：评论

学校读者我要写书评

暂无评论

Instantaneous Trajectory Prediction via Latent Bidirectional Cooperative Diffusion

Instantaneous Trajectory Prediction via Latent Bidirectional...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Kun Ma Qilong Han Jingzheng Yao Changmao Wu Chunrui Na College of Computer Science And Technology Harbin Engineering University Harbin China Yantai Research Institute Harbin Engineering University Yantai China Chinese Academy of Sciences Institute of Software Beijing China College of Shipbuilding Engineering Harbin Engineering University Harbin China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

In real-world scenarios, extreme cases where pedestrians suddenly emerge from blind spots or occlusions, leaving only a minimal amount of observable trajectory points, occur frequently. This presents a significant challenge for autonomous driving and robotic navigation, where pedestrian safety and timely response are critical considerations. To address this challenge, we propose a framework for instantaneous trajectory prediction using Latent bidirectional Cooperative Diffusion (LCD). It designs a complementary mechanism that constructs a coupled bidirectional cooperative diffusion model. LCD simultaneously and progressively generates unobserved past trajectories and future trajectories, feeding each other as conditions into the cross-attention module for mutual guidance. This framework employs CVAE as its encoder to map the observed multi-model trajectories into a high-dimensional latent space to enhance complex representations. Experiments conducted on the ETH/UCY and SDD datasets demonstrate the superiority of our framework.

关键词： Pedestrians Navigation Signal processing Diffusion models Liquid crystal displays Trajectory Safety Speech processing Robots Autonomous vehicles

来源：评论

学校读者我要写书评

暂无评论

Multi-Modal Multi-Kernel Graph Learning for Autism Prediction and Biomarker Discovery

IEEE Transactions on Computational Biology and Bioinformatic...

引用

IEEE Transactions on Computational Biology and Bioinformatics 2025年第2期22卷 842-854页

作者： Jin Liu Junbin Mao Hanhe Lin Hulin Kuang Shirui Pan Xusheng Wu Shan Xie Fei Liu Yi Pan Hunan Provincial Key Lab on Bioinformatics School of Computer Science and Engineering Central South University Changsha China Xinjiang Engineering Research Center of Big Data and Intelligent Software School of software Xinjiang University Wulumuqi China Hunan Province Key Lab on Bioinformatics School of Computer Science and Engineering Central South University Changsha China School of Science and Engineering University of Dundee Dundee U.K. School of Information and Communication Technology Griffith University Gold Coast QLD Australia Shenzhen Health Development Research and Data Management Center Shenzhen China Faculty of Computer Science and Control Engineering Shenzhen University of Advanced Technology Shenzhen China Shenzhen Institute of Advanced Technology Chinese Academy of Sciences Shenzhen China

Graph learning-based multi-modal integration and classification is one of the most challenging tasks for disease prediction. To effectively offset the negative impact among modalities in the process of multi-modal integration and heterogeneous information extractions from graphs, we propose a novel method called Multi-modal Multi-Kernel Graph Learning (MMKGL). To solve the problem of negative impact among modalities, we propose a multi-modal graph embedding module to construct a multi-modal graph. Different from conventional methods that manually construct static graphs for all modalities, each modality generates a separate graph by adaptive learning, where a function graph and a supervision graph are introduced for optimization during the multi-graph fusion embedding process. We then propose a multi-kernel graph learning module to extract heterogeneous information from the multi-modal graph. The information in the multi-modal graph at different levels is aggregated by convolutional kernels with different receptive field sizes, followed by generating a cross-kernel discovery tensor for disease prediction. Our method is evaluated on the benchmark Autism Brain Imaging Data Exchange (ABIDE) dataset and outperforms the state-of-the-art methods. In addition, discriminative brain regions associated with autism are identified by our model, providing guidance for the study of autism pathology.

关键词： Diseases Autism Random access memory Data mining Kernel Electronic mail Training Neuroimaging Graph convolutional networks Bioinformatics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：