检索结果-内蒙古大学图书馆

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Zhang, Haodi Wang, Yichi Jian, Yifan Jiang, Jiahui Bai, Zhaohai Ma, Lin College of Computer Science and Software Engineering Shenzhen University China Center for Agricultural Resources Research Institute of Genetic and Developmental Biology Chinese Academy of Sciences China State Key Laboratory of Pollution Control and Resource Reuse School of the Environment Nanjing University China

ISBN: (纸本)9798350368741

Climate downscaling is crucial for detailed small-scale analysis and for acquiring climate data in regions without weather stations. Operator learning has proven potential for this task. However, several challenges remain in operator learning, such as multimodal fusion, spatiotemporal fusion and input state and query adaptation. To address these challenges, we propose a Spatiotemporal Multimodal Fusion Operator with a State-Query Coupled Kernel (SMCK). This framework includes a latent space fusion encoder that encodes climate variables using position-wise multihead attention for multimodal fusion and integrates historical information to generate robust and precise representation. Additionally, we introduce a state-query coupled kernel that combines radial basis functions and discrete fourier encoding to enhance query location representation, while also adapting to the state to obtain the coupled kernel. Extensive experiments demonstrate that our method achieves state-ofthe-art performance and provides strong support for climate downscaling and the planning of climate-related strategies. © 2025 IEEE.

关键词： Climate downscaling Deep Learning Multimodal fusion Operator learning Time series forecasting

来源：评论

学校读者我要写书评

暂无评论

Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?

arXiv

引用

arXiv 2025年

作者： Liang, Qingyuan Zhang, Zhao Sun, Zeyu Lin, Zheng Luo, Qi Xiao, Yueyi Chen, Yizhou Zhang, Yuqun Zhang, Haotian Zhang, Lu Chen, Bin Xiong, Yingfei School of Computer Science Peking University China Kuaishou Technology China Institute of Software Chinese Academy of Sciences China Department of Computer Science and Engineering Southern University of Science and Technology China

Grammar serves as a cornerstone in programming languages and software engineering, providing frameworks to define the syntactic space and program structure. Existing research demonstrates the effectiveness of grammar-based code representations in small-scale models, showing their ability to reduce syntax errors and enhance performance. However, as language models scale to the billion level or beyond, syntax-level errors become rare, making it unclear whether grammar information still provides performance benefits. To explore this, we develop a series of billion-scale GrammarCoder models, incorporating grammar rules in the code generation process. Experiments on HumanEval (+) and MBPP (+) demonstrate a notable improvement in code generation accuracy. Further analysis shows that grammar-based representations enhance LLMs’ ability to discern subtle code differences, reducing semantic errors caused by minor variations. These findings suggest that grammar-based code representations remain valuable even in billion-scale models, not only by maintaining syntax correctness but also by improving semantic differentiation.1 © 2025, CC BY.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Lightweight underwater garbage target detection algorithm based on improved YOLOV7-TINY 4

Lightweight underwater garbage target detection algorithm ba...

引用

4th International Conference on computer Vision, Application, and Algorithm, CVAA 2024

作者： Liu, Xinpei Xiong, Hailing Gao, Mingjie Liu, Wei College of Computer and Information Science Squthwest School of Software Southwest University Chongqing400715 China College of Electronic and Information Engineering Southwest University Chongqing China Business School The Taihu Lake University Chongqing China Silkworm Academy Palace Southwest University Chongqing China

ISBN: (数字)9781510687622

ISBN: (纸本)9781510687615

In the field of object detection, deep learning has been used extensively, especially in algorithms like Yolov7, which have achieved significant accuracy improvements. However, traditional convolutional neural networks are computationally intensive and require powerful GPU support, making its deployment on embedded devices difficult. This presents a problem for researchers as the high device requirements hinder their related research work. Consequently, more people opt to use lightweight networks, such as Yolov7-tiny. However, when using Yolov7-tiny for underwater garbage detection, it has been observed that while achieving good accuracy in mAP (mean Average Precision) at IOU (Intersection over Union) of 0.5, the performance is not satisfactory in the mAP range of 0.5 to 0.95. This limitation may be attributed to the trade-off in network performance during the process of model lightweighting. To address these issues, An enhanced Yolov7-tiny method for the detection of underwater trash objects is proposed in this paper. First, the algorithm employs an enhanced Ghost convolutional feature extraction module, which starts with conventional convolutions using a smaller number of channels, then performs grouped convolutions to obtain partial output maps with features. Finally, the maps with features obtained from the first convolutional step are added to the channels obtained from the second grouped convolution step. This design effectively reduces model complexity while extracting richer feature information. Secondly, the algorithm utilizes the CA (Channel Attention) mechanism to weight channels based on their positional information, thereby efficiently extracting features. The network can concentrate more on important feature regions by learning position weights in the feature maps. Lastly, the algorithm combines the Repeated Weighted Bi-directional Feature Pyramid Network (BIFPN) for feature fusion. BIFPN employs multiple down-sampling steps for short skip connections,

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

MixSSC: Forward-Backward Mixture for Vision-based 3D Semantic Scene Completion

引用

IEEE Transactions on Circuits and Systems for Video Technology 2025年

作者： Wang, Meng Ding, Yan Liu, Yumeng Qin, Yunchuan Li, Ruihui Tang, Zhuo The College of Computer Science and Electronic Engineering Hunan University Changsha410082 China Beijing Key Laboratory of Human-Computer Interaction Institute of Software Chinese Academy of Sciences Beijing100190 China

Vision-based semantic scene completion task aims to predict dense geometric and semantic 3D scene representations from 2D images. However, 3D modeling from a single view is an ill-posed problem, limited by the field of view and occlusion problems caused by image input. Moreover, existing methods tend to produce erroneous scene hallucinations and overly smooth boundary segmentation due to a lack of information. To address this problem, we propose MixSSC, which mixes the sparsity of forward projection with the denseness of depth-prior backward projection. The aim is to use sparse features to fill information-poor regions and dense features to enhance visible regions. Specifically, we develop the forward-backward mixture module, which enables the generation of scene mixture voxel representation by leveraging the benefits of both forward and backward projection. Subsequently, we design the semantic-spatial fusion module, which utilizes a coarse-to-fine approach to process mixture voxel features at the semantic-spatial level. Extensive experimental results on the SemanticKITTI, SSCBench-KITTI-360 and nuScenes datasets demonstrate the superiority of MixSSC. © 1991-2012 IEEE.

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

Enhanced Fire Detection for Industrial and Environmental Safety using IoT-Integrated Vision Sensors and Residual Clutch Attention Network and Depthwise Distinguishable Convolutional Neural Networks 3

Enhanced Fire Detection for Industrial and Environmental Saf...

引用

3rd International Conference on Intelligent Data Communication Technologies and Internet of Things, IDCIoT 2025

作者： Christy Sujatha, D. Singh, Priyanshu Vallikannu, C. Varman, Ravi Skanda, M.G. Gavaskar, T. Department of Software Engineering Periyar Maniammai Institute of Science & Technology Tamil Nadu Thanjavur India Department of Computer Science and Engineering R.V college of engineering Karnataka Bangalore India Department of Humanities and Sciences Rajalakshmi Engineering College Tamil Nadu Chennai India Department of Electrical and Electronics Engineering Karpagam academy of Higher Education Tamil Nadu Coimbatore India Department of Industrial and Production Engineering SJCE JSS Science and Technology University Karnataka Mysore India Department of Mechanical Engineering St. Joseph's College of Engineering Tamil Nadu Chennai India

ISBN: (纸本)9798331527549

Fire detection is an important factor in improved safety in industrial and environmental conditions. The conventional flame detectors, largely relying on heat and smoke detectors, have drawbacks of time lag, low sensitivity, and low coverage when compared with our requirement in large industrial scenes and vast natural terrains vulnerable to fire occurrence. These systems are highly unsuitable for dynamic and high-risk situations as they require a higher accuracy and a faster reaction time for the alarms to be efficient as is now the case due to the aforementioned reasons. In an attempt to remedy these limitations, this research introduces a fire detection system using IoT and vision sensors that combines a RCAN with DDCNN. This architecture is tuned using the Crayfish Optimization Algorithm (COA), which seeks to improve model parameter to improve response time and accuracy. By incorporating feedbacks from sensors placed at its different parts and reading from the environment, the RCAN-DDCNN model can quickly identify the fire patterns and respond by activating the sprinkler systems as well as setting off alarms. The proposed system presents a superior option for fire detecting because it is scalable and adaptive and it provides high degree of precision, making it an optimal development in fire detection technology for industry and environment. The introduced approach attains higher accuracy as 99%. © 2025 IEEE.

关键词： Sprinkler systems (fire fighting)

来源：评论

学校读者我要写书评

暂无评论

LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs

arXiv

引用

arXiv 2025年

作者： Lv, Peizhuo Xiahou, Yiran Li, Congyi Sun, Mengjie Zhang, Shengzhi Chen, Kai Zhang, Yingjun Institute of Information Engineering Chinese Academy of Sciences China Department of Computer Science Metropolitan College Boston University United States Institute of Software Chinese Academy of Sciences China

LoRA (Low-Rank Adaptation) has achieved remarkable success in the parameter-efficient fine-tuning of large models. The trained LoRA matrix can be integrated with the base model through addition or negation operation to improve performance on downstream tasks. However, the unauthorized use of LoRAs to generate harmful content highlights the need for effective mechanisms to trace their usage. A natural solution is to embed watermarks into LoRAs to detect unauthorized misuse. However, existing methods struggle when multiple LoRAs are combined or negation operation is applied, as these can significantly degrade watermark performance. In this paper, we introduce LoRAGuard, a novel black-box watermarking technique for detecting unauthorized misuse of LoRAs. To support both addition and negation operations, we propose the Yin-Yang watermark technique, where the Yin watermark is verified during negation operation and the Yang watermark during addition operation. Additionally, we propose a shadow-model-based watermark training approach that significantly improves effectiveness in scenarios involving multiple integrated LoRAs. Extensive experiments on both language and diffusion models show that LoRAGuard achieves nearly 100% watermark verification success and demonstrates strong effectiveness. Copyright © 2025, The Authors. All rights reserved.

关键词： Watermarking

来源：评论

学校读者我要写书评

暂无评论

PATCH: Empowering Large Language Model with Programmer-Intent Guidance and Collaborative-Behavior Simulation for Automatic Bug Fixing

arXiv

引用

arXiv 2025年

作者： Zhang, Yuwei Jin, Zhi Xing, Ying Li, Ge Liu, Fang Zhu, Jiaxin Dou, Wensheng Wei, Jun Affiliated with Nanjing Institute of Software Technology University of Chinese Academy of Sciences Nanjing China Key Laboratory of System Software Chinese Academy of Sciences Institute of Software Chinese Academy of Sciences University of Chinese Academy of Sciences Beijing China Key Laboratory of High Confidence Software Technologies Peking University Ministry of Education School of Computer Science Peking University Beijing China School of Computer Science Wuhan University Wuhan China School of Intelligent Engineering and Automation Beijing University of Posts and Telecommunications Beijing China State Key Laboratory of Complex & Critical Software Environment School of Computer Science and Engineering Beihang University Beijing China

Bug fixing holds significant importance in software development and maintenance. Recent research has made substantial strides in exploring the potential of large language models (LLMs) for automatically resolving software bugs. However, a noticeable gap in existing approaches lies in the oversight of collaborative facets intrinsic to bug resolution, treating the process as a single-stage endeavor. Moreover, most approaches solely take the buggy code snippet as input for LLMs during the patch generation stage. To mitigate the aforementioned limitations, we introduce a novel stage-wise framework named PATCH. Specifically, we first augment the buggy code snippet with corresponding dependence context and intent information to better guide LLMs in generating the correct candidate patches. Additionally, by taking inspiration from bug management practices, we decompose the bug-fixing task into four distinct stages: bug reporting, bug diagnosis, patch generation, and patch verification. These stages are performed interactively by LLMs, aiming to simulate the collaborative behavior of programmers during the resolution of software bugs. By harnessing these collective contributions, PATCH effectively enhances the bug-fixing capability of LLMs. We implement PATCH by employing the powerful dialogue-based LLM ChatGPT. Our evaluation on the widely used bug-fixing benchmark BFP demonstrates that PATCH has achieved better performance than state-of-the-art LLMs. Copyright © 2025, The Authors. All rights reserved.

关键词： computer software maintenance

来源：评论

学校读者我要写书评

暂无评论

NetScribed: A Deep Learning Approach for Machine-Based Melody Transcription of Audio Files 7th

NetScribed: A Deep Learning Approach for Machine-Based Melo...

引用

7th International Conference on Applied Informatics, ICAI 2024

作者： Volschenk, Francois van Der Haar, Dustin Academy of Computer Science and Software Engineering University of Johannesburg Cnr University Road and Kingsway Avenue Auckland Park Gauteng Johannesburg2092 South Africa

ISBN: (纸本)9783031751431

Automatic Music Transcription (AMT) entails creating an algorithm that converts an acoustic signal from an audio file into the corresponding sheet music representation. This paper uses deep learning methods and models AMT as a translation problem, comparing the effectiveness of an instance-based translation approach using an MLP to a sequence-based approach using an RNN. The models were trained on the EsAc dataset and evaluated using MUSTER metrics. The results show that the instance-based model better classifies the correct pitch. However, the sequence-based approach outperforms the instance-based approach on all other aspects of the MUSTER metrics, producing a 98% accuracy. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

On EDA-Driven Learning for SAT Solving 23

On EDA-Driven Learning for SAT Solving

引用

Proceedings of the 60th Annual ACM/IEEE Design Automation Conference

作者： Min Li Zhengyuan Shi Qiuxia Lai Sadaf Khan Shaowei Cai Qiang Xu Department of Computer Science and Engineering The Chinese University of Hong Kong Hong Kong S.A.R Communication University of China China State Key Laboratory of Computer Science Institute of Software Chinese Academy of Sciences China

ISBN: (纸本)9798350323481

We present DeepSAT, a novel end-to-end learning framework for the Boolean satisfiability (SAT) problem. Unlike existing solutions trained on random SAT instances with relatively weak supervision, we propose applying the knowledge of the well-developed electronic design automation (EDA) field for SAT solving. Specifically, we first resort to logic synthesis algorithms to pre-process SAT instances into optimized and-inverter graphs (AIGs). By doing so, the distribution diversity among various SAT instances can be dramatically reduced, which facilitates improving the generalization capability of the learned model. Next, we regard the distribution of SAT solutions being a product of conditional Bernoulli distributions. Based on this observation, we approximate the SAT solving procedure with a conditional generative model, leveraging a novel directed acyclic graph neural network (DAGNN) with two polarity prototypes for conditional SAT modeling. To effectively train the generative model, with the help of logic simulation tools, we obtain the probabilities of nodes in the AIG being logic '1' as rich supervision. We conduct comprehensive experiments on various SAT problems. Our results show that, DeepSAT achieves significant accuracy improvements over state-of-the-art learning-based SAT solutions, especially when generalized to SAT instances that are relatively large or with diverse distributions.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Encryption Traffic Classification Based on Mining Traffic Context and Transport Relationship

Encryption Traffic Classification Based on Mining Traffic Co...

引用

IEEE Conference on Wireless Communications and Networking

作者： Weilin Gai Runqing Zhang Huiyuan Zhang Yu Guo Jun Yin Peng Zhang TCA Institute of Software Chinese Academy of Sciences Beijing China School of Computer Science and Technology University of Chinese Academy of Sciences Beijing China National Computer Network Emergency Response TechnicalTeam/Coordination Center of China Beijing China School of Cyber Science and Engineering Nanjing University of Science and Technology Nanjing China

ISBN: (数字)9798350368369

ISBN: (纸本)9798350368376

This paper proposes a novel ETC-MTCTR, which is designed to enable more accurate, versatile and efficient traffic classification in the context of multi-scenario, low-resource encrypted traffic. Through three modules of Datagram Token conversion, pretraining and fine-tuning, the method uses large-scale unlabeled encrypted traffic for pretraining, mining and learning the traffic context and transmission relationship of encrypted traffic classification tasks, so that a small number of labeled data samples can be effectively used in the fine-tuning stage. Significantly improve the performance of the model on specific downstream classification tasks, enhance the accuracy, adaptability and robustness of the model in diverse environments, limited resources and new encryption security protocols, and realize efficient encryption traffic classification in multi-scenario and low-resource background. The results show that ETC-MTCTR achieves the best performance on three tasks: encryption malware classification, VPN encrypted traffic classification, and TLS 1.3 encryption application classification. Its F1 score is improved by 0.22% in the classification task of encrypted malware, 1.4% in the classification task of VPN encrypted traffic App, 4.56% in the classification task of VPN encrypted traffic Service, and 9.89% in the classification task of TLS 1.3 encrypted application, which is significantly better than other comparison methods.

关键词： Adaptation models Accuracy Protocols Malware Robustness Encryption Virtual private networks Data mining

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：