检索结果-内蒙古大学图书馆

Data Compression Conference (DCC)

作者： Hui Sun Huidong Ma Yingfeng Zheng Haonan Xie Meng Yan Cheng Zhong Xiaoguang Liu Gang Wang Nankai-Baidu Joint Lab Parallel and Distributed Software Technology Lab College of Computer Science Nankai University Tianjin China School of Electrical Engineering Guangxi University Nanning China School of Computer Electronics and Information Guangxi University Nanning China

ISBN: (数字)9798350385878

ISBN: (纸本)9798350385885

The advancement of long reads sequencing technologies has led to a significant increase in biological sequencing big data. Although several reference-free compressors are available for saving long reads data storage space, choosing the suitable one is challenging due to the shortage of thorough and systematic evaluations of their lossless compression effectiveness, both dedicated and general-purpose. In this study, we performed benchmark examinations on 30 compressors, including 11 specialized for long reads and 19 general-purpose ones, using 31 real-world datasets with differing sequencing platforms, species, and lengths. Each lossless compressor was evaluated on 13 performance measures, including compression strength, compression robustness, as well as time and peak memory required for compression and decompression. Additionally, for future long reads data compressors, we outlined investigation directions with consideration for privacy-sensitive sequences data security, hardware parallel acceleration, parameter tuning framework, and system hardware-algorithm integration design. We summarized the results as the Long Reads Compression Benchmark, available at https://***/fahaihi/LRCB.

关键词： Sequential analysis Systematics Memory management Benchmark testing Compressors Loss measurement Time measurement

来源：评论

学校读者我要写书评

暂无评论

A Weakly Supervised Semantic Segmentation Model with Enhanced CLIP Feature Extraction

A Weakly Supervised Semantic Segmentation Model with Enhance...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Fanxuan Kong Jun Lu College of Computer Science and Technology Heilongjiang University Harbin China Jiaxiang Industrial Technology Research Institute of Heilongjiang University Jining China Key Laboratory of Database and Parallel Computing of Heilongjiang Province Harbin China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

This paper addresses the limitations of the Contrastive Language-Image Pre-training (CLIP) model’s image encoder and proposes a segmentation model WSSS-ECFE with enhanced CLIP feature extraction, aiming to improve the performance of the Weakly Supervised Semantic Segmentation (WSSS) task. WSSS-ECFE employs the Enhanced Bottleneck module proposed in this paper and adds dynamic residual connection to improve the model’s processing effect on complex scenes. In terms of implementation, the Enhanced Bottleneck module employs the Swish activation function and the Depthwise Separable Convolution to enhance the feature extraction and segmentation capability of the model, and uses multiple attention mechanisms to further optimize the feature representation and segmentation accuracy. The WSSS task on the public datasets PASCAL VOC 2012 and MS COCO 2014 achieves 82.6% and 56.3% mean intersection over union (mIoU), achieving state-of-the-art performance in models with low resource requirements.

关键词： Training Attention mechanisms Accuracy Convolution Semantic segmentation Graphics processing units Feature extraction Boosting Acoustics Speech processing

来源：评论

学校读者我要写书评

暂无评论

ModalSync: Synchronizing User Behavior with Multimodal Features for Multimodal Pre-training Recommendation 25

ModalSync: Synchronizing User Behavior with Multimodal Featu...

引用

Companion Proceedings of the ACM on Web Conference 2025

作者： Shiqin Liu Chaozhuo Li Minjun Zhao Litian Zhang Jiajun Bu College of Computer Science and Technology Zhejiang University Hangzhou China Key Laboratory of Trustworthy Distributed Computing and Service (MoE) Beijing University of Posts and Telecommunications Beijing China School of Cyber Science and Technology Beihang University Beijing China

ISBN: (纸本)9798400713316

In the continually evolving landscape of online platforms, integrating multimodal information into recommender systems offers a promising avenue to enhance our understanding of user preferences and product insights. Traditional models primarily rely on user-item interactions, but the advent of multimodal systems utilizes additional data modalities-text, images, audio, and video-to enhance recommendation accuracy. However, existing multimodal recommendation architectures often fail to fully exploit the potential synergy between multimodal feature extraction and recommendation processes, leading to domain bias and false positives. In this work, we introduce ModalSync, a novel multimodal pre-training framework designed to synchronize multimodal features with user behaviors, closely aligning with human perceptual processes. Unlike previous approaches that utilize pre-trained generic encoders, ModalSync incorporates a pre-training method that integrates both unsupervised and supervised strategies, thereby fostering a harmonious relationship between interaction graphs and multimodal data. Our framework uniquely addresses domain bias by infusing recommendation-specific interaction data into the feature extraction process and reduces false positives by directing encoder attention towards crucial attributes. Furthermore, ModalSync introduces a staged co-training module that strategically adjusts the training dynamics of the feature extractors and GNNs, promoting an effective and seamless fusion of multimodal information. Extensive experiments across three public datasets demonstrate that ModalSync significantly outperforms existing methods, achieving state-of-the-art results.

关键词： graph neural network

来源：评论

学校读者我要写书评

暂无评论

FCloudless: A Performance-Aware Collaborative Mechanism for JointCloud Serverless

FCloudless: A Performance-Aware Collaborative Mechanism for ...

引用

IEEE International Conference on Joint Cloud Computing (JCC)

作者： Jianfei Liu Huaimin Wang Peichang Shi Yaojie Li Penghui Ma Guodong Yi National Key Laboratory of Parallel and Distributed Computing College of Computer Science National University of Defense Technology Changsha 410073 China Key Laboratory of Software Engineering for Complex Systems College of Computer Science National University of Defense Technology Changsha 410073 China Xiangjiang Lab Changsha 410073 China School of Advanced Interdisciplinary Studies Hunan University Of Technology and Business Changsha 410073 China

As a new stage in the development of the cloud computing paradigm, serverless computing has the high-level abstraction characteristic of shielding underlying details. This makes it extremely challenging for users to choose a suitable serverless platform. To address this, targeting the jointcloud computing scenario of heterogeneous serverless platforms across multiple clouds, this paper presents a jointcloud collaborative mechanism called FCloudless with cross-cloud detection of the full lifecycle performance of serverless platforms. Based on the benchmark metrics set that probe performance critical stages of the full lifecycle, this paper proposes a performance optimization algorithm based on detected performance data that takes into account all key stages that affect the performance during the lifecycle of a function and predicts the overall performance by combining the scores of local stages and dynamic weights. We evaluate FCloudless on AWS, AliYun, and Azure. The experimental results show that FCloudless can detect the underlying performance of serverless platforms hidden in the black box and its optimization algorithm can select the optimal scheduling strategy for various applications in a jointcloud environment. FCloudless reduces the runtime by 23.3% and 24.7% for cold and warm invocations respectively under cost constraints.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Towards Intelligent Transportation with Pedestrians and Vehicles In-the-Loop: A Surveillance Video-Assisted Federated Digital Twin Framework

引用

IEEE Network 2025年

作者： Li, Xiaolong Wei, Jianhao Wang, Haidong Dong, Li Chen, Ruoyang Yi, Changyan Cai, Jun Niyato, Dusit Shen, Xuemin Hunan Provincial General University Key Laboratory of IoT Intelligent Sensing and Distributed Collaborative Optimization Hunan University of Technology and Business Xiangjiang Laboratory Changsha410205 China Nanjing University of Aeronautics and Astronautics College of Computer Science and Technology China University of Concordia Department of Electrical and Computer Engineering Montreal Canada Nanyang Technological University College of Computing and Data Science Singapore Singapore University of Waterloo Department of Electrical and Computer Engineering Canada

In intelligent transportation systems (ITSs), integrating pedestrians and vehicles into traffic management models is essential for developing realistic and safe solutions. However, current systems often fail to simulate complex, real-world scenarios due to the absence of a comprehensive digital twin framework across diverse traffic environments and effective modeling of pedestrian-vehicle interactions. In this article, we propose a surveillance video-assisted federated digital twin (SV-FDT) framework to enhance ITSs by incorporating pedestrians and vehicles into the control loop. SV-FDT improves computational efficiency and communication performance by transmitting only semantic data and agent parameters, rather than raw video streams. The proposed framework adopts three-layer architecture and constructs detailed pedestrian-vehicle interaction models using multi-source traffic surveillance videos. The three-layer architecture includes: (i) an end layer that collects surveillance videos from multiple sources;(ii) an edge layer that performs self-supervised semantic segmentation to extract interactions, converts them into executable traffic codes, and generates local digital twin systems (LDTSs) for regional traffic modeling;and (iii) a cloud layer that integrates LDTSs into a real-time global digital twin model. Key design considerations, challenges, and practical implementation guidelines are discussed for SV-FDT, and a testbed evaluation is used to show that SV-FDT improves traffic flow, reduces mirroring delay, and enhances recognition accuracy and system efficiency compared to traditional terminal-server frameworks. Finally, we outline open challenges and potential directions for future research in digital twin-enabled ITS. © 1986-2012 IEEE.

关键词： Air traffic control

来源：评论

学校读者我要写书评

暂无评论

A Memory Saving Mechanism Based on Data Transferring for Pipeline parallelism

A Memory Saving Mechanism Based on Data Transferring for Pip...

引用

IEEE International Conference on Big Data and Cloud Computing (BdCloud)

作者： Wei Jiang Rui Xu Sheng Ma Qiong Wang Xiang Hou Hongyi Lu Science and Technology on Parallel and Distributed Processing Laboratory College of Computer National University of Defense Technology Changsha China

The correctness and robustness of the neural network model are usually proportional to its depth and width. Currently, the neural network models become deeper and wider to cope with complex applications, which leads to high memory capacity requirement and computer capacity requirements of the training process. The multi-accelerator parallelism is a promising choice for the two challenges, which deploys multiple accelerators in parallel for training neural networks. Among them, the pipeline parallel scheme has a great advantage in training speed, but its memory capacity requirements are relatively higher than other parallel schemes. Aiming at solving this challenge of pipeline parallel scheme, we propose a data transfer mechanism, which effectively reduces the peak memory usage of the training process by real-time data transferring. In the experiment, we implement our design and apply it to Pipedream, a mature pipeline parallel scheme. The memory requirement of training process is reduced by up to 48.5%, and the speed loss is kept within a reasonable range.

关键词： Training Computational modeling Pipelines Neural networks Memory management Bandwidth parallel processing

来源：评论

学校读者我要写书评

暂无评论

Co-designing the Topology/Algorithm to Accelerate distributed Training

Co-designing the Topology/Algorithm to Accelerate Distribute...

引用

IEEE International Conference on Big Data and Cloud Computing (BdCloud)

作者： Xiang Hou Rui Xu Sheng Ma Qiong Wang Wei Jiang Hongyi Lu Science and Technology on Parallel and Distributed Processing Laboratory College of Computer National University of Defense Technology Changsha China

With the development of Deep Learning (DL), Deep Neural Network (DNN) models have become more complex. At the same time, the development of the Internet makes it easy to obtain large data sets for DL training. Large-scale model parameters and training data enhance the level of AI by improving the accuracy of DNN models. But on the other hand, they also present more severe challenges to the hardware training platform because training a large model needs a lot of computing and memory resources that can easily exceed the capacity of a single processor. In this context, integrating more processors on a hierarchical system to conduct distributed training is a direction for the development of training platforms. In distributed training, collective communication operations (including all-to-all, all-reduce, and all-gather) take up a lot of training time, making the interconnection network between computing nodes one of the most critical factors affecting the system performance. The hierarchical torus topology, combined with the Ring All-Reduce collective communication algorithm, is one of the current mainstream distributed interconnection networks. However, we believe that its communication performance is not the best. In this work, we first designed a new intra-package communication topology, i.e. the switch-based fully connected topology, which shortens the time consumed by cross-node communication. Then, considering the characteristics of this topology, we carefully devised more efficient all-reduce and all-gather communication algorithms. Finally, combined with the torus topology, we implemented a novel distributed DL training platform. Compared with the hierarchical torus, our platform improves communication efficiency and provides 1.16-2.68 times speedup in distributed training of DNN models.

关键词： Training Deep learning Network topology Multiprocessor interconnection Computational modeling Training data Switches

来源：评论

学校读者我要写书评

暂无评论

Surrogate Supervision-based Deep Weakly-supervised Anomaly Detection

Surrogate Supervision-based Deep Weakly-supervised Anomaly D...

引用

IEEE International Conference on Data Mining Workshops (ICDM Workshops)

作者： Zhiyue Wu Hongzuo Xu Yijie Wang Yongjun Wang Science and Technology on Parallel and Distributed Processing Laboratory College of Computer National University of Defense Technology Changsha China

ISBN: (纸本)9781665424288

Many anomaly detection applications can provide partially observed anomalies, but only limited work is for this setting. Additionally, a number of anomaly detectors focus on learning a particular model of normal/abnormal class. However, the intra-class model might be too complicated to be accurately learned. It is still a non-trivial task to handle data with anomalies/inliers in skewed and heterogeneous distributions. To address these problems, this paper proposes an anomaly detection method to leverage Partially Labeled anomalies via Surrogate supervision-based Deviation learning (denominated PLSD). The original supervision (i.e., known anomalies and a set of explored inliers) is transferred to semantic-rich surrogate supervision signals (i.e., anomaly-inlier and inlier-inlier class) via vector concatenation. Then different relationships and interactions between anomalies and inliers are directly and efficiently learned thanks to the neural network’s connection property. Anomaly scoring is processed via the trained network and the high-efficacy inliers. Extensive experiments show that PLSD significantly prevails state-of-the-art semi/weakly-supervised anomaly detectors.

关键词： Learning systems Conferences Detectors Gaussian distribution Benchmark testing Task analysis Anomaly detection

来源：评论

学校读者我要写书评

暂无评论

Word Embedding-based Context-sensitive Network Flow Payload Anomaly Detection

Word Embedding-based Context-sensitive Network Flow Payload ...

引用

International Conference on Applied Machine Learning (ICAML)

作者： Yizhou Li Yijie Wang Li Cheng Hongzuo Xu Science and Technology on Parallel and Distributed Processing Laboratory College of Computer National University of Defense Technology Changsha China

ISBN: (纸本)9781665421263

Payload anomaly detection can discover malicious behaviors hidden in network packets. It is hard to handle payload due to its various possible characters and complex semantic context, and thus identifying abnormal payload is also a non-trivial task. Prior art only uses the n-gram language model to extract features, which directly leads to ultra-high-dimensional feature space and also fails to capture the context semantics fully. Accordingly, this paper proposes a word embedding-based context-sensitive network flow payload anomaly detection method (termed WECAD). First, WECAD obtains the initial feature representation of the payload through the word embedding-based method. Then, we propose a corpus pruning algorithm, which applies the cosine similarity clustering and frequency distribution to prune inconsequential characters. We only keep the essential characters to reduce the calculation space. Subsequently, we propose a context learning algorithm. It employs the co-occurrence matrix transformation technology and introduces the backward step size to consider the order relationship of essential characters. Comprehensive experiments on real-world intrusion detection datasets validate the effectiveness of our method.

关键词： Art Semantics Clustering algorithms Intrusion detection Machine learning Feature extraction Task analysis

来源：评论

学校读者我要写书评

暂无评论

A multidimensional approach of evaluating developers 2020

A multidimensional approach of evaluating developers

引用

2nd International Conference on Big Data Engineering, BDE 2020

作者： Zhang, Changqiang Chen, Ming Key Laboratory of Parallel and Distributed Computing College of Computer National University of Defense Technology China

ISBN: (纸本)9781450377225

In this paper, we propose an approach to assess the ability of developers based on their behavior data from OSS. Specifically, we classify developers' ability into code ability, project management ability, and social ability. Code efficiency is related to the developer's commit record and the pull-request record. The developer's project management ability is achieved by tracking the developer's commit record. We use regular matching to map the commit behavior to the project management behavior and calculate the developer's project management ability according to the proportion of different behaviors. The social ability of developers is related to the data that developers interact with in the open-source community. We dug for developer reviews on commit, issue, and gist fragments. By calculating the proportion of positive emotions in developer reviews and the proportion of developers interacting with others in the reviews, the social ability of developers is obtained. We get behavioral data from 50 random developers. Twitter's data is used to test the effect of different machine learning algorithms on the accuracy of developer comment polarity judgments. It is found that the combination of SVM, xgboost and random forest have the highest prediction accuracy. Finally, we select 5 students to use Likert scale to score the results. Our score shows that the results are basically in line with expectations. © 2020 ACM.

关键词： Decision trees

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：