检索结果-内蒙古大学图书馆

SSRN 2024年

作者： Li, Hongliang Wang, Zichen Zhao, Hairui Zhang, Meng Li, Xiang Xu, Haixiao College of Computer Science and Technology Jilin University Qianjin street 2699 Jilin Changchun130012 China Key Laboratory of Symbolic Computation and Knowledge Engineering The Ministry of Education Qianjin street 2699 Jilin Changchun130012 China High Performance Computing Center Qianjin street 2699 Jilin Changchun130012 China

Training Deep Learning (DL) models are becoming more time-consuming, thus interruptions to the training processes are inevitable. Existing fault-tolerant work adopted checkpoint/recovery mechanism from traditional HPC platforms that makes periodical persistent copies of model states to save execution progress. These checkpoints have fixed time intervals that they are evenly placed across the training. We can obtain an optimal checkpointing interval for an HPC job with the precondition that the progress of a job is proportional to its execution time. Unfortunately, it is not the case in DL model training where a DL training job yields diminishing returns across its lifetime. It makes the early progress of a DL training job more valuable than the later ones. Even placement of checkpoints would either increase the risks in the early stages or waste resources overprotecting the latter stages. Meanwhile, the issue can get amplified for exploratory training jobs, where early terminations are common. Moreover, in data parallelism, the state-of-art quality-driven scheduling strategies allocate more resources for the early stages of a job than the later ones to accelerate the training progress which further amplifies the issue. This paper introduces a novel checkpointing interval problem for exploratory DL training jobs based on model convergence progress. We present COCI, an approach to compute optimal checkpointing configuration for a DL training job, minimizing the fault-tolerant overhead, including checkpointing cost and recovery cost. We implement COCI based on state-of-art iteration-level checkpointing mechanism, as a pluggable module compatible with PyTorch. COCI requires no extra user input. We conduct comprehensive evaluations with real DL application setups. The experimental results show that COCI reduces up to 40.18% fault-tolerant overhead compared to existing state-of-the-art DL fault-tolerant methods in serial scenario, 57.26% in data parallel scenario and 63.8

关键词： Iterative methods

来源：评论

学校读者我要写书评

暂无评论

NON-RADIATING ELASTIC SOURCES IN INHOMOGENEOUS ELASTIC MEDIA AT CORNERS WITH APPLICATIONS

arXiv

引用

arXiv 2025年

作者： Diao, Huaian Geng, Yueran Tang, Ruixiang School of Mathematics Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University Changchun China School of Mathematics Jilin University Changchun130012 China Department of Mathematics City University of Hong Kong Kowloon Hong Kong

This paper is concerned with non-radiating elastic sources in inhomogeneous elastic media. We demonstrate that the value of non-radiating elastic sources must vanish at convex corners of their support, provided the sources exhibit Hölder continuous regularity near the corner. Additionally, their gradient must satisfy intricate algebraic relationships with the angles defining the underlying corners, assuming the sources have C1,α regularity with α ∈ (0, 1) in the neighborhood of the corners. To perform microlocal analysis around the corners, we employ the so-called complex geometrical optics (CGO) solutions as test functions within a partial differential system. These characterizations of non-radiating elastic sources in inhomogeneous elastic media at corners enable us to establish the unique identifiability results for determining the position and shape of radiating elastic sources by a single far-field measurement, both locally and globally. The uniqueness result by a single far-field measurement is a challenging problem with a colorful history in inverse scattering. Indeed, when the support of a radiating elastic source is a convex polygon, we can simultaneously determine the shape of the elastic source and its values at the corners, provided the source is Hölder continuous at the corner. Furthermore, when the source function exhibits C1,α regularity in the neighborhood of a corner, the gradient of the source function at that corner can also be generically *** Codes 35Q74, 35R30, 74B05, 86A22 © 2025, CC BY.

关键词： Geometrical optics

来源：评论

学校读者我要写书评

暂无评论

Application of Hybrid Monocular SLAM Method in Augmented Reality 5

Application of Hybrid Monocular SLAM Method in Augmented Rea...

引用

2020 5th International Seminar on Computer Technology, Mechanical and Electrical engineering, ISCME 2020

作者： Zhang, Zuoming Wang, Hanwen Shu, Man Wang, Xin College of Software Engineering Jilin University Changchun Jilin130000 China Key Laboratory of Symbolic Computation and Knowledge Engineer of Ministry of Education Jilin University Changchun Jilin130000 China

In this paper, we design a hybrid (semi-direct) approach to simultaneous localization and mapping (SLAM) for monocular cameras and apply it to augmented reality (AR) for monocular cameras. We combine the advantagesof the direct method and the feature point method. We use both photometric bundle adjustment which is robust to camera exposure time and motion bundle adjustment which is geometrically robust based on feature points to do tracking process. This approach can maintain an intuitive direct local map as well as a reusable global sparse feature point map. Through the processing of point clouds, such as PCA plane detection and grid reconstruction, we greatly improve the effect of the augmented reality system. © Published under licence by IOP Publishing Ltd.

关键词： Augmented reality

来源：评论

学校读者我要写书评

暂无评论

Automating Discussion Structure Re-Organization for Github Issues

SSRN

引用

SSRN 2023年

作者： Bai, Shuotong Liu, Lei Meng, Chenkun Liu, Huaxiao College of Computer Science and Technology Jilin University Jilin Changchun130012 China Key Laboratory of Symbolic Computation and Knowledge Engineering Ministry of Education Jilin University Jilin Changchun130012 China

As a popular social code hosting platform, GitHub encourages developers to discuss and leave opinions on issues for better repository development and closer team collaboration. However, popular issues can be bloated over the time, in particular, the linear format of GitHub issue discussions makes it difficult for developers to organize and extract useful information. For a better understanding of GitHub issue discussions, we first conduct an empirical study. Among the 14 most-starred repositories, we notice that 16,740 issues contain more than 10 comments. Then, more commented issue discussions refer to additional repository contributions and draw more developer attention. Hence, we narrow our perspective to the issue discussions with more than 10 comments. For 50.29% of these popular discussions, the topics of content are subject to change explicitly, and more than 36% of consecutive comment pairs do not host response relationships. In addition, just 40% of the comments on GitHub use the @ or quoting functions when commenting on other people's comments, but these functions are not only used for responding, but also for referencing, informing others, etc. Based on these results, these popular discussions suffer from intertwined content of various topics and ambiguous response linkage. To mitigate the situation, we propose a new approach IRA to automatically re-organize GitHub issue discussions, aiming at converting an issue discussion with the linear structure into a discussion tree with key information. The experimental results show that our approach outperforms other baselines, and achieves an average improvement of 19.97%, 14,25% on metrics of ACC and F1-score in the task of predicting response relationships, as well as gets 15.78%, 51.72%, 26.92%, 21.03%, 22.08%, 25.59% improvement in terms of parent accuracy, Variation of Information, One-to-One Overlap, and all Exact Match metrics in the re-organizing task. To investigate human perspectives on our re-organized

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

Application of Monocular Direct Vision Odometry in Augmented Reality 5

Application of Monocular Direct Vision Odometry in Augmented...

引用

2020 5th International Seminar on Computer Technology, Mechanical and Electrical engineering, ISCME 2020

作者： Zhang, Zuoming Wang, Zixuan Wang, Hanwen Wang, Xin College of Software Engineering Jilin University Changchun Jilin130000 China Key Laboratory of Symbolic Computation and Knowledge Engineer of Ministry of Education Jilin University Changchun Jilin130000 China

In recent years, the unlabeled augmented reality system has been gradually applied to various mobile devices, among which stable, accurate, and fast registration is the key to realizing this function. For this technique, this paper introduces camera exposure parameters and puts the data association and pose estimation into a unified nonlinear optimization problem. Moreover, the direct monocular vision odometer is transplanted into the augmented reality system through the position adjustment module. We compare it with the traditional visual odometry method that matches the feature points. The results show that this improved method can be used to track more quickly and build a more visual semi-dense point cloud map, which can be used to support the registration and tracking of virtual objects in augmented reality. © Published under licence by IOP Publishing Ltd.

关键词： Augmented reality

来源：评论

学校读者我要写书评

暂无评论

RFR-WWANet: Weighted Window Attention-Based Recovery Feature Resolution Network for Unsupervised Image Registration

arXiv

引用

arXiv 2023年

作者： Ma, Mingrui Wang, Tao Wang, Weijie Song, Lei Liu, Guixia College of Computer Science and Technology Jilin University Changchun China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Changchun China Department of Information Engineering and Computer Science University of Trento Trento Italy

The Swin transformer has recently attracted attention in medical image analysis due to its computational efficiency and long-range modeling capability. Owing to these properties, the Swin Transformer is suitable for establishing more distant relationships between corresponding voxels in different positions in complex abdominal image registration tasks. However, the registration models based on transformers combine multiple voxels into a single semantic token. This merging process limits the transformers to model and generate coarse-grained spatial information. To address this issue, we propose Recovery Feature Resolution Network (RFRNet), which allows the transformer to contribute fine-grained spatial information and rich semantic correspondences to higher resolution levels. Furthermore, shifted window partitioning operations are inflexible, indicating that they cannot perceive the semantic information over uncertain distances and automatically bridge the global connections between windows. Therefore, we present a Weighted Window Attention (WWA) to build global interactions between windows automatically. It is implemented after the regular and cyclic shift window partitioning operations within the Swin transformer block. The proposed unsupervised deformable image registration model, named RFR-WWANet, detects the long-range correlations, and facilitates meaningful semantic relevance of anatomical structures. Qualitative and quantitative results show that RFR-WWANet achieves significant improvements over the current state-of-the-art methods. Ablation experiments demonstrate the effectiveness of the RFRNet and WWA designs. Our code is available at https://***/MingR-Ma/RFR-WWANet. Copyright © 2023, The Authors. All rights reserved.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Based Prediction of Myelosuppression in Lymphoma Patients During Chemotherapy Using Multimodal Radiological Images with Subcutaneous Adipose Tissue

Deep Learning-Based Prediction of Myelosuppression in Lympho...

引用

International Conference on Image, Vision and Intelligent Systems, ICIVIS 2023

作者： Du, Tianming Sun, Hongzan Yang, Jinzhu Grzegorzek, Marcin Li, Chen Microscopic Image and Medical Image Analysis Group College of Medicine and Biological Information Engineering Northeastern University Shenyang China Shengjing Hospital China Medical University Shenyang China Key Laboratory of Intelligent Computing in Medical Image Ministry of Education Northeastern University Shenyang China Institute of Medical Informatics University of Luebeck Luebeck Germany Department of Knowledge Engineering University of Economics in Katowice Katowice Poland

ISBN: (纸本)9789819708543

Lymphoma is a malignant tumor, and diffuse large B-cell lymphoma (DLBCL) is the most common type of non-Hodgkin's lymphoma. Due to its biological characteristics, surgical treatment is difficult. The main treatment for DLBCL is chemotherapy, with the R-CHOP regimen being the most common. The vast majority of patients require lifelong treatment. Myelosuppression during chemotherapy is the most common adverse reaction in DLBCL patients and directly affects the progress of chemotherapy. Accurately predicting whether patients need early intervention before chemotherapy can greatly improve their prognosis. In this paper, we propose a neural network that uses PET/CT images of subcutaneous adipose tissue before treatment to predict myelosuppression in DLBCL patients. The model achieves a classification accuracy of 93.57%. This indicates that the growth distribution pattern and metabolic characteristics of adipose tissue are important for DLBCL patients. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Chemotherapy

来源：评论

学校读者我要写书评

暂无评论

Spatiotemporal Transformer for Data Inference and Long Prediction in Sparse Mobile CrowdSensing

Spatiotemporal Transformer for Data Inference and Long Predi...

引用

IEEE Annual Joint Conference: INFOCOM, IEEE Computer and Communications Societies

作者： En Wang Weiting Liu Wenbin Liu Chaocan Xiang Bo Yang Yongjian Yang College of Computer Science and Technology Jilin University China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education Jilin University China College of Computer Science Chongqing University China

Mobile CrowdSensing (MCS) is a data sensing paradigm that recruits users carrying mobile terminals to collect data. As its variant, Sparse MCS has been further proposed for large-scale and fine-grained sensing task with the advantage of collecting only a few data to infer unsensed data. However, in many real-world scenarios, such as early prevention of epidemic, people are interested in not only the data at the current, but also in the future or even long-term future, and the latter may be more important. Long-term prediction not only reduces sensing cost, but also identifies trends or other characteristics of the data. In this paper, we propose a spatiotemporal model based on Transformer to infer and predict the data with sparse sensed data by utilizing spatiotemporal relationships. We design a spatiotemporal feature embedding to embed the prior spatiotemporal information of sensing map into the model to guide model learning. Moreover, we also design a novel multi-head spatiotemporal attention mechanism to dynamically capture spatiotemporal relationships among data. Extensive experiments have been conducted on three types of typical urban sensing tasks, which verify the effectiveness of our proposed algorithms in improving the inference and long-term prediction accuracy with the sparse sensed data.

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Coarse-to-fine Cascaded Evidence-Distillation Neural Network for Explainable Fake News Detection 29

A Coarse-to-fine Cascaded Evidence-Distillation Neural Netwo...

引用

29th International Conference on Computational Linguistics, COLING 2022

作者： Yang, Zhiwei Ma, Jing Chen, Hechang Lin, Hongzhan Luo, Ziyang Chang, Yi College of Computer Science and Technology Jilin University Changchun China Department of Computer Science Hong Kong Baptist University Hong Kong School of Artificial Intelligence International Center of Future Science Jilin University China Key Laboratory of Symbolic Computation and Knowledge Engineering Ministry of Education

Existing fake news detection methods aim to classify a piece of news as true or false and provide veracity explanations, achieving remarkable performances. However, they often tailor automated solutions on manual fact-checked reports, suffering from limited news coverage and debunking delays. When a piece of news has not yet been fact-checked or debunked, certain amounts of relevant raw reports are usually disseminated on various media outlets, containing the wisdom of crowds to verify the news claim and explain its verdict. In this paper, we propose a novel Coarse-to-fine Cascaded Evidence-Distillation (CofCED) neural network for explainable fake news detection based on such raw reports, alleviating the dependency on fact-checked ones. Specifically, we first utilize a hierarchical encoder for web text representation, and then develop two cascaded selectors to select the most explainable sentences for verdicts on top of the selected top-K reports in a coarse-to-fine manner. Besides, we construct two explainable fake news datasets, which is publicly available. Experimental results demonstrate that our model significantly outperforms state-of-the-art detection baselines and generates high-quality explanations from diverse evaluation perspectives. © 2022 Proceedings - International Conference on Computational Linguistics, COLING. All rights reserved.

关键词： Distillation

来源：评论

学校读者我要写书评

暂无评论

3DSEAVNet: 3D-Squeeze-and-Excitation Networks for Audio-Visual Saliency Prediction

3DSEAVNet: 3D-Squeeze-and-Excitation Networks for Audio-Visu...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Silong Liang Chunxiao Li Naying Cui Minghui Sun Hao Xue College of Software Engineering JiLin University Changchun China College of Computer Science and Technology JiLin University Changchun China Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education JiLin University Changchun China

Video saliency prediction is an important task in the field of computer vision. Most of the existing video saliency prediction methods only focus on image information, and the audio information is often ignored. This leads to an incomplete perception mode, which makes it difficult to achieve optimal performance. SENet is an excellent attention mechanism-based network. It significantly enhances the performance of 2D convolutional networks. However, whether the 3D convolutional network can be applied to this attention mechanism network remains to be studied. In order to solve the above problems, we propose a saliency prediction network for audio-visual fusion to extract and predict various information in videos. At the same time, we improve the traditional SENet to make it applicable in 3D convolutional neural networks and discuss its role. Compared with the state-of-the-art methods, our model has strong competitiveness in multiple data sets.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：