检索结果-内蒙古大学图书馆

3rd International Conference on Electrical, Electronics, Information and Communication Technologies, ICEEICT 2024

作者： Mathur, Abeer Tejpal, Moulik Bhargava, Kshitiz Natarajan, Krishnaraj Singh, Manvendra Vellore Institute of Technology School of Computer Science & Engineering Department of Software Systems Vellore India Vellore Institute of Technology School of Computer Science & Engineering Department of Database Systems Vellore India

ISBN: (纸本)9798350369083

The extensive spread of DeepFake images on the internet has emerged as a significant challenge, with applications ranging from harmless entertainment to harmful acts like blackmail, misinformation, and spreading false propaganda. To tackle this issue, this paper introduces a sophisticated DeepFake detection model designed to identify and mitigate the increase of these deceptive images. The model architecture integrates an ensemble approach, combining the strengths of two pre-trained Convolutional Neural Network (CNN) models - MobileNet and Xception - with a novel CNN architecture, the Advanced CNN (ACNN). This rigorous validation process enabled the model to achieve a high accuracy rate of 97.89% in detecting DeepFakes. The successful implementation of this ensemble CNN approach demonstrates its effectiveness in distinguishing between real and fabricated imagery with high precision. This research makes a substantial contribution to the field of digital image forensics, offering a reliable tool for stakeholders across various sectors to identify and counteract the spread of DeepFake images online. © 2024 IEEE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

PAR-Net: An Enhanced Dual-Stream CNN-ESN Architecture for Human Physical Activity Recognition

引用

SENSORS 2024年第6期24卷 1908页

作者： Khan, Imran Ullah Lee, Jong Weon Sejong Univ Dept Software Mixed Real & Interact Lab Seoul 05006 South Korea

Physical exercise affects many facets of life, including mental health, social interaction, physical fitness, and illness prevention, among many others. Therefore, several AI-driven techniques have been developed in the literature to recognize human physical activities. However, these techniques fail to adequately learn the temporal and spatial features of the data patterns. Additionally, these techniques are unable to fully comprehend complex activity patterns over different periods, emphasizing the need for enhanced architectures to further increase accuracy by learning spatiotemporal dependencies in the data individually. Therefore, in this work, we develop an attention-enhanced dual-stream network (PAR-Net) for physical activity recognition with the ability to extract both spatial and temporal features simultaneously. The PAR-Net integrates convolutional neural networks (CNNs) and echo state networks (ESNs), followed by a self-attention mechanism for optimal feature selection. The dual-stream feature extraction mechanism enables the PAR-Net to learn spatiotemporal dependencies from actual data. Furthermore, the incorporation of a self-attention mechanism makes a substantial contribution by facilitating targeted attention on significant features, hence enhancing the identification of nuanced activity patterns. The PAR-Net was evaluated on two benchmark physical activity recognition datasets and achieved higher performance by surpassing the baselines comparatively. Additionally, a thorough ablation study was conducted to determine the best optimal model for human physical activity recognition.

关键词： physical activity recognition deep learning machine learning skeleton data echo state networks

来源：评论

学校读者我要写书评

暂无评论

PNSP: Overcoming catastrophic forgetting using Primary Null Space Projection in continual learning

引用

PATTERN RECOGNITION LETTERS 2024年 179卷 137-143页

作者： Zhou, DaiLiang Song, YongHong Xi An Jiao Tong Univ Sch Software Engn Xian 710049 Peoples R China

Continual Learning (CL) plays a crucial role in enhancing learning performance for both new and previous tasks in continuous data streams, thus contributing to the advancement of cognitive computing. However, CL faces a fundamental challenge known as the stability -plasticity quandary. In this research, we present an innovative and effective CL algorithm called Primary Null Space Projection (PNSP) to strike a balance between network plasticity and stability. PNSP consists of three main components. Firstly, it leverages the NSP-LRA algorithm to project the gradient of network parameters from previous tasks into a meticulously designed null space. NSP-LRA harnesses high -dimensional geometric information extracted from the feature covariance matrix through low -rank approximation algorithm to obtain the basis of null space dynamically. This process constructs an innovation null space and ensures the continuous updating of orthonormal bases to accommodate changes in the input data. Secondly, we propose a Consistency -guided Task -specific Feature Learning (CTFL) mechanism to tackle the issue of catastrophic forgetting and facilitate continual learning. CTFL achieves this by aligning feature vectors and maintaining consistent feature learning directions, thereby preventing the loss of previously acquired knowledge. Lastly, we introduce Label Guided Self -Distillation (LGSD), a technique that utilizes true labels to guide the distillation process and incorporates a dynamic temperature mechanism to enhance performance. To evaluate the effectiveness of our proposed method, we conduct experiments on the CIFAR100 and TinyImageNet datasets. The results demonstrate significant improvements over state-of-the-art methods. We have made the implementation code of our approach available for reference.

关键词： Continual learning Catastrophic forgetting Null space Low-rank approximation Feature alignment

来源：评论

学校读者我要写书评

暂无评论

Remote operation of the DIII-D National Fusion Facility

引用

NUCLEAR FUSION 2024年第7期64卷 076004-076004页

作者： Schissel, D. P. Cho, E. Flanagan, S. Garcia, F. Liu, C. Margo, M. Nguyen, J. Nguyen, P. Parker, C. Penaflor, B. Pederson, T. Piglowski, D. Rivas, E. Shapov, R. Shen, H. Short, B. Waddell, T. Kalling, R. Gen Atom San Diego CA 92121 USA Kalling Software Kirkland WA USA

Full remote scientific operation of the DIII-D National Fusion Facility is now possible through significant advances in the computer science hardware and software infrastructure made over the last decade. Capabilities around information visualization, data movement, and communication have all been enhanced. The level of capability deployed to remotely operate DIII-D required an infrastructure advancement over what had previously been achieved in the fusion community. The large quantity of real-time data that is automatically displayed on DIII-D's control room screens can now be visualized by remote participants via web-based applications. New audio/video solutions using the VoIP and instant messaging application Discord have been implemented to mimic the dynamic and ad-hoc scientific conversations that are critical in successfully operating an experimental campaign. Discord's ability for a user to rapidly move between audio channels, text with images, and share screens is a significant enhancement over traditional videoconferencing tools. In addition, multiple combinations of broadcast audio are made available via a web-based application to allow remote participants to simultaneously listen to general announcements/sounds while conducting their own specific conversations. Secure methodologies have been put into place to allow remote control of hardware including DIII-D's plasma control system application. Secure methods also included the ability of the on-site team to closely coordinate their work with remote team members which has been enhanced through extensions to the wireless network and the use of tablet computers for audio/video/screen sharing. However, no amount of software can fully replace the need for 'hands on hardware.' This infrastructure was severely stress tested during the COVID-19 pandemic where occupancy of the DIII-D control room was restricted. Operational efficiency during the pandemic, measured in discharges per hour, remained high (3.8 +/- 0.8

关键词： DIII-D Tokamak remote operation remote collaboration

来源：评论

学校读者我要写书评

暂无评论

AG-HCRL: Air-Ground Collaborative Crowdsensing Based on Deep Reinforcement Learning in City Sensing 10

AG-HCRL: Air-Ground Collaborative Crowdsensing Based on Deep...

引用

10th IEEE Smart World Congress, SWC 2024

作者： Zhao, Kaixing Zhou, Yingying Xue, Huiwen Ding, Lige He, Liang Guo, Bin Northwestern Polytechnical University School of Software Xi'an China Beijing University of Posts and Telecommunications School of Computer Science Beijing China Northwestern Polytechnical University School of Computer Science Xi'an China

ISBN: (纸本)9798331520861

The rapid development of unmanned technology has brought new opportunities for mobile sensing in different fields. Naturally, traditional mobile crowdsensing (MCS) based on mobile device users and unmanned vehicle sensing (UVS) based on unmanned aerial vehicle (UAV) and unmanned ground vehicle (UGV) are especially complementary when performing sensing tasks. To further improve the intelligence in city sensing tasks, we propose an air-ground collaborative crowdsourcing framework based on deep reinforcement learning, called AG-HCRL, which is a heterogeneous crowdsensing design leveraging both manpower and unmanned vehicles (including aerial and ground). The proposed AG-HCRL framework models the different characteristics of MCS as well as UVS, achieving high-quality city sensing in a cost-effective way. A series of extensive evaluations based on simulations confirm the validity and superiority of AG-HCRL in simulated city sensing. © 2024 IEEE.

关键词： Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Robust Activity Recognition Based on Human Skeleton for Video Surveillance 9

Robust Activity Recognition Based on Human Skeleton for Vide...

引用

9th IEEE Smart World Congress, SWC 2023

作者： Zhang, Xiang Xie, Lei Bu, Yanling Lin, Zhenjie Wang, Liming Nanjing University State Key Laboratory for Novel Software Technology China Nanjing University of Aeronautics and Astronautics College of Computer Science and Technology China China Southern Power Grid Digital Platform Technology Company China

ISBN: (纸本)9798350319804

Activity recognition is an important task in video analysis, which can be used in accident monitoring and other daily applications. Traditional activity recognition methods are mainly based on the pixel-level analysis of 2D images. However, they achieve poor robustness in various complex environments, and are vulnerable to the perspective distortion caused by the fixed camera view. To address these challenges, we propose a robust skeleton-based human activity recognition method using a fixed monocular surveillance camera. We encode human skeleton with more critical motion information like pairwise distances between keypoints to capture high-level motion modality. Besides, we normalize skeleton data to eliminate the defects of 2D frames, such as the impact of distance on skeleton scale. Furthermore, we propose a skeleton calibration method based on perspective transformation to adapt our method to the deployment environment of surveillance cameras, especially different downward pitch angles. Experimental results show the recognition accuracy of our system reaches 91 percent with a frame rate of 10 FPS. © 2023 IEEE.

关键词： Activity recognition Human skeleton Surveillance video

来源：评论

学校读者我要写书评

暂无评论

Modeling full information with graph network for joint entity-relation extraction 4

Modeling full information with graph network for joint entit...

引用

4th International Conference on Algorithms, Computing and Artificial Intelligence, ACAI 2021

作者： Wan, Qian Wei, Luona Chen, Xinhai Liu, Jie Science and Technology on Parallel and Distributed Processing Laboratory National University of Defense Technology Hunan Changsha China College of Systems Engineering National University of Defense Technology Hunan Changsha China Laboratory of Software Engineering for Complex Systems National University of Defense Technology Hunan Changsha China

ISBN: (纸本)9781450385053

Fully capturing contextual information and analyzing the association between entity semantics and type is helpful for joint extraction task: 1) The context can reflect the part of speech and semantics of entity. 2) The entity type is closely related to the relation between entities. Previous research used to simply embed the contextual information into shallow layer of the model, ignoring the association between entity semantics and type. In this paper, we propose a graph network with full-information modeling to explicitly model different-level information in the text. The contextual information of entity is dynamically embedded in each span representation to improve the reasoning ability. To capture the fine-grained association between the semantics and type of entity, the graph network uses the feature of entity types to generate edge information between different nodes. Experimental results show that our model outperforms previous models on the CoNLL04 dataset and obtains competitive results on the SciERC dataset in both entity recognition and relation extraction. Extensive additional experiments further verify the effectiveness of the model. © 2021 ACM.

关键词： Natural language processing systems

来源：评论

学校读者我要写书评

暂无评论

Image Tampering Detection With Frequency-Aware Attention and Multiview Fusion

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2025年第3期6卷 614-625页

作者： Xu, Xu Chen, Junxin Lv, Wenrui Wang, Wei Zhang, Yushu Dalian University of Technology School of Software Dalian116621 China Northeastern University School of Computer Science and Engineering Shenyang110004 China Shenzhen MSU-BIT University Guangdong-Hong Kong-Macao Joint Laboratory for Emotion Intelligence and Pervasive Computing Artificial Intelligence Research Institute Shenzhen518172 China Jiangxi University of Finance and Economics School of Computing and Artificial Intelligence Nanchang330013 China

Manipulated images are flooding our daily lives, which poses a threat to social security. Recently, many studies have focused on image tampering detection. However, they have poor performance on independent validation due to differences in image scenes and tampering methods. The key question is how to design a network that is able to adaptively enhance the tampering information and suppress the generalization features during training. To this end, we propose a dual-branch network with a frequency adaptation paradigm and a feature fusion module for robust tampering image detection. First, this paradigm is designed to adaptively highlight tampering features through frequency conversion and learnable weight. Second, a feature fusion module is developed to filter redundant features and dynamically fuse two-branch features. Experiments on eight typical datasets demonstrate that our model has advantages over state-of-the-art algorithms, and our paradigm can well empower semantic segmentation networks for tampering detection. © 2024 IEEE.

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

DOC: Text Recognition via Dual Adaptation and Clustering

引用

IEEE TRANSACTIONS ON MULTIMEDIA 2023年 25卷 9071-9081页

作者： Ding, Xue-Ying Liu, Xiao-Qian Luo, Xin Xu, Xin-Shun Shandong Univ Sch Software Jinan 250101 Peoples R China

More recently, unsupervised domain adaptation has been introduced to text image recognition tasks for serious domain shift problem, which can transfer knowledge from source domains to target ones. Moreover, in unsupervised domain adaptation for text recognition, there is no label information in the target domain to supervise the domain adaptation, especially at the character. Several existing methods regard a text image as a whole and perform only on global feature adaptation, neglecting local-level feature adaptation, i.e., characters. Others methods only focus their attention on word-level feature alignment while ignoring the categories of local-level characters. To address these issues, we propose a text recognition model via Dual adaptatiOn and Clustering, DOC for short. Regarding word-level, we construct a Global Discriminator for global feature adaptation to reduce text layout bias between source and target domains. Regarding character-level, we propose an Adaptive Feature Clustering (AFC) module, which can extract invariant character features through a local-level discriminator for adaptation. Moreover, it enhances the local-feature adaptation by a clustering scheme, which evaluates the feature adaptation by leveraging the knowledge from the source domain as much as possible. In this way, it can pay more attention to the differences in fine-grained characters. Extensive experiments on benchmark datasets demonstrate that our framework can achieve state-of-the-art performance.

关键词： Feature extraction Adaptation models Text recognition Task analysis Image recognition Training Data models unsupervised domain adaptation domain shift clustering

来源：评论

学校读者我要写书评

暂无评论

Integrating Real-Time and Non-Real-Time Collaborative Programming: Workflow, Techniques, and Prototypes

引用

Proceedings of the ACM on Human-Computer Interaction 2023年第GROUP期7卷 1-19页

作者： Ma, Yifan Qi, Batu Xu, Wenhua Wang, Mingjie Du, Bowen Fan, Hongfei School of Software Engineering Tongji University Shanghai China College of Design and Innovation Tongji University Shanghai China Department of Computer Science University of Warwick Coventry United Kingdom

Real-time collaborative programming enables a group of programmers to edit shared source code at the same time, which significantly complements the traditional non-real-time collaborative programming supported by version control systems. However, one critical issue with this emerging technique is the lack of integration with non-real-time collaboration. Specifically, contributions from multiple programmers in a real-time collaboration session cannot be distinguished and accurately recorded in the version control system. In this study, we propose a scheme that integrates real-time and non-real-time collaborative programming with a novel workflow, and contribute enabling techniques to realize such integration. As a proof-of-concept, we have successfully implemented two prototype systems named CoEclipse and CoIDEA, which allow programmers to closely collaborate in a real-time fashion while preserving the work's compatibility with traditional non-real-time collaboration. User evaluation and performance experiments have confirmed the feasibility of the approach and techniques, demonstrated the good system performance, and presented the satisfactory usability of the prototypes. © 2023 ACM.

关键词： Control systems

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：