检索结果-内蒙古大学图书馆

A Novel Deep Auto-Encoder Based Linguistics Clustering Model for Social Text

ACM Transactions on Asian and Low-Resource Language Information Processing 1000年

作者： Muhammad Waseem Akram Muhammad Salman Muhammad Farrukh Bashir Syed Muhammad Saad Salman Thippa Reddy Gadekallu Abdul Rehman Javed COMSATS University Islamabad Pakistan The Australian National University Australia Riphah International University Pakistan National University of Computer and Emerging Sciences Pakistan School of Information Technology and Engineering Vellore Institute of Technology India Department of Cyber Security Air University Pakistan

The wide adoption of media and social media has increased the amount of digital content to an enormous level. Natural language processing (NLP) techniques provide an opportunity to extract and explore meaningful information from a large amount of text. Among natural languages, Urdu is one of the widely used languages worldwide for spoken and written communications. Due to its wide adopt-ability, digital content in the Urdu language is increasing briskly, especially with social media and online NEWS feeds. Government agencies and advertisers must filter and understand the content to analyze the trends and cohorts in their interest and national prerogative. Clustering is considered a baseline and one of the first steps in natural language understanding. There are many state-of-the-art clustering techniques specifically for English, French, and Arabic, but no significant research has been conducted in Urdu language processing. Doing it for short text segments is challenging because of limited features and the absence of meaningful language discourse and nuance. Many rule-based NLP techniques are adopted to overcome these issues, relying on human-designed features and rules. Therefore, these methods do not promise remarkable results. Alongside NLP, deep learning techniques are pretty efficient in capturing contextual information with minimal noise compared to other traditional methods. By taking on this challenging job, we develop a deep learning-based technique for Urdu short text clustering for the very first time without a human-designed feature. In this paper, we propose a method of short text clustering using a deep neural network that automatically learns feature representations and clustering assignments simultaneously. This method learns clustering objectives by converting the high dimensional feature space to a low dimensional feature space. Our experiments on the Urdu NEWS headlines dataset show remarkable results compared to state-of-the-art methods.

关键词： Clustering Urdu Social media Text Low resource language

来源：评论

学校读者我要写书评

暂无评论

TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model

引用

ACM Transactions on Multimedia Computing, Communications, and Applications 1000年

作者： Jiahao Lyu Jin Wei Gangyan Zeng Zeng Li Enze Xie Wei Wang Can Ma Yu Zhou Institute of Information Engineering Chinese Academy of Sciences & School of Cyber Security University of Chinese Academy of Sciences China Lenovo Research China School of Cyber Science and Engineering Nanjing University of Science and Technology China Shanghai Artificial Intelligence Laboratory China Institute of Information Engineering Chinese Academy of Sciences China VCIP & TMCC & DISSec College of Computer Science Nankai University China

Existing scene text spotters are designed to locate and transcribe texts from images. However, it is challenging for a spotter to achieve precise detection and recognition of scene texts simultaneously. Inspired by the glimpse-focus spotting pipeline of human beings and impressive performances of Pre-trained Language Models (PLMs) on visual tasks, we ask: 1) “Can machines spot texts without precise detection just like human beings?”, and if yes, 2)“Is text block another alternative for scene text spotting other than word or character?” To this end, our proposed scene text spotter leverages advanced PLMs to enhance performance without fine-grained detection. Specifically, we first use a simple detector for block-level text detection to obtain rough positional information. Then, we finetune a PLM using a large-scale OCR dataset to achieve accurate recognition. Benefiting from the comprehensive language knowledge gained during the pre-training phase, the PLM-based recognition module effectively handles complex scenarios, including multi-line, reversed, occluded, and incomplete-detection texts. Taking advantage of the fine-tuned language model on scene recognition benchmarks and the paradigm of text block detection, extensive experiments demonstrate the superior performance of our scene text spotter across multiple public benchmarks. Additionally, we attempt to spot texts directly from an entire scene image to demonstrate the potential of PLMs, even Large Language Models (LLMs).

关键词： Scene Text Spotting Pre-trained Language Model

来源：评论

学校读者我要写书评

暂无评论

Mobile Application Development: Practice and Experience 1

引用

丛书名： Studies in Systems, Decision and Control

1000年

作者： Jagannath Singh Debasish Das Lov Kumar Aneesh Krishna

来源：评论

学校读者我要写书评

暂无评论

Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application

引用

ACM Transactions on Intelligent Systems and technology 1000年

作者： Chuanpeng Yang Yao Zhu Wang Lu Yidong Wang Qian Chen Chenlong Gao Bingjie Yan Yiqiang Chen Institute of Information Engineering Chinese Academy of Sciences & School of Cyber Security University of Chinese Academy of Sciences China Zhejiang University China Tsinghua University China Peking University China Institute of Computing Technology Chinese Academy of Sciences University of Chinese Academy of Sciences China Institute of Computing Technology Chinese Academy of Sciences China

Large Language Models (LLMs) have showcased exceptional capabilities in various domains, attracting significant interest from both academia and industry. Despite their impressive performance, the substantial size and computational demands of LLMs pose considerable challenges for practical deployment, particularly in environments with limited resources. The endeavor to compress language models while maintaining their accuracy has become a focal point of research. Among the various methods, knowledge distillation has emerged as an effective technique to enhance inference speed without greatly compromising performance. This paper presents a thorough survey from three aspects: method, evaluation, and application, exploring knowledge distillation techniques tailored specifically for LLMs. Specifically, we divide the methods into white-box KD and black-box KD to better illustrate their differences. Furthermore, we also explored the evaluation tasks and distillation effects between different distillation methods, and proposed directions for future research. Through in-depth understanding of the latest advancements and practical applications, this survey provides valuable resources for researchers, paving the way for sustained progress in this field.

关键词： Knowledge Distillation Large Language Models Evaluation

来源：评论

学校读者我要写书评

暂无评论

Information security, Privacy and Digital forensics 1

引用

丛书名： Lecture Notes in Electrical engineering

1000年

作者： Sankita J. Patel Bhavesh N. Gohil Naveen Kumar Chaudhary S. S. Iyengar

来源：评论

学校读者我要写书评

暂无评论

Interventional Feature Generation for Few-shot Learning

引用

ACM Transactions on Multimedia Computing, Communications, and Applications 1000年

作者： Shuo Wang Jinda Lu Huixia Ben Yanbin Hao Xingyu Gao Meng Wang School of Data Science School of Information Science and Technology University of Science and Technology of China China School of Cyber Science and Technology University of Science and Technology of China China School of Public Security and Emergency Management Anhui University of Science and Technology China School of Computer Science and Information Engineering School of Artificial Intelligence Hefei University of Technology China Institute of Microelectronics Chinese Academy of Sciences and University of Chinese Academy of Sciences Beijing China

Few-shot learning (FSL) aims to classify a novel object into a specific category under limited training samples. This is a challenging task since (1) the features expressed by pre-trained knowledge introduce perceived bias and then constrain the classification space, and (2) the use of general hallucination techniques based on global features fails to escape the limited classification space, resulting in suboptimal improvements. To solve these issues, this paper proposes an interventional feature generation (IFG) method. Specifically, we first use the relations of the categories or instances as interventional operations to implicitly constrain the feature representations (pre-trained knowledge) into different classification subsets. Then, we employ a parameter-free feature generation strategy to enrich each subset’s training samples of the support category. In other words, IFG provides a multi-subsets learning strategy to reduce the influence of perceived bias, enrich the diversity of generated features, and improve the robustness of the few-shot classifier. We apply our method to four benchmark datasets and observe state-of-the-art performance across all experiments. Specifically, compared to the baseline on the Mini-ImageNet dataset, our approach yields accuracy improvements of 6.03% and 3.46% for 1 and 5 support training samples, respectively. Furthermore, the proposed interventional feature generation technique can improve classifier performance in other FSL methods, demonstrating its versatility and potential for broader applications. The code is available at https://***/ShuoWangCS/IFG-FSL/.

关键词： Few-Shot Learning Interventional Operation Data Hallucination

来源：评论

学校读者我要写书评

暂无评论

Artificial Intelligence in Practice 1

Artificial Intelligence in Practice

引用

丛书名： Signals and Communication technology

1000年

作者： S.S. Iyengar Seyedsina Nabavirazavi Yashas Hariprasad Prasad HB C. Krishna Mohan

ISBN: (数字)9783031893278

ISBN: (纸本)9783031893261;9783031893292

This book provides a comprehensive exploration of how Artificial Intelligence (AI) is being applied in the fields of cyber security and digital forensics. The book delves into the cutting-edge techniques that are reshaping the way we protect and investigate digital information. From identifying cyber threats in real-time to uncovering hidden evidence in complex digital cases, this book offers practical insights and real-world examples. Whether you’re a professional in the field or simply interested in understanding how AI is revolutionizing digital security, this book will guide you through the latest advancements and their implications for the future. Includes application of AI in solving real cyber security and digital forensics challenges, offering tangible examples; Shows how AI methods from machine / deep learning to NLP can be used for cyber defenses and in forensic investigations; Explores emerging trends and future possibilities, helping readers stay ahead of the curve in a rapidly evolving field.

关键词： Communications engineering, Networks Artificial Intelligence Systems and Data security

来源：评论

学校读者我要写书评

暂无评论

cyber security and Digital forensics 1

引用

丛书名： Lecture Notes in Networks and Systems

1000年

作者： Nihar Ranjan Roy Sudeep Tanwar Usha Batra

来源：评论

学校读者我要写书评

暂无评论

New Contribution to the Analysis and Evaluation of Safety Architectures 1

引用

1000年

作者： Ossmane Krini

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：