检索结果-内蒙古大学图书馆

Effective IoT-based crop disease prediction using localise search traversing coupled with deep convolutional neural network classifier

引用

International Journal of Wireless and Mobile Computing 2024年第2期26卷 168-181页

作者： Vani, B.V. Guruprakash, C.D. Information Science and Engineering Sri Siddhartha Institute of Technology Karnataka Tumkur India Department Computer Science and Engineering Sri Siddhartha Institute of Technology Karnataka Tumkur India

Predicting crop disease on the image obtained from the affected crop has been a potential research topic. In this research, the Localise Search Optimisation Algorithm (LSOA) enabled deep Convolutional Neural Network (deep CNN) is used to predict the crop disease for which the dominant statistical and texture features are utilised and LSOA as a training algorithm. The experiments were done on an apple data set and a corn data set, and the results show that the LSOA-deep CNN model attains 98.474% of accuracy, 92.837% of sensitivity and 99.00% of specificity in k-fold training data and 94.683% of accuracy, 95.489% of specificity and 99.00% of specificity with 80% training data for the corn data set. With the apple data set, the developed method achieves 94.587% of accuracy, 99.00% sensitivity and 99.00% specificity under k-fold training, while for the 80% of training, 97.959% accuracy, 96.233% sensitivity and 99.005% specificity are attained. © 2024 Inderscience Enterprises Ltd.

关键词： Wireless sensor networks

来源：评论

学校读者我要写书评

暂无评论

Automatic hate speech detection in audio using machine learning algorithms

引用

International Journal of Speech technology 2024年第2期27卷 447-469页

作者： Imbwaga, Joan L. Chittaragi, Nagatatna B. Koolagudi, Shashidhar G. Computer Science and Engineering National Institute of Technology Karnataka Karnataka Surathkal575025 India Information Science and Engineering Siddaganga Institute of Technology Karnataka Tumkur572103 India

Even though every individual is entitled to freedom of speech, some limitations exist when this freedom is used to target and harm another individual or a group of people, as it translates to hate speech. In this study, the proposed research deals with detection of hate speech for English and Kiswahili languages from audio. The dataset used in this work was collected manually from YouTube videos and then converted to audio. Audio-based features namely spectral, temporal, prosodic and excitation source features were extracted and used to train various machine learning classifiers. Initial experiments were conducted for English language and later on for Kiswahili language. However, it is observed from literature that research activities on Kiswahili language is comparatively lesser. The scores calculated for accuracy, recall, precision, auc and f1 score in detecting hate speech, suggest that Random Forest classifier performed better for English language while the Extreme Gradient Boosting classifier performed better for Kiswahili language. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Machine learning

来源：评论

学校读者我要写书评

暂无评论

Video forgery localization using inter-frame denoising and intra-frame segmentation

引用

Multimedia Tools and Applications 2025年 1-17页

作者： Banerjee, Debanik Chittaragi, Nagaratna B. Koolagudi, Shashidhar G. Department of Computer Science and Engineering National Institute of Technology Karnataka Karnataka Surathkal575025 India Department of Information Science and Engineering Siddaganaga Institute of Technology Karnataka Tumkur572103 India

Video forgery detection has been necessary with recent spurt in fake videos like Deepfakes and doctored videos from multiple video capturing devices. In this paper, we provide a novel technique of detecting fake videos by creating an ensemble network, based on statistical and deep learning methods to detect interframe forgery and intraframe forgery in forged videos separately. In this paper, Noise signature extraction of a particular image capturing sensor and an Autoencoder-based Convolutional Neural Network model (CNN) are used to localize the forged regions. We have trained the model to localize Deepfake video forgeries as well as copy-paste forgeries with effective results in the test data. The proposed fake video detector can be applied at the back-end of on-line video aggregating services and check their authenticity to verify the genuineness of videos. The results achieved have shown better performances in detecting fake videos compared to existing methods. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2025.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

OCRBench: on the hidden mystery of OCR in large multimodal models

引用

science China(Information sciences) 2024年第12期67卷 23-35页

作者： Yuliang LIU Zhang LI Mingxin HUANG Biao YANG Wenwen YU Chunyuan LI Xu-Cheng YIN Cheng-Lin LIU Lianwen JIN Xiang BAI School of Artificial Intelligence and Automation Huazhong University of Science and Technology School of Electronic and Information Engineering South China University of Technology Microsoft Research School of Computer & Communication Engineering University of Science and Technology Beijing Institute of Automation Chinese Academy of Sciences School of Software Engineering Huazhong University of Science and Technology

Large models have recently played a dominant role in natural language processing and multimodal vision-language learning. However, their effectiveness in text-related visual tasks remains relatively unexplored. In this paper, we conducted a comprehensive evaluation of large multimodal models, such as GPT4V and Gemini, in various text-related visual tasks including text recognition, scene text-centric visual question answering(VQA), document-oriented VQA, key information extraction(KIE), and handwritten mathematical expression recognition(HMER). To facilitate the assessment of optical character recognition(OCR) capabilities in large multimodal models, we propose OCRBench, a comprehensive evaluation benchmark. OCRBench contains 29 datasets, making it the most comprehensive OCR evaluation benchmark available. Furthermore, our study reveals both the strengths and weaknesses of these models, particularly in handling multilingual text, handwritten text, non-semantic text, and mathematical expression *** importantly, the baseline results presented in this study could provide a foundational framework for the conception and assessment of innovative strategies targeted at enhancing zero-shot multimodal *** evaluation pipeline and benchmark are available at https://***/Yuliang-Liu/Multimodal OCR.

关键词： large multimodal model OCR text recognition scene text-centric VQA document-oriented VQA key information extraction handwritten mathematical expression recognition

来源：评论

学校读者我要写书评

暂无评论

Robust clustering of Ethereum transactions using time leakage from fixed nodes

引用

Blockchain(Research and Applications) 2023年第1期4卷 48-57页

作者： Congcong Yu Chen Yang Zheng Che Liehuang Zhu School of Computer Science Beijing Institute of TechnologyBeijing100081China School of Cyberspace Science and Technology Beijing Institute of TechnologyBeijing100081China

Ethereum has received increasing attention as the first blockchain platform to support smart *** mining has become an important tool for analyzing Ethereum ***,existing methods have the disadvantage of covering partial transactions and being vulnerable to privacy-enhancing *** this paper,we propose a scheme for transaction correlation with the node as an entity,which can cover all transactions while being resistant to privacy-enhancing *** timestamps relayed from N fixed nodes to describe the network properties of transactions,we cluster transactions that enter the network from the same source *** results show that our method can determine with 97%precision whether two transactions enter the network from the same source node.

关键词： Blockchain Ethereum Transaction correlation Network analysis Data mining

来源：评论

学校读者我要写书评

暂无评论

Hybrid RMDL-CNN for speech recognition from unclear speech signal

引用

International Journal of Speech technology 2025年第1期28卷 195-217页

作者： Bhargava, Raja Arivazhagan, N. Babu, Kunchala Suresh Research Scholar Department of Computer Science and Engineering SRM Institute of Science and Technology Kattankulathur Chennai India Department of Computational Intelligence SRM Institute of Science and Technology Kattankulathur Chennai India Department of Computer Science and Engineering Potti Sriramulu Chaluvadi Mallikarjuna Rao college of Engineering and Technology Andhra Pradesh Vijayawada India

ASR is an effectual approach, which converts human speech into computer actions or text format. It involves extracting and determining the noise feature, the audio model, and the language model. The extraction and determination of the noise feature is a crucial aspect of speech recognition, serving as both a process of information compression and signal deconvolution. ASR schemes are mostly employed in smart homes, smart appliances, and biometric schemes. Yet, traditional approaches offer very low performance because of a noisy environment. Moreover, local differences and accents negatively influence the ASR scheme execution during the conversion of the speech signals. This paper introduces a hybrid RMDL-CNN method to address these challenges. At first, the input of unclear speech is carried out by the dataset. Then, signal pre-processing is done by employing a Gaussian filter. After that, voice enhancement is accomplished by employing nonlinear spectral subtraction. Later, the speech word is segmented from the enhanced output based on the Attentional Encoder-Decoder approach and finally, the speech is recognized using the proposed RMDL-CNN. The RMDL-CNN method is devised by the combination of RMDL and CNN. Furthermore, the established RMDL-CNN is accessed for its efficiency based on several values of k-group value, as well as learning data. In addition, the introduced RMDL-CNN approach for speech recognition achieved better accuracy, PPV, as well as NPV of 0.909, 0.947, and 0.917 for dataset 1. Moreover, the RMDL-CNN has achieved the highest accuracy of 0.909, PPV of 0.926 and NPV of 0.888 for dataset 2. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2025.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Explainable hate speech detection using LIME

引用

International Journal of Speech technology 2024年第3期27卷 793-815页

作者： Imbwaga, Joan L. Chittaragi, Nagaratna B. Koolagudi, Shashidhar G. Computer Science and Engineering National Institute of Technology Karnataka Karnataka Surathkal575025 India Information Science and Engineering Siddaganga Institute of Technology Karnataka Tumkur572103 India

Free speech is essential, but it can conflict with protecting marginalized groups from harm caused by hate speech. Social media platforms have become breeding grounds for this harmful content. While studies exist to detect hate speech, there are significant research gaps. First, most studies used text data instead of other modalities such as videos or audio. Second, most studies explored traditional machine learning algorithms. However, due to the increase in complexities of computational tasks, there is need to employ complex techniques and methodologies. Third, majority of the research studies have either been evaluated using very few evaluation metrics or not statistically evaluated at all. Lastly, due to the opaque, black-box nature of the complex classifiers, there is need to use explainability techniques. This research aims to address these gaps by detecting hate speech in English and Kiswahili languages using videos manually collected from YouTube. The videos were converted to text and used to train various classifiers. The performance of these classifiers was evaluated using various evaluation and statistical measurements. The experimental results suggest that the random forest classifier achieved the highest results for both languages across all evaluation measurements compared to all classifiers used. The results for English language were: accuracy 98%, AUC 96%, precision 99%, recall 97%, F1 98%, specificity 98% and MCC 96% while the results for Kiswahili language were: accuracy 90%, AUC 94%, precision 93%, recall 92%, F1 94%, specificity 87% and MCC 75%. These results suggest that the random forest classifier is robust, effective and efficient in detecting hate speech in any language. This also implies that the classifier is reliable in detecting hate speech and other related problems in social media. However, to understand the classifiers’ decision-making process, we used the Local Interpretable Model-agnostic Explanations (LIME) technique to explain the

关键词： Random forests

来源：评论

学校读者我要写书评

暂无评论

A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback 41

A Unified Linear Programming Framework for Offline Reward Le...

引用

41st International Conference on Machine Learning, ICML 2024

作者： Kim, Kihyun Zhang, Jiawei Ozdaglar, Asuman Parrilo, Pablo Department of Electrical Engineering and Computer Science Massachusetts Institute of Technology Cambridge United States

Inverse Reinforcement Learning (IRL) and Reinforcement Learning from Human Feedback (RLHF) are pivotal methodologies in reward learning, which involve inferring and shaping the underlying reward function of sequential decision-making problems based on observed human demonstrations and feedback. Most prior work in reward learning has relied on prior knowledge or assumptions about decision or preference models, potentially leading to robustness issues. In response, this paper introduces a novel linear programming (LP) framework tailored for offline reward learning. Utilizing pre-collected trajectories without online exploration, this framework estimates a feasible reward set from the primal-dual optimality conditions of a suitably designed LP, and offers an optimality guarantee with provable sample efficiency. Our LP framework also enables aligning the reward functions with human feedback, such as pairwise trajectory comparison data, while maintaining computational tractability and sample efficiency. We demonstrate that our framework potentially achieves better performance compared to the conventional maximum likelihood estimation (MLE) approach through analytical examples and numerical experiments. Copyright 2024 by the author(s)

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

LLM chatbots as a language practice tool: a user study 13

LLM chatbots as a language practice tool: a user study

引用

13th Workshop on Natural Language Processing for computer Assisted Language Learning, NLP4CALL 2024

作者： Tyen, Gladys Caines, Andrew Buttery, Paula ALTA Institute Dept. of Computer Science & Technology University of Cambridge United Kingdom

ISBN: (纸本)9789180757744

Second language learners often experience language anxiety when speaking with others in their target language. As the generative capabilities of Large Language Models (LLMs) continue to improve, we investigate the possibility of using an LLM as a conversation practice tool. We conduct a user study with 160 English language learners, where an LLM chatbot is used to simulate real-world conversations. We present our findings on 1) how an interactive session with a chatbot might impact performance in real-world conversations;2) whether the learning experience differs for learners of different proficiency levels;3) how changes in difficulty affects the learner’s experience;and 4) how online, synchronous conversation provided by an LLM compares with a purely receptive experience. Additionally, we propose a simple yet effective way to detect linguistic complexity on-the-fly: clicking on words to reveal dictionary definitions. We demonstrate that clicks correlate well with linguistic complexity and indicate which words learners find difficult to understand. © 2024 NLP4CALL. All Rights Reserved.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

A Review on Smart Healthcare employing Quantum Internet of Things

引用

IEEE Engineering Management Review 2024年 1-9页

作者： Sutradhar, Kartick Venkatesh, Ranjitha Venkatesh, Priyanka Department of Computer Science and Engineering Indian Institute of Information Technology Sri City India Department of Computer Science and Engineering GITAM School of Technology Bengaluru India Department of Computer Science and Engineering Presidency University Bengaluru India

The Quantum Internet of Things (QIoT) in the healthcare industry holds the promise of transforming patient care, diagnostics, and medical research. Quantum-enhanced sensors, communication, and computation offer unprecedented capabilities that can revolutionize how healthcare services are delivered and experienced. This paper explores the potential of QIoT in the context of smart healthcare, where interconnected quantum-enabled devices and systems create an ecosystem that enhances data security, enables real-time monitoring, and advances medical knowledge. We delve into the applications of quantum sensors in precise health monitoring, the role of quantum communication in secure telemedicine, and the computational power of quantum computing in drug discovery and personalized medicine. We discuss challenges such as technical feasibility, scalability, and regulatory considerations, along with the emerging trends and opportunities in this transformative field. By examining the intersection of quantum technologies and smart healthcare, this paper aims to shed light on the novel approaches and breakthroughs that could redefine the future of healthcare delivery and patient outcomes. IEEE

关键词： Internet of things

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：