检索结果-内蒙古大学图书馆

Automated Detection of Offensive images and Sarcastic Memes in Social Media Through NLP

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND applications 2024年第7期15卷 1415-1425页

作者： Purnima, Tummala Rao, Ch Koteswara VIT AP Univ Sch Comp Sci Near Vijayawada Amaravati 522237 Andhra Pradesh India

In this digital era, social media is one of the key platforms for collecting customer feedback and reflecting their views on various aspects, including products, services, brands, events, and other topics of interest. However, there is a rise of sarcastic memes on social media, which often convey contrary meaning to the implied sentiments and challenge traditional machine learning identification techniques. The memes, blending text and visuals on social media, are difficult to discern solely from the captions or images, as their humor often relies on subtle contextual cues requiring a nuanced understanding for accurate interpretation. Our study introduces Offensive images and Sarcastic Memes Detection to address this problem. Our model employs various techniques to identify sarcastic memes and offensive images. The model uses Optical Character Recognition (OCR) and bidirectional long-short term memory (Bi-LSTM) for sarcastic meme detection. For offensive image detection, the model employs Autoencoder LSTM, deep learning models such as Densenet and mobilenet, and computer vision techniques like Feature Fusion Process (FFP) based on Transfer Learning (TL) with image Augmentation. The study showcases the effectiveness of the proposed methods in achieving high accuracy in detecting offensive content across different modalities, such as text, memes, and images. Based on tests conducted on real-world datasets, our model has demonstrated an accuracy rate of 92% on the Hateful Memes Challenge dataset. The proposed methodology has also achieved a Testing Accuracy (TA) of 95.7% for Densenet with transfer learning on the NPDI dataset and 95.12% on the Pornography dataset. Moreover, implementing Transfer Learning with a Feature Fusion Process (FFP) has resulted in a TA of 99.45% for the NPDI dataset and 98.5% for the Pornography dataset.

关键词： Deep learning natural language processing offen sive images sarcastic memes toxic content detection

来源：评论

学校读者我要写书评

暂无评论

Red Deer Optimization with Artificial Intelligence Enabled image Captioning System for Visually Impaired People

引用

Computer Systems Science & Engineering 2023年第8期46卷 1929-1945页

作者： Anwer Mustafa Hilal Fadwa Alrowais Fahd N.Al-Wesabi Radwa Marzouk Department of Computer and Self Development Preparatory Year DeanshipPrince Sattam bin Abdulaziz UniversityAlKharjSaudi Arabia Department of Computer Sciences College of Computer and Information SciencesPrincess Nourah bint Abdulrahman UniversityP.O.Box 84428Riyadh11671Saudi Arabia Department of Computer Science College of Science&Art at MahayilKing Khalid UniversityMahayilSaudi Arabia Department of Information Systems College of Computer and Information SciencesPrincess Nourah bint Abdulrahman UniversityP.O.Box 84428Riyadh11671Saudi Arabia Department of Mathematics Faculty of ScienceCairo UniversityGiza12613Egypt

The problem of producing a natural language description of an image for describing the visual content has gained more attention in natural language processing(NLP)and computer vision(CV).It can be driven by applications like image retrieval or indexing,virtual assistants,image understanding,and support of visually impaired people(VIP).Though the VIP uses other senses,touch and hearing,for recognizing objects and events,the quality of life of those persons is lower than the standard *** image captioning generates captions that will be read loudly to the VIP,thereby realizing matters happening around *** article introduces a Red Deer Optimization with Artificial Intelligence Enabled image Captioning System(RDOAI-ICS)for Visually Impaired *** presented RDOAI-ICS technique aids in generating image captions for *** presented RDOAiiCS technique utilizes a neural architectural search network(NASNet)model to produce image ***,the RDOAI-ICS technique uses the radial basis function neural network(RBFNN)method to generate a textual *** enhance the performance of the RDOAI-ICS method,the parameter optimization process takes place using the RDO algorithm for NasNet and the butterfly optimization algorithm(BOA)for the RBFNN model,showing the novelty of the *** experimental evaluation of the RDOAI-ICS method can be tested using a benchmark *** outcomes show the enhancements of the RDOAI-ICS method over other recent image captioning approaches.

关键词： machine learning image captioning visually impaired people parameter tuning artificial intelligence metaheuristics

来源：评论

学校读者我要写书评

暂无评论

How to combine issues related to autonomous vehicles - a proposal with a literature review 26

How to combine issues related to autonomous vehicles - a pro...

引用

26th IEEE Signal processing: Algorithms, Architectures, Arrangements, and applications, SPA 2023

作者： Balcerek, Julian Pawlowski, Pawel Poznan University of Technology Division of Signal Processing and Electronic Systems Institute of Automatic Control and Robotics Poznan Poland

ISBN: (纸本)9798350304985

This article proposes a model that combines the issues related to autonomous vehicles into seven groups. The groups are included in mutual iterations between the user, the autonomous vehicle and the environment. They are: vehicle automation, environmental sensors, automatic perception of the environment based on the processing of vision signals, communication with external devices, interaction with the user, direct transmission of signals to the environment, and future development plans. Bearing in mind the proposed model, various latest solutions presented in the state-of-the-art literature or already offered in vehicles are collected and described. © 2023 Division of Signal processing and Electronic Systems, Poznan University of Technology (DSPES PUT).

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Epidemiological Mucormycosis treatment and diagnosis challenges using the adaptive properties of computer vision techniques based approach: a review

引用

MULTIMEDIA TOOLS AND applications 2022年第10期81卷 14217-14245页

作者： Nira Kumar, Harekrishna GLA Univ Dept Elect & Commun Mathura 281406 India

As everyone knows that in today's time Artificial Intelligence, machine Learning and Deep Learning are being used extensively and generally researchers are thinking of using them everywhere. At the same time, we are also seeing that the second wave of corona has wreaked havoc in India. More than 4 lakh cases are coming in 24 h. In the meantime, news came that a new deadly fungus has come, which doctors have named Mucormycosis (Black fungus). This fungus also spread rapidly in many states, due to which states have declared this disease as an epidemic. It has become very important to find a cure for this life-threatening fungus by taking the help of our today's devices and technology such as artificial intelligence, data learning. It was found that the CT-Scan has much more adequate information and delivers greater evaluation validity than the chest X-Ray. After that the steps of image processing such as pre-processing, segmentation, all these were surveyed in which it was found that accuracy score for the deep features retrieved from the ResNet50 model and SVM classifier using the Linear kernel function was 94.7%, which was the highest of all the findings. Also studied about Deep Belief Network (DBN) that how easy it can be to diagnose a life-threatening infection like fungus. Then a survey explained how computer vision helped in the corona era, in the same way it would help in epidemics like Mucormycosis.

关键词： Mucormycosis Computer vision Black fungus Artificial intelligence Deep learning

来源：评论

学校读者我要写书评

暂无评论

A survey on multimodal bidirectional machine learning translation of image and natural language processing

引用

EXPERT SYSTEMS WITH applications 2024年 235卷

作者： Nam, Wongyung Jang, Beakcheol Yonsei Univ Grad Sch 50 Yonsei Ro Seoul 03722 South Korea

Advances in multimodal machine learning help artificial intelligence to resemble human intellect more closely, which perceives the world from multiple modalities. We surveyed state-of-the-art research on the modalities of bidirectional machine learning translation of image and natural language processing (NLP), which address a considerable proportion of human life. Recently, with the advances in deep learning model architectures and learning methods in the fields of image and NLP, considerable progress has been made in multimodal machine learning translations that can be built by integrating image and NLP. Our goal is to explore and summarize state-of-the-art research on multimodal machine learning translation and present a taxonomy for the multimodal bidirectional machine learning translation of image and NLP. Furthermore, we reviewed the evaluation metrics and compared state-of-the-art approaches that influences this field. We believe that this survey will become a cornerstone of future research by discussing the challenges in multimodal machine learning translation and direction of future research based on understanding state-of-the-art research in the field.

关键词： Computer vision and natural language processing Deep learning image captioning image synthesis machine learning Multimodal

来源：评论

学校读者我要写书评

暂无评论

An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance

An image speaks a thousand words, but can everyone listen? O...

引用

2024 Conference on Empirical Methods in Natural Language processing, EMNLP 2024

作者： Khanuja, Simran Ramamoorthy, Sathyanarayanan Song, Yueqi Neubig, Graham Carnegie Mellon University United States

ISBN: (纸本)9798891761643

Given the rise of multimedia content, human translators increasingly focus on culturally adapting not only words but also other modalities such as images to convey the same meaning. While several applications stand to benefit from this, machine translation systems remain confined to dealing with language in speech and text. In this work, we introduce a new task of translating images to make them culturally relevant. First, we build three pipelines comprising state-of-the-art generative models to do the task. Next, we build a two-part evaluation dataset - (i) concept: comprising 600 images that are cross-culturally coherent, focusing on a single concept per image;and (ii) application: comprising 100 images curated from real-world applications. We conduct a multi-faceted human evaluation of translated images to assess for cultural relevance and meaning preservation. We find that as of today, image-editing models fail at this task, but can be improved by leveraging LLMs and retrievers in the loop. Best pipelines can only translate 5% of images for some countries in the easier concept dataset and no translation is successful for some countries in the application dataset, highlighting the challenging nature of the task. Our project webpage is here and our code, data and model outputs can be found here. © 2024 Association for Computational Linguistics.

关键词： machine translation

来源：评论

学校读者我要写书评

暂无评论

High-definition event frame generation using SoC FPGA devices 26

High-definition event frame generation using SoC FPGA device...

引用

26th IEEE Signal processing: Algorithms, Architectures, Arrangements, and applications, SPA 2023

作者： Blachut, Krzysztof Kryjak, Tomasz AGH University of Krakow Embedded Vision Systems Group Department of Automatic Control and Robotics Krakow Poland

ISBN: (纸本)9798350304985

In this paper we have addressed the implementation of the accumulation and projection of high-resolution event data stream (HD - 1280×720 pixels) onto the image plane in FPGA devices. The results confirm the feasibility of this approach, but there are a number of challenges, limitations and trade-offs to be considered. The required hardware resources of selected data representations, such as binary frame, event frame, exponentially decaying time surface and event frequency, were compared with those available on several popular platforms from AMD Xilinx. The resulting event frames can be used for typical vision algorithms, such as object classification and detection, using both classical and deep neural network methods. © 2023 Division of Signal processing and Electronic Systems, Poznan University of Technology (DSPES PUT).

关键词： System-on-chip

来源：评论

学校读者我要写书评

暂无评论

SHAPE DETECTION IN AN image USING PARALLELIZED TRADITIONAL image ANALYSIS TECHNIQUES 58

SHAPE DETECTION IN AN IMAGE USING PARALLELIZED TRADITIONAL I...

引用

58th Annual International Telemetering Conference, ITC 2023

作者： Loreto Cornídez, Alan Manuel Fuentes Gutiérrez, Rubén Diego College of Electrical and Computer Engineering The University of Arizona TucsonAZ85721 United States

Modern day computer vision applications are frequently implemented using machine learning approaches. While these implementations can perform very well, the performance is heavily dependent on sufficient and accurate training data. Due to a lack of adequate training data, the Arizona Autonomous Vehicles Club (AZA) decided to implement the generalized hough transform to detect shapes in a live video feed from an unmanned aerial system (UAS). The hough transform is computationally intensive and since real-time performance is required, a serial approach may not have the execution speed necessary for the application. image processing techniques include matrix multiplication and convolution operations which are highly parallelizable. Therefore, the algorithm was parallelized and implemented on a graphics processing unit (GPU). Performance profiling was done on both machine learning and traditional approaches where execution time and accuracy were compared. © 2023 International Foundation for Telemetering. All rights reserved.

关键词： Unmanned aerial vehicles (UAV)

来源：评论

学校读者我要写书评

暂无评论

Guest Editorial: Smart Measurement in machine vision for Challenging applications

引用

IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE 2023年第8期26卷 3-3页

作者： Venkatesan, C. AI-Turjman, Fadi Pelusi, Danilo HKBK Coll Engn Dept ECE Bengaluru India Near East Univ Dept Artificial Engn Istanbul Turkiye Univ Teramo Dept CSE Teramo Italy

Smart measurements are widely deployed in many applications due to the technology advancement. For various industrial applications, automated inspection and analysis based on the image is provided by machine vision. For the measurements in these applications, sensors must be connected. machine vision tries to creatively combine already existing technology and use them to address current issues. The term "measurement" is frequently used to refer to many tasks and is the cornerstone of industrial automation and security deployment. This Special Issue of Instrumentation & Measurement Magazine addresses some novel achievements in the measurement and instrumentation science and technology fields. It advances machine vision concerning production, application of smart materials, measurement and estimation techniques, etc. The variety of selected papers reflects the efforts made by the authors to focus either on methodological aspects or technical issues. In particular, three papers have been accepted for publication, reflecting several aspects of the abovementioned fields by covering machine vision and image processing technology.

关键词： Special issues and sections Smart devices Measurement techniques machine vision

来源：评论

学校读者我要写书评

暂无评论

A comprehensive study of feature extraction techniques for plant leaf disease detection

引用

MULTIMEDIA TOOLS AND applications 2022年第1期81卷 367-419页

作者： Vishnoi, Vibhor Kumar Kumar, Krishan Kumar, Brajesh Gurukula Kangri Vishwavidyalaya Dept Comp Sci Haridwar 249404 Uttarakhand India MJP Rohilkhand Univ Dept Comp Sci & IT Bareilly 243006 Uttar Pradesh India

Agriculture has been the most primary source of the livelihood of man for thousands of years. Even today, it provides subsistence to about 50% of the world population. Plant diseases are the serious cause of big losses to crop production every year worldwide. It is necessary to keep the plants healthy at various stages of their growth/development to deal with the financial losses from plant diseases. Symptoms of infections are visible mainly at plant leaves;thus leaves are commonly used to detect and identify the diseases. Detecting the disease through visual observation is itself a challenging task and requires a lot of human expertise. image processing techniques along with computational intelligence or soft computing techniques can be used to provide a better assistance for disease detection to the farmers. A disease in plants can be detected based on its symptoms extracted in the form of features. Feature extraction techniques thus play a vital role in such systems. The paper emphasizes on the review of hand-crafted and deep learning based feature extraction with their merits and demerits. It provides a comprehensive discussion on a variety of image features such as color, texture, and shape for various disorders in different cultures.

关键词： Computer vision image processing Plant leaf diseases Feature extraction Classification machine learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：