检索结果-内蒙古大学图书馆

Image Classification of multimedia Platforms With Texture algorithms and Neural Networks

IEEE access 2025年 13卷 46478-46494页

作者： Sauzameda-Gonzalez, Mauricio Alvarez-Borrego, Josue Guerra-Rosas, Esperanza Ctr Invest Cient & Educ Super Ensenada CICESE Ensenada 22860 Baja California Mexico Univ Autonoma Baja California Fac Ingn Arquitectura & Diseno Ensenada 22860 Baja California Mexico

In this paper, a methodology for real-time image classification on multimedia platforms has been developed. For this purpose, six feedforward neural network models were trained with images from two databases, which were preprocessed by three texture extraction methods: local binary pattern-uniform (LBP-U), gray level co-occurrence matrix (GLCM), and wavelet image scattering (WIS). The databases used consist of 157,448 images of the sections with the thumbnails of the platform content (mosaics) representing 14 classes and 38,214 images with the descriptions of the available content (descriptors) representing 11 classes, where all images have a resolution of 1280 x 720 pixels. The six models (three for mosaics and three for descriptors) were validated with images from the databases, which were not part of the training process, to obtain their performance metrics. The training and validation process was performed 30 times, and the average results were compared. The most outstanding models for each database were the neural networks trained with the wavelet image scattering method, with metrics of 99.97 +/- 0.01 % accuracy, 99.99 +/- 0.01 % specificity, 99.84 +/- 0.06 % sensitivity, 99.59 +/- 0.13 % precision and 99.71 +/- 0. 08 % of F1 score with a response time of 0.7349 seconds for the model trained with mosaics and with metrics of 99.90 +/- 0.03 % of accuracy, 99.94 +/- 0.02 % of specificity, 99.58 +/- 0.15 % of sensitivity, 98.63 +/- 0.55 % of precision and 99.09 +/- 0.30 % of F1 score with a response time of 0.6227 seconds for the model trained with descriptor images. The results are very significant due to the high efficiency obtained and confirm the effectiveness of the models with the WIS method for the classification of multimedia platform images with the characteristics of the databases used. It is suggested that the remaining methods be adjusted to improve their performance.

关键词： Gray-scale Mathematical models Symmetric matrices Scattering Real-time systems Image classification Statistical analysis Object recognition multimedia databases Histograms Classification method neural networks texture analysis methods local binary pattern grayscale co-occurrence matrix performance metrics

来源：评论

学校读者我要写书评

暂无评论

A Comprehensive Study on Various Encryption Techniques for Digital Images in multimedia Networks 2

A Comprehensive Study on Various Encryption Techniques for D...

引用

2nd IEEE International conference on Integrated Circuits and Communication systems, ICICACS 2024

作者： Santhosha, Benaka Sudheesh, K.V. Kiran, K. Parameshachari, B.D. Dayanand Sagar University Harohalli Bengaluru Visvesveraya Technological University Department of CSE Belagavi India Vidyavardhaka College of Enginerring Mysuru Visvesveraya Technological University Department of ECE Belagavi India Nitte Meenakshi Institute of Technology Department of ECE Bengaluru India

ISBN: (纸本)9798350317558

Image encryption is a fundamental component of modern data security that guarantees the integrity, privacy, and confidentiality of sensitive visual content. This paper provides a thorough examination of image encryption, comparing and contrasting different encryption techniques and their benefits and limitations as well as real-world uses. In order to strengthen image data security against unwanted access and manipulation, we first offer some basic understanding of image encryption. The suitability of symmetric, asymmetric, and hybrid encryption techniques in various contexts is examined. We also explore the assessment criteria used to evaluate encryption algorithms, highlighting the significance of using suitable measures to precisely gauge security and effectiveness. We also discuss common issues with image encryption, like complicated key management and computational complexity. The review also explores future directions that picture encryption might take, including multi-media encryption techniques, resistance to cryptanalysis, and quantum image encryption. We also highlight how important image encryption is in a variety of fields, such as finance, healthcare, journalism, intellectual property protection, and military operations. We will conduct a thorough analysis of current cryptography schemes and multimedia encryption algorithms to provide a comprehensive overview of the current security landscape tailored to digital multimedia technology. The findings from this survey will enhance our understanding of the efficacy and dependability of secure multimedia encryption schemes, ultimately aiding in the development of more efficient and robust encryption methods for the future. This article aims to summarize and assess numerous algorithms using different methodologies based on multiple characteristics such as MSE, PSNR, NC, BER, and so on. © 2024 IEEE.

关键词： Cryptography Image multimedia Networks Security Attacks Steganography Watermarking

来源：评论

学校读者我要写书评

暂无评论

Multimodal AI-Based Summarization and Storytelling for Soccer on Social Media 24

Multimodal AI-Based Summarization and Storytelling for Socce...

引用

15th ACM multimedia systems conference (ACM MMSys)

作者： Sarkhoosh, Mehdi Houshmand Gautam, Sushant Midoglu, Cise Sabet, Saeed Shafiee Halvorsen, Pal OsloMet Oslo Norway Forzasys Oslo Norway SimulaMet Oslo Norway

ISBN: (纸本)9798400704123

The rapid advancement of technology has been revolutionizing the field of sports media, where there is a growing need for sophisticated data processing methods. Current methodologies for extracting information from soccer broadcast videos to generate game highlights and summaries for social media are predominantly manual and rely heavily on text-based NLP techniques, overlooking the rich visual and auditory information available. In response to this challenge, our research introduces SoccerSum, a tool that innovates in the field by integrating computer vision, audio analysis with advanced language models like GPT-4. This multimodal approach enables automated, enriched content summarization, including detection of players and key field elements, thereby enhancing the metadata used in summarization algorithms. SoccerSum uniquely combines textual and visual data, offering a comprehensive solution for generating accurate, platform-specific content. This development represents a significant advancement in automated, data-driven sports media dissemination, and sets a new benchmark in the realm of soccer information extraction. A video of the demo can be found here: https://***/za4VIi2ARXY.

关键词： AI association football audio computer vision GUI LLM multimedia modality social media summarization text video

来源：评论

学校读者我要写书评

暂无评论

COCONUT: content Consumption Energy Measurement Dataset for Adaptive Video Streaming 24

COCONUT: Content Consumption Energy Measurement Dataset for ...

引用

15th ACM multimedia systems conference (ACM MMSys)

作者： Tashtarian, Farzad Lorenzi, Daniele Amirpour, Hadi Afzal, Samira Timmerer, Christian Alpen Adria Univ Klagenfurt Inst Informat Technol ITEC Christian Doppler Lab ATHENA Klagenfurt Austria

ISBN: (纸本)9798400704123

HTTP Adaptive Streaming (HAS) has emerged as the predominant solution for delivering video content on the Internet. The urgency of the climate crisis has accentuated the demand for investigations into the environmental impact of HAS techniques. In HAS, clients rely on adaptive bitrate (ABR) algorithms to drive the quality selection for video segments. Focusing on maximizing video quality, these algorithms often prioritize maximizing video quality under favorable network conditions, disregarding the impact of energy consumption. To thoroughly investigate the effects of energy consumption, including the impact of bitrate and other video parameters such as resolution and codec, further research is still needed. In this paper, we propose COCONUT, a content COnsumption eNergy measUrement daTaset for adaptive video streaming collected through a digital multimeter on various types of client devices, such as laptop and smartphone, streaming MPEGDASH segments. Furthermore, we analyze the dataset and find insights into the influence of multiple codecs, various video encoding parameters, such as segment length, framerate, bitrates, and resolutions, and decoding type, i.e., hardware or software, on energy consumption. We gather and categorize these measurements based on segment retrieval through the network interface card (NIC), decoding, and rendering. Additionally, we compare the impact of different HAS players on energy consumption. This research offers valuable perspectives on the energy usage of streaming devices, which could contribute to creating a media consumption experience that is both more sustainable and resource-efficient.

关键词： Dataset Energy HTTP Adaptive Streaming Decoding Video Player

来源：评论

学校读者我要写书评

暂无评论

multimedia Information Retrieval in XR 24

Multimedia Information Retrieval in XR

引用

32nd ACM International conference on multimedia, MM 2024

作者： Arnold, Rahel Bailer, Werner Gasser, Ralph Jónsson, Björn P. Khan, Omar Shahbaz Schuldt, Heiko Spiess, Florian Vadicamo, Lucia University of Basel Basel Switzerland Joanneum Research Graz Austria Reykjavík University Reykjavík Iceland Institute of Information Science and Technologies National Research Council Pisa Italy

ISBN: (纸本)9798400706868

The way we create, consume and interact with multimedia content has changed significantly in recent years with the advent of affordable recording devices and easy sharing and access in the form of mobile phones. With the imminent wave of affordable devices that enable mixed reality experiences and the large variety of devices on the market, interaction with multimedia content is expected to continue to evolve rapidly. This will also drastically affect the entire area of multimedia information retrieval in eXtended Reality (XR), for instance by novel ways to express user needs in VR, result presentation that takes the specific capabilities of XR devices into account, and/or result feedback. This tutorial on multimedia Retrieval in XR discusses and demonstrates existing solutions and highlights key challenges in this evolving field. © 2024 Owner/Author.

关键词： multimedia systems

来源：评论

学校读者我要写书评

暂无评论

Perceptual annotation of local distortions in videos: tools and datasets 23

Perceptual annotation of local distortions in videos: tools ...

引用

14th ACM multimedia systems conference (MMSys)

作者： Pastor, Andreas Le Callet, Patrick Nantes Univ Ecole Cent Nantes CNRS LS2NUMR 6004 F-44000 Nantes France

ISBN: (纸本)9798400701481

To assess the quality of multimedia content, create datasets, and train objective quality metrics, one needs to collect subjective opinions from annotators. Different subjective methodologies exist, from direct rating with single or double stimuli to indirect rating with pairwise comparisons. Triplet and quadruplet-based comparisons are a type of indirect rating. From these comparisons and preferences on stimuli, we can place the assessed stimuli on a perceptual scale (e.g., from low to high quality). Maximum Likelihood Difference Scaling (MLDS) solver is one of these algorithms working with triplets and quadruplets. A participant is asked to compare intervals inside pairs of stimuli: (a,b) and (c,d), where a,b,c,d are stimuli forming a quadruplet. However, one limitation is that the perceptual scales retrieved from stimuli of different contents are usually not comparable. We previously offered a solution to measure the inter-content scale of multiple contents. This paper presents an open-source python implementation of the method and demonstrates its use on three datasets collected in an in-lab environment. We compared the accuracy and effectiveness of the method using pairwise, triplet, and quadruplet for intra-content annotations. The code is available here: https://***/andreaspastor/MLDS_inter_content_scaling.

关键词： Subjective methodology Perception multimedia Quality

来源：评论

学校读者我要写书评

暂无评论

Cognitive Solution for multimediaaccess 5

Cognitive Solution for MultimediaAccess

引用

5th International conference on Artificial Intelligence, Big Data, Computing and Data Communication systems (ICABCD)

作者： Periola, A. A. Alonge, A. A. Ogudo, K. A. Univ Johannesburg Dept Elect & Elect Engn Technol Johannesburg South Africa

ISBN: (数字)9781665484220

ISBN: (纸本)9781665484220

Subscribers seeking to access multimedia content utilize wireless networks significantly. multimedia access is influenced and impaired by variability in the availability of wireless access links. The impairment arises due to the occurrence of rain induced attenuation. This results in a multimedia content viewing gap (MCVG). The proposed research addresses the challenge of the MCVG and proposes the incorporation of artificial intelligence with computing and networking entities in enabling the creation of multi - media content. This is done aboard entities located in the subscriber residence instead of entities that were previously out of the subscriber residence. Performance analysis shows that the proposed mechanism outperforms existing mechanism and reduces access costs by at least 22.6% and up to 71% on average. The proposed mechanism also enhances the content access duration by 35.6%.

关键词： multimedia content Wireless Networks access Costs access Duration Artificial Intelligence

来源：评论

学校读者我要写书评

暂无评论

Green Intelligent content Rendering Paradigm 7

Green Intelligent Content Rendering Paradigm

引用

7th International conference on Artificial Intelligence, Big Data, Computing and Data Communication systems (ICABCD)

作者： Periola, A. A. Alonge, A. A. Ogudo, K. A. Cape Peninsula Univ Technol Dept Elect Elect & Comp Engn Cape Town South Africa Univ Johannesburg Dept Elect & Elect Engn Technol Johannesburg South Africa

ISBN: (纸本)9798350387919;9798350387902

Multi-media content rendering is an important computing intensive task required for multi-media content generation. The content rendering is executed aboard ground based data centres. The operation of terrestrial data centres is inefficient when a significant amount of power is expended on cooling alongside a high-water footprint (WF). This results in a high-power usage effectiveness (PUE). The high PUE, and WF can be reduced via the use of alternative freely cooled data centre such as the underwater data centre. The research proposes render workload migration strategy enabling an underwater data centre to execute the render workload. Furthermore, the underwater data centre hosts mini - nuclear reactors that enhance its power accessibility instead of only relying on highly variable renewable energy resources in the underwater environment. Performance evaluation shows that the use of the PUE, and accessible power is improved by proposed approach by an average of 26.5%, and 63.8%, respectively.

关键词： Non - terrestrial platforms multimedia content rendering energy access

来源：评论

学校读者我要写书评

暂无评论

Accelerating Video Segment access via Quality-Aware Multi-Source Selection 25

Accelerating Video Segment Access via Quality-Aware Multi-So...

引用

16th multimedia systems conference-MMSYS

作者： Winecki, Dominik Nandi, Arnab Ohio State Univ Columbus OH 43210 USA

ISBN: (纸本)9798400714672

Video data can be slow to process due to the size of video streams and the computational complexity needed to decode, transform, and encode them. These challenges are particularly significant in interactive applications, such as quickly generating compilation videos from a user search. We look at optimizing access to source video segments in multimedia systems where multiple separately encoded copies of video sources are available, such as proxy/optimized media in conventional non-linear video editors or VOD streams in content distribution networks. Rather than selecting a single source to use (e.g., "use the lowest-bitrate 720p source"), we specify a minimum visual quality (e.g., " use any frames with VMAF >= 85"). This quality constraint and the needed segment bounds are used to find the lowest-latency operations to decode a segment from multiple available sources with diverse bitrates, resolutions, and codecs. This uses higher-quality/slower-to-decode sources if the encoding is better aligned for the specific segment bounds, which can provide faster access than using just one lower-quality source. We provide a general solution to this Quality-Aware Multi-Source Selection problem with optimal computational complexity. We create a dataset using adaptive-bitrate streaming Video on Demand sources from YouTube's CDN. We evaluate our algorithm on simple segment decoding as well as embedded into a larger editing system-a declarative video editor. Our evaluation shows up to 23% lower latency access, depending on segment length, at identical visual quality levels.

关键词： multimedia databases declarative video editing

来源：评论

学校读者我要写书评

暂无评论

Facial Liveness Detection for Biometric User Identification 34

Facial Liveness Detection for Biometric User Identification

引用

34th International conference Radioelektronika (RADIOELEKTRONIKA)

作者： Haluska, Renat Pleva, Matus Sedlak, Jozef Kocturova, Marianna Andrecik, Samuel Ovsenik, L'ubos Tech Univ Kosice Fac Elect Engn & Informat Dept Elect & Multimedia Commun Letna 9 Kosice 04200 Slovakia

ISBN: (纸本)9798350362169

This scientific paper addresses the pervasive issue of spoofing in biometric user identification, focusing on its manifestation in facial recognition systems. Spoofing involves deceptive communication originating from an untrusted source, aiming to gain unauthorized access to sensitive information. The study delves into the vulnerabilities of facial biometrics, particularly in scenarios where malicious actors attempt to imitate a user's face using masks, photos, or digital means.

关键词： anti-spoof algorithms CNNs facial recognition image classification liveness detection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：