The proceedings contain 27 papers. The topics discussed include: hierarchical photo stream segmentation using context;giving order to image queries;logo detection using wavelet co-occurrence histograms;a novel approac...
详细信息
ISBN:
(纸本)9780819469922
The proceedings contain 27 papers. The topics discussed include: hierarchical photo stream segmentation using context;giving order to image queries;logo detection using wavelet co-occurrence histograms;a novel approach to personal photo album representation and management;facial features matching using a virtual structuring element;picture management using person retrieval for consumer image collections;distributed wireless face recognition system;improving multimedia retrieval with a video OCR;improving scene detection by using gradual shot transitions as cues from film grammar;semantic video indexing using context-dependent fusion;highlight summarization in golf videos using audio signals;content-based image retrieval using greedy routing;evaluation of content-based features for user-centered image retrieval in small medial collections;and visual search engine for product images.
In this paper, a methodology for real-time image classification on multimedia platforms has been developed. For this purpose, six feedforward neural network models were trained with images from two databases, which we...
详细信息
In this paper, a methodology for real-time image classification on multimedia platforms has been developed. For this purpose, six feedforward neural network models were trained with images from two databases, which were preprocessed by three texture extraction methods: local binary pattern-uniform (LBP-U), gray level co-occurrence matrix (GLCM), and wavelet image scattering (WIS). The databases used consist of 157,448 images of the sections with the thumbnails of the platform content (mosaics) representing 14 classes and 38,214 images with the descriptions of the available content (descriptors) representing 11 classes, where all images have a resolution of 1280 x 720 pixels. The six models (three for mosaics and three for descriptors) were validated with images from the databases, which were not part of the training process, to obtain their performance metrics. The training and validation process was performed 30 times, and the average results were compared. The most outstanding models for each database were the neural networks trained with the wavelet image scattering method, with metrics of 99.97 +/- 0.01 % accuracy, 99.99 +/- 0.01 % specificity, 99.84 +/- 0.06 % sensitivity, 99.59 +/- 0.13 % precision and 99.71 +/- 0. 08 % of F1 score with a response time of 0.7349 seconds for the model trained with mosaics and with metrics of 99.90 +/- 0.03 % of accuracy, 99.94 +/- 0.02 % of specificity, 99.58 +/- 0.15 % of sensitivity, 98.63 +/- 0.55 % of precision and 99.09 +/- 0.30 % of F1 score with a response time of 0.6227 seconds for the model trained with descriptor images. The results are very significant due to the high efficiency obtained and confirm the effectiveness of the models with the WIS method for the classification of multimedia platform images with the characteristics of the databases used. It is suggested that the remaining methods be adjusted to improve their performance.
In today's information-rich society, the rapid dissemination of news content across digital platforms plays a pivotal role in shaping public perception and discourse, yet they remain underutilized for sentiment an...
详细信息
In today's information-rich society, the rapid dissemination of news content across digital platforms plays a pivotal role in shaping public perception and discourse, yet they remain underutilized for sentiment analysis compared to other social media data. This study addresses this gap by using advanced Natural Language Processing (NLP) techniques to analyze Modern Standard Arabic (MSA) news content, categorizing headlines into themes like politics, business, education, weather, and sport, and applying sentiment analysis to classify them as positive or negative. These models form the foundation of our Societal Wellbeing Scoring and Monitoring System, which quantifies and monitors societal sentiment across key Quality of Life (QoL) aspects in Moroccan news. Our approach utilizes an LDA topic modeling approach in the preprocessing stage on the Moroccan News Arabic Dataset (MNAD) to enhance the quality of selected data sample before building classification models. Following this, we compare deep learning models with AraBERT and asafaya/bert-base-arabic (ASAFAYA) word embeddings to traditional machine learning algorithms using term frequency-inverse document frequency (TF-IDF). Notably, the GRU model with ASAFAYA embeddings excelled in the topic modeling task, achieving a 94.74% accuracy. We further analyzed public sentiment using data from Hibapress, a Moroccan news website, by scraping titles, likes, and dislikes to assign positive and negative sentiment scores. Leveraging a Bi-GRU model with ASAFAYA embeddings, we achieved an accuracy of 90.23%. We propose the Societal Wellbeing Scoring and Monitoring System, an aspect-based sentiment analysis framework designed to assign a wellbeing score over a specified sample or time period. Our findings highlight the effectiveness of combining topic modeling with deep learning-driven sentiment analysis to derive actionable insights from Arabic news, thereby enhancing the understanding of societal dynamics in Morocco.
The proceedings contain 15 papers. The topics discussed include: discriminative genre-independent audio-visual scene change detection;a random walk through human behavior;flexible user interface for efficient content-...
The proceedings contain 15 papers. The topics discussed include: discriminative genre-independent audio-visual scene change detection;a random walk through human behavior;flexible user interface for efficient content-based video surveillance retrieval: design and evaluation;an automated object-level video editing tool;ImageSeeker: a content-based image retrieval system;extraction of salient regions of interest using visual attention models;research on subjective stereoscopic image quality assessment;image quality assessment in multimedia applications;document description: what works for images should also work for text?;an annotation database for multimedia scientific data;a model of multimodal fusion for medical applications;and binary and nonbinary description of hypointensity for search and retrieval of brain MR images.
The proceedings contain 25 papers. The topics discussed include: a model-based conceptual clustering of moving objects in video surveillance;image watermarking based on color quantization process;search and retrieval ...
详细信息
ISBN:
(纸本)0819466190
The proceedings contain 25 papers. The topics discussed include: a model-based conceptual clustering of moving objects in video surveillance;image watermarking based on color quantization process;search and retrieval of medical images for improved diagnosis of neurodegenerative diseases;assessment of end-user response to sports highlights extraction for personal video recorders;examining user interactions with video retrieval systems;automatic and user-centric approaches to video summary evaluation;ontology driven search engine;adaptation of video game UVW mapping to 3D visualization of gene expression patterns;classification of yeast cells from image features to evaluate pathogen conditions;data mining learning bootstrap through semantic thumbnail analysis;a spatiotemporal decomposition strategy for personal home video management;and analysis of unstructured video based on camera motion.
The proceedings contain 31 papers. The topics discussed include: location-aware gang graffiti acquisition and browsing on a mobile device;dietary intake assessment using integrated sensors and software;FCam for multip...
ISBN:
(纸本)9780819489517
The proceedings contain 31 papers. The topics discussed include: location-aware gang graffiti acquisition and browsing on a mobile device;dietary intake assessment using integrated sensors and software;FCam for multiple cameras;continuously adjustable Pulfrich spectacles for mobile devices;parameters of the human 3D gaze while observing portable autostereoscopic display: a model and measurement results;deblocking of mobile stereo video;SUPL support for mobile devices;measuring ionizing radiation with a mobile device;design and evaluation of security multimedia warnings for children's smartphones;using wi-fi hotspots as an intrusion vector into corporate networks;frame rate up-conversion assisted with camera auto exposure information;and fused fibonacci-like (p,q) sequences with compression and barcoding applications.
The proceedings contain 37 papers. The topics discussed include: mobile 3D quality of experience evaluation: a hybrid data collection and analysis approach;overcome the shortcoming in mobile stereoscopy;comparative st...
ISBN:
(纸本)9780819484185
The proceedings contain 37 papers. The topics discussed include: mobile 3D quality of experience evaluation: a hybrid data collection and analysis approach;overcome the shortcoming in mobile stereoscopy;comparative study of autostereoscopic displays for mobile devices;subjective evaluation of mobile 3D video content: depth range versus compression artifacts;development of 3D mobile receiver for stereoscopic video and data service in T-DMB;a right scaled depth sense formed by using a distorted objective space based on CG stereoscopy;smart travel guide: from internet image database to intelligent system;revised benchmarking of contact-less fingerprint scanners for forensic fingerprint detection: challenges and results for chromatic white light scanners (CWL);and optimizing bandwidth and storage requirements for mobile images using perceptual-based JPEG recompression.
The proceedings contain 36 papers. The topics discussed include: contextual advertisement placement in printed media;content-based image retrieval with ontological ranking;a case study on rule-based and CRF-based auth...
ISBN:
(纸本)9780819479334
The proceedings contain 36 papers. The topics discussed include: contextual advertisement placement in printed media;content-based image retrieval with ontological ranking;a case study on rule-based and CRF-based author extraction methods;new performance evaluation models for character detection in images;navigating web search results;cloud-based printing for mobile devices;using ePub as framework for the automated collection, tagging, and transformation of web content for cross-media publication;a web-based rapid assessment tool for production publishing solutions;an investigation of document aesthetics for web-to-print repurposing of small-medium business marketing collateral;learning from graphic designers: using grids as a scaffolding for automatic print layout;ubiquitous picture-rich content representation;a novel XML-based document format with printing quality for web publishing;and faces from the web: automatic selection and composition of media for casual screen consumption and printed artwork.
We present a multimedia information analysis framework for content-based browsing of video. Specifically, we develop algorithms for the automated extraction of video highlights in sports video that are based on audio,...
详细信息
We present a multimedia information analysis framework for content-based browsing of video. Specifically, we develop algorithms for the automated extraction of video highlights in sports video that are based on audio, text, and image features. The extracted annotations are used to build applications for selective browsing of sports videos. Such summarization techniques enable content-based indexing of multimedia documents for efficient storage and retrieval. In addition, in the context of the newly emerging standard MPEG-7, these methods will enable applications that use MPEG-7 descriptions. As this standard provides only the syntax for representing such descriptions and not specific algorithms for extracting them, these algorithms are of great value for establishing MPEG-7 as an accepted standard. We provide experimental results for the proposed algorithms on several hours of sports programs that prove the feasibility of efficient video access techniques in a multimedia environment.
With the increasing popularity of the WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. access to multimedia objects in databases has long been limited to the inform...
详细信息
With the increasing popularity of the WWW, the main challenge in computer science has become content-based retrieval of multimedia objects. access to multimedia objects in databases has long been limited to the information provided in manually assigned keywords. Now, with the integration of feature-detection algorithms in database systems software, content-based retrieval can be fully integrated with query processing. We describe our experimentation platform under development, making database technology available to multimedia. Our approach is based on the new notion of feature databases. Its architecture fully integrates traditional query processing and content-based retrieval techniques.
暂无评论