the proceedings contain 76 papers from the intelligentdataengineering and automatedlearning - ideal2005: 6thinternationalconference. Proceedings. the topics discussed include: synthetic environment representatio...
详细信息
the proceedings contain 76 papers from the intelligentdataengineering and automatedlearning - ideal2005: 6thinternationalconference. Proceedings. the topics discussed include: synthetic environment representational semantics using the web ontology language;model trees for classification of hybrid data types;unsupervised image segmentation using penalized fuzzy clustering algorithm;mining job logos using incremental attribute-oriented approach;design of simple structure neural voltage regulator for power systems;designing an optimal network using the cross-entropy method;efficient spatial clustring algorithm using binary tree;matching peptide sequences with mass spectra;and toward transitive dependence in MAS.
作者:
Mitic, PeterUCL
Dept Comp Sci Gower St London WC1E 6BT England
We present a quantitative definition of reputation risk, formulated in terms of a reputation time series comprising daily sentiment measurements. Self Supported learning is used to quantify reputation risk by progress...
详细信息
ISBN:
(纸本)9783031777301;9783031777318
We present a quantitative definition of reputation risk, formulated in terms of a reputation time series comprising daily sentiment measurements. Self Supported learning is used to quantify reputation risk by progressively refining an initial proposal for a Minimum Acceptable Sentiment, calculated from descriptive statistics of the reputation data. the derived values are validated using a "sense test" based on a Loess quantile. the results show that the Minimum Acceptable Sentiment value is given approximately by a two standard deviation lower tail of the observed data.
Universitat Polit`ecnica de Val`encia (UPV) faces challenges in managing its Alfresco document repository, which contains 600,000 PDF files, of which only 100,000 are correctly categorised. Manual classification is la...
详细信息
ISBN:
(纸本)9783031777301;9783031777318
Universitat Polit`ecnica de Val`encia (UPV) faces challenges in managing its Alfresco document repository, which contains 600,000 PDF files, of which only 100,000 are correctly categorised. Manual classification is laborious and error-prone, hindering information retrieval and advanced search capabilities. this project presents an automated pipeline that integrates optical character recognition (OCR) and machine learning to efficiently classify documents. Our approach distinguishes between scanned and digital documents, accurately extracts text and categorises it into 51 predefined categories using models such as BERT and RF. By improving document organisation and accessibility, this work optimises UPV's document management and paves the way for advanced search technologies and real-time classification systems.
this paper presents a novel deep-learning pipeline to segment large railway datasets with minimal manual annotation, notoriously time consuming. the pipeline adapts DINOv2 [11] for labeling point clouds, with tailored...
详细信息
ISBN:
(纸本)9783031777301;9783031777318
this paper presents a novel deep-learning pipeline to segment large railway datasets with minimal manual annotation, notoriously time consuming. the pipeline adapts DINOv2 [11] for labeling point clouds, with tailored self-distillation pre-training and fine-tuning. the adopted transformer architecture successfully generalizes to multiple railway datasets, with a lightweight pipeline that outperforms manual labeling speed by a factor of 6, despite requiring a final segmentation check and correction. this groundbreaking achievement bridges the gap between the need for annotated point clouds in railway industry and the lack of publicly available annotated datasets.
Machine learning models often excel in controlled environments but may struggle with noisy, incomplete, or shifted real-world data. Ensuring that these models maintain high performance despite these imperfections is c...
详细信息
ISBN:
(纸本)9783031777370;9783031777387
Machine learning models often excel in controlled environments but may struggle with noisy, incomplete, or shifted real-world data. Ensuring that these models maintain high performance despite these imperfections is crucial for practical applications, such as medical diagnosis or autonomous driving. this paper introduces a novel framework to systematically analyse the robustness of Machine learning models against noisy data. We propose two empirical methods: (1) Noise Tolerance Estimation, which calculates the noise level a model can withstand without significant degradation in performance, and (2) Robustness Ranking, which ranks Machine learning models by their robustness at specific noise levels. Utilizing Cohen's kappa statistic, we measure the consistency between a model's predictions on original and perturbed datasets. Our methods are demonstrated using various datasets and Machine learning techniques, identifying models that maintain reliability under noisy conditions.
this research study proposes an IoT-based solution for intelligent energy management, designed to monitor and optimize power consumption in both residential and industrial environments. the system will integrate an AI...
详细信息
Exploring how literary authors influence each other and how intertextual connections can be found among them is a complex problem that is handled manually by experts. this research field, intertextuality, seeks to und...
详细信息
ISBN:
(纸本)9783031777301;9783031777318
Exploring how literary authors influence each other and how intertextual connections can be found among them is a complex problem that is handled manually by experts. this research field, intertextuality, seeks to understand how writers relate to each other and how the works of one writer have reflected on the work of another. this analysis usually provides insight into the historical, cultural, biographical, and literary contexts that shape literary works. From a computational point of view, the approaches related to this problem have mainly dealt with problems of style classification, known as authorship attribution, but none have so far dealt withthe problem of intertextuality. this paper proposes a novel approach based on contrastive learning to generate authorship embeddings that encapsulate the stylistic signatures of narrative writers and that can be projected into graphical representations to discern similarities between a dataset of literary writers. the embeddings generated withthis approach aim to represent authorship styles are created and evaluated using a dataset of books from Project Gutemberg. During the evaluation, we perform a book and chunk-level evaluation, showing good performance in both cases. Finally, we present different graphical representations and provide a deep analysis of the relations that arise between the writers.
the integration of the Internet of things (IoT) into various aspects of daily life is accelerated by the proliferation of cognitive capabilities and AI-driven applications. Traditional automated systems rely on centra...
详细信息
In the realm of pattern recognition, the automated detection of handwritten text or symbols poses intricate challenges in the field of handwriting recognition. the paper introduces a novel approach that considers the ...
详细信息
暂无评论