image classification is one of the most fundamental capabilities of machine vision intelligence. In this work, we revisit the image classification task using visually-grounded language models (VLMs) such as GPT-4V and...
Although AI systems have been applied in various fields and achieved impressive performance, their safety and reliability are still a big concern. This is especially important for safety-critical tasks. One shared cha...
详细信息
ISBN:
(纸本)9783031282409;9783031282416
Although AI systems have been applied in various fields and achieved impressive performance, their safety and reliability are still a big concern. This is especially important for safety-critical tasks. One shared characteristic of these critical tasks is their risk sensitivity, where small mistakes can cause big consequences and even endanger life. There are several factors that could be guidelines for the successful deployment of AI systems in sensitive tasks: (i) failure detection and out-ofdistribution (OOD) detection;(ii) overfitting identification;(iii) uncertainty quantification for predictions;(iv) robustness to data perturbations. These factors are also challenges of current AI systems, which are major blocks for building safe and reliable AI. Specifically, the current AI algorithms are unable to identify common causes for failure detection. Furthermore, additional techniques are required to quantify the quality of predictions. All these contribute to inaccurate uncertainty quantification, which lowers trust in predictions. Hence obtaining accurate model uncertainty quantification and its further improvement are challenging. To address these issues, many techniques have been proposed, such as regularization methods and learning strategies. As vision and language are the most typical data type and have many open source benchmark datasets, this thesis will focus on vision-language data processing for tasks like classification, image captioning, and vision question answering. In this thesis, we aim to build a safeguard by further developing current techniques to ensure the accurate model uncertainty for safety-critical tasks.
This research presents an innovative approach to accurately extract multi-scale landscape characters by combining geographic information systems (GIS) and graphic processingalgorithms. It is essential to recognize an...
详细信息
FL (Federated learning) has grown in popularity as a field of research that allows for the training of an algorithm over many decentralised servers that are having local data samples without requiring data exchange. N...
详细信息
Computer assisted diagnosis(CAD) of diseases provides more accurate and precise diagnostic reports towards better information regarding the medical condition of patients. A clinician can minimize the error by applying...
详细信息
ISBN:
(纸本)9798350350661;9798350350654
Computer assisted diagnosis(CAD) of diseases provides more accurate and precise diagnostic reports towards better information regarding the medical condition of patients. A clinician can minimize the error by applying his experience acquired by practice, cognitive intuition or scientific research backed by laboratory reports and computer assisted medical image analysis. The findings by the experts based on the analysis of such data are crucial as the suggested treatment is dependent on evaluation at this stage. Machine learning techniques while applied in the medical field performs decision making by mimicking the steps performed by a medical expert in diagnosing the disease, but using algorithms rather intuitive. It brings out accurate medical data through analysis of images performed by computing devices that can reveal valuable information regarding the disease prognosis. Computer aided disease diagnosis with state-of-the-art machine learning and deep learning offers seamless assistance in medical care with near human accuracy. Technology integrated medical support systems combined with sophisticated algorithms can reduce the number of false positive incidents as well as false negative cases. Convolutional neural network (CNN) models can be trained using handcrafted features to derive conclusive inferences for binary class as well as multi-class classification. Artificial Intelligence (AI) supported techniques in disease diagnosis provide assistance to medical experts in decision making by virtue of cloud based data analytics tools for storage and computing.
This paper presents a comprehensive comparative analysis of image partitioning and compression mechanisms, two fundamental techniques in imageprocessing and data compression. image partitioning involves dividing an i...
详细信息
This article proposes a model that combines the issues related to autonomous vehicles into seven groups. The groups are included in mutual iterations between the user, the autonomous vehicle and the environment. They ...
详细信息
This study focuses on enhancing the security of image transmission in Networking systems of Artificial Intelligence (NSAI) by implementing an advanced encryption algorithm (AEA) based on chaotic algorithms. The resear...
详细信息
Watermarks in historical manuscripts are figural shapes serving as tokens for provenance research (e.g. scribe identification, dating, papermill attribution, scribe-papermaker relation, trading, etc.) in Humanities su...
详细信息
ISBN:
(纸本)9783031705427;9783031705434
Watermarks in historical manuscripts are figural shapes serving as tokens for provenance research (e.g. scribe identification, dating, papermill attribution, scribe-papermaker relation, trading, etc.) in Humanities such as Musicology. As of today, they come in a variety of formats: digitized handtracings and rubbings, X-ray based imagery and, more recently, thermograms acquired with infrared (IR) cameras - all of which have been made accessible via image data bases in libraries or archives like the watermark information system (WZIS). A key use case from a scholar's perspective is the search for similar or even equal watermarks in whatever digitized data collections. Non-surprisingly, the prerequisite is the availability of a versatile, reliable, and user-friendly tool comprising methods from digital imageprocessing (IP) and pattern recognition (PR). In our paper, we focus on bridging the gap between digitized thermograms of music manuscripts and watermark classification for similarity-based search through (i) a state-of-the-art (SOTA) analysis, (ii) a resulting conceptual design based on well-understood SOTA as well as novel methods, (iii) an easy-to-use implementation, and (iv) an experimental validation as Proof-of-Concept (PoC). The current system performance is characterized using thermograms recently made openly available within the DRACMarkS project as well as WZIS. The experimental results clearly demonstrate success in bridging the existing gap hence also setting a baseline for an as yet lacking benchmark.
To date, the use of computer vision in biomedicine with the use of artificial intelligent systems, which in turn receive information from images, and then give out new knowledge and final conclusions about the disease...
详细信息
暂无评论