Line segment detection is a fundamental procedure in computervision, patternrecognition, and image analysis applications. the paper proposes a novel method for wide line segment detection especially endpoints determ...
详细信息
Misinformation is a widespread problem in the wake of flourishing social media. One of the most common forms of misinformation is cheapfakes - multimedia content that is manipulated with simple techniques such as misc...
详细信息
ISBN:
(纸本)9798400706028
Misinformation is a widespread problem in the wake of flourishing social media. One of the most common forms of misinformation is cheapfakes - multimedia content that is manipulated with simple techniques such as miscaptioning to create misleading or false narratives. Due to their simplicity and appealing nature, cheapfakes pose a formidable threat to the reliability, transparency and integrity of the Internet. the ACM ICMR 2024 Grand Challenge on Detecting Cheapfakes tasks participants with detecting out-of-context cheapfakes - images with misleading or irrelevant captions. this paper introduces a lightweight approach that enhances cheapfake detection by combining Prompt Engineering withimage captioning running on an interleaved image-text model. Testing on a public dataset gives an accuracy of 82.9%. Despite its modest accuracy, our method demonstrates the potential for applying better mixed-media learning models for context understanding and Visual Question Answering.
Fog rectification is a crucial preprocessing step in enhancing image quality for applications like autonomous driving and object recognition, particularly in foggy and hazy conditions. this paper presents a streaming ...
详细信息
the study of frequency components derived from Discrete Cosine Transform (DCT) has been widely used in image analysis. In recent years it has been observed that significant information can be extrapolated from them ab...
详细信息
Diabetic Retinopathy (DR) is an eye disease associated with chronic diabetes. It remains the primary cause of visual impairment and blindness among the global working-age population. Early detection of DR is crucial f...
详细信息
ISBN:
(纸本)9783031821554;9783031821561
Diabetic Retinopathy (DR) is an eye disease associated with chronic diabetes. It remains the primary cause of visual impairment and blindness among the global working-age population. Early detection of DR is crucial for ensuring timely diagnosis and effective treatment. this paper proposes a new homogeneous ensemble-based approach constructed using a set of hybrid architectures as base learners and two combination rules (weighted and hard voting) for referable DR detection, using fundus images from the Messidor-2, Kaggle DR, and APTOS datasets. the hybrid architectures are created using deep feature extraction techniques, dimensionality reduction techniques to reduce the size of the extracted features, and a decision tree algorithm (DT) for classification. the results showed the potential of the proposed new approach which achieved high accuracy values over the three datasets: 90.65%, 93.01%, and 83.32% using the APTOS, Kaggle DR, and Messidor-2 datasets respectively. therefore, we recommend using the proposed approach since it is impactful for referable DR classification, and it represents a promising tool to assist ophthalmologists in diagnosing DR.
In this work the advancement in the field of natural language processing and computervision, introduces a robust image captioning model that harnesses the capabilities of attention mechanisms. the model encompasses s...
详细信息
Convolutional neural network (CNN) has shown impressive advantages in various applications like computervision and speech recognition. However, CNN's applications in P300 detection are still in primary stage. Dif...
详细信息
In the present cooperative competition of robot, the robot needs to use its own camera module to collect real-time images and identify the enemy robot according to the collected images. However, due to the interferenc...
详细信息
the integration of advanced technologies into education has emerged as a vital solution to bridge the accessibility gap for visually challenged students. this paper introduces an innovative system that uses image capt...
详细信息
Owing to the progress of information technology, library management system is being automated intelligently. Book barcode recognition is the key to improve the quality of service but its performance is affected by lig...
详细信息
暂无评论