Despite the advancements in neural network technologies driving interactive image segmentation forward, challenges persist, especially concerning segmentation ambiguities caused by overlapping or visually similar obje...
详细信息
ISBN:
(纸本)9789819784899;9789819784905
Despite the advancements in neural network technologies driving interactive image segmentation forward, challenges persist, especially concerning segmentation ambiguities caused by overlapping or visually similar objects against complex backgrounds, as well as intricate object boundaries. Addressing these challenges, we introduce FusionNet, focusing on effective feature fusion. Firstly, the Hierarchical Context Fusion Module aids in grasping holistic structures and multi-scale contextual information of target objects. Secondly, the Attention Feature Fusion Module captures more representative feature expressions. This design empowers FusionNet to capture details and contextual relationships better, thereby enhancing segmentation accuracy. For fine-grained boundary details, we propose the Local Correction Module, refining local mask details meticulously. This module initially focuses on information around newly clicked areas, employing discriminative correction feedback for enhanced detail processing accuracy. Rigorous experimentations on datasets like SBD, DAVIS, GrabCut, and Berkeley validate our model's effectiveness, with segmentation results strongly supporting the superiority of our approach.
This research introduces "Jaddah,"an innovative AI-based system for the automated detection of road infrastructure defects using advanced computer vision and machine learning techniques. The system addresses...
详细信息
Recent Compositional Zero-Shot Learning (CZSL) methods increasingly adopt the pre-trained vision-language models to capture the contextual relations between image and text spaces. However, the single-class-token desig...
详细信息
This brief presents a current-mode sensing-computing fusion system for advanced Internet of Things (IoT) machinevision. The key contributions of our work are: (a) a low-voltage of 0.9V multiply-accumulate (MAC) compu...
详细信息
This brief presents a current-mode sensing-computing fusion system for advanced Internet of Things (IoT) machinevision. The key contributions of our work are: (a) a low-voltage of 0.9V multiply-accumulate (MAC) computing macro with reconfigurable weights is proposed, enabling efficient on-sensor feature extraction;(b) a weight-flipping method for processing negative signals is employed, reducing both power consumption and circuit complexity;(c) a convolutional horizontal shifting technique with fixed weights is equipped, eliminating power consumption associated with weight updates. A 16 x 16 CCIS prototype, fabricated using a 0.18 mu m CMOS process, achieves a power efficiency of 27.7pJ/frame-pixel at 2000fps. Experimental evaluations in edge feature extraction demonstrate a 32% reduction in power consumption, highlighting the efficiency gains of our approach.
A group of eye conditions known as glaucoma impair the optic nerve, which is in charge of sending visual data from the eye to the brain. Glaucoma impacts 3.54% of adults aged 40 to 80 around the world. Early detection...
详细信息
A group of eye conditions known as glaucoma impair the optic nerve, which is in charge of sending visual data from the eye to the brain. Glaucoma impacts 3.54% of adults aged 40 to 80 around the world. Early detection of glaucoma is crucial as it can prevent total optic nerve damage, which would cause irreversible vision loss. It is possible for specialists to diagnose glaucoma medically, but treatment options are either expensive or time-consuming and requires ongoing care from medical professionals. There have been numerous initiatives at streamlining all components of the glaucoma categorization process, however these models are challenging for users to comprehend the key predictors, resulting in them being unreliable for use by medical experts. The study uses eye fundus images to classify glaucoma patients using three distinct Deep Learning techniques: Convolutional neural network, Visual Geometry Group 16 (VGG16), and Global Context Network (GC-Net). In addition, several data pre-processing techniques are used to avoid overfitting and achieve high accuracy. This research compares and analyses the performance of various architectures using the aforementioned techniques. The CNN model had the best accuracy of 83% when in contrast to the other deep learning models.
Matrix-vector multiplication (MVM) operations play an important role in applications such as data processing and artificial neural networks. To meet the growing demand for computing power, the photonic MVM processor p...
详细信息
Matrix-vector multiplication (MVM) operations play an important role in applications such as data processing and artificial neural networks. To meet the growing demand for computing power, the photonic MVM processor provides what we believe to be a new computing architecture. In this paper, we propose a reconfigurable parallel MVM (RP-MVM) processor. To further improve the parallel computing dimension, wavelength division multiplexing (WDM) and digital subcarrier multiplexing (DSM) technologies were first incorporated into the photonic MVM. Compared with the traditional WDM-MVM architecture, the parallelism of RP-MVM scheme is increased by N times, where N is the carrier number of DSM signal. Moreover, the input data channel can be dynamically adjusted without changing the hardware scale, which improves the flexibility of computing system. The simulation results show that the RP-MVM scheme can achieve parallel computing operations of eight MVMs, with a computing speed of 128 GOPs. For a random 6-bit resolution data sequence, the root mean square error (RMSE) of calculation results is on the order of 1E-3. In addition, for the image edge extraction task based on Roberts operator, this scheme can realize the parallel processing of four grayscale images. Therefore, the proposed scheme provides an alternative approach for realizing a highly parallel and reconfigurable large-scale photonic MVM architecture.
The exploration of sentiments through facial expressions is a captivating domain with applications across security, healthcare, and human–computer interaction, where understanding sentiments is primarily about interp...
详细信息
The proceedings contain 128 papers. The special focus in this conference is on Data Science, machine Learning and applications. The topics include: Digitization of Monuments – An Impact on the Tourist Experience with...
ISBN:
(纸本)9789819780426
The proceedings contain 128 papers. The special focus in this conference is on Data Science, machine Learning and applications. The topics include: Digitization of Monuments – An Impact on the Tourist Experience with Special Reference to Hampi;resume Parser Using machine Learning;IOT Based Smart Hydroponics System;comparative Study of machine Learning and Deep Learning Techniques for Cancer Disease Detection;High Thruput Modulation Approaches Used in Next Generation WiF’s Under Multi-impairments Environments with MATLAB Codes;skin Disease Detection;root Vegetable Crop Recommendation System Based on Soil Properties and Environmental Factors;deep Learning Model Development for an Automatic Healthcare Edge Computing Application;Empathetic Conversations in Mental Health: Fine-Tuning LLMs for Supportive AI Interactions;exploring Block Chain Technology with applications, and Future Prospects;a Comprehensive Review of Soft Computing Enabled Techniques for IoT Security: State-of-the-Art and Challenges Ahead;Performance Analysis of machine Learning Algorithms on Imbalanced Datasets Using SMOTE Technique;An AI Based Nutrient Tracking and Analysis System;power Saving Mechanism for Street Lights System Using IoT;Automatic Login System Using ATTINY85 IC;forecasting Stock Prices: A Comparative Analysis of machine Learning, Deep Learning, and Statistical Approaches;smart vision Bot;robots in Logistics: Apprehension of Current Status and Future Trends in Indian Warehouses;smart Healthcare: Enhancing Patient Well-Being with IoT;Detection of B-ALL Using CNN Model and Deep Learning;a Comprehensive Analysis for Advancements and Challenges in Deep Learning Models for imageprocessing;a Comprehensive Survey on Enhancing Patient Care Through Deep Learning and IoT-Enabled Healthcare Innovations;attention-Based image Caption Generation.
The proceedings contain 128 papers. The special focus in this conference is on Data Science, machine Learning and applications. The topics include: Digitization of Monuments – An Impact on the Tourist Experience with...
ISBN:
(纸本)9789819780303
The proceedings contain 128 papers. The special focus in this conference is on Data Science, machine Learning and applications. The topics include: Digitization of Monuments – An Impact on the Tourist Experience with Special Reference to Hampi;resume Parser Using machine Learning;IOT Based Smart Hydroponics System;comparative Study of machine Learning and Deep Learning Techniques for Cancer Disease Detection;High Thruput Modulation Approaches Used in Next Generation WiF’s Under Multi-impairments Environments with MATLAB Codes;skin Disease Detection;root Vegetable Crop Recommendation System Based on Soil Properties and Environmental Factors;deep Learning Model Development for an Automatic Healthcare Edge Computing Application;Empathetic Conversations in Mental Health: Fine-Tuning LLMs for Supportive AI Interactions;exploring Block Chain Technology with applications, and Future Prospects;a Comprehensive Review of Soft Computing Enabled Techniques for IoT Security: State-of-the-Art and Challenges Ahead;Performance Analysis of machine Learning Algorithms on Imbalanced Datasets Using SMOTE Technique;An AI Based Nutrient Tracking and Analysis System;power Saving Mechanism for Street Lights System Using IoT;Automatic Login System Using ATTINY85 IC;forecasting Stock Prices: A Comparative Analysis of machine Learning, Deep Learning, and Statistical Approaches;smart vision Bot;robots in Logistics: Apprehension of Current Status and Future Trends in Indian Warehouses;smart Healthcare: Enhancing Patient Well-Being with IoT;Detection of B-ALL Using CNN Model and Deep Learning;a Comprehensive Analysis for Advancements and Challenges in Deep Learning Models for imageprocessing;a Comprehensive Survey on Enhancing Patient Care Through Deep Learning and IoT-Enabled Healthcare Innovations;attention-Based image Caption Generation.
Summarization approaches are currently proposed solutions that focus on meaningfully reducing different types of data such as text, audio, and video. Many techniques such as machine learning, signal processing, image ...
详细信息
暂无评论