This paper addresses the diagnosis system of cancer disease using a single setting framework. Most of the radiologists and image specialists are identifying the disease in naked eye. When many conventional systems are...
详细信息
This paper addresses the diagnosis system of cancer disease using a single setting framework. Most of the radiologists and image specialists are identifying the disease in naked eye. When many conventional systems are used to assess or see a patient's disorder condition, it rarely detects the disease all at once in certain situations. Patients are facing difficulties, when the condition of disease is increasing. Thus, this paper focusses the condition of patient seeing the disease image and developed a single setting framework using a convolutional neural network (CNN) architecture with the help of deep learning approaches. The framework contains several deep learning strategies which are used to determine the patient's relevant illness through affected image, such as mass detection using You-Only-Look-Once (YOLO) approach and the crucial aspect of segmentation by full resolution convolutional networks (FrCN). In last the CNN model is considered for classification. This paper is considered to implement our model using breast cancer disease. The different classifiers and cross-validation tests are taken for evaluating validation matrix items. Comparisons of the existing model with the proposed model are made for improving the diagnosis system. For example, the method Inception V3 for accuracy and AUC are 86.77 and 85.89 on MIAS database whereas proposed model got 99.54 and 98.85 on same evaluation items. Our findings show that the proposed diagnostic model outperforms on conventional detection, segmentation, and classification methods. Thus, our diagnosis process worked much better using deep learning and suggested approaches which will help and facilitate the diagnosis of each contaminated region. In each stage of imageprocessing of the infected region, the suggested diagnostics method could support radiologists.
Brain cancer is one of the most deadly illnesses. It causes abnormal cells to grow in the brain. Planning for treatment and the prognosis of patients with brain tumors depend greatly on early diagnosis. Brain tumors c...
详细信息
Despite the great progress of unsupervised domain adaptation (UDA) with the deep neuralnetworks, current UDA models are opaque and cannot provide promising explanations, limiting their applications in the scenarios t...
详细信息
Despite the great progress of unsupervised domain adaptation (UDA) with the deep neuralnetworks, current UDA models are opaque and cannot provide promising explanations, limiting their applications in the scenarios that require safe and controllable model decisions. At present, a surge of work focuses on designing deep interpretable methods with adequate data annotations and only a few methods consider the distributional shift problem. Most existing interpretable UDA methods are post-hoc ones, which cannot facilitate the model learning process for performance enhancement. In this paper, we propose an inherently interpretable method, named Transferable Conceptual Prototype Learning (TCPL), which could simultaneously interpret and improve the processes of knowledge transfer and decision-making in UDA. To achieve this goal, we design a hierarchically prototypical module that transfers categorical basic concepts from the source domain to the target domain and learns domain-shared prototypes for explaining the underlying reasoning process. With the learned transferable prototypes, a self-predictive consistent pseudo-label strategy that fuses confidence, predictions, and prototype information, is designed for selecting suitable target samples for pseudo annotations and gradually narrowing down the domain gap. Comprehensive experiments show that the proposed method can not only provide effective and intuitive explanations but also outperform previous state-of-the-arts. Code is available at https://***/file/d/1b1EHFghiF1ExD-Cn1HYg75VutfkXWp60/view?usp=sharing.
This work lays the groundwork for creating an automated system for the inventory of urban waste elements. Our primary contribution is the development of, to the best of our knowledge, the first re-identification syste...
详细信息
This work lays the groundwork for creating an automated system for the inventory of urban waste elements. Our primary contribution is the development of, to the best of our knowledge, the first re-identification system for urban waste elements that uses artificial Intelligence and Computer Vision, incorporating information from a classification module and geolocation context to enhance post-processing performance. This re-identification system helps to create and update inventories by determining if a new image matches an existing element in the inventory based on visual similarity or, if not, by adding it as a new identity (new class or new identity of an existing class). Such a system could be highly valuable to local authorities and waste management companies, offering improved facility maintenance, geolocation, and additional applications. This work also addresses the dynamic nature of urban environments and waste management elements by exploring Continual Learning strategies to adapt pretrained systems to new settings with different urban elements. Experimental results show that the proposed system operates effectively across various container types and city layouts. These findings were validated through testing in two different Spanish locations, a "City" and a "Campus", differing in size, illumination conditions, seasons, urban design and container appearance. For the final re-identification system, the baseline system achieves 53.18 mAP (mean Average Precision) in the simple scenario, compared to 21.54 mAP in the complex scenario, with additional challenging unseen variability. Incorporating the proposed post-processing techniques significantly improved results, reaching 74.14 mAP and 71.75 mAP in the simple and complex scenario respectively.
Target contour extraction is a key task in the field of imageprocessing, which is of great significance for applications such as image segmentation, object detection, and scene understanding. Traditional methods are ...
详细信息
Convolutional neuralnetworks (CNNs) play an important role in an increasing number of imageprocessing tasks. There is an obvious demand to improve their classification performance and efficiency. Current research in...
详细信息
ISBN:
(纸本)9783031611360;9783031611377
Convolutional neuralnetworks (CNNs) play an important role in an increasing number of imageprocessing tasks. There is an obvious demand to improve their classification performance and efficiency. Current research in this area tends to focus on developing increasingly complex models and algorithms to achieve this end. However, research into computer vision techniques and data augmentation tends to be neglected. This paper demonstrates that even a very simple CNN model achieves high performance in surface defect classification on the NEU dataset thanks to image preprocessing and data augmentation. The initial F1-score of 0.9646 without image preprocessing increases to 0.9727 when preprocessing is carried out. The simple CNN then achieves an F1-score of 0.9854 after data augmentation.
Deep neuralnetworks trained on large datasets have achieved good results in image denoising. However, networks trained on specific datasets often have poor generalization, which is not conducive to practical applicat...
详细信息
Deep neural network has achieved remarkable progress in artificial intelligence. By employing hierarchical abstraction, weight sharing, and local receptive fields, CNNs are highly effective at extracting features from...
详细信息
Deep neural network has achieved remarkable progress in artificial intelligence. By employing hierarchical abstraction, weight sharing, and local receptive fields, CNNs are highly effective at extracting features from spatial data and capturing significant patterns and structures. However, CNNs are inherently designed for grid-like structures, like images, they may have difficulty processing sequential or temporal data and may not be able to deal with temporal dependencies well. The ability of LSTMs (Long Short-Term Memory networks) to capture long-term relationships makes them suitable for processing sequential data;additionally, applying suitable activation functions, like sigmoid and tanh, improves their capacity to model complex temporal patterns. The paper presents an innovative LSTM model that incorporates a newly introduced activation function, termed modified_sigmoid, specifically applied to the input and forget gates, offering potential improvements for capturing and preserving essential features in image data. Following that, the recently introduced LSTM model is applied to datasets including MNIST, fashion_MNIST, and brain tumors in DICOM and NIfTI formats. Fourteen activation functions are compared with the modified_sigmoid function across the datasets MNIST and fashion_MNIST. In the end, a hybrid CNN-LSTM network, integrating the modified_sigmoid function, is employed to detect brain tumors by utilizing the higher-level features. The accuracy and validation results are impressive, showing no signs of overfitting. When there are no channels in the input, the model achieves a perfect 100% accuracy, and overall accuracy is very high at 99.87% extracted by CNN-LSTM.
In this article, we investigate the spontaneity issue in facial expression sequence generation. Current leading methods in the field are commonly reliant on manually adjusted conditional variables to direct the model ...
详细信息
In this article, we investigate the spontaneity issue in facial expression sequence generation. Current leading methods in the field are commonly reliant on manually adjusted conditional variables to direct the model to generate a specific class of expression. We propose a neural network-based method which uses Gaussian noise to model spontaneity in the generation process, removing the need for manual control of conditional generation variables. Our model takes two sequential images as input, with additive noise, and produces the next image in the sequence. We trained two types of models: single-expression, and mixed-expression. With single-expression, unique facial movements of certain emotion class can be generated;with mixed expressions, fully spontaneous expression sequence generation can be achieved. We compared our method to current leading generation methods on a variety of publicly available datasets. Initial qualitative results show our method produces visually more realistic expressions and facial action unit (AU) trajectories;initial quantitative results using image quality metrics (SSIM and NIQE) show the quality of our generated images is higher. Our approach and results are novel in the field of facial expression generation, with potential wider applications to other sequence generation tasks.
artificial intelligence (AI) hardware is a growing area of research that focuses on implementing specialized hardware chips designed for machine learning, neuralnetworks, and their applications. The AI hardware and r...
ISBN:
(数字)9781638285632
ISBN:
(纸本)9781638285625
artificial intelligence (AI) hardware is a growing area of research that focuses on implementing specialized hardware chips designed for machine learning, neuralnetworks, and their applications. The AI hardware and related chips include the design of efficient processors, memory, and dedicated circuits running AI workloads at extreme efficiency and processing speeds. At the heart of neural network implementations, there are models of neurons that are primarily memory functions capable of learning and adapting to new information. Memory is essential for enabling various learning functions and is inherent in all intelligent beings. Memristors as devices, and the systems built with them, have shown to be of great promise for use in analog neural computing. Every attempt to create an energy-efficient CMOS-based general purpose neural network processor that can compete with human intelligence seems to have failed. Memristive systems and devices are compatible and scalable with CMOS technology and show response behavior to stimuli similar to a biological neuron. This has prompted a closer look at memristive systems in academia and industry through the lens of beyond CMOS technologies, algorithms, and applications. In this monograph, in-memory computing is presented with the memristor as the enabling memory element. The practical memristor device faces several challenges when targeting on-chip implementations. Often, there are conductance variabilities of different forms resulting from device-to-device variability, aging, circuit parasitics, read instabilities, various types of noises, and conductance drifts. This variability and how it can be analysed is introduced, along with the concept of super-resolution for compensating errors in analog computing. The application of memristive processing is also shown through echo-state networks for energy-efficient computing and image filtering processing for RF applications.
暂无评论