Light-sheet fluorescence microscopy (LSFM) is a minimally invasive and high throughput imaging technique ideal for capturing large volumes of tissue with sub-cellular resolution. A fundamental requirement for LSFM is ...
详细信息
Light-sheet fluorescence microscopy (LSFM) is a minimally invasive and high throughput imaging technique ideal for capturing large volumes of tissue with sub-cellular resolution. A fundamental requirement for LSFM is a seamless overlap of the light-sheet that excites a selective plane in the specimen, with the focal plane of the objective lens. However, spatial heterogeneity in the refractive index of the specimen often results in violation of this requirement when imaging deep in the tissue. To address this issue, autofocus methods are commonly used to refocus the focal plane of the objective-lens on the light-sheet. Yet, autofocus techniques are slow since they require capturing a stack of images and tend to fail in the presence of spherical aberrations that dominate volume imaging. To address these issues, we present a deeplearning-based autofocus framework that can estimate the position of the objective-lens focal plane relative to the light-sheet, based on two defocused images. This approach outperforms or provides comparable results with the best traditional autofocus method on small and large image patches respectively. When the trained network is integrated with a custom-built LSFM, a certainty measure is used to further refine the network's prediction. The network performance is demonstrated in real-time on cleared genetically labeled mouse forebrain and pig cochleae samples. Our study provides a framework that could improve light-sheet microscopy and its application toward imaging large 3D specimens with high spatial resolution. (c) 2021 Optical Society of America under the terms of the OSA Open Access Publishing Agreement
In recent years, major capability improvements at synchrotron beamlines have given researchers the ability to capture more complex structures at a higher resolution within a very short time. This opens up the possibil...
详细信息
In recent years, major capability improvements at synchrotron beamlines have given researchers the ability to capture more complex structures at a higher resolution within a very short time. This opens up the possibility of studying dynamic processes and observing resulting structural changes over time. However, such studies can create a huge quantity of 3D image data, which presents a major challenge for segmentation and analysis. Here tomography experiments at the Australian synchrotron source are examined, which were used to study bread dough formulations during rising and baking, resulting in over 460 individual 3D datasets. The current pipeline for segmentation and analysis involves semi-automated methods using commercial software that require a large amount of user input. This paper focuses on exploring machine learning methods to automate this process. The main challenge to be faced is in generating adequate training datasets to train the machine learning model. Creating training data by manually segmenting realimages is very labour-intensive, so instead methods of automatically creating synthetic training datasets which have the same attributes of the original images have been tested. The generated synthetic images are used to train a U-Net model, which is then used to segment the original bread dough images. The trained U-Net outperformed the previously used segmentation techniques while taking less manual effort. This automated model for data segmentation would alleviate the time-consuming aspects of experimental workflow and would open the door to perform 4D characterization experiments with smaller time steps.
Poor treatment outcomes result from the fact that oral cancer is frequently identified at an advanced stage, making it a serious and sometimes lethal disease. Thus, it is essential to develop quick and accurate proced...
详细信息
With the increasing demand for computing speed and real-time data processing in various fields, deeplearning and convolutional neural networks are more and more widely used in the field of computer vision. FPGA-based...
详细信息
Rail surface defect inspection is a vital element for ensuring railway safety. We critically analyze the application of deep convolutional neural networks (CNN) in Rail Surface Defect Perception (RSDP). We scrutinize ...
Rail surface defect inspection is a vital element for ensuring railway safety. We critically analyze the application of deep convolutional neural networks (CNN) in Rail Surface Defect Perception (RSDP). We scrutinize 43 studies, revealing how CNN, YOLO, R-CNN, and Semantic Segmentation techniques, have revolutionized RSDP by surpassing traditional inspection methods in speed, accuracy, and efficiency. Our examination underscores the necessity for comprehensive and standardized datasets to support the varied and complex nature of rail surface defects. Additionally, we highlight the persistent challenges such as limited defect samples and the imperative for real-time detection capabilities. The paper details the performance and adaptation of various CNN models for RSDP, charting a course for future inquiry. We advocate for enhanced datasets, unified defect classification, and refined deeplearning models, aiming to bolster the progress in CNN-driven RSDP technology for enhanced railway safety.
This paper addresses the problem of sheet-image-based on-line audio-to-score alignment also known as score following. Drawing inspiration from object detection, a conditional neural network architecture is proposed th...
详细信息
ISBN:
(纸本)9789082797060
This paper addresses the problem of sheet-image-based on-line audio-to-score alignment also known as score following. Drawing inspiration from object detection, a conditional neural network architecture is proposed that directly predicts x,y coordinates of the matching positions in a complete score sheet image at each point in time for a given musical performance. Experiments are conducted on a synthetic polyphonic piano benchmark dataset and the new method is compared to several existing approaches from the literature for sheet-image-based score following as well as an Optical Music Recognition baseline. The proposed approach achieves new state-of-the-art results and furthermore significantly improves the alignment performance on a set of real-world piano recordings by applying Impulse Responses as a data augmentation technique.
Segmentation of a touching component to separate its constituent text and nontext parts is always a very crucial but challenging task toward developing a comprehensive document imageprocessing (DIP) system. This is b...
详细信息
Segmentation of a touching component to separate its constituent text and nontext parts is always a very crucial but challenging task toward developing a comprehensive document imageprocessing (DIP) system. This is because, irrespective of document types, either printed or handwritten, the nontext parts need to be suppressed first before processing the text parts through an optical character recognition (OCR) system. Although a good number of attempts have been made to address this issue for printed documents, the same for regular handwritten document images is almost none. However, the appearance of touching components where a nontext part gets joined with a text part is a common issue in freestyle handwriting. To this end, in this work, we tailor-make a generative adversarial network (GAN)-based model with a suitable loss function that we name tsegGAN. We also prepare an in-house data set by collecting touching components from different real-world handwritten documents to evaluate our model. The performance comparison of our model with state-of-the-art GAN models shows that tsegGAN has outperformed the others with a significant margin.
As an important branch in the field of imageprocessing, target detection requires more and more real-time algorithm with its wide application in industry, agriculture, medical and military industries, which also puts...
详细信息
deeplearning methods have shown a promising approach to reliable automated pavement condition survey in recent years. However, the training of models requires large quantities of annotated data, which is normally tim...
详细信息
ISBN:
(数字)9780784483503
ISBN:
(纸本)9780784483503
deeplearning methods have shown a promising approach to reliable automated pavement condition survey in recent years. However, the training of models requires large quantities of annotated data, which is normally time consuming, expensive, and sometimes difficult to obtain. This research aims to explore the viability of using synthetic pavement image data to train convolutional neural networks (CNNs) for automated pavement crack detection. A procedural approach of generating synthetic pavement crack image data is proposed. Perlin noise is adopted to mimic the real-world cracks, and simple textures are used to control the generated crack type. Mask R-CNN is used to train on the synthetic data developed in this study. Both synthetic and real data sets are used to evaluate the performance of the trained model. The results indicate that training a crack detection model using only synthetic data can reach almost the same level of accuracy as using the real data.
image recognition, powered by machine learning (ML), has significantly advanced applications in both dance movement recognition and robotic vision. This review examines key ML techniques, including Convolutional Neura...
详细信息
ISBN:
(数字)9798350356755
ISBN:
(纸本)9798350356762
image recognition, powered by machine learning (ML), has significantly advanced applications in both dance movement recognition and robotic vision. This review examines key ML techniques, including Convolutional Neural Networks (CNNs), deep Neural Networks (DNNs), Self-Organizing Maps (SOMs), and Long Short-Term Memory (LSTM) networks, alongside pose estimation methods like OpenPose and Part Affinity Fields (PAFs). These techniques enhance dance classification, real-time feedback, and motion analysis, with OpenPose + LSTMs and PAFs + LSTMs demonstrating the highest accuracy. Notwithstanding progress, obstacles such as high computational costs, data dependency, and real-time implementation challenges persist. Beyond dance, these methods are critical in robotic vision, intelligent automation, and industrial imageprocessing, enabling autonomous robotic navigation, defect detection in manufacturing, and AI-driven motion tracking. By leveraging human movement analysis for robotics, ML improves human-robot interaction, robotic-assisted rehabilitation, and industrial automation. Despite progress, challenges such as high computational demands, data dependency, and real-time constraints remain. This review explores future directions, including multimodal data fusion, hybrid AI models, and real-time optimization, bridging the gap between AI-driven motion systems and intelligent automation to enhance adaptability and efficiency across domains.
暂无评论