This paper introduces an innovative and efficient solution for achieving precise optical fiber alignment in fiber optic splicing applications. By harnessing the capabilities of FPGA-based real-time imageprocessing, t...
详细信息
This paper introduces an innovative and efficient solution for achieving precise optical fiber alignment in fiber optic splicing applications. By harnessing the capabilities of FPGA-based real-time imageprocessing, the system surpasses the required frame rate of 20 frames per second, ensuring swift and accurate alignment. With a verified resolution of approximately 1 pixel per mu m, the system offers high precision suitable for various applications. Through advanced imageprocessing techniques, the system calculates precise correction values for each frame, resulting in fast and accurate alignment of fiber cores and cladding. This unique feature enhances performance and adaptability, leading to substantial reduction in splice wastage. By integrating motorized alignment stages and guided machinevision, the system provides a robust and automated approach to fiber optic alignment. The FPGA-driven stepper motors enable precise and controlled fiber movements with micrometer-level step resolutions. The calculated correction factors effectively guide the alignment process, with deviations from actual results impressively low - a maximum of 12 mu m in the Y correction factor and 7 mu m in the X correction factor. This reliability makes the system a crucial component in any optical fiber splicing machine. Furthermore, the paper emphasizes the ability to handle specialty fiber alignment contributes significantly to cost-effective manufacturing of fiber optic components and modules. By ensuring precise and optimal fiber alignment, the system mitigates signal loss, leading to improved performance and reduced production costs.
Spatial frequency processing is an essential technology for extracting morphological information from an optical image. Although various Fourier-based flat optical elements have been proposed as spatial-frequency filt...
详细信息
Spatial frequency processing is an essential technology for extracting morphological information from an optical image. Although various Fourier-based flat optical elements have been proposed as spatial-frequency filters to realize imageprocessing, their transfer functions are statically fixed once fabricated, thus limiting the versatile, dynamic functionalities and practical applications. Here, a novel practical tuning strategy is demonstrated to realize switchable spatial-frequency processing for edge-enhanced and bright-field imaging by employing a tunable hydrogel-scalable nanoslide. By utilizing multilayered metallic and hydrogel stacks construction, the nanoslide directly manipulates the optical spatial frequency in the wavevector domain and exhibits opposite imageprocessing at different wavelength channels due to the cavity-induced wavelength-sensitivity. More intriguingly, via controlling the ambient humidity, the angular-dependent optical response of the nanoslide can be effectively tuned for dynamic edge-enhanced imaging due to the hydrogel's inflation from moisture. In addition, the nanoslide is readily fabricated at a large scale and integrated into compact imaging systems, such as a biomicroscope. With the advantages of a high numerical aperture approximate to 0.8, polarization-insensitive, microscopy-compatible, and facile architecture, the proposed hydrogel-based nanoslide can find potential applications in machinevision, real-time imageprocessing, biological imaging, and analog computing.
Within Artificial Intelligence, Deep Learning (DL) represents a paradigm that has been showing unprecedented performance in image and audio processing by supporting or even replacing humans in defect and anomaly detec...
详细信息
Within Artificial Intelligence, Deep Learning (DL) represents a paradigm that has been showing unprecedented performance in image and audio processing by supporting or even replacing humans in defect and anomaly detection. The railway sector is expected to benefit from DL applications, especially in predictive maintenance applications, where smart audio and video sensors can be leveraged yet kept distinct from safety-critical functions. Such separation is crucial, as it allows for improving system dependability with no impact on its safety certification. This is further supported by the development of DL in other transportation domains, such as automotive and avionics, opening for knowledge transfer opportunities and highlighting the potential of such a paradigm in railways. In order to summarize the recent state-of-the-art while inquiring about future opportunities, this paper reviews DL approaches for the analysis of data generated by acoustic and visual sensors in railway maintenance applications that have been published until August 31st, 2021. In this paper, the current state of the research is investigated and evaluated using a structured and systematic method, in order to highlight promising approaches and successful applications, as well as to identify available datasets, current limitations, open issues, challenges, and recommendations about future research directions.
In the field of nuclear fusion fueling, the distribution of fuel beam parameters is of great significance for optimizing injection techniques and for in-depth study of the interaction between particles and plasma. To ...
详细信息
ISBN:
(数字)9798350355413
ISBN:
(纸本)9798350355420
In the field of nuclear fusion fueling, the distribution of fuel beam parameters is of great significance for optimizing injection techniques and for in-depth study of the interaction between particles and plasma. To obtain the beam velocity distribution of the supersonic molecular fueling beam, a U-net model has been applied in our previous research, successfully capturing the mapping relationship between density distribution and velocity distribution, and effectively predicts velocity distributions across a wide range of parameters. To effectively apply this model to the inversion of experimental data, it is imperative to rigorously investigate its noise resistance capabilities. This paper focuses on enhancing algorithmic robustness through noise-induced dataset augmentation. The results suggest that under 20% noise interference, the model can still achieve stable prediction of the velocity distribution of the supersonic molecular beam, providing strong support for subsequent research.
Aerial image segmentation algorithms are used in many scenarios such as traffic warning, road assistance, terrain mapping and military reconnaissance. It has been widely applied in both military and civilian applicati...
详细信息
Automated text-to-image (image synthesis) and image-to-text (image captioning) generation are two of the most challenging and cutting-edge fields of study in Computer vision (CV) in conjunction with Natural Language P...
Automated text-to-image (image synthesis) and image-to-text (image captioning) generation are two of the most challenging and cutting-edge fields of study in Computer vision (CV) in conjunction with Natural Language processing (NLP). The image-to-text synthesis, also known as image Captioning (IC), has numerous applications in visual assistance, machinevision, remote security, healthcare, and remote sensing, among others. Since their origin, the IC frameworks have been constructed utilizing a two-subcomponent pipeline consisting of the visual feature extraction and natural language modeling subcomponents. The recent surge of interest in the applications of sequential deep learning models in machine translation has resulted in the development of efficient language modeling architectures. Deep sequential modeling architectures, such as Recurrent Neural Networks (RNN), Long Short Term Memory (LSTM), and Gated Recurrent Units (GRU), tackle the intricacies of the multi-modular space far better than existing machine learning-based translation approaches. The tremendous growth of deep language modeling architectures in IC necessitates a comprehensive review of its literature. In this survey, we conduct an exhaustive and analytical analysis of the language modeling architectures used in IC frameworks for caption generation, along with their training corpus datasets. We also identify a list of open research issues and potential research areas for future work.
In an age where the Internet is dominated by visual content, the generation of animated captions has become a must. It has always been an interesting study for researchers in the Department of Artificial Intelligence....
详细信息
ISBN:
(数字)9798350370249
ISBN:
(纸本)9798350370270
In an age where the Internet is dominated by visual content, the generation of animated captions has become a must. It has always been an interesting study for researchers in the Department of Artificial Intelligence. Enabling the machine to describe images with the same skillful accuracy as the human has important applications in various fields such as robotic vision, manufacturing, and beyond This project integrates recurrent neural networks with is a topic of contextual parallelism dedicated to extracting features from images Natural Language processing Computer vision and integrating them seamlessly, this research provides insights a it goes further on this interdisciplinary topic. Additionally, annotations for the sample images are created and performed a comparative analysis of different feature extraction and encoder patterns to determine which model provided the highest accuracy and delivered the desired results.
machine Learning Algorithms for Signal and imageprocessing Enables readers to understand the fundamental concepts of machine and deep learning techniques with interactive, real-life applications within signal and ima...
详细信息
ISBN:
(数字)9781119861843;9781119861836
ISBN:
(纸本)9781119861829
machine Learning Algorithms for Signal and imageprocessing Enables readers to understand the fundamental concepts of machine and deep learning techniques with interactive, real-life applications within signal and imageprocessingmachine Learning Algorithms for Signal and imageprocessing aids the reader in designing and developing real-world applications using advances in machine learning to aid and enhance speech signal processing, imageprocessing, computer vision, biomedical signal processing, adaptive filtering, and text processing. It includes signal processing techniques applied for pre-processing, feature extraction, source separation, or data decompositions to achieve machine learning tasks. Written by well-qualified authors and contributed to by a team of experts within the field, the work covers a wide range of important topics, such as:
Speech recognition, image reconstruction, object classification and detection, and text processing
Healthcare monitoring, biomedical systems, and green energy
How various machine and deep learning techniques can improve accuracy, precision rate recall rate, and processing time
Real applications and examples, including smart sign language recognition, fake news detection in social media, structural damage prediction, and epileptic seizure detection
Professionals within the field of signal and imageprocessing seeking to adapt their work further will find immense value in this easy-to-understand yet extremely comprehensive reference work. It is also a worthy resource for students and researchers in related fields who are looking to thoroughly understand the historical and recent developments that have been made in the field.
The phenotype of edible fungus basically relies on visual observation and empirical judgment at present, and there is still a lack of reports on the phenotypic techniques and applications for cultivation of seafood mu...
详细信息
The proceedings contain 37 papers. The special focus in this conference is on Artificial Intelligence and its applications. The topics include: The Hybrid Cardiac Risk Assessment and Prediction Model Using Convolution...
ISBN:
(纸本)9783031843938
The proceedings contain 37 papers. The special focus in this conference is on Artificial Intelligence and its applications. The topics include: The Hybrid Cardiac Risk Assessment and Prediction Model Using Convolutional Neural Networks;deep Learning applications for Malaria Detection and Diagnosis: A Review;Environmental Considerations in the Ethics of AI Adoption in Healthcare: Striving for Sustainable and Responsible Practices;harnessing the Power of Cognitive Computing: Assessing Point-of-Care Decision Support Tools in Oncology Practice;Critical Analysis in Use of AI in Health Care Management;classification and Prediction of Spinal Tuberculosis Disease Using Optimization of Convolution Neural Network Using Spatial and Temporal Constraints;artificial Intelligence for Remote Healthcare in Underserved Areas: Enhancing Access and Quality of Healthcare Delivery;machine Learning Based Skin Cancer Detection and Recognitions Techniques in IoT Environment;validation of a Chronic Kidney Disease Prediction System Using machine Learning Techniques;real-Time Feedback Detection Using Emotion Detection and Facial Recognition;revolutionizing vision Tasks: Unlocking Potential Through Patch-Based Approaches;automated Knee Implant Identification from 2D Templates Using imageprocessing and Artificial Intelligence – An Experimental Approach;crop Analysis and Classification Based on Phenotype Using Ensemble Learning;Classification and Identification with Health Benefit Assessment and Nutrient Profile of Brewed Tea Utilizing Computer vision with ML and DL and Sensory Approaches;enhancing Industrial Automation Flexibility Through Neural Network-Empowered machinevisionapplications.
暂无评论