The article focuses on the concepts of Cell image Segmentation (CIS) and the gradual introduction of cell counting. Motivated by the rapid development of machine learning (ML) methods, which is carried out in this inv...
详细信息
The article focuses on the concepts of Cell image Segmentation (CIS) and the gradual introduction of cell counting. Motivated by the rapid development of machine learning (ML) methods, which is carried out in this investigation. ML is evolving from theory to practical applications, with deep neural network models extensively used in academia and business for various applications, including image counting and natural language processing. These advancements can greatly influence medical imaging technologies, data processing, diagnostics, and healthcare in general. Main objectives of the research are to provide an overview of biological cell counting methods in microscopic images and to explore deep learning (DL)-based image segmentation approaches. The study expertly describes current trends, cutting-edge learning technologies, and platforms utilized for DL approaches. Cell counting is one of the most researched and challenging subjects in computer vision systems. Academics are increasingly interested in this area due to its real-time applications in biology, biochemistry, medical diagnostics, computer vision-based cell tracking systems for large populations, and stem cell manufacturing. Counting cells in the biological field is beneficial. For instance, the ratio of white blood cells to cancer cells in the blood can help determine the origin of a disease. Biologists also need to count cells within cell cultures to monitor the time-dependent growth of cells during bacterial experiments. Numerous methods for cell counting have been developed, after addressing the challenges with Cell Counting algorithms;the article explores promising future directions in CIS and cell counting research fields.
In agricultural settings, handling of soft fruit is critical to ensuring quality and safety. This study introduces a novel opto-tactile sensing approach designed to enhance the handling and assessment of soft fruit, w...
详细信息
In agricultural settings, handling of soft fruit is critical to ensuring quality and safety. This study introduces a novel opto-tactile sensing approach designed to enhance the handling and assessment of soft fruit, with a case example of strawberries. Our approach utilises a Robotiq 2F-85 gripper equipped with the DIGIT vision-Based Tactile Sensor (vBTS) and attached to a Universal Robot UR10e. In contrast to force-based approaches, we introduce a novel purely image-based processing software pipeline for quantifying localised surface deformations in soft fruit. The system integrates fast and explainable imageprocessing techniques applying image differencing, denoising, K-means clustering for unsupervised classification, morphological operations, and connected components analysis (CCA) to quantify surface deformations accurately. A calibration of the imageprocessing pipeline using a rubber ball showed that the system effectively captured and analysed the rubber ball's surface deformations, benefiting from its uniform elasticity and predictable response to compression. As a soft fruit case example, the imageprocessing pipeline was subsequently applied to strawberries, blueberries, and raspberries, demonstrating that the calibration parameters derived from the rubber ball could effectively assess surface deformations in soft fruits. Despite the complex, nonlinear deformation characteristics inherent to strawberries, blueberries, and raspberries, the pipeline exhibited robust performance, capturing and quantifying subtle surface changes. These findings underscore the system's capacity for precise deformation analysis in delicate materials, offering major potential for further refinement and adaptation. This novel approach of proposing and testing an imageprocessing pipeline lays the groundwork for enhancing the handling and assessment of materials with intricate mechanical properties, paving the way for broader applications in sensitive agricultural and industrial
Soil erosion, primarily driven by water and wind, poses a significant environmental challenge globally, leading to land degradation and geo-hazards. Despite various empirical methods, image analysis, and machine learn...
详细信息
Soil erosion, primarily driven by water and wind, poses a significant environmental challenge globally, leading to land degradation and geo-hazards. Despite various empirical methods, image analysis, and machine learning techniques employed to address this issue, effective mitigation tools remain lacking. This study presents an innovative framework integrating imageprocessing (IP) and machine learning (ML) to enhance the understanding, quantification, and prediction of soil erosion processes. Laboratory flume experiments were conducted to capture erosion images, which were pre-processed using techniques such as Contrast Limited Adaptive Histogram Equalization (CLAHE) to improve image quality. Supervised ML models, including Logistic Regression (LR), K-Nearest Neighbor (KNN), Support vector machine (SvM), Decision Tree (DT), and Random Forest (RF), were applied to classify eroded and non-eroded soil areas. The model's performance was rigorously evaluated using metrics such as precision, recall, and F1-score. The results demonstrated that KNN and RF outperformed other models in predicting soil erosion, with KNN exhibiting the least variation (2.39%) compared to the reference erosion profile. This study underscores the potential of an IP and ML ensemble framework for precise soil erosion quantification and prediction, offering practical applications for erosion mitigation. The open-source code and dataset are available at https://***/mlgeotech/***.
Lying in the cross-section of computer vision and natural language processing, vision language models are capable of processingimages and text at once. These models are helpful in various tasks: text generation from ...
详细信息
Lying in the cross-section of computer vision and natural language processing, vision language models are capable of processingimages and text at once. These models are helpful in various tasks: text generation from image and vice versa, image-text retrieval, or visual navigation. Besides building a model trained on a dataset for a task, people also study general-purpose models to utilize many datasets for multitasks. Their two primary applications are image captioning and visual question answering. For English, large datasets and foundation models are already abundant. However, for vietnamese, they are still limited. To expand the language range, this work proposes a pretrained general-purpose image-text model named visualRoBERTa. A dataset of 600k images with captions (translated MS COCO 2017 from English to vietnamese) is introduced to pretrain visualRoBERTa. The model's architecture is built using Convolutional Neural Network and Transformer blocks. Fine-tuning visualRoBERTa shows promising results on the vivQA dataset with 34.49% accuracy, 0.4173 BLEU 4, and 0.4390 RougeL (in visual question answering task), and best outcomes on the sviIC dataset with 0.6685 BLEU 4, 0.6320 RougeL (in image captioning task).
In this study, an ad-hoc imageprocessing pipeline has been developed and proposed for the purpose of semantically segmenting wheat kernel data acquired through near-infrared hyperspectral imaging (HSI). The Gaussian ...
详细信息
In this study, an ad-hoc imageprocessing pipeline has been developed and proposed for the purpose of semantically segmenting wheat kernel data acquired through near-infrared hyperspectral imaging (HSI). The Gaussian Mixture Model (GMM), characterized as a soft clustering method, has been employed for this task, yielding noteworthy results in both kernel and germ segmentation. A comparative analysis was conducted, wherein GMM was compared with two hard clustering methods, hierarchical clustering and k-means, as well as other common clustering algorithms prevalent in food HSI applications. Notably, GMM exhibited the highest accuracy, with a Jaccard index of 0.745, surpassing hierarchical clustering at 0.698 and k-means at 0.652. Furthermore, the spectral variations observed in wheat kernel topology can be used for semantic image segmentation, especially in the context of selecting the germ portion within the wheat kernels. These findings carry practical significance for professionals in the fields of hyperspectral imaging (HSI) and machinevision, particularly for food product quality assessment and real-time inspection.
The specular reflection of objects is an important factor affecting image display quality, which poses challenges to tasks such as pattern recognition and machinevision detection. At present, specular removal for a s...
详细信息
The specular reflection of objects is an important factor affecting image display quality, which poses challenges to tasks such as pattern recognition and machinevision detection. At present, specular removal for a single real image is a crucial pre-processing step to improve the performance of computer vision algorithms. Despite notable approaches tailored for handling synthesized and pre-simplified images with dark backgrounds, real-time separation of specular reflection for a single real image remains a challenging problem. This paper proposes a novel specular removal method to separate the specular reflection for a single real image accurately and efficiently based on the dark channel prior. Initially, a modified-specular-free (MSF) image is developed using the dark channel prior, which can derive a direct estimation of specular reflection. Next, the image chromaticity spaces are established to represent the pixel intensity. Then, the maximum chromaticity value of the modified MSF image is extracted to guide the filtering of the specular reflection, treating the specular pixels as noise in the chromaticity space. Finally, the image without specular reflection can be obtained using the restored maximum chromaticity value based on the dichromatic reflection model. The superiority of this method is to achieve highquality specular reflection separation quickly without destroying the geometric features of the real image. Compared with the state-of-the-art methods, experimental results show that the proposed algorithm can achieve the best subjective visual effect and satisfactory quantitative performance. In addition, this approach can be implemented efficiently to meet real-time requirements, promising to be applied to computer vision measurement and inspection applications.
Breakout is the most serious production accident in continuous casting and must be detected and predicted by stable and reliable *** sticking region,which forms on the local copper plate and expanded into a"v&quo...
详细信息
Breakout is the most serious production accident in continuous casting and must be detected and predicted by stable and reliable *** sticking region,which forms on the local copper plate and expanded into a"v"shape,is the typical precursor of ***,computer vision technology was exploited to visualize the temperature change rate of the copper plate based on the temperature signals from thermocouples;then,the static and dynamic features of the abnormal sticking region were ***,logistic regression and Adaboost models were used to study and identify these features,resulting in the development of a mold breakout prediction model based on computer vision and machine *** test results demonstrate that the proposed model can effectively distinguish anomalous temperature patterns and considerably reduce false alarms without any missing *** a result,the proposed method could offer valuable insights into the realm of abnormality detection and prediction during continuous casting process.
Embedded computer vision systems are increasingly being adopted across various domains, playing a pivotal role in enabling advanced technologies such as autonomous vehicles and industrial automation. Their cost-effect...
详细信息
Embedded computer vision systems are increasingly being adopted across various domains, playing a pivotal role in enabling advanced technologies such as autonomous vehicles and industrial automation. Their cost-effectiveness, compact size, and portability make them particularly well-suited for diverse implementations and operations. In real-time scenarios, these systems must process visual data with minimal latency, which is crucial for immediate decision-making. However, these solutions continue to face significant challenges related to computational efficiency, memory usage, and accuracy. This research addresses these challenges by enhancing classification methodologies, specifically in Gray Level Co-occurrence Matrix (GLCM) feature extraction and Support vector machine (SvM) classifiers. To maintain a high level of accuracy while preserving performance, a smaller feature set is selected following a comprehensive complexity analysis and is further refined through Correlation-based Feature Selection (CFS). The proposed method achieves an overall classification accuracy of 84.76% with a feature set reduced by 79.2%, resulting in a 72.45% decrease in processing time, a 50% reduction in storage requirements, and up to a 77.8% decrease in memory demand during prediction. These improvements demonstrate the effectiveness of the proposed approach in improving the adaptability and capabilities of embedded vision systems (EvS), optimizing their performance under the constraints of real-time limited-resource environments.
X-ray imaging technology has been used for decades in clinical tasks to reveal the internal condition of different organs, and in recent years, it has become more common in other areas such as industry, security, and ...
详细信息
X-ray imaging technology has been used for decades in clinical tasks to reveal the internal condition of different organs, and in recent years, it has become more common in other areas such as industry, security, and geography. The recent development of computer vision and machine learning techniques has also made it easier to automatically process X-ray images and several machine learning-based object (anomaly) detection, classification, and segmentation methods have been recently employed in X-ray image analysis. Due to the high potential of deep learning in related imageprocessingapplications, it has been used in most of the studies. This survey reviews the recent research on using computer vision and machine learning for X-ray analysis in industrial production and security applications and covers the applications, techniques, evaluation metrics, datasets, and performance comparison of those techniques on publicly available datasets. We also highlight some drawbacks in the published research and give recommendations for future research in computer vision-based X-ray analysis.
The computer vision-based analysis of railway superstructure has gained significant attention in railway engineering. This approach utilises advanced imageprocessing and machine learning techniques to extract valuabl...
详细信息
The computer vision-based analysis of railway superstructure has gained significant attention in railway engineering. This approach utilises advanced imageprocessing and machine learning techniques to extract valuable information from visual data captured in the railway track environment. By analysing images from various sources such as cameras, drones, or sensors, computer vision algorithms can accurately detect and classify different components of the ballast superstructure, including the catenary system support, rail surface and profile, fastening system, sleeper, and ballast layer. This enables the automated assessment of the railway track's condition, stability, and maintenance needs. This paper comprehensively reviews the recent advancements, challenges, and potential applications of computer vision techniques in analysing railway superstructure. It discusses various vision-based methodologies and machine-learning approaches utilised in this context. Furthermore, it examines the benefits and limitations of computer vision-based analysis and presents future research directions for improving its applicability in railway track engineering.
暂无评论