Interactive robots are intelligent auxiliary tools that can monitor sports events in real-time and provide entertainment information services. This article studies the application of entertainment interactive robots b...
详细信息
Interactive robots are intelligent auxiliary tools that can monitor sports events in real-time and provide entertainment information services. This article studies the application of entertainment interactive robots based on deeplearning in the referee assistance mode of sports competitions. The system first uses a camera to capture real-timeimages of volleyball matches, then preprocesses the images using imageprocessing algorithms, and uses deeplearning algorithms to recognize and track the balls and players in the images. By training the model, the system can accurately determine key information such as player actions. deeplearning technology is used to train interactive entertainment robots to identify and analyze key decision events in games. Through image recognition, action analysis and rule matching, the robot monitors the game process in realtime and determines the referee's possible errors in the decision. The system generates the penalty results based on the penalty rules and competition rules, and displays them to the referee and audience through display screens or sound prompts. After experimental verification, the volleyball match referee judgment assistance system based on imageprocessing and deeplearning has performed excellently in terms of accuracy and speed. Compared to manual referees, the system can identify and track the ball and players more quickly, reduce the possibility of misjudgments, and improve the fairness of the game.
The medical image is a set of all organizations, institutions, and resources whose primary goal is to improve health. The extensive growth of medical data increases the utility of machine learning and deeplearning in...
详细信息
deeplearning models have been a huge success in image recognition which hence can be used for the purpose of text generation. In the field of imaging science, captioning images and videos is regarded as an intellectu...
详细信息
ISBN:
(数字)9798350391770
ISBN:
(纸本)9798350391787
deeplearning models have been a huge success in image recognition which hence can be used for the purpose of text generation. In the field of imaging science, captioning images and videos is regarded as an intellectually difficult job. Visual Geometry Group (VGG); is a standard deep Convolutional Neural Network (CNN) architecture with multiple layers, specifically focusing on the integration of CNN for image feature extraction. Exploring this underlying method, the use of another model is essential for caption generation. Here the Recurrent Neural Network (RNN) comes in use for caption generation from the extracted features. Models named Long Short-Term Memory (LSTM) based on RNN and Bidirectional encoder representation transformer (BERT) based on Transformers have been prominent in ensuring accurate results. The Flicker8k dataset is used which provides a variety of information useful for model training. By testing validation data along with evaluation metrics, we analyze the effectiveness of different models to create consistent and descriptive headlines. Extending our inquiry to encompass title generation using transformer models, while also exploring learning techniques for real-time title generation and delivery using the Open-CV library available in Python to get the output from the camera and display it on screen. The result shows that the LSTM is the best model for captioning, with an accuracy of 65.07% at the epochs of 300 and the BERT model has an accuracy of 31% at the epochs of 2. The findings of this study not only contribute to advancing subtitle enhancement methodologies but also broaden the potential applications of deeplearning techniques in this domain.
The incorporation of distributed deeplearning for medical imageprocessing in cloud settings is the subject of this study. The findings demonstrate the high viability and significant performance advantages realized b...
The incorporation of distributed deeplearning for medical imageprocessing in cloud settings is the subject of this study. The findings demonstrate the high viability and significant performance advantages realized by cloud-based distributed systems, notably significant processingtime savings, outstanding diagnostic accuracy, as well as improved scalability. The consequences for security and privacy have been discussed, with a focus on effective safeguards for private medical information. There is a void in the literature about resource and cost-effectiveness optimization tactics used in cloud-based systems. Future research must concentrate on resource optimization tactics for economic sustainability, study developing security risks and privacy techniques, and incorporate real-world implementations in order to improve this topic. This study informs the use of distributed deeplearning in cloud-based medical imageprocessing as well as adds to the body of knowledge in healthcare technology.
In order to improve the intellective level of water resources management, a real-time water level recognition method based on deep-learning algorithms and image-processing techniques is proposed in this paper. The rec...
详细信息
Recent advances in camera design and imaging technology have enabled the capture of high-quality images using smartphones. However, due to the limited dynamic range of digital cameras, the quality of photographs captu...
详细信息
Recent advances in camera design and imaging technology have enabled the capture of high-quality images using smartphones. However, due to the limited dynamic range of digital cameras, the quality of photographs captured in environments with highly imbalanced lighting often results in poor-quality images. To address this issue, most devices capture multi-exposure frames and then use some multi-exposure fusion method to merge those frames into a final fused image. Nevertheless, most traditional and current deeplearning approaches are unsuitable for real-time applications on mobile devices due to their heavy computational and memory requirements. We propose MobileMEF, a new method for multi-exposure fusion based on an encoder-decoder deeplearning architecture with efficient building blocks tailored for mobile devices. This efficient design makes MobileMEF capable of processing 4K resolution images in less than 2 s on mid-range smartphones. MobileMEF outperforms state-of-the-art techniques regarding full-reference quality measures and computational efficiency (runtime and memory usage), making it ideal for real-time applications on hardware-constrained devices. Our code is available at: https://***/LucasKirsten/MobileMEF.
The proceedings contain 27 papers. The topics discussed include: fast multi-modal reuse: co-occurrence pre-trained deeplearning models;deeplearning for fast super-resolution reconstruction from multiple images;an ef...
ISBN:
(纸本)9781510626577
The proceedings contain 27 papers. The topics discussed include: fast multi-modal reuse: co-occurrence pre-trained deeplearning models;deeplearning for fast super-resolution reconstruction from multiple images;an efficient algorithm for fast block matching motion estimation using an adaptive threshold scheme;low exposure image frame generation algorithms for feature extraction and classification;parallel image and video self-recovery scheme with high recovery capability;learning optimal actions with imperfect images;CNN classification based on global and local features;kalman-based motion estimation in video surveillance systems for safety applications;and recent advances in integrated photonic-electronic technologies for high-speed processing and communication circuits for light-based transducers.
One of the most important occupations in India is agriculture. Out of all the crops, cotton is the best and is crucial to the agricultural economy of the country. In India, 40-50 million people work in the cotton trad...
详细信息
ISBN:
(数字)9798350319019
ISBN:
(纸本)9798350319026
One of the most important occupations in India is agriculture. Out of all the crops, cotton is the best and is crucial to the agricultural economy of the country. In India, 40-50 million people work in the cotton trade and processing, while six million farmers directly depend on the crop. The cotton leaf disease has grown in importance over the last few decades, resulting in losses to crops, farming operations, and financial resources. To achieve this aim, we first need to acquire different images of cotton plants. We can use imageprocessing techniques to analyze dead leaf images and extract features like color, texture, and other characteristics with the deep CNN model’s assistance. In addition to being less expensive and more straightforward, automatic disease detection supports machine vision, which offers image-based automated process control and inspection. To properly train the algorithm, we will be using a dataset of approximately 1752(approximately 440 images in each class) images classified into different categories according to the diseases. This model will be developed using tools present in Anaconda such as Jupyter Notebook, Spyder etc. The results of this project will demonstrate whether using it in real-time applications is feasible and whether traditional or manual disease and pest identification could benefit from the use of IT- based solutions.
Cardiovascular diseases (CVD) seriously threaten the health of human beings, and they have caused widespread concern in recent years. At present, the diagnosis of CVD is mainly conducted by computed tomography (CT), e...
详细信息
Cardiovascular diseases (CVD) seriously threaten the health of human beings, and they have caused widespread concern in recent years. At present, the diagnosis of CVD is mainly conducted by computed tomography (CT), echocardiography and nuclear magnetic resonance (NMR) technologies. NMR imaging technology is widely used in medical applications owing to its characteristics of high resolution and very low radiation. However, manual NMR image segmentation is time-consuming and error-prone, which has led to the research on automatic NMR image segmentation technologies. Researchers tend to explore the ventricular NRM image segmentation to improve the accuracy of CVD diagnosis. In this study, based on deeplearning technology, we propose a layered Mask R-CNN segmentation method to segment ventricular NMR images. The experimental results show that the mean dice metrics (DM) of left ventricular segmentation and right ventricular segmentation are 0.92 and 0.89, and the Hausdorff distance (HD) metrics are 4.78 mm and 7.03 mm. Our research indicates that the proposed novel method has great potential to automate the ventricular NMR image segmentation. We also discuss the automatic abnormal ventricular systolic function detection method based on the proposed layered segmentation model.
The coffee industry contributes to the economic restructuring of many countries, often associated with a closed process from production to consumption. The green coffee bean grading standard provided by the Specialty ...
详细信息
The coffee industry contributes to the economic restructuring of many countries, often associated with a closed process from production to consumption. The green coffee bean grading standard provided by the Specialty Coffee Association (SCA) is one of the best methods for grading coffee beans. Traditionally, the assessment of quality and classification of coffee beans relies on visual examination, which demands significant time and effort and is easily inaccurate. deeplearning technology, characterized by precision, velocity, and veracity, can be adopted to empower the reduction of human labor and improve the productivity, quality, and efficiency of these tasks. Therefore, this paper aims to address these issues by implementing deeplearning to classify coffee bean quality in realtime by integrating the system with a cloud-based solution. First, imageprocessing and data augmentation techniques are employed to handle the coffee bean image data. Subsequently, the model is trained using YOLOv8, a framework for object recognition, and OpenCV, an open-source imageprocessing technology, to classify coffee beans. Finally, an application is developed for real-time video and image-streaming coffee bean recognition using React Native, NodeJS, and Python. The experimental results provide empirical evidence that our system enhances accuracy and efficiency in the tasks of classifying coffee bean quality in nine distinct varieties of coffee beans, with the time required reduced to a mere 1 to 3 seconds. Our system can be a useful solution for coffee producers, processors, and traders without relying on stationary equipment, especially in large farms or warehouses.
暂无评论