This article studies the three-dimensional (3D) image analysis and sports training methods of sports technical characteristics. This research uses the current sports technology diagnostic 3D video analysis system as a...
详细信息
This article studies the three-dimensional (3D) image analysis and sports training methods of sports technical characteristics. This research uses the current sports technology diagnostic 3D video analysis system as a platform to build a database and knowledge base based on athletes' 3D sports information and sports parameters and uses algorithms based on artificial intelligence machine-learning machines to analyze sports data, learn from it, and learn from sports technology. Actions make analytical decisions and predictions. Then, it analyzes the human-motion behavior with the concept of traditional and virtual reality technology. The effectiveness of athletes' technical movements, using mathematical statistics, artificial intelligence, and other research methods, integrates and draws on the research methods of sports biomechanics, graphical imaging, human anatomy, expert systems, and neuralnetworks. A neural network not only inherits certain characteristics of biology but also has its own unique characteristics, such as large-scale parallel processing, strong fault tolerance, and self-learning functions. neuralnetworks have a wide range of applications in information processing, pattern recognition, optimization, and other issues. By analyzing the application status of artificial intelligence technology in sports, the development prospects of sports training based on artificial intelligence can be inferred. Based on the acquisition of sports-related data, the evaluation of functional action modes, sports techniques, etc., is established. The multi-target feedback training method ultimately helps athletes improve their training level. Experimental data show that for the human body walking toward the camera, the rotation angle between adjacent frames is close to 0 degrees, and the translational position is basically 5 cm. The experimental results show that 3D image analysis and related sports training methods based on specific sports technical characteristics are
Instrument tone recognition systems have over time had the highest application value and significance in information retrieval. Notably, the traditional systems and methods often rely on convolutional neuralnetworks ...
详细信息
Executing deep neuralnetworks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited reso...
详细信息
Executing deep neuralnetworks (DNN) based vision tasks on edge devices encounters challenging scenarios of significant and continually evolving data domains (e.g. background or subpopulation shift). With limited resources, the state-of-the-art domain adaptation (DA) methods either cause high training overheads on large DNN models, or incur significant accuracy losses when adapting small/compressed models in an online fashion. The inefficient resource scheduling among multiple applications further degrades their overall model accuracy. In this paper, we present ElasticDNN, a framework that enables online DNN remodeling for applications encountering evolving domain drifts at edge. Its first key component is the master-surrogate DNN models, which can dynamically generate a small surrogate DNN by retaining and training the large master DNN's most relevant regions pertinent to the new domain. The second novelty of ElasticDNN is the filter-grained resource scheduling, which allocates GPU resources based on online accuracy estimation and DNN remodeling of co-running applications. We fully implement ElasticDNN and demonstrate its effectiveness through extensive experiments. The results show that, compared to existing online DA methods using the same model sizes, ElasticDNN improves accuracy by 23.31% and reduces adaption time by 35.67x. In the more challenging multi-application scenario, ElasticDNN improves accuracy by an average of 25.91%.
作者:
Yin, ShiLiu, HuiCent South Univ
Sch Traff & Transportat Engn Inst Artificial Intelligence & Robot IAIR Key Lab Traff Safety TrackMinist Educ Changsha 410075 Hunan Peoples R China
image dehazing is critical for enhancing image quality in applications such as autonomous driving, surveillance, and remote sensing. This paper presents an innovative image dehazing model based on a multi-branch and m...
详细信息
image dehazing is critical for enhancing image quality in applications such as autonomous driving, surveillance, and remote sensing. This paper presents an innovative image dehazing model based on a multi-branch and multi scale feature fusion network that leverages spatial and frequency information. The model features a multi-branch architecture that combines local and global features through depthwise separable convolutions and state space models, effectively capturing both detailed and comprehensive information to improve dehazing performance. Additionally, a specialized module integrates spatial and frequency domain information by utilizing convolutional layers and Fourier transforms, enabling comprehensive haze removal through the fusion of these two domains. A feature fusion mechanism incorporates channel attention and residual connections, dynamically adjusting the importance of different channel features while preserving the global structural information of the input image. Furthermore, this is the first model to combine Mamba and convolution layers for driving scene image dehazing, achieving global feature extraction with linear complexity. Each image is processed in only 0.030 s, with a frame rate of 32.41 FPS and a processing efficiency of 67.96 MPx/s, ensuring high efficiency suitable for real-time applications. Extensive experiments on real-world foggy driving scene datasets demonstrate the superior performance of the proposed method, providing reliable visual perception capabilities and significantly improving adaptability and robustness in complex environments.
The most prevalent form of cancer globally is breast cancer, which predominantly impacts women. Early detection ensures successful treatment of breast cancer, significantly improving patients' survival chances. Va...
详细信息
ISBN:
(纸本)9798331541859;9798331541842
The most prevalent form of cancer globally is breast cancer, which predominantly impacts women. Early detection ensures successful treatment of breast cancer, significantly improving patients' survival chances. Various imaging modalities, including mammography and ultrasound, are utilized for breast cancer screening. Incorporating new technologies is essential for better patient management, particularly for those with malignant masses. artificial intelligence can assist radiologists by training neuralnetworks to detect breast lesions on mammograms or ultrasounds using deep learning techniques. In this article, the YOLOv9 network is trained on two public ultrasound databases, UDIAT and BUSIS. The network successfully localized malignant and benign masses with a precision of 83%, a recall of 82%, and a mAP of 87% in the UDIAT dataset. In the BUSIS dataset, our model achieved a precision of 75%, a recall of 88%, and a mAP of 90%. Furthermore, we used real Moroccan cases to evaluate the model's performance.
Recommender systems aim to improve the user experience in a world where data and available alternatives are expanding at an unprecedented rate. Integrating Natural Language processing and artificialneuralnetworks ha...
详细信息
ISBN:
(纸本)9783031624940;9783031624957
Recommender systems aim to improve the user experience in a world where data and available alternatives are expanding at an unprecedented rate. Integrating Natural Language processing and artificialneuralnetworks have resulted in better performance when compared to other recommender systems. This paper showcases the optimization of an artificialneural network-based recommender system that is used for drug recommendation, where the optimization process involves adopting ResNet-50 and a Multiple Criteria Decision Making-based recommender system to tune the learning rate of the neural network models on which the system is based. Results show that our proposed approach leads to a system that outperforms the existing similar systems.
The existing learning-based image dehazing methods usually adopt the encoder-decoder architecture with convolutional neuralnetworks to estimate latent haze-free images from hazy images. However, the limited receptive...
详细信息
The existing learning-based image dehazing methods usually adopt the encoder-decoder architecture with convolutional neuralnetworks to estimate latent haze-free images from hazy images. However, the limited receptive field of convolutional neuralnetworks may not effectively capture structure-level information, causing the model to be unable to the haze density. To solve this problem, this paper proposes a bi-decoder structure with a dense non-pooling encoder to enhance the structural features that are closely related to the haze density. Compared with conventional methods, the main advantage of our method is the integration of an additional coarse decoder in the encoder-decoder architecture, where a hybrid feature convolution (HFC) block is utilized to enlarge the receptive field to extract the structure of the image. Besides the difference in the network, the inputs of the fine and coarse decoders are non-pooling and pooling respectively. Moreover, a multi-scale feature attention (MSFA) module is proposed to selectively enhance the haze-relevant feature outputs of fine and coarse decoders. Experimental results on synthetic and real-world datasets demonstrate that the proposed method outperforms most state-of-the-art methods in terms of image quality and quantitative metrics. Especially in the NH-HAZE dataset, its PSNR exceeds other methods by more than 2.13 dB. In the end, this paper applies this dehazing technology to object detection. The code of this paper and data are available online at https://***/Qiaoyu-K/Bi-Decoder-Dehazing.
Cyberbullying on social media using hate speech in text is applying depreciatory dialect in message dispatches on online forums to abuse, defile, as well as ill-treat recipients. A report by the New Indian Express sta...
详细信息
Cyberbullying on social media using hate speech in text is applying depreciatory dialect in message dispatches on online forums to abuse, defile, as well as ill-treat recipients. A report by the New Indian Express stated that 93% of Indian children were subordinated to cyberbullying out of which 45% were bullied by strangers and 48% were bullied by people known to them. The existing system uses feature extraction using count vectorizer with Support Vector Machine classifier to give an accuracy of 94.78%. The proposed system uses feature extraction using tokenization and padding with artificialneuralnetworks Classifier to achieve an accuracy of 95.85%. The system examines the text content of social media dispatches using Natural Language processing through artificialneuralnetworks.
With the rise of deep learning, the cross collision between artificial intelligence and art, represented by image style transfer, has attracted high attention in the fields of graphic and image technology and art. Bas...
详细信息
Computer vision tasks, such as object recognition, using deep learning find their place in a variety of contexts including agriculture. Regarding data, the coupling of RGB and depth modalities has already proven to be...
详细信息
ISBN:
(纸本)9798331541859;9798331541842
Computer vision tasks, such as object recognition, using deep learning find their place in a variety of contexts including agriculture. Regarding data, the coupling of RGB and depth modalities has already proven to be beneficial for object recognition over the use of RGB-only images. However, the lack of neural network architectures and large-size datasets dedicated to the depth modality forces us to use backbones pre-trained on RGB data using large datasets such as imageNet. While works proposed by Eitel et al. and Aakerberg et al. rely on colorizing the depth values to match an RGB format, they do not take full advantage of the geometric properties carried by the depth modality. We demonstrated principal curvatures when used to color-encode the depth values retain more information related to the object's shape. The proposition was evaluated on the Washington RGB-D dataset and gave mitigated results mainly explained by a high confusion between similarly shaped objects, which represent an important fraction of the dataset. With the introduction of superclasses based on the geometric shape of objects (sphere, cylinder, cube,...) our model performed higher than the previous work, e.g. 3.1% precision increase for the sphere superclass. While presenting some limitations, this work opens the path for further developments.
暂无评论