the existing satellite navigation and positioning technology can’t meet the requirement of target positioning accuracy, and the results are not obtained in time. In order to improve the accuracy and real-time of sate...
详细信息
the scarcity of accurately labeled data critically hampers the usage of deep learning models. this issue is highlighted in areas (e.g., biological sciences) where data annotation results in an expert-demanding, labor-...
详细信息
ISBN:
(数字)9798350376036
ISBN:
(纸本)9798350376043
the scarcity of accurately labeled data critically hampers the usage of deep learning models. this issue is highlighted in areas (e.g., biological sciences) where data annotation results in an expert-demanding, labor-intensive and error-prone task. While state-of-the-art semi-supervised approaches have proven effective in circumventing this limitation, their reliance on pre-trained architectures and large validation sets to deliver effective solutions still poses a challenge. In this work we introduce an iterative contrastive-based meta-pseudo-Iabeling method for training non-pre-trained custom CNN architectures for image classification in conditions of limited labeled and abundant unlabeled data, with no dependency on a validation set. It generates multiple models across a few iterations, which are in turn exploited in an ensemble manner to label the unlabeled data and train a final classifier. Our approach starts by capitalizing on contrastive learning to enhance the representation ability of two collaborative networks while eliminating the need of pre-trained architectures. then, during each iteration, the networks are trained within a teacher-student based cross-training setup, where OPFSemi (teacher) propagates labels from labeled to unlabeled on the non-linear 2D latent space projections of each network's (student) deep features; afterward, the pseudo-labels withthe highest top 10% confidence, per class, are picked to fine-tune the other network in a cross-training manner, jointly mitigating confirmation bias and overfitting while improving the generalization ability of the networks as iterations evolve. Our method is evaluated on three challenging biological image datasets with only 5 % of labeled samples, demonstrating its effectiveness and robustness when compared to two direct baselines and six state-of-the-art methods from three different semi-supervised learning paradigms.
During a football match, the information is manually collected by humans. However, the correctness of the football match data is difficult to check because of the game's speed, and thus, human errors can occur. th...
详细信息
ISBN:
(数字)9781728156316
ISBN:
(纸本)9781728156323
During a football match, the information is manually collected by humans. However, the correctness of the football match data is difficult to check because of the game's speed, and thus, human errors can occur. this paper presents an automatic football match event detection from the scoreboard using a deep learning algorithm. the proposed method can reduce human error and performs the detection faster. In this study, the detection was trained with 30,000 data of Goals, Substitutions and Cards scoreboard from 68 matches of English Premier League 2017-2018 broadcast videos. the detection was tested with 80 sub-testing videos. these videos were prepared from 20 full matches broadcast videos, which consisted of 12 full matches from the year 2017-2018 and 8 full matches from the year 2018-2019. the proposed method contains three main steps: data gathering and augmentation, object detection for scoreboard visualization forms, and the event classification. the scoreboard detection is performed with a Single-Shot MultiBox Detector. the event classification employs the majority vote and time frame technique. the experimental results show an accuracy rate of 1.00 withthe expected event scoreboards, comprised of Goal, Substitution, and Card events.
Virtual Humans (VHs) with high levels of anthropomorphism in visual appearance and behavior can enhance user experience in movies, games, and other interactive media. Users often seek human-like representations that i...
详细信息
ISBN:
(数字)9798350376036
ISBN:
(纸本)9798350376043
Virtual Humans (VHs) with high levels of anthropomorphism in visual appearance and behavior can enhance user experience in movies, games, and other interactive media. Users often seek human-like representations that include realistic movement, emotion, and gender, fostering a stronger sense of identification with VHs. Studies from Psychology have shown that people tend to evaluate characteristics of others within their own group differently, known as the in-group advantage. For example, women may be better at recognizing emotions in other women than in men. Researchers have also noted differences in feature recognition based on boththe gender of the person and the gender of the perceiver, a phenomenon that also extends to VHs. Understanding how humans perceive VHs is crucial for improving user experience and representation in virtual environments. Gender is a key anthropomorphic characteristic in VHs, essential for representativeness, human identification, and user comfort. Typically, VHs assigned a specific gender exhibit stereotyped features such as movements, clothing, hairstyles, and colors, designed to be easily recognized by users. Insights into gender representation in VHs can guide the industry in modeling and animating VHs to achieve the desired impact. this tutorial introduces a methodology for modeling gender in VHs, from genderless virtual babies to genderless virtual adults, focusing on visual and behavioral aspects. Participants will learn how to create genderless VHs, enhancing gender representation and making virtual environments more inclusive for a diverse audience.
Convolutional Neural Networks (CNNs) have been widely used in many computer applications. the growth in deep neural networks and machine learning applications has resulted in the state-of-the-art in CNN architectures ...
详细信息
ISBN:
(数字)9781728147703
ISBN:
(纸本)9781728147710
Convolutional Neural Networks (CNNs) have been widely used in many computer applications. the growth in deep neural networks and machine learning applications has resulted in the state-of-the-art in CNN architectures becoming more and more complex. Millions of multiply-accumulate (MACC) operations are needed in this kind of processing. To deal withthese massive computing requirements, accelerating CNNs on FPGAs has become a viable solution for balancing power efficiency and processing speed. In this paper, we propose an approximate high-speed implementation of the convolution stage of a CNN computing architecture, the Approximate Multiply-Accumulate Array. Compared withthe traditional multiply-accumulate operation, this implementation converts multiplications into additions and systolic accumulate operations. A key feature is the logarithmic addition with iterative residual error reduction stages which, in principle, allows to trade off power, area and speed with accuracy through for specific data using different configurations. Here, we present experiments where we configure the approximate multiplier in different ways, changing number of iteration stages as well as the bit width of the data and investigate the impact on overall accuracy. In this paper we present initial experiments evaluating the architecture's error using random input data, and Sobel Edge detection is used to investigate the proposed architecture with regard to its use in image-processing CNNs. the experimental results show that the proposed approximate architecture is up to 10.7% faster than a competitive FPGA implementation of an exact multiplier when running the convolution kernel over a test image, and that residual errors after two iterations reach 1.6% for 8-bit inputs and 0.001% for 12-bit inputs on average, based on 10,000 random samples.
In recent years, significant advancements have been made in the field of anomaly detection. Expanding anomaly detection tasks to new scenarios has become a development avenue. Anomaly detection for the outer packaging...
详细信息
ISBN:
(数字)9798350356670
ISBN:
(纸本)9798350356687
In recent years, significant advancements have been made in the field of anomaly detection. Expanding anomaly detection tasks to new scenarios has become a development avenue. Anomaly detection for the outer packaging of supermarket goods is a challenging task due to the diversity of goods’ appearances, which results in some detection methods suffering from limited sensitivity to anomalies and inadequate target mining. To address these issues, we propose a novel unsupervised anomaly detection method based on multi-branch feature extraction and foreground cosine structure mining strategy. Firstly, we use anomaly synthesis technique to enhance the model’s perception of anomalies and construct more precise decision boundaries in the feature space. Secondly, a multi-branch feature extractor extracts and integrates multi-level and multi-scale features, providing rich texture and semantic information to improve the model’s adaptability in real-world scenarios. Finally, the foreground cosine loss focuses on target mining while supplementing local structural information, further enhancing the model’s ability to detect subtle anomalies. To validate the effectiveness of the proposed method, extensive experiments were conducted on the GoodsAD dataset, sampled from supermarket scenarios. Experimental results show that this method achieves significant performance in image-level detection and pixel-level localization.
At present, the rapid development of information technology has promoted the diversification of education methods. Online education, as an important supplement to traditional education and an important means of educat...
详细信息
ISBN:
(数字)9798350356670
ISBN:
(纸本)9798350356687
At present, the rapid development of information technology has promoted the diversification of education methods. Online education, as an important supplement to traditional education and an important means of education reform, has become an integral part of the education and teaching process. However, the long-term development of online education still faces many problems, such as difficulties in grasping students' classroom learning states, poor interaction between teachers and students, and difficulty in ensuring the teaching effect. thanks to the development of deep learning technology and the availability of large-scale facial expression databases, significant progress has been made in the field of facial expression recognition. In this paper, we propose a teaching platform combined with artificial intelligence. this platform uses CNN (Convolutional Neural Network) to recognize students' classroom behavior, specifically their facial expressions, to analyze their learning states and provide timely feedback on classroom teaching effects. the platform can also read video, use advanced imageprocessing and pattern recognition to find expressions and gestures in the video, and mark them, thereby recognizing students' classroom statuses and monitoring their listening statuses. this platform can also generate student behavior reports and visual statistical charts to help teachers better understand students' learning situations and progress, provide teachers with accurate and real-time teaching feedback, help teachers adjust teaching strategies and methods, and improve teaching quality. Finally, after the preliminary test, it is confirmed that the platform has met the initial design requirements and has high identification accuracy and strong data analysis abilities.
暂无评论