Due to its superior performance and fewer parameters, CAM++ has become the state-of-the-art model for speaker verification tasks. This model uses 2D convolutional blocks to extract front-end features, which are then f...
详细信息
The utilization of data analytics to gain insights into the game of basketball has seen a remarkable surge in the past decade. Leagues such as the National Basketball Association are continuously exploring innovative ...
详细信息
ISBN:
(数字)9798350330649
ISBN:
(纸本)9798350330656
The utilization of data analytics to gain insights into the game of basketball has seen a remarkable surge in the past decade. Leagues such as the National Basketball Association are continuously exploring innovative methods to analyze game data, an approach that has significantly influenced the dynamics of the game. But to perform these analyses, a growing amount of data is needed, which is traditionally annotated by humans. This work proposes a 3-stage system able to automatically acquire relevant basketball game data from a broadcast video. The first stage is an object detector combined with a tracking algorithm to extract the main elements present in a basketball game video. Then, the players' visual information is analyzed to identify the players based on pixel color analysis and number recognition. Finally, a statistics generation algorithm assigns the game events to the corresponding player and team, so that the system can be used as an aid for box score annotation in major leagues, low-cost annotation in amateur games, or in-depth game video analysis.
This paper presents a multifunctional quadruped robot, specifically engineered for comprehensive air quality monitoring and emergency assistance. Designed to navigate through urban environments, the robot autonomously...
详细信息
ISBN:
(数字)9798350330649
ISBN:
(纸本)9798350330656
This paper presents a multifunctional quadruped robot, specifically engineered for comprehensive air quality monitoring and emergency assistance. Designed to navigate through urban environments, the robot autonomously collects and transmits real-time air quality data to a centralized database. This interdisciplinary paper integrates robotics, environmental science, and emergency response protocols to propose novel solutions for smart city infrastructure, public health, and disaster management. Specifically, this research highlights the critical role that autonomous systems play in monitoring environmental conditions and advancing safety protocols. This research leverages a comprehensive array of sensors including CO2 (MH-Z19), particle (SDS 011), GPS (ADA 746), DHT 22, MQ 9, and ADS 1115 converter, each meticulously implemented to ensure precise environmental data collection and analysis. Furthermore, the integration of cutting-edge technologies such as Robot Operating System (ROS), .NET MAUI, Python, and MariaDB establishes a robust framework for seamless operation, data processing, and secure storage within the quadruped robot system.
This paper presents the creation of an innovative autonomous security robot designed to perform security functions with efficiency and reliability. The robot boasts mapping capabilities, which it utilizes to facilitat...
详细信息
ISBN:
(数字)9798350330649
ISBN:
(纸本)9798350330656
This paper presents the creation of an innovative autonomous security robot designed to perform security functions with efficiency and reliability. The robot boasts mapping capabilities, which it utilizes to facilitate autonomous patrol in designated areas. Its primary operations involve the use of computer vision to detect violence, identify weapons and dangerous items, and recognize individuals. Critical incidents are met with an immediate alarm and the subsequent transmission of data to a central security server, which then generates comprehensive reports displayed through a web application for security personnel. The application itself features remote control of the robot, incident report management, status updates, and incident analytics. The robot demonstrates substantial real-world application potential, particularly in crowded environments where it could outperform conventional surveillance. The project combines concepts of engineering, computer science, and cybersecurity, functioning per design but with considerable potential for future refinement and expansion, embodying the concept of an evolving technological solution.
Tissue P systems are a class of distributed and parallel computing models inspired from inter-cellular communication and cooperation between cells. In this work, a variant of tissue P system, named tissue P system wit...
详细信息
Tissue P systems are a class of distributed and parallel computing models inspired from inter-cellular communication and cooperation between cells. In this work, a variant of tissue P system, named tissue P system with look-ahead mode, is discussed for decreasing the inherent non-determinism of tissue P systems and helping implementing tissue P systems on computers. Such systems are proved to be universal by simulating register machine, and they are also proved to be able to efficiently solve computationally hard problems by means of a space-time tradeoff, which is illustrated with a polynomial solution to 3-coloring problem.
This paper presents an Intelligent Monitoring System that utilizes the Internet of Things (IoT) and Artificial Intelligence (AI) technologies to automate the class attendance process reliably and efficiently. Conventi...
详细信息
ISBN:
(数字)9798350330649
ISBN:
(纸本)9798350330656
This paper presents an Intelligent Monitoring System that utilizes the Internet of Things (IoT) and Artificial Intelligence (AI) technologies to automate the class attendance process reliably and efficiently. Conventional approaches for attendance tracking have been laborious and have consumed a significant amount of time, with errors and inconsistencies being a common occurrence. In contrast, the Intelligent Monitoring System combines object detection & recognition AI models, wireless communication, and cloud monitoring to generate reliable attendance data that can be used for various purposes, such as tracking student-by-student attendance data and monitoring overall attendance statistics. The system comprises an ID reader that uses radio frequency tags, a facial recognition system that uses a camera and AI algorithms, and a cloud monitoring system for attendance statistics. The proposed system is designed to overcome the challenges of traditional attendance-taking processes and provide a solution that is accurate, reliable, and efficient.
Representation learning is a challenging, but essential task in audiovisual learning. A key challenge is to generate strong cross-modal representations while still capturing discriminative information contained in uni...
详细信息
Representation learning is a challenging, but essential task in audiovisual learning. A key challenge is to generate strong cross-modal representations while still capturing discriminative information contained in unimodal features. Properly capturing this information is important to increase accuracy and robustness in audiovisual tasks. Focusing on emotion recognition, this study proposes novel cross-modal ladder networks to capture modality-specific information while building strong cross-modal representations. Our method utilizes representations from a backbone network to implement unsupervised auxiliary tasks to reconstruct intermediate layer representations across the acoustic and visual networks. The skip connections between the cross-modal encoder and decoder provide powerful modality-specific and multimodal representations for emotion recognition. Our model on the CREMA-D corpus achieves high performance with precision, recall, and F1 scores over 80% on a six-class problem.
Ultrasonic waves provide an effective means of transmitting information through solid media, such as metal pipes and bars. However, the complex geometry of these materials causes reverberations that degrade the qualit...
详细信息
ISBN:
(数字)9798350371901
ISBN:
(纸本)9798350371918
Ultrasonic waves provide an effective means of transmitting information through solid media, such as metal pipes and bars. However, the complex geometry of these materials causes reverberations that degrade the quality of communication. In this paper, we propose denoising approaches using time reversal and inverse filtering methods for blind deconvolution. These methods enable significant improvements in signal-to-noise ratio (SNR), with the inverse filter demonstrating superior performance in both simulated and real-world scenarios. Our results show that this approach provides a robust solution for improving ultrasonic communication in complex solid channels.
A dual-band polarization-diversity omnidirectional glass antenna is presented in this paper. A stepped glass dielectric resonator (DR) is fed by a probe and dual loops for vertical and horizontal polarizations, respec...
详细信息
暂无评论