The discovery of frequent generators of high utility itemsets (FGHUIs) holds great importance as they provide concise representations of frequent high utility itemsets (FHUIs). FGHUIs are crucial for generating nonred...
详细信息
As extended reality becomes more ubiquitous, people will more frequently interact with computer systems using gestures instead of peripheral devices. However, previous works have shown that using traditional gestures ...
详细信息
ISBN:
(纸本)9783031355950;9783031355967
As extended reality becomes more ubiquitous, people will more frequently interact with computer systems using gestures instead of peripheral devices. However, previous works have shown that using traditional gestures (pointing, swiping, etc.) in mid-air causes fatigue, rendering them largely unsuitable or long-term use. Some of the same researchers have promoted "microgestures"-smaller gestures requiring less gross motion-as a solution, but to date there is no dataset of intentional microgestures available to train computer vision algorithms for use in downstream interactions with computer systems such as agents deployed on XR headsets. As a step toward addressing this challenge, we present a novel video dataset of microgestures, classification results from a variety of ML models showcasing the feasibility (and difficulty) of detecting these fine-grained movements, present a demonstration of a novel keyframe detection method as a way to increase recognition accuracy, and discuss the challenges in developing robust recognition of microgestures for human-computer interaction.
Various essential functions in living organisms are performed by binding of proteins with other molecules (ligands). Proper detection and analysis of ligand binding locations (cavities) leads towards the success of th...
详细信息
Generating music-related notations offers assistance for musicians in the path of replicating the music using a specific instrument. In this paper, we evaluate the state-of-the-art guitar tablature transcription netwo...
详细信息
ISBN:
(纸本)9783031500688;9783031500695
Generating music-related notations offers assistance for musicians in the path of replicating the music using a specific instrument. In this paper, we evaluate the state-of-the-art guitar tablature transcription network named TabCNN against state-of-the-art computer vision networks. The evaluation is performed using the same dataset as well as the same evaluation metrics of TabCNN. Furthermore, we propose a new CNN-based network named TabInception to transcribe guitar-related notations, also called guitar tablatures. The network relies on a custom inception block converged by dense layers. The TabInception network outperforms the TabCNN in terms of multi-pitch precision (MP), tablature precision (TP), and tablature F-measure (TF). Moreover, the Swin Transformer achieves the best score in terms of multi-pitch recall (MR) and tablature recall (TR), while the Vision Transformer achieves the best score in terms of multi-pitch F-measure (MF). Motivated by the previous insights, we train the networks with more epochs and propose another network named Inception Transformer (InT) to surpass all the estimation metrics of TabCNN using a single network. The InT network relies on an inception block converged by a Transformer Encoder. The TabInception and the InT network outperformed all estimation metrics of TabCNN except the tablature disambiguation rate (TDR) when trained using a bigger epoch size.
In Opportunistic Network based mobile data diversion algorithms, the need for multi-hop transmission makes the selection of the next hop node critical. The traditional Prophet algorithm calculates the encounter delive...
详细信息
Computed tomography (CT) reconstruction faces difficulties in dealing with artifacts caused by imperfect imaging processes. Deep learning-based CT reconstruction models have been proposed to address these challenges, ...
详细信息
This paper introduces a novel crowdsourcing worker selection algorithm, enhancing annotation quality and reducing costs. Unlike previous studies targeting simpler tasks, this study contends with the complexities of la...
详细信息
Group signatures allow users to sign messages on behalf of the group without prevealing their identities. However, the opening authority can trace signatures back to their source, raising concerns about privacy. To ad...
详细信息
Accurately bounding the worst-case execution time (WCET) is crucial for efficient real-time system design. Precisely analyzing whether a memory reference results in a cache miss or a cache hit significantly impac...
详细信息
Visual localization is a challenging task involving precise camera position and orientation estimation from an image. Specifically, existing panorama datasets suffer from a limitation in the number of available omnidi...
详细信息
暂无评论