Few-shot video object segmentation(FSVOS) aims to segment a specific object throughout a video sequence when only the first-frame annotation is given. In this study, we develop a fast target-aware learning approach fo...
详细信息
Few-shot video object segmentation(FSVOS) aims to segment a specific object throughout a video sequence when only the first-frame annotation is given. In this study, we develop a fast target-aware learning approach for FSVOS, where the proposed approach adapts to new video sequences from its firstframe annotation through a lightweight procedure. The proposed network comprises two models. First, the meta knowledge model learns the general semantic features for the input video image and up-samples the coarse predicted mask to the original image size. Second, the target model adapts quickly from the limited support set. Concretely, during the online inference for testing the video, we first employ fast optimization techniques to train a powerful target model by minimizing the segmentation error in the first frame and then use it to predict the subsequent frames. During the offline training, we use a bilevel-optimization strategy to mimic the full testing procedure to train the meta knowledge model across multiple video *** proposed method is trained only on an individual public video object segmentation(VOS) benchmark without additional training sets and compared favorably with state-of-the-art methods on DAVIS-2017, with a J &F overall score of 71.6%, and on YouT ubeVOS-2018, with a J &F overall score of 75.4%. Meanwhile,a high inference speed of approximately 0.13 s per frame is maintained.
Educational teaching apps are primarily available in app stores to educate students in various contexts. Lack of educational resources, physical and mental health conditions, and poverty cause some students to skip sc...
Educational teaching apps are primarily available in app stores to educate students in various contexts. Lack of educational resources, physical and mental health conditions, and poverty cause some students to skip school and move on to the next school grade without completing the course content of the previous grade. Most of the available apps focus on specific content to cover. The Smart Primary Education Tutor (SPET) teaching app specifically focuses on the missed content by analyzing their knowledge gap and providing lessons to cover the missed content. The main objective of SPET is to develop a methodology to identify the gap in student knowledge and fill the knowledge gap by teaching using smart techniques. SPET is determined to identify students’ interactions (attention, emotions) with the system to identify students’ ability to use the learning tool, identifying gaps in students’ knowledge levels compared to their actual grades using activities and voice-based technologies, teaching to cover the knowledge gap by providing engaging activities and lessons and evaluating students by conducting a final assessment and analyze students’ knowledge and performance obtained through the system. Students between the ages of 5 and 8 are targeted in the community to apply. The solution embeds deep learning-based models including attention classification models using head posture estimation, facial expression recognition, and eye gaze estimation, speech recognition models to identify provided verbal answers, handwriting recognition models to evaluate student performance, and smart teaching. The child emotion recognition model achieved 93% accuracy. The Attention span evaluation model achieved 85% accuracy. The handwritten numerical and English character data recognition model which detects answers for the final assessment paper achieved 85% percent of accuracy.
Failures of Induction Motors (IMs) can lead to unscheduled downtime and interruption in industry processes. This paper concentrates on the detection of the stator’s inter-turn faults which are one of the most frequen...
详细信息
ISBN:
(数字)9798350371628
ISBN:
(纸本)9798350371635
Failures of Induction Motors (IMs) can lead to unscheduled downtime and interruption in industry processes. This paper concentrates on the detection of the stator’s inter-turn faults which are one of the most frequent causes of failures in IMs. The proposed detection method is based on a similarity index that uses the current waveform. To be more specific, the proposed algorithm presents a full-cycle sliding-window-based index based on cosine similarity that only uses current signals for detection of the stator’s inter-turn faults. The proposed index cuts the phase difference before/after the disturbance and, as a result, it only depends on the size variations of the current waveform. The proposed method is technically unaffected by non-fault transient conditions including voltage imbalance, voltage sag, voltage swell, and heavy load changes. The performance of the proposed method is validated with numerous simulated scenarios and has good accuracy and speed of convergence.
Cancer survival prediction is pivotal in tailoring individualized treatment strategies and guiding clinician decision-making. Yet, existing methodologies grapple with efficiently harnessing the intricate distribution ...
Cancer survival prediction is pivotal in tailoring individualized treatment strategies and guiding clinician decision-making. Yet, existing methodologies grapple with efficiently harnessing the intricate distribution of medical data spanning various modalities. In response, we present SAMMS, an advanced multi-omics multimodal deep learning framework tailored for survival prediction. SAMMS leverages the robust image segmentation model, "Segment Anything" to adeptly characterize pathological images. This prowess is further enhanced by integrating multi-omics data and clinical insights, facilitating holistic modeling across a diverse modal spectrum. The framework weaves a modality-specific subnetwork with a cross-modality common subnetwork, meticulously capturing intra-modality nuances and inter-modality correlations. SAMMS eclipsed its contemporaries by delivering remarkable performance on TCGA’s LGG and KIRC tumor datasets. A battery of analyses underscored SAMMS’s unparalleled capability to distill multifaceted insights from multimodal datasets, yielding richer and more integrative multimodal representations. Such strides promise significant advancements in cancer survival analytics, bolstering the precision and efficacy of patient-centric treatments, disease oversight, and clinical decision processes.
A model for evaluating the availability of a fault-tolerant cluster with a constraint on the maximum query service time is proposed. Node recovery in the cluster involves replacing the entire computational node with t...
A model for evaluating the availability of a fault-tolerant cluster with a constraint on the maximum query service time is proposed. Node recovery in the cluster involves replacing the entire computational node with the latest backup results before the failure. Information update after the backup is achieved through the replication of this information from the working nodes of the cluster. The purpose of the article is to assess the reliability of the cluster and its readiness to timely service requests critical to the waiting time. Restoring up-to-date data in a cluster node after previously entering the data saved during backup into it requires the use of resources of the node in which the data is being restored and the node containing up-to-date data. The specified pair of nodes is excluded from the request servicing process, which reduces the likelihood of timely maintenance. Information recovery in case of failures of all nodes containing replicas of up-to-date data requires more time compared to replication from the memory of a functioning node.
Compromised air quality on a construction site could lead to significant adverse respiratory impacts on workers. This paper introduces an alert sensing methodology to develop practical health and wellbeing requirement...
详细信息
The explosive growth of pervasive and diverse digital products demands the importance of addressing emotional design within the sphere of product development. As a result, there has been a significant focus from both ...
The explosive growth of pervasive and diverse digital products demands the importance of addressing emotional design within the sphere of product development. As a result, there has been a significant focus from both industry and academia on studying consumer perspectives related to a wide range of pervasive and digital products, adopting a variety of methods, tools, and resources to delve into user emotional experiences. However, a comprehensive study focusing on users' emotions, incorporating a precisely defined semantic or lexicon tailored for this type of user interface (UI) product design remains elusive. Therefore, this study aims to synthesize and validate a comprehensive array of emotional keywords applicable for pervasive and digital products through an extensive literature search and a rigorous validation procedure involving language experts. The result is a meticulously refined set of 175 keywords that will serve as a valuable point of reference for all endeavors associated with evaluating and configuring the emotional structure of pervasive and digital product UI experiences. The study's findings have the potential to provide valuable guidance to industry practitioners and academic researchers, contributing to the improvement of UI design landscape and the development of new systems by enhancing their insight into users' emotional experiences through the utilization of pertinent emotional keywords in a wide range of digital and pervasive UI product designs.
The article proposes models of the structural reliability of a multipath routing network with the possibility of its reconfiguration when switching path segments. The models are focused on optimizing the placement of ...
The article proposes models of the structural reliability of a multipath routing network with the possibility of its reconfiguration when switching path segments. The models are focused on optimizing the placement of segment switching nodes, aimed at achieving maximum network performance with the option of connecting request sources to the available communication paths with servers. The study considers the options for connecting each source of requests with one of the paths or with all paths for connecting to servers. The paper shows the existence of an optimal placement of path switching nodes, in which the maximum reliability of the system is achieved.
With society’s increasing data production and the corresponding demand for systems that are capable of utilizing them, the big data domain has gained significant importance. However, besides the systems’ actual impl...
With society’s increasing data production and the corresponding demand for systems that are capable of utilizing them, the big data domain has gained significant importance. However, besides the systems’ actual implementation, their testing also needs to be considered. For this, oftentimes, proper test data sets are necessary. This publication discusses several different approaches how these can be provisioned and, further, highlights the respective advantages, disadvantages, and suitable application scenarios. In doing so, researchers and practitioners that are implementing big data applications and need to test them, or who are generally interested in the domain, are supported in their own considerations on how to obtain test data.
Data compression is a trending field that is used in data storage and data transmission systems. Lossy compression means that data cannot be completely retrieved while in lossless compression the compressed data must ...
详细信息
ISBN:
(纸本)9781665482387
Data compression is a trending field that is used in data storage and data transmission systems. Lossy compression means that data cannot be completely retrieved while in lossless compression the compressed data must be reconstructed exactly. Lossless data compression is used in compressing binary files, telemetry data and high-fidelity medical and scientific images where details are crucial. There is no generic compression algorithm that gives best compression ratio on all data pattern. In this paper, we propose a hybrid lossless hardware architecture that compresses most of data patterns such as repeated data, Gaussian distribution data and images. A profiling-before-compressing and then choosing the right compression hardware is proposed. The proposed design is a highly parallelized architecture that can compress/decompress 64 bytes/cycle with minor overhead. Moreover, it provides high compression ratio on small block sizes as well as large ones.
暂无评论