Vertically Aligned LCD is one of the most widely used optical modes in LCD TV and mobile devices. It is also a very important category of passive LC displays used in Automobiles. It has the advantages of perfect dark ...
详细信息
Speech is a fundamental means of human interaction. Speaker Identification (SI) plays a crucial role in various applications, such as authentication systems, forensic investigation, and personal voice assistance. Howe...
详细信息
Speech is a fundamental means of human interaction. Speaker Identification (SI) plays a crucial role in various applications, such as authentication systems, forensic investigation, and personal voice assistance. However, achieving robust and secure SI in both open and closed environments remains challenging. To address this issue, researchers have explored new techniques that enable computers to better understand and interact with humans. Smart systems leverage Artificial Neural Networks (ANNs) to mimic the human brain in identifying speakers. However, speech signals often suffer from interference, leading to signal degradation. The performance of a Speaker Identification System (SIS) is influenced by various environmental factors, such as noise and reverberation in open and closed environments, respectively. This research paper is concerned with the investigation of SI using Mel-Frequency Cepstral Coefficients (MFCCs) and polynomial coefficients, with an ANN serving as the classifier. To tackle the challenges posed by environmental interference, we propose a novel approach that depends on symmetric comb filters for modeling. In closed environments, we study the effect of reverberation on speech signals, as it occurs due to multiple reflections. To address this issue, we model the reverberation effect with comb filters. We explore different domains, including time, Discrete Wavelet Transform (DWT), Discrete Cosine Transform (DCT), and Discrete Sine Transform (DST) domains for feature extraction to determine the best combination for SI in case of reverberation environments. Simulation results reveal that DWT outperforms other transforms, leading to a recognition rate of 93.75% at a Signal-to-Noise Ratio (SNR) of 15 dB. Additionally, we investigate the concept of cancelable SI to ensure user privacy, while maintaining high recognition rates. Our simulation results show a recognition rate of 97.5% at 0 dB using features extracted from speech signals and their DCTs. Fo
One of the most important problems in computer vision and remote sensing is object detection, which identifies particular categories of diverse things in pictures. Two crucial data sources for public security are the ...
详细信息
Student success models might be prone to develop weak spots, i.e., examples hard to accurately classify due to insufficient representation during model creation. This weakness is one of the main factors undermining us...
详细信息
Traffic sign recognition plays a pivotal role in modern intelligent transportation systems, contributing significantly to traffic management and road safety. This thesis presents a comprehensive investigation into the...
详细信息
Mental-image based spatiotemporal (4D) language understanding in human was considered from the viewpoint of cognitive science and simulated based on the mental image model proposed in mental image directed semantic th...
详细信息
This paper aims at studying the ability of deep machine learning to predict software faults based on object-oriented metrics. This research investigated software faults from the perspective of fault-proneness, faults ...
详细信息
In skeleton-based action recognition, graph convolutional networks (GCN) have been applied to extract features based on the dynamic of the human body and the method has achieved excellent results recently. However, GC...
详细信息
Analytical studies of network epidemiology almost exclusively focus on the extreme situations where the timescales of network dynamics are well separated (longer or shorter) from that of epidemic propagation. In reali...
详细信息
Analytical studies of network epidemiology almost exclusively focus on the extreme situations where the timescales of network dynamics are well separated (longer or shorter) from that of epidemic propagation. In realistic scenarios, however, these timescales could be similar, which has profound implications for epidemic modeling (e.g., one can no longer reduce the dimensionality of epidemic models). Combining Monte Carlo simulations and mean-field theory, we analyze the critical behavior of susceptible-infected-susceptible epidemics in the vicinity of the critical threshold on the activity-driven model of temporal networks. We find that the persistence of links in the network causes the threshold to decrease as the recovery rate increases. Dynamic correlations (coming from being close to infected nodes increases the likelihood of infection) drive the threshold in the opposite direction. These two counteracting effects make epidemic criticality in temporal networks a remarkably complex phenomenon.
Traditional dental plaque detection relies on medical staining reagents and professional intervention. Deep learning-based automatic staining-free dental plaque segmentation provides an alternative for patients to per...
详细信息
ISBN:
(纸本)9798350386226
Traditional dental plaque detection relies on medical staining reagents and professional intervention. Deep learning-based automatic staining-free dental plaque segmentation provides an alternative for patients to perform plaque detection at home without staining reagents. However, existing methods still struggle with low-contrast visual features between unstained plaque and healthy teeth. To address this, we propose a Frequency-Guided Network (FGN) for low-contrast staining-free dental plaque segmentation. We observe that dental plaque tends to concentrate specifically near the junction between the teeth and the gingiva. This junction demonstrates abrupt changes in pixel values, indicating high-frequency regions in the image. In other words, dental plaque tends to appear near the high-frequency regions of oral endoscope images. Exploiting this characteristic, we employ a frequency-guided decoupling module to separate the image into high-frequency and low-frequency regions automatically and expand the high-frequency region to encompass nearby potential dental plaque. Then we supervise two regions individually to specifically focus on the expended high-frequency region for localizing nearby dental plaque. Additionally, we propose a high-to-low frequency multiple tasks framework. In the first phase, the network segments the teeth region, and then we input the teeth mask into the second phase. In the second stage, the teeth mask allows us to have a higher frequency at the junction between the teeth and gums, thereby enhancing the effectiveness of frequency-guided decoupling. Furthermore, FGN integrates a frequency-driven refinement module to enhance the guidance quality of the teeth mask for the second phase. Extensive evaluations of the oral endoscope dataset demonstrate that our method outperforms existing high-performance segmentation methods. User studies also confirm that our approach achieves superior results to experienced dentists. https://frequency-guided-netw
暂无评论