Over the past decade, social media platforms have been key in spreading rumors, leading to significant negative impacts. To counter this, the community has developed various Rumor Detection (RD) algorithms to automati...
详细信息
In the heavy clutter environment, the information capacity is large,the relationships among information are complicated, and track initiationoften has a high false alarm rate or missing alarm rate. Obviously, it is ad...
详细信息
In the heavy clutter environment, the information capacity is large,the relationships among information are complicated, and track initiationoften has a high false alarm rate or missing alarm rate. Obviously, it is adifficult task to get a high-quality track initiation in the limited measurementcycles. This paper studies the multi-target track initiation in heavy *** first, a relaxed logic-based clutter filter algorithm is presented. In thealgorithm, the raw measurement is filtered by using the relaxed logic *** not only design a kind of incremental and adaptive filtering gate, but alsoadd the angle extrapolation based on polynomial extrapolation. The algorithm eliminates most of the clutter and obtains the environment with highdetection rate and less clutter. Then, we propose a fuzzy sequential Houghtransform-based track initiation algorithm. The algorithm establishes a newmeshing rule according to system noise to balance the relationship between thegrid granularity and the track initiation quality. And a flexible superpositionmatrix based on fuzzy clustering is constructed, which avoids the transformation error caused by 0–1 voting method in traditional Hough *** addition, the algorithm allows the superposition matrixes of nonadjacentcycles to be associated to overcome the shortcoming that the track can’t beinitiated in time when the measurements appear in an intermittent way. Anda slope verification method is introduced to detect formation-intensive serialtracks. Last, the sliding window method is employed to feedback the trackinitiation results timely and confirm the track. Simulation results verify thatthe proposed algorithms can initiate the tracks accurately in heavy clutter.
The worldwide spread of COVID-19 has made a severe impact on human health and life. It has shown rapid propagation, long in vitro survival, and a long incubation period. More seriously, COVID-19 is more susceptible to...
详细信息
This paper studies semantic segmentation primarily under image-level weak-supervision. Most stateof-the-art technologies have recently used deep classification networks to create small and sparse discriminatory seed r...
详细信息
This paper studies semantic segmentation primarily under image-level weak-supervision. Most stateof-the-art technologies have recently used deep classification networks to create small and sparse discriminatory seed regions of each interest target as pseudo-labels for training segmentation networks, which achieve inferior performance compared with the fully supervised *** propose a Dilated convolutional pixels affinity network(DCPAN) to localize and expand the seed regions of objects to bridge this gap. Although introduced dilated convolutional units enable capture of additional location information of objects, it falsely highlighted true negative regions as dilated rate enlarge. To address this problem,we properly integrate dilated convolutional units with different dilated rates and self-attention mechanisms to obtain pixel affinity measure matrix for promoting classification network to generate high-quality object seed regions as pseudo-labels; thus, the performance of the segmentation network is boosted. Furthermore,although our approach seems simple, our method obtains a competitive performance, and experiments show that the performance of DCPAN outperforms other state-ofart approaches in weakly-supervised settings, which only use image-level labels on the Pascal VOC 2012 dataset.
In the context of industrial automation and intelligent monitoring, accurate identification of the industrial environment is crucial to risk management. However, traditional scene classification methods struggle to ca...
详细信息
With the complexity of the functions of modern buildings, the problem of vertical traffic in buildings is becoming more and more prominent. As the only vertical transportation, the elevator is a necessary prerequisite...
With the complexity of the functions of modern buildings, the problem of vertical traffic in buildings is becoming more and more prominent. As the only vertical transportation, the elevator is a necessary prerequisite for the development of modern buildings to solve its group control distribution problem. In this paper, a vertical traffic scheduling control method is proposed based on dual fuzzy neural network, a recognition strategy is designed to identify the passenger flow models. The dual fuzzy neural network is used to first identify the traffic network model in which the elevator is located, and then combine the corresponding weights and the confidence of the group control allocation to complete the group control scheduling. Finally, the proposed method is verified through a semi-physical simulation platform, proving the correctness and effectivness for traffic pattern recognition and scheduling strategy optimization of group controllers.
Zero-shot Chain-of-Thought (CoT) prompting emerges as a simple and effective strategy for enhancing the performance of large language models (LLMs) in real-world reasoning tasks. Nonetheless, the efficacy of a singula...
The evolution of advanced artificial intelligence generated content approaches has heightened concerns about deepfake, due to the sophisticated forgeries and concealed appearances they produce. To this end, the pre-tr...
详细信息
The evolution of advanced artificial intelligence generated content approaches has heightened concerns about deepfake, due to the sophisticated forgeries and concealed appearances they produce. To this end, the pre-trained Vision Transformer (ViT) model has become a de facto choice for deepfake detection, thanks to its powerful learning capability. Despite favorable results achieved by existing ViT-based methods, they have inherent limitations that could result in suboptimal performance in scenarios with continuously evolving forgery techniques, such as overfitting to single forgery patterns or placing excessive emphasis on dominant forgery regions. In this paper, we propose CUTA, a simple yet effective deepfake detection paradigm that utilizes ViT adapters as the medium and fully exploits the spatial- and frequency-domain features of given images to overcome the limitations of existing methods. Specifically, CUTA focuses on frequency domain masking within the input space, which obscures parts of the high-frequency image to intensify the training challenge while preserving subtle forgery cues in the frequency domain to facilitate comprehensive forgery representations. Furthermore, we propose two task-customized modules within the ViT model, i.e., the texture enhancement module and the multi-scale perceptron module, to seamlessly integrate local texture and rich contextual features. These two modules ensure an organic interaction between the task-specific forgery patterns and general semantic features within the pre-trained ViT framework. The experimental results on several publicly available benchmark datasets demonstrate CUTA’s superiority in performance, particularly showcasing its significant advantages in both cross-dataset and cross-manipulation scenarios. 2005-2012 IEEE.
Gaze input is a popular hands-free input method that allows for intuitive and rapid pointing but lacks a confirmation mechanism. This study introduces GazePuffer, an interaction method that combines puffing cheeks wit...
Gaze input is a popular hands-free input method that allows for intuitive and rapid pointing but lacks a confirmation mechanism. This study introduces GazePuffer, an interaction method that combines puffing cheeks with gaze. We explored the design space of mouth gestures, proposed a set of candidate gestures, filtered them through user subjective evaluation, and selected five basic gestures and four variations. We determined the corresponding virtual reality (VR) actions for these gestures through brainstorming. We achieved an accuracy of 93.8% in recognizing the five basic mouth gestures using the built-in sensors of the head-mounted display devices. We compared GazePuffer with two baseline methods in target selection tasks, demonstrating that GazePuffer is on par with Gaze&Pinch in throughput and speed, slightly outperforming Gaze&Dwell. Finally, we showcased the applicability of GazePuffer in real VR interaction tasks, with users generally finding it usable and effortless.
Large-scale pre-trained Vision Transformer (ViT) models have demonstrated remarkable performance on visual tasks but are computationally expensive to transfer to downstream tasks. Parameter-Efficient Fine-Tuning (PEFT...
详细信息
暂无评论