Compressed sensing (CS) is an emerging signal acquisition theory that directly collects signals in a compressed form if they are sparse on some certain basis. This paper focuses on the realization of CS on speech sign...
详细信息
Compressed sensing (CS) is an emerging signal acquisition theory that directly collects signals in a compressed form if they are sparse on some certain basis. This paper focuses on the realization of CS on speech signals. Observing that different kind speech frames have different intra-frame correlations, we propose a frame-based adaptive compressed sensing framework for speech signals, which applies adaptive projection matrix. Experimental results show significant improvement of speech reconstruction quality by using such adaptive approach against using traditional non-adaptive projection matrix.
With the development of surveillance cameras, more communication resources are used to transmit surveillance videos. However, some previous video compression methods have a fixed compression standard and are not effec...
详细信息
ISBN:
(数字)9798350378412
ISBN:
(纸本)9798350378429
With the development of surveillance cameras, more communication resources are used to transmit surveillance videos. However, some previous video compression methods have a fixed compression standard and are not effective in unknown scenarios. To address this problem, we propose a Task-driven Semantic communication with Unsupervised Semantic Segmentation (TSCUSS) for surveillance video, in order to improve transmission efficiency. First, at the transmitter, we segment the video into semantic foreground and fixed background model. Second, in transmission, real-time transmission is performed for the semantic foreground and single transmission is performed for the background model. Third, at the receiver, the semantic foreground and background model are seamlessly merged to fully reproduce the original semantic content. Finally, experiments show that our proposed method achieves an average bandwidth saving of $78.34 \%$ on both CDnet 2014 and ABODA datasets, and our semantic foreground segmentation dominates in terms of accuracy and structural similarity of foreground-background fusion.
As networks have grown in size and complexity, the existing isolated add-on security systems of internet are confronting some serious challenges. This paper outlines the background, concept, evolution and basic proper...
详细信息
As networks have grown in size and complexity, the existing isolated add-on security systems of internet are confronting some serious challenges. This paper outlines the background, concept, evolution and basic properties of the trustworthy network, and projects the future development of the network in terms of its access control, modeling of trust relationship, quantification and decisionmaking. Then put forward a strengthen TNC to fix the deficiencies.
Single-photon interference is the essential key for the recently well-known twin-field quantum key distribution (TF-QKD) to break the linear rate-distance limit and requires beams with identical polarizations. Inspire...
详细信息
Single-photon interference is the essential key for the recently well-known twin-field quantum key distribution (TF-QKD) to break the linear rate-distance limit and requires beams with identical polarizations. Inspired by this fact and aiming to improve the secret key rate, we propose a hybrid high-dimensional QKD named the interfering-or-not-interfering QKD (INI-QKD), in which both the polarization and phase degrees of freedom (DOFs) are adopted as information carriers. The protocol's security proof is analyzed based on entanglement distillation and three defined effective events, X1, X2, and X3. The simulation shows that the X1 event, from which only the phase information is extracted, exceeds TF-QKD's variants, while the X2 and X3 events, in which both types of information are decoded, can achieve twice the secret key rate as measurement-device-independent QKD (MDI-QKD). It is also proven that by adding the polarization DOF, INI-QKD obtains more resistance to phase mismatch than TF-QKD. Remarkably, these can all be achieved by simply altering the TF-QKD's measurement setup to that of MDI-QKD.
Multi-modal semantic communication has attracted great attention due to its broad application prospects. However, the existing multi-modal semantic communications mostly focus on task-oriented approaches, which ignore...
Multi-modal semantic communication has attracted great attention due to its broad application prospects. However, the existing multi-modal semantic communications mostly focus on task-oriented approaches, which ignore the correlation among multi-modal data, leading to a decrease in the robustness. In this paper, we propose a deep learning enabled semantic communication system with cross-modal alignment, called CA_DeepSC, which effectively utilizes the correlation across multi-modal signals to enhance the robustness of transmission. Firstly, we train the semantic encoder at the transmitter to learn the relationship of cross-modal alignment at the semantic level. Meanwhile, the cross-modal alignment allows to modify the errors caused by semantic or physical noise. Secondly, we propose a novel cross-modal amendment scheme that dynamically assigns weights to auxiliary multi-modal semantic information based on their correlation levels, and integrates modal semantic information with auxiliary modal semantic information at the receiver, optimizing the performance on recovery. Finally, experimental results demonstrate that CA_DeepSC effectively reduces semantic distortion caused by semantic and physical noise, thereby improving the quality and robustness in the multi-modal semantic communication.
In 6G communication, semantic communication is considered one of the most promising directions to fulfill users’ demands for immersive multi-modal experiences, low latency, and high reliability. We proposes a cross-m...
In 6G communication, semantic communication is considered one of the most promising directions to fulfill users’ demands for immersive multi-modal experiences, low latency, and high reliability. We proposes a cross-modal semantic communication approach based on deep learning, where both semantic coding and decoding are carefully crafted to provide optimum performance. Firstly, cross-modal semantic fusion is designed to enable end-to-end data transmission, driven by various task requirements of multi-modal business users. In addition, the proposed approach for evaluation on the semantic similarity is highly effective. It consists of a siamese network and a pseudo-siamese network, which can accurately obtain the matching loss between modal contents. Finally, the simulation results show that the proposed cross-modal semantic communication approach outperforms traditional communication systems, especially in low SNR scenarios. The similarity of cross-modal semantic communication improves by more than 53% compared to the traditional approaches, demonstrating its superiority and feasibility. Overall, our solution can meet the increasing demands of modern communication and facilitate seamless and intuitive experiences for users.
With the development of next-generation mobile communication, semantic communication is considered as a novel technology that focuses on conveying the meaning of a message to the intended users. The key of semantic co...
With the development of next-generation mobile communication, semantic communication is considered as a novel technology that focuses on conveying the meaning of a message to the intended users. The key of semantic communication is that the receiver and sender should have the similar background knowledge, hence the semantic knowledge base is the corner-stone of semantic communication. However, how to select an appropriate content as the background knowledge from huge amount of knowledge base data is a challenge in realizing a semantic communication system. Caching partial semantic knowledge bases at edge servers in advance is a promising approach to overcome the above issue. In this paper, we propose a novel semantic knowledge base deployment architecture, in which the specific content of the semantic knowledge base cached at the edge server is determined by the user’s personal preferences, fact relevance, and the sender-receiver matching degree. The semantic knowledge base deployment architecture is affected by conditions such as latency and storage capacity of the edge server. Based on preference-based reinforcement learning (PbRL), we propose a policy ranking algorithm to drive the replacement of the categorical cache for semantic knowledge bases to maximize the hit ratio and the sender-receiver matching degree under the constraints of latency and caching capacity. Simulation results show that our algorithm can obtain higher knowledge base hit ratio and matching degree.
In recent years, volunteer service in China has made an unprecedented development, but the credibility and traceability of volunteer service time management are confronted with serious challenges. Sin
ISBN:
(纸本)9781467389808
In recent years, volunteer service in China has made an unprecedented development, but the credibility and traceability of volunteer service time management are confronted with serious challenges. Sin
In human-computer interaction, we need to use human's multiple sensory channels and motion channels to improve its naturalness and efficiency. These channels include voice, handwriting, posture, vision, expression...
详细信息
In human-computer interaction, we need to use human's multiple sensory channels and motion channels to improve its naturalness and efficiency. These channels include voice, handwriting, posture, vision, expression, etc. Visual-based human-computer interaction systems are often mentioned in the literature. The system uses a camera to monitor and track the operator's posture, and sets corresponding commands to control the movement of the robot. On top of that, we add tactile feedback to allow the robot to accomplish more tasks. In the vision system, first we obtain color images and depth images through a depth camera. Secondly, we estimate the human posture through OpenPose. Finally, we map the motion of the human arm to the motion of the robotic arm. In the tactile system, first we control the movement of the magic hand through the exoskeleton. Secondly, we obtain the force values of the fingers through the magic hand. Finally, we provide force feedback to each finger through the exoskeleton.
To solve the problem of rapid QoS declining under network situation as well as improve the judging and dynamical adjusting ability that network system towards user QoS, in this paper, we proposed a dynamical self-conf...
详细信息
暂无评论