The state-of-the-art face recognition systems frequently rely on training with large image dataset acquired from several users on a single device. However, these databases usually contain sensitive information, which ...
详细信息
ISBN:
(数字)9798350375022
ISBN:
(纸本)9798350375039
The state-of-the-art face recognition systems frequently rely on training with large image dataset acquired from several users on a single device. However, these databases usually contain sensitive information, which raises substantial privacy concerns. To address this issue, we propose the use of federated learning for a supervised face recognition. Federated learning enables model training while maintaining privacy by leveraging decentralized edge devices. In our method, each edge device trains a model individually, which is subsequently sent to a secure aggregator. We use GAN to introduce different data without sending actual data. The aggregator combines various models into a global one, which is subsequently disseminated to the edge devices. Experiments with CelebA datasets show that federated learning effectively protects privacy while maintaining performance levels.
In terms of mortality rates, gastric cancer is second only to lung cancer. Manual gastric slice pathology examination is labor-intensive and prone to observer bias. Endoscopy of the upper digestive tract is commonly u...
详细信息
Mobile robots deployed in the homes of disabled people and older adults can play a vital role as companions and aid in activities of daily living, enhancing people's quality of life. For these robots to effectivel...
详细信息
In recent years, the deep learning method has gained widespread popularity in hyperspectral image classification (HSI) tasks because of remarkable feature extraction capability. However, conventional deep learning app...
In recent years, the deep learning method has gained widespread popularity in hyperspectral image classification (HSI) tasks because of remarkable feature extraction capability. However, conventional deep learning approaches often yield suboptimal classification performance due to the limited discrimination ability of the extracted features. Moreover, the challenge of obtaining high classification performance with a limited number of samples in hyperspectral images (HSIs) has emerged as a prominent research area of interest. To address this concern, this letter provides a novel 3D convolution attention network (3D-CAN) for feature extraction and HSIs classification. The attention mechanism employed in this context enables capturing both the long-range spatial dependencies of HSIs in the horizontal and vertical directions and the varying significance levels across distinct spectral channels. The experiments are executed on both the new data sets and the publicly accessible benchmark data sets to expose the efficiency and durability of the proposed model. Empirically, it has been demonstrated that the presented model succeeds over the remaining state-of-the-art approaches in terms of classification accuracy.
Multi-scale networks rely heavily on the aggregation process to aggregate feature information into other feature maps. However, it has disadvantages in aggregating information, where the localization information becom...
Multi-scale networks rely heavily on the aggregation process to aggregate feature information into other feature maps. However, it has disadvantages in aggregating information, where the localization information becomes inconsistent. An Enhancing Path Aggregation Network through concatenation and attention mechanism is proposed here to improve the aggregation path for generating the feature maps in multi-scale network. This approach helps to keep information consistently during the process of aggregation. The proposed model is evaluated using the COCO dataset, which achieves significant improvements on several metrics.
Rate control is one of the promising approaches to mitigate channel congestion in wireless vehicular networks once resource allocation-based techniques are overloaded due to too many requests. Recently, state-of-the-a...
Rate control is one of the promising approaches to mitigate channel congestion in wireless vehicular networks once resource allocation-based techniques are overloaded due to too many requests. Recently, state-of-the-art rate control methods have been proposed to mitigate channel congestion by reducing the data rate from low-risk vehicles. However, there is a lack of fairness guarantees during the risk assessment. As a result, if many vehicles are at the same risk level, there are possibilities that several of them may not be able to access the wireless channels to transmit messages on time due to the long queue of channel occupation requests. In this work, we present a novel fairness risk-based transmission control model, namely F-RTC, to automatically adjust the message broadcasting rate of a group of vehicles under channel congestion. F-RTC works based on Double Deep Q-learning Networks with a centralized control center to measure Jain's fairness index for each vehicle. F-RTC can suggest cutting down or increasing the data rate of the vehicles based on their score results. The evaluation results show that F-RTC outperforms the state-of-the-art risk-based rate control methods in terms of congestion rate reduction and channel busy ratio while having no significant impact on the packet delivery rate. We release the source code of the learning model for interesting readers at https://***/lanvernon/FRTC.
With the development of virtual reality (VR) and augmented reality (AR) devices, handheld controllers and camera-based hand tracking are the most common methods for interacting with the virtual world. This paper propo...
详细信息
ISBN:
(数字)9798350366501
ISBN:
(纸本)9798350366518
With the development of virtual reality (VR) and augmented reality (AR) devices, handheld controllers and camera-based hand tracking are the most common methods for interacting with the virtual world. This paper proposes arm motion tracking and hand gesture recognition methods based on a single IMU (Inertial Measurement Unit) sensor embedded in a wristband. This approach is more convenient than handheld controllers and more energy-efficient than camera-based tracking. We utilize two deep learning models: one for arm motion tracking and the other for hand gesture recognition. When training the arm motion tracking model, we use two IMU sensors-one placed on the wrist to provide forearm pose data and the other placed above the elbow to provide upper arm pose data. We use the elbow’s IMU data as the reference for training the model and calculate the real-time upper arm pose vector based on the wrist’s pose data. We train the hand gesture recognition model using only the wrist-mounted IMU sensor. This model captures different vibrations in the IMU during various gestures (such as grab, release, snap, and tap) for training. We achieve real-time arm motion tracking and hand gesture recognition using only a single IMU sensor. The angle error for arm motion tracking is only 15 degrees, and the hand gesture recognition accuracy reaches 95%.
We study the use of PPO (Proximal Policy Optimization) algorithm for trading ETFs belonging to different asset classes. The studied asset classes include common stocks, bonds, REIT (Real Estate Investment Trust), gold...
We study the use of PPO (Proximal Policy Optimization) algorithm for trading ETFs belonging to different asset classes. The studied asset classes include common stocks, bonds, REIT (Real Estate Investment Trust), gold, and future contracts on agricultural commodities. When properly training the PPO agent with the proposed ratio allocation strategy, the agent outperforms the static, periodical rebalancing approach.
The use of spatial audio plugins (SAPs) with Ambisonics processing and binaural rendering has become widespread in the last decade, thanks to their increased accessibility and usability. SAPs are particularly relevant...
The use of spatial audio plugins (SAPs) with Ambisonics processing and binaural rendering has become widespread in the last decade, thanks to their increased accessibility and usability. SAPs are particularly relevant in scenarios involving real-time music playing with headphones, such as networked music performance and individual recreational music-making using backing tracks. However, a crucial issue that has been largely overlooked thus far is the measurement of the processing latency introduced by currently available SAPs. Identifying which SAPs are the fastest is essential to enable designers, musicians, and researchers to create time-sensitive applications involving 3D audio. To bridge this gap, we compared nine systems formed by different SAPs that enable 3D audio management. We measured the latency of each system throughout the third-order Ambisonics plugins pipeline: encoding, room simulation, sound scene rotation, and binaural decoding. In particular, the measurements were performed utilizing different buffer sizes. Results showed that to achieve a minimization of the latency, it is necessary to use a combination of different SAPs from different systems. Based on our measurements, we propose two spatial audio systems that mix different SAPs. Considering a sampling rate of 48 kHz, a Dell Alienware x15 R2 laptop running the Windows 10 operating system, and an RME Fireface UFX sound card, the two systems achieved an overall latency of 0.33 ms and 0.94 ms respectively.
Cardiovascular disease is an immense threat in today's world. Every year lots of people died by this disease. Though medical science evolved and evaluated much extends still in many areas of the world there are la...
详细信息
暂无评论