This paper considers multi-view video and audio transmission on ICN (Information-Centric Networking)/CCN (Content-Centric Networking). Routers in ICN/CCN can cache content. Besides, the capacity of routers' caches...
详细信息
ISBN:
(数字)9798350374537
ISBN:
(纸本)9798350374544
This paper considers multi-view video and audio transmission on ICN (Information-Centric Networking)/CCN (Content-Centric Networking). Routers in ICN/CCN can cache content. Besides, the capacity of routers' caches is finite, so various cache control schemes have been proposed to improve cache efficiency. This paper presents and evaluates a new and suitable control scheme for multi-view video and audio transmission. For this purpose, we construct a network environment with Cefore. We assess application-level QoS (Quality of Service) and QoE (Quality of Experience). We then show the effectiveness of the proposed control scheme.
This paper compares video and audio QoE of OFDMA multi-user transmission and reliable groupcast over wireless LAN. We assume video and audio transmission to several terminals simultaneously. As a reliable groupcast me...
详细信息
ISBN:
(数字)9798350364866
ISBN:
(纸本)9798350364873
This paper compares video and audio QoE of OFDMA multi-user transmission and reliable groupcast over wireless LAN. We assume video and audio transmission to several terminals simultaneously. As a reliable groupcast method, we employ Unsolicited Retry, a technique of IEEE 802.11aa GroupCast with Retries. We utilize IEEE 802.11ax for OFDMA transmission. We evaluate application-level QoS by computer simulation. We then assess QoE through a subjective experiment with video and audio streams generated by the output timing obtained from the simulation. We notice that each method has a situation to be applied appropriately.
Low resolution image-face recognition system is one of the challenging aspects of face recognition models' development. From machine learning, deep learning, and into ensemble learning are implemented to develop f...
详细信息
ISBN:
(数字)9798331520762
ISBN:
(纸本)9798331520779
Low resolution image-face recognition system is one of the challenging aspects of face recognition models' development. From machine learning, deep learning, and into ensemble learning are implemented to develop face recognition models. Including mpdCNN that published by Mishra in a journal article. This mpdCNN already achieved 88.6% accuracy score on SCFace dataset while published. Based on literature review, ensemble learning model can be improved by modifying parallel layers. The purpose of this research is to improve mpdCNN's performance by implementing some modifications include adding parallel layers, alternating the fusion layers, expanding the layers, and adding residual connections. Some modifications were successfully improved mpdCNN's accuracy score. By adding parallel layers and residual connections into mpdCNN's architecture, the modified mpdCNN that proposed in this research achieved 92.33% accuracy score, measured by Rank-k evaluation metric.
Automatic Speech Recognition (ASR) is useful for converting speech into text. ASR is needed to display automatic subtitles on movies or when conducting video conferencing. The use of deep learning in ASR applications ...
Automatic Speech Recognition (ASR) is useful for converting speech into text. ASR is needed to display automatic subtitles on movies or when conducting video conferencing. The use of deep learning in ASR applications is currently still dominated by Long Short-Term Memory (LSTM) and Recurrent Neural Network (RNN). Several previous studies used the Transformer deep learning model in building the ASR model. The accuracy results obtained are better than the LSTM or RNN models. However, ASR research using the Transformer model is still very limited. This paper will discuss the use of the Transformer model for ASR in Indonesian. The dataset used is the Indonesian language private speech dataset. The experimental results show that the Word Error Rate (WER) and Character Error Rate (CER) produced by the proposed model using the Indonesian language primary dataset are almost the same as using the English language public dataset where the resulting WER and CER are 27.34 and 7.96 for Indonesian and 25.28 and 6.02 for English.
LoRa's biggest advantage is its flexibility, which is the ability to increase or decrease data rate and range while decreasing or increasing sensitivity. Whenever propagation conditions change frequently, this fun...
详细信息
The ability to differentiate various products in the retail store plays an essential role to provide effectiveness to customers and reduce or even eliminate long queues. However, traditional machine learning algorithm...
详细信息
The ability to differentiate various products in the retail store plays an essential role to provide effectiveness to customers and reduce or even eliminate long queues. However, traditional machine learning algorithms are incapable of recognizing many subordinate categories in various retail product. This paper aims to recognize the retail product recognition algorithm based on the YOLOv7 model in terms of intra-class variations, with the sub-categories of brand and size. We used two schemes of the dataset to compare recognition performance between them. Firstly, the YOLOv7 is applied in the two schemes of the dataset that is annotated with the subordinate category to detect the brand as meta category. Secondly, the proposed method is applied by adding the object size classification into the YOLOv7 model where the square area of the bounding box was calculated to classify the product according to size. Confidence score and square area are used to verify the object and to obtain the product size, which represents sub-category of the product. The experimental results show that our proposed method achieves higher recall compared to baseline object detection.
Real-world practical systems inherently exhibit non-linearities in their dynamics. Also, it is known that a time-varying delay exists in the system state or input-output. Combined, it affects the stability of the clos...
详细信息
Real-world practical systems inherently exhibit non-linearities in their dynamics. Also, it is known that a time-varying delay exists in the system state or input-output. Combined, it affects the stability of the closed-loop system. It also increases the complexity of the controller design. In-depth controller design research on the class of Nonlinear Systems with Time-Varying Delay (NSTVD) has been the focus of the control community for many years. However, there is a lack of Systematic Literature Review (SLR) and classifications of the papers on this topic. This paper aims to review controller design utilizing a neural network model for the class of NSTVD systems. The study employs Kitchenham’s SLR method to gather, analyze and synthesize published papers from reliable databases between 2017 and 2021. The bibliometric analysis for the selected 38 papers reveals the prolific authors, countries, affiliations, publishers, co-authorship network, co-occurrences of keywords, and ten most-cited papers. Finally, this paper developed a conceptual map outlining six multi-layered findings: the addressed problem, control design method, nonlinear system properties, time-varying delay properties, system constraint properties, and actuator limit properties. A brief qualitative analysis of the ten most-cited papers is performed based on the map. The findings highlighted that the proposed methods have shown encouraging results in the simulation domain and can be used as a source of inspiration for future studies and implementation of the neural controller design of the NSTVD system.
Modern healthcare systems demand comprehensive information systems but face obstacles during adoption. Organizational and structural complexity, especially decentralized systems, challenges the integrated management a...
详细信息
This work presents the application of the Continuous Wavelet Transform employing the complex Morlet mother wavelet for the modal identification of electromechanical transients. The observed oscillations are decoupled ...
This work presents the application of the Continuous Wavelet Transform employing the complex Morlet mother wavelet for the modal identification of electromechanical transients. The observed oscillations are decoupled into simpler oscillatory modes through analytical expressions, from which the relative oscillation frequencies are estimated for each mode. Additionally, the most influential modes in the detected transient are identified based on the amplitude of each oscillation. To demonstrate the effectiveness of the method, two simulations were conducted: one using a synthetic multimodal signal and the other using a unimodal signal obtained from tests on the Power System Stabilizer of a generator from the Belo Monte Hydroelectric Power Plant.
The development of technology and artificial intelligence, especially in the era of Industrial Revolution 4.0 almost covers all fields, one of which is e-learning. In its development, it is very important to determine...
详细信息
暂无评论