During conversations, humans are capable of inferring the intention of the speaker at any point of the speech to prepare the following action promptly. Such ability is also the key for conversational systems to achiev...
详细信息
During conversations, humans are capable of inferring the intention of the speaker at any point of the speech to prepare the following action promptly. Such ability is also the key for conversational systems to achieve rhythmic and natural conversation. To perform this, the automatic speech recognition (ASR) used for transcribing the speech in real-time must achieve high accuracy without delay. In streaming ASR, high accuracy is assured by attending to look-ahead frames, which leads to delay increments. To tackle this trade-off issue, we propose a multiple latency streaming ASR to achieve high accuracy with zero look-ahead. The proposed system contains two encoders that operate in parallel, where a primary encoder generates accurate outputs utilizing look-ahead frames, and the auxiliary encoder recognizes the look-ahead portion of the primary encoder without look-ahead. The proposed system is constructed based on contextual block streaming (CBS) architecture, which leverages block processing and has a high affinity for the multiple latency architecture. Various methods are also studied for architecting the system, including shifting the network to perform as different encoders; as well as generating both encoders’ outputs in one encoding pass.
This paper investigates the wireless communication with a novel architecture of antenna arrays,termed modular extremely large-scale array(XLarray),where array elements of an extremely large number/size are regularly m...
详细信息
This paper investigates the wireless communication with a novel architecture of antenna arrays,termed modular extremely large-scale array(XLarray),where array elements of an extremely large number/size are regularly mounted on a shared platform with both horizontally and vertically interlaced *** module consists of a moderate/flexible number of array elements with the inter-element distance typically in the order of the signal wavelength,while different modules are separated by the relatively large inter-module distance for convenience of practical *** accurately modelling the signal amplitudes and phases,as well as projected apertures across all modular elements,we analyse the near-field signal-to-noise ratio(SNR)performance for modular XL-array *** on the non-uniform spherical wave(NUSW)modelling,the closed-form SNR expression is derived in terms of key system parameters,such as the overall modular array size,distances of adjacent modules along all dimensions,and the user's three-dimensional(3D)*** addition,with the number of modules in different dimensions increasing infinitely,the asymptotic SNR scaling laws are ***,we show that our proposed near-field modelling and performance analysis include the results for existing array architectures/modelling as special cases,e.g.,the collocated XL-array architecture,the uniform plane wave(UPW)based far-field modelling,and the modular extremely large-scale uniform linear array(XL-ULA)of *** simulation results are presented to validate our findings.
The ability to inform transportation businesses and regulatory bodies about the demand for transportation services and how resources may be best deployed to meet this need is strategic to resource allocation and plann...
The ability to inform transportation businesses and regulatory bodies about the demand for transportation services and how resources may be best deployed to meet this need is strategic to resource allocation and planning of national transportation system. Case in point, the pandemic has forced the government to reevaluate the number of utility buses and its ridership and implement various restrictions and regulations within society. One of the restrictions that is often affected is the seating capacity for Public Utility Vehicles. This study provides a proof of concept to an easier and more efficient way to be able to monitor capacity in a Public Utility Bus through the use of a people counter. Photoelectric sensors will be used to monitor the number of people entering and exiting the bus with information being uploaded to ThingSpeak while also being displayed onboard using an LCD. The system is capable of bi-directional counting through the use of two pairs of sensors and will work most accurately when the sensor pairs are at a distance of at least 12 cm from each other and passengers will board and depart in intervals of one second or greater.
A scene plane information recognition method is demonstrated based on data fusion using a single ToF camera. This approach effectively tackles general LiDAR's deficiencies in identifying planar content, achieving ...
详细信息
To assist drivers, the researchers propose a car park occupancy monitoring system that uses a Raspberry Pi to obtain photos needed by YOLOv7 to determine the presence of vehicles in a parking area. OpenCV is used to c...
To assist drivers, the researchers propose a car park occupancy monitoring system that uses a Raspberry Pi to obtain photos needed by YOLOv7 to determine the presence of vehicles in a parking area. OpenCV is used to count the number of available parking spaces from the analysis of YOLOv7. The information is uploaded to the internet in real-time for access by users of the parking area. The data is uploaded and processed in Google Sheets and is accessed by a chatbot, using FlowXO. The chatbot is then deployed in Facebook Messenger, available to the targeted end users.
Although there have been multiple studies related to the tracking of mosquito wingbeat frequencies, not much has been done yet on Philippine mosquitoes. Additionally, most of the current methods for tracking mosquitoe...
Although there have been multiple studies related to the tracking of mosquito wingbeat frequencies, not much has been done yet on Philippine mosquitoes. Additionally, most of the current methods for tracking mosquitoes involve actively trapping mosquitoes and evaluating them in a separate laboratory. As a response, an acoustic sensor module was developed using an Arduino microprocessor to identify and classify mosquitoes. Mosquitoes were lured and zapped. The module took sound input using an omnidirectional microphone and a parabolic dish housed in a pipe. The Arduino used digital signal processing to decrease background noise, identify probable mosquito wingbeats, and categorize different mosquito species according to the frequency of the wingbeats. Audacity was used to create recordings for reference, along with manual checking. ThingSpeak, an internet platform, received classified data and provided real-time display and analysis. Real-time classifications from the module were shown, and a histogram showed how frequently identified mosquito wingbeat frequencies were distributed. This made it possible for users to keep an eye on and track the mosquito population in the region where the module was placed. The findings showed that UV light attracts mosquitoes more potently than yeast. However, it was mentioned that combining yeast and UV light as luring techniques would be interesting for further study. The monitoring of mosquito populations is made easier by the interface with ThingSpeak, which offers real-time data display.
The ability to monitor falls, especially for the elderly, deems to be a crucial task to provide quality and timely healthcare response. However, there have been minimal efforts in centralizing such activity for effici...
The ability to monitor falls, especially for the elderly, deems to be a crucial task to provide quality and timely healthcare response. However, there have been minimal efforts in centralizing such activity for efficient hospital management. This paper presents the development of a full-stack fall monitoring system with edge computing and machine learning technologies. Using a 3-axis accelerometer of a smartphone, motion data is collected and directly sent to an edge computing platform wherein a shallow neural network is directly trained to classify the motion data into positional states: stable, falling sidewards, falling flat, and standing up. A confusion matrix is presented to evaluate the performance of the neural network model, both in training and in real time. A cloud-based approach using ReactJS for front-end integration and Firebase's Cloud Firestore with NodeJS embedded capabilities for real-time data storage and embedded classification is implemented.
Chronic pulmonary diseases remain a prevalent threat globally. With the emergence of COVID-19 and its transmission, there has been a rapid increase in the number of deaths due to respiratory illnesses. In this study, ...
Chronic pulmonary diseases remain a prevalent threat globally. With the emergence of COVID-19 and its transmission, there has been a rapid increase in the number of deaths due to respiratory illnesses. In this study, lung sound classifications were performed using a Thinklabs One digital stethoscope and through the utilization of Long Short-Term Memory (LSTM) in the classification of a person's lung auscultation record into either the normal, crackle, wheeze, or stridor categories with a 92.50% accuracy. Performance evaluation of this system was also done to cross-check for the validity of the algorithm modeled through Edge Impulse, which provided a 92.77% accuracy. The integration of the system adopted an Android-based mobile application as the pulmonary monitoring platform that records a person's general respiratory health data. The inputs from the mobile application were anonymously stored in a centralized database system correspondingly for post-processing and analysis.
Machine learning offers a valuable resolve in dealing with generic and urgent problems in the community with the aid of mathematical concepts. Among the pressing dilemma it addresses is climate change, which has an im...
详细信息
We study the problem of reconfiguring one minimum s-t-separator A into another minimum s-tseparator B in some n-vertex graph G containing two non-adjacent vertices s and t. We consider several variants of the problem ...
详细信息
暂无评论