With advanced artificial intelligence and deep learning techniques, a growing number of data sources are playing more and more critical roles in planning and operating transportation services. The General Transit Feed...
详细信息
With advanced artificial intelligence and deep learning techniques, a growing number of data sources are playing more and more critical roles in planning and operating transportation services. The General Transit Feed Specification (GTFS), with standard open-source data in both static and real-time formats, is being widely used in public transport planning and operation management. However, compared to other extensively studied data sources such as smart card data and GPS trajectory data, the GTFS data lacks proper investigation yet. Utilization of the GTFS data is challenging for both transport planners and researchers due to its difficulty and complexity of understanding, processing, and leveraging the raw data. In this paper, a GTFS data acquisition and processing framework is proposed to offer an efficient and effective benchmark tool for converting and fusing the GTFS data to a ready-to-use format. To validate and test the proposed framework, a multivariate multistep Long Short-Term Memory is developed to predict train delay with minor anomaly in Sydney as a case study. The contribution of this new framework will render great potential for broader applications and deeper research.
Question generation based on conversational context is a difficult problem to solve. A widely used technique for generating quality questions using fine-tuned models relies on a suitable answer and the context, usuall...
详细信息
Question generation based on conversational context is a difficult problem to solve. A widely used technique for generating quality questions using fine-tuned models relies on a suitable answer and the context, usually the passage. But when it comes to conversational settings, the questions generated are not of the highest quality as they lack the contextual element in the question, especially due to the lack of co-reference resolution of the entity. Furthermore, in most of the evaluation techniques for generating questions, there seems to be a lack of utilizing powerful question-answering systems to judge the answerability of the questions generated. The most prevalent metric used for judging machine-generated text against the human gold standard, BLUE, unfortunately doesn't factor in whether a question answering system would be able to answer the question, but instead focuses mostly on the number of substrings that match against each other. Various question generation models following a generalized encoder-decoder architecture were evaluated using semantic textual similarity for both the generated questions and the generated answers. Although higher parameters in a model usually lend to better performance, our experiment displayed that such is not always the case, at least when there is a massive amount of context missing.
Growing amounts of information sources are required as the quantity of available information from various sources, both self governing and heterogeneous data maintenance requirement increases. A major component of the...
详细信息
Learning distance functions between complex objects, such as the Wasserstein distance to compare point sets, is a common goal in machine learning applications. However, functions on such complex objects (e.g., point s...
Learning distance functions between complex objects, such as the Wasserstein distance to compare point sets, is a common goal in machine learning applications. However, functions on such complex objects (e.g., point sets and graphs) are often required to be invariant to a wide variety of group actions e.g. permutation or rigid transformation. Therefore, continuous and symmetric product functions (such as distance functions) on such complex objects must also be invariant to the product of such group actions. We call these functions symmetric and factor-wise group invariant functions (or SFGI functions in short). In this paper, we first present a general neural network architecture for approximating SFGI functions. The main contribution of this paper combines this general neural network with a sketching idea to develop a specific and efficient neural network which can approximate the p-th Wasserstein distance between point sets. Very importantly, the required model complexity is independent of the sizes of input point sets. On the theoretical front, to the best of our knowledge, this is the first result showing that there exists a neural network with the capacity to approximate Wasserstein distance with bounded model complexity. Our work provides an interesting integration of sketching ideas for geometric problems with universal approximation of symmetric functions. On the empirical front, we present a range of results showing that our newly proposed neural network architecture performs comparatively or better than other models (including a SOTA Siamese Autoencoder based approach). In particular, our neural network generalizes significantly better and trains much faster than the SOTA Siamese AE. Finally, this line of investigation could be useful in exploring effective neural network design for solving a broad range of geometric optimization problems (e.g., k-means in a metric space).
Recent advances in generative artificial intelligence (AI), and particularly the integration of large language models (LLMs), have had considerable impact on multiple domains. Meanwhile, enhancing dynamic network perf...
详细信息
A huge number of text documents are divided into a limited number of groups using the unsupervised learning technique known as text clustering. While the clusters contain different text documents, each cluster contain...
A huge number of text documents are divided into a limited number of groups using the unsupervised learning technique known as text clustering. While the clusters contain different text documents, each cluster contains similar documents. Swarm intelligence (SI) optimization methods have been successfully used to resolve a variety of optimization issues, including difficulties with text document grouping. However, the traditional SI algorithms have many drawbacks including the local optima, low convergence rate, and low accuracy. Hence, the present research work proposes a novel text clustering approach based on bacterial colony optimization (BCO) for separating text documents based on similarity. Three separate text document datasets are used for the experiments, and performance is assessed using three different performance measures. The novel BCO approach produces high accuracy and a quick convergence rate, according to the analysis of the results. The suggested BCO text clustering strategy compares several text clustering methods to analyze strength and robustness.
Picking actions performed by industrial robots may be evaluated using a new architecture in this article. It has always been a goal of humankind to imbue their products with lifelike characteristics in an effort to cr...
详细信息
Various acceleration approaches for Policy Gradient (PG) have been analyzed within the realm of Reinforcement Learning (RL). However, the theoretical understanding of the widely used momentum-based acceleration method...
详细信息
One of the most critical problems in the field of string algorithms is the longest common subsequence problem (LCS). The problem is NP-hard for an arbitrary number of strings but can be solved in polynomial time for a...
One of the most critical problems in the field of string algorithms is the longest common subsequence problem (LCS). The problem is NP-hard for an arbitrary number of strings but can be solved in polynomial time for a fixed number of strings. In this paper, we select a typical parallel LCS algorithm and integrate it into our large-scale string analysis algorithm library to support different types of large string analysis. Specifically, we take advantage of the high-level parallel language, Chapel, to integrate Lu and Liu's parallel LCS algorithm into Arkouda, an open-source framework. Through Arkouda, data scientists can easily handle large string analytics on the back-end high-performance computing resources from the front-end Python interface. The Chapel-enabled parallel LCS algorithm can identify the longest common subsequences of two strings, and experimental results are given to show how the number of parallel resources and the length of input strings can affect the algorithm's performance.
Handwritten signature verification is a crucial task with applications spanning authentication, financial transactions, and legal documents. In scenarios where only a single reference signature is available, the chall...
Handwritten signature verification is a crucial task with applications spanning authentication, financial transactions, and legal documents. In scenarios where only a single reference signature is available, the challenge of accurate verification becomes pronounced due to variations in writing styles, distortions, and limited labeled data. In this paper, we propose a novel Siamese-Transformer network tailored for handwritten signature verification using few-shot learning. By synergizing Siamese neural networks and Transformer architectures, our model excels in capturing contextual relationships and discerning genuine from forged signatures. A triplet loss function facilitates discriminative feature learning. Convolution layers extract local features from an image, while the transformer component utilizes these local features to capture global dependencies within signatures. Experimental results on benchmark datasets showcase the model’s superior performance in few-shot verification scenarios, marking it as a promising advancement in signature verification and few-shot learning techniques.
暂无评论