In the realm of music AI, arranging rich and structured multi-track accompaniments from a simple lead sheet presents significant challenges. Such challenges include maintaining track cohesion, ensuring long-term coher...
ISBN:
(纸本)9798331314385
In the realm of music AI, arranging rich and structured multi-track accompaniments from a simple lead sheet presents significant challenges. Such challenges include maintaining track cohesion, ensuring long-term coherence, and optimizing computational efficiency. In this paper, we introduce a novel system that leverages prior modelling over disentangled style factors to address these challenges. Our method presents a two-stage process: initially, a piano arrangement is derived from the lead sheet by retrieving piano texture styles; subsequently, a multi-track orchestration is generated by infusing orchestral function styles into the piano arrangement. Our key design is the use of vector quantization and a unique multi-stream Transformer to model the long-term flow of the orchestration style, which enables flexible, controllable, and structured music generation. Experiments show that by factorizing the arrangement task into interpretable sub-stages, our approach enhances generative capacity while improving efficiency. Additionally, our system supports a variety of music genres and provides style control at different composition hierarchies. We further show that our system achieves superior coherence, structure, and overall arrangement quality compared to existing baselines.
We study private stochastic convex optimization (SCO) under user-level differential privacy (DP) constraints. In this setting, there are n users (e.g., cell phones), each possessing m data items (e.g., text messages),...
Deep learning-based hyperspectral image (HSI) compression has recently attracted great attention in remote sensing due to the growth of hyperspectral data archives. Most of the existing models achieve either spectral ...
详细信息
This research presents the development and evaluation of SPEAR, an advanced voice-activated personal desktop assistant designed to address challenges in existing virtual assistant technology, such as limited language ...
详细信息
Investigating the interactions among particles in high-energy physics is essential for various tasks, such as reconstructing particle decays. The neural relational inference encoder model, capable of capturing inter-e...
详细信息
The digitization of Electronic Health Records (EHRs) has brought a revolution in health care delivery and even practices in reporting key organizational HR processes. However, these advancements have their merits and ...
详细信息
The 'Smart Vehicle Monitoring System' presents a comprehensive solution for enhancing road safety and user authentication in the realm of modern transportation. This system integrates advanced technologies to ...
详细信息
Light Detection and Ranging (LiDAR) technology is one of the integral parts of systems involving connected vehicles. It provides the necessary two-dimensional accuracy to enable proper navigation, object identificatio...
详细信息
The data-cleaning approach applies the capabilities of large language models to reduce the noise in the extracted and received data from healthcare sources. The aim will be to clean the collected and extracted data by...
详细信息
Strengthening network security is a must in today's digital landscape. Existing Intrusion Detection Systems commonly make use of deep learning techniques such as Deep Neural Networks to identify malicious & an...
详细信息
暂无评论