检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

935 篇 期刊文献
623 篇 会议
11 篇 学位论文

馆藏范围

1,569 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

1,418 篇 工学
- 964 篇 计算机科学与技术...
- 496 篇 电气工程
- 183 篇 信息与通信工程
- 136 篇 控制科学与工程
- 131 篇 软件工程
- 77 篇 生物医学工程（可授...
- 73 篇 仪器科学与技术
- 61 篇 机械工程
- 51 篇 材料科学与工程（可...
- 38 篇 电子科学与技术（可...
- 32 篇 石油与天然气工程
- 26 篇 测绘科学与技术
- 25 篇 化学工程与技术
- 23 篇 土木工程
- 21 篇 动力工程及工程热...
- 21 篇 环境科学与工程（可...
- 18 篇 力学（可授工学、理...
- 17 篇 交通运输工程
- 13 篇 生物工程
339 篇 理学
- 129 篇 物理学
- 96 篇 生物学
- 61 篇 化学
- 55 篇 数学
- 38 篇 地球物理学
- 20 篇 统计学（可授理学、...
249 篇 医学
- 149 篇 临床医学
- 65 篇 特种医学
- 58 篇 基础医学(可授医学...
113 篇 管理学
- 91 篇 管理科学与工程(可...
- 16 篇 图书情报与档案管...
19 篇 农学
12 篇 经济学
10 篇 法学
10 篇 教育学
10 篇 文学
3 篇 艺术学

主题

1,569 篇 variational auto...
265 篇 deep learning
134 篇 anomaly detectio...
93 篇 machine learning
63 篇 generative adver...
48 篇 training
47 篇 generative model
45 篇 feature extracti...
42 篇 unsupervised lea...
40 篇 neural networks
39 篇 representation l...
37 篇 generative adver...
37 篇 data augmentatio...
34 篇 convolutional ne...
33 篇 data models
28 篇 deep generative ...
28 篇 artificial intel...
27 篇 semi-supervised ...
26 篇 collaborative fi...
26 篇 task analysis

机构

10 篇 natl chiao tung ...
6 篇 ucl england
6 篇 shenzhen univ co...
6 篇 shanghai univ sc...
5 篇 nanyang technol ...
5 篇 zhejiang lab peo...
5 篇 xiamen univ sch ...
5 篇 chung yuan chris...
4 篇 mit comp sci & a...
4 篇 univ chinese aca...
4 篇 beijing jiaotong...
4 篇 acad sinica res ...
4 篇 acad sinica taiw...
4 篇 oak ridge natl l...
4 篇 northwestern pol...
4 篇 ecole technol su...
4 篇 chongqing univ p...
4 篇 tsinghua univ de...
4 篇 univ elect sci &...
4 篇 zhejiang univ st...

作者

12 篇 chien jen-tzung
8 篇 tahan antoine
8 篇 chen junghui
8 篇 zemouri ryad
7 篇 yang fan
7 篇 utschick wolfgan...
7 篇 zhang hao
6 篇 tsao yu
6 篇 baur michael
6 篇 slavic giulia
6 篇 wang hsin-min
6 篇 regazzoni carlo
6 篇 marcenaro lucio
5 篇 guo rui
5 篇 li maokun
5 篇 hsu wei-ning
5 篇 glass james
5 篇 liu xin
5 篇 li yan
5 篇 xu shenheng

语言

1,522 篇 英文
44 篇 其他
2 篇 中文
1 篇 朝鲜文
1 篇 土耳其文

检索条件"主题词=variational autoencoder"

共 1569 条记录，以下是971-980 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

Multi-Domain Level Generation and Blending with Sketches via Example-Driven BSP and variational autoencoders 20

Multi-Domain Level Generation and Blending with Sketches via...

引用

15th International Conference on the Foundations of Digital Games

作者： Snodgrass, Sam Sarkar, Anurag Modlai Copenhagen Denmark Northeastern Univ Boston MA USA

ISBN: (纸本)9781450388078

Procedural content generation via machine learning (PCGML) has demonstrated its usefulness as a content and game creation approach, and has been shown to be able to support human creativity. An important facet of creativity is combinational creativity or the recombination, adaptation, and reuse of ideas and concepts between and across domains. In this paper, we present a PCGML approach for level generation that is able to recombine, adapt, and reuse structural patterns from several domains to approximate unseen domains. We extend prior work involving example-driven Binary Space Partitioning for recombining and reusing patterns in multiple domains, and incorporate variational autoencoders (VAEs) for generating unseen structures. We evaluate our approach by blending across 7 domains and subsets of those domains. We show that our approach is able to blend domains together while retaining structural components. Additionally, by using different groups of training domains our approach is able to generate both 1) levels that reproduce and capture features of a target domain, and 2) levels that have vastly different properties from the input domain.

关键词： procedural content generation level blending level generation binary space partitioning variational autoencoder PCGML

来源：评论

学校读者我要写书评

暂无评论

Emotional Voice Conversion with Semi-Supervised Generative Modeling 24

Emotional Voice Conversion with Semi-Supervised Generative M...

引用

Interspeech Conference

作者： Zhu, Hai Zhan, Huayi Cheng, Hong Wu, Ying Sichuan Changhong Elect Holding Grp Co Ltd Changhong AI Lab CHAIR Mianyang Sichuan Peoples R China Univ Elect Sci & Technol China Ctr Robot Chengdu Sichuan Peoples R China Northwestern Univ Dept Elect Engn & Comp Sci Evanston IL 60208 USA

Emotional Voice Conversion (EVC) is a task that aims to convert the emotional state of speech from one to another while preserving the linguistic information and identity of the speaker. However, many studies are limited by the requirement for parallel speech data between different emotional patterns, which is not widely available in real-life applications. Furthermore, the annotation of emotional data is highly time-consuming and labor-intensive. To address these problems, in this paper, we propose SGEVC, a novel semi-supervised generative model for emotional voice conversion. This paper demonstrates that using as little as 1% supervised data is sufficient to achieve EVC. Experimental results show that our proposed model achieves state-of-the-art (SOTA) performance and consistently outperforms EVC baseline frameworks.

关键词： emotional voice conversion variational autoencoder semi-supervised end-to-end

来源：评论

学校读者我要写书评

暂无评论

Supervised Electrocardiogram(ECG) Features Outperform Knowledge-based And Unsupervised Features In Individualized Survival Prediction 3

Supervised Electrocardiogram(ECG) Features Outperform Knowle...

引用

3rd Machine Learning for Health Symposium

作者： Nademi, Yousef Kalmady, Sunil V. Sun, Weijie Qi, Shi-ang Hindle, Abram Kaul, Padma Greiner, Russell Univ Alberta Dept Comp Sci Edmonton AB Canada Univ Alberta Dept Med Canadian VIGOUR Ctr Edmonton AB Canada Univ Alberta Dept Med Edmonton AB Canada Alberta Machine Intelligence Inst Edmonton AB Canada

An electrocardiogram (ECG) provides crucial information about an individual's health status. Researchers utilize ECG data to develop learners for a variety of tasks, ranging from diagnosing ECG abnormalities to estimating time to death - here modeled as individual survival distributions (ISDs). The way the ECG is represented is important for creating an effective learner. While many traditional ECG-based prediction models rely on hand-crafted features, such as heart rate, this study aims to achieve a better representation. The effectiveness of various ECG based feature extraction methods for prediction of ISDs, either supervised or unsupervised, have not been explored previously. The study uses a large ECG dataset from 244,077 patients with over 1.6 million 12lead ECGs, each labeled with the patient's disease - one or more International Classification of Diseases (ICD) codes. We explored extracting high-level features from ECG traces using various approaches, then trained models that used these ECG features (along with age and sex), across a range of training sizes, to estimate patient-specific ISDs. The results showed that the supervised feature extractor method produced ECG features that can estimate ISD curves better than ECG features obtained from unsupervised or knowledge-based methods. Supervised ECG features required fewer training instances (as low as 500) to learn ISD models that performed better than the baseline model that only used age and sex. On the other hand, unsupervised and knowledge-based ECG features required over 5,000 training samples to produce ISD models that performed better than the baseline. The study's findings may assist researchers in selecting the most appropriate approach for extracting high-level features from ECG signals to estimate patient-specific ISD curves.

关键词： Electrocardiogram Individual Survival Distributions variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

TransVQ-VAE: Generating Diverse Images Using Hierarchical Representation Learning 1

引用

32nd International Conference on Artificial Neural Networks (ICANN)

作者： Jin, Chuan Zheng, Anqi Wu, Zhaoying Tong, Changqing Hangzhou Dianzi Univ Sch Sci Hangzhou 310018 Peoples R China Southeast Univ Southeast Monash Joint Grad Sch Suzhou 210096 Peoples R China

ISBN: (数字)9783031442131

ISBN: (纸本)9783031442124;9783031442131

Understanding how to learn feature representations for images and generate high-quality images under unsupervised learning was challenging. One of the main difficulties in feature learning has been the problem of posterior collapse in variational inference. This paper proposes a hierarchical aggregated vector-quantized variational autoencoder, called TransVQ-VAE. Firstly, the multi-scale feature information based on the hierarchical Transformer is complementarily encoded to represent the global and structural dependencies of the input features. Then, it is compared to the latent encoding space with a linear difference to reduce the feature dimensionality. Finally, the decoder generates synthetic samples with higher diversity and fidelity compared to previous ones. In addition, we propose a dual self-attention module in the encoding process that uses spatial and channel information to capture distant texture correlations, contributing to the consistency and realism of the generated images. Experimental results on MNIST, CIFAR-10, CelebA-HQ, and ImageNet datasets show that our approach significantly improves the diversity and visual quality of the generated images.

关键词： Image generation Unsupervised learning variational autoencoder Hierarchical representation Dual self-attention

来源：评论

学校读者我要写书评

暂无评论

Automatic Generation of Semantic Parts for Face Image Synthesis 1

引用

22nd International Conference on Image Analysis and Processing (ICIAP)

作者： Fontanini, Tomaso Ferrari, Claudio Bertozzi, Massimo Prati, Andrea Univ Parma Dept Engn & Architecture IMP Lab Parma Italy

ISBN: (数字)9783031431487

ISBN: (纸本)9783031431470;9783031431487

Semantic image synthesis (SIS) refers to the problem of generating realistic imagery given a semantic segmentation mask that defines the spatial layout of object classes. Most of the approaches in the literature, other than the quality of the generated images, put effort in finding solutions to increase the generation diversity in terms of style i.e. texture. However, they all neglect a different feature, which is the possibility of manipulating the layout provided by the mask. Currently, the only way to do so is manually by means of graphical users interfaces. In this paper, we describe a network architecture to address the problem of automatically manipulating or generating the shape of object classes in semantic segmentation masks, with specific focus on human faces. Our proposed model allows embedding the mask class-wise into a latent space where each class embedding can be independently edited. Then, a bi-directional LSTM block and a convolutional decoder output a new, locally manipulated mask. We report quantitative and qualitative results on the CelebMask-HQ dataset, which show our model can both faithfully reconstruct and modify a segmentation mask at the class level. Also, we show our model can be put before a SIS generator, opening the way to a fully automatic generation control of both shape and texture. Code available at https://***/TFonta/Semantic-VAE.

关键词： Image Synthesis variational autoencoder Face Editing

来源：评论

学校读者我要写书评

暂无评论

Fabric Defect Detection by Applying Structural Similarity Index to the Combination of variational Autoencode and Generative Adversarial Network 1st

Fabric Defect Detection by Applying Structural Similarity In...

引用

1st International Conference on Security and Information Technologies with AI, Internet Computing, and Big data Applications (SITAIBA)

作者： Lee, Chin-Feng Chang, Ting-Chia Chaoyang Univ Technol Dept Informat Management Taichung 41349 Taiwan

ISBN: (纸本)9783031054914;9783031054907

Textiles are one of the common necessities in our lives. The quality of textile products is usually closely related to the quality of the fabric materials. Therefore, before the fabric materials are processed, the fabric and textile industry first conducts quality inspections. In the past, traditional methods detect the defects on the fabric surface by human eyes, suffering from the overall inspection standards to be unreliable due to the fatigue and subjective judgment of the inspectors and consuming considerable labor costs and time. Automated detection methods are gradually introduced into the textile industry as one of the important processes. With the rapid advancement of deep learning technology, deep neural networks have caused revolutionary changes in the field of computer vision Therefore, this research employs an unsupervised deep learning model to detect fabric defects by combining the variational autoencoder (VAE) and the generative adversarial network (GAN). The proposed fabric inspection networks The proposed fabric inspection networks also called FINs only use non-defective fabric data to train the model, which solves the problem that traditional detection methods need to collect a large amount of defect data. In the process of model training, we introduced structural similarity index to help the overall model learn the defect-free texture characteristics of fabric surfaces. Finally, through this method, the surface defects can be found and the defective areas can be repaired. After segmentation, the position of the defect can be marked, and the detection result has also reached a certain degree of accuracy.

关键词： Automated fabric detection Deep learning variational autoencoder Generative adversarial network Structural similarity index

来源：评论

学校读者我要写书评

暂无评论

TimToShape: Supporting Practice of Musical Instruments by Visualizing Timbre with 2D Shapes based on Crossmodal Correspondences 23

TimToShape: Supporting Practice of Musical Instruments by Vi...

引用

28th Annual Conference on Intelligent User Interfaces (IUI)

作者： Arai, Kota Hirao, Yutaro Narumi, Takuji Nakamura, Tomohiko Takamichi, Shinnosuke Yoshida, Shigeo Univ Tokyo Tokyo Japan OMRON SINIC X Corp Tokyo Japan

ISBN: (纸本)9798400701061

Timbre is high-dimensional and sensuous, making it difficult for musical-instrument learners to improve their timbre. Although some systems exist to improve timbre, they require expert labeling for timbre evaluation;however, solely visualizing the results of unsupervised learning lacks the intuitiveness of feedback because human perception is not considered. Therefore, we employ crossmodal correspondences for intuitive visualization of the timbre. We designed TimToShape, a system that visualizes timbre with 2D shapes based on the user's input of timbre-shape correspondences. TimToShape generates a shape morphed by linear interpolation according to the timbre's position in the latent space, which is obtained by unsupervised learning with a variational autoencoder (VAE). We confirmed that people perceived shapes generated by TimToShape to correspond more to timbre than randomly generated shapes. Furthermore, a user study of six violin players revealed that TimToShape was well-received in terms of visual clarity and interpretability.

关键词： crossmodal correspondences timbre-shape correspondences timbre musical instrumental practice variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

SegmentCodeList: Unsupervised Representation Learning for Human Skeleton Data Retrieval 1

引用

45th European Conference on Information Retrieval (ECIR)

作者： Sedmidubsky, Jan Carrara, Fabio Amato, Giuseppe Masaryk Univ Brno Czech Republic ISTI CNR Pisa Italy

ISBN: (数字)9783031282386

ISBN: (纸本)9783031282379;9783031282386

Recent progress in pose-estimation methods enables the extraction of sufficiently-precise 3D human skeleton data from ordinary videos, which offers great opportunities for a wide range of applications. However, such spatio-temporal data are typically extracted in the form of a continuous skeleton sequence without any information about semantic segmentation or annotation. To make the extracted data reusable for further processing, there is a need to access them based on their content. In this paper, we introduce a universal retrieval approach that compares any two skeleton sequences based on temporal order and similarities of their underlying segments. The similarity of segments is determined by their content-preserving low-dimensional code representation that is learned using the variational autoencoder principle in an unsupervised way. The quality of the proposed representation is validated in retrieval and classification scenarios;our proposal outperforms the state-of-the-art approaches in effectiveness and reaches speed-ups up to 64x on common skeleton sequence datasets.

关键词： 3D skeleton sequence Segment similarity Unsupervised feature learning variational autoencoder Segment code list Action retrieval

来源：评论

学校读者我要写书评

暂无评论

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer 24

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis...

引用

Interspeech Conference

作者： Zhang, Yongmao Xue, Heyang Li, Hanzhao Xie, Lei Guo, Tingwei Zhang, Ruixiong Gong, Caixia Northwestern Polytech Univ Audio Speech & Language Proc Grp ASLP NPU Sch Comp Sci Xian Peoples R China DiDi Chuxing Beijing Peoples R China

End-to-end singing voice synthesis (SVS) model VISinger [1] can achieve better performance than the typical two-stage model with fewer parameters. However, VISinger has several problems: text-to-phase problem, the end-to-end model learns the meaningless mapping of text-to-phase;glitches problem, the harmonic components corresponding to the periodic signal of the voiced segment occurs a sudden change with audible artefacts;low sampling rate, the sampling rate of 24KHz does not meet the application needs of high-fidelity generation with the full-band rate (44.1KHz or higher). In this paper, we propose VISinger 2 to address these issues by integrating the digital signal processing (DSP) methods with VISinger. Specifically, inspired by recent advances in differentiable digital signal processing (DDSP) [2], we incorporate a DSP synthesizer into the decoder to solve the above issues. The DSP synthesizer consists of a harmonic synthesizer and a noise synthesizer to generate periodic and aperiodic signals, respectively, from the latent representation z in VISinger. It supervises the posterior encoder to extract the latent representation without phase information and avoid the prior encoder modelling text-to-phase mapping. To avoid glitch artefacts, the HiFiGAN is modified to accept the waveforms generated by the DSP synthesizer as a condition to produce the singing voice. Moreover, with the improved waveform decoder, VISinger 2 manages to generate 44.1kHz singing audio with richer expression and better quality. Experiments on OpenCpop corpus [3] show that VISinger 2 outperforms VISinger, CpopSing and RefineSinger in both subjective and objective metrics. Our audio samples and source code are available (1).

关键词： Singing voice synthesis variational autoencoder adversarial learning

来源：评论

学校读者我要写书评

暂无评论

From Point-wise to Group-wise: A Fast and Accurate Microservice Trace Anomaly Detection Approach 2023

From Point-wise to Group-wise: A Fast and Accurate Microserv...

引用

31st ACM Joint Meeting of the European Software Engineering Conference / Symposium on the Foundations-of-Software-Engineering (ESEC/FSE)

作者： Xie, Zhe Pei, Changhua Li, Wanxue Jiang, Huai Su, Liangfei Li, Jianhui Xie, Gaogang Pei, Dan Tsinghua Univ Beijing Peoples R China Chinese Acad Sci CNIC Beijing Peoples R China eBay Inc Beijing Peoples R China

ISBN: (纸本)9798400703270

As Internet applications continue to scale up, microservice architecture has become increasingly popular due to its flexibility and logical structure. Anomaly detection in traces that record inter-microservice invocations is essential for diagnosing system failures. Deep learning-based approaches allow for accurate modeling of structural features (i.e., call paths) and latency features (i.e., call response time), which can determine the anomaly of a particular trace sample. However, the point-wise manner employed by these methods results in substantial system detection overhead and impracticality, given the massive volume of traces (billion-level). Furthermore, the point-wise approach lacks high-level information, as identical sub-structures across multiple traces may be encoded differently. In this paper, we introduce the first Group-wise Trace anomaly detection algorithm, named GTrace. This method categorizes the traces into distinct groups based on their shared substructure, such as the entire tree or sub-tree structure. A groupwise variational autoencoder (VAE) is then employed to obtain structural representations. Moreover, the innovative "predicting latency with structure" learning paradigm facilitates the association between the grouped structure and the latency distribution within each group. With the group-wise design, representation caching, and batched inference strategies can be implemented, which significantly reduces the burden of detection on the system. Our comprehensive evaluation reveals that GTrace outperforms state-of-theart methods in both performances (2.64% to 195.45% improvement in AUC metrics and 2.31% to 40.92% improvement in best F-Score) and efficiency (21.9x to 28.2x speedup). We have deployed and assessed the proposed algorithm on eBay's microservices cluster, and our code is available at https://***/NetManAIOps/***.

关键词： microservice trace anomaly detection variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共157页 << < 94 95 96 97 98 99 100 101 102 103 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：