检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

29,402 篇 会议
1,396 册 图书
219 篇 期刊文献

馆藏范围

31,015 篇 电子文献
2 种 纸本馆藏

日期分布

学科分类号

17,281 篇 工学
- 13,622 篇 计算机科学与技术...
- 5,203 篇 软件工程
- 2,970 篇 机械工程
- 2,648 篇 光学工程
- 1,412 篇 控制科学与工程
- 1,409 篇 电气工程
- 1,333 篇 信息与通信工程
- 656 篇 生物工程
- 576 篇 仪器科学与技术
- 513 篇 生物医学工程（可授...
- 465 篇 电子科学与技术（可...
- 251 篇 化学工程与技术
- 213 篇 安全科学与工程
- 141 篇 交通运输工程
- 132 篇 建筑学
- 121 篇 材料科学与工程（可...
- 119 篇 土木工程
5,056 篇 理学
- 3,130 篇 物理学
- 2,404 篇 数学
- 824 篇 生物学
- 802 篇 统计学（可授理学、...
- 299 篇 系统科学
- 228 篇 化学
3,830 篇 医学
- 3,799 篇 临床医学
- 186 篇 基础医学(可授医学...
- 140 篇 药学(可授医学、理...
1,061 篇 管理学
- 617 篇 图书情报与档案管...
- 469 篇 管理科学与工程(可...
- 146 篇 工商管理
373 篇 艺术学
- 373 篇 设计学（可授艺术学...
116 篇 法学
81 篇 农学
48 篇 教育学
43 篇 经济学
18 篇 军事学
8 篇 文学

主题

12,594 篇 computer vision
5,699 篇 pattern recognit...
3,181 篇 training
2,264 篇 cameras
2,179 篇 computational mo...
2,117 篇 feature extracti...
2,049 篇 image segmentati...
1,971 篇 visualization
1,967 篇 shape
1,642 篇 robustness
1,491 篇 layout
1,476 篇 three-dimensiona...
1,444 篇 computer science
1,338 篇 computer archite...
1,295 篇 object detection
1,221 篇 semantics
1,144 篇 face recognition
1,107 篇 conferences
1,077 篇 benchmark testin...
1,056 篇 humans

机构

137 篇 univ sci & techn...
134 篇 tsinghua univers...
134 篇 univ chinese aca...
118 篇 chinese univ hon...
101 篇 microsoft resear...
97 篇 zhejiang univers...
95 篇 national laborat...
93 篇 shanghai jiao to...
93 篇 zhejiang univ pe...
85 篇 university of sc...
79 篇 shanghai ai lab ...
78 篇 swiss fed inst t...
65 篇 microsoft res as...
62 篇 adobe research
62 篇 computer vision ...
61 篇 peking univ peop...
58 篇 univ oxford oxfo...
57 篇 google mountain ...
57 篇 hong kong univ s...
56 篇 google res mount...

作者

107 篇 umapada pal
81 篇 van gool luc
68 篇 zhang lei
59 篇 timofte radu
41 篇 yang yi
37 篇 loy chen change
37 篇 hanqing lu
33 篇 liu yang
33 篇 xiaoou tang
32 篇 nassir navab
32 篇 wang liang
30 篇 tian qi
29 篇 h. bischof
29 篇 jan-michael frah...
29 篇 vittorio murino
29 篇 darrell trevor
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 murino vittorio
27 篇 chen chen

语言

30,712 篇 英文
236 篇 其他
93 篇 中文
6 篇 土耳其文
2 篇 日文
2 篇 俄文

检索条件"任意字段=Conference on Computer Vision and Pattern Recognition"

共 31017 条记录，以下是4851-4860 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Personalized Outfit Recommendation with Learnable Anchors

Personalized Outfit Recommendation with Learnable Anchors

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lu, Zhi Hu, Yang Chen, Yan Zeng, Bing Univ Elect Sci & Technol China Chengdu Sichuan Peoples R China Univ Sci & Technol China Hefei Anhui Peoples R China

ISBN: (纸本)9781665445092

The multimedia community has recently seen a tremendous surge of interest in the fashion recommendation problem. A lot of efforts have been made to model the compatibility between fashion items. Some have also studied users' personal preferences for the outfits. There is, however, another difficulty in the task that hasn't been dealt with carefully by previous work. Users that are new to the system usually only have several (less than 5) outfits available for learning. With such a limited number of training examples, it is challenging to model the user's preferences reliably. In this work, we propose a new solution for personalized outfit recommendation that is capable of handling this case. We use a stacked self-attention mechanism to model the high-order interactions among the items. We then embed the items in an outfit into a single compact representation within the outfit space. To accommodate the variety of users' preferences, we characterize each user with a set of anchors, i.e. a group of learnable latent vectors in the outfit space that are the representatives of the outfits the user likes. We also learn a set of general anchors to model the general preference shared by all users. Based on this representation of the outfits and the users, we propose a simple but effective strategy for the new user profiling tasks. Extensive experiments on large scale real-world datasets demonstrate the performance of our proposed method.

关键词： Training computer vision Computational modeling pattern recognition Reliability Task analysis Surges

来源：评论

学校读者我要写书评

暂无评论

Keep your Eyes on the Lane: Real-time Attention-guided Lane Detection

Keep your Eyes on the Lane: Real-time Attention-guided Lane ...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tabelini, Lucas Berriel, Rodrigo Paixao, Thiago M. Badue, Claudine De Souza, Alberto F. Oliveira-Santos, Thiago Univ Fed Espirito Santo UFES Vitoria ES Brazil Inst Fed Espirito Santo IFES Vitoria ES Brazil

ISBN: (纸本)9781665445092

Modern lane detection methods have achieved remarkable performances in complex real-world scenarios, but many have issues maintaining real-time efficiency, which is important for autonomous vehicles. In this work, we propose LaneATT: an anchor-based deep lane detection model, which, akin to other generic deep object detectors, uses the anchors for the feature pooling step. Since lanes follow a regular pattern and are highly correlated, we hypothesize that in some cases global information may be crucial to infer their positions, especially in conditions such as occlusion, missing lane markers, and others. Thus, this work proposes a novel anchor-based attention mechanism that aggregates global information. The model was evaluated extensively on three of the most widely used datasets in the literature. The results show that our method outperforms the current state-of-the-art methods showing both higher efficacy and efficiency. Moreover, an ablation study is performed along with a discussion on efficiency trade-off options that are useful in practice.

关键词： computer vision Codes Lane detection Computational modeling Aggregates Detectors Feature extraction

来源：评论

学校读者我要写书评

暂无评论

MDFIDNet: Multi-domain Feature Integration Denoising Network 27th

MDFIDNet: Multi-domain Feature Integration Denoising Network

引用

27th International conference on pattern recognition, ICPR 2024

作者： Das, Debashis Maji, Suman Kumar Department of Computer Science and Engineering Indian Institute of Technology Bihar Patna801106 India

ISBN: (纸本)9783031781247

In the realm of computer vision, image denoising remains a formidable challenge with profound implications for fields like medical imaging, remote sensing, and photography. Despite notable advancements in deep learning, there are enduring challenges: current convolutional neural networks (CNNs) frequently struggle with training complexities due to their emphasis on increased network depth. At the same time, these networks often fail to adequately consider the crucial role of gradient information in the denoising process. Furthermore, there is a distinct gap in leveraging transform domain analysis in image denoising. This study addresses these limitations with MDFIDNet, a novel triple-phase attentive fusion network tailored for image denoising. MDFIDNet integrates three independent feature extraction pipelines: a frequency domain processing pipeline (FDP) enhanced by a multi-scale convolutional attention Block (MSCAB), a spatial domain processing pipeline (SDP) focusing on detail feature preservation, and a gradient-domain processing pipeline (GDP) driven by multidirectional gradient information. Experimental validation demonstrates that MDFIDNet surpasses existing benchmarks, exhibiting robust performance across diverse datasets. Comprehensive ablation studies underscore the individual contributions of each network component, elucidating the novel advancements that underpin MDFIDNet’s superior denoising efficacy. The source code and further details are available in the https://***/debashis15/MDFIDNet. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Semantic-Aware Video Text Detection

Semantic-Aware Video Text Detection

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Feng, Wei Yin, Fei Zhang, Xu-Yao Liu, Cheng-Lin Chinese Acad Sci Natl Lab Pattern Recognit NLPR Inst Automat Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China CAS Ctr Excellence Brain Sci & Intelligence Techn Beijing 100190 Peoples R China

ISBN: (纸本)9781665445092

Most existing video text detection methods track texts with appearance features, which are easily influenced by the change of perspective and illumination. Compared with appearance features, semantic features are more robust cues for matching text instances. In this paper, we propose an end-to-end trainable video text detector that tracks texts based on semantic features. First, we introduce a new character center segmentation branch to extract semantic features, which encode the category and position of characters. Then we propose a novel appearance-semanticgeometry descriptor to track text instances, in which semantic features can improve the robustness against appearance changes. To overcome the lack of character-level annotations, we propose a novel weakly-supervised character center detection module, which only uses word-level annotated real images to generate character-level labels. The proposed method achieves state-of-the-art performance on three video text benchmarks ICDAR 2013 Video, Minetto and RT-IK, and two Chinese scene text benchmarks CA-SIA10K and MSRA-TD500.

关键词： computer vision Text recognition Semantics Lighting Detectors Benchmark testing Feature extraction

来源：评论

学校读者我要写书评

暂无评论

LayoutGMN: Neural Graph Matching for Structural Layout Similarity

LayoutGMN: Neural Graph Matching for Structural Layout Simil...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Patil, Akshay Gadi Li, Manyi Fisher, Matthew Savva, Manolis Zhang, Hao Simon Fraser Univ Burnaby BC Canada Adobe Res San Jose CA USA

ISBN: (纸本)9781665445092

We present a deep neural network to predict structural similarity between 2D layouts by leveraging Graph Matching Networks (GMN). Our network, coined LayoutGMN, learns the layout metric via neural graph matching, using an attention-based GMN designed under a triplet network setting. To train our network, we utilize weak labels obtained by pixel-wise Intersection-over-Union (IoUs) to define the triplet loss. Importantly, LayoutGMN is built with a structural bias which can effectively compensate for the lack of structure awareness in IoUs. We demonstrate this on two prominent forms of layouts, viz., floorplans and UI designs, via retrieval experiments on large-scale datasets. In particular, retrieval results by our network better match human judgement of structural layout similarity compared to both IoUs and other baselines including a state-of-theart method based on graph neural networks and image convolution. In addition, LayoutGMN is the first deep model to offer both metric learning of structural layout similarity and structural matching between layout elements.

关键词： Measurement Deep learning computer vision Convolution Computational modeling Layout Graph neural networks

来源：评论

学校读者我要写书评

暂无评论

Activate or Not: Learning Customized Activation

Activate or Not: Learning Customized Activation

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Ma, Ningning Zhang, Xiangyu Liu, Ming Sun, Jian Hong Kong Univ Sci & Technol Hong Kong Peoples R China MEGVII Technol Beijing Peoples R China

ISBN: (纸本)9781665445092

We present a simple, effective, and general activation function we term ACON which learns to activate the neurons or not. Interestingly, we find Swish, the recent popular NAS-searched activation, can be interpreted as a smooth approximation to ReLU. Intuitively, in the same way, we approximate the more general Maxout family to our novel ACON family, which remarkably improves the performance and makes Swish a special case of ACON. Next, we present meta-ACON, which explicitly learns to optimize the parameter switching between non-linear (activate) and linear (inactivate) and provides a new design space. By simply changing the activation function, we show its effectiveness on both small models and highly optimized large models (e.g. it. improves the ImageNet top-1 accuracy rate by 6.7% and 1.8% on MobileNet0.25 and ResNet-152, respectively). Moreover, our novel ACON can be naturally transferred to object detection and semantic segmentation, showing that ACON is an effective alternative in a variety of tasks. Code is available at https: // ***/nmaac/acon.

关键词： Image segmentation computer vision Codes Semantics Neurons Switches Object detection

来源：评论

学校读者我要写书评

暂无评论

Radar-Camera Pixel Depth Association for Depth Completion

Radar-Camera Pixel Depth Association for Depth Completion

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Long, Yunfei Morris, Daniel Liu, Xiaoming Castro, Marcos Chakravarty, Punarjay Narayanan, Praveen Michigan State Univ E Lansing MI 48824 USA Ford Motor Co Dearborn MI 48121 USA

ISBN: (纸本)9781665445092

While radar and video data can be readily fused at the detection level, fusing them at the pixel level is potentially more beneficial. This is also more challenging in part due to the sparsity of radar, but also because automotive radar beams are much wider than a typical pixel combined with a large baseline between camera and radar, which results in poor association between radar pixels and color pixel. A consequence is that depth completion methods designed for LiDAR and video fare poorly for radar and video. Here we propose a radar-to-pixel association stage which learns a mapping from radar returns to pixels. This mapping also serves to densify radar returns. Using this as a first stage, followed by a more traditional depth completion method, we are able to achieve imageguided depth completion with radar and video. We demonstrate performance superior to camera and radar alone on the nuScenes dataset. Our source code is available at https://***/longyunf/rc-pda.

关键词： Training computer vision Laser radar Radar measurements Image color analysis Radar detection Radar

来源：评论

学校读者我要写书评

暂无评论

Adversarial Generation of Continuous Images

Adversarial Generation of Continuous Images

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Skorokhodov, Ivan Ignatyev, Savva Elhoseiny, Mohamed King Abdullah Univ Sci & Technol KAUST Thuwal Saudi Arabia Skolkovo Inst Sci & Technol Moscow Russia

ISBN: (纸本)9781665445092

In most existing learning systems, images are typically viewed as 2D pixel arrays. However, in another paradigm gaining popularity, a 2D image is represented as an implicit neural representation (INR) - an MLP that predicts an RGB pixel value given its (x, y) coordinate. In this paper, we propose two novel architectural techniques for building INR-based image decoders: factorized multiplicative modulation and multi-scale INRs, and use them to build a state-of-the-art continuous image GAN. Previous attempts to adapt INRs for image generation were limited to MNIST-like datasets and do not scale to complex real-world data. Our proposed INR-GAN architecture improves the performance of continuous image generators by several times, greatly reducing the gap between continuous image GANs and pixel-based ones. Apart from that, we explore several exciting properties of the INR-based decoders, like out-of-the-box superresolution, meaningful image-space interpolation, accelerated inference of low-resolution images, an ability to extrapolate outside of image boundaries, and strong geometric prior. The project page is located at https://***/inr-gan.

关键词： Learning systems Interpolation computer vision Image synthesis Superresolution Modulation Solids

来源：评论

学校读者我要写书评

暂无评论

Single Image Dehazing Using Bounded Channel Difference Prior

Single Image Dehazing Using Bounded Channel Difference Prior

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhao, Xuan Air To Air Missile Res Inst Luoyang Henan Peoples R China

ISBN: (纸本)9781665448994

The single image dehazing task has made significant progress recently, aiming to recover the contrast and color of the scattered image. Many patch prior based dehazing methods have been explored, and this paper proposes another single image dehazing method by analyzing the prior information of local dehazed patches. With our observation, when the estimated transmission value varies from the ground-truth transmission value to 1, the output value of a metric function decrease correspondingly, which is defined based on the difference maps among three RGB channels of local dehazed patches normalized using global atmospheric light. Under additional bounding, the local transmission value can be estimated accurately. To reduce computation time, the whole image is divided into many small patches, and within each patch, we estimate a transmission value accurately. We further use weighted interpolation and guided filtering to refine the edges and details of the rough transmission map. Finally, we evaluate the proposed method using Fattal's synthetic haze images, SOTS dataset, and a wide variety of real-world haze images. Experiments show that our method outperforms other state-of-the-art dehazing algorithms by a large margin, especially on synthetic noisy haze images.

关键词： Interpolation computer vision Image color analysis Filtering Image edge detection conferences Estimation

来源：评论

学校读者我要写书评

暂无评论

Railroad is not a Train: Saliency as Pseudo-pixel Supervision for Weakly Supervised Semantic Segmentation

Railroad is not a Train: Saliency as Pseudo-pixel Supervisio...

引用

IEEE/CVF conference on computer vision and pattern recognition (CVPR)

作者： Lee, Seungho Lee, Minhyun Lee, Jongwuk Shim, Hyunjung Yonsei Univ Seoul South Korea Sungkyunkwan Univ Seoul South Korea

ISBN: (数字)9781665445092

ISBN: (纸本)9781665445092

Existing studies in weakly-supervised semantic segmentation (WSSS) using image-level weak supervision have several limitations: sparse object coverage, inaccurate object boundaries, and co-occurring pixels from non-target objects. To overcome these challenges, we propose a novel framework, namely Explicit Pseudo-pixel Supervision (EPS), which learns from pixel-level feedback by combining two weak supervisions;the image-level label provides the object identity via the localization map and the saliency map from the off-the-shelf saliency detection model offers rich boundaries. We devise a joint training strategy to fully utilize the complementary relationship between both information. Our method can obtain accurate object boundaries and discard co-occurring pixels, thereby significantly improving the quality of pseudo-masks. Experimental results show that the proposed method remarkably outperforms existing methods by resolving key challenges of WSSS and achieves the new state-of-the-art performance on both PASCAL VOC 2012 and MS COCO 2014 datasets. The code is available at https://***/halbielee/EPS.

关键词： Location awareness Training Image segmentation computer vision Codes Semantics pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 482 483 484 485 486 487 488 489 490 491 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：