检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

22,774 篇 会议
111 篇 期刊文献
23 册 图书

馆藏范围

22,907 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

13,400 篇 工学
- 10,880 篇 计算机科学与技术...
- 3,450 篇 软件工程
- 2,429 篇 机械工程
- 1,723 篇 光学工程
- 1,011 篇 控制科学与工程
- 998 篇 电气工程
- 761 篇 信息与通信工程
- 393 篇 仪器科学与技术
- 337 篇 生物工程
- 257 篇 生物医学工程（可授...
- 214 篇 电子科学与技术（可...
- 113 篇 化学工程与技术
- 112 篇 安全科学与工程
- 98 篇 测绘科学与技术
- 93 篇 交通运输工程
- 86 篇 建筑学
- 82 篇 土木工程
3,361 篇 医学
- 3,347 篇 临床医学
- 79 篇 基础医学(可授医学...
3,251 篇 理学
- 1,953 篇 物理学
- 1,665 篇 数学
- 567 篇 统计学（可授理学、...
- 484 篇 生物学
- 245 篇 系统科学
- 109 篇 化学
506 篇 管理学
- 299 篇 图书情报与档案管...
- 219 篇 管理科学与工程(可...
- 75 篇 工商管理
252 篇 艺术学
- 252 篇 设计学（可授艺术学...
62 篇 法学
- 59 篇 社会学
40 篇 农学
25 篇 教育学
19 篇 经济学
11 篇 军事学
3 篇 文学

主题

10,126 篇 computer vision
4,026 篇 pattern recognit...
2,900 篇 training
1,958 篇 computational mo...
1,792 篇 cameras
1,759 篇 visualization
1,484 篇 shape
1,466 篇 image segmentati...
1,445 篇 feature extracti...
1,412 篇 three-dimensiona...
1,288 篇 robustness
1,170 篇 computer archite...
1,146 篇 layout
1,142 篇 computer science
1,134 篇 semantics
1,071 篇 object detection
1,043 篇 conferences
1,009 篇 benchmark testin...
967 篇 codes
810 篇 face recognition

机构

135 篇 univ sci & techn...
118 篇 univ chinese aca...
118 篇 chinese univ hon...
110 篇 carnegie mellon ...
99 篇 tsinghua univers...
99 篇 microsoft resear...
94 篇 swiss fed inst t...
92 篇 zhejiang univ pe...
82 篇 university of sc...
81 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 university of ch...
72 篇 shanghai jiao to...
68 篇 microsoft res as...
65 篇 national laborat...
65 篇 alibaba grp peop...
63 篇 adobe research
63 篇 tsinghua univ pe...
60 篇 peking univ peop...
59 篇 peng cheng labor...

作者

78 篇 van gool luc
72 篇 timofte radu
63 篇 zhang lei
45 篇 luc van gool
40 篇 yang yi
37 篇 loy chen change
33 篇 xiaoou tang
33 篇 li stan z.
33 篇 qi tian
32 篇 sun jian
31 篇 liu yang
31 篇 li fei-fei
30 篇 chen chen
30 篇 tian qi
30 篇 pascal fua
29 篇 darrell trevor
28 篇 ying shan
27 篇 li xin
27 篇 vasconcelos nuno
27 篇 hanqing lu

语言

22,719 篇 英文
162 篇 其他
20 篇 中文
5 篇 土耳其文
2 篇 日文

检索条件"任意字段=1994 IEEE Computer-Society Conference on Computer Vision and Pattern Recognition"

共 22908 条记录，以下是4601-4610 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Point Cloud Upsampling via Disentangled Refinement

Point Cloud Upsampling via Disentangled Refinement

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Li, Ruihui Li, Xianzhi Heng, Pheng-Ann Fu, Chi-Wing Chinese Univ Hong Kong Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Point clouds produced by 3D scanning are often sparse, non-uniform, and noisy. Recent upsampling approaches aim to generate a dense point set, while achieving both distribution uniformity and proximity-to-surface, and possibly amending small holes, all in a single network. After revisiting the task, we propose to disentangle the task based on its multi-objective nature and formulate two cascaded sub-networks, a dense generator and a spatial refiner. The dense generator infers a coarse but dense output that roughly describes the underlying surface, while the spatial refiner further fine-tunes the coarse output by adjusting the location of each point. Specifically, we design a pair of local and global refinement units in the spatial refiner to evolve a coarse feature map. Also, in the spatial refiner, we regress a per-point offset vector to further adjust the coarse outputs in fine scale. Extensive qualitative and quantitative results on both synthetic and real-scanned datasets demonstrate the superiority of our method over the state-of-the-arts.

关键词： Surface reconstruction Three-dimensional displays Pipelines Generators Surface roughness Rough surfaces pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation

Cyclic Co-Learning of Sounding Object Visual Grounding and S...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Tian, Yapeng Hu, Di Xu, Chenliang Univ Rochester Rochester NY 14627 USA Renmin Univ China Gaoling Sch Artificial Intelligence Beijing Peoples R China Beijing Key Lab Big Data Management & Anal Method Beijing Peoples R China

ISBN: (纸本)9781665445092

There are rich synchronized audio and visual events in our daily life. Inside the events, audio scenes are associated with the corresponding visual objects;meanwhile, sounding objects can indicate and help to separate their individual sounds in the audio track. Based on this observation, in this paper, we propose a cyclic co-learning (CCoL) paradigm that can jointly learn sounding object visual grounding and audio-visual sound separation in a unified framework. Concretely, we can leverage grounded object-sound relations to improve the results of sound separation. Meanwhile, benefiting from discriminative information from separated sounds, we improve training example sampling for sounding object grounding, which builds a co-learning cycle for the two tasks and makes them mutually beneficial. Extensive experiments show that the proposed framework outperforms the compared recent approaches on both tasks, and they can benefit from each other with our cyclic co-learning.

关键词： Training Visualization computer vision Codes Grounding Computational modeling pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Stay Positive: Non-Negative Image Synthesis for Augmented Reality

Stay Positive: Non-Negative Image Synthesis for Augmented Re...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Luo, Katie Yang, Guandao Xian, Wenqi Haraldsson, Harald Hariharan, Bharath Belongie, Serge Cornell Univ Ithaca NY 14853 USA

ISBN: (纸本)9781665445092

In applications such as optical see-through and projector augmented reality, producing images amounts to solving non-negative image generation, where one can only add light to an existing image. Most image generation methods, however, are ill-suited to this problem setting, as they make the assumption that one can assign arbitrary color to each pixel. In fact, naive application of existing methods fails even in simple domains such as MNIST digits, since one cannot create darker pixels by adding light. We know, however, that the human visual system can be fooled by optical illusions involving certain spatial configurations of brightness and contrast. Our key insight is that one can leverage this behavior to produce high quality images with negligible artifacts. For example, we can create the illusion of darker patches by brightening surrounding pixels. We propose a novel optimization procedure to produce images that satisfy both semantic and non-negativity constraints. Our approach can incorporate existing state-of-the-art methods, and exhibits strong performance in a variety of tasks including image-to-image translation and style transfer.

关键词： Image synthesis Image color analysis Semantics Visual systems Optical imaging pattern recognition Task analysis

来源：评论

学校读者我要写书评

暂无评论

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Collier, Mark Mustafa, Basil Kokiopoulou, Efi Jenatton, Rodolphe Berent, Jesse Google AI Mountain View CA 94043 USA

ISBN: (纸本)9781665445092

Large scale image classification datasets often contain noisy labels. We take a principled probabilistic approach to modelling input-dependent, also known as heteroscedastic, label noise in these datasets. We place a multivariate Nor-mal distributed latent variable on the final hidden layer of a neural network classifier. The covariance matrix of this latent variable, models the aleatoric uncertainty due to label noise. We demonstrate that the learned covariance structure captures known sources of label noise between semantically similar and co-occurring classes. Compared to standard neural network training and other baselines, we show significantly improved accuracy on Imagenet ILSVRC 2012 79.3% (+ 2.6%), Imagenet-21k 47.0% (+ 1.1%) and JFT 64.7% (+ 1.6%). We set a new state-of-the-art result on Webvision 1.0 with 76.6% top-1 accuracy. These datasets range from over 1M to over 300M training examples and from 1k classes to more than 21k classes. Our method is simple to use, and we provide an implementation that is a drop-in replacement for the final fully-connected layer in a deep classifier.

关键词： Training Correlation Uncertainty Neural networks Estimation Probabilistic logic pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Improving the Transferability of Adversarial Samples with Adversarial Transformations

Improving the Transferability of Adversarial Samples with Ad...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Wu, Weibin Su, Yuxin Lyu, Michael R. King, Irwin Chinese Univ Hong Kong Dept Comp Sci & Engn Hong Kong Peoples R China

ISBN: (纸本)9781665445092

Although deep neural networks (DNNs) have achieved tremendous performance in diverse vision challenges, they are surprisingly susceptible to adversarial examples, which are born of intentionally perturbing benign samples in a human-imperceptible fashion. It thus poses security concerns on the deployment of DNNs in practice, particularly in safety- and security-sensitive domains. To investigate the robustness of DNNs, transfer-based attacks have attracted a growing interest recently due to their high practical applicability, where attackers craft adversarial samples with local models and employ the resultant samples to attack a remote black-box model. However, existing transfer-based attacks frequently suffer from low success rates due to overfitting to the adopted local model. To boost the transferability of adversarial samples, we propose to improve the robustness of synthesized adversarial samples via adversarial transformations. Specifically, we employ an adversarial transformation network to model the most harmful distortions that can destroy adversarial noises and require the synthesized adversarial samples to become resistant to such adversarial transformations. Extensive experiments on the ImageNet benchmark showcase the superiority of our method to state-of-the-art baselines in attacking both undefended and defended models.

关键词： Training Resistance Deep learning computer vision Computational modeling Benchmark testing Distortion

来源：评论

学校读者我要写书评

暂无评论

Bracketing Image Restoration and Enhancement with High-Low Frequency Decomposition

Bracketing Image Restoration and Enhancement with High-Low F...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Genggeng Chen Kexin Dai Kangzhen Yang Tao Hu Xiangyu Chen Yongqing Yang Wei Dong Peng Wu Yanning Zhang Qingsen Yan Xi’an University of Architecture and Technology Northwestern Polytechnical University University of Macau Xi’an Institute of Optics and Precision Mechanics of CAS

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In real-world scenarios, due to a series of image degradations, obtaining high-quality, clear content photos is challenging. While significant progress has been made in synthesizing high-quality images, previous methods for image restoration and enhancement often overlooked the characteristics of different degradations. They applied the same structure to address various types of degradation, resulting in less-than-ideal restoration outcomes. Inspired by the notion that high/low frequency information is applicable to different degradations, we introduce HLNet, a Bracketing Image Restoration and Enhancement method based on high-low frequency decomposition. Specifically, we employ two modules for feature extraction: shared weight modules and non-shared weight modules. In the shared weight modules, we use SCConv to extract common features from different degradations. In the non-shared weight modules, we introduce the High-Low Frequency Decomposition Block (HLFDB), which employs different methods to handle high-low frequency information, enabling the model to address different degradations more effectively. Compared to other networks, our method takes into account the characteristics of different degradations, thus achieving higher-quality image restoration.

关键词： Degradation computer vision conferences Feature extraction Image restoration pattern recognition Image enhancement

来源：评论

学校读者我要写书评

暂无评论

Person Re-identification using Heterogeneous Local Graph Attention Networks

Person Re-identification using Heterogeneous Local Graph Att...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zhang, Zhong Zhang, Haijia Liu, Shuang Tianjin Normal Univ Tianjin Key Lab Wireless Mobile Commun & Power Tr Tianjin Peoples R China

ISBN: (纸本)9781665445092

Recently, some methods have focused on learning local relation among parts of pedestrian images for person reidentification (Re-ID), as it offers powerful representation capabilities. However, they only provide the intra-local relation among parts within single pedestrian image and ignore the inter-local relation among parts from different images, which results in incomplete local relation information. In this paper, we propose a novel deep graph model named Heterogeneous Local Graph Attention Networks (HLGAT) to model the inter-local relation and the intra-local relation in the completed local graph, simultaneously. Specifically, we first construct the completed local graph using local features, and we resort to the attention mechanism to aggregate the local features in the learning process of inter-local relation and intra-local relation so as to emphasize the importance of different local features. As for the inter-local relation, we propose the attention regularization loss to constrain the attention weights based on the identities of local features in order to describe the inter-local relation accurately. As for the intra-local relation, we propose to inject the contextual information into the attention weights to consider structure information. Extensive experiments on Market-1501, CUHK03, DukeMTMC-reID and MSMT17 demonstrate that the proposed HLGAT outperforms the state-of-the-art methods.

关键词： computer vision Aggregates pattern recognition Context modeling

来源：评论

学校读者我要写书评

暂无评论

A Promptable Segmentation Approach to Automatic Floor Plan Analysis using vision Transformers 8

A Promptable Segmentation Approach to Automatic Floor Plan A...

引用

8th SLAAI - International conference on Artificial Intelligence, SLAAI-ICAI 2024

作者： Goonathilake, M.D.P.P. Thanuja, A.L.A.R.R. University of Moratuwa Department of Computational Mathematics Katubedda Moratuwa10400 Sri Lanka University of North Carolina Department of Mathematics and Statistics GreensboroNC27402-6170 United States

ISBN: (纸本)9798331509170

Automatic floor plan analysis, rooted in computer vision and pattern recognition, aims to extract critical insights from architectural and interior design drawings, with applications in architecture, real estate, and construction. Conventional methods, whether rule-based or learning-based, face limitations, including extensive manual annotation requirements and challenges in achieving precise segmentation. This research study evaluates the efficacy of vision Transformers (ViTs) for key floor plan analysis tasks, such as identifying structural elements like doors and windows, extracting line segments to represent room layouts, and segmenting regions like room spaces. We introduce a novel promptable segmentation approach that integrates object detection and image segmentation to achieve a comprehensive understanding of scenes. This methodology centers on the Segment Anything Model (SAM), utilizing input prompts generated by object detection models to guide segmentation. For generating these prompts, we leverage real-Time object detection models, such as You Look Only Once Version 8 (YOLOv8) and end-To-end object detection models such as Real-Time Detection Transformer (RT-DETR). Experimental results across diverse datasets demonstrate robust performance, with high Mean Average Precision (mAP), Precision, and Recall rates, setting a new benchmark for automatic floor plan analysis. © 2024 ieee.

关键词： Floors

来源：评论

学校读者我要写书评

暂无评论

Efficient Multi-Lens Bokeh Effect Rendering and Transformation

Efficient Multi-Lens Bokeh Effect Rendering and Transformati...

引用

ieee computer society conference on computer vision and pattern recognition Workshops (CVPRW)

作者： Tim Seizinger Marcos V. Conde Manuel Kolmet Tom E. Bishop Radu Timofte Computer Vision Lab CAIDAS & IFI University of Würzburg Germany Glass Imaging Inc. CA

Many advancements of mobile cameras aim to reach the visual quality of professional DSLR cameras. Great progress was shown over the last years in optimizing the sharp regions of an image and in creating virtual portrait effects with artificially blurred backgrounds. Bokeh is the aesthetic quality of the blur in out-of-focus areas of an image. This is a popular technique among professional photographers, and for this reason, a new goal in computational photography is to optimize the Bokeh effect *** paper introduces EBokehNet, a efficient state-of-the-art solution for Bokeh effect transformation and rendering. Our method can render Bokeh from an all-in-focus image, or transform the Bokeh of one lens to the effect of another lens without harming the sharp foreground regions in the image. Moreover we can control the shape and strength of the effect by feeding the lens properties i.e. type (Sony or Canon) and aperture, into the neural network as an additional input. Our method is a winning solution at the NTIRE 2023 Lens-to-Lens Bokeh Effect Transformation Challenge, and state-of-the-art at the EBB benchmark.

关键词：

来源：评论

学校读者我要写书评

暂无评论

SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos

SoundingActions: Learning How Actions Sound from Narrated Eg...

引用

conference on computer vision and pattern recognition (CVPR)

作者： Changan Chen Kumar Ashutosh Rohit Girdhar David Harwath Kristen Grauman University of Texas at Austin FAIR Meta

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

We propose a novel self-supervised embedding to learn how actions sound from narrated in-the-wild egocentric videos. Whereas existing methods rely on curated data with known audio-visual correspondence, our multimodal contrastive-consensus coding (M C3) embedding reinforces the associations between audio, language, and vision when all modality pairs agree, while diminishing those associations when anyone pair does not. We show our approach can successfully discover how the long tail of human actions sound from egocentric video, outperforming an array of recent multimodal embedding techniques on two datasets (Eg04D and EPIC-Sounds) and multiple cross-modal tasks.

关键词： Training computer vision Tail Encoding pattern recognition Streams Videos

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 457 458 459 460 461 462 463 464 465 466 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：