检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

12,844 篇 会议
13 篇 期刊文献
2 册 图书

馆藏范围

12,859 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

7,573 篇 工学
- 6,863 篇 计算机科学与技术...
- 880 篇 机械工程
- 814 篇 软件工程
- 435 篇 控制科学与工程
- 360 篇 光学工程
- 306 篇 电气工程
- 209 篇 仪器科学与技术
- 124 篇 信息与通信工程
- 91 篇 生物工程
- 62 篇 生物医学工程（可授...
- 39 篇 电子科学与技术（可...
- 34 篇 安全科学与工程
- 26 篇 化学工程与技术
- 21 篇 交通运输工程
- 20 篇 建筑学
- 18 篇 土木工程
2,957 篇 医学
- 2,956 篇 临床医学
- 15 篇 基础医学(可授医学...
- 12 篇 药学(可授医学、理...
700 篇 理学
- 359 篇 物理学
- 225 篇 数学
- 175 篇 系统科学
- 95 篇 统计学（可授理学、...
- 93 篇 生物学
- 22 篇 化学
201 篇 艺术学
- 201 篇 设计学（可授艺术学...
84 篇 管理学
- 59 篇 图书情报与档案管...
- 25 篇 管理科学与工程(可...
- 14 篇 工商管理
23 篇 法学
- 21 篇 社会学
5 篇 农学
4 篇 教育学
2 篇 经济学
1 篇 军事学

主题

6,464 篇 computer vision
2,688 篇 training
2,437 篇 pattern recognit...
1,780 篇 computational mo...
1,522 篇 visualization
1,348 篇 three-dimensiona...
1,091 篇 computer archite...
1,063 篇 semantics
997 篇 benchmark testin...
976 篇 codes
970 篇 conferences
854 篇 feature extracti...
830 篇 cameras
771 篇 task analysis
707 篇 deep learning
646 篇 image segmentati...
611 篇 object detection
595 篇 shape
554 篇 transformers
538 篇 neural networks

机构

132 篇 univ sci & techn...
122 篇 carnegie mellon ...
120 篇 tsinghua univ pe...
114 篇 univ chinese aca...
113 篇 chinese univ hon...
94 篇 tsinghua univers...
91 篇 zhejiang univ pe...
91 篇 swiss fed inst t...
85 篇 peng cheng lab p...
81 篇 university of ch...
80 篇 zhejiang univers...
77 篇 shanghai ai lab ...
77 篇 peng cheng labor...
75 篇 university of sc...
69 篇 shanghai jiao to...
68 篇 shanghai jiao to...
67 篇 alibaba grp peop...
67 篇 stanford univ st...
66 篇 univ hong kong p...
64 篇 sensetime res pe...

作者

77 篇 timofte radu
63 篇 van gool luc
45 篇 zhang lei
36 篇 yang yi
36 篇 luc van gool
34 篇 tao dacheng
31 篇 loy chen change
29 篇 chen chen
28 篇 sun jian
28 篇 qi tian
25 篇 li xin
24 篇 liu yang
24 篇 tian qi
24 篇 ying shan
23 篇 wang xinchao
23 篇 zha zheng-jun
23 篇 boxin shi
21 篇 zhou jie
21 篇 vasconcelos nuno
20 篇 luo ping

语言

12,851 篇 英文
7 篇 其他
1 篇 中文

检索条件"任意字段=IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops"

共 12859 条记录，以下是4791-4800 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

NICE: CVPR 2023 Challenge on Zero-shot Image Captioning

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Taehoon Kim Pyunghwan Ahn Sangyun Kim Sihaeng Lee Mark Marsden Alessandra Sala Seung Hwan Kim Bohyung Han Kyoung Mu Lee Honglak Lee Kyounghoon Bae Xiangyu Wu Yi Gao Hailiang Zhang Yang Yang Weili Guo Jianfeng Lu Youngtaek Oh Jae Won Cho Dong-Jin Kim In So Kweon Junmo Kim Wooyoung Kang Won Young Jhoo Byungseok Roh Jonghwan Mun Solgil Oh Kenan Emir Ak Gwang-Gook Lee Yan Xu Mingwei Shen Kyomin Hwang Wonsik Shin Kamin Lee Wonhark Park Dongkwan Lee Nojun Kwak Yujin Wang Yimu Wang Tiancheng Gu Xingchang Lv Mingmao Sun

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

In this report, we introduce NICE (New frontiers for zero-shot Image Captioning Evaluation) project 1 and share the results and outcomes of 2023 challenge. This project is designed to challenge the computer vision community to develop robust image captioning models that advance the state-of-the-art both in terms of accuracy and fairness. Through the challenge, the image captioning models were tested using a new evaluation dataset that includes a large variety of visual concepts from many domains. There was no specific training data provided for the challenge, and therefore the challenge entries were required to adapt to new types of image descriptions that had not been seen during training. This report includes information on the newly proposed NICE dataset, evaluation methods, challenge results, and technical details of top-ranking entries. We expect that the outcomes of the challenge will contribute to the improvement of AI models on various vision-language tasks.

关键词： Training Adaptation models Visualization computer vision Computational modeling conferences Training data

来源：评论

学校读者我要写书评

暂无评论

The New Agronomists: Language Models are Experts in Crop Management

The New Agronomists: Language Models are Experts in Crop Man...

引用

ieee computer Society conference on computer vision and pattern recognition workshops (CVPRW)

作者： Jing Wu Zhixin Lai Suiyao Chen Ran Tao Pan Zhao Naira Hovakimyan University of Illinois at Urbana-Champaign Cornell University University of South Florida University of Alabama

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Crop management plays a crucial role in determining crop yield, economic profitability, and environmental sustainability. Despite the availability of management guidelines, optimizing these practices remains a complex and multifaceted challenge. In response, previous studies have explored using reinforcement learning with crop simulators, typically employing simple neural-network-based reinforcement learning (RL) agents. Building on this foundation, this paper introduces a more advanced intelligent crop management system. This system uniquely combines RL, a language model (LM), and crop simulations facilitated by the Decision Support System for Agrotechnology Transfer (DSSAT). We utilize deep RL, specifically a deep Q-network, to train management policies that process numerous state variables from the simulator as observations. A novel aspect of our approach is the conversion of these state variables into more informative language, facilitating the language model’s capacity to understand states and explore optimal management practices. The empirical results reveal that the LM exhibits superior learning capabilities. Through simulation experiments with maize crops in Florida (US) and Zaragoza (Spain), the LM not only achieves state-of-the-art performance under various evaluation metrics but also demonstrates a remarkable improvement of over 49% in economic profit, coupled with reduced environmental impact when compared to baseline methods. Our code is available at https://***/jingwu6/LM_AG.

关键词： Decision support systems Profitability Noise Green products Crops Deep reinforcement learning pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Cross-Domain Gradient Discrepancy Minimization for Unsupervised Domain Adaptation

Cross-Domain Gradient Discrepancy Minimization for Unsupervi...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Du, Zhekai Li, Jingjing Su, Hongzu Zhu, Lei Lu, Ke Univ Elect Sci & Technol China Chengdu Sichuan Peoples R China Shandong Normal Univ Jinan Shandong Peoples R China

ISBN: (纸本)9781665445092

Unsupervised Domain Adaptation (UDA) aims to generalize the knowledge learned from a well-labeled source domain to an unlabled target domain. Recently, adversarial domain adaptation with two distinct classifiers (bi-classifier) has been introduced into UDA which is effective to align distributions between different domains. Previous bi-classifier adversarial learning methods only focus on the similarity between the outputs of two distinct classifiers. However, the similarity of the outputs cannot guarantee the accuracy of target samples, i.e., traget samples may match to wrong categories even if the discrepancy between two classifiers is small. To challenge this issue, in this paper, we propose a cross-domain gradient discrepancy minimization (CGDM) method which explicitly minimizes the discrepancy of gradients generated by source samples and target samples. Specifically, the gradient gives a cue for the semantic information of target samples so it can be used as a good supervision to improve the accuracy of target samples. In order to compute the gradient signal of target smaples, we further obtain target pseudo labels through a clustering-based self-supervised learning. Extensive experiments on three widely used UDA datasets show that our method surpasses many previous state-of-the-arts.

关键词： computer vision Semantics Minimization Adversarial machine learning pattern recognition Reliability

来源：评论

学校读者我要写书评

暂无评论

Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place recognition

Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descript...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Hausler, Stephen Garg, Sourav Xu, Ming Milford, Michael Fischer, Tobias Queensland Univ Technol QUT Ctr Robot Brisbane Qld Australia

ISBN: (纸本)9781665445092

Visual Place recognition is a challenging task for robotics and autonomous systems, which must deal with the twin problems of appearance and viewpoint change in an always changing world. This paper introduces Patch-NetVLAD, which provides a novel formulation for combining the advantages of both local and global descriptor methods by deriving patch-level features from NetVLAD residuals. Unlike the fixed spatial neighborhood regime of existing local keypoint features, our method enables aggregation and matching of deep-learned local features defined over the feature-space grid. We further introduce a multi-scale fusion of patch features that have complementary scales (i.e. patch sizes) via an integral feature space and show that the fused features are highly invariant to both condition (season, structure, and illumination) and viewpoint (translation and rotation) changes. Patch-NetVLAD achieves state-of-the-art visual place recognition results in computationally limited scenarios, validated on a range of challenging real-world datasets, including winning the Facebook Mapillary Visual Place recognition Challenge at ECCV2020. It is also adaptable to user requirements, with a speed-optimised version operating over an order of magnitude faster than the state-of-the-art. By combining superior performance with improved computational efficiency in a configurable framework, Patch-NetVLAD is well suited to enhance both stand-alone place recognition capabilities and the overall performance of SLAM systems.

关键词： Visualization computer vision Simultaneous localization and mapping Social networking (online) Autonomous systems Face recognition Lighting

来源：评论

学校读者我要写书评

暂无评论

Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes

Synthesizing Long-Term 3D Human Motion and Interaction in 3D...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wang, Jiashun Xu, Huazhe Xu, Jingwei Liu, Sifei Wang, Xiaolong Univ Calif San Diego San Diego CA 92093 USA Univ Calif Berkeley Berkeley CA USA Shanghai Jiao Tong Univ Shanghai Peoples R China NVIDIA Santa Clara CA USA

ISBN: (纸本)9781665445092

Synthesizing 3D human motion plays an important role in many graphics applications as well as understanding human activity. While many efforts have been made on generating realistic and natural human motion, most approaches neglect the importance of modeling human-scene interactions and affordance. On the other hand, affordance reasoning (e.g., standing on the floor or sitting on the chair) has mainly been studied with static human pose and gestures, and it has rarely been addressed with human motion. In this paper, we propose to bridge human motion synthesis and scene affordance reasoning. We present a hierarchical generative framework to synthesize long-term 3D human motion conditioning on the 3D scene structure. Building on this framework, we further enforce multiple geometry constraints between the human mesh and scene point clouds via optimization to improve realistic synthesis. Our experiments show significant improvements over previous approaches on generating natural and physically plausible human motion in a scene.

关键词： Graphics Geometry computer vision Three-dimensional displays Affordances Computational modeling Cognition

来源：评论

学校读者我要写书评

暂无评论

Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction

Greedy Hierarchical Variational Autoencoders for Large-Scale...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wu, Bohan Nair, Suraj Martin-Martin, Roberto Li Fei-Fei Finn, Chelsea Stanford Univ Stanford CA 94305 USA

ISBN: (纸本)9781665445092

A video prediction model that generalizes to diverse scenes would enable intelligent agents such as robots to perform a variety of tasks via planning with the model. However, while existing video prediction models have produced promising results on small datasets, they suffer from severe underfitting when trained on large and diverse datasets. To address this underfitting challenge, we first observe that the ability to train larger video prediction models is often bottlenecked by the memory constraints of GPUs or TPUs. In parallel, deep hierarchical latent variable models can produce higher quality predictions by capturing the multi-level stochasticity of future observations, but end-to-end optimization of such models is notably difficult. Our key insight is that greedy and modular optimization of hierarchical autoencoders can simultaneously address both the memory constraints and the optimization challenges of large-scale video prediction. We introduce Greedy Hierarchical Variational Autoencoders (GHVAEs), a method that learns highfidelity video predictions by greedily training each level of a hierarchical autoencoder. In comparison to state-of-the-art models, GHVAEs provide 17-55% gains in prediction performance on four video datasets, a 35-40% higher success rate on real robot tasks, and can improve performance monotonically by simply adding more modules.

关键词： Training Visualization Memory management Stacking Predictive models Planning pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Learnable Motion Coherence for Correspondence Pruning

Learnable Motion Coherence for Correspondence Pruning

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Liu, Yuan Liu, Lingjie Lin, Cheng Dong, Zhen Wang, Wenping Univ Hong Kong Hong Kong Peoples R China Saarland Informat Campus MPI Informat Saarbrucken Germany Wuhan Univ Wuhan Peoples R China Texas A&M Univ College Stn TX 77843 USA

ISBN: (纸本)9781665445092

Motion coherence is an important clue for distinguishing true correspondences from false ones. Modeling motion coherence on sparse putative correspondences is challenging due to their sparsity and uneven distributions. Existing works on motion coherence are sensitive to parameter settings and have difficulty in dealing with complex motion patterns. In this paper, we introduce a network called Laplacian Motion Coherence Network (LMCNet) to learn motion coherence property for correspondence pruning. We propose a novel formulation of fitting coherent motions with a smooth function on a graph of correspondences and show that this formulation allows a closed-form solution by graph Laplacian. This closed-form solution enables us to design a differentiable layer in a learning framework to capture global motion coherence from putative correspondences. The global motion coherence is further combined with local coherence extracted by another local layer to robustly detect inlier correspondences. Experiments demonstrate that LMCNet has superior performances to the state of the art in relative camera pose estimation and correspondences pruning of dynamic scenes(1).

关键词： computer vision Laplace equations Closed-form solutions Pose estimation Fitting Dynamics Neural networks

来源：评论

学校读者我要写书评

暂无评论

DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

DANNet: A One-Stage Domain Adaptation Network for Unsupervis...

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Wu, Xinyi Wu, Zhenyao Guo, Hao Ju, Lili Wang, Song Univ South Carolina Columbia SC 29208 USA Farsee2 Technol Ltd Shenzhen Guangdong Peoples R China

ISBN: (纸本)9781665445092

Semantic segmentation of nighttime images plays an equally important role as that of daytime images in autonomous driving, but the former is much more challenging due to poor illuminations and arduous human annotations. In this paper, we propose a novel domain adaptation network (DANNet) for nighttime semantic segmentation without using labeled nighttime image data. It employs an adversarial training with a labeled daytime dataset and an unlabeled dataset that contains coarsely aligned day-night image pairs. Specifically, for the unlabeled day-night image pairs, we use the pixel-level predictions of static object categories on a daytime image as a pseudo supervision to segment its counterpart nighttime image. We further design a re-weighting strategy to handle the inaccuracy caused by misalignment between day-night image pairs and wrong predictions of daytime images, as well as boost the prediction accuracy of small objects. The proposed DANNet is the first one-stage adaptation framework for nighttime semantic segmentation, which does not train additional day-night image transfer models as a separate pre-processing stage. Extensive experiments on Dark Zurich and Nighttime Driving datasets show that our method achieves state-of-the-art performance for nighttime semantic segmentation.

关键词： Training Bridges Image segmentation computer vision Annotations Semantics Neural networks

来源：评论

学校读者我要写书评

暂无评论

Group Collaborative Learning for Co-Salient Object Detection

Group Collaborative Learning for Co-Salient Object Detection

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Fan, Qi Fan, Deng-Ping Fu, Huazhu Tang, Chi-Keung Shao, Ling Tai, Yu-Wing HKUST Hong Kong Peoples R China Incept Inst Artificial Intelligence Abu Dhabi U Arab Emirates Kuaishou Technol Beijing Peoples R China

ISBN: (纸本)9781665445092

We present a novel group collaborative learning framework (GCoNet) capable of detecting co-salient objects in real time (16ms), by simultaneously mining consensus representations at group level based on the two necessary criteria: 1) intra-group compactness to better formulate the consistency among co-salient objects by capturing their inherent shared attributes using our novel group affinity module;2) inter-group separability to effectively suppress the influence of noisy objects on the output by introducing our new group collaborating module conditioning the inconsistent consensus. To learn a better embedding space without extra computational overhead, we explicitly employ auxiliary classification supervision. Extensive experiments on three challenging benchmarks, i.e., CoCA, CoSOD3k, and Cosal2015, demonstrate that our simple GCoNet outperforms 10 cutting-edge models and achieves the new state-of-the-art. We demonstrate this paper's new technical contributions on a number of important downstream computer vision applications including content aware co-segmentation, co-localization based automatic thumbnails, etc.

关键词： computer vision Codes Computational modeling Semantics Object detection Benchmark testing Collaborative work

来源：评论

学校读者我要写书评

暂无评论

Orthogonal Over-Parameterized Training

Orthogonal Over-Parameterized Training

引用

ieee/cvf conference on computer vision and pattern recognition (CVPR)

作者： Liu, Weiyang Lin, Rongmei Liu, Zhen Rehg, James M. Paull, Liam Xiong, Li Song, Le Weller, Adrian Univ Cambridge Cambridge England Max Planck Inst Intelligent Syst Stuttgart Germany Emory Univ Atlanta GA 30322 USA Univ Montreal Mila Montreal PQ Canada Georgia Inst Technol Atlanta GA 30332 USA Alan Turing Inst London England

ISBN: (纸本)9781665445092

The inductive bias of a neural network is largely determined by the architecture and the training algorithm. To achieve good generalization, how to effectively train a neural network is of great importance. We propose a novel orthogonal over-parameterized training (OPT) framework that can provably minimize the hyperspherical energy which characterizes the diversity of neurons on a hypersphere. By maintaining the minimum hyperspherical energy during training, OPT can greatly improve the empirical generalization. Specifically, OPT fixes the randomly initialized weights of the neurons and learns an orthogonal transformation that applies to these neurons. We consider multiple ways to learn such an orthogonal transformation, including unrolling orthogonalization algorithms, applying orthogonal parameterization, and designing orthogonality-preserving gradient descent. For better scalability, we propose the stochastic OPT which performs orthogonal transformation stochastically for partial dimensions of neurons. Interestingly, OPT reveals that learning a proper coordinate system for neurons is crucial to generalization. We provide some insights on why OPT yields better generalization. Extensive experiments validate the superiority of OPT over the standard training.

关键词： Training computer vision Scalability Neurons Optimized production technology computer architecture pattern recognition

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 476 477 478 479 480 481 482 483 484 485 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：