检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

50,479 篇 会议
1,421 册 图书
1,041 篇 期刊文献
1 篇 学位论文

馆藏范围

52,940 篇 电子文献
4 种 纸本馆藏

日期分布

学科分类号

31,811 篇 工学
- 24,804 篇 计算机科学与技术...
- 12,568 篇 软件工程
- 5,153 篇 光学工程
- 4,756 篇 电气工程
- 4,436 篇 信息与通信工程
- 4,257 篇 机械工程
- 3,956 篇 控制科学与工程
- 2,474 篇 生物工程
- 1,728 篇 生物医学工程（可授...
- 1,584 篇 仪器科学与技术
- 1,317 篇 电子科学与技术（可...
- 793 篇 化学工程与技术
- 698 篇 安全科学与工程
- 542 篇 交通运输工程
- 379 篇 建筑学
- 331 篇 土木工程
11,839 篇 理学
- 6,434 篇 物理学
- 5,405 篇 数学
- 2,761 篇 生物学
- 1,910 篇 统计学（可授理学、...
- 801 篇 化学
- 669 篇 系统科学
5,305 篇 医学
- 5,094 篇 临床医学
- 729 篇 基础医学(可授医学...
- 459 篇 药学(可授医学、理...
3,350 篇 管理学
- 1,953 篇 图书情报与档案管...
- 1,535 篇 管理科学与工程(可...
- 479 篇 工商管理
720 篇 艺术学
- 718 篇 设计学（可授艺术学...
428 篇 法学
- 401 篇 社会学
297 篇 农学
197 篇 教育学
163 篇 经济学
63 篇 文学
49 篇 军事学

主题

17,385 篇 computer vision
9,017 篇 pattern recognit...
4,196 篇 training
3,815 篇 feature extracti...
3,134 篇 cameras
2,870 篇 computational mo...
2,789 篇 image segmentati...
2,622 篇 visualization
2,573 篇 shape
2,533 篇 face recognition
2,171 篇 robustness
2,123 篇 computer science
1,973 篇 object detection
1,959 篇 computer archite...
1,878 篇 layout
1,853 篇 object recogniti...
1,802 篇 three-dimensiona...
1,725 篇 neural networks
1,708 篇 humans
1,691 篇 image recognitio...

机构

165 篇 univ chinese aca...
144 篇 tsinghua univers...
136 篇 national laborat...
108 篇 univ sci & techn...
104 篇 zhejiang univers...
100 篇 shanghai jiao to...
95 篇 microsoft resear...
94 篇 university of sc...
86 篇 zhejiang univ pe...
84 篇 shanghai ai lab ...
74 篇 school of comput...
69 篇 computer vision ...
68 篇 peking univ peop...
68 篇 chinese acad sci...
65 篇 chinese univ hon...
63 篇 institute of inf...
62 篇 google res mount...
61 篇 univ oxford oxfo...
59 篇 univ toronto on
57 篇 swiss fed inst t...

作者

91 篇 van gool luc
87 篇 umapada pal
76 篇 zhang lei
64 篇 lee seong-whan
49 篇 vittorio murino
42 篇 yang yi
34 篇 nassir navab
33 篇 li xin
33 篇 jie yang
32 篇 liu yang
31 篇 escalera sergio
31 篇 loy chen change
30 篇 ling haibin
30 篇 h. bischof
29 篇 zhou jie
29 篇 vasconcelos nuno
29 篇 jan-michael frah...
29 篇 hanqing lu
28 篇 blumenstein mich...
27 篇 jia yunde

语言

51,871 篇 英文
835 篇 其他
241 篇 中文
22 篇 土耳其文
5 篇 西班牙文
2 篇 日文
2 篇 葡萄牙文
2 篇 俄文

检索条件"任意字段=IEEE Conference on Computer Vision and Pattern Recognition"

共 52943 条记录，以下是61-70 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

Learning to Count without Annotations

Learning to Count without Annotations

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Knobel, Lukas Han, Tengda Asano, Yuki M. Univ Amsterdam Amsterdam Netherlands Univ Oxford Oxford England

ISBN: (纸本)9798350353006

While recent supervised methods for reference-based object counting continue to improve the performance on benchmark datasets, they have to rely on small datasets due to the cost associated with manually annotating dozens of objects in images. We propose UnCounTR, a model that can learn this task without requiring any manual annotations. To this end, we construct "Self-Collages", images with various pasted objects as training samples, that provide a rich learning signal covering arbitrary object types and counts. Our method builds on existing unsupervised representations and segmentation techniques to successfully demonstrate for the first time the ability of reference-based counting without manual supervision. Our experiments show that our method not only outperforms simple base-lines and generic models such as FasterRCNN and DETR, but also matches the performance of supervised counting models in some domains.

关键词： computer vision self-supervision visual counting

来源：评论

学校读者我要写书评

暂无评论

Florence-2: Advancing a Unified Representation for a Variety of vision Tasks

Florence-2: Advancing a Unified Representation for a Variety...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Xiao, Bin Wu, Haiping Xu, Weijian Dai, Xiyang Hu, Houdong Lu, Yumao Zeng, Michael Liu, Ce Yuan, Lu Microsoft Corp Redmond WA 98052 USA

ISBN: (纸本)9798350353013;9798350353006

We introduce Florence-2, a novel vision foundation model with a unified, prompt-based representation for various computer vision and vision-language tasks. While ex-isting large vision models excel in transfer learning, they struggle to perform diverse tasks with simple instructions, a capability that implies handling the complexity of various spatial hierarchy and semantic granularity. Florence-2 was designed to take text-prompt as task instructions and generate desirable results in text forms, whether it be captioning, object detection, grounding or segmentation. This multi-task learning setup demands large-scale, high-quality annotated data. To this end, we co-developed FLD-5B that consists of 5.4 billion comprehensive visual annotations on 126 million images, using an iterative strategy of automated image annotation and model refinement. We adopted a sequence-to-sequence structure to train Florence-2 to perform versatile and comprehensive vision tasks. Extensive evaluations on numerous tasks demonstrated Florence-2 to be a strong vision foundation model contender with unprecedented zero-shot and fine-tuning capabilities.

关键词： Image annotation

来源：评论

学校读者我要写书评

暂无评论

Evidential Active recognition: Intelligent and Prudent Open-World Embodied Perception

Evidential Active Recognition: Intelligent and Prudent Open-...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Fan, Lei Liang, Mingfu Li, Yunxuan Hua, Gang Wu, Ying Northwestern Univ Xian Shaanxi Peoples R China Wormpex AI Res Bellevue WA USA

ISBN: (纸本)9798350353006

Active recognition enables robots to intelligently explore novel observations, thereby acquiring more information while circumventing undesired viewing conditions. Recent approaches favor learning policies from simulated or collected data, wherein appropriate actions are more frequently selected when the recognition is accurate. However, most recognition modules are developed under the closed-world assumption, which makes them ill-equipped to handle unexpected inputs, such as the absence of the target object in the current observation. To address this issue, we propose treating active recognition as a sequential evidence-gathering process, providing by-step uncertainty quantification and reliable prediction under the evidence combination theory. Additionally, the reward function developed in this paper effectively characterizes the merit of actions when operating in open-world environments. To evaluate the performance, we collect a dataset from an indoor simulator, encompassing various recognition challenges such as distance, occlusion levels, and visibility. Through a series of experiments on recognition and robustness analysis, we demonstrate the necessity of introducing uncertainties to active recognition and the superior performance of the proposed method.

关键词： Active agents Active vision Embodied vision

来源：评论

学校读者我要写书评

暂无评论

JoAPR: Cleaning the Lens of Prompt Learning for vision-Language Models

JoAPR: Cleaning the Lens of Prompt Learning for Vision-Langu...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Guo, Yuncheng Guo, Xiaodong Fudan Univ Dept Elect Engn Shanghai 200438 Peoples R China

ISBN: (纸本)9798350353006

Leveraging few-shot datasets in prompt learning for vision-Language Models eliminates the need for manual prompt engineering while highlighting the necessity of accurate annotations for the labels. However, high-level or complex label noise challenges prompt learning for vision-Language Models. Aiming at this issue, we propose a new framework for improving its robustness. Specifically, we introduce the Joint Adaptive Partitioning for Label Refurbishment (JoAPR), a structured framework encompassing two key steps. 1) Data Partitioning, where we differentiate between clean and noisy data using joint adaptive thresholds. 2) Label Refurbishment, where we correct the labels based on the partition outcomes before retraining the network. Our comprehensive experiments confirm that JoAPR substantially enhances the robustness of prompt learning for vision-Language Models against label noise, offering a promising direction for future research.

关键词： joint adaptive partitioning label noise label refurbishment prompt learning vision-Language Pre-Trained Models

来源：评论

学校读者我要写书评

暂无评论

InstructDiffusion: A Generalist Modeling Interface for vision Tasks

InstructDiffusion: A Generalist Modeling Interface for Visio...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Geng, Zigang Yang, Binxin Hang, Tiankai Li, Chen Gu, Shuyang Zhang, Ting Bao, Jianmin Zhang, Zheng Li, Houqiang Hu, Han Chen, Dong Guo, Baining Univ Sci & Technol China Hefei Anhui Peoples R China Southeast Univ Nanjing Jiangsu Peoples R China Xi An Jiao Tong Univ Xian Shaanxi Peoples R China Beijing Normal Univ Beijing Peoples R China Microsoft Res Asia Redmond WA 98052 USA

ISBN: (纸本)9798350353006

We present InstructDiffusion, a unified and generic framework for aligning computer vision tasks with human instructions. Unlike existing approaches that integrate prior knowledge and pre-define the output space (e.g., categories and coordinates) for each vision task, we cast diverse vision tasks into a human-intuitive image-manipulating process whose output space is a flexible and interactive pixel space. Concretely, the model is built upon the diffusion process and is trained to predict pixels according to user instructions, such as encircling the man's left shoulder in red or applying a blue mask to the left car. InstructDiffusion could handle a variety of vision tasks, including understanding tasks (such as segmentation and keypoint detection) and generative tasks (such as editing and enhancement) and outperforms prior methods on novel datasets. This represents a solid step towards a generalist modeling interface for vision tasks, advancing artificial general intelligence in the field of computer vision.

关键词： computer vision

来源：评论

学校读者我要写书评

暂无评论

Probabilistic Sampling of Balanced K-Means using Adiabatic Quantum Computing

Probabilistic Sampling of Balanced K-Means using Adiabatic Q...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Zaech, Jan-Nico Danelljan, Martin Birdal, Tolga Van Gool, Luc Swiss Fed Inst Technol Zurich Switzerland Univ Sofia INSAIT Sofia Bulgaria Imperial Coll London London England

ISBN: (纸本)9798350353006

Adiabatic quantum computing (AQC) is a promising approach for discrete and often NP-hard optimization problems. Current AQCs allow to implement problems of research interest, which has sparked the development of quantum representations for many computer vision tasks. Despite requiring multiple measurements from the noisy AQC, current approaches only utilize the best measurement, discarding information contained in the remaining ones. In this work, we explore the potential of using this information for probabilistic balanced k-means clustering. Instead of discarding non-optimal solutions, we propose to use them to compute calibrated posterior probabilities with little additional compute cost. This allows us to identify ambiguous solutions and data points, which we demonstrate on a D-Wave AQC on synthetic tasks and real visual data.

关键词： Clustering computer vision Quantum Computing uncertainty estimation

来源：评论

学校读者我要写书评

暂无评论

AnimalFormer: Multimodal vision Framework for Behavior-based Precision Livestock Farming

AnimalFormer: Multimodal Vision Framework for Behavior-based...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Qazi, Ahmed Razzaq, Taha Iqbal, Asim Tibbling Technol Redmond WA 98052 USA

ISBN: (纸本)9798350365474

We introduce a multimodal vision framework for precision livestock farming, harnessing the power of GroundingDINO, HQSAM, and ViTPose models. This integrated suite enables comprehensive behavioral analytics from video data without invasive animal tagging. GroundingDINO generates accurate bounding boxes around livestock, while HQSAM segments individual animals within these boxes. ViTPose estimates key body points, facilitating posture and movement analysis. Demonstrated on a sheep dataset with grazing, running, sitting, standing, and walking activities, our framework extracts invaluable insights: activity and grazing patterns, interaction dynamics, and detailed postural evaluations. Applicable across species and video resolutions, this framework revolutionizes non-invasive livestock monitoring for activity detection, counting, health assessments, and posture analyses. It empowers data-driven farm management, optimizing animal welfare and productivity through AI-powered behavioral understanding.

关键词： Livestock

来源：评论

学校读者我要写书评

暂无评论

Lacunarity Pooling Layers for Plant Image Classification using Texture Analysis

Lacunarity Pooling Layers for Plant Image Classification usi...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Mohan, Akshatha Peeples, Joshua Texas A&M Univ Dept Elect & Comp Engn College Stn TX 77840 USA

ISBN: (纸本)9798350365474

Pooling layers (e.g., max and average) may overlook important information encoded in the spatial arrangement of pixel intensity and/or feature values. We propose a novel lacunarity pooling layer that aims to capture the spatial heterogeneity of the feature maps by evaluating the variability within local windows. The layer operates at multiple scales, allowing the network to adaptively learn hierarchical features. The lacunarity pooling layer can be seamlessly integrated into any artificial neural network architecture. Experimental results demonstrate the layer's effectiveness in capturing intricate spatial patterns, leading to improved feature extraction capabilities. The proposed approach holds promise in various domains, especially in agricultural image analysis tasks. This work contributes to the evolving landscape of artificial neural network architectures by introducing a novel pooling layer that enriches the representation of spatial features. Our code is publicly available. (1)

关键词： computer vision Image Classification Machine Learning Texture Analysis

来源：评论

学校读者我要写书评

暂无评论

IrrNet: Spatio-Temporal Segmentation guided Classification for Irrigation Mapping

IrrNet: Spatio-Temporal Segmentation guided Classification f...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Hoque, Oishee Bintey Univ Virginia Dept Comp Sci Charlottesville VA 22903 USA

ISBN: (纸本)9798350365474

Irrigation systems can vary widely in scale, from smallscale subsistence farming to large commercial agriculture (see Fig. 1 ). The heterogeneity in irrigation practices and systems across different regions adds to the complexity of mapping (see Fig. 1 ). Distinguishing between irrigated and non-irrigated areas is challenging due to the spectral characteristics of various irrigation systems and practices across different regions, further complicating the task of mapping different types of irrigation. For example, rainfed agriculture is prevalent in the Midwest, Southeast, and parts of the Northeast U.S., while irrigation is common in arid Western and Southwestern states. Rainfed farming can result in highly variable patterns of cultivation. Farmers may practice rainfed agriculture in some fields while irrigating others, leading to a complex mosaic of irrigated and non-irrigated areas within the same region. © 2024 ieee.

关键词： Deep Learning Irrigation Mapping MTL Remote Sensing Segmentation Transfer Learning vision in Agriculture

来源：评论

学校读者我要写书评

暂无评论

AffordanceLLM: Grounding Affordance from vision Language Models

AffordanceLLM: Grounding Affordance from Vision Language Mod...

引用

ieee/CVF conference on computer vision and pattern recognition (CVPR)

作者： Qian, Shengyi Chen, Weifeng Bai, Mm Zhou, Xiong Tu, Zhuowen Li, Li Erran Amazon AWS AI Seattle WA 98109 USA

ISBN: (纸本)9798350365474

Affordance grounding refers to the task of finding the area of an object with which one can interact. It is a fundamental but challenging task, as a successful solution requires the comprehensive understanding of a scene in multiple aspects including detection, localization, and recognition of objects with their parts, of geo-spatial configuration/layout of the scene, of 3D shapes and physics, as well as of the functionality and potential interaction of the objects and humans. Much of the knowledge is hidden and beyond the image content with the supervised labels from a limited training set. In this paper, we make an attempt to improve the generalization capability of the current affordance grounding by taking the advantage of the rich world, abstract, and human-object-interaction knowledge from pre-trained large-scale vision language models [40]. Under the AGD20K benchmark, our proposed model demonstrates a significant performance gain over the competing methods for in-the-wild object affordance grounding. We further demonstrate it can ground affordance for objects from random Internet images, even if both objects and actions are unseen during training.

关键词： Visual languages

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 3 4 5 6 7 8 9 10 11 12 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：