检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

分类表

所选分类

>> <<

限定检索结果

标题

标题
作者
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

作者

作者
标题
主题词
出版物名称
出版社
机构
学科分类号
摘要
ISBN
ISSN
基金资助
索书号

文献类型

8,227 篇 会议
158 篇 期刊文献
36 册 图书

馆藏范围

8,420 篇 电子文献
1 种 纸本馆藏

日期分布

学科分类号

5,789 篇 工学
- 5,044 篇 计算机科学与技术...
- 3,605 篇 软件工程
- 1,541 篇 光学工程
- 867 篇 信息与通信工程
- 663 篇 电气工程
- 642 篇 控制科学与工程
- 501 篇 机械工程
- 449 篇 生物医学工程（可授...
- 374 篇 电子科学与技术（可...
- 349 篇 生物工程
- 237 篇 仪器科学与技术
- 119 篇 化学工程与技术
- 101 篇 建筑学
- 92 篇 土木工程
- 72 篇 安全科学与工程
- 58 篇 材料科学与工程（可...
- 52 篇 交通运输工程
3,203 篇 理学
- 1,985 篇 物理学
- 1,904 篇 数学
- 579 篇 统计学（可授理学、...
- 408 篇 生物学
- 126 篇 化学
- 57 篇 系统科学
488 篇 管理学
- 329 篇 图书情报与档案管...
- 176 篇 管理科学与工程(可...
- 55 篇 工商管理
424 篇 医学
- 407 篇 临床医学
- 105 篇 基础医学(可授医学...
- 79 篇 药学(可授医学、理...
54 篇 艺术学
- 53 篇 设计学（可授艺术学...
53 篇 农学
45 篇 法学
28 篇 教育学
18 篇 经济学
11 篇 军事学
5 篇 文学

主题

1,329 篇 image processing
1,100 篇 computer vision
895 篇 image segmentati...
663 篇 pattern recognit...
538 篇 image reconstruc...
515 篇 image analysis
501 篇 cameras
451 篇 layout
374 篇 shape
366 篇 computer science
318 篇 feature extracti...
268 篇 face recognition
263 篇 image recognitio...
260 篇 robustness
243 篇 humans
202 篇 pixel
200 篇 image edge detec...
192 篇 object recogniti...
189 篇 object detection
188 篇 pattern recognit...

机构

23 篇 department of co...
20 篇 microsoft resear...
17 篇 center for autom...
16 篇 the robotics ins...
15 篇 national laborat...
15 篇 institute of ima...
15 篇 institute of ima...
15 篇 department of co...
15 篇 institute of com...
14 篇 department of co...
14 篇 tsinghua univers...
14 篇 school of comput...
14 篇 nec research ins...
14 篇 school of comput...
13 篇 robotics institu...
13 篇 institute for ro...
12 篇 computer science...
11 篇 carnegie mellon ...
11 篇 swiss fed inst t...
11 篇 inria sophia-ant...

作者

31 篇 anon
27 篇 huang thomas s.
25 篇 jain anil k.
24 篇 s.k. nayar
22 篇 nayar shree k.
22 篇 haralick robert ...
19 篇 timofte radu
18 篇 shum heung-yeung
18 篇 aggarwal j.k.
17 篇 zhang lei
17 篇 hancock edwin r.
16 篇 van gool luc
15 篇 g. healey
14 篇 davis larry s.
14 篇 rosenfeld azriel
14 篇 t. kanade
14 篇 r. szeliski
14 篇 ahuja narendra
13 篇 k. ikeuchi
13 篇 chellappa rama

语言

8,151 篇 英文
182 篇 其他
88 篇 中文
3 篇 土耳其文
1 篇 西班牙文
1 篇 日文
1 篇 葡萄牙文

检索条件"任意字段=Proceedings - IEEE Computer Society Conference on Pattern Recognition and Image Processing."

共 8421 条记录，以下是551-560 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

相关度排序

相关度排序
时效性降序
时效性升序

FINER: Flexible Spectral-Bias Tuning in Implicit NEural Representation by Variableperiodic Activation Functions

FINER: Flexible Spectral-Bias Tuning in Implicit NEural Repr...

引用

conference on computer Vision and pattern recognition (CVPR)

作者： Zhen Liu Hao Zhu Qi Zhang Jingde Fu Weibing Deng Zhan Ma Yanwen Guo Xun Cao School of Electronic Science and Engineering Nanjing University Nanjing China AI Lab Tencent Company Shenzhen China Department of Mathematics Nanjing University Nanjing China Department of Computer Science and Technology Nanjing University Nanjing China

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

Implicit Neural Representation (INR), which utilizes a neural network to map coordinate inputs to corresponding attributes, is causing a revolution in the field of signal processing. However, current INR techniques suffer from a re-stricted capability to tune their supported frequency set, re-sulting in imperfect performance when representing complex signals with multiple frequencies. We have identified that this frequency-related problem can be greatly alleviated by introducing variableperiodic activation functions, for which we propose FINER. By initializing the bias of the neural network within different ranges, sub-functions with various frequencies in the variableperiodic function are selected for activation. Consequently, the supported frequency set of FINER can be flexibly tuned, leading to improved performance in signal representation. We demon-strate the capabilities of FINER in the contexts of2D image fitting, 3D signed distance field representation, and 5D neural radiance fields optimization, and we show that it outper-forms existing INRs.

关键词： Three-dimensional displays Shape Frequency-domain analysis Neural networks Fitting Signal processing Rendering (computer graphics)

来源：评论

学校读者我要写书评

暂无评论

Tiny-PIRATE: A Tiny model with Parallelized Intelligence for Real-time Analysis as a Traffic countEr

Tiny-PIRATE: A Tiny model with Parallelized Intelligence for...

引用

ieee/CVF conference on computer Vision and pattern recognition (CVPR)

作者： Synh Viet-Uyen Ha Nhat Minh Chung Tien-Cuong Nguyen Hung Ngoc Phan Int Univ Sch Comp Sci & Engn Ho Chi Minh City Vietnam Vietnam Natl Univ Ho Chi Minh City Vietnam

ISBN: (纸本)9781665448994

Due to the rapid growth in the number of vehicles over the last decade, there has been a dramatic increase in demand for highway capacity analysis. Vehicle counting, in particular, has become a key element of vision-based intelligent traffic systems deployed across metropolitan areas. Most methods solved the vehicle counting problem under the assumption of state-of-the-art computing systems. However, large-scale deployment of such systems for multi-camera processing.is very inefficient. With the recent advancement of cost-efficient Internet-of-Things (IoT) devices alongside machine learning methods developed specifically for such devices, solving the vehicle counting problem for real-time traffic analysis on IoT edge devices, and thereby facilitating its large-scale deployment have become highly favorable. In this paper, we propose a framework of vehicle counting designed specifically for IoT edge computers which follows the detection-tracking-counting (DTC) model. The proposed solution aims at addressing the multimodality of contextual dynamics in traffic scenes with a small detector model, a robust tracker and a counting process that accurately estimate both a vehicle's motion of interest and its exit time from observation areas. Experimental results on AI City 2021 Track-1 Dataset showed that ours outperformed related methods with promising results regarding both accuracy and execution speed.

关键词： Road transportation Tracking image edge detection Computational modeling Urban areas Detectors Real-time systems

来源：评论

学校读者我要写书评

暂无评论

An Efficient 3D Synthetic Model Generation Pipeline for Human Pose Data Augmentation

An Efficient 3D Synthetic Model Generation Pipeline for Huma...

引用

ieee/CVF conference on computer Vision and pattern recognition (CVPR)

作者： Vyas, Kathan Jiang, Le Liu, Shuangjun Ostadabbas, Sarah Northeastern Univ Augmented Cognit Lab ACLab Boston MA 02115 USA

ISBN: (纸本)9781665448994

3D modeling of articulated bodies of humans or animals and using these models for synthetic 2D and 3D pose data generation can mitigate the small data challenges faced by many critical applications such as healthcare. In this paper, we present our efficient 3D synthetic model generation (3D-SMG) pipeline used for body pose data augmentation. 3D-SMG pipeline starts with scanning point clouds from various angles around the subject using an off-the-shelf RGBD camera. We then implement a dual objective iterative closest point (ICP) algorithm that uses both color (if available) as well as geometric information from point cloud and apply a pose graph node optimization to form one single rigid body mesh. 3D-SMG also includes a series of post processing.steps to obtain a smooth mesh at the end of the pipeline. The approach allows it to be applied to any articulated object such as a human body or an animal. Our experiments also show high level of accuracy in dimensions of obtained 3D meshes, when compared to the original subject. As the final step towards developing augmented pose dataset, we perform model rigging to articulate the 3D model of the subject and generate dynamic avatars within variety of context-feasible poses(1).

关键词： Solid modeling Surface reconstruction Three-dimensional displays image color analysis Pipelines Cameras Data models

来源：评论

学校读者我要写书评

暂无评论

Real-time Monocular Depth Estimation with Sparse Supervision on Mobile

Real-time Monocular Depth Estimation with Sparse Supervision...

引用

ieee/CVF conference on computer Vision and pattern recognition (CVPR)

作者： Yucel, Mehmet Kerim Dimaridou, Valia Drosou, Anastasios Saa-Garriga, Albert Samsung Res UK Staines England Ctr Res & Technol Hellas CERTH Informat Technol Inst Thessaloniki Greece

ISBN: (纸本)9781665448994

Monocular (relative or metric) depth estimation is a critical task for various applications, such as autonomous vehicles, augmented reality and image editing. In recent years, with the increasing availability of mobile devices, accurate and mobile-friendly depth models have gained importance. Increasingly accurate models typically require more computational resources, which inhibits the use of such models on mobile devices. The mobile use case is arguably the most unrestricted one, which requires highly accurate yet mobile-friendly architectures. Therefore, we try to answer the following question: How can we improve a model without adding further complexity (i.e. parameters)? Towards this end, we systematically explore the design space of a relative depth estimation model from various dimensions and we show, with key design choices and ablation studies, even an existing architecture can reach highly competitive performance to the state of the art, with a fraction of the complexity. Our study spans an in-depth backbone model selection process, knowledge distillation, intermediate predictions, model pruning and loss rebalancing. We show that our model, using only DIW as the supervisory dataset, achieves 0.1156 WHDR on DIW with 2.6M parameters and reaches 37 FPS on a mobile GPU, without pruning or hardware-specific optimization. A pruned version of our model achieves 0.1208 WHDR on DIW with 1M parameters and reaches 44 FPS on a mobile GPU.

关键词： Runtime Computational modeling Graphics processing.units Estimation Predictive models Mobile handsets Real-time systems

来源：评论

学校读者我要写书评

暂无评论

Ceramic Decoration Extraction Method Based on computer Vision and image processing 2

Ceramic Decoration Extraction Method Based on Computer Visio...

引用

2nd International conference on Networking, Communications and Information Technology, NetCIT 2022

作者： Li, Li Xi'An Academy of Fine Arts Shaanxi Xi'an10065 China

ISBN: (纸本)9781665492737

China is one of the first countries to invent pottery in the world. Like India, West Asia, Japan and the central Balkans, our ancestors invented pottery in the Neolithic period of primitive society. As a product of computer technology, the development and application of graphics processing.technology has been extended to all sectors of society, especially in the field of ceramic decorative pattern design. The speed and accuracy of traditional intelligent manufacturing detection can not meet the current detection requirements. Machine vision detection technology captures the image through the vision system, uses the Prewitt directional derivative approximation operator to detect the edge of the copied image, and completes the template matching and matching value weighting by transforming the image detection space into the parameter space. After non-maximum suppression, all directions of corner response function are processed, and the right corner output and its direction are obtained. After expanding the corner area, the detection area is defined. Zernike moment discretization is used to process the real part and imaginary part of the image, and the image features are extracted. Combining the coordinates of each corner and sub-pixel coordinates of the image, the fake area and the real area are determined, so as to realize the image tampering detection. In this paper, we focus on the body patterns of ceramic artworks, aiming at high-quality reproduction and design re-creation, deeply study the collection and image correction technology of three-dimensional patterns of ceramic artworks, and deeply analyze the algorithm of feature detection and extraction. © 2022 ieee.

关键词： Extraction

来源：评论

学校读者我要写书评

暂无评论

Blurry-Consistency Segmentation Framework with Selective Stacking on Differential Interference Contrast 3D Breast Cancer Spheroid

Blurry-Consistency Segmentation Framework with Selective Sta...

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Thanh-Huy Nguyen Thi Kim Ngan Ngo Mai Anh Vu Ting-Yuan Tu Department of Biomedical Engineering IMBSL Lab National Cheng Kung University Tainan ROC

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

The ability of three-dimensional (3D) spheroid modeling to study the invasive behavior of breast cancer cells has drawn increased attention. The deep learning-based image processing.framework is very effective at speeding up the cell morphological analysis process. Out-of-focus photos taken while capturing 3D cells under several z-slices, however, could negatively impact the deep learning model. In this work, we created a new algorithm to handle blurry images while preserving the stacked image quality. Furthermore, we proposed a unique training architecture that leverages consistency training to help reduce the bias of the model when dense-slice stacking is applied. Additionally, the model’s stability is increased under the sparse-slice stacking effect by utilizing the self-training approach. The new blurring stacking technique and training flow are combined with the suggested architecture and self-training mechanism to provide an innovative yet easy-to-use framework. Our methods produced noteworthy experimental outcomes in terms of both quantitative and qualitative aspects.

关键词： Training Solid modeling Three-dimensional displays Microscopy Stacking computer architecture Breast cancer

来源：评论

学校读者我要写书评

暂无评论

DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer

DehazeDCT: Towards Effective Non-Homogeneous Dehazing via De...

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Wei Dong Han Zhou Ruiyi Wang Xiaohong Liu Guangtao Zhai Jun Chen McMaster University Shanghai Jiao Tong University

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

image dehazing, a pivotal task in low-level vision, aims to restore the visibility and detail from hazy images. Many deep learning methods with powerful representation learning capability demonstrate advanced performance on non-homogeneous dehazing, however, these methods usually struggle with processing.high-resolution images (e.g., 4000 × 6000) due to their heavy computational demands. To address these challenges, we introduce an innovative non-homogeneous Dehazing method via Deformable Convolutional Transformer-like architecture (DehazeDCT). Specifically, we first design a transformer-like network based on deformable convolution v4, which offers long-range dependency and adaptive spatial aggregation capabilities and demonstrates faster convergence and forward speed. Furthermore, we leverage a lightweight Retinex-inspired transformer to achieve color correction and structure refinement. Extensive experiment results and highly competitive performance of our method in NTIRE 2024 Dense and Non-Homogeneous Dehazing Challenge, ranking second among all 16 submissions, demonstrate the superior capability of our proposed method. The code is available: https://***/movingforward100/Dehazing_R.

关键词： Convolutional codes Representation learning image color analysis Convolution computer architecture Transformer cores Transformers

来源：评论

学校读者我要写书评

暂无评论

Human Spine Motion Capture using Perforated Kinesiology Tape

Human Spine Motion Capture using Perforated Kinesiology Tape

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Hendrik Hachmann Bodo Rosenhahn Institute for Information Processing (tnt) / L3S Leibniz University Hannover

In this work, we present a marker-based multi-view spine tracking method that is specifically adjusted to the requirements for movements in sports. A maximal focus is on the accurate detection of markers and fast usage of the system. For this task, we take advantage of the prior knowledge of the arrangement of dots in perforated kinesiology tape. We detect the tape and its dots using a Mask R-CNN and a blob detector. Here, we can focus on detection only while skipping any image-based feature encoding or matching. We conduct a reasoning in 3D by a linear program and Markov random fields, in which the structure of the kinesiology tape is modeled and the shape of the spine is optimized. In comparison to state-of-the-art systems, we demonstrate that our system achieves high precision and marker density, is robust against occlusions, and capable of capturing fast movements.

关键词：

来源：评论

学校读者我要写书评

暂无评论

ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG-based Video Analysis System

ViTA: An Efficient Video-to-Text Algorithm using VLM for RAG...

引用

ieee computer society conference on computer Vision and pattern recognition Workshops (CVPRW)

作者： Md Adnan Arefeen Biplob Debnath Md Yusuf Sarwar Uddin Srimat Chakradhar NEC Laboratories America University of Missouri-Kansas City

ISBN: (数字)9798350365474

ISBN: (纸本)9798350365481

Retrieval-augmented generation (RAG) is used in natural language processing.(NLP) to provide query-relevant information in enterprise documents to large language models (LLMs). Such enterprise context enables the LLMs to generate more informed and accurate responses. When enterprise data is primarily videos, AI models like vision language models (VLMs) are necessary to convert information in videos into text. While essential, this conversion is a bottleneck, especially for large corpus of videos. It delays the timely use of enterprise videos to generate useful *** propose ViTA, a novel method that leverages two unique characteristics of VLMs to expedite the conversion process. As VLMs output more text tokens, they incur higher latency. In addition, large (heavyweight) VLMs can extract intricate details from images and videos, but they incur much higher latency per output token when compared to smaller (lightweight) VLMs that may miss details. To expedite conversion, ViTA first employs a lightweight VLM to quickly understand the gist or overview of an image or a video clip, and directs a heavyweight VLM (through prompt engineering) to extract additional details by using only a few (preset number of) output tokens. Our experimental results show that ViTA expedites the conversion time by as much as 43%, without compromising the accuracy of responses when compared to a baseline system that only uses a heavyweight VLM.

关键词： computer vision Accuracy Large language models conferences Natural language processing Data models pattern recognition

来源：评论

学校读者我要写书评

暂无评论

Multi-scale Feature Fusion Residual Shrinkage Network for COVID-19 Diagnosis 19

Multi-scale Feature Fusion Residual Shrinkage Network for CO...

引用

2022 ieee SmartWorld, 19th ieee International conference on Ubiquitous Intelligence and Computing, 2022 ieee International conference on Autonomous and Trusted Vehicles conference, 22nd ieee International conference on Scalable Computing and Communications, 2022 ieee International conference on Digital Twin, 8th ieee International conference on Privacy Computing and 2022 ieee International conference on Metaverse, SmartWorld/UIC/ATC/ScalCom/DigitalTwin/PriComp/Metaverse 2022

作者： Lin, Jiale Wen, Zhuangfei Qiu, Zhao Hainan University School of Computer Science and Technology Haikou China Haikou Hospital of the Maternal and Child Health Department of Pediatrics Haikou China

ISBN: (纸本)9798350346558

With the normalization of prevention and control of COVID-19, the market of smart healthcare will be further opened. Smart healthcare and active assisted living have important applications in infectious disease prevention and control, hierarchical diagnosis and treatment, home monitoring, and other scenarios. CT scans have proven to be an effective way to appraise whether a patient is infected with COVID-19. However, due to the analysis process of CT detection being very complicated and requiring the attention of domain experts, computer-aided diagnosis systems based on artificial intelligence have received more attention from industry and academia. In this article, we propose an innovative deep learning model to detect COVID-19 from CT images, the Multi-scale Feature Fusion Residual Shrinkage Network (MSRSN). For the sake of eliminating the interference of noise information in CT images and enhancing the accuracy of CT scan image recognition, we insert soft thresholding as a nonlinear transform layer into the deep structure to effectively remove noise-related features. Furthermore, considering that reasonable setting of thresholds is often challenging, the MSRSN has been developed to integrate the MS-CAM module as a trainable module to adaptively determine the threshold, so it does not need professional knowledge in noise processing. The MSRSN achieved an accuracy of 0.9722, a specificity of 0.95 as well as an F1 score of 0.97 on the SARS-CoV-2 CT-Scan dataset. Compared with CNNs in existence as well as related state-of the-art works, the results achieved by the MSRSN were excellent. Our results demonstrate that the MSRSN can assist specialists in screening as well as can aid in diagnosing patients with suspected COVID-19. © 2022 ieee.

关键词： COVID-19

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共500页 << < 52 53 54 55 56 57 58 59 60 61 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：