咨询与建议

限定检索结果

文献类型

  • 50 篇 会议
  • 38 篇 期刊文献

馆藏范围

  • 88 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 63 篇 工学
    • 40 篇 计算机科学与技术...
    • 35 篇 软件工程
    • 14 篇 光学工程
    • 14 篇 信息与通信工程
    • 11 篇 生物工程
    • 10 篇 安全科学与工程
    • 9 篇 生物医学工程(可授...
    • 6 篇 机械工程
    • 3 篇 电气工程
    • 3 篇 控制科学与工程
    • 2 篇 力学(可授工学、理...
    • 2 篇 交通运输工程
    • 1 篇 仪器科学与技术
    • 1 篇 电子科学与技术(可...
    • 1 篇 化学工程与技术
  • 34 篇 理学
    • 12 篇 数学
    • 12 篇 物理学
    • 12 篇 生物学
    • 5 篇 统计学(可授理学、...
    • 2 篇 系统科学
    • 1 篇 化学
    • 1 篇 大气科学
    • 1 篇 海洋科学
  • 7 篇 管理学
    • 5 篇 管理科学与工程(可...
    • 3 篇 工商管理
    • 2 篇 图书情报与档案管...
  • 3 篇 法学
    • 3 篇 社会学
  • 2 篇 经济学
    • 2 篇 应用经济学
  • 2 篇 医学
    • 2 篇 临床医学
  • 1 篇 教育学
    • 1 篇 教育学

主题

  • 6 篇 visualization
  • 6 篇 feature extracti...
  • 5 篇 three-dimensiona...
  • 4 篇 semantics
  • 4 篇 bit rate
  • 3 篇 image segmentati...
  • 3 篇 convolution
  • 3 篇 predictive model...
  • 3 篇 computer vision
  • 3 篇 encoding
  • 3 篇 training
  • 2 篇 navier stokes eq...
  • 2 篇 semantic segment...
  • 2 篇 face detection
  • 2 篇 video coding
  • 2 篇 deep neural netw...
  • 2 篇 target tracking
  • 2 篇 security systems
  • 2 篇 educational inst...
  • 2 篇 stereo image pro...

机构

  • 27 篇 institute of ima...
  • 9 篇 institute of ima...
  • 7 篇 shanghai key lab...
  • 6 篇 shanghai key lab...
  • 4 篇 artificial intel...
  • 4 篇 moe key lab of a...
  • 4 篇 shanghai key lab...
  • 3 篇 university of pe...
  • 3 篇 the institute of...
  • 3 篇 department of co...
  • 3 篇 vector institute...
  • 3 篇 erlangen-nürnber...
  • 3 篇 radboud institut...
  • 3 篇 department of bi...
  • 3 篇 centre for medic...
  • 3 篇 general robotics...
  • 3 篇 department of di...
  • 3 篇 university of po...
  • 3 篇 department of ra...
  • 3 篇 department of ra...

作者

  • 19 篇 yang hua
  • 13 篇 zheng shibao
  • 9 篇 hua yang
  • 8 篇 zhai guangtao
  • 7 篇 jun zhou
  • 6 篇 xiaokang yang
  • 6 篇 zhou qin
  • 6 篇 xiao gu
  • 5 篇 chang zhigang
  • 5 篇 shibao zheng
  • 5 篇 pan renjie
  • 4 篇 ya zhang
  • 4 篇 haase robert
  • 4 篇 müller henning
  • 4 篇 hu menghan
  • 4 篇 baumgartner mich...
  • 4 篇 renjie pan
  • 4 篇 isensee fabian
  • 4 篇 zhou jun
  • 3 篇 kreshuk anna

语言

  • 85 篇 英文
  • 3 篇 其他
检索条件"机构=Institute of Image Processing and Network Engineering"
88 条 记 录,以下是1-10 订阅
排序:
M-RAT: a Multi-grained Retrieval Augmentation Transformer for image Captioning  17th
M-RAT: a Multi-grained Retrieval Augmentation Transformer f...
收藏 引用
17th Asian Conference on Computer Vision, ACCV 2024
作者: Song, Jiayan Pan, Renjie Zhou, Jun Yang, Hua Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai200240 China Shanghai Key Lab of Digital Media Processing and Transmission Shanghai200240 China
Current encoder-decoder methods for image captioning mai-nly consist of an object detection module (two-stage), or rely on big models with large-scale datasets to improve the effectiveness, which leads to increasing c... 详细信息
来源: 评论
Learning group interaction for sports video understanding from a perspective of athlete
收藏 引用
Frontiers of Computer Science 2024年 第4期18卷 175-188页
作者: Rui HE Zehua FU Qingjie LIU Yunhong WANG Xunxun CHEN Intelligent Recognition and Image Processing(IRIP)Lab School of Computer Science and EngineeringBeihang UniversityBeijing 100191China Hangzhou Innovation Institute Behang UniversityHangzhou 310051China National Computer Network Emergency Response Technical Team/Coordination Center of China(CNCERT or CNCERT/CC) Beijing 100029China
Learning activities interactions between small groups is a key step in understanding team sports *** research focusing on team sports videos can be strictly regarded from the perspective of the audience rather than th... 详细信息
来源: 评论
Hydrodynamics-Informed Neural network for Simulating Dense Crowd Motion Patterns  24
Hydrodynamics-Informed Neural Network for Simulating Dense C...
收藏 引用
32nd ACM International Conference on Multimedia, MM 2024
作者: Zhou, Yanshan Lai, Pingrui Yu, Jiaqi Xiong, Yingjie Yang, Hua Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai China Shanghai Key Lab of Digital Media Processing and Transmission Shanghai Jiao Tong University Shanghai China
With global occurrences of crowd crushes and stampedes, dense crowd simulation has been drawing great attention. In this research, our goal is to simulate dense crowd motions under six classic motion patterns, more sp... 详细信息
来源: 评论
L2RT-FIQA: Face image Quality Assessment via Learning-to-Rank Transformer  9th
L2RT-FIQA: Face Image Quality Assessment via Learning-to-Ra...
收藏 引用
9th International Forum on Digital Multimedia Communication, IFTC 2022
作者: Chen, Zehao Yang, Hua Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai China Shanghai Key Lab of Digital Media Processing and Transmission Shanghai China
Face recognition (FR) systems are easily constrained by complex environmental situations in the wild. To ensure the accuracy of FR systems, face image quality assessment (FIQA) is applied to reject low-quality face im... 详细信息
来源: 评论
FC-GNN: Recovering Reliable and Accurate Correspondences from Interferences
FC-GNN: Recovering Reliable and Accurate Correspondences fro...
收藏 引用
Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Haobo Xu Jun Zhou Hua Yang Renjie Pan Cunyan Li Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai Key Lab of Digital Media Processing and Transmission
Finding correspondences between images is essential for many computer vision tasks and sparse matching pipelines have been popular for decades. However, matching noise within and between images, along with inconsisten... 详细信息
来源: 评论
Adaptive and Collaborative Multi-scale Alignment for Text-Based Person Search
Adaptive and Collaborative Multi-scale Alignment for Text-Ba...
收藏 引用
2023 IEEE International Conference on Visual Communications and image processing, VCIP 2023
作者: Yang, Xinxin Pan, Renjie Yang, Hua Institute of Image Communication and Network Engineering Shanghai Jiao Tong University Shanghai Key Lab of Digital Media Processing and Transmission Shanghai China Shanghai Jiao Tong University China MoE Key Lab of Artificial Intelligence AI Institute China
Text-To-image person search is challenging due to the cross-scale correspondences and information inequality between modalities. Specifically, images and text are complexly linked at different scales and images are us... 详细信息
来源: 评论
AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
arXiv
收藏 引用
arXiv 2025年
作者: Cao, Yuqin Min, Xiongkuo Gao, Yixuan Sun, Wei Zhai, Guangtao Institute of Image Communication and Network Engineering Shanghai Key Laboratory of Digital Media Processing and Transmissions Shanghai Jiao Tong University Shanghai China
Many video-to-audio (VTA) methods have been proposed for dubbing silent AI-generated videos. An efficient quality assessment method for AI-generated audio-visual content (AGAV) is crucial for ensuring audio-visual qua... 详细信息
来源: 评论
Physics-Environment Interaction network for Dense Crowd Behavior Recognition
SSRN
收藏 引用
SSRN 2024年
作者: Yu, Jiaqi Zhou, Yanshan Pan, Renjie Lai, Pingrui Yang, Hua Institute of Image Communication and Network Engineering Shanghai Key Lab of Digital Media Processing and Transmission Shanghai Jiao Tong University Shanghai200240 China
The analysis of large-scale crowd behavior plays a crucial role in public safety. However, intelligent systems face three major challenges in analyzing dense crowd behavior: the severe occlusion between individuals, t... 详细信息
来源: 评论
UNQA: Unified No-Reference Quality Assessment for Audio, image, Video, and Audio-Visual Content
arXiv
收藏 引用
arXiv 2024年
作者: Cao, Yuqin Min, Xiongkuo Gao, Yixuan Sun, Wei Lin, Weisi Zhai, Guangtao The Institute of Image Communication and Network Engineering Shanghai Key Laboratory of Digital Media Processing and Transmissions Shanghai Jiao Tong University Shanghai200240 China The School of Computer Science and Engineering Nanyang Technological University Singapore639798 Singapore
As multimedia data flourishes on the Internet, quality assessment (QA) of multimedia data becomes paramount for digital media applications. Since multimedia data includes multiple modalities including audio, image, vi... 详细信息
来源: 评论
Spatial-Temporal Constrained Pseudo-labeling for Unsupervised Person Re-identification via GCN Inference  18th
Spatial-Temporal Constrained Pseudo-labeling for Unsupervis...
收藏 引用
18th International Forum of Digital Multimedia Communication, IFTC 2021
作者: Ling, Sen Yang, Hua Liu, Chuang Chen, Lin Zhao, Hongtian The Institute of Image Communication and Network Engineering Department of Electronic Engineering Shanghai Jiao Tong University Shanghai China Shanghai Key Laboratory of Digital Media Processing and Transmission Shanghai Jiao Tong University Shanghai China MoE Key Lab of Artificial Intelligence AI Institute Shanghai Jiao Tong University Shanghai China
Most existing unsupervised person re-identification (Re-ID) methods primarily depend on the cluster distance, and merely exploit the available source labeled data to assign pseudo labels for the unannotated data. Wher... 详细信息
来源: 评论