咨询与建议

限定检索结果

文献类型

  • 8 篇 期刊文献
  • 6 篇 会议

馆藏范围

  • 14 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 13 篇 工学
    • 10 篇 计算机科学与技术...
    • 7 篇 电气工程
    • 2 篇 信息与通信工程
    • 2 篇 控制科学与工程
    • 2 篇 软件工程
    • 1 篇 电子科学与技术(可...
    • 1 篇 土木工程
    • 1 篇 交通运输工程
  • 2 篇 医学
    • 1 篇 基础医学(可授医学...
    • 1 篇 特种医学
  • 1 篇 文学
    • 1 篇 外国语言文学
  • 1 篇 理学
    • 1 篇 物理学
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 14 篇 multi-modal larg...
  • 2 篇 generative artif...
  • 1 篇 behavior control...
  • 1 篇 driver distracti...
  • 1 篇 pino
  • 1 篇 driver monitorin...
  • 1 篇 parameter optimi...
  • 1 篇 transformer
  • 1 篇 openpino
  • 1 篇 pseudo 3d percep...
  • 1 篇 cognitive develo...
  • 1 篇 deep learning
  • 1 篇 digital-twin
  • 1 篇 weakly supervise...
  • 1 篇 driver state rec...
  • 1 篇 series-elastic a...
  • 1 篇 text-to-image re...
  • 1 篇 large language m...
  • 1 篇 stolen check
  • 1 篇 force control

机构

  • 1 篇 sony comp sci la...
  • 1 篇 ixs co ltd 7-7 s...
  • 1 篇 hong kong univ s...
  • 1 篇 prince sattam bi...
  • 1 篇 nyu ny usa
  • 1 篇 01.ai
  • 1 篇 chinese acad sci...
  • 1 篇 nanjing ctr appl...
  • 1 篇 yunnan united vi...
  • 1 篇 new york univ la...
  • 1 篇 southeast univ s...
  • 1 篇 new york univ sh...
  • 1 篇 sony grp corp 1-...
  • 1 篇 tongji univ coll...
  • 1 篇 beijing jiaotong...
  • 1 篇 cuhk mmlab peopl...
  • 1 篇 sony grp corp te...
  • 1 篇 guizhou univ col...
  • 1 篇 flower robot inc...
  • 1 篇 xian univ techno...

作者

  • 1 篇 sawai kunihito
  • 1 篇 liu jiaming
  • 1 篇 geng haoran
  • 1 篇 ma jiajian
  • 1 篇 zhu jian
  • 1 篇 endo ken
  • 1 篇 zuo zhiyuan
  • 1 篇 ding changxing
  • 1 篇 warner gary
  • 1 篇 miyazawa kiyokaz...
  • 1 篇 ma fuqi
  • 1 篇 liu yang
  • 1 篇 tian lu
  • 1 篇 wang fengjuan
  • 1 篇 deng jingyang
  • 1 篇 li guozhang
  • 1 篇 jin hailin
  • 1 篇 zhang chengcui
  • 1 篇 zhan yibing
  • 1 篇 zhao fei

语言

  • 14 篇 英文
检索条件"主题词=Multi-Modal Large Language Model"
14 条 记 录,以下是11-20 订阅
排序:
ManipLLM: Embodied multimodal large language model for Object-Centric Robotic Manipulation
ManipLLM: Embodied Multimodal Large Language Model for Objec...
收藏 引用
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Li, Xiaoqi Zhang, Mingxu Geng, Yiran Geng, Haoran Long, Yuxing Shen, Yan Zhang, Renrui Liu, Jiaming Dong, Hao Peking Univ Sch Comp Sci Beijing Peoples R China Beijing Univ Posts & Telecommun Beijing Peoples R China CUHK MMLab Hong Kong Peoples R China
Robot manipulation relies on accurately predicting contact points and end-effector directions to ensure successful operation. However, learning-based robot manipulation, trained on a limited category within a simulato... 详细信息
来源: 评论
Harnessing the Power of MLLMs for Transferable Text-to-Image Person ReID
Harnessing the Power of MLLMs for Transferable Text-to-Image...
收藏 引用
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
作者: Tan, Wentao Ding, Changxing Jiang, Jiayu Wang, Fei Zhan, Yibing Tao, Dapeng South China Univ Technol Guangzhou Peoples R China Pazhou Lab Guangzhou Peoples R China JD Explore Acad Beijing Peoples R China Yunnan Univ Kunming Yunnan Peoples R China Yunnan United Vis Technol Co Ltd Kunming Yunnan Peoples R China
Text-to-image person re-identification (ReID) retrieves pedestrian images according to textual descriptions. Manually annotating textual descriptions is time-consuming, restricting the scale of existing datasets and t... 详细信息
来源: 评论
Diagram Formalization Enhanced multi-modal Geometry Problem Solver
Diagram Formalization Enhanced Multi-Modal Geometry Problem ...
收藏 引用
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
作者: Zhang, Zeren Cheng, Jo-Ku Deng, Jingyang Tian, Lu Ma, Jinwen Qin, Ziran Zhang, Xiaokai Zhu, Na Leng, Tuo School of Mathematical Sciences Peking University Beijing100871 China 01.AI Beijing China School of Electronic Information and Electrical Engineering Shanghai Jiao Tong University Shanghai200240 China School of Computer Engineering and Science Shanghai University Shanghai200444 China
Mathematical reasoning remains an ongoing challenge for AI models, especially for geometry problems, which require both linguistic and visual signals. As the vision encoders of most MLLMs are trained on natural scenes... 详细信息
来源: 评论
Stories of QRIO and PINO, and Beyond: Lessons Learned from Small Humanoid Projects From R&D to Business
收藏 引用
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS 2024年 第1期21卷
作者: Fujita, Masahiro Kawanami, Yasunori Miyazawa, Kiyokazu Kinoshita, Masaya Sawai, Kunihito Yamasaki, Fuminori Matsui, Tatsuya Endo, Ken Ishiguro, Shu Kitano, Hiroaki Sony Grp Corp Technol Infrastruct Ctr AI Technol Div 2-10-1 Osaki Shinagawa Tokyo Japan Sony Grp Corp Creat Ctr Creat Platform 1-7-1 KonanMinato Ku Tokyo Japan iXs Co Ltd 7-7 Shin Kawasaki Kawasaki Kanagawa Japan Flower Robot Inc J House 301 Tokyo Japan Sony Comp Sci Labs Inc 3-14-13 Higashigotanda Tokyo Japan XiBorg Inc 4026-34-3 Jingumae Tokyo Japan Chiba Inst Technol Future Robot Technol Ctr 2-17-1 Tsudanuma Narashino Chiba 2750016 Japan Sony Grp Corp 1-7-1 KonanMinato Ku Tokyo Japan Okinawa Inst Sci & Technol Grad Sch 1919-1 Tancha Okinawa 9040495 Japan
In 1997, Sony announced AIBO, a fully autonomous small quadruped robot for home entertainment, and in 1999 the company began selling it as a consumer product. Soon after development, two small humanoid robots were ann... 详细信息
来源: 评论