检索结果-内蒙古大学图书馆

您好，读者！请登录

内蒙古大学图书馆

首页
概况
党建
资源
服务
科研支持
- 论文收录引用证明
- 科技查新
知识产权
档案馆
帮助

咨询与建议

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

您的常用邮箱：*

您的手机号码：*

问题描述：

当前已输入0个字，您还可以输入200个字

全部搜索
期刊论文
图书
学位论文
标准
纸本馆藏
外文资源发现
数据库导航
超星发现

高级检索

时间限定

出版年份：

文献类型

图书期刊文献学位论文多媒体

馆藏选择

电子馆藏纸本馆藏

核心期刊

全部期刊 SCI 收录期刊 SSCI 收录期刊 EI 收录期刊 CSCD 收录期刊 CSSCI 收录期刊

语言

中文英文

文献类型

期刊文献图书学位论文标准纸本馆藏

帮助

文字说明：

T=题名（书名、题名），A=作者（责任者），K=主题词，P=出版物名称，PU=出版社名称，O=机构（作者单位、学位授予单位、专利申请人），L=中图分类号，C=学科分类号，U=全部字段，Y=年（出版发行年、学位年度、标准发布年）

检索规则说明：

AND代表“并且”；OR代表“或者”；NOT代表“不包含”；(注意必须大写,运算符两边需空一格)

检索范例：

范例一：(K=图书馆学 OR K=情报学) AND A=范并思 AND Y=1982-2016
范例二：P=计算机应用与软件 AND (U=C++ OR U=Basic) NOT K=Visual AND Y=2011-2016

分类表

所选分类

>> <<

限定检索结果

文献类型

3 篇 会议

馆藏范围

3 篇 电子文献
0 种 纸本馆藏

日期分布

学科分类号

3 篇 工学
- 3 篇 计算机科学与技术...
- 1 篇 信息与通信工程
- 1 篇 控制科学与工程
- 1 篇 软件工程
1 篇 管理学
- 1 篇 管理科学与工程(可...
- 1 篇 图书情报与档案管...

主题

3 篇 data version con...
1 篇 physical data la...
1 篇 data platform
1 篇 ipfs
1 篇 software enginee...
1 篇 empirical softwa...
1 篇 dataset manageme...
1 篇 software evoluti...
1 篇 data streaming a...
1 篇 blockchain
1 篇 se4ai

机构

1 篇 centre borelli u...
1 篇 univ calgary cal...
1 篇 concordia univ m...
1 篇 queens univ king...
1 篇 apple inc cupert...

作者

1 篇 gagneja anupriya
1 篇 yang jinqiu
1 篇 abdellatif ahmad
1 篇 fathollahzadeh p...
1 篇 bindal aanchal
1 篇 shihab emad
1 篇 shah vishrut
1 篇 zhao kaiyu
1 篇 muss timothy
1 篇 arya rajat
1 篇 sugden laura
1 篇 bhatia sandeep
1 篇 pacheco lorena b...
1 篇 hernandez jose a...
1 篇 wu ming-chuan
1 篇 chen tse-hsun (p...
1 篇 rabbi fazle
1 篇 paliwal mudit ma...
1 篇 agrawal pulkit
1 篇 raman sethu

语言

2 篇 英文
1 篇 其他

检索条件"主题词=Data Version Control"

共 3 条记录，以下是1-10 订阅

全选清除本页清除全部题录导出标记到"检索档案"

详细简洁

排序：

DGChain: data control version for trustworthy reproducibility with Blockchain 6

DGChain: Data control version for trustworthy reproducibilit...

引用

6th International Conference on Blockchain Computing and Applications, BCCA 2024

作者： Hernandez, Jose Armando Centre Borelli Université Paris-Saclay Gif-sur-Yvette France

ISBN: (纸本)9798350351538

This work presents the DGChain (data-Git- for Blockchain) project. This Python package allows version control of data in blockchain and IPFS based on a DAO (decentralized autonomous organization) for managing data in the development cycles of reproducible computational scientific research. Analyzes the benefits of using this Blockchain-Based Decentralized Architecture to mediate collaborative interactions between developers compared to existing solutions. Presents a use case in developing a medical research project and typical IRIS example to offer the traceability of changes and provenance of metadata, data, and code in data / Software version control systems through management of intrinsic hash-based persistent, immutable CIDs (Content Identifier) recorded in Merkle trees in the development cycle of its main products, publication, software source code, and data to guarantee reproducibility and trustworthiness in computational scientific research using DGChain. © 2024 IEEE.

关键词： Blockchain data version control IPFS software engineering

来源：评论

学校读者我要写书评

暂无评论

DVC in Open Source ML-development: The Action and the Reaction 24

DVC in Open Source ML-development: The Action and the Reacti...

引用

IEEE/ACM 3rd International Conference on AI Engineering - Software Engineering for AI (CAIN)

作者： Pacheco, Lorena Barreto Simedo Rahman, Musfiqur Rabbi, Fazle Fathollahzadeh, Pouya Abdellatif, Ahmad Shihab, Emad Chen, Tse-Hsun (Peter) Yang, Jinqiu Zou, Ying Concordia Univ Montreal PQ Canada Queens Univ Kingston ON Canada Univ Calgary Calgary AB Canada

ISBN: (纸本)9798400705915

Machine Learning (ML) systems are gaining popularity, reshaping various domains ranging from customer services to software engineering. The effectiveness of ML systems is dependent on the quality of their training data. Therefore, practitioners invest substantial time experimenting with different data, parameters, and models to guarantee the quality of the end system. Prior work highlighted unique challenges of developing ML systems, particularly concerning versioning data and models. Recently, various tools such as DVC and MLFlow have emerged to aid developers in the storage and tracking of data. Despite their growing popularity, very little is known about their usage patterns and impact on open-source software (OSS) systems. To address this gap, we conducted an empirical study on 56 GitHub OSS projects that use DVC to understand the DVC usage pattern and the impact of using DVC on the software development process. We found that versioning and tracking is the most adopted DVC feature, being utilized by all 56 projects and being the only adopted feature in 85.7% of them. Furthermore, we found that DVC has a significant impact on the software development process indicators such as the number of created pull requests (PRs), and the number of bug-fix commits. For instance, our findings showed that DVC causes a peak in the number of commits and PRs at the moment of the adoption, followed by a long-term decrease. We believe that our findings can assist practitioners in tailoring tools to better meet user requirements and help organizations realize potential outcomes of adopting such tools.

关键词： Empirical Software Engineering data version control Software Evolution SE4AI

来源：评论

学校读者我要写书评

暂无评论

data Platform for Machine Learning 19

Data Platform for Machine Learning

引用

ACM SIGMOD International Conference on Management of data (SIGMOD)

作者： Agrawal, Pulkit Arya, Rajat Bindal, Aanchal Bhatia, Sandeep Gagneja, Anupriya Godlewski, Joseph Low, Yucheng Muss, Timothy Paliwal, Mudit Manu Raman, Sethu Shah, Vishrut Shen, Bochao Sugden, Laura Zhao, Kaiyu Wu, Ming-Chuan Apple Inc Cupertino CA 95014 USA

ISBN: (纸本)9781450356435

In this paper, we present a purpose-built data management system, MLdp, for all machine learning (ML) datasets. ML applications pose some unique requirements different from common conventional data processing applications, including but not limited to: data lineage and provenance tracking, rich data semantics and formats, integration with diverse ML frameworks and access patterns, trial-and-error driven data exploration and evolution, rapid experimentation, reproducibility of the model training, strict compliance and privacy regulations, etc. Current ML systems/services, often named MLaaS, to-date focus on the ML algorithms, and offer no integrated data management system. Instead, they require users to bring their own data and to manage their own data on either blob storage or on file systems. The burdens of data management tasks, such as versioning and access control, fall onto the users, and not all compliance features, such as terms of use, privacy measures, and auditing, are available. MLdp offers a minimalist and flexible data model for all varieties of data, strong version management to guarantee re-producibility of ML experiments, and integration with major ML frameworks. MLdp also maintains the data provenance to help users track lineage and dependencies among data versions and models in their ML pipelines. In addition to table-stake features, such as security, availability and scalability, MLdp's internal design choices are strongly influenced by the goal to support rapid ML experiment iterations, which cycle through data discovery, data exploration, feature engineering, model training, model evaluation, and back to data discovery. The contributions of this paper are: 1) to recognize the needs and to call out the requirements of an ML data platform, 2) to share our experiences in building MLdp by adopting existing database technologies to the new problem as well as by devising new solutions, and 3) to call for actions from our communities on future challeng

关键词： data platform data streaming access data version control dataset management for machine learning physical data layout

来源：评论

学校读者我要写书评

暂无评论

没有更多数据了...

全选清除本页清除全部题录导出标记到“检索档案”

共1页 << < 1 > >>

检索报告对象比较合并检索0

隐藏清空

合并搜索

回到顶部

执行限定条件

内容：

评分：

请选择保存的检索档案：

请选择收藏分类：

订阅名称：

通借通还

温馨提示：

图书名称：

借书校区：

取书校区：

手机号码：

邮箱地址：

一卡通帐号：

电话和邮箱必须正确填写，我们会与您联系确认。

联系人：

所在院系：

联系邮箱：

联系电话：

内蒙古自治区呼和浩特市赛罕区大学西街235号邮编: 010021

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：