检索结果-内蒙古大学图书馆

18th International World Wide Web Conference, WWW 2009

作者： Grineva, Maria Grinev, Maxim Lizorkin, Dmitry Institute for System Programming Russian Academy of Sciences Russia

ISBN: (纸本)9781605584874

We present a novel method for key term extraction from text documents. In our method, document is modeled as a graph of semantic relationships between terms of that document. We exploit the following remarkable feature of the graph: the terms related to the main topics of the document tend to bunch up into densely interconnected sub-graphs or communities, while non-important terms fall into weakly interconnected communities, or even become isolated vertices. We apply graph community detection techniques to partition the graph into thematically cohesive groups of terms. We introduce a criterion function to select groups that contain key terms discarding groups with unimportant terms. To weight terms and determine semantic relatedness between them we exploit information extracted from Wikipedia. Using such an approach gives us the following two advantages. First, it allows effectively processing multi-theme documents. Second, it is good at filtering out noise information in the document, such as, for example, navigational bars or headers in web pages. Evaluations of the method show that it outperforms existing methods producing key terms with higher precision and recall. Additional experiments on web pages prove that our method appears to be substantially more effective on noisy and multi-theme documents than existing methods. Copyright is held by the International World Wide Web Conference Committee (IW3C2).

关键词： Websites

来源：评论

学校读者我要写书评

暂无评论

Extracting objects and their attributes from tables in text documents

Extracting objects and their attributes from tables in text ...

引用

7th Spring Researchers Colloquium on Databases and Information systems, SYRCoDIS 2011

作者： Astrakhantsev, Nikita Institute for System Programming Russian Academy of Sciences Russia

Extracting information from tables is an important and rather complex part of information retrieval. For the task of objects extraction from HTML tables we introduce the following methods: determining table orientation, processing of aggregating objects (like Total) and scattered headers (super row labels, subheaders).

关键词：

来源：评论

学校读者我要写书评

暂无评论

Automatic recognition of domain-specific terms: An experimental evaluation 9

Automatic recognition of domain-specific terms: An experimen...

引用

9th Spring Researchers Colloquium on Databases and Information systems, SYRCoDIS 2013

作者： Fedorenko, Denis Astrakhantsev, Nikita Turdakov, Denis Institute for System Programming Russian Academy of Sciences Russia

This paper presents an experimental evaluation of the state-of-the-art approaches for automatic term recognition based on multiple features: machine learning method and voting algorithm. We show that in most cases machine learning approach obtains the best results and needs little data for training;we also find the best subsets of all popular features.

关键词： Learning systems

来源：评论

学校读者我要写书评

暂无评论

A Method to Evaluate Binary Code Comparison Tools

A Method to Evaluate Binary Code Comparison Tools

引用

2021 Ivannikov Memorial Workshop, IVMEM 2021

作者： Arutunian, Mariam Hovhannisyan, Hripsime Vardanyan, Vahagn Sargsyan, Sevak Kurmangaleev, Shamil Aslanyan, Hayk Russian-Armenian University System programming department Yerevan Armenia Institute for System Programming Russian Academy of Sciences System programming department Moscow Russia

ISBN: (纸本)9781665423274

Binary code comparison tools are widely used to analyze vulnerabilities, search for malicious code, detect copyright violations, etc. The article discusses three tools - BCC, BinDiff, Diaphora. Those are based on static analysis of programs. The tools receive as input data two versions of the program in binary form and match their functions. The purpose of the article is to assess the quality of the tools. We developed a testing system to automatically determine the precision and recall of each instrument. F1 score on the developed testing system for BCC instrument is 85.6%, for BinDiff - 82.4%, for Diaphora - 64.7%. © 2021 IEEE.

关键词： Static analysis

来源：评论

学校读者我要写书评

暂无评论

DataGuide-based distribution for XML documents

DataGuide-based distribution for XML documents

引用

6th Spring Young Researchers' Colloquium on Databases and Information systems, SYRCoDIS 2009

作者： Kalinin, Alexander Institute for System Programming Russian Academy of Sciences Russia

Distribution is a well-known solution to increase performance and provide load balancing in case you need optimal resource utilization. Together with replication it also allows improved reliability, accessibility and fault-tolerance. However since the amount of data is large there is a problem of maintaining meta-information about distribution and finding needed data fragments during execution of queries. These problems are well understood but they have not received much attention in the context of XML data management. This paper presents research-in-progress, which examines the possibility of management of meta-information about XML data distribution extending auxillary index structure called DataGuide.

关键词： Fault tolerance

来源：评论

学校读者我要写书评

暂无评论

Header-driven generation of sanity API tests for shared libraries

Header-driven generation of sanity API tests for shared libr...

引用

Central and Eastern European Software Engineering Conference

作者： Ponomarenko, Andrey Rubanov, Vladimir Institute for System Programming Russian Academy of Sciences Russia

ISBN: (纸本)9781457706066

There are thousands of various software libraries being developed in the modern world - completely new libraries emerge as well as new versions of existing ones regularly appear. Unfortunately, developers of many libraries focus on developing functionality of the library itself but neglect ensuring high quality and backward compatibility of application programming interfaces (APIs) provided by their libraries. The best practice to address these aspects is having an automated regression test suite that can be regularly (e.g., nightly) run against the current development version of the library. Such a test suite would ensure early detection of any regressions in the quality or compatibility of the library. But developing a good test suite can cost significant amount of efforts, which becomes an inhibiting factor for library developers when deciding QA policy. That is why many libraries do not have a test suite at all. This paper discusses an approach for low cost automatic generation of basic tests for shared libraries based on the information automatically extracted from the library header files and additional information about semantics of some library data types. Such tests can call APIs of target libraries with some correct parameters and can detect typical problems like crashes "out-of-the-box". Using this method significantly lowers the barrier for developing an initial version of library tests, which can be then gradually improved with a more powerful test development framework as resources appear. The method is based on analyzing API signatures and type definitions obtained from the library header files and creating parameter initialization sequences through comparison of target function parameter types with other functions' return values or out-parameters (usually, it is necessary to call some function to get a correct parameter value for another function and the initialization sequence of the necessary function calls can be quite long). The paper also descri

关键词： Linux

来源：评论

学校读者我要写书评

暂无评论

Effective Solving Scientific Problems on Heterogeneous Networks with mpC

引用

Journal of Computational Methods in sciences and Engineering 2002年第1-2期2卷 135-140页

作者： Kalinov, Alexey Lastovetsky, Alexey Ledovskih, Ilya Posypkin, Mikhail Institute for System Programming Russian Academy of Sciences 25 Bolshaya Konmmnisticheskaia str. Moscow109004 Russia

关键词： Heterogeneous networks

来源：评论

学校读者我要写书评

暂无评论

Automated verification of shared libraries for backward binary compatibility

Automated verification of shared libraries for backward bina...

引用

International Conference on Advances in system Testing and Validation Lifecycle

作者： Ponomarenko, Andrey Rubanov, Vladimir Institute for System Programming Russian Academy of Sciences Moscow Russia

ISBN: (纸本)9780769541464

This paper discusses a problem of ensuring backward binary compatibility when developing shared libraries. Linux (and GCC environment) is used as the main example. Breakage of the compatibility may result in crashing or incorrect behavior of applications built with an old version of a library when they are running with a new one. The paper describes typical issues that cause binary compatibility problems and presents a new method for library verification for such issues. Existing tools can detect only a small fraction of all possible backward compatibility problems while the suggested method can verify a broad spectrum of them. The method is based on comparison of function signatures and type definitions obtained from library header files in addition to analyzing symbols in library binaries. This paper also describes an automated verification tool that implements the suggested method and presents some results of its practical usage. © 2010 IEEE.

关键词： Linux

来源：评论

学校读者我要写书评

暂无评论

Optimal mapping of a parallel application processes onto heterogeneous platform

Optimal mapping of a parallel application processes onto het...

引用

19th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2005

作者： Kalinov, Alexey Klimov, Sergey Institute for System Programming Russian Academy of Sciences 25 Bolshaya Kommunisticheskaya str. Moscow 1090045 Russia Institute for System Programming Russian Academy of Sciences

ISBN: (纸本)0769523129

The paper is devoted to analysis of a strategy of computation distribution on heterogeneous parallel systems. According to this strategy processes of parallel program are distributed over the processors according to their performances and data are distributed between processes evenly. The paper presents an algorithm that computes optimal number of the processes and their distribution over processors minimizing the execution time of an application. The processor performance is considered as a function of the number of processes running on the processor and the amount of the data processing by the processor.

关键词： Parallel processing systems

来源：评论

学校读者我要写书评

暂无评论

A discrete-event simulator for early validation of avionics systems 1

A discrete-event simulator for early validation of avionics ...

引用

1st International Workshop on Architecture Centric Virtual Integration, ACVI 2014, Co-located with the 17th International Conference on Model Driven Engineering Languages and systems, MoDELS 2014

作者： Buzdalov, Denis Khoroshilov, Alexey Institute for System Programming Russian Academy of Sciences Moscow Russia

The paper discusses problems arising in development of avionics systems and considers how discrete-event simulation on the base of architecture models at the early stages of avionics design can help to mitigate some of them. A tool for simulation of AADL architecture models augmented by behavioural specifications is presented and its main design decisions are discussed.

关键词： Discrete event simulation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：