This article outlines the digitization process and methodology applied to the archive of parliamentary questions from the 1st Parliamentary Term (1974-1977) in the Hellenic Parliament. A collaborative pilot project in...
ISBN:
(纸本)9783031706448;9783031706455
This article outlines the digitization process and methodology applied to the archive of parliamentary questions from the 1st Parliamentary Term (1974-1977) in the Hellenic Parliament. A collaborative pilot project involving parliament, academia, and a research center facilitated the conversion of printed material to open data. The main tasks of the project include capturing digital images, a custom Optical Character Recognition (OCR) software solution employing machine learning, and rigorous validation for accuracy of a fragmented and of variable quality polytonic corpus in a variety of modern Greek language called Katharevousa. The article discusses the approach and challenges as well as the initial results of the digitization effort, emphasizing ongoing research steps. Overall, 1,674 images were digitally processed corresponding to 1,338 questions. Following algorithmic training, character recognition accuracy is over 98.5%. Successful implementation streamlines further similar digitalization operations in the vast parliamentary archives, while enabling in-depth studies on parliamentary control in the turbulent period of the immediate post-junta era in Greece. A preliminary comparative analysis with a corpus of newer parliamentary questions (2009-2019) provides insights and incentives for the further study of the characteristics and evolution of the Greek language.
In this paper, we investigate different approaches for generating synthetic microdata from open-source aggregated data. Specifically, we focus on macro-to-micro data synthesis. We explore the potential of the Gaussian...
ISBN:
(数字)9783031696510
ISBN:
(纸本)9783031696503;9783031696510
In this paper, we investigate different approaches for generating synthetic microdata from open-source aggregated data. Specifically, we focus on macro-to-micro data synthesis. We explore the potential of the Gaussian copulas framework to estimate joint distributions from aggregated data. Our generated synthetic data is intended for educational and software testing use cases. We propose three scenarios to achieve realistic and high-quality synthetic microdata: (1) zero knowledge, (2) internal knowledge, and (3) external knowledge. The three scenarios involve different knowledge of the underlying properties of the real microdata, i.e., standard deviation, and covariate. Our evaluation includes matching tests to evaluate the privacy of the synthetic datasets. Our results indicate that macro-to-micro synthesis achieves better privacy preservation compared to other methods, demonstrating both the potential and challenges of synthetic data generation in maintaining data privacy while providing useful data for analysis.
This paper provides an overview of spherical blossoms, called splossoms, and some of its implications. The blossom of a polynomial is a multi-affine function of euclidean space with the same number of variables as the...
ISBN:
(纸本)9783031500787;9783031500770
This paper provides an overview of spherical blossoms, called splossoms, and some of its implications. The blossom of a polynomial is a multi-affine function of euclidean space with the same number of variables as the degree of the polynomial. It provides many insights to the polynomial and simplifies methods not otherwise apparent. One example is the de Casteljau algorithm for computing and subdividing a Bezier curve. This report describes a blossom for a parametric de Casteljau-like curve on the sphere, leading to similar insights and simplification of algorithms on the sphere. Two earlier such methods are the well-known SLERP and SQUAD interpolations of points on the sphere. These methods are re-formulated with our new concept, the splossom, which plays the role of a blossom in spherical space. Some of its implications are briefly sketched to illustrate its potential. The splossom itself is neatly described in terms of spinors in Geometric Algebra. This development follows the Geometric Algebra approach and points to considerable further research within its broad vista.
We are entering a new era in which software systems are becoming more and more complex and larger. So, the composition of such systems is becoming infeasible by manual means. To address this challenge, self-organising...
ISBN:
(纸本)9783031646256;9783031646263
We are entering a new era in which software systems are becoming more and more complex and larger. So, the composition of such systems is becoming infeasible by manual means. To address this challenge, self-organising software models represent a promising direction since they allow the (bottom-up) emergence of complex computational structures from simple rules. In this paper, we propose an abstract machine, called the composition machine, which allows the definition and the execution of such models. Unlike typical abstract machines, our proposal does not compute individual programs but enables the emergence of multiple programs at once. We particularly present the machine's semantics and demonstrate its operation with well-known rules from the realm of Boolean logic and elementary cellular automata.
The field and community of Information Retrieval (IR) are changing and evolving in response to the latest developments and advances in Artificial Intelligence (AI) and research culture. As the field and community re-o...
ISBN:
(纸本)9783031560682;9783031560699
The field and community of Information Retrieval (IR) are changing and evolving in response to the latest developments and advances in Artificial Intelligence (AI) and research culture. As the field and community re-oriented and re-consider its positioning within computing and information sciences more generally - it is timely to gather and discuss more seriously our field's vision for the future - the challenges and threats that the community and field faces - along with the bold new research questions and problems that are arising and emerging as we re-imagine search. This workshop aims to provide a forum for the IR community to voice and discuss their concerns and pitch proposals for building and strengthening the field and community.
This paper investigates the Dynamic Capacitated Profitable Tour Problem with Stochastic Requests (DCPTPSR), a variant of the Traveling Salesman Problem (TSP) with profits. In the DCPTPSR, online decisions must be made...
ISBN:
(纸本)9783031646041;9783031646058
This paper investigates the Dynamic Capacitated Profitable Tour Problem with Stochastic Requests (DCPTPSR), a variant of the Traveling Salesman Problem (TSP) with profits. In the DCPTPSR, online decisions must be made for accepting and scheduling requests over a finite number of periods. Requests follow a discrete-time stochastic process, and each request is characterized by a location, demand, and prize. Accepted requests must be served on a TSP tour such that the collected prize minus the transportation costs becomes maximal. The DCPTPSR has practical applications in food delivery and less-thantruckload transportation, where requests arrive in an online fashion and immediate decisions about acceptance and scheduling must be made. We model the DCPTPSR by a Markov Decision Process (MDP) and propose a Stochastic Dynamic Programming (SDP) algorithm for solving the problem to optimality. Addressing the computational challenges involved in SDP, we present a framework that integrates Reinforcement Learning (RL) as an alternative solution method. We perform an extensive numerical study where instances with up to incoming 25 requests can be solved by SDP while our RL approach can be used to adequately solve instances with even up to 100 incoming requests. Particularly, the performance of the RL approach is very close to the optimal policy by SDP and outperforms both the first come first serve heuristic and the first accept traveling salesman algorithm. The latter algorithm accepts requests if the available capacity enables it and fulfills these demands in an optimal TSP tour afterward. Especially instances with scarce capacity show considerable potential for savings in request acceptance and transportation scheduling decisions if both decisions are made simultaneously.
RISC-V is a recently developed open instruction set architecture gaining a lot of attention. To improve the security of these systems and design efficient countermeasures, a better understanding of vulnerabilities to ...
ISBN:
(纸本)9783031541285;9783031541292
RISC-V is a recently developed open instruction set architecture gaining a lot of attention. To improve the security of these systems and design efficient countermeasures, a better understanding of vulnerabilities to novel and future attacks is mandatory. This paper demonstrates that RISC-V is sensible to Jump-Oriented Programming, a class of complex code-reuse attacks. We provide an analysis of new dispatcher gadgets we discovered, and show how they can be used together to build a stealth attack, bypassing existing protections. We implemented a proof-of-concept attack on an embedded web server compiled for RISC-V, in which we introduced a vulnerability allowing an attacker to read an arbitrary file from the remote host machine.
Game-based learning is an effective pedagogical approach with a demonstrated capacity to activate learner engagement, inspire motivation, and enhance the overall learning experience. The application of educational rob...
ISBN:
(纸本)9783031613043;9783031613050
Game-based learning is an effective pedagogical approach with a demonstrated capacity to activate learner engagement, inspire motivation, and enhance the overall learning experience. The application of educational robotics has also attracted a lot of attention in recent years across educational levels and domains. Despite their appeal and the positive learning outcomes associated with such innovative pedagogies, the synergistic edifying impact of blending them remains largely unexplored. The aim of this study is to present a synthesis of empirical evidence on game-based learning and educational robotics. A systematic literature review is conducted focusing on empirical research published between 2019 and 2023. The analysis reveals prevalent methodological approaches and pedagogical theories framing learning and instruction, as well as the most widely employed robotics and gaming platforms. The study sheds light not only on the benefits of embracing game-based learning and educational robotics, but also on the barriers and challenges associated with adopting such innovative pedagogies. Ultimately, the study attempts to portray the impact of these approaches on learning and transferable skills development.
In Atlantico, Colombia, the Departmental Health Secretariat has been proactive in promoting healthy lifestyles to prevent non-communicable diseases (NCDs), adhering to the strategies outlined in the Health Action Plan...
ISBN:
(纸本)9783031711145;9783031711152
In Atlantico, Colombia, the Departmental Health Secretariat has been proactive in promoting healthy lifestyles to prevent non-communicable diseases (NCDs), adhering to the strategies outlined in the Health Action Plan (PAS) as recommended by the Ministry of Health's Promotion and Prevention and Epidemiology and Demography Directorates. However, the efficacy of these activities is often hampered by a reliance on external data sources or studies from regions with dissimilar health determinants. This paper highlights the role of business intelligence tools-including dimensional fact models, multidimensional databases, ETL (Extract, Transform and Load) processes, and data visualization dashboards-in enhancing local data analysis, thereby providing valuable insights for public health management and research within Atlantico. By leveraging accurate, localized data, this approach strengthens decision-making processes in NCD prevention. The study aims to ascertain the impact of a healthy diet and physical activity on preventing NCDs among Atlantico's vulnerable populations, utilizing a confirmatory research methodology that employs observation, surveys, and interviews within the constraints of the COVID-19 pandemic. The findings underscore the critical role of diet and exercise in NCD prevention and demonstrate the efficacy of employing digital analytical tools and business intelligence to inform user-centric health interventions. This research lays the groundwork for developing evidence-based, locally tailored NCD prevention strategies, marking a significant advancement in public health initiatives in Atlantico.
We study randomized generation of sequences of test-inputs to a system using Prolog. Prolog is a natural fit to generate test-sequences that have complex logical inter-dependent structure. To counter the problems pose...
ISBN:
(纸本)9783031712937;9783031712944
We study randomized generation of sequences of test-inputs to a system using Prolog. Prolog is a natural fit to generate test-sequences that have complex logical inter-dependent structure. To counter the problems posed by a large (or infinite) set of possible tests, randomization is a natural choice. We study the impact that randomization in conjunction with SLD resolution have on the test performance. To this end, this paper proposes two strategies to add randomization to a test-generating program. One strategy works on top of standard Prolog semantics, whereas the other alters the SLD selection function. We analyze the mean time to reach a test-case, and the mean number of generated test-cases in the framework of Markov chains. Finally, we provide an additional empirical evaluation and comparison between both approaches.
暂无评论