Projects per year
Abstract
An influential active learning technique is Angluin’s L∗
algorithm for regular languages which inspired several generalisations from DFAs to other automata-based modelling formalisms. In this work, we study L∗-based learning of deterministic Markov decision processes, first assuming an ideal setting with perfect information. Then, we relax this assumption and present a novel learning algorithm that collects information by sampling system traces via testing. Experiments with the implementation of our sampling-based algorithm suggest that it achieves better accuracy than state-of-the-art passive learning techniques with the same amount of test data. Unlike existing learning algorithms with predefined states, our algorithm learns the complete model structure including the states.
Original language | English |
---|---|
Title of host publication | Formal Methods - The Next 30 Years |
Editors | Maurice H. ter Beek, Annabelle McIver, José N. Oliveria |
Place of Publication | Cham |
Publisher | Springer |
Pages | 651 - 669 |
Number of pages | 19 |
ISBN (Electronic) | 978-3-030-30942-8 |
ISBN (Print) | 978-3-030-30941-1 |
DOIs | |
Publication status | Published - 2019 |
Event | International Symposium on Formal Methods: FM 2019 - Porto, Portugal Duration: 7 Oct 2019 → 11 Oct 2019 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 11800 |
Conference
Conference | International Symposium on Formal Methods |
---|---|
Abbreviated title | FM 2019 |
Country/Territory | Portugal |
City | Porto |
Period | 7/10/19 → 11/10/19 |
Fields of Expertise
- Information, Communication & Computing
Fingerprint
Dive into the research topics of 'L*-Based Learning of Markov Decision Processes'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Dependable Internet of Things
Boano, C. A. (Co-Investigator (CoI)), Kubin, G. (Co-Investigator (CoI)), Bloem, R. (Co-Investigator (CoI)), Horn, M. (Co-Investigator (CoI)), Pernkopf, F. (Co-Investigator (CoI)), Zakany, N. (Co-Investigator (CoI)), Mangard, S. (Co-Investigator (CoI)), Witrisal, K. (Co-Investigator (CoI)), Römer, K. U. (Co-Investigator (CoI)), Aichernig, B. (Co-Investigator (CoI)), Bösch, W. (Co-Investigator (CoI)), Baunach, M. C. (Co-Investigator (CoI)), Tappler, M. (Co-Investigator (CoI)), Malenko, M. (Co-Investigator (CoI)), Weiser, S. (Co-Investigator (CoI)), Eichlseder, M. (Co-Investigator (CoI)), Leitinger, E. (Co-Investigator (CoI)), Grosinger, J. (Co-Investigator (CoI)), Großwindhager, B. (Co-Investigator (CoI)), Ebrahimi, M. (Co-Investigator (CoI)), Alothman Alterkawi, A. B. (Co-Investigator (CoI)), Knoll, C. (Co-Investigator (CoI)), Teschl, R. (Co-Investigator (CoI)), Saukh, O. (Co-Investigator (CoI)), Rath, M. (Co-Investigator (CoI)), Steinberger, M. (Co-Investigator (CoI)), Steinbauer-Wagner, G. (Co-Investigator (CoI)) & Tranninger, M. (Co-Investigator (CoI))
1/01/16 → 31/03/22
Project: Research project
Research output
- 1 Article
-
L*-Based Learning of Markov Decision Processes (Extended Version)
Tappler, M., Aichernig, B., Bacci, G., Eichlseder, M. & Larsen, K. G., Aug 2021, In: Formal Aspects of Computing. 33, 4-5, p. 575-615 41 p.Research output: Contribution to journal › Article › peer-review
Open Access