A decision heuristic for Monte Carlo tree search doppelkopf agents

Lade...
Vorschaubild
Dateien
Zu diesem Dokument gibt es keine Dateien.
Datum
2017
Autor:innen
Dockhorn, Alexander
Hewelt, Matthias
Kruse, Rudolf
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
URI (zitierfähiger Link)
ArXiv-ID
Internationale Patentnummer
Angaben zur Forschungsförderung
Projekt
Open Access-Veröffentlichung
Core Facility der Universität Konstanz
Gesperrt bis
Titel in einer weiteren Sprache
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published
Erschienen in
2017 IEEE Symposium Series on Computational Intelligence (SSCI). Piscataway, New Jersey, USA: IEEE, 2017, pp. 51-58. ISBN 978-1-5386-4058-6. Available under: doi: 10.1109/SSCI.2017.8285181
Zusammenfassung

This work builds up on previous research by Sievers and Helmert, who developed an Monte Carlo Tree Search based doppelkopf agent. This four player card game features a larger state space than skat due to the unknown cards of the contestants. Additionally, players face the unique problem of not knowing their teammates at the start of the game. Figuring out the player parties is a key feature of this card game and demands differing play styles depending on the current knowledge of the game state. In this work we enhance the Monte Carlo Tree Search agent created by Sievers and Helmert with a decision heuristic. Our goal is to improve the quality of playouts, by suggesting high quality moves and predicting enemy moves based on a neural network classifier. This classifier is trained on an extensive history of expert player moves recorded during official doppelkopf tournaments. Different network architectures are discussed and evaluated based on their prediction accuracy. The best performing network was tested in a direct comparison with the previous Monte Carlo Tree Search agent by Sievers and Helmert. We show that high quality predictions increase the quality of playouts. Overall, our simulations show that adding the decision heuristic increased the strength of play under comparable computational effort.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
004 Informatik
Schlagwörter
Konferenz
2017 IEEE Symposium Series on Computational Intelligence (SSCI), 27. Nov. 2017 - 1. Dez. 2017, Honolulu, Hawaii, USA
Rezension
undefined / . - undefined, undefined
Zitieren
ISO 690DOCKHORN, Alexander, Christoph DOELL, Matthias HEWELT, Rudolf KRUSE, 2017. A decision heuristic for Monte Carlo tree search doppelkopf agents. 2017 IEEE Symposium Series on Computational Intelligence (SSCI). Honolulu, Hawaii, USA, 27. Nov. 2017 - 1. Dez. 2017. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI). Piscataway, New Jersey, USA: IEEE, 2017, pp. 51-58. ISBN 978-1-5386-4058-6. Available under: doi: 10.1109/SSCI.2017.8285181
BibTex
@inproceedings{Dockhorn2017decis-44702,
  year={2017},
  doi={10.1109/SSCI.2017.8285181},
  title={A decision heuristic for Monte Carlo tree search doppelkopf agents},
  isbn={978-1-5386-4058-6},
  publisher={IEEE},
  address={Piscataway, New Jersey, USA},
  booktitle={2017 IEEE Symposium Series on Computational Intelligence (SSCI)},
  pages={51--58},
  author={Dockhorn, Alexander and Doell, Christoph and Hewelt, Matthias and Kruse, Rudolf}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/44702">
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2019-01-23T16:49:28Z</dcterms:available>
    <dc:contributor>Hewelt, Matthias</dc:contributor>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <dcterms:title>A decision heuristic for Monte Carlo tree search doppelkopf agents</dcterms:title>
    <dc:contributor>Kruse, Rudolf</dc:contributor>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2019-01-23T16:49:28Z</dc:date>
    <dcterms:issued>2017</dcterms:issued>
    <bibo:uri rdf:resource="https://kops.uni-konstanz.de/handle/123456789/44702"/>
    <dc:creator>Doell, Christoph</dc:creator>
    <dcterms:abstract xml:lang="eng">This work builds up on previous research by Sievers and Helmert, who developed an Monte Carlo Tree Search based doppelkopf agent. This four player card game features a larger state space than skat due to the unknown cards of the contestants. Additionally, players face the unique problem of not knowing their teammates at the start of the game. Figuring out the player parties is a key feature of this card game and demands differing play styles depending on the current knowledge of the game state. In this work we enhance the Monte Carlo Tree Search agent created by Sievers and Helmert with a decision heuristic. Our goal is to improve the quality of playouts, by suggesting high quality moves and predicting enemy moves based on a neural network classifier. This classifier is trained on an extensive history of expert player moves recorded during official doppelkopf tournaments. Different network architectures are discussed and evaluated based on their prediction accuracy. The best performing network was tested in a direct comparison with the previous Monte Carlo Tree Search agent by Sievers and Helmert. We show that high quality predictions increase the quality of playouts. Overall, our simulations show that adding the decision heuristic increased the strength of play under comparable computational effort.</dcterms:abstract>
    <dc:contributor>Dockhorn, Alexander</dc:contributor>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:language>eng</dc:language>
    <dc:contributor>Doell, Christoph</dc:contributor>
    <dc:creator>Kruse, Rudolf</dc:creator>
    <dc:creator>Dockhorn, Alexander</dc:creator>
    <dc:creator>Hewelt, Matthias</dc:creator>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/36"/>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen