Developing a finite-state morphological anlayzer for Urdu and Hindi

Lade...
Vorschaubild
Dateien
Zu diesem Dokument gibt es keine Dateien.
Datum
2007
Herausgeber:innen
Kontakt
ISSN der Zeitschrift
Electronic ISSN
ISBN
Bibliografische Daten
Verlag
Schriftenreihe
Auflagebezeichnung
DOI (zitierfähiger Link)
ArXiv-ID
Internationale Patentnummer
EU-Projektnummer
DFG-Projektnummer
Projekt
Open Access-Veröffentlichung
Sammlungen
Gesperrt bis
Titel in einer weiteren Sprache
Forschungsvorhaben
Organisationseinheiten
Zeitschriftenheft
Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
unikn.publication.listelement.citation.prefix.version.undefined
HANNEFORTH, Thomas, ed., Kay-Michael WÜRZNER, ed.. Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 Potsdam, Germany, september 14 - 16 ; revised papers. Potsdam: Universitätsverlag, 2007, pp. 86-96. ISBN 978-3-940793-57-7
Zusammenfassung

We introduce and discuss a number of issues that arise in the process of building a finite-state morphological analyzer for Urdu, in particular issues with potential ambiguity and non-concatenative morphology. Our approach allows for an underlyingly similar treatment of both Urdu and Hindi via a cascade of finite-state transducers that transliterates the very different scripts into a common ASCII transcription system. As this transliteration system is based on the XFST tools that the Urdu/Hindi common morphological analyzer is also implemented in, no compatibility problems arise.

Zusammenfassung in einer weiteren Sprache
Fachgebiet (DDC)
400 Sprachwissenschaft, Linguistik
Schlagwörter
Konferenz
FSMNLP - Finite-state methods and natural language processing, 14. Sep. 2007 - 16. Sep. 2007, Potsdam, Germany
Rezension
undefined / . - undefined, undefined
Zitieren
ISO 690BÖGEL, Tina, Miriam BUTT, Annette HAUTLI-JANISZ, Sebastian SULGER, 2007. Developing a finite-state morphological anlayzer for Urdu and Hindi. FSMNLP - Finite-state methods and natural language processing. Potsdam, Germany, 14. Sep. 2007 - 16. Sep. 2007. In: HANNEFORTH, Thomas, ed., Kay-Michael WÜRZNER, ed.. Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 Potsdam, Germany, september 14 - 16 ; revised papers. Potsdam: Universitätsverlag, 2007, pp. 86-96. ISBN 978-3-940793-57-7
BibTex
@inproceedings{Bogel2007Devel-17648,
  year={2007},
  title={Developing a finite-state morphological anlayzer for Urdu and Hindi},
  isbn={978-3-940793-57-7},
  publisher={Universitätsverlag},
  address={Potsdam},
  booktitle={Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 Potsdam, Germany, september 14 - 16 ; revised papers},
  pages={86--96},
  editor={Hanneforth, Thomas and Würzner, Kay-Michael},
  author={Bögel, Tina and Butt, Miriam and Hautli-Janisz, Annette and Sulger, Sebastian}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/17648">
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:contributor>Hautli-Janisz, Annette</dc:contributor>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/17648"/>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dc:contributor>Bögel, Tina</dc:contributor>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:language>eng</dc:language>
    <dc:creator>Bögel, Tina</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-02-01T15:52:43Z</dcterms:available>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:rights>terms-of-use</dc:rights>
    <dcterms:issued>2007</dcterms:issued>
    <dcterms:bibliographicCitation>Publ. in: Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 Potsdam, Germany, september 14 - 16 ; revised papers / Thomas Hanneforth, Kay-Michael Würzner (eds.). - Potsdam : Universitätsverl., 2008. - S. 86-96. - ISBN 978-3-940793-57-7</dcterms:bibliographicCitation>
    <dcterms:abstract xml:lang="eng">We introduce and discuss a number of issues that arise in the process of building a finite-state morphological analyzer for Urdu, in particular issues with potential ambiguity and non-concatenative morphology. Our approach allows for an underlyingly similar treatment of both Urdu and Hindi via a cascade of finite-state transducers that transliterates the very different scripts into a common ASCII transcription system. As this transliteration system is based on the XFST tools that the Urdu/Hindi common morphological analyzer is also implemented in, no compatibility problems arise.</dcterms:abstract>
    <dc:creator>Butt, Miriam</dc:creator>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2012-02-01T15:52:43Z</dc:date>
    <dcterms:title>Developing a finite-state morphological anlayzer for Urdu and Hindi</dcterms:title>
    <dc:contributor>Butt, Miriam</dc:contributor>
    <dc:creator>Hautli-Janisz, Annette</dc:creator>
    <dc:creator>Sulger, Sebastian</dc:creator>
    <dc:contributor>Sulger, Sebastian</dc:contributor>
  </rdf:Description>
</rdf:RDF>
Interner Vermerk
xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter
Kontakt
URL der Originalveröffentl.
Prüfdatum der URL
Prüfungsdatum der Dissertation
Finanzierungsart
Kommentar zur Publikation
Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet