Publikation:

Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar

Lade...
Vorschaubild

Dateien

Kamran_0-280261.pdf
Kamran_0-280261.pdfGröße: 284.08 KBDownloads: 937

Datum

2010

Autor:innen

Kamran Malik, Muhammad
Gulzar, Atif
Hussain, Sarmad

Herausgeber:innen

Kontakt

ISSN der Zeitschrift

Electronic ISSN

ISBN

Bibliografische Daten

Verlag

Schriftenreihe

Auflagebezeichnung

DOI (zitierfähiger Link)
ArXiv-ID

Internationale Patentnummer

Angaben zur Forschungsförderung

Projekt

Open Access-Veröffentlichung
Open Access Green
Core Facility der Universität Konstanz

Gesperrt bis

Titel in einer weiteren Sprache

Publikationstyp
Beitrag zu einem Konferenzband
Publikationsstatus
Published

Erschienen in

NICOLETTA CALZOLARI ..., , ed.. Proceedings of LREC 2010 : Seventh International Conference on Language Resources and Evaluation, May 17-23, 2010, Valletta, Malta. European Language Resources Association (ELRA), 2010, pp. 2921-2927. ISBN 2-9517408-6-7

Zusammenfassung

In this paper, we present a system for transliterating the Arabic-based script of Urdu to a Roman transliteration scheme. The system is integrated into a larger system consisting of a morphology module, implemented via finite state technologies, and a computational LFG grammar of Urdu that was developed with the grammar development platform XLE (Crouch et al. 2008). Our long-term goal is to handle Hindi alongside Urdu; the two languages are very similar with respect to syntax and lexicon and hence, one grammar can be used to cover both languages. However, they are not similar concerning the script – Hindi is written in Devanagari, while Urdu uses an Arabic-based script. By abstracting away to a common Roman transliteration scheme in the respective transliterators, our system can be enabled to handle both languages in parallel. In this paper, we discuss the pipeline architecture of the Urdu-Roman transliterator, mention several linguistic and orthographic issues and present the integration of the transliterator into the LFG parsing system.

Zusammenfassung in einer weiteren Sprache

Fachgebiet (DDC)
400 Sprachwissenschaft, Linguistik

Schlagwörter

Konferenz

LREC 2010, Seventh International Conference on Language Resources and Evaluation, 17. Mai 2010 - 23. Mai 2010, Valletta, Malta
Rezension
undefined / . - undefined, undefined

Forschungsvorhaben

Organisationseinheiten

Zeitschriftenheft

Zugehörige Datensätze in KOPS

Zitieren

ISO 690KAMRAN MALIK, Muhammad, Tafseer AHMED, Sebastian SULGER, Tina BÖGEL, Atif GULZAR, Ghulam RAZA, Sarmad HUSSAIN, Miriam BUTT, 2010. Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar. LREC 2010, Seventh International Conference on Language Resources and Evaluation. Valletta, Malta, 17. Mai 2010 - 23. Mai 2010. In: NICOLETTA CALZOLARI ..., , ed.. Proceedings of LREC 2010 : Seventh International Conference on Language Resources and Evaluation, May 17-23, 2010, Valletta, Malta. European Language Resources Association (ELRA), 2010, pp. 2921-2927. ISBN 2-9517408-6-7
BibTex
@inproceedings{KamranMalik2010Trans-30044,
  year={2010},
  title={Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar},
  url={http://www.lrec-conf.org/proceedings/lrec2010/pdf/194_Paper.pdf},
  isbn={2-9517408-6-7},
  publisher={European Language Resources Association (ELRA)},
  booktitle={Proceedings of LREC 2010 : Seventh International Conference on Language Resources and Evaluation, May 17-23, 2010, Valletta, Malta},
  pages={2921--2927},
  editor={Nicoletta Calzolari ...},
  author={Kamran Malik, Muhammad and Ahmed, Tafseer and Sulger, Sebastian and Bögel, Tina and Gulzar, Atif and Raza, Ghulam and Hussain, Sarmad and Butt, Miriam}
}
RDF
<rdf:RDF
    xmlns:dcterms="http://purl.org/dc/terms/"
    xmlns:dc="http://purl.org/dc/elements/1.1/"
    xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
    xmlns:bibo="http://purl.org/ontology/bibo/"
    xmlns:dspace="http://digital-repositories.org/ontologies/dspace/0.1.0#"
    xmlns:foaf="http://xmlns.com/foaf/0.1/"
    xmlns:void="http://rdfs.org/ns/void#"
    xmlns:xsd="http://www.w3.org/2001/XMLSchema#" > 
  <rdf:Description rdf:about="https://kops.uni-konstanz.de/server/rdf/resource/123456789/30044">
    <dc:creator>Ahmed, Tafseer</dc:creator>
    <dc:creator>Kamran Malik, Muhammad</dc:creator>
    <dc:contributor>Bögel, Tina</dc:contributor>
    <bibo:uri rdf:resource="http://kops.uni-konstanz.de/handle/123456789/30044"/>
    <dc:date rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-02-25T10:29:01Z</dc:date>
    <dspace:isPartOfCollection rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dcterms:hasPart rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/30044/1/Kamran_0-280261.pdf"/>
    <dc:creator>Bögel, Tina</dc:creator>
    <dc:creator>Butt, Miriam</dc:creator>
    <dc:contributor>Kamran Malik, Muhammad</dc:contributor>
    <void:sparqlEndpoint rdf:resource="http://localhost/fuseki/dspace/sparql"/>
    <dc:language>eng</dc:language>
    <dc:contributor>Sulger, Sebastian</dc:contributor>
    <dcterms:abstract xml:lang="eng">In this paper, we present a system for transliterating the Arabic-based script of Urdu to a Roman transliteration scheme. The system is integrated into a larger system consisting of a morphology module, implemented via finite state technologies, and a computational LFG grammar of Urdu that was developed with the grammar development platform XLE (Crouch et al. 2008). Our long-term goal is to handle Hindi alongside Urdu; the two languages are very similar with respect to syntax and lexicon and hence, one grammar can be used to cover both languages. However, they are not similar concerning the script – Hindi is written in Devanagari, while Urdu uses an Arabic-based script. By abstracting away to a common Roman transliteration scheme in the respective transliterators, our system can be enabled to handle both languages in parallel. In this paper, we discuss the pipeline architecture of the Urdu-Roman transliterator, mention several linguistic and orthographic issues and present the integration of the transliterator into the LFG parsing system.</dcterms:abstract>
    <dc:contributor>Raza, Ghulam</dc:contributor>
    <dc:rights>terms-of-use</dc:rights>
    <dc:contributor>Butt, Miriam</dc:contributor>
    <dcterms:isPartOf rdf:resource="https://kops.uni-konstanz.de/server/rdf/resource/123456789/45"/>
    <dc:creator>Raza, Ghulam</dc:creator>
    <dcterms:available rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime">2015-02-25T10:29:01Z</dcterms:available>
    <dcterms:rights rdf:resource="https://rightsstatements.org/page/InC/1.0/"/>
    <dcterms:title>Transliterating Urdu for a Broad-Coverage Urdu/Hindi LFG Grammar</dcterms:title>
    <dspace:hasBitstream rdf:resource="https://kops.uni-konstanz.de/bitstream/123456789/30044/1/Kamran_0-280261.pdf"/>
    <dc:creator>Sulger, Sebastian</dc:creator>
    <foaf:homepage rdf:resource="http://localhost:8080/"/>
    <dc:contributor>Ahmed, Tafseer</dc:contributor>
    <dcterms:issued>2010</dcterms:issued>
    <dc:creator>Gulzar, Atif</dc:creator>
    <dc:contributor>Gulzar, Atif</dc:contributor>
    <dc:contributor>Hussain, Sarmad</dc:contributor>
    <dc:creator>Hussain, Sarmad</dc:creator>
  </rdf:Description>
</rdf:RDF>

Interner Vermerk

xmlui.Submission.submit.DescribeStep.inputForms.label.kops_note_fromSubmitter

Kontakt

Prüfdatum der URL

2015-02-25

Prüfungsdatum der Dissertation

Finanzierungsart

Kommentar zur Publikation

Allianzlizenz
Corresponding Authors der Uni Konstanz vorhanden
Internationale Co-Autor:innen
Universitätsbibliographie
Ja
Begutachtet
Diese Publikation teilen