Codificación y anotación preliminar de un corpus oral multilingüe de conversaciones telefónicas interpretadas para el estudio de los ataques a la imagen

Marcelo Yuji Himoro; Antonio Pareja-Lora

doi:10.25267/Pragmalinguistica.2022.i30.19

Constitution et annotation préliminaire d'un corpus oral multilingue d'entretiens téléphoniques interprétés pour l'étude des actes de langage menaçants la face

Téléchargements

PDF (Español (España)) 298

Vues de la page résumé de l'article: 580

DOI

https://doi.org/10.25267/Pragmalinguistica.2022.i30.19

Info

No 30: 2022

Section monographique

413-432

Publiée: 01-12-2022

Auteurs

Marcelo Yuji Himoro (ES) Universidad Nacional de Educación a Distancia
Google Scholar
https://orcid.org/0000-0002-7591-0354
Antonio Pareja-Lora (ES) Universidad de Alcalá
Google Scholar
https://orcid.org/0000-0001-5804-4119

Résumé

La croissance et la consolidation de la demande d’interprétation téléphonique attirent davantage l’intérêt des chercheurs sur son étude. Le but principal de cette recherche est de constituer un corpus oral de conversations téléphoniques médiées par interprètes, et orienté vers l'étude des actes de langage menaçants la face (FTA). Ces entretiens comprennent l’espagnol et une deuxième langue, c'est-à-dire, l’allemand, le chinois, le français, l’anglais ou le russe. Tout d’abord, on revoit brièvement les jalons atteints dans les travaux antérieurs, à savoir : la compilation des conversations enregistrées et anonymisées, leur traitement initial, et aussi leur transcription et traduction. Ensuite, on décrit en détail la conversion des transcriptions au format EXMARaLDA et sa synchronisation avec les enregistrements. Enfin, on discute des restrictions et des défis rencontrés dans ces processus de conversion et de synchronisation.

Mots-clés

interprétation téléphonique

corpus oraux

annotation pragmatique

outils d’annotation

EXMARaLDA

Téléchargements

Les données relatives au téléchargement ne sont pas encore disponibles.

Organismes de soutien

Projet de R&D “Análisis de ataques contra la imagen en interpretación telefónica” (CM/JIN/2019-040), Comunidad de Madrid (Espagne), Projet de R&D “Pragmática de corpus e interpretación telefónica: análisis de ataques contra la imagen en interpretación telefónica: análisis de ataques contra la imagen (PRAGMACOR)" (PID2021-127196NA-I00), Ministerio de Ciencia e Innovación (Espagne), Projet de R&D "LITHME (Language in the Human-Machine Era, COST Action)" (CA19102), Programa Marco Horizonte 2020

Comment citer

Himoro, M. Y., & Pareja-Lora, A. (2022). Constitution et annotation préliminaire d’un corpus oral multilingue d’entretiens téléphoniques interprétés pour l’étude des actes de langage menaçants la face. Pragmalingüística, (30), 413–432. https://doi.org/10.25267/Pragmalinguistica.2022.i30.19

Télécharger la référence bibliographique

Licence

Ce travail est disponible sous licence Creative Commons Attribution - Pas d'Utilisation Commerciale - Pas de Modification 4.0 International.

Références

ANGERMEYER, P., MEYER, B. y SCHMIDT, T. (2012): “Sharing Community Interpreting Corpora: A pilot study”, Schmidt, T. y Wörner, K. (coords.): Multilingual Corpora and Multilingual Corpus Analysis, Amsterdam: John Benjamins Publishing Company, pp. 275-294. https://doi.org/10.1075/hsm.14.19ang

AUSTIN, J. L. (1962): How to Do Things with Words, Oxford: University Press.

BOERSMA, P. y VAN HEUVEN, V. (2001): “Praat, a system for doing phonetics by computer”, Glot International, 5:9/10, pp. 341-345. https://www.fon.hum.uva.nl/paul/papers/speakUnspeakPraat_glot2001.pdf (Fecha de consulta: 03/12/2021).

BRAVO, D. y BRIZ GÓMEZ, A. (2004): Pragmática sociocultural: estudios sobre el discurso de cortesía en español, Barcelona: Ariel.

BROWN, P. y LEVINSON, S. C. (1987): Politeness: Some universals in language usage, Cambridge: Cambridge University Press.

BRUGMAN, H. y RUSSEL, A. (2004): “Annotating Multimedia/Multi-modal resources with ELAN”, Proceedings of LREC 2004, Fourth International Conference on Language Resources and Evaluation, Lisbon: European Language Resources Association, pp. 2065-2068. https://aclanthology.org/L04-1285/ (Fecha de consulta: 03/12/2021).

CAMBRIDGE, J. (1997): “Information exchange in bilingual medical interviews”, Trabajo de fin de máster, University of Manchester, Manchester, Inglaterra.

CROWDY, S. (1993): “Spoken Corpus Design”, Literary and Linguistic Computing, 8(4), pp. 259-265. https://doi.org/10.1093/llc/8.4.259

CULPEPER, J., BOUSFIELD, D. y WICHMANN A. (2003): “Impoliteness revisited: With special reference to dynamic and prosodic aspects”, Journal of Pragmatics, 35, pp. 1545-1579. https://doi.org/10.1016/S0378-2166(02)00118-2

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION (2014): Language resource management — Semantic annotation framework (SemAF) — Part 5: Discourse structure (SemAF-DS), (ISO/TS Standard No. 24617-5). Disponible en: https://www.iso.org/standard/57083.html (Fecha de consulta: 03/12/2021).

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION (2016): Language resource management — Semantic annotation framework (SemAF) — Part 8: Semantic relations in discourse, core annotation schema (DR-core), (ISO Standard No. 24617-8). Disponible en: https://www.iso.org/standard/60780.html (Fecha de consulta: 03/12/2021).

INTERNATIONAL ORGANIZATION FOR STANDARDIZATION (2020): “Language resource management — Semantic annotation framework (SemAF) — Part 2: Dialogue acts (ISO Standard No. 24617-2). Disponible en: https://www.iso.org/standard/76443.html (Fecha de consulta: 03/12/2021).

KIPP, M. (2001): “Anvil - A Generic Annotation Tool for Multimodal Dialogue”, Proceedings of the 7th European Conference on Speech Communication and Technology (Eurospeech 2001), Aalborg: International Speech Communication Association, pp. 1367-1370. https://www.isca-speech.org/archive/pdfs/eurospeech_2001/kipp01_eurospeech.pdf

LACHENICHT, L. G. (1980): “Aggravating language: A study of abusive and insulting language”. International Journal of Human Communication, 13(4), pp. 607–688. https://doi.org/10.1080/08351818009370513

LÁZARO GUTIÉRREZ, R. (2018): “Design and Compilation of a Multilingual Corpus of Mediated Interactions about Roadside Assistance”, Ruiz Mezcua, A. (ed.): Approaches to Telephone Interpretation. Research, Innovation, Teaching and Transference, Bern: Peter Lang.

LÁZARO GUTIÉRREZ, R. (2019): “Telephone interpreting and roadside assistance”, Translation and Translanguaging in Multilingual Contexts, (5)3, pp. 215-240.

LÁZARO GUTIÉRREZ, R. y ALCALDE PEÑALVER, E. (2022): “‘El cliente siempre lleva la razón’: problemas de comunicación y soluciones en la interpretación telefónica para asistencia en carretera”, Pragmalingüística, 30, pp. 433-446.

LÁZARO GUTIÉRREZ, R. y CABRERA MÉNDEZ, G. (2018): “Pragmática e interpretación telefónica: un estudio sobre ataques contra la imagen de los intérpretes (FTA, Face threatening acts)”, Curado Fuentes, A. (coord.): LSP in Multi-disciplinary contexts of Teaching and Research. Papers from the 16th International AELFE Conference, Manchester: EasyChair, pp. 85-90.

LÁZARO GUTIÉRREZ, R. y CABRERA MÉNDEZ, G. (2019): “Context and pragmatic meaning in telephone interpreting”, Garcés-Conejos Blitvich, P., Fernández Amaya, L. y Hernández-López, M. O. (coords.): Technology Mediated Service Encounters, Amsterdam: John Benjamins Publishing Company, pp. 45-67.

LOVE, R., DEMBRY, C., HARDIE, A., BREZINA, V. y MCENERY, T. (2017): “The Spoken BNC2014: designing and building a spoken corpus of everyday conversations”. International Journal of Corpus Linguistics, 22(3), pp. 319-344. https://doi.org/10.1075/ijcl.22.3.02lov

MARCOS-MARÍN, F. (1992): “Corpus de referencia de la lengua española contemporánea: Corpus oral peninsular”, Laboratorio de Lingüística Informática. http://www.lllf.uam.es/ESP/Corlec.html (Fecha de consulta: 03/12/2021).

MONTI, C., BENDAZZOLI, C., SANDRELLI, A. y RUSSO, M. (2005): “Studying Directionality in Simultaneous Interpreting through an Electronic Corpus: EPIC (European Parliament Interpreting Corpus)”, Meta, 50(4). https://doi.org/10.7202/019850ar

O'DRISCOLL, J. (2007): “What's in an FTA? Reflections on a chance meeting with Claudine”, Journal of Politeness Research, 3(2), pp. 243-268. https://doi.org/10.1515/PR.2007.011

OZOLINS, U. (1998): Interpreting & Translating in Australia: Current Issues and International Comparisons, Melbourne: The National Language and Literacy Institute of Australia. https://eric.ed.gov/?id=ED426597

PENA DÍAZ, C. (2022): “El uso de atenuantes retóricos en la interpretación telefónica en la asistencia en carretera”, Pragmalingüística, 30, pp. 447-462.

PHELAN, M. (2001): The Interpreter’s Resource, Clevedon: Multilingual Matters.

PÖLLABAUER, S. (2004): “Interpreting in asylum hearings. Issues of role, responsibility and power”, Interpreting (International Journal of Research and Practice in Interpreting), 6(2), pp. 143-180. https://doi.org/10.1075/intp.6.2.03pol

RÜHLEMANN, C. (2018): “CL and speech acts”, Corpus Linguistics for Pragmatics: A Guide for Research, Abingdon: Routledge, pp. 16-47.

SÁNCHEZ, M. S. (2005): “El corpus de referencia del español actual (CREA): el CREA oral”. Oralia: Análisis del discurso oral, 8, pp. 37-56.

SAYERS, D., SOUSA-SILVA, R., HÖHN, S. et al. (2021): “The Dawn of the Human-Machine Era: A forecast of new and emerging language technologies”, EU COST Action CA19102 ‘Language In The Human-Machine Era’. https://doi.org/10.17011/jyx/reports/20210518/1

SCHMIDT, T. (2004): “Transcribing and annotating spoken language with EXMARaLDA”, Proceedings of the LREC-Workshop on XML based richly annotated corpora, Paris: European Language Resources Association. https://ids-pub.bsz-bw.de/frontdoor/index/index/docId/2317 (Fecha de consulta: 03/12/2021).

SIMPSON, R. C., BRIGGS, S. L., OVENS, J. y SWALES, J. M. (2002): The Michigan Corpus of Academic Spoken English, Ann Arbor: The Regents of the University of Michigan.

SVARTVIK, J. (ed.) (1990): The London–Lund corpus of spoken English: Description and research, Lund: Lund University Press. https://portal.research.lu.se/sv/publications/the-londonlund-corpus-of-spoken-english-description-and-research (Fecha de consulta: 03/12/2021).

TEXT ENCODING INITIATIVE (2019): “The TEI Guidelines”, TEI: Text Encoding Initiative. https://www.tei-c.org/release/doc/tei-p5-doc/en/html/index.html (Fecha de consulta: 03/12/2021).

VALERO GARCÉS, C. y LI, J. (2022): “La interpretación telefónica y presencial chino-español. Estudio de caso”, Pragmalingüística, 30, pp. 463-482.

VITALARU, B. (2022): “Mitigación y estrategias atenuadoras en interpretación telefónica: estudio de caso sobre la combinación español-ruso”, Pragmalingüística, 30, pp. 483-514.

YERGEAU, F. (2003): “RFC 2279: UTF-8, a transformation format of ISO 10646” (RFC Standard No. 2279). https://doi.org/10.17487/RFC3629