T01

Transforming text across media

PI(s): Prof. Dr. Manfred Stede & Prof. Dr. Tatjana Scheffler

In collaboration with two language-technology companies, the project addresses the task of tailoring text to the conventions and requirements of different target media, in particular written blogs and microblogs (Twitter), and spoken podcasts. The focus is on the discourse-level phenomena of coreference, particle use, and coherence relations. Variation on these dimensions will be correlated with human processing ease in reading/listening experiments. The findings will then be translated into models and algorithms for identifying text passages that should be adapted, and experiments with automatically performing such adaptations will be carried out.

Members

no photo
Prof. Dr. Tatjana Scheffler
Ruhr-Universität BochumFakultät für PhilologieGB 5/157
(+49) 234 32-21471 tatjana.scheffler@rub.de
no photo
Hannah Seemann
Ruhr-Universität BochumFakultät für PhilologieGB 5/156
no photo
Prof. Dr. Manfred Stede
Universität PotsdamCampus GolmHaus 14, Raum 2.31
(+49) 331 977-2691 manfred.stede@uni-potsdam.de

Publications

  • Peer-Reviewed: Papers, Journals, Books, Articles of the CRC
  • Talk or Presentation: Talks, Presentations, Posters of the CRC
  • SFB-Related: not produced in connection with the CRC, but are thematically appropriate
  • Other: Papers, Journals, Books, Articles of the CRC, but not peer-reviewed
Author(s)TitleYearPublished inLinks
Scheffler, T., Kern, L.-A., & Seemann, H. J.The medium is not the message: Individual level register variation in blogs vs. tweets.2022Register Studies, 4(2), 171-201. DOI: 10.1075/rs.22009.sch
Seemann, H. J., & Scheffler, T.Differentiating Social Media Texts via Clustering.2022In F. Karsdorp, A. Lassche, & K. Nielbo (Eds.), Proceedings of the Computational Humanities Research Conference 2022 (pp. 177-188). Antwerp, Belgium.
Shahmohammadi, S., Seemann, H. J., Stede, M., & Scheffler, T.Encoding discourse structure: comparison of RST and QUD.2023In M. Strube, C. Braud, C. Hardmeier, J. J. Li, S. Loáiciga, & A. Zeldes (Eds.), Proceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023), (pp. 89–98), Toronto, Canada. Association for Computational Linguistics. * DOI: 10.18653/v1/2023.codi-1.11
Seemann, H. J., & Maršík, A.Expressing the degree of confidence and attitude in Czech and German.2023Poster presented at the 10th International Contrastive Linguistics Conference (ICLC-10), University of Mannheim, Mannheim, Germany. 18-21 July.
Shahmohammadi, S., & Stede, M.Discourse Parsing for German with new RST Corpora.2024Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024), pp. 65-74. Vienna, Austria: Association for Computational Linguistics.
Seemann, H. J., Shahmohammadi, S., Stede, M., & Scheffler, T.Spoken vs. Written Computer-Mediated Communiation.2024Proceedings of the 11th Conference on CMC and Social Media Corpora for the Humanities (CMC 2024), pp. 70-74. Université Côte d'Azur, Nice, France.
Seemann, H. J., Shahmohammadi, S., Stede, M., & Scheffler, T.Discourse-Level Features in Spoken and Written Communication.2024Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024), pp. 292–302. Vienna, Austria: Association for Computational Linguistics.