Project T01
Transforming text across media
PI(s): Prof. Dr. Manfred Stede & Prof. Dr. Tatjana Scheffler
In collaboration with two language-technology companies, the project addresses the task of tailoring text to the conventions and requirements of different target media, in particular written blogs and microblogs (Twitter), and spoken podcasts. The focus is on the discourse-level phenomena of coreference, particle use, and coherence relations. Variation on these dimensions will be correlated with human processing ease in reading/listening experiments. The findings will then be translated into models and algorithms for identifying text passages that should be adapted, and experiments with automatically performing such adaptations will be carried out.
Members

Prof. Dr. Manfred Stede (+49) 331 977-2691 manfred.stede@uni-potsdam.de Homepage Universität ResearchGate ORCID

Prof. Dr. Tatjana Scheffler (+49) 234 32-21471 tatjana.scheffler@rub.de Homepage GoogleScholar ORCID


Papers
Author(s) | Title | Year | Published in | Links |
---|---|---|---|---|
Scheffler, T., Kern, L.-A., & Seemann, H. J. | The medium is not the message: Individual level register variation in blogs vs. tweets. | 2022 | Register Studies, 4(2), 171-201. DOI: 10.1075/rs.22009.sch | |
Seemann, H. J., & Scheffler, T. | Differentiating Social Media Texts via Clustering. | 2022 | In F. Karsdorp, A. Lassche, & K. Nielbo (Eds.), Proceedings of the Computational Humanities Research Conference 2022 (pp. 177-188). Antwerp, Belgium. | |
Shahmohammadi, S., Seemann, H. J., Stede, M., & Scheffler, T. | Encoding discourse structure: comparison of RST and QUD. | 2023 | In M. Strube, C. Braud, C. Hardmeier, J. J. Li, S. Loáiciga, & A. Zeldes (Eds.), Proceedings of the 4th Workshop on Computational Approaches to Discourse (CODI 2023), (pp. 89–98), Toronto, Canada. Association for Computational Linguistics. * DOI: 10.18653/v1/2023.codi-1.11 | |
Shahmohammadi, S., & Stede, M. | Discourse Parsing for German with new RST Corpora. | 2024 | Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024), pp. 65-74. Vienna, Austria: Association for Computational Linguistics. | |
Seemann, H. J., Shahmohammadi, S., Stede, M., & Scheffler, T. | Spoken vs. Written Computer-Mediated Communiation. | 2024 | Proceedings of the 11th Conference on CMC and Social Media Corpora for the Humanities (CMC 2024), pp. 70-74. Université Côte d'Azur, Nice, France. | |
Seemann, H. J., Shahmohammadi, S., Stede, M., & Scheffler, T. | Discourse-Level Features in Spoken and Written Communication. | 2024 | Proceedings of the 20th Conference on Natural Language Processing (KONVENS 2024), pp. 292–302. Vienna, Austria: Association for Computational Linguistics. | |
Seemann, H., & Scheffler, T. | German Modal Particles as Discourse Signals. | 2025 | Dialogue & Discourse, 16(1), 1-30. DOI: 10.5210/dad.2025.101 | |
Seemann, H. | Modalpartikel | 2024 | In D. Gutzmann, K. Turgay, & T. E. Zimmermann (Eds.), Semantik und Pragmatik. Wörterbücher zur Sprach- und Kommunikationswissenschaft (WSK) Online. Berlin: De Gruyter. |