research
Publications and talks.
More recent information, as well as information in Polish, may be available on my institutional website
Publications
Papers
-
Šeļa, A., Nagy, B., Byszuk, J., Hernández-Lorenzo, L., Szemes, B. and Eder, M. (forthcoming). From stage to page: language independent bootstrap measures of distinctiveness in fictional speech, [pre-print].
-
Hernández-Lorenzo, L., and Byszuk J. (2023). Challenging stylometry: the authorship of the baroque play La Segunda Celestina, Digital Scholarship in the Humanities, advance access: https://doi.org/10.1093/llc/fqac063.
-
Byszuk, J. (2023). On Computers in text analysis. In: J. O’Sullivan (ed.), The Bloomsbury Handbook to the Digital Humanities, 159–68. London: Bloomsbury.
-
Herrmann, J. B., Byszuk, J. and Grisot, G. (2022). Using word embeddings for validation and enhancement of spatial entity lists. Digital Humanities 2022: Conference Abstracts. Tokyo: University of Tokyo, pp. 239–41, https://dh2022.dhii.asia/dh2022bookofabsts.pdf.
-
Byszuk, J. and Dombrowski, Q. (2022). Stylometric investigations into translationese: The Baby-Sitters Club across languages. In Misuraca, M., Scepi, G. and Spano, M. (eds), Proceedings of the 16th International Conference on Statistical Analysis of Textual Data, vol. 1. Naples, pp. 188–96, http://lexicometrica.univ-paris3.fr/jadt/JADT2022/VOL1.pdf.
-
Škorić, M., Stanković, R., Ikonić Nešić, M., Byszuk, J. and Eder, M. (2022). Parallel stylometric document embeddings with deep learning based language models in literary authorship attribution, Mathematics, 10(5), 838 https://doi.org/10.3390/math10050838.
- Idziak, J., Šeļa, A., Woźniak, M., Leśniak, A., Byszuk, J. and Eder, M. (2021). Scalable handwritten text recognition system for lexicographic sources of under-resourced languages and alphabets. Computational Science – ICCS 2021, vol. 1. (LNCS 12742). Springer, pp. 137–50, [pre-print]. .
-
Byszuk, J. (2020). The voices of Doctor Who – how stylometry can be useful in revealing new information about TV series. Digital Humanities Quarterly, 14(4).
-
Byszuk, J., Woźniak, M., Kestemont, M., Leśniak, A., Łukasik, W., Šeļa, A. and Eder, M. (2020). Detecting direct speech in multilingual collection of 19th century novels. Proceedings of the LREC 2020. Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2020). Marseille, pp. 100–04.
- Franzini, G., Kestemont, M., Rotari, G., Jander, M., Ochab, J. K., Franzini, E., Byszuk, J. and Rybicki, J. (2018). Attributing authorship in the noisy digitized correspondence of Jacob and Wilhelm Grimm. Frontiers in Digital Humanities, 5(4), doi:10.3389/fdigh.2018.00004.
Other
- Byszuk, J. (2023) What is Authorship Attribution? in Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.
- Byszuk, J. (2023) Analysis in Authorship Attribution in Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.
- Byszuk, J. (2023) Evaluation in Authorship Attribution in Schöch, C., Dudar, J., & Fileva, E. (2023). CLS INFRA D3.2: Series of Five Short Survey Papers on Methodological Issues (= Survey of Methods in Computational Literary Studies) (v1.1.0). URL: https://methods.clsinfra.io, DOI: 10.5281/zenodo.7892112.
- Four articles promoting new digital humanities publications for “Nowości Badawcze NCK” 1/2021.
- Mischke, D. Choiński, M., Byszuk, J, Göbel, M. Network Analysis and Spatial Stylometry in American Drama Studies, DH2020 Book of Abstracts, Ottawa, ADHO, 2020
- Byszuk, J., Stylometry Expertise Conclusion: Assessing authorship of an anonymous Persian qasida in: Khismatulin, A. “Amir Mu‘izzi Nishapuri. The Siyasat-nama/Siyar al-muluk: A Fabrication Ascribed to Nizam al-Mulk”. The series: THE PERSIAN MIRRORS FOR PRINCES WRITTEN IN THE SALJUQ PERIOD: ORIGINALS AND FABRICATIONS (I). St. Petersburg: Peterburgskoe Vostokovedenie; Moscow: Sadra, 2020. pp. 176-178 online access: Stylometry Expertise Conclusion on Assessing authorship of an anonymous Persian qasida
- Frontini, F., Brando C., Byszuk J., Galleron I., Santos D. & Stanković R. Named Entity Recognition for Distant Reading in ELTeC. CLARIN Annual Conference 2020 Proceedings, p. 37-41, (online).
- A book review of Reading beyond the female: The relationship between perception of author gender and literary quality. In: Socjolingiwistyka, 33. 2019. (online)
- Hernandez Lorenzo, L., Byszuk, J. (2019) Challenging Stylometry: The Authorship of the Baroque Play La Segunda Celestina. In Digital Humanities 2019: Book of Abstracts. University of Utrecht.
- Eder M., Byszuk, J. (2019) Feature Selection in Authorship Attribution: Ordering the Wordlist. In: Digital Humanities 2019: Book of Abstracts. University of Utrecht.
- Ochab, J.K., Byszuk, J., Pielström, S. and Eder, M. (2019) Identifying Similarities in Text Analysis: Hierarchical Clustering (Linkage) versus Network Clustering (Community Detection). In: Digital Humanities 2019: Book of Abstracts. University of Utrecht. Utrecht.
- blog post: Using ‘Stylo’ with languages other than English
- Byszuk, J. (2018) Tracing Showrunners’ Impact. Book of Abstracts AIUCD 2018. 190-192.
Talks
Invited talks
- “Towards multimodal stylometry – possibilities and challenges of new approach to film and TV series analysis” w ramach Brave New Humanities? A Novel Perceptions Symposium on Computational Literary Studies, 8 IV 2022, online (link).
- “Towards multimodal stylometry – possibilities and challenges of new approach to film and TV series analysis” for Natural Language Processing Seminar 2021–2022 at the Institute of Computer Science, Polish Academy of Sciences, 6 XII 2021, online.
- “What can be measured with stylometry? On language, creativity and numbers.”, Bielefeld colloquium “digital linguistics / computational linguistics”, 12 X 2021, Bielefeld.
- “The Voices of Doctor Who – How Stylometry Can be Useful in Revealing New Information About TV Series”, for the seminars of the Centre for Digital Humanities at the University of Groningen, 9 III 2021, online.
- “Direct speech for multilingual corpora some problems and one possible solution”, SIG_DLS Workshop: Tool Criticism 3.0. Present, past, and future methods in Digital Literary Stylistics, ADHO Special Interest Group for Digital Literary Studies, 20 VII 2020, online.
- “Literary Studies: Textual analysis and stylometry with WebSty”, CLARIN Café III CLARIN for Researchers, 8 VII 2020, online.
- “Stylometry in textual analysis and beyond”, Colloquium in Digital Cultural Heritage, 22 I 2020, Köln.
- “AI in Computational Linguistics and Humanities”, Giersch Symposium “AI for Science”, 18-22 XI 2019, Frankfurt am Main.
Peer-reviewed conference talks (selection):
-
Herrmann, J. B., Byszuk, J. and Grisot, G. (2022). Using word embeddings for validation and enhancement of spatial entity lists. Digital Humanities 2022: Conference Abstracts. Tokyo: University of Tokyo, pp. 239–41, https://dh2022.dhii.asia/dh2022bookofabsts.pdf
-
Byszuk, J., Kunda, B. (2022) Coping Strategies Used by Male Young Adults in Contemporary TV Series, Discourses of Fictional (Digital) TV Series
-
Byszuk, J. (2022) What Makes a Captain: Quantitative Analysis of Discourses of Power across Star Trek Series, Discourses of Fictional (Digital) TV Series
-
Šeļa, Artjoms, Ben Nagy, Joanna Byszuk, Laura Hernández-Lorenzo, Botond Szemes, and Maciej Eder (2022) From stage to page: language independent bootstrap measures of distinctiveness in fictional speech, Workshop on Computational Drama Analysis: Achievements and Opportunities
-
Artjoms Šeļa, Joanna Byszuk, Bartlomiej Kunda, Laura Hernández-Lorenzo, Botond Szemes, Maciej Eder. (2022). Imagined differences: approaches to variation in fictional character voices in literary history. @ Closing conference of the COST Action Distant Reading for European Literary History.
-
Ranka Stanković, Diana Santos, Carmen Brando, Gábor Palkó, Joanna Byszuk. (2022). Distant Reading of ELTeC text collection through Named Entities. @ Closing conference of the COST Action Distant Reading for European Literary History
- “Challenging Stylometry: The Authorship of the Baroque Play La Segunda Celestina” with Laura Hernandez Lorenzo (Universidad de Sevilla), DH 2019, 9-12 VII 2019, Utrecht. (Abstract, Repository with code and extra materials).
- “Feature Selection in Authorship Attribution: Ordering the Wordlist” with Maciej Eder, DH 2019, 9-12 VII 2019, Utrecht. (Abstract)
- “Identifying Similarities in Text Analysis: Hierarchical Clustering (Linkage) versus Network Clustering (Community Detection)” with Jeremi K. Ochab, Maciej Eder and Steffen Pielström (University of Würzburg), DH 2019, 9-12 VII 2019, Utrecht. (Abstract, Repository with code and extra materials).
- “Attribution of Authorship for Medieval Persian Quasidas with Stylometry” with Alexey Khismatulin (Institute of Oriental Manuscripts, Russian Academy of Sciences), #Right2Left Workshop, 8 VI 2019, Victoria BC. link to presentation
- “Older language learner: a comparative corpus study of FL performance and learning materials”, poster with Karolina Czopek (Uniwersytet Warszawski), 4th CLARe Conference, 28 II - 2 III 2019, Helsinki.
- “Enhanced digital editions: retrieving POS tags from pre-digital word indexes”, EADH Conference, 7-9 XII 2018, Galway. Link to presentation
- “Who is the author? Modeling creative relations in television writing”, EADH Conference, 7-9 XII 2018, Galway. Link to presentation
- “Analysis of cross-lingual semantic change in professional discourse with quantitative methods”, Qualico 2018, Wrocław. Link to poster
- “Zipf’s law and subsets of lexis”, long paper with Maciej Eder, Rafał L. Górski, Qualico 2018, Wrocław. Link to presentation
- “Tracing Showrunner’s Impact”, 7th AIUCD Conference, 31 I – 2 II 2018, Bari.
- “The Voices of Doctor Who”, April Conference Fourteen, 20-22 IV 2017, Kraków.
- “Jak (po angielsku) pisze polski programista?”, 11. Studenckie Warsztaty Tłumaczeniowe, 9-11 V 2017, Kraków.