Publication: Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models
dc.contributor.author | Rosa, Javier de la | |
dc.contributor.author | Pérez Pozo, Álvaro | |
dc.contributor.author | Sisto, Mirella de | |
dc.contributor.author | Hernández, Laura | |
dc.contributor.author | Díaz, Aitor | |
dc.contributor.author | Ros, Salvador | |
dc.contributor.author | González Blanco, Elena | |
dc.contributor.funder | Horizon 2020 Framework Programme | |
dc.contributor.funder | European Commission | |
dc.contributor.ror | https://ror.org/02jjdwm75 | |
dc.date.accessioned | 2024-07-08T13:15:03Z | |
dc.date.available | 2024-07-08T13:15:03Z | |
dc.date.issued | 2023 | |
dc.description.abstract | The splitting of words into stressed and unstressed syllables is the foundation for the scansion of poetry,a process that aims at determining the metrical pattern of a line of verse within a poem. Intricate language rules and their exceptions,as well as poetic licenses exerted by the authors,make calculating these patterns a nontrivial task. Some rhetorical devices shrink the metrical length,while others might extend it. This opens the door for interpretation and further complicates the creation of automated scansion algorithms useful for automatically analyzing corpora on a distant reading fashion. In this paper,we compare the automated metrical pattern identification systems available for Spanish,English,and German,against fine-tuned monolingual and multilingual language models trained on the same task. Despite being initially conceived as models suitable for semantic tasks,our results suggest that transformers-based models retain enough structural information to perform reasonably well for Spanish on a monolingual setting,and outperforms both for English and German when using a model trained on the three languages,showing evidence of the benefits of cross-lingual transfer between the languages. © 2021,The Author(s). | |
dc.description.fundingtype | See http://postdata.linhd.uned.es/ . Starting Grant research project Poetry Standardization and Linked Open Data: POSTDATA (ERC-2015-STG-679528) funded by the European Research Council (https://erc.europa.eu) (ERC) under the research and innovation program Horizon2020 of the European Union. This research was supported by the project Poetry Standardization and Linked Open Data (POSTDATA) (ERC-2015-STG-679528) obtained by Elena González-Blanco and funded by an European Research Council ( https://erc.europa.eu ) Starting Grant under the Horizon2020 Program of the European Union. This research was supported by the project Poetry Standardization and Linked Open Data (POSTDATA) (ERC-2015-STG-679528) obtained by Elena González-Blanco and funded by an European Research Council (https://erc.europa.eu) Starting Grant under the Horizon2020 Program of the European Union. | |
dc.description.keyword | Digital humanities | |
dc.description.keyword | Language models | |
dc.description.keyword | Natural language processing | |
dc.description.keyword | Poetry | |
dc.format | application/pdf | |
dc.identifier.citation | De la Rosa, J., Pérez, Á., De Sisto, M., Hernández, L., Díaz, A., Ros, S., & González-Blanco, E. (2023). Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models. Neural Computing and Applications, 1-6. | |
dc.identifier.doi | https://doi.org/10.1007/s00521-021-06692-2 | |
dc.identifier.issn | 9410643 | |
dc.identifier.officialurl | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85119484453&doi=10.1007%2fs00521-021-06692-2&partnerID=40&md5=841b8f12c5044537de5920989364e7c2 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14417/3136 | |
dc.issue.number | 25 | |
dc.journal.title | Neural Computing and Applications | |
dc.language.iso | eng | |
dc.page.final | 18176 | |
dc.page.initial | 18171 | |
dc.page.total | 23 | |
dc.publisher | Springer Science and Business Media Deutschland GmbH | |
dc.relation.entity | IE University | |
dc.relation.projectID | H2020: 679528 | |
dc.relation.school | IE School of Science & Technology | |
dc.rights | Attribution 4,0 International | |
dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Digital humanities; Language models; Natural language processing; Poetry | |
dc.subject.keyword | Digital humanities | |
dc.subject.keyword | Language models | |
dc.subject.keyword | Natural language processing | |
dc.subject.keyword | Poetry | |
dc.subject.other | Computational linguistics | |
dc.subject.other | Natural language processing systems | |
dc.subject.other | Distant readings | |
dc.subject.other | Language processing | |
dc.subject.other | Natural languages | |
dc.subject.other | Non-trivial tasks | |
dc.subject.other | Pattern identification | |
dc.subject.other | Splittings | |
dc.subject.other | Semantics | |
dc.title | Transformers analyzing poetry: multilingual metrical pattern prediction with transfomer-based language models | |
dc.type | info:eu-repo/semantics/article | |
dc.version.type | info:eu-repo/semantics/publishedVersion | |
dc.volume.number | 35 | |
dspace.entity.type | Publication | |
person.identifier.scopus-author-id | 55203236600 | |
person.identifier.scopus-author-id | 57222025738 | |
person.identifier.scopus-author-id | 57224541472 | |
person.identifier.scopus-author-id | 57222031867 | |
person.identifier.scopus-author-id | 57348530700 | |
person.identifier.scopus-author-id | 7005158730 | |
person.identifier.scopus-author-id | 57190153324 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Transformers analyzing poetry multilingual metrical pattern prediction with transfomer-based language models.pdf
- Size:
- 234.85 KB
- Format:
- Adobe Portable Document Format