TomoCPT: a generalizable model for 3D particle detection and localization in cryo-electron tomograms
| dc.contributor.author | Shah, Pranav | |
| dc.contributor.author | Sánchez García, Ruben | |
| dc.contributor.author | Stuart, David | |
| dc.contributor.funder | UK Medical Research Council | |
| dc.contributor.funder | Wellcome Trust | |
| dc.contributor.ror | https://ror.org/02jjdwm75 | |
| dc.date.accessioned | 2026-03-05T11:35:40Z | |
| dc.date.issued | 2025-02 | |
| dc.description.abstract | Cryo-electron tomography is a rapidly developing field for studying macromolecular complexes in their native environments and has the potential to revolutionize our understanding of protein function. However, fast and accurate identification of particles in cryo-tomograms is challenging and represents a significant bottleneck in downstream processes such as subtomogram averaging. Here, we present tomoCPT (Tomogram Centroid Prediction Tool), a transformer-based solution that reformulates particle detection as a centroid-prediction task using Gaussian labels. Our approach, which is built upon the SwinUNETR architecture, demonstrates superior performance compared with both conventional binary labelling strategies and template matching. We show that tomoCPT effectively generalizes to novel particle types through zero-shot inference and can be significantly enhanced through fine-tuning with limited data. The efficacy of tomoCPT is validated using three case studies: apoferritin, achieving a resolution of 3.0 Å compared with 3.3 Å using template matching, SARS-CoV-2 spike proteins on cell surfaces, yielding an 18.3 Å resolution map where template matching proved unsuccessful, and rubisco molecules within carboxysomes, reaching 8.0 Å resolution. These results demonstrate the ability of tomoCPT to handle varied scenarios, including densely packed environments and membrane-bound proteins. The implementation of the tool as a command-line program, coupled with its minimal data requirements for fine-tuning, makes it a practical solution for high-throughput cryo-ET data-processing workflows. | |
| dc.description.peerreviewed | Yes | |
| dc.description.sponsorship | Computation used the Oxford Biomedical Research Computing (BMRC) facility, a joint development between the Centre for Human Genetics and the Big Data Institute supported by Health Data Research UK and the National Institute for Health (NIHR) Oxford Biomedical Research Centre. Financial support was provided by the Wellcome Trust Core Award Grant No. 03141/Z/16/Z. The OPIC electron microscopy facility was founded by a Wellcome Trust JIF award (060208/Z/00/Z). This work was supported by the UK Medical Research Council grant to DIS (MR/N00065X/1). | |
| dc.description.status | Published | |
| dc.format | application/pdf | |
| dc.identifier.citation | Shah, P. N., Sanchez-Garcia, R., & Stuart, D. I. (2025). TomoCPT: a generalizable model for 3D particle detection and localization in cryo-electron tomograms. Biological Crystallography, 81(2). https://doi.org/10.1107/S2059798325000865 | |
| dc.identifier.doi | https://doi.org/10.1107/S2059798325000865 | |
| dc.identifier.issn | 2059-7983 | |
| dc.identifier.officialurl | https://journals.iucr.org/d/issues/2025/02/00/sor5001/index.html | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14417/4224 | |
| dc.journal.title | Acta Crystallographica Section D - Structural Biology | |
| dc.language.iso | eng | |
| dc.page.final | 76 | |
| dc.page.initial | 63 | |
| dc.page.total | 13 | |
| dc.publisher | International Union of Crystallography | |
| dc.relation.department | Sci Tech (Data Science) | |
| dc.relation.entity | IE University | |
| dc.relation.projectid | MR/N00065X/1 | |
| dc.relation.projectid | 03141/Z/16/Z | |
| dc.relation.projectid | 060208/Z/00/Z | |
| dc.relation.school | IE School of Science & Technology | |
| dc.rights | Attribution 4.0 International | |
| dc.rights.accessRights | info:eu-repo/semantics/openAccess | |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject.keywords | cryo-ET | |
| dc.subject.keywords | particle picking | |
| dc.subject.keywords | subtomogram averaging. | |
| dc.subject.ods | ODS 3 - Salud y bienestar | |
| dc.subject.ods | ODS 9 - Industria, innovación e infraestructura | |
| dc.subject.unesco | 24 Ciencias de la Vida::2406 Biofísica | |
| dc.title | TomoCPT: a generalizable model for 3D particle detection and localization in cryo-electron tomograms | |
| dc.type | info:eu-repo/semantics/article | |
| dc.version.type | info:eu-repo/semantics/publishedVersion | |
| dc.volume.number | 81 | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 911e42ad-19b7-4b92-9890-9c8213aa65bd | |
| relation.isAuthorOfPublication.latestForDiscovery | 911e42ad-19b7-4b92-9890-9c8213aa65bd |
