Deep Generative Model Driven Protein Folding Simulations

dc.contributor.authorMa, Heng
dc.contributor.authorBhowmik, Debsindhu
dc.contributor.authorLee, Hyungro
dc.contributor.authorTurilli, Matteo
dc.contributor.authorYoung, Michael
dc.contributor.authorJha, Shantenu
dc.contributor.authorRamanathan, Arvind
dc.contributor.rorhttps://ror.org/02jjdwm75
dc.date.accessioned2026-02-26T17:50:54Z
dc.date.issued2020
dc.description.abstractSignificant progress in computer hardware and software have enabled molecular dynamics (MD) simulations to model complex biological phenomena such as protein folding. However, enabling MD simulations to access biologically relevant timescales (e.g., beyond milliseconds) still remains challenging. These limitations include (1) quantifying which set of states have already been (sufficiently) sampled in an ensemble of MD runs, and (2) identifying novel states from which simulations can be initiated to sample rare events (e.g., sampling folding events). With the recent success of deep learning and artificial intelligence techniques in analyzing large datasets, we posit that these techniques can also be used to adaptively guide MD simulations to model such complex biological phenomena. Leveraging our recently developed unsupervised deep learning technique to cluster protein folding trajectories into partially folded intermediates, we build an iterative workflow that enables our generative model to be coupled with all-atom MD simulations to fold small protein systems on emerging high performance computing platforms. We demonstrate our approach in folding Fs-peptide and the β-β- α (BBA) fold, FSD-EY. Our adaptive workflow enables us to achieve an overall root-mean squared deviation (RMSD) to the native state of 1.6 Å and 4.4 Å respectively for Fs-peptide and FSD-EY. We also highlight some emerging challenges in the context of designing scalable workflows when data intensive deep learning techniques are coupled to compute intensive MD simulations.
dc.description.peerreviewedYes
dc.description.statusPublished
dc.formatapplication/pdf
dc.identifier.citationMa, H., Bhowmik, D., Lee, H., Turilli, M., Young, M. T., Jha, S., & Ramanathan, A. (2019). Deep generative model driven protein folding simulation. https://doi.org/10.3233/APC200023
dc.identifier.doihttps://doi.org/10.3233/APC200023
dc.identifier.isbn978-1-64368-071-2
dc.identifier.officialurlhttps://ebooks.iospress.nl/volumearticle/53902
dc.identifier.publicationParallel Computing: Technology Trends
dc.identifier.urihttps://hdl.handle.net/20.500.14417/4180
dc.language.isoeng
dc.page.total11
dc.publisherIOS Press
dc.relation.departmentSci Tech (Data Science)
dc.relation.entityIE University
dc.relation.schoolIE School of Science & Technology
dc.rightsAttribution-NonCommercial 4.0 International
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subject.odsODS 9 - Industria, innovación e infraestructura
dc.subject.unesco33 Ciencias Tecnológicas::3306 Ingeniería y tecnología eléctricas
dc.titleDeep Generative Model Driven Protein Folding Simulations
dc.typeinfo:eu-repo/semantics/bookPart
dc.version.typeinfo:eu-repo/semantics/publishedVersion
dspace.entity.typePublication
relation.isAuthorOfPublication4c105a3d-3b6f-4801-b0a6-7593ef9017d2
relation.isAuthorOfPublication.latestForDiscovery4c105a3d-3b6f-4801-b0a6-7593ef9017d2

Bloque original

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
APC-36-APC200023.pdf
Tamaño:
859.84 KB
Formato:
Adobe Portable Document Format

Bloque de licencias

Mostrando 1 - 1 de 1
Cargando...
Miniatura
Nombre:
license.txt
Tamaño:
1.71 KB
Formato:
Item-specific license agreed to upon submission
Descripción: