TY - JOUR
T1 - ChromaFold predicts the 3D contact map from single-cell chromatin accessibility
AU - Gao, Vianne R.
AU - Yang, Rui
AU - Das, Arnav
AU - Luo, Renhe
AU - Luo, Hanzhi
AU - McNally, Dylan R.
AU - Karagiannidis, Ioannis
AU - Rivas, Martin A.
AU - Wang, Zhong-Min
AU - Barisic, Darko
AU - Karbalayghareh, Alireza
AU - Wong, Wilfred
AU - Zhan, Yingqian A.
AU - Chin, Christopher R.
AU - Noble, William S.
AU - Bilmes, Jeff A.
AU - Apostolou, Effie
AU - Kharas, Michael G.
AU - Beguelin, Wendy
AU - Viny, Aaron D.
AU - Huangfu, Danwei
AU - Rudensky, Alexander Y.
AU - Melnick, Ari M.
AU - Leslie, Christina S.
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/11/1
Y1 - 2024/11/1
N2 - Identifying cell-type-specific 3D chromatin interactions between regulatory elements can help decipher gene regulation and interpret disease-associated non-coding variants. However, achieving this resolution with current 3D genomics technologies is often infeasible given limited input cell numbers. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps, including regulatory interactions, from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility across metacells, and a CTCF motif track as inputs and employs a lightweight architecture to train on standard GPUs. Trained on paired scATAC-seq and Hi-C data in human samples, ChromaFold accurately predicts the 3D contact map and peak-level interactions across diverse human and mouse test cell types. Compared to leading contact map prediction models that use ATAC-seq and CTCF ChIP-seq, ChromaFold achieves state-of-the-art performance using only scATAC-seq. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations.
AB - Identifying cell-type-specific 3D chromatin interactions between regulatory elements can help decipher gene regulation and interpret disease-associated non-coding variants. However, achieving this resolution with current 3D genomics technologies is often infeasible given limited input cell numbers. We therefore present ChromaFold, a deep learning model that predicts 3D contact maps, including regulatory interactions, from single-cell ATAC sequencing (scATAC-seq) data alone. ChromaFold uses pseudobulk chromatin accessibility, co-accessibility across metacells, and a CTCF motif track as inputs and employs a lightweight architecture to train on standard GPUs. Trained on paired scATAC-seq and Hi-C data in human samples, ChromaFold accurately predicts the 3D contact map and peak-level interactions across diverse human and mouse test cell types. Compared to leading contact map prediction models that use ATAC-seq and CTCF ChIP-seq, ChromaFold achieves state-of-the-art performance using only scATAC-seq. Finally, fine-tuning ChromaFold on paired scATAC-seq and Hi-C in a complex tissue enables deconvolution of chromatin interactions across cell subpopulations.
KW - Animals
KW - CCCTC-Binding Factor/metabolism
KW - Chromatin Immunoprecipitation Sequencing/methods
KW - Chromatin/metabolism
KW - Deep Learning
KW - Humans
KW - Mice
KW - Single-Cell Analysis/methods
UR - https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=purepublist2023&SrcAuth=WosAPI&KeyUT=WOS:001346598500013&DestLinkType=FullRecord&DestApp=WOS_CPL
UR - http://www.scopus.com/inward/record.url?scp=85208290714&partnerID=8YFLogxK
U2 - 10.1038/s41467-024-53628-0
DO - 10.1038/s41467-024-53628-0
M3 - Article
C2 - 39487131
SN - 2041-1723
VL - 15
SP - 9432
JO - Nature Communications
JF - Nature Communications
IS - 1
M1 - 9432
ER -