Усилитель | Группа изоляторов Цзясин

Nature Genetics, том 54, страницы 1919–1932 (2022 г.) Процитировать эту статью

20 тысяч доступов

31 цитат

54 Альтметрика

Подробности о метриках

Остается неясным, почему острое истощение CTCF (CCCTC-связывающего фактора) и когезина лишь незначительно влияет на экспрессию большинства генов, несмотря на существенное нарушение трехмерного (3D) сворачивания генома на уровне доменов и структурных петель. Чтобы решить эту загадку, мы использовали Micro-C высокого разрешения и профилирование зарождающихся транскриптов в эмбриональных стволовых клетках мыши. Мы обнаружили, что взаимодействия энхансер-промотор (E-P) в значительной степени нечувствительны к острому (3-часовому) истощению CTCF, когезина или WAPL. YY1 был предложен в качестве структурного регулятора петель E-P, но острое истощение YY1 также оказывает минимальное влияние на петли E-P, транскрипцию и сворачивание трехмерного генома. Поразительно, что визуализация одиночных молекул живых клеток показала, что истощение когезина снижает связывание фактора транскрипции (TF) с хроматином. Таким образом, хотя CTCF, cohesin, WAPL или YY1 не требуются для кратковременного поддержания большинства взаимодействий E-P и экспрессии генов, наши результаты показывают, что cohesin может способствовать более эффективному поиску и связыванию TF мишеней.

Высокопроизводительные анализы на основе захвата хромосомной конформации (Hi-C) изменили наше понимание трехмерного сворачивания генома1,2. На основании таких исследований можно выделить как минимум три уровня трехмерной складки генома. Во-первых, геном разделен на компартменты A и B, которые в основном соответствуют активным и неактивным сегментам хроматина соответственно и выглядят как плед-подобный паттерн на картах контактов Hi-C3. Во-вторых, белки CTCF и cohesin помогают складывать геном в топологически ассоциированные домены (TADs)4,5 и структурные петли хроматина6, вероятно, посредством экструзии петель ДНК7,8. В-третьих, в гораздо более тонком масштабе транскрипционные элементы участвуют в дальнодействующих взаимодействиях хроматина, таких как взаимодействия E-P и промотор-промотор (P-P), с образованием локальных доменов9,10,11.

Элегантные эксперименты, сочетающие острое истощение белка CTCF, когезина и когезин-регуляторных белков с подходами Hi-C или визуализации, выявили роль CTCF и когезина в регуляции первых двух уровней: TADs и компартментов12,13,14,15,16. Однако Hi-C неэффективен для захвата третьего уровня сворачивания 3D-генома: мелкомасштабных транскрипционно важных взаимодействий E-P/P-P9,17,18. Наше понимание роли CTCF и cohesin в регуляции экспрессии генов в основном пришло из генетических экспериментов, сосредоточенных на нескольких локусах развития19,20,21. Таким образом, оставалось неясным, регулирует ли, когда, где и как CTCF/cohesin взаимодействия E-P/P-P и экспрессию генов.

Недавно мы сообщили, что Micro-C может эффективно решать сверхтонкие 3D-складки генома с разрешением нуклеосомы22,23, включая взаимодействия E-P/P-P9,17. В настоящем исследовании мы использовали Micro-C, секвенирование иммунопреципитации хроматина (ChIP-seq), общее секвенирование РНК (RNA-seq) и зарождающуюся РНК-seq24, чтобы систематически исследовать, как резко истощаются CTCF, RAD21 (субъединица когезина), WAPL ( разгрузчик когезина) или YY1 (предполагаемый структурный белок25) влияет на регуляторные взаимодействия генов с хроматином и транскрипцию в эмбриональных стволовых клетках мыши (mESCs). Наконец, сосредоточение внимания на динамике YY1 выявило неожиданную роль cohesin в облегчении связывания TF.

Наше предыдущее исследование использовало Micro-C, чтобы показать, что мелкомасштабная 3D-структура генома хорошо коррелирует с транскрипционной активностью, образуя «точки» или «петли» (см. «Методы терминологии») на пересечениях E-P и P-P9. В настоящем исследовании мы идентифицировали более 75 000 статистически значимых петель в мЭСК, используя недавно разработанную программу вызова петель Mustache26 (рис. 1a) или Chromosight27 (расширенные данные, рис. 1a), что примерно в 2,5 раза больше, чем в нашем предыдущем отчете9,26 и около 4 × больше, чем у Hi-C26,28 (расширенные данные, рис. 1б). Путем анализа локального состояния хроматина в якорях петель (расширенные данные, рис. 1c, d) мы разделили эти петли на петли когезина (~ 13 735), петли E-P (~ 20 369), петли P-P (~ 7 433) и поликомб. -ассоциированные контакты (~700) (рис. 1а,б) со средним размером ~160 т.п.н. для петель когезина и ~100 т.п.н. для петель E-P/P-P (расширенные данные, рис. 1e).

75,190 chromatin dots/loops, subclassified into four primary types (Mustache loop caller26; see Methods and Supplementary Note). b, Probability distribution of loop strength for cohesin, E–P, P–P and random loops. Chromatin loop numbers are shown on the left. The box plot indicates the quartiles for the loop strength score distribution (min. = lower end of line, Q1 = lower bound of box, Q2 = line in box, Q3 = higher bound of box and max. = higher end of line). Genome-wide averaged contact signals (aggregate peak analysis (APA)) are plotted on the right. The contact map was normalized by matrix balancing and distance (Obs/Exp), with positive enrichment in red and negative signal in blue, shown as the diverging color map with the gradient of normalized contact enrichment in log10. The ratio of contact enrichment for the center pixels is annotated within each plot. This color scheme and normalization method are used for normalized matrices throughout the manuscript unless otherwise mentioned. Loop anchors are annotated as ‘C’ for CTCF/cohesin, ‘P’ for promoter and ‘E’ for enhancer. Asterisks denote a P < 10−16 using two-sided Wilcoxon’s signed-rank test. The data are presented in the same format and color scheme throughout the manuscript unless otherwise indicated (n = 37 biological replicates)9. c, Genome-wide averaged transcript counts for nascent transcript profiling. Genes are grouped into high, medium and low expression levels based on nascent RNA-seq data (gene body) and rescaled to the same length from TSS (transcription start site) to poly(adenylation) cleavage site (PAS) or TES (transcription end site) on the x axis. d, Rank-ordered distribution of loop strength against gene expression for cohesin, E–P and P–P loops. Gene expression levels for the corresponding chromatin loop were calculated by averaging the genes with TSSs located ±5 kb around the loop anchors. Loop strength was obtained from the same analysis shown in b. The distribution for each loop type was fitted and smoothed by LOESS (locally estimated scatterplot smoothing) regression. Error bands indicate fitted curve ± s.e.m. with 95% confidence interval (CI). e, APAs are plotted by paired E–P/P–P loops and sorted by the level of nascent transcription into high, mid and low levels./p>90% of CTCF peaks and 60% of cohesin peaks are significantly decreased on loss of CTCF (Padj < 0.05; Fig. 3e and Extended Data Fig. 3g). Despite the substantial loss of cohesin peaks, biochemical fractionation experiments show that the fraction of RAD21 associated with chromatin remains fairly constant 3 h after CTCF degradation (Extended Data Fig. 2f, green box). Thus, our results are in line with the widely accepted conclusion that CTCF positions cohesin43. On the other hand, loss of cohesin affects a subset of CTCF binding (Fig. 3c,d)13, resulting in ~20% reduction in the number of CTCF peaks (Fig. 3e) and a slight decrease in its global chromatin association (Extended Data Fig. 2f, blue box)./p> 0.1 µm2 s−1), which can be separated further into slow (Dslow ~0.1–2 µm2 s−1) and fast moving (Dfast > 2 µm2 s−1). Scale bar, 1 μm. f, Aggregate likelihood of diffusive YY1 molecules. Top, bar graph showing fractions of YY1 binned into bound, slow- and fast-diffusing subpopulations. Bottom, YY1 diffusion coefficient estimation by regular Brownian motion with marginalized localization errors. g, Western blots of cytoplasmic (Cyt) and nuclear proteins dissociating from chromatin at increasing salt concentrations (Extended Data Fig. 2b). A subpopulation (~30%) of YY1 stays on chromatin, resisting 1 M washes. Ins, insoluble pellet after sonication; Son, sonicated, solubilized chromatin. Percentage of total shows the signal intensity of the indicated fractions divided by the total signal intensity. Anti-histone 2B controls for chromatin integrity during fractionation. h, FRAP analysis of YY1 bleached with a square spot. Error bars are fitted curve ± s.e.m. with 95% CI. i, Slow-SPT measuring YY1 residence time. Individual molecules were tracked at 100-ms exposure time to blur fast-moving molecules into the background and capture stable binding. The unbinding rate is obtained by fitting a model to the molecules’ survival curve. Each datapoint indicates the unbinding rate of YY1 molecules in a single cell. The box plot shows quartiles of data. Error bars are mean ± s.d. j. Slow-SPT measures YY1’s residence time at multiple exposure times./p>90% depletion after 3 h of IAA treatment (Fig. 7a and Extended Data Fig. 9a). Despite the high degradation efficiency, neither YY1’s nuclear distribution nor its clustering was strongly affected after acute loss of CTCF and cohesin in either live or fixed cells (Fig. 7b,c and Extended Data Fig. 9b). This suggests that the maintenance of YY1 hubs is independent of CTCF and cohesin./p>82% of these loci were associated with promoter regions (Fig. 7f and Extended Data Fig. 9d,e). In contrast, both CTCF and WAPL depletion had a negligible effect on YY1 occupancy (Fig. 7f and Extended Data Fig. 9d,e). In biochemical fractionation analysis, we also observed a similar, though less pronounced, reduction in YY1 chromatin association after RAD21 depletion (Extended Data Fig. 9f). To test whether cohesin facilitates the target search of TFs in general, we performed spaSPT on additional TFs. We thus generated RAD21–AID cell lines stably expressing either HaloTag-conjugated SOX2 or KLF4 and found that the bound fraction of both TFs was reduced by ~20% after 3-h cohesin degradation (Extended Data Fig. 9g). These results suggest that cohesin probably facilitates chromatin binding of TFs in general./p>20% of E–P/P–P loops can cross TAD boundaries and retain high contact probability and transcriptional activity (Fig. 2)18,35; (2) only a very small handful of genes showed altered expression levels after CTCF, cohesin or WAPL depletion (Fig. 3)12,13,14,15,16; (3) CTCF and cohesin loops are both rare (~5% of the time) and dynamic (median lifetime ~10–30 min)34; (4) most of the E–P/P–P loops persist after depletion of these structural proteins (Fig. 4)39,63; (5) CTCF/cohesin generally does not colocalize with transcription loci67; and (6) E–P loops and transcription can be established before CTCF/cohesin interactions on mitotic exit71, in some cases even with no CTCF/cohesin expression36,65,66. Second, YY1 was proposed to be a master structural regulator of E–P interactions25 (Fig. 8, Model 2). However, our Micro-C data are inconsistent with this model, because acute YY1 depletion has little effect on E–P/P–P interactions or gene expression. It is still possible that YY1 specifically connects development-related chromatin loops during neural lineage commitment47, but is less important in the pluripotent state. In summary, we conclude that, in mESCs, CTCF, cohesin, WAPL or YY1 is not generally required for the short-term maintenance of most E–P interactions and the subsequent expression of most genes after acute depletion and loss of function./p>

2. Full lists of DEGs are available in Supplementary Table 11./p>2). Full lists of DEGs are available in Supplementary Table 12./p> 100 & intensity > 100 & sigma < 220 & uncertainty_xy < 50; (2) merge: Max distance = 10 & Max frame off = 1 & Max frames = 0; and (3) remove duplicates enabled. This setting combines the blinking molecules into one and removes the multiple localizations in a frame./p>

20 kb). b. Micro-C reproducibility tests. Top: pairwise similarity scores measured by GenomeDisco between UT vs. IAA and UT vs. UT samples using 10-kb resolution of Micro-C matrices. Bottom: similarity scores measured by QuASAR between replicates (light lines) or comparing the UT and IAA-treated samples (dark lines) using Micro-C matrices at 250-kb, 50-kb, 25-kb, and 10-kb resolutions. c. Genome-wide contact decaying P(s) analysis (bottom) and slope distributions of the P(s) curves (top) for UT cells. d. Micro-C contact maps at specific regions or at genome-wide scale across multiple resolutions in the UT and IAA-treated cells. Left to right: examples of Pearson’s correlation matrices showing plaid-like chromosome compartments; saddle plots showing overall compartment strength (A-A: bottom-right; B-B: top left); differential saddle plots showing changes in compartment strength; contact matrices showing TADs along the diagonal; ADA showing all TADs; differential ADA showing TAD strength changes. e. Slope distribution of P(s) curves for UT and IAA-treated cells. Dashed lines highlight the range of genome distances affected by CTCF, RAD21, or WAPL depletion. CTCF depletion had minimal impact on overall interactions across the genome. RAD21 depletion reduced contact frequencies in the range of 10–200 kb but increased interactions at 300 kb – 5 Mb. WAPL depletion showed the opposite trend, with increased contacts at 70–700 kb but reduced contacts at 1–5 Mb. f. Scatter plot of cohesin loops scores in UT and IAA-treated cells. The overlaid heatmap indicates dot density (red: highest, blue: lowest). Dashed lines along the diagonal delimit unchanged loops. g. Loop numbers called by Mustache for UT and IAA-treated cells. The additional loops (n = 5764) identified after WAPL depletion show longer lengths, with a 570-kb median. h. APA for loops across multiple ranges of genomic distance in UT and IAA-treated cells./p> 10), suggesting that while CTCF and cohesin are required for the transcriptional maintenance of only a small subset of genes, those genes tend to require the presence of both factors. Statistical test: Fisher’s exact test. g. Snapshots of Micro-C maps comparing chromatin interactions in the UT (top-right) and IAA-treated (bottom-left) cells surrounding Klf4 locus. Contact maps are annotated with gene boxes and 1D chromatin tracks showing the ChIP-seq signal enrichment in the same region./p>20 kb) interactions. j. Genome-wide contact decaying P(s) analysis (bottom) and slope distributions of the P(s) curves (top) for UT cells. k. MA plot of total RNA-seq and nascent RNA-seq for YY1 degron 3 to 24 hours after IAA treatment. l. Scatter plots of loop scores (quantified using 2-kb-resolution Micro-C data) plotted for E-P or P-P loops in UT and IAA-treated cells. APA for YY1, E-P, or P-P anchored loops plotted for the ΔYY1 degron cell line in UT and IAA-treated cells. m. Micro-C maps comparing chromatin interactions in UT and IAA-treated ΔYY1 cells surrounding Nes gene./p>