Type 2 Diabetes PBMC Ancestry Paper
Focus
This project integrates multi-modal PBMC data — genotyping, scRNA-seq, and scATAC-seq — to identify immune features associated with type 2 diabetes and to trace those features to their genetic origins via ancestry-aware analysis.
Ancestry separation is itself based on genotyping data, so ancestry-associated immune differences can be resolved to specific genetic markers — bridging from cell-state phenotypes down to the variants that may drive them. The project asks which T2D-associated differences in PBMC composition, chromatin accessibility, and gene expression carry a genetic signal that varies with ancestry.
Working Research Question
How do PBMC immune profiles — across chromatin accessibility, gene expression, and genotype — differ in type 2 diabetes, and which of those differences carry a genetic signal that can be resolved through ancestry-aware analysis rather than reflecting cohort composition, environment, disease severity, medication exposure, or technical confounders?
Paper Goals
- Identify reproducible PBMC immune changes reported in T2D across modalities (chromatin accessibility, transcription, genotype).
- Distinguish cell-composition shifts from per-cell transcriptional or functional changes.
- Use genotyping data to anchor ancestry separation and resolve ancestry-associated immune differences to specific genetic markers.
- Track whether ancestry-associated differences are directly tested, inferred indirectly, or not evaluated.
- Separate biological ancestry (genetic markers) from social, environmental, and healthcare-access variables in the argument.
- Build a source-backed outline with citations attached to each claim.
Core Pages
- Manuscript Folder
- Introduction
- Methods
- Results
- Discussion
- Research Results Folder
- PBMC Immune Changes in Type 2 Diabetes
- Single-Cell PBMC Profiling in Type 2 Diabetes
- Ancestry-Associated Immune Differences
- Paper Evidence Map: T2D PBMC Ancestry
- Gu et al. 2024 Single-Cell PBMCs in T2D
- Tkachenko et al. 2024 Deciphering the Transcriptomic Landscape of T2D
- Tkachenko et al. 2025 Cross-Study Blood Transcriptomes in T2D
- Blood Transcriptome Meta-Analysis in Type 2 Diabetes
- Cross-Study Heterogeneity in T2D Blood Transcriptomics
- T2D Blood Transcriptomics Biomarker Evidence
- Li et al. 2025 Immunometabolic Alterations in T2D
- T2D T-Cell Immunometabolic Subtypes
- T-Cell-Monocyte Communication in T2D
- T2D Transcription Factor Activity
- T2D Drug Enrichment From Single-Cell Subtypes
- GSE244515
- Literature Note Template
- Huang et al. 2022 Exploring Biomarkers in T2D
- Tang et al. 2026 Identification of Signature Genes in T2D
- Markelova et al. 2025 Genetic Heterogeneity of T2D Across Russian Ancestry Groups (our prior work, same cohort)
- Partitioned Polygenic Scores in T2D
- Ancestry-Specific T2D Genetic Mechanisms
- East Asian T2D β-Cell Dysfunction Paradigm
- T2D Ethnic Differences in Incretin Action
- Zhao et al. 2025 Single-Cell PBMCs in T2D
- NF-kB and IFN-Gamma Signaling
- GSE255566
Ingested Literature
- Gu et al. 2024 — Korean PBMC scRNA-seq, scTCR-seq, and scBCR-seq study reporting inflammatory monocyte states, cytotoxic T-cell expansion, and B-cell differentiation changes in T2D.
- Tkachenko et al. 2024 — medRxiv preprint from a Russian cohort generating bulk blood RNA-seq (n=18) and PBMC scRNA-seq (n=4) from T2D and control participants. Reports NK cell depletion in T2D PBMCs (contrasting with Gu et al. 2024), increased CD4+ TCM/naive cells, and 146 bulk blood DEGs with PCA showing disease status does not explain most transcriptomic variance. Data accessions: GSE280401 (scRNA-seq), GSE280402 (bulk RNA-seq).
- Tkachenko et al. 2025 — cross-study meta-analysis of eight T2D blood RNA-seq datasets reporting low individual-study concordance, 2065 meta-analysis DEGs, and pathway themes involving neutrophil effector biology, ERAD, mTOR, oxidative stress, and RNA splicing.
- Li et al. 2025 — PBMC scRNA-seq reanalysis using GSE268210 T2D samples and GSE244515 controls, reporting T-cell metabolic subtypes, stronger T-cell-monocyte communication, TF activity patterns, machine-learning subtype classifiers, and drug-enrichment hypotheses.
- Huang et al. 2022 — islet RNA-seq and pancreatic scRNA-seq study identifying SLC2A2, SERPINF1, RASGRP1, and CHL1 as T2D diagnostic biomarkers, with a pancreatic fibroblast SERPINF1-NR2F2 regulatory axis. Uses pancreatic tissue, not PBMCs.
- Tang et al. 2026 — pancreatic islet scRNA-seq + LASSO study identifying PNLIP, BUB1, CTSB, and NAMPT as T2DM signature genes, with qRT-PCR validation in peripheral blood from a Chinese cohort. Uses islet tissue as primary discovery and blood for validation; does not test ancestry effects.
- Markelova et al. 2025 — our prior genotyping study of the same cohort (Chechen, Tatar, Yakut) demonstrating ancestry-specific distributions of T2D genetic clusters using partitioned polygenic scores. Yakuts show beta-cell dysfunction dominance, while Chechens and Tatars show obesity/insulin-resistance patterns. Provides the pre-existing genetic backbone for the current PBMC project, enabling within-person integration of pPGS with PBMC immune features.
- Yabe et al. 2015
- Zhao and Fang 2025 — Frontiers in Immunology PBMC scRNA-seq study from a small Chinese cohort (3 T2DM, 3 healthy controls), reporting T-cell and monocyte DEG/pathway findings involving TNF/NF-kB, interferon-gamma response, T-cell receptor signaling, chemokine signaling, and TNFRSF1A-centered network interactions. Data accession: GSE255566. Does not test ancestry effects. — review establishing T2D in East Asians as β-cell-dysfunction-dominant with lower insulin resistance, contrasting with the insulin-resistance-dominant paradigm in Europeans. Documents East-Asian-specific T2D genetic loci (KCNQ1, UBE2E2, C2CD4A/B, PTPRD, SRR, SPRY2, CDC123) and greater incretin-based therapy efficacy in Asians. Provides the pathophysiological framework for ancestry-specific T2D mechanisms that directly aligns with our same-cohort pPGS findings from Markelova et al. 2025.
Manuscript Draft
The manuscript/ folder contains the actual paper draft text, split into section notes. Use concepts/, references/, and synthesis/ as supporting material, but keep paper-ready prose in manuscript/.
Research Results
The research-results/ folder contains findings generated from this project’s own data analyses. Use it for analysis summaries, QC summaries, model outputs, figure/table result notes, and data-derived observations before they are converted into polished manuscript prose in Results.
Source Intake Workflow
- Put PDFs, abstracts, exported citations, or rough notes in
_raw/. - Ingest each source into
references/using the literature note template. - Promote cross-source conclusions into
synthesis/only when supported by multiple references. - Keep uncertainty explicit when ancestry is a proxy for unmeasured variables.
Claim Discipline
- Use
Evidence:lines with wikilinks to source notes for every manuscript claim. - Mark claims as
established,mixed,single-study, orhypothesisuntil enough sources are reviewed. - Do not collapse ancestry, race, ethnicity, and geography into one construct unless a source does so and the limitation is stated.
Open Questions
- Which PBMC cell types show the most consistent T2D-associated differences?
- Does the NK cell depletion in T2D PBMCs observed by Tkachenko et al. 2024 (Russian cohort, n=2/group scRNA-seq) replicate in this project’s scRNA-seq data, or does it follow the Gu et al. 2024 NK trend? This discrepancy could serve as a useful replication test.
- Are observed immune differences driven by altered cell proportions, activation states, gene expression, cytokine production, or metabolic rewiring?
- Which studies include ancestry-aware design or ancestry-stratified analysis?
- Which results replicate across ancestries, and which appear ancestry-specific?
- What covariates are consistently controlled: age, sex, BMI, glycemia, medication, infection status, diet, socioeconomic variables, and batch?
- Which ancestry-associated PBMC signals in this project overlap cross-study T2D blood transcriptomic themes versus appearing project-specific?
- How should the manuscript distinguish whole-blood neutrophil signals from PBMC-intrinsic immune signatures?
- Which Li et al. 2025 immunometabolic subtype signals replicate in independent T2D PBMC cohorts rather than reflecting reuse of GSE268210?
- Do ancestry-associated findings in this project align with T-cell metabolic subtype differences, or are they independent axes of variation?
- Which T2D-associated chromatin accessibility peaks (scATAC-seq) colocalize with ancestry-informative genetic variants?
- Do expression quantitative trait loci (eQTL) or chromatin QTLs in PBMCs explain ancestry-stratified T2D immune differences?
- Are there genetic markers (SNPs, haplotypes) — beyond admixture proportion — that directly associate with T2D PBMC cell-state variation?
- Do ancestry-specific T2D genetic mechanisms (beta-cell vs. obesity/insulin-resistance dominant) correlate with specific PBMC immune profiles within the same cohort?
- Are pPGS distributions from our prior analysis consistent when the same genotyping-based ancestry framework is applied alongside PBMC immune data? (Ancestry inference is from genotyping, not PBMC data — the question is whether the genetic backbone remains stable when integrated with the new multi-omic layer.)
- Could the pPGS approach be extended to partition PBMC immune features by their genetic architecture, analogous to Smith et al.’s T2D cluster framework?
- Do the TNF/NF-kB, interferon-gamma, T-cell receptor, and chemokine pathway themes from Zhao and Fang 2025 appear in this project’s ancestry-aware PBMC scRNA-seq or scATAC-seq analyses?