Unveiling inflammatory and prehypertrophic cell populations as key contributors to knee cartilage degeneration in osteoarthritis using multi-omics data integration

Objectives Single-cell and spatial transcriptomics analysis of human knee articular cartilage tissue to present a comprehensive transcriptome landscape and osteoarthritis (OA)-critical cell populations. Methods Single-cell RNA sequencing and spatially resolved transcriptomic technology have been applied to characterise the cellular heterogeneity of human knee articular cartilage which were collected from 8 OA donors, and 3 non-OA control donors, and a total of 19 samples. The novel chondrocyte population and marker genes of interest were validated by immunohistochemistry staining, quantitative real-time PCR, etc. The OA-critical cell populations were validated through integrative analyses of publicly available bulk RNA sequencing data and large-scale genome-wide association studies. Results We identified 33 cell population-specific marker genes that define 11 chondrocyte populations, including 9 known populations and 2 new populations, that is, pre-inflammatory chondrocyte population (preInfC) and inflammatory chondrocyte population (InfC). The novel findings that make this an important addition to the literature include: (1) the novel InfC activates the mediator MIF-CD74; (2) the prehypertrophic chondrocyte (preHTC) and hypertrophic chondrocyte (HTC) are potentially OA-critical cell populations; (3) most OA-associated differentially expressed genes reside in the articular surface and superficial zone; (4) the prefibrocartilage chondrocyte (preFC) population is a major contributor to the stratification of patients with OA, resulting in both an inflammatory-related subtype and a non-inflammatory-related subtype. Conclusions Our results highlight InfC, preHTC, preFC and HTC as potential cell populations to target for therapy. Also, we conclude that profiling of those cell populations in patients might be used to stratify patient populations for defining cohorts for clinical trials and precision medicine.


Osteoarthritis
traditional multi-omics studies have been primarily conducted to understand the molecular processes that are potentially involved in disease progression at the population level. 12For example, genomic data were used to identify OA-associated genetic variants, 13 14 such as rs375575359 and rs11335718; transcriptomic data were used to detect key molecular differences or characterise the subtype of patients with OA, 15 such as MMP13, MMP2 and CYTL1; epigenomic data were used to reflect how epigenetic modifications that regulate gene expression, 16 and proteomic data were used to determine what changes in genomics, transcriptomics or epigenomics, 17 18 such as CRTAC1 protein in plasma was considered as a promising candidate biomarker for OA.However, conventional multi-omics sequencing technologies are only capable of measuring the average expression level of hundreds of thousands of cells together, 19 20 largely ignoring the cellular heterogeneity of chondrocytes in articular cartilage tissues.
With the emergence of single-cell sequencing technologies 21 22 and spatially resolved technologies, 23 unbiased and high-throughput profiling of tissue architecture at single-cell and spatial resolution is now feasible.This presents an unprecedented opportunity to uncover the cellular or spatial heterogeneity of cartilage tissue and delineate the differentiation trajectories of chondrocytes; therefore, greatly facilitating the identification of novel or rare chondrocyte populations for maintaining cartilage homeostasis and preventing the development and progression of OA.Recently, several single-cell omics studies have been conducted to unravel the cellular heterogeneity of cartilage tissues, 10 24 25 revealing that chondrocytes in cartilage can exhibit diverse compositions, rather than the long-held view that cartilage consists of only a single dominant cell type.Of note, the role of chondrocytes depends on the different types of cell populations, and on the spatial nature of cartilage architecture, 26 27 orchestrated by articular surface (AS), superficial zone (SZ), middle zone (MZ) and deep zone (DZ).Therefore, a comprehensive characterisation of chondrocyte composition and the spatial landscape of cartilage tissue in a high-resolution, untargeted and unbiased manner is an important step towards the understanding of the molecular mechanisms of chondrocyte differentiation, proliferation, 28 senescence, 29 autophagy 30 and apoptosis 31 in patients with OA.In light of this, a comprehensive transcriptional landscape of chondrocytes is urgently needed to systematically investigate cellular heterogeneity and spatial heterogeneity.
Meanwhile, numerous large-scale traditional multi-omics studies involving genome-wide association studies (GWAS) and bulk RNA sequencing (RNA-seq) studies have been performed, identifying >100 variants 14 32-34 and a plethora of OA-associated differentially expressed (DE) genes 11 that potentially relate to the pathogenic mechanisms and drug targets of OA.However, due to cellular heterogeneity and interpatient variances, it remains unknown which molecular subtypes of chondrocyte populations represent the reproducible stratification of patients with OA, and which chondrocyte populations may essentially contribute to the development and progression of OA.
Here, we collected 19 cartilage samples from 8 patients with OA and 3 normal controls and profiled a total of 135 896 chondrocytes by the droplet-based single-cell RNA sequencing (scRNA-seq) protocol (ie, 10x Genomics 35 ) to investigate the cellular heterogeneity of human knee articular cartilage tissue.Additionally, 124 samples of 9 cartilage samples from 4 patients with OA and 1 normal control are subjected to spatial transcriptomic analysis by laser microdissection-based RNA sequencing (Geo-seq 36 ) to investigate the spatial heterogeneity of human knee articular cartilage tissue.We performed an integrative analysis of scRNA-seq data and Geo-seq data to define the spatial landscape of chondrocytes, to identify zone-specific genes, to detect zone-specific DE genes and to enrich the functional pathways associated with markers and DE genes.Next, we performed an integrative analysis of scRNA-seq data and two bulk RNAseq data to stratify patients with OA into two distinct subtypes.Finally, we combined the scRNA-seq data with two large-scale GWAS to infer the OA-critical chondrocyte populations that related to genetic variants and chondrocyte populations that confer the risk of OA.Overall, our findings revealed the prominent role of critical cell populations and tissue architectures in the pathophysiology of OA, which will greatly expand our understanding of the populations and cellular heterogeneity of cartilage tissue from patients with OA and non-OA controls, and stratify the subtypes of patients with OA for the design of clinical trials.

Single-cell transcriptional landscape of human knee cartilage in patients with OA and non-OA controls
To comprehensively present the cellular landscape of knee cartilage tissue, we measured the single-cell transcriptomic profiles of chondrocytes from a total of 19 cartilage samples (online supplemental table S1; n=135 896 chondrocytes after the stringent quality control, online supplemental table S2), which were collected from 8 patients with OA (n=119 389 chondrocytes) and 3 non-OA controls (n=16 507 chondrocytes).In particular, the OA samples were split into weight-bearing (WB) cartilage (n=62 611 chondrocytes) from the medial condyle of the femur and non-weight-bearing (NWB) cartilage (n=56 778 chondrocytes) from the intact lateral condyle of the femur to serve as internal controls (figure 1A).The transcriptomic profiling of chondrocytes was profiled through 10x Genomics 35 (see 'Materials and methods' section).
We performed an integrative analysis of multiple scRNA-seq datasets to investigate the cellular heterogeneity of chondrocytes from multiple samples (online supplemental figure S1A) and finally classified the chondrocytes into a total of 11 putative cell populations (figure 1B).Specifically, we observed four cell populations (ie, FC, InfC, preInfC and preHTC) were dominated in the patients with OA (ie, WB and NWB); while one cell population (ie, HomC) was dominated in non-OA controls (figure 1C).Notably, we found that two novel chondrocyte populations, pre-inflammatory chondrocytes (preInfC, 533 cells; IFI16 and IFI27) and inflammatory chondrocytes (InfC, 1499 cells; CD74, CXCL8 and GPR183) were strongly enriched in both WB and NWB samples (WB>NWB; figure 1C), indicating these two populations may potentially contribute to the pathogenesis of OA.
Next, we performed a trajectory inference analysis to examine the differentiation relationship of these populations (see 'Materials and methods' section).We found that all chondrocytes from patients with OA and non-OA controls were distributed along a single lineage trajectory (online supplemental figure S1B).Consistent with the previous studies, 10 ProCs were initially located at the start of the trajectory, followed by HTCs, ECs and HomCs.Whereas FCs, preInfCs and InfCs were mainly distributed in the termination of the trajectory, indicating that chondrocytes exhibited fibrosis and inflammation at the differentiated destination, which is one of the hallmarks of OA. 37 38 Osteoarthritis Figure 1 The single cell landscape of human knee cartilage in WB, NWB and non-OA controls.(A) Schematic of workflows for transcriptomic profiling of human articular cartilage using scRNA-seq.The scRNA-seq data (n=11, 5 females and 6 males) were assayed by 10x Genomics protocols, including 8 paired WB and NWB samples, and 3 non-OA controls.(B) The top panel shows the visualisation of chondrocyte populations using UMAP, where a total of 135 896 chondrocytes were analysed by the integrative analysis of 8 paired WB and NWB samples and 3 non-OA controls.There is a total of 11 populations, that is, RegC (regulator chondrocytes, green), ProC (proliferation chondrocytes, yellow), HomC (homeostasis chondrocytes, purple), preHTC (azure), EC (effector chondrocytes, red), FC (fibrocartilage chondrocytes, blue), HTC (orange), RepC (reparative chondrocytes, aqua), preFC (pink), preInfC (crimson) and InfC (brown).Each cell population was annotated by a given marker gene list explored in this study.

Osteoarthritis Cell population-specific marker genes for annotating chondrocytes of cartilage tissue
The chondrocytes of knee cartilage generally demonstrated a high correlation in intrapopulations (online supplemental figure S2A) but a low correlation in intersamples (online supplemental figure S2B), resulting in a great challenge to identify cell population-specific marker genes to annotate the putative chondrocyte populations.As far as we know, apart from the canonical markers, for example, COL10A1 for HTC; COL1A1 and COL1A2 for FC, there have not been any marker genes available that can reliably distinguish putative chondrocyte populations.With the large-scale and multisource scRNA-seq data, we finally identified 33 marker genes from the top 50 population-specific marker genes (online supplemental table S3), which were applicable for annotating cell populations under different cartilage statuses (online supplemental figure S1C).
Specifically, we found that the gene C11orf96, BMP2 and HMGA1 exhibited distinct expression levels for ProC (figure 1D).ProC was enriched in the expression of genes related to chondrocyte differentiation, wound healing and cytoplasmic translation pathways (online supplemental figure S2C; online supplemental table S4) and marked by transcription factors, FOSL1 and RELB (online supplemental figure S2D).To validate the presence of ProCs in cartilage tissue, we performed BMP2 IHC staining on patients with OA and non-OA controls.BMP2 as a member of the transforming growth factor-β (pro-inflammatory cytokines) superfamily and an anabolic factor, was positively observed throughout all zones of knee articular cartilage (figure 1E), implying these chondrocytes were effective in promoting cartilage repair and regeneration. 39lthough COL10A1 is a canonical marker for HTC, we also expanded another two markers IBSP and SPP1 as candidates for HTC and there two markers have been demonstrated to be highly expressed in the terminal hypertrophic chondrocytes. 40TC was enriched in the expression of genes related to extracellular matrix (ECM) organisation and ossification (online supplemental figure S2C; online supplemental table S4).The IHC staining confirmed that IBSP was highly expressed in the DZ of patients with OA and non-OA controls (figure 1F).Meanwhile, we identified both CHI3L1 and CHI3L2 as markers for RegC that related to some catabolic process such as amino sugar catabolic process (online supplemental figure S2C; online supplemental table S4).To validate the expression levels of CHI3L1, we also performed IHC staining on patients with OA and non-OA controls (figure 1G).
In addition, we defined new markers for preFC, that is, COL27A1, WWP2 and PLCG2.Where COL27A1 (Chr9:116,909,146 T/A, p=3.4e-13) was genetically associated with knee pain in large-scale GWAS 41 and WWP2 is essential for maintaining cartilage homeostasis and protecting cartilage against OA. 42Therefore, preFC may potentially be a critical cell population in alleviating cartilage lesions.
Notably, the other two novel cell populations, that is, preInfC and InfC were uniquely identified in the patients with OA. Subsequently, we performed IFI27 IHC staining on patients with OA and non-OA controls, demonstrating that IFI27 is highly expressed in the SZ of patients with OA (p=4.0e-4,figure 1H).
In summary, we identified 33 representative marker genes for annotating putative chondrocyte populations (table 1).To validate the efficacy of these candidate markers in annotating cell populations, we applied them to another two publicly available scRNA-seq datasets. 10 25The results demonstrated the effectiveness of the marker gene set in accurately annotating cell populations across distinct sequencing technologies, including Smart-seq2 10 (online supplemental figure S3A) and 10x Genomics 25 (online supplemental figure S3B).Notably, the previously defined marker genes from studies 10 25 did not exhibit consistent and accurate annotation of scRNA-seq data in this research (online supplemental figures S3C and S3D).
Together, we found four cell populations, that is, preHTC, FC, preInfC and InfC, were dominant in the patients with OA.We then asked the following questions: (1) how these chondrocytes are spatially distributed in the cartilage tissue of both patients with OA and non-OA controls; (2) which chondrocyte populations are OA-critical populations that may contribute to the degeneration of cartilage tissue.

Spatially resolved transcriptional landscape of human knee cartilage in patients with OA and non-OA controls
The characterisation of knee articular cartilage changes with different zones-AS, SZ, MZ and DZ-based on the organisation of the tissue and the alignment degree of collagen fibers 27 (online supplemental figure S4A).To comprehensively explore the spatial landscape of chondrocytes on human knee articular cartilage, we carried out the laser capture microdissection (LCM) 36 coupled with full-length messenger RNA (mRNA) sequencing (Geo-seq; see 'Materials and methods' section) on

Osteoarthritis
nine cartilage samples from four patients with OA (ie, paired WB and NWB) and one non-OA control (figure 2A).The details of sample collection from each zone can be found in online supplemental table S5.
To illustrate the relationship among four zones, we computed the cosine similarity between distinct zones across various statues.Notably, we observed that AS and SZ exhibited substantial intercorrelation within both NWB and WB.Furthermore, AS or SZ displayed significant intracorrelation between NWB and WB (figure 2B).In contrast, the four zones from the non-OA controls showcased overall distinct gene expression profiles, indicating their distinctive role in preserving cartilage homeostasis.
Since the Geo-seq data do not achieve single-cell resolution, to comprehensively resolve the distribution of chondrocytes in four different zones, we performed the deconvolution analysis of Geo-seq data using CIBERSORTx 46 (see 'Materials and Figure 2 The spatial transcriptomic landscape of human knee cartilage in patients with OA and non-OA controls.(A) Schematic of workflows for transcriptomic profiling of human articular cartilage using Geo-seq (n=5, 2 females and 3 males), including 4 paired WB (53 spots) and NWB (50 spots) and a non-OA control (21 spots).Articular cartilages are separated into four zones, that is, AS, SZ, MZ and DZ based on the organisation and degree of alignment of the collagen fibers.(B) The heatmap showed the transcriptional similarity among four zones from different cartilage conditions.Unlike control samples, AS and SZ zones from OA samples are highly correlated.(C) Cell compositions for each zone mapped spatial landscape by CIBERSORTx.The dot plot showed the cell composition difference between patients with OA and non-OA controls.Gene signature matrices were constructed based on scRNA-seq data.The dot size represents the proportion of each chondrocyte population in a particular zone and the colour represents the −log10 (p value) (permutation t-test).I(k) represents an indicator function, where I(k)=1 if a chondrocyte population is abundant in the patient with OA and I(k)=−1 if a chondrocyte population is abundant in the non-OA controls.The dot which represents the highest cell proportion for each chondrocyte population across four zones is highlighted by dotted circles.(D) The heatmap displays the expression levels of the top 10 zone-specific genes in AS (red), SZ (purple), MZ (orange) and DZ (green) for non-OA control (upper panel) or OA samples (bottom panel).The Gene Ontology terms were enriched by the zone-specific genes of non-OA controls (upper panel) or patients with OA (bottom panel).Zone-specific genes of MZ and DZ were not enriched in any OA-related meaningful pathways.AS, articular surface; DZ, deep zone;Geo-seq, laser microdissection-based RNA sequencing; InfC, inflammatory chondrocyte; LCM, laser capture microdissection; MZ, middle zone; NWB, non-weight-bearing; OA, osteoarthritis; preFC, prefibrocartilage chondrocyte; preHTC, prehypertrophic chondrocyte; preInfC, pre-inflammatory chondrocyte; scRNA-seq, single-cell RNA sequencing; SZ, superficial zone; WB, weight-bearing; Osteoarthritis methods' section).As a result, we observed that preHTCs were mainly located in the AS of OA samples, while in the DZ of non-OA controls; HTCs were mainly located in the DZ of all samples, which were consistent with the aforementioned IHC staining results (figure 1F); InfCs were mainly located in MZ of OA while were absent in non-OA controls (figure 2C).Furthermore, we noted that the compositions of preHTC, HTC, InfC and RegC exhibited great changes between the patient with OA group and the non-OA control group.Specifically, the proportion of preHTC for patients with OA and non-OA controls in AS was 30.3% and 7.74%, respectively (p=4.76e-3 using permutation t-test); the proportion of HTC in DZ was 3.01% and 8.51% (p=9.62e-3); the proportion of RegC in AS was 14.9% and 41.7% (p=0.027) and the proportion of InfC in MZ was 6.07% and 0% (p=0.024).
In contrast, we noted the proportions of preFC in the AS and SZ exhibited substantial variations between WB and NWB (online supplemental figure S4B).In particular, the cell proportions of preFC in the AS were 18.8% and 31.7%,respectively (p=0.013) while in the SZ were 20.4% and 41.7%, respectively (p=4.25e-5).

The zone-specific genes of human knee cartilage in patients with OA and non-OA controls
Next, we sought to comprehensively examine the four zonespecific genes for the human knee articular cartilage.To this end, we performed zone-specific gene detection using DESeq2 47 (see 'Materials and methods' section; log2FC≥1, and adjusted p<0.05).As a result, we identified 561, 190, 245 and 737 zonespecific genes for AS, SZ, MZ and DZ of non-OA controls, respectively; 2777, 694, 1418 and 691 zone-specific genes for AS, SZ, MZ and DZ of patients with OA, respectively (online supplemental table S6).
In particular, we observed that the zone-specific genes showed distinct expression patterns between patients with OA and non-OA controls (figure 2D) and were enriched in biologically meaningful Gene Ontology (GO) terms (online supplemental table S7).In particular, glycoprotein-related genes, that is, COMP, CLU and BGN were commonly highly expressed in AS of both OA and non-OA controls.In addition, the protein foldingrelated genes (eg, HSPA8, HSPB8) were also highly expressed in AS of both OA and non-OA controls, and were commonly enriched in the cellular response to unfolded protein pathway, suggesting that the chondrocytes in AS are essential for ECM formation.However, chondrocyte proliferation-associated genes such as S100A4, and CDKN1A, and hypoxia-associated genes such as VEGFA, and EGR1, PRG4 were uniquely highly expressed in AS of patients with OA, suggesting a higher proliferative activity and stress in OA cartilage samples.CRTAC1, a candidate biomarker for OA was also found to be highly expressed in AS of patients with OA.Moreover, the SZ-specific genes of non-OA controls contributed to energy metabolism, whereas these zonespecific genes of patients with OA were involved in cell cycleassociated pathways, suggesting that the chondrocytes in the SZ entered a proliferative cell cycle to compensate for damage in the AS.Finally, MZ-specific and DZ-specific markers of both patients with OA and non-OA controls were not enriched in any biologically meaningful pathways.Therefore, AS and SZ are molecularly unique among all zones, whereas MZ and DZ are more ubiquitous.These results are consistent with a previous zone-specific gene expression analysis of articular cartilage. 48ext, we examined the zone-specific genes for WB and NWB.We identified 359, 704, 1847 and 230 genes for AS, SZ, MZ and DZ of NWB, respectively; 3539, 514, 729 and 782 genes for AS, SZ, MZ and DZ of WB, respectively (online supplemental table S6).Specifically, we found that AS-specific genes from WB and NWB were enriched in similar pathways, including protein folding and hypoxia-related pathways (online supplemental figure S4C).Besides, AS-specific genes from WB samples were further involved in apoptotic signalling pathways, with highly expressed MAPK9, IFI6 and BCL10, etc.Moreover, SZ-specific genes from NWB were mainly enriched in cell cycle-associated pathways, while SZ-specific genes from WB were mainly enriched in autophagy-related pathways (online supplemental table S7).Similarly, we observed that MZ-specific and DZ-specific markers of both NWB and WB samples were not enriched in any biologically meaningful pathways.

Cell population InfC is uniquely identified in patients with OA
With the results from the scRNA-seq and Geo-seq data, we observed the InfC specifically expressed CD74, GPR183 and CXCL8 marker genes, and spatially distributed in MZ of patients with OA (figure 2C).We performed both CD74 and GPR183 IHC staining on patients with OA and non-OA controls to validate their expression levels.As a result, we noted that these genes were significantly expressed in SZ and MZ of patients with OA (figure 3A).To examine the origin of such novel chondrocytes, we performed trajectory inference 49 to infer the pseudotime along with chondrocyte differentiation (see 'Materials and methods' section).We found that preInfCs and InfCs were located in the termination point of lineage (online supplemental figure S1B), indicating that InfCs may be differentiated from the preInfCs.
To further confirm the differentiation trajectory of InfCs, we statistically grouped the chondrocytes into four clusters based on transcriptional changes along the pseudotime axis (online supplemental figure S5A).Specifically, cluster 1 exhibited significant expression of marker genes for ProC (eg, HMGA1, BMP2, etc) and marker genes for HomC (eg, DDIT3, HSPA6, etc) as well as cell proliferation and cell cycle-associated genes such as CDKN1A and G0S2.Cluster 2 displayed upregulation of marker genes associated with RegC, HTC, RepC and EC, primarily linked to ECM organisation, cartilage development and tissue homeostasis.In contrast, the marker genes for preHTC (e.g.CRTAC1 and ABI3BP), and preFC (e.g.COL27A1), were upregulated in cluster 3, which were associated with ECM organisation, glycosaminoglycan and ion channels.Finally, the markers of preInfC (e.g.IFI27), InfC (e.g.CD74 and GPR183) and FC (e.g.COL1A1) were highly expressed in cluster 4, which were enriched in angiogenesis and inflammatory response pathways, such as neutrophil activation and leucocyte migration (online supplemental figure S5B).Overall, these results further confirmed that the inflammatory signature was mainly present in the chondrocytes that were located at the termination of the differentiation process.

InfC-induced MIF-CD74 signalling may be an important regulator in the pathogenesis of OA
To systematically investigate the cell-cell communications of chondrocytes, we then performed ligand-receptor (L-R) interaction analyses using CellChat 50 (see 'Materials and methods' section).This analysis unveiled 15 potential L-R interactions relevant to patients with OA (figure 3B) and identified 11 potential L-R interactions linked to non-OA controls (online supplemental figure S6A).Additionally, our investigation revealed that multiple inflammatory signalling pathways were Osteoarthritis only involved in the patients with OA samples.In particular, preInfC and InfC were identified as both sender and receptor through chemokine signalling pathways, for example, CCL and CXCL (figure 3B). The analysis of L-R interactions revealed that MIF triggered pro-inflammatory effects via two distinct pairs, MIF-CD74+CD44 and MIF-CD74+CXCR4, particularly observed in InfC and communicated with other chondrocyte populations (online supplemental figure S6B).Previous studies have shown elevated MIF protein levels in OA cartilage, and MIF deletion has been associated with reduced OA severity. 52Notably, our findings emphasised that the interaction of MIF with CD74 expressed by the InfC population may contribute to the pathogenesis of OA.Furthermore, we observed the chondrocyte populations in non-OA controls also engaged in the MIF pathway, with preHTC acting as the primary receptor population (online supplemental figure S6A).However, this interaction used a distinct L-R pair, MIF-ACKR3 (online supplemental figure S6C).The activation of the PI3K-Akt pathway by MIF-ACKR3 further supported chondrocyte proliferation and hindered hypertrophic differentiation, 53 54 reinforcing the pro-inflammatory role of the InfC and , where the AS and SZ were hardly distinguished during IHC staining, therefore, the AS and SZ in Geo-seq results were merged together to only present as SZ in IHC staining.(B) A heatmap shows cell-cell communications in patients with OA.The grey bar on the y-axis denotes the relative strength of the predicted signalling pathway in the overall network while the colour bar on the x-axis represents the relative contribution of the chondrocyte population to the overall signalling milieu (top x-axis) for patients with OA.MIF was ranked as the strongest signalling pathway that interacted with chondrocyte populations in OA samples.(C) Quantitative real-time PCR was performed to measure the relative mRNA levels of MIF, CD74, IL-1β and TNF in patients with OA and controls (replicates, n=6).(D) Quantitative real-time PCR was performed to measure the relative mRNA levels of CD74, MIF, IL-1β and TNF after CD74 overexpressing with LV (replicates, n=6).(E) The visualisation of the subpopulation of preInfC using UMAP projections, that is, preInfC-1 (brown), preInfC-2 (blue) and preInfC-3 (orange), where a total of 533 chondrocytes were analysed together by the integrative analysis of patients with OA.The marker genes of each subpopulation were also annotated.(F) The visualisation of the subpopulation of InfCs using UMAP projections, that is, InfC-1 (cyan), InfC-2 (crimson), InfC-3 (dark green) and InfC-4 (green), where a total of 1499 chondrocytes were analysed together by the integrative analysis of patients with OA. (G) The violin plot shows the inflammatory score for the subpopulations of preInfC and InfC.The inflammatory scores for each chondrocyte were calculated through the average expression of a predefined inflammatory-associated gene set.The inflammatory scores were gradually increasing from preInfC to InfC.AS, articular surface; DZ, deep zone; IHC, immunohistochemistry; IL, interleukin; InfC, inflammatory chondrocyte; MIF, macrophage migration inhibitory factor; mRNA, messenger RNA; MZ, middle zone; NS, no significant; OA, osteoarthritis; preInfC, pre-inflammatory chondrocyte; SZ, superficial zone; TNF, tumour necrosis factor; UMAP, uniform manifold approximation and projection.*p<0.05,**p<0.01,***p<0.001,****p<0.0001.

Osteoarthritis
preInfC populations in the cellular milieu of cartilage tissue in patients with OA.
Moreover, for additional confirmation, we pursued two distinct validation approaches: we gauged the relative mRNA levels of marker genes for InfCs using quantitative real-time PCR (qRT-PCR), revealing a significant upregulation of MIF and CD74 in OA chondrocytes (figure 3C).Subsequently, we investigated the biological function of CD74+InfCs by overexpressing CD74 in the C28/I2 human chondrocyte cell line (see 'Materials and methods' section).Interestingly, this manipulation led to an increase in the gene expressions of renowned pro-inflammatory cytokines IL-1β and TNF 55 56 (figure 3D).

The subpopulations of preInfC and InfC demonstrate diverse inflammatory score
Next, we sought to explore both InfC-specific and preInfCspecific gene expression changes that may further explain the role of inflammatory-associated chondrocytes in OA.To do so, we performed the subclustering analysis, delving deeper into the compositions of both populations (see 'Materials and methods' section).Notably, three distinct subpopulations of preInfC were identified (figure 3E).The preInfC-1 cells exhibited elevated expression of inflammatory chemokines-associated genes (eg, CCL14 and ACKR1) and FC markers like COL1A1, suggesting an intermediary state between FC and InfC.The preInfC-2 cells demonstrated heightened expression of CXCL12, potentially linked to chondrocyte apoptosis. 57The preInfC-3 cells exhibited high expression levels of MMP3, ECRG4 and DCN, indicative of cartilage degradation.Furthermore, we identified four hypothetical subpopulations of InfC (figure 3F).The InfC-1 cells highly expressed classic macrophage markers CD68, CD163 and several members of complement pathway-related genes (ie, C1QA, C1QB and C1QC).The InfC-2 cells highly expressed MMP3, CHI3L2 and C11orf96.Finally, the InfC-3 and InfC-4 highly expressed dendritic cell markers (eg, FCER1A and CD1C) and T/NK cell markers (eg, NKG7 and CD3D), respectively.
Additionally, we sought to investigate the potential contribution to inflammation for each subpopulation.To this end, we initially established an inflammatory score for individual chondrocytes, determined by the expression of a curated set of genes related to inflammation.Subsequently, these scores served as indicators to evaluate the inflammatory profile for each subpopulation 58 (see 'Materials and methods' section).In examining the subpopulations of preInfC, we observed that the inflammatory score of preInfC-1 (mediate value=0.016)and preIn-fC-2 (mediate value=0.012) was higher than that of preInfC-3 (mediate value=−0.002).Furthermore, all InfC subpopulations exhibited elevated inflammatory scores (figure 3G), that is, mediate value=0.067for InfC-1; 0.028 for InfC-2; 0.074 for InfC-3 and 0.005 for InfC-4.Lastly, we noted a substantial correlation between the inflammatory score of chondrocytes and the pseudotime depicted in online supplemental figure S1B (Spearman's r=0.527, p<2.2e-16).This observation provides further evidence supporting the potential for chondrogenic differentiation from preInfC to InfC.

The preHTC displays the major changes between patients with OA and non-OA controls
To further quantify the compositional differences between patients with OA and the non-OA controls within the shared chondrocyte populations, we performed a cell populationspecific composition shift analysis using Cacoa 59 (see 'Materials and methods' section).As a result, we observed that the most significant composition change between patients with OA and the non-OA controls was in preHTC (20.4% vs 8.9%, p=2.0e-4), followed by RepC (8.46% vs 5.54%, p=2.0e-4, online supplemental figure S7A).
Next, to identify OA-associated genes, we performed differential expression (DE) analysis using DESeq2 (see 'Materials and methods' section) on pseudobulk data, 60 that is, aggregating (ie, summation) gene expression raw counts of all chondrocytes within one population.Among the DE genes across all shared cell populations ( log2FC ≥ 1 , and adjusted p<0.05; online supplemental table S8), we identified a total of 23 upregulated common DE genes (online supplemental figure S7B), which were associated with collagen-containing ECM (eg, COL3A1, COL1A2 and COL6A3), and matrix-degrading enzymes associated genes (eg, ADAMTS6 and HTRA1).Notably, we observed that the upregulated DE genes across all chondrocyte populations participated in analogous pathways (online supplemental figure S7C), indicating a shared transcriptomic characteristic among OA chondrocytes-specifically, dysregulated ECM metabolism, cell proliferation and differentiation.Broadly, preHTC and RepC detected a greater number of DE genes between patients with OA and non-OA controls (figure 4A).Specifically, preHTC uniquely identified the osteogenic-related genes such as POSTN, CCN4 and TGFB3 (online supplemental figure S7B), which enriched in bone development pathway.In contrast, the other cell population's unique DE genes were not enriched in any significant pathways.
Additionally, we performed gene-gene network analysis (see 'Materials and methods' section) that combines information from DE analysis with protein-protein interaction networks to identify statistically OA-associated subnetworks.This analysis further confirmed the connection of preHTC DE genes to the p38 MAPK cascade pathway (figure 4B), which potentially contributed to the apoptosis of chondrocytes. 61

The preHTC exhibits the greatest expression changes between WB and NWB
The aforementioned results demonstrated that preHTC exhibited the major compositional changes between patients with OA and non-OA controls.Therefore, we then asked how the gene expression changes in preHTC between WB and NWB.To do so, we first performed expression shift analysis using Cacoa.Notably, we observed that the cell composition between WB and NWB exhibited small distinctions (online supplemental figure S8A).Moreover, a few numbers of DE genes (<10 genes) between WB and NWB were detected across most cell populations (an adjusted p of 5% and log2FC ≥ 0.5 ), suggesting minor gene expression variations between WB and NWB (online supplemental figure S8B).
In contrast, the cell populations with the most three detected DE genes were EC, preHTC and RepC, with a total of 20, 35 and 34 DE genes, respectively.Specifically, we noted that these chondrocytes were highly expressed ECM-related genes, such as TGFBI, FN1, COL1A2, COL1A1 and LUM.Meanwhile, preHTC was uniquely expressed by the matrix-degrading enzymes ADAMTS2 and MMP inhibitors TIMP3, along with the inflammatory-associated gene THY1 (online supplemental figure S8B).The accumulating evidence suggests that preHTC may play a critical role in OA pathogenesis.Therefore, to further investigate the cellular heterogeneity of preHTC, we performed the subclustering analysis, resulting in four putative subpopulations

Osteoarthritis
(figure 4C and online supplemental figure S8C).Specifically, the preHTC-1 highly expressed the marker genes that related to metal and cadmium ions, such as MT1G and MT1M.This subpopulation was also reported as the metallothionein chondrocytes (MTC) by previous studies. 25The preHTC-2 was abundant in patients with OA and highly enriched in the ossification pathway (p=1.36e-5;COL1A1, MMP2 and SPARC).In contrast, the preHTC-3 highly expressed APOD and CHI3L1, potentially play a role in cartilage protection from ECM degradation. 64he preHTC-4 was abundant in patients with OA and highly expressed in the mitochondrial genes, such as MT-ATP6, MT-ND4 and MT-CO3, which may further contribute to cartilage degeneration 65 (figure 4D).

The AS and SZ exhibit the most significant alterations in gene expression
To investigate the zone-specific changes in function, we performed the zone-specific DE analysis of Geo-seq data using DESeq2 (see 'Materials and methods' section).As a result, we identified 3177, 1831, 409 and 102 upregulated DE genes ( log2FC ≥ 1 , and adjusted p<0.05) for AS, SZ, MZ and DZ, respectively, when comparing between patients with OA and non-OA controls (figure 4; online supplemental table S9).In particular, we found that ECM-associated genes (eg, FN1, COL1A1 and COMP) were consistently expressed in both the AS and SZ of OA samples.Meanwhile, the upregulated genes in AS were enriched in immune response-related pathways, for example, neutrophil activation involved in immune response (IL1R2, MAPK14 and CD9).The upregulated genes in SZ were enriched in transcription and protein synthesis-related pathways (figure 4; online supplemental table S10).The gene-gene network analysis also demonstrated the response to the chemokine subnetwork in AS and the regulation of the transcription subnetwork in SZ significantly, further confirming the enhanced inflammatory response and transcription in OA (figure 4G).In contrast, despite the detection of numerous upregulated DE genes in the MZ and DZ between patients with OA and non-OA controls, these genes did not enrich meaningful pathways.Furthermore, we identified 478, 261, 154 and 498 downregulated DE genes between patients with OA and non-OA controls for the AS, SZ, MZ and DZ, respectively (figure 4; online supplemental table S9).However, those genes were not enriched in any biologically meaningful pathways.
Since the sample size issue of the non-OA controls in Geo-seq data potentially introduced unwanted variations, we performed three additional lines of validation.We first curated a total of 2886 OA-associated genes from PubMed with a given search criterion (see 'Materials and methods' section).As a result, we observed that the upregulated DE genes for AS and DZ were enriched in the given gene set in a range of proportions (online supplemental figure S9A).We next collected a total of 6552 OA-associated pathways with a given search criterion (see 'Materials and methods' section).As a result, we found that 60.2% and 52.0% of pathways overlapped with DE genes from AS and SZ, respectively (online supplemental figure S9B).Finally, we conducted BAG1 and CD9 IHC staining on patients with OA and non-OA controls.Notably, protein folding-related gene BAG1 and immune response regulator CD9 were highly expressed in SZ (figure 4H).
In addition, we also performed the DE analysis between WB and NWB, resulting in a total of 628, 1734, 590 and 2072 upregulated DE genes ( log2FC ≥ 1 , and adjusted p<0.05) for the AS, SZ, MZ and DZ, respectively (online supplemental figure S10A; online supplemental table S11), and we identified a total of 15, 914, 1156 and 362 downregulated DE genes ( log2FC ≤ −1 , and adjusted p<0.05) for the AS, SZ, MZ and DZ, respectively.Specifically, we observed that the minimum number of DE genes were detected in AS between WB and NWB, suggesting the small gene expression alterations between WB and NWB.The highest number of DE genes were identified in SZ, which were enriched in protein synthesis and ECM-related pathways (online supplemental figure S10B online supplemental table S12).Notably, a large number of DE genes were detected in the MZ and DZ, however, these genes were not enriched in biologically meaningful pathways.

Cellular composition and expression signature of preHTC
To further confirm the critical role of preHTC in large-scale population studies, we estimate the cell composition of bulk samples by performing deconvolution analysis on two publicly available bulk RNA-seq datasets using CIBERSORTx. 46Here, the cell population-specific gene signature matrices were curated from the scRNA-seq data in this study (see 'Materials and methods' section).
In the first bulk RNA-seq data consisting of 38 samples (20 patients with OA and 18 controls), we observed that InfC and FC were only presented in patients with OA (the proportion: 0% vs 0.11% for non-OA vs OA; p=0.048 for InfC, and 0% vs 8.75%; p=8.29e-5 for FC).preHTC (20.0%vs 32.1%; p=4.88e-5) was significantly more abundant in patients with OA samples.In contrast, HomC was significantly more abundant in non-OA controls (2.41% vs 0.046%; p=5.7e-4, figure 5A).This result was consistent with the cell compositions in the integrative analysis of scRNA-seq data and Geo-seq data.
To identify cell subpopulations from single-cell data that the transcriptional patterns are associated with OA status, we applied Scissor 66 to such bulk RNA-seq data.We classified chondrocytes that were positively associated (Scissor + cells), negatively associated (Scissor − cells) and uncorrelated (background cells) with OA pathology (see 'Materials and methods' section).As a result, we found that ProC was significantly associated with OA (42.6% Scissor + cells and 2.82% Scissor − cells), followed by FC (32.2% Scissor + cells and 3.28% Scissor − cells) and preHTC (30.5% Scissor + cells and 9.19% Scissor − cells; figure 5B), supporting the association of preHTC with OA status.
The second bulk RNA-seq data, comprising eight damaged distal medial condyles (DMC, equivalent to WB), and eight matched intact posterior lateral condyles (PLC, equivalent to NWB) served as internal controls (see 'Materials and methods' section).Through deconvolution analysis, it was revealed that preFC was predominant in PLC, constituting 30.5% compared with 18.0% in DMC (p=6.84e-3,online supplemental figure S11A).The absence of preFC in DMC aligned with the cell compositions observed in the Geo-seq data, comparing WB and NWB.Scissor analysis also demonstrated that the preHTC was significantly associated with DMC (24.0%Scissor + cells and 0.48% Scissor − cells; online supplemental figure S11B).
To further explore the role of preHTC, we applied Scissor to the integrative analysis of four subpopulations and the two bulk RNA-seq datasets.Notably, we observed that preHTC-2 was strongly associated with either OA status (21.8% Scissor + cells for preHTC-1, 63.0% for preHTC-2, 15.6% for preHTC-3 and 21.7% for preHTC-4) (online supplemental figure S11C) or DMC (18.7% Scissor + cells for preHTC-1, 74.5% for preHTC-2, 6.2% for preHTC-3 and 29.7% for preHTC-4) (online supplemental figure S11D).These results highlight the critical role of Osteoarthritis the preHTC-2 subpopulation (MMP2 and COL1A1 as marker genes) in the pathogenesis of OA. 67 Finally, we evaluated the predictive capability of OA status using the cell population-specific DE genes.To do so, we employed a logistic regression-based prediction model on the first bulk RNA-seq dataset.With the leave-one-out cross-validation (LOOCV) strategy, we found the DE genes of preHTC exhibited superior performance in predicting OA status (area under the curve (AUC)=0.99;online supplemental figure S12).

The preFC is the main contributor to subtyping patients with OA
According to the aforementioned results, the patients with OA displayed diverse phenotypes.Therefore, to investigate the subtypes of patients with OA, we applied deconvolution analysis to two bulk RNA-seq datasets. 15 68The first bulk RNA-seq dataset contains 131 patients with OA.We then performed a hierarchical clustering analysis on the deconvolution results, resulting in two major subtypes (figure 5C), namely group A (n=43) and group B (n=88).The main differences in cell compositions between group A and group B were the preFC (7.39% vs 15.5%, p=3.17e-9) and the FC (23.6% vs 5.2%; p=3.96e-13).Furthermore, we performed the DE analysis between group A and group B. As a result, we finally identified a total of 457 DE genes (|log2FC|>1 and adjusted p<0.05), including 342 upregulated genes and 115 downregulated genes (figure 5; online supplemental table S13).
Subsequently, we performed an enrichment analysis of the DE genes, finding the upregulated DE genes in group A were strongly enriched in immune-related signalling pathways (expressed: CXCL12, CD163, CCL2 and CD74), while the downregulated DE genes were enriched in chondrocyte proliferation-related signalling pathways (expressed: GDF10, CYTL1 and CHRDL2; figure 5E).In addition, this bulk RNA-seq data also contained the joint space narrowing (JSN) score, 69 which measured the grade of knee OA severity for each patient.We found that the JSN score was significantly higher in group A than in group B (median scores for group A vs group B were 3.75 vs 3.02, p=0.01,Wilcoxon rank-sum test), indicating more severe cartilage lesions in group A. The proportion of preFC in each patient was also negatively correlated with their JSN score in 131 patients with OA (Spearman's r=−0.239,p=5.93e-3).The

Osteoarthritis
stratification of patients with OA was largely consistent with the result in the Research Arthritis and Articular Cartilage (RAAK) study. 68Our result suggested that the preFC population was a main contributor to subtyping patients with OA.
The second bulk RNA-seq dataset contains 20 patients with OA.We then performed a hierarchical clustering analysis on the deconvolution results, resulting in two distinct subtypes of OA (online supplemental figure S13A), namely group A (n=7) and group B (n=13).The most different cell composition between group A and group B was preFC (proportion: 7.91% vs 37.1%; p=1.29e-5).Furthermore, we performed DE analysis between group A and group B, detecting a total of 3789 DE genes (|log2FC|>1 and adjusted p<0.05), of which 2125 were upregulated genes in group A and 1664 were downregulated genes (online supplemental figure S13B; online supplemental table S14).In addition, we performed the enrichment analysis, finding the DE genes in group A were enriched in immune-related signalling pathways (expressed: CXCL12, CCL2 and CD74), while the DE genes from group B were enriched in chondrocyte proliferation-related signalling pathways (expressed: SOX5 and BMP6, online supplemental figure S13C).

The HTC genetically relates the GWAS variants in OA
To systematically investigate the cell population-specific GWAS variants that potentially contribute to the pathogenesis of OA, we performed the integrative analysis on scRNA-seq data and GWAS summary data using CELLECT 70 (see 'Materials and methods' section).Since our scRNA-seq samples were all from East Asia (ie, China), we stratified the UK Biobank data to include only individuals of East Asian ancestry.There was a total of 1979 East Asian ancestry participants (1172 individuals with knee OA and 807 controls) for whom we obtained GWAS summary statistics (online supplemental figure S14A) using GMMAT 71 (see 'Materials and methods' section).Next, we used CELLECT to determine the potential genetically relevant chondrocyte populations (see 'Materials and methods' section).Specifically, we observed that the strongest cell population-specific association was with HTC (p=0.089) for East Asian populations (figure 5F), reinforcing the notion that HTC may be an OA-critical chondrocyte population.
Furthermore, we performed a similar analysis on European ancestry.Specifically, GWAS summary statistics for knee pain associations from the UK Biobank, 72 including 406 241 individuals of European ancestry (21 684 knee OA samples and 384 458 controls) were obtained from Neale's laboratory 73 (online supplemental figure S14B).We then integrated this GWAS summary statistics with the scRNA-seq data obtained from European samples. 74We observed that both ProC (p=0.016) and HTC (p=0.017) were strongly associated with OA status (figure 5F).
Taken together, we systematically investigated the relationship between OA-associated GWAS variants and chondrocyte populations of knee articular cartilage, finding HTC to be the most relevant chondrocyte population in both East Asian and European populations.Our results provided new insight into the genetic architecture of OA and potential therapeutic targets for OA-modifying drugs.

DISCUSSION
In the knee articular cartilage tissue, chondrocyte resides in a three-dimensional spatial structure, exhibiting cellular and spatial heterogeneity within individuals and among different patients.To gain a comprehensive understanding of the transcriptomic landscape of human knee articular cartilage, we performed extensive integration analyses using scRNA-seq and spatially resolved transcriptomic technologies for both patients with OA and non-OA controls.We identified 33 marker genes that were used for annotating 11 putative chondrocyte populations, including 8 commonly shared populations between patients with OA and non-OA controls; and 3 OA cartilage-specific populations, including 2 novel defined populations: preInfC and InfC.Our analysis revealed that chondrocytes in patients with OA exhibited high interpatient heterogeneity and represented more diverse phenotypic variations compared with controls, highlighting the need for precision medicine approaches in OA treatment.
While inflammatory mediators have been implicated in the regulation of cartilage damage, 9 the role of chondrocytes in the immune response remained unclear.Recently, a scRNA-seq study for dissecting the heterogeneity of articular cartilage could detect CD74, CD80, CD86 and HLA-DPA1 genes that were highly expressed in a small proportion of chondrocytes, indicating that these chondrocytes may have immune cell functions. 10However, these chondrocytes were mixed together with the RegC population, presumably due to the limited number of chondrocytes profiling by Smart-seq2 technology 75 (ie, 60-120 chondrocytes were profiled per sample).Another study used cytometry by time of flight (cyTOF) profiling to map the inflammatory landscape, further confirming the existence of the inflammatory populations in the articular cartilage of OA. 24 However, it remains unclear how the cross-talk or cell communication among different chondrocyte populations happens, and the key genes or signalling pathways that might contribute to OA, presumably due to the preselected panel of 33 markers.In this study, we applied 10x Genomics scRNA-seq protocols to profile 135 896 chondrocytes, representing a comprehensive survey of the chondrocytes in OA status.Of note, we identified that the preInfC and InfC were newly expanded chondrocyte populations (a total of 1499 chondrocytes, 1.1% of all chondrocytes for InfC, and a total of 533 chondrocytes, 0.39% of all chondrocytes for preInfC) that may potentially contribute to perpetuating cartilage degradation in OA (figure 6A).The InfCs highly express CD74, CXCL8 and GPR183 genes, and were enriched for MIF-CD74 and MIF-CXCR2/4 L-R pairs, as major upstream regulator inflammatory of p38 and/or JNK MAP kinase signalling.MIF-CD74 and MIF-CXCR2/4 are essential for promoting the transcription of proinflammatory cytokines and complement systems (figure 6B).Therefore, targeting the MIF-CD74 or MIF-CXCR2/4 pathways is a potential therapeutic strategy for treating OA in the future.
Besides the single-cell transcriptomic landscape, we systematically characterised the transcriptomic landscape of the spatial organisation of knee articular cartilage, and revealed that the AS was the most transcriptionally active zone either in patients with OA or non-OA controls.Notably, many potential previously reported targets of candidate drugs for treating OA were detected in the AS.For example, PRG4 known as lubricin, may contribute to smooth joint motion 76 ; CRTAC1 has been considered as a potential biomarker for early diagnosis of OA at risk for the disease earlier and monitoring the disease course. 17As we expected, the relatively quiescent articular chondrocytes reside on MZ and DZ of patients with OA.Overall, the integrative analysis of scRNA-seq and Geo-seq data demonstrated that most of the chondrocyte populations were located in MZ and DZ, while preHTCs and preFCs were mainly distributed in AS and SZ of patients with OA, suggesting these chondrocyte populations were potential disease-modifying candidates for treating OA.

Osteoarthritis
We further showed that the preHTC contributes to OA and preFC potentially prevents cartilage degradation.We found that preHTC was significantly abundant in OA compared with non-OA controls, and preFC was abundant in the AS and SZ of WB compared with NWB.These findings were also validated by deconvolution analysis of two bulk RNA-seq data.Specifically, we found that the preHTC population was the major population to distinguish patients with OA and non-OA controls, suggesting the marker genes of this chondrocyte population are potential biomarkers for OA status.On the other hand, the preFC population is the major contributor to subtyping patients with OA (figure 6C).As a result, the preFC-depleted group was related to the inflammation-related pathways, while the preFC-abundant group was related to cartilage-related pathways.Of note, these observations need to be further validated with large-scale OA studies with comprehensive clinical features in the future.Furthermore, we performed enrichment analysis for chondrocyte population-specific marker genes in GWAS variants in OA to investigate the OA-critical chondrocyte populations.We found that HTC was a key mediator in both European and East Asian ancestry populations.Importantly, the integrative analysis of scRNA-seq data with GWAS summary data enhances the comprehension of disease mechanisms.It also offers a more precise methodology for unravelling how genetic variants contribute to disease risk and identifying the pertinent cell types involved (figure 6D).
While vast numbers of single-cell molecular profiles have been generated, a standardised terminology for annotating cell types of knee articular cartilage tissues remains lacking.Defining cell types is challenging, 77 probably due to factors such as unwanted variations, cellular heterogeneity, genetics, cell lineage and cell cycle phases.Recent studies have highlighted the impact of the Figure 6 The summary of the findings.(A) The cell fraction difference between patients with OA and non-OA controls.The InfC and preInfC are newly identified chondrocyte populations, which are largely absent in non-OA controls.(B) The MIF expressed by other chondrocytes interacts with the CD74-CXCR2/4-CD44 complex expressed by InfCs and activates the MAPK signalling pathway, which increases the expression of the activator protein 1 (AP-1) transcription factor and further promotes the transcription of pro-inflammatory cytokines.This figure was created by BioRender.(C) An integrative analysis of scRNA-seq data and bulk RNA-seq data demonstrates two major subtypes of patients with OA are stratified by preFC, resulting in an inflammatory-related subtype (group A) and a non-inflammatory-related subtype of OA (group B). (D) An integrative analysis of scRNA-seq data and GWAS summary data shows that HTC is the key chondrocyte population that drives the susceptibility to the pathogenesis of OA.CXCR2/4, C-X-C motif chemokine receptor 2/4; ECM, extracellular matrix; GWAS, genome-wide association studies; HTC, hypertrophic chondrocyte; InfC, inflammatory chondrocyte; MIF, macrophage migration inhibitory factor; MAPK, mitogen-activated protein kinase; OA, osteoarthritis; preFC, prefibrocartilage chondrocyte; preInfC, pre-inflammatory chondrocyte; scRNA-seq, single-cell RNA sequencing.

Osteoarthritis
senescent cell population (expressed: CDKN1A) 25 74 78 and the stem/progenitor cell population (expressed: CD44, ICAM1 and VCAM1) 79 in the OA cartilage tissue.In particular, we also could identify these cells in ProC, but cannot expand new cell populations presumably due to the small number of chondrocytes that were assayed by scRNA-seq protocols.A new recently defined cell population MTC, 25 which we could also identify in a subpopulation of preHTC-preHTC-1, which highly expressed MT1M, MT1A and MT1G (figure 4C).In summary, we provide a spatially aware annotation of cell populations in human knee articular cartilage and perform independent validation using publicly available scRNA-seq data.However, a holistic view of cell populations in human knee articular cartilage remains elusive.We emphasise the importance of comprehensive screening and experimental validation to investigate marker gene candidates that are generally applicable for annotating cell populations.
Acknowledging the limitation of small sample size in non-OA control spatial data, we recognise the potential bias in identifying non-representative DE genes.Despite this constraint, we increased the assayed spots in each zone and conducted additional validations to ensure the biologically meaningfulness of these DE genes.Together, our study presents a transcriptional landscape of chondrocytes in knee articular cartilage at both single-cell and zone-level spatial resolutions in patients with OA and patients without OA.The new findings have the potential to enhance diagnosis and treatment, provide insights into key molecular players in OA and uncover promising drugs for personalised medicine strategies in OA.

Cartilage sample collection
We recruited eight patients with OA who met the Modified Outerbridge Classification criteria and underwent knee arthroplasty surgery, and obtained the articular cartilage tissue.We also recruited three normal controls who had undergone amputation and had no history of joint injury or disease.The knee cartilage samples were obtained from the discarded tissue after knee arthroplasty.Subjects were excluded if they had or had previously suffered from any other osteoarticular disease (eg, rheumatoid arthritis, gout or skeletal fluorosis), any other type of macrosomia, a disorder of osteochondrodysplasic or chronic disease (eg, hypertension, diabetes or coronary heart disease) or had received any treatment in the previous 6 months.The basic physiological characteristics (six females and six males; 64.33±7.6 years for patients with OA and 51.67±4.9 years for non-OA controls) of the recruited samples are shown in online supplemental table S1.In addition, six patients with OA and six non-OA controls were used for immunohistochemical validation (online supplemental table S15).

Chondrocyte isolation and library construction for 10x Genomics protocol
Cartilage was harvested within an hour after surgery, washed twice with phosphate-buffered saline (PBS) containing penicillin and streptomycin, and cut into 1 mm 3 pieces.Samples were then subjected to pretreatment with 0.25% trypsin (Thermo Fisher Scientific, Waltham, Massachusetts, USA) at 37°C in an incubator with 5% CO 2 for 30 min.The cartilage fragment was then centrifuged at 350×g for 5 min and the supernatant was discarded completely, followed by digestion in basal Dulbecco's Modified Eagle Medium/F12 media (Gibico, Grand Island, New York, USA) supplemented with 0.2% type II collagenase (Gibico) at 37°C for 10-12 hours.Cell count and viability were estimated using a fluorescence Cell Analyzer (Countstar Rigel S2) with Acidine Orange/Propidium Iodide (AO/PI) reagent after the removal of erythrocytes (Miltenyi 130-094-183).Debris and dead cells removal was performed according to the recommended manuals (Miltenyi 130-109-398/130-090-101).For the 10x Genomics protocol, isolated chondrocytes were filtered through a 70 μm cell strainer, washed twice with PBS buffer and directly subjected to complementary DNA (cDNA) library construction. 10 80Articular cartilage tissue was dissociated into single cells by enzymatic methods according to standard procedures.Briefly, the cells were washed with PBS and resuspended in 500 μL PBS, and single-cell RNA-seq libraries were prepared using the Chromium Single Cell 3' Library V.3 Reagent Kit (10x Genomics, Pleasanton, California, USA).Sequencing was performed by Illumina NovaSeq 6000.

Processing of scRNA-seq data
Unique molecular identifier (UMI) counts were obtained by aligning FASTQ files to the human reference genome (GRCh38 3.0.0)through Cell Ranger (V.4.0) software from the 10x Genomics website (https://support.10xgenomics.com/singlecell-gene-expression/software/pipelines/latest/advanced/references).Gene expression raw count matrices were used for downstream analyses.Specifically, for each scRNA-seq dataset, the quality control steps included low-quality cell removal and mitochondrial count filtering.The quality control for each scRNA-seq dataset is summarised in online supplemental table S2.We performed the doublets removal step using the Doublet-Finder (V.2.0.3) 81 when the captured cell was >10 000, with pK (the principal component neighbourhood size used to calculate pANN) set to 0.09, nEXP (the total number of doublet predictions produced) set to 0.075 and other parameters as the default settings.

Geo-sequencing
Fresh cartilage tissue obtained from patients with OA and non-OA controls was placed in cold RNase-free PBS.The tissue was then gently transferred to precooled optimum cutting temperature (OCT) compound, ensuring that the entire tissue was embedded.The tissue was immediately frozen in liquid nitrogen or dry ice and transferred to −80°C.The OCT-embedded samples were cryosectioned at −20°C at 14 μm thickness.The sections were then mounted on polyethylene-terephthalate-coated slides, fixed immediately with ethanol and stained with 1% cresyl violet acetate dissolved in 75% ethanol solution.Afterwards, targeted regions of sections were collected by LCM, and transcriptome analyses of those collected samples were performed by Geoseq. 36cDNA library was prepared using TruePrep DNA Library Prep Kit V2 for Illumina (Vazyme #TD501-503) and sequenced on Illumina NovaSeq 6000 with PE150 mode.We classified the zones for knee articular cartilage based on the organisation and degree of alignment of the collagen fibers, which could be summarised as the following criteria (online supplemental figure S4A): (1) the hyaline superficial cartilage of the articular cartilage was recognised as the AS; (2) the thin SZ of the remaining cartilage, where the collagen fibers are densely packed and aligned parallel to the AS, and the chondrocytes are flattened, was recognised as the SZ; (3) the zone where the collagen fibers are obliquely organised and the chondrocytes are spherical and of low density was recognised as the MZ; (4) the zone where the collagen fibers are arranged perpendicular to the AS and the chondrocytes are hypertrophic was recognised as the DZ.To Osteoarthritis avoid overlap between zones, regions of approximately 200 μm between SZ and MZ and MZ and DZ were discarded.

Integrative analysis of scRNA-seq datasets
To assess the reproducibility of chondrocyte annotations across 19 scRNA-seq datasets from 11 individuals, we performed an in-depth integrative analysis.We performed integrative analyses using Harmony, 84 LIGER 85 and Seurat CCA 86 to provide independent validations of chondrocyte annotations.In particular, we observed a similar number and composition of chondrocyte populations in all scenarios, regardless of the number of latent dimensions (online supplemental figures S15A-S15C) or the choice of integrative analysis methods (online supplemental figures S15C and S15D).To remove dataset-specific batch effects, we evaluated all integrated results according to an existing benchmark pipeline. 87We compared different integration methods (ie, Harmony, LIGER and Seurat) with a range of number of PCs and clustering resolutions.In the first step, the robustness of the clustering was tested by comparing the results under different resolutions, resolutions that guaranteed all chondrocyte populations were detected as separate clusters were selected.In the second step, different integration methods were benchmarked through a pipeline according to a previous study. 87Finally, the optimal integration methods (ie, Harmony) were selected for the downstream analysis with PCs=30 and resolution=0.9,to produce an integrated embedding of all chondrocytes (online supplemental figure S15A).
Specifically, we performed integrative analyses following the Harmony pipeline 84 (https://htmlpreview.github.io/?https:// github.com/immunogenomics/harmony/blob/master/doc/quickstart.html), the Seurat pipeline 22 88 (https://satijalab.org/seurat/articles/integration_introduction.html) and the LIGER pipeline (http://htmlpreview.github.io/?https://github.com/welch-lab/liger/blob/master/vignettes/Integrating_multi_scRNA_data.html).For the Harmony pipeline (V.1.0.3), scRNA-seq data from different samples were merged into one Seurat object, then normalised using the NormalizeData function and scaled on 2000 most variable features using the FindVariableFeatures and ScaleData functions.Principal component analysis (PCA) was performed using the RunPCA function, followed by the RunHarmony function to correct for batch effects across samples.For the Seurat pipeline, we followed the routine integration pipeline (V.4.2.0), that is, the log-normalisation was performed using the NormalizeData function; selection of highly variable genes was performed using the FindVariableFeatures function (VST method); correction of batch effects across samples was performed using the FindIntegrationAnchors and IntegrateData functions; dimensionality reduction was performed using the RunPCA and RunUMAP functions; neighbour network construction was performed using the FindNeighbors function.For the LIGER pipeline, the log-normalisation was performed using the normalise function; selection of highly variable genes was performed using the selectGenes function; batch effects correction across samples was performed using the optimizeALS and quantile_norm functions; clustering was performed using louvainCluster function, and the marker genes for each population were identified using the FindMarkers function with the Wilcoxon rank-sum test for all pipelines.

Cell compositional analysis
We ran Cacao (V.0.4.0) 59 to assess the changes in cell composition between patients with OA and non-OA controls for each chondrocyte population.The statistical significance of the difference in cell composition was tested using the estimateCellLoadings function.To test the statistical significance of cell composition changes in the deconvolution results of Geo-seq data and bulk RNA-seq data, we performed a permutation t-test implemented in the exactRankTests R package (V.0.8.35).

Trajectory inference analysis
We ran monocle2 (V.2.18.0) 89 on the raw counts of the scRNA-seq data.After calculating size factors and estimating dispersions using the estimateSizeFactors and estimateDispersions functions, we identified 13 124 DE genes along the lineage using a trajectory-based differential gene test.We selected the top 3000 genes ranked by q-value generated by the differen-tialGeneTest function.Dimensionality reduction analysis was performed using the DDRTree method.After cell ordering, the trajectory was visualised using the plot_cell_trajectory function with default parameter settings.

Cell-cell communication analysis
Cell-cell interaction analyses were performed on the patient with OA and the non-OA control, using CellChat (V.1.4.0). 50Specifically, CellChat was run on each group, and the log-normalised expression data and cell annotation information were used for analysis.We followed the standard pipeline (https:// github.com/ sqjin/ CellChat/ blob/ master/ tutorial/ CellChat-vignette.html) to perform all analyses using the default parameters and the human ligand-receptors database.

Deconvolution analysis
We performed the deconvolution analysis using CIBERSORTx (https://cibersortx.stanford.edu/) 46for two different scenarios: (1) integration of the scRNA-seq data with Geo-seq data to determine the chondrocyte composition on the spatial landscape of cartilage tissue and (2) integration the scRNA-seq data with traditional bulk RNA-seq data to decipher the chondrocyte composition of patients with OA.Specifically, we first constructed the gene signature matrix using the top marker genes (log2FC>0.25 and adjusted p<0.05) for all chondrocyte populations.The deconvolution analysis was performed in S-mode due to the possibility of high technical variance.All samples were generated with the p values that were included in the downstream analysis if the p value was <0.05.Finally, a permutation test 59 was performed to test whether or not the cell proportions were significantly different between patients with OA and non-OA controls or not.

Differential expression analysis between patients with OA and non-OA controls
We identified DE genes between patients with OA and non-OA controls for each common cell population.Briefly, we first aggregated UMI counts within a given chondrocyte population

Osteoarthritis
to generate pseudobulk RNA-seq data.We then performed DESeq2 47 implemented in Cacoa 59 to identify DE genes using a likelihood ratio test.We considered a gene with |log2FC|>1 and adjusted p<0.05 (the multiple testing correction is the Benjamini and Hochberg) as a DE gene.We excluded InfC, preInfC and FC when performing DE analysis between patients with OA and non-OA controls because they were absent in non-OA samples.We also performed DESeq2 to identify DE genes between patients with OA and non-OA controls for Geo-seq data.

Gene set enrichment analysis
Gene set enrichment analyses were conducted using the enrichGO function implemented in the clusterProfiler R package (V.3.18.1). 90Specifically, the genes retained after QC were used as background genes, and the minimum and maximum sizes of genes annotated with GO terms for testing were set to 10 and 500, respectively.The significant GO terms were selected with adjusted p<0.05 (the multiple testing correction is the Benjamini and Hochberg).

Subpopulation analysis of chondrocytes
We performed the subclustering analysis of InfC, preInfC and preHTC populations using FindSubCluster function that was implemented in Seurat. 22 88Specifically, we performed an integrative analysis of InfCs, preInfCs from 16 OA samples, performed an integrative analysis of preHTCs from all 19 samples and then applied dimensionality reduction (RunPCA, k=30) and clustering (FindCluster, Louvain algorithm, resolution=0.1)to determine the preInfC, InfC and preHTC subpopulations.

Defining inflammatory score of chondrocytes
The inflammatory score was used to evaluate the inflammatory degree of individual chondrocytes expressed in a certain predefined expression gene set obtained from a previous study. 58uch scores were calculated using the AddModuleScore function in Seurat, with default parameter settings.We used the median of the inflammatory score of the given chondrocyte subpopulation as the representative score.

Identifying OA status-associated chondrocytes
To investigate OA-associated chondrocytes, we performed the Scissor analysis (V.2.0.0) 66 on the scRNA-seq data and two bulk RNA-seq datasets (ie, GSE114007 and E-MTAB-4304), and set the L1 norm and network-based penalties balance parameter α=0.02.The chondrocytes that negatively and positively correlated with the phenotype were labelled as Scissor − and Scissor + , respectively.

Classification using logistic regression models
We used the logistic regression model using the LinearRegression function, which was implemented in sklearn (https://scikitlearn. org/stable/) to predict OA status.The parameter settings were penalty='elasticnet' and solver='saga' for prediction model training.The discriminative genes were selected as DE genes of each chondrocyte population using pseudobulk analysis.LOOCV was used for model training.The performance of the classification models was assessed by the AUC.

The validation of DE genes in PubMed
We used the RISmed R package (V.2.3.0) to validate the detected DE genes and enriched pathways between patients with OA and non-OA controls in Geo-seq data.First, we searched PubMed with the term 'osteoarthritis, cartilage, chondrocyte' and the gene symbol of all analysed genes from 1 January 2000 to 12 December 2023 selecting for primary research studies.we used the function EUtilsQuery to extract bibliographic content from PubMed.Out of the 14 062 genes, a total of 4710 had at least one publication.We considered those genes with ≥1 publication (the 50% quantile of the number of publications) as OA-associated genes, resulting in a total of 2886 genes.
We defined OA-related pathways with a similar approach.Out of the 22 874 pathways obtained from the MSigDB C5 collection, 12 990 had at least one publication.We considered pathways with ≥12 publications (the 50% quantile of the number of publications) as OA-associated pathways, resulting in a total of 6552 pathways.

Lentivirus construction and amplification
CD74 overexpressing lentivirus was directly synthesised and amplified by Tsingke (China).In brief, the coding sequence of CD74 was cloned into lentiviral vector PDS279_pL-CMV-GFP-ccdB-puro (Tsingke, China).The virus particle was constructed and amplified in HEK293T cells and collected by ultracentrifugation.The supernatant containing the lentiviral particles was collected at 48 and 60 hours after transfection and mixed before ultracentrifugation at 19 400g for 2.5 hours at 4°C.We seeded human chondrocyte cell line C28/I2 on the 12-well plate and infected with CD74 overexpressing lentivirus and control lentivirus at multiplicity of infection 100.Cells were harvested at 72 hours after infection for subsequent analysis.The results of qRT-PCR were plotted using GraphPad PRISM V.6 (GraphPad, San Diego, California, USA).
Figure1The single cell landscape of human knee cartilage in WB, NWB and non-OA controls.(A) Schematic of workflows for transcriptomic profiling of human articular cartilage using scRNA-seq.The scRNA-seq data (n=11, 5 females and 6 males) were assayed by 10x Genomics protocols, including 8 paired WB and NWB samples, and 3 non-OA controls.(B) The top panel shows the visualisation of chondrocyte populations using UMAP, where a total of 135 896 chondrocytes were analysed by the integrative analysis of 8 paired WB and NWB samples and 3 non-OA controls.There is a total of 11 populations, that is, RegC (regulator chondrocytes, green), ProC (proliferation chondrocytes, yellow), HomC (homeostasis chondrocytes, purple), preHTC (azure), EC (effector chondrocytes, red), FC (fibrocartilage chondrocytes, blue), HTC (orange), RepC (reparative chondrocytes, aqua), preFC (pink), preInfC (crimson) and InfC (brown).Each cell population was annotated by a given marker gene list explored in this study.The preInfC and InfC are new chondrocyte populations.The bar plot at the bottom panel shows the corresponding cell compositions among WB, NWB and non-OA control samples.The cell composition is similar between WB and NWB, while largely different with non-OA control samples.Proportions were scaled by the total number of cells assayed.(C) The cell density plot shows the visualisation of the average cell density within the non-OA control (upper panel), NWB (middle panel) and WB (bottom panel).Regions with higher density are represented by darker colours.The preHTC, InfC, preInfC and FC are strongly enriched in NWB and WB.(D) The dot plot shows the mean expressions of the marker genes for 11 major cell populations.The colour represents the average scaled gene expression level (z-score) and the dot size represents the percentage of cells that the marker gene detected in each cell population.(E) The marker gene of ProC, BMP2, is validated by IHC staining.(F) The marker gene of HTC, IBSP, is validated by IHC.(G) The marker gene of RegC, CHI3L1, is validated by IHC.(H) The marker gene of preInfC, IFI27, is validated by IHC.The quantification of positive cells from different zones (ie, SZ, MZ and DZ) is displayed by bar plots (replicates n=6), where the AS and SZ were hardly distinguished during IHC staining, therefore, the AS and SZ in Geo-seq results were merged together to only present as SZ in IHC staining.The scale bar: left, 500 μm; right, 50 μm.AS, articular surface; DZ, deep zone; Geo-seq, laser microdissection-based RNA sequencing; IHC, immunohistochemistry; InfC, inflammatory chondrocyte; MZ, middle zone; NS, no significant; NWB, non-weight-bearing; OA, osteoarthritis; preFC, prefibrocartilage chondrocyte; preHTC, prehypertrophic chondrocyte; preInfC, pre-inflammatory chondrocyte; scRNA-seq, single-cell RNA sequencing; SZ, superficial zone; UMAP, uniform manifold approximation and projection; WB, weight-bearing.*p<0.05,***p<0.001.

Figure 3
Figure 3The roles of the InfC and preInfC populations in patients with OA and non-OA controls.(A) The marker genes (ie, CD74 and GPR183) for the InfC population are validated by IHC.The quantification of positive cells from different zones (ie, SZ, MZ and DZ) is displayed by bar plots (replicates n=6), where the AS and SZ were hardly distinguished during IHC staining, therefore, the AS and SZ in Geo-seq results were merged together to only present as SZ in IHC staining.(B) A heatmap shows cell-cell communications in patients with OA.The grey bar on the y-axis denotes the relative strength of the predicted signalling pathway in the overall network while the colour bar on the x-axis represents the relative contribution of the chondrocyte population to the overall signalling milieu (top x-axis) for patients with OA.MIF was ranked as the strongest signalling pathway that interacted with chondrocyte populations in OA samples.(C) Quantitative real-time PCR was performed to measure the relative mRNA levels of MIF, CD74, IL-1β and TNF in patients with OA and controls (replicates, n=6).(D) Quantitative real-time PCR was performed to measure the relative mRNA levels of CD74, MIF, IL-1β and TNF after CD74 overexpressing with LV (replicates, n=6).(E) The visualisation of the subpopulation of preInfC using UMAP projections, that is, preInfC-1 (brown), preInfC-2 (blue) and preInfC-3 (orange), where a total of 533 chondrocytes were analysed together by the integrative analysis of patients with OA.The marker genes of each subpopulation were also annotated.(F) The visualisation of the subpopulation of InfCs using UMAP projections, that is, InfC-1 (cyan), InfC-2 (crimson), InfC-3 (dark green) and InfC-4 (green), where a total of 1499 chondrocytes were analysed together by the integrative analysis of patients with OA. (G) The violin plot shows the inflammatory score for the subpopulations of preInfC and InfC.The inflammatory scores for each chondrocyte were calculated through the average expression of a predefined inflammatory-associated gene set.The inflammatory scores were gradually increasing from preInfC to InfC.AS, articular surface; DZ, deep zone; IHC, immunohistochemistry; IL, interleukin; InfC, inflammatory chondrocyte; MIF, macrophage migration inhibitory factor; mRNA, messenger RNA; MZ, middle zone; NS, no significant; OA, osteoarthritis; preInfC, pre-inflammatory chondrocyte; SZ, superficial zone; TNF, tumour necrosis factor; UMAP, uniform manifold approximation and projection.*p<0.05,**p<0.01,***p<0.001,****p<0.0001.

OsteoarthritisFigure 4
Figure 4 The OA-associated changes between patients with OA and non-OA controls.(A) The Manhattan-like plot demonstrates the populationspecific DE genes between patients with OA and non-OA controls.The numbers on the top panel denote the number of DE genes for each population.The dot size represents the value of −log10 (adjusted p).(B) The subnetworks identified by PhenomeExpress.The DE genes of preHTC were enriched in a pMAPK38 cascade subnetwork.(C) The visualisation of subpopulations of preHTC using UMAP projections, that is, preHTC-1 (blue), preHTC-2 (green), preHTC-3 (brown) and preHTC-4 (red), where a total of 25 778 chondrocytes were analysed by the integrative analysis of all 19 samples.(D) The bar plot shows the significant GO terms for each subpopulation.(E) The Manhattan-like plot shows the zone-specific DE genes between patients with OA and non-OA controls, that is, AS (red), SZ (yellow), MZ (blue) and DZ (green).The numbers on the top panel or bottom panel represent the number of significant upregulated or downregulated DE genes for each zone.(F) The bar plot shows the significant GO terms that were enriched by the zone-specific DE genes.Notably, the MZ-specific and DZ-specific DE genes were not enriched in any biologically meaningful pathways.(G) The subnetworks identified by PhenomeExpress.The DE genes of AS were enriched in response to the chemokine subnetwork and those of SZ were enriched in the regulation of the transcription subnetwork.(H) The top-ranked DE gene of AS or SZ between patients with OA and non-OA controls, BAG1, was validated by IHC staining.(I) The top-ranked DE gene of AS or SZ between patients with OA and non-OA controls, CD9, was validated by IHC staining.The quantification of positive cells from different zones (ie, SZ, MZ and DZ) is displayed by bar plots (replicates n=6), where the AS and SZ were hardly distinguished during IHC staining, therefore, the AS and SZ in Geo-seq results were merged together to only present as SZ in IHC staining.The scale bar: left, 500 μm; right, 50 μm.AS, articular surface; DZ, deep zone; GO, Gene Ontology; IHC, immunohistochemistry; InfC, inflammatory chondrocyte; MZ, middle zone; NS, no significant; OA, osteoarthritis; preInfC, pre-inflammatory chondrocyte; preFC, prefibrocartilage chondrocyte; preHTC, prehypertrophic chondrocyte; SZ, superficial zone; UMAP, uniform manifold approximation and projection.*p<0.05,**p<0.01,***p<0.001.

Figure 5
Figure 5 The OA-critical cell populations in bulk RNA-seq data and GWAS associations.(A) The heatmap shows cell compositions of 18 healthy controls and 20 patients with OA from GSE114007.The values in brackets represent the Spearman's correlation (r) between patients with OA and non-OA controls.FC and preHTC are highly correlated with OA status.(B) The bar plot depicts the fraction of chondrocytes that are related to OA, where Scissor+ denotes the positively correlated (pink), Scissor− is negatively correlated (green), and the others are uncorrelated chondrocytes (grey).The top three populations, ProC, FC and preHTC are positively correlated with OA status.(C) The heatmap illustrates the two subtypes of 131 OA samples identified by hierarchical cluster with ward.D2 parameter.131 samples are clustered into 2 subtypes: group A (n=43): an inflammationrelated subtype with less fraction of preFCs and group B (n= 88): a non-inflammatory-related subtype with more fraction of preFCs.(D) The volcano plot shows the differentially expressed genes between group A versus group B. (E) The significant Gene Ontology terms were enriched by the upregulated/downregulated DE genes.The upregulated DE genes in group A (inflammation-related subtype) were mainly involved in inflammatory and immune response-associated pathways, while group B (non-inflammatory-related subtype) upregulated DE genes were mainly involved in chondrocyte proliferation pathways.(F) The scatter plot shows the correlation between the cell population contribution to OA as determined from GWAS of individuals with Asian ancestry (n=1979) and the corresponding contribution derived from GWAS of individuals with European ancestry (n=406 241).HTC population is significantly associated with OA in individuals with European ancestry and has the smallest p value with East Asian ancestry.GWAS, genome-wide association studies; InfC, inflammatory chondrocyte; OA, osteoarthritis; preFC, prefibrocartilage chondrocyte; preHTC, prehypertrophic chondrocyte.

Table 1
The 33 marker genes are used for annotating 11 distinct chondrocyte populations Part of the marker genes were reported by previous studies and the newly defined marker genes are highlighted in bold.ECM, extracellular matrix; preFC, prefibrocartilage chondrocyte; preHTC, prehypertrophic chondrocyte; preInfC, pre-inflammatory chondrocyte; TGF, transforming growth factor.