Data acquisition
RNA-sequencing data for different lesion
types in brain white matter in patients with progressive multiple sclerosis
were used for gene expression deconvolution [16]. The lesion types examined
included chronic active (n = 17), active (n = 16), inactive (n= 14), NAWM (n=
21), and remyelinating lesions (n= 5). White matter (n= 25) was used as normal
controls. The corresponding raw count matric was retrieved from Gene Expression
Omnibus GSE138614. A total of 98 samples that had complete data for
RNA-sequencing were used. Raw count data was converted to TPM space before
downstream analyses.
Lesion purity estimation
The ESTIMATE algorithm was used to
estimate lesion purity (defined broadly as the proportion of lesion cells in
the tissue sample). The ESTIMATE formula and pipeline uses known mRNA
expression signatures of stromal and immune cells to infer lesion purity
(Supplementary Figure 1) [29]
Lesion and stroma gene expression
deconvolution
The total mRNA expression
for a specific gene in a bulk lesion sample s
can be modeled as follows where
represents the
estimated lesion purity in sample s,
represents the mean expression for the gene in
the lesion compartment and
denotes the mean expression for the gene in
the stroma compartment [30]:
Therefore, the stroma and lesion compartment
expression levels were estimated using non-negative least-squares regression,
assuming that these expression levels are constant across the lesion samples.
Bootstrapping was used to derive 95% confidence intervals for the lesion and
stromal point estimates. TPM RNA-sequencing data was log2 transformed
before regression.
Ligand-receptor Interaction scoring
A combined database of 1380 ligand-receptor
pairs, previously curated by Ramilowski et al. and Ghoshdastider et
al. were used to annotate the inferred compartmental gene expression output [30,31].To quantify the ligand-receptor interactions
from the deconvolved data, the ligand-receptor crosstalk (RC) metric was used
[31]. Thus, the molar concentration of ligand-receptor [LR]
interaction complexes in equilibrium can be modeled with:where [L]
and [R] represent the molar concentrations of individual ligand L and
receptor R, respectively, along with the dissociation constant kD-1.
As molar concentrations were not available, mRNA expression levels were treated
as reasonable proxies. Thus, if the following conditions are assumed: (1) the
inferred mRNA expression values are reasonable proxies for the ligand and
receptor concentrations (2) ligand-receptor kinetics are constant across all
samples (3) assumptions of Law of Mass Action are met, then the following
Relative Crosstalk (RC) score can be applied (example given below for specific
lesion-lesion ligand-receptor interaction):where
the numerator represents the ligand receptor complex of interest and the
denominator represents all possible directions of ligand-receptor interactions.
This simplifies to:and
since the dissociation constant is cancelled, it does not need to be accounted
for in the downstream analysis. Therefore, the relative crosstalk score for an
example lesion ligand and lesion receptor interaction can be modeled as
follows:In
summary, for a given ligand-receptor pair, this score
estimates the relative changes of the unidirectional ligand-receptor binding
complex between the compartments of interest compared to all other possible
directions between the two compartments. Also, it accounts for interactions in
matched normal white matter control tissue.