This page provides an alphabetical catalog of enclone variables, along with some general information.
enclone variables are complicated because there are different types and different ways that they can be used. This pages specifies the system, but also provides an example, and for a first pass, you may wish to just read that.
enclone allows variables to be used in the following ways:
All variables are in one of two classes:
CVARS
or CVARSP
, these need to be suffixed by the chain
number.All variables are specified at one of the following levels:
This table shows how the level is reflected in the output:
level | visual output | parseable output | notes | ||
---|---|---|---|---|---|
exact | cell | exact | cell | ||
cell | no | yes | no | yes | |
cell-exact | yes | yes | yes | yes | |
exact | yes | no | yes | yes | |
clono | yes | no | yes | yes | |
group | no | no | yes | yes | not available at all for visual output |
dataset | no | no | no | no | currently only accessible via DVARS |
global | no | no | no | no | currently only accessible via GVARS |
Here are some rules about how this works (and please see the example below, which may be more instructive):
PER_CELL
, then visual output is in a "split" mode,
appearing both at the exact and cell levels.PCELL
, then parseable output is instead displayed
at the cell level. Moreover, every cell-exact variable has a special _cell
version
that can be used in this mode. This allows you to use both the exact version and the cell
version.The following example illustrates how this applies to the cell-exact variable u
:
[1] GROUP = 1 CLONOTYPES = 6 CELLS
[1.1] CLONOTYPE = 6 CELLS
βββββββββββββ¬ββββββββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββ
β β CHAIN 1 β CHAIN 2 β
β β 98|IGHV2-5 β 13|IGHD2-15 β 57|IGHJ5 β 349|IGLV3-1 β 311|IGLJ2 β
β βββββββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββ€
β β 11111111111111111111 β 11111111111 β
β β 8 11111222222222233333 β 6 00000111111 β
β β 5 56789012345678901234 β 2 56789012345 β
β β ββββββββCDR3ββββββββ β ββββCDR3βββ β
βreference β S β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦W β V CQAWDβ¦β¦β¦β¦β¦β¦ β
βdonor ref β S β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦W β V CQAWDβ¦β¦β¦β¦β¦β¦ β
βββββββββββββΌββββββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββ€
β# n β x ..x.......xx........ u const β x ........... u constβ
β1 3 β S CALMGTYCSGDNCYSWFDPW 592 IGHM β V CQAWDSSVVVF 2995 IGLC2β
β2 1 β S CALMGTYCSGDNCYSWFDPW 6218 IGHG1 β V CQAWDSSVVVF 15182 IGLC2β
β3 1 β T CALMGTYCSGDNCYSWFDPW 4033 IGHG1 β V CQAWDSSVVVF 6777 IGLC2β
β4 1 β S CAHMGTYCSGGSCYSWFDPW 18 IGHG1 β V CQAWDSSVVVF 592 IGLC2β
βββββββββββββ΄ββββββββββββββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββ
barcodes u1
CGTTCTGGTAGCCTCG-1,GCGCCAAGTATTACCG-1,TGGCCAGGTTACGCGC-1 592
GACCTGGAGCGTGAAC-1 6218
CTCCTAGCAAGGACAC-1 4033
AGCTTGACAATGGAGC-1 18
|
[1] GROUP = 1 CLONOTYPES = 6 CELLS
[1.1] CLONOTYPE = 6 CELLS
ββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββ
β β CHAIN 1 β CHAIN 2 β
β β 98|IGHV2-5 β 13|IGHD2-15 β 57|IGHJ5 β 349|IGLV3-1 β 311|IGLJ2 β
β βββββββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββ€
β β 11111111111111111111 β 11111111111 β
β β 8 11111222222222233333 β 6 00000111111 β
β β 5 56789012345678901234 β 2 56789012345 β
β β ββββββββCDR3ββββββββ β ββββCDR3βββ β
βreference β S β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦W β V CQAWDβ¦β¦β¦β¦β¦β¦ β
βdonor ref β S β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦W β V CQAWDβ¦β¦β¦β¦β¦β¦ β
ββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββ€
β# barcode n β x ..x.......xx........ u const β x ........... u constβ
β1 3 β S CALMGTYCSGDNCYSWFDPW 592 IGHM β V CQAWDSSVVVF 2995 IGLC2β
β CGTTCTGGTAGCCTCG-1 β 592 β 2995 β
β GCGCCAAGTATTACCG-1 β 104 β 172 β
β TGGCCAGGTTACGCGC-1 β 6617 β 12097 β
β2 1 β S CALMGTYCSGDNCYSWFDPW 6218 IGHG1 β V CQAWDSSVVVF 15182 IGLC2β
β GACCTGGAGCGTGAAC-1 β 6218 β 15182 β
β3 1 β T CALMGTYCSGDNCYSWFDPW 4033 IGHG1 β V CQAWDSSVVVF 6777 IGLC2β
β CTCCTAGCAAGGACAC-1 β 4033 β 6777 β
β4 1 β S CAHMGTYCSGGSCYSWFDPW 18 IGHG1 β V CQAWDSSVVVF 592 IGLC2β
β AGCTTGACAATGGAGC-1 β 18 β 592 β
ββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββββββββ
barcode u1 u_cell1
CGTTCTGGTAGCCTCG-1 592 592
GCGCCAAGTATTACCG-1 592 104
TGGCCAGGTTACGCGC-1 592 6617
GACCTGGAGCGTGAAC-1 6218 6218
CTCCTAGCAAGGACAC-1 4033 4033
AGCTTGACAATGGAGC-1 18 18
|
VAR_DEF=...
, like
this:
enclone BCR=86237 GEX=85679 VAR_DEF="sum:CD19_ab + CD25_ab" LVARSP=CD19_ab,CD25_ab,sum
CDR3=CARSFFGDTAMVMFQAFDPW PER_CELL FOLD_HEADERS
[1] GROUP = 1 CLONOTYPES = 3 CELLS
[1.1] CLONOTYPE = 3 CELLS
ββββββββββββββββββββββββββββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββ
β β CHAIN 1 β CHAIN 2 β
β β 194|IGHV4-59 β 349|IGLV3-1 β
β β β 29|IGHD5-18 β β 311|IGLJ2 β
β β β 57|IGHJ5 β β
β ββββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββ€
β β 11111111111111111111 β 11111111111 β
β β 11111112222222222333 β 00000111111 β
β β 34567890123456789012 β 56789012345 β
β β ββββββββCDR3ββββββββ β ββββCDR3βββ β
βreference β β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦W β CQAWDβ¦β¦β¦β¦β¦β¦ β
βdonor ref β β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦β¦W β CQAWDβ¦β¦β¦β¦β¦β¦ β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββΌβββββββββββββββββββββββββββββββββββΌββββββββββββββββββββββββ€
β# barcode n CD19_ab CD25_ab sum β .................... u const β ........... u constβ
β1 1 1210 102 1312.0 β CARSFFGDTAMVMFQAFDPW 2 IGHD β CQAWDSSTVVF 8 IGLC2β
β GCTGGGTTCAACCAAC-1 1210 102 1312.0 β 2 β 8 β
β2 2 952 53 1004.5 β β CQAWDSSTVVF 4 IGLC2β
β ACGTCAAAGTGGTAGC-1 850 80 930.0 β β 6 β
β ATGTGTGAGAGTACCG-1 1054 25 1079.0 β β 2 β
ββββββββββββββββββββββββββββββββββββββββββββββββββββββ΄βββββββββββββββββββββββββββββββββββ΄ββββββββββββββββββββββββ
Note that spaces need to be placed around operators! We require this because some variable
names can have operators in them, for example because gene names can include -
.
Multiple invocations of VAR_DEF
are allowed.
We could easily add the ability to include calls to standard functions, if we had a list of such. Please let us know if this would be useful to you.
variable | name of the variable |
limits | footnote to things missing from the implementation (which in general are deliberate) |
class | lvar or cvar |
level | cell or exact or clono or group or global |
numeric | does the variable have numeric values* |
help page | page where variable is documented (both a link and an enclone command) |
brief description | summary, see help page for details |
*We allow for the null value (the empty string). For example, this would be used if a chain is not present for an exact subclonotype.
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
(from BC or META/bc) | lvar | cell | sometimes | enclone help input |
user defined variable | |
(from INFO) | lvar | exact | sometimes | enclone help input |
user defined variable | |
<feature> |
lvar | cell-exact | yes | enclone help lvars |
count for a gene expression or antibody feature | |
<feature>_% |
lvar | cell-exact | yes | enclone help lvars |
percent of total expression for a particular gene | |
<feature>_cellular_r |
lvar | dataset | yes | enclone help display |
percent of feature reads in cells, for a given dataset | |
<feature>_cellular_u |
lvar | dataset | yes | enclone help display |
percent of feature UMIs in cells, for a given dataset | |
<feature>_max |
lvar | exact | yes | enclone help lvars |
maximum count for a feature | |
<feature>_mean |
lvar | exact | yes | enclone help lvars |
mean count for a feature (same with ΞΌ for mean ) | |
<feature>_min |
lvar | exact | yes | enclone help lvars |
minimum count for a feature | |
<feature>_sum |
lvar | exact | yes | enclone help lvars |
sum of counts for a feature (same with Ξ£ for sum ) | |
<feature>_Ξ£ |
lvar | exact | yes | enclone help lvars |
sum of counts for a feature (same with sum for Ξ£ ) | |
<feature>_ΞΌ |
lvar | exact | yes | enclone help lvars |
mean count for a feature (same with mean for ΞΌ ) | |
<dataset>_barcode |
1 | lvar | cell | no | enclone help parseable
|
barcode from the given dataset (or null) |
<dataset>_barcodes |
1 | lvar | exact | no | enclone help parseable
|
barcodes from the given dataset |
aa% |
cvar | exact | yes | enclone help cvars |
amino acid identity with donor reference | |
barcode |
2 | lvar | cell | no | enclone help parseable
|
barcode of the cell |
barcodes |
1 | lvar | exact | no | enclone help parseable
|
barcodes for the exact subclonotype |
1: only implemented for parseable output |
2: only implemented for parseable output, and as an automatic field for clonotype tables
in PER_CELL mode |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
cdiff |
cvar | exact | no | enclone help cvars |
differences of const region with universal reference | |
cdr*_aa |
cvar | exact | no | enclone help cvars |
CDR* amino acid sequence | |
cdr*_aa_L_R_ext |
cvar | exact | no | enclone help cvars |
CDR* region with specified extension length | |
cdr*_aa_north |
cvar | exact | no | enclone help cvars |
North version of CDR* amino acid sequence | |
cdr*_aa_ref |
cvar | clono | no | enclone help cvars |
CDR* amino acid sequence for universal reference | |
cdr*_dna |
cvar | exact | no | enclone help cvars |
CDR* nucleotide sequence | |
cdr*_dna_ref |
cvar | clono | no | enclone help cvars |
CDR* nucleotide sequence for universal reference | |
cdr*_len |
cvar | exact | yes | enclone help cvars |
length of CDR* amino acid sequence | |
cdr3_aa_conp |
cvar | clono | no | enclone help cvars |
CDR3 amino acid consensus, symbols at variants | |
cdr3_aa_conx |
cvar | clono | no | enclone help cvars |
CDR3 amino acid clonotype consensus, Xs at variants | |
cdr3_start |
cvar | exact | yes | enclone help parseable
|
nucleotide start of CDR3 sequence on full sequence | |
clen |
cvar | exact | yes | enclone help cvars |
length of observed constant region | |
clonotype_id |
1 | lvar | clono | yes | enclone help parseable
|
identifier of clonotype within clonotype group |
clonotype_ncells |
lvar | clono | yes | enclone help parseable
|
number of cells in the clonotype | |
comp |
cvar | exact | yes | enclone help cvars |
CDR3 complexity number | |
const |
cvar | exact | no | enclone help cvars |
constant region name | |
const_id |
cvar | exact | yes | enclone help parseable
|
numerical identifier of constant region (or null) | |
count_* |
lvar | exact | yes | enclone help lvars |
count amino acid motifs | |
cred |
lvar | cell-exact | yes | enclone help lvars |
credibility assessed using GEX data |
1: only implemented for parseable output |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
d1_name |
cvar | exact | no | enclone help cvars |
optimal D segment name | |
d2_name |
cvar | exact | no | enclone help cvars |
second best D segment name | |
d1_score |
cvar | exact | yes | enclone help cvars |
score of optimal D segment | |
d2_score |
cvar | exact | yes | enclone help cvars |
score of second best D segment | |
datasets |
lvar | cell-exact | no | enclone help lvars |
dataset names | |
dna% |
cvar | exact | yes | enclone help cvars |
nucleotide identity with donor reference | |
donors |
lvar | cell-exact | no | enclone help lvars |
donor names | |
dref |
lvar | exact | yes | enclone help lvars |
nucleotide distance to donor reference | |
dref_aa |
lvar | exact | yes | enclone help lvars |
amino acid distance to donor reference | |
d_delta |
cvar | exact | yes | enclone help cvars |
score delta between top two D gene assignments | |
d_donor |
cvar | exact | yes | enclone help cvars |
distance from donor reference | |
d_frame |
cvar | exact | yes | enclone help parseable
|
reading frame of D segment (0, 1, 2 or null) | |
d_id |
cvar | clono | yes | enclone help parseable
|
D region id | |
d_inconsistent_% |
lvar | global | yes | D genes |
inconsistency percent for D gene assignment | |
d_inconsistent_n |
lvar | global | yes | D genes |
sample size for computation of d_inconsistent_% | |
d_name |
cvar | clono | no | enclone help parseable
|
D region name | |
d_start |
cvar | exact | yes | enclone help parseable
|
start of D on full nucleotide sequence (or null) | |
d_univ |
cvar | exact | yes | enclone help cvars |
distance from universal reference | |
d_Ξ |
cvar | exact | yes | enclone help cvars |
score delta between top two D gene assignments |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
edit |
cvar | exact | no | enclone help cvars |
edit to get from reference CDR3 | |
exact_subclonotype_id |
lvar | exact | yes | enclone help cvars |
identifier of exact subclonotype | |
filter |
lvar | cell | no | enclone help lvars |
name of filter that would be applied (if filters off) | |
fwr*_aa |
cvar | exact | no | enclone help cvars |
FWR* amino acid sequence | |
fwr*_aa_ref |
cvar | clono | no | enclone help cvars |
FWR* amino acid seq for universal reference | |
fwr*_dna |
cvar | exact | no | enclone help cvars |
FWR* nucleotide sequence | |
fwr*_dna_ref |
cvar | clono | no | enclone help cvars |
FWR* nucleotide seq for universal reference | |
fwr*_len |
cvar | exact | yes | enclone help cvars |
length of FWR* amino acid sequence |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
g<d> |
lvar | exact | yes | enclone help lvars |
exact subclonotype group, by Hamming distance | |
gex |
lvar | cell-exact | yes | enclone help lvars |
number of GEX UMIs | |
gex_max |
lvar | exact | yes | enclone help lvars |
maximum number of GEX UMIs across exact subclonotype | |
gex_mean |
lvar | exact | yes | enclone help lvars |
mean of GEX UMIs across exact subclonotype (= gex_ΞΌ ) | |
gex_min |
lvar | exact | yes | enclone help lvars |
minimum number of GEX UMIs across exact subclonotype | |
gex_sum |
lvar | exact | yes | enclone help lvars |
sum of GEX UMIs across exact subclonotype (= gex_Ξ£ ) | |
gex_Ξ£ |
lvar | exact | yes | enclone help lvars |
sum of GEX UMIs across exact subclonotype (= gex_sum ) | |
gex_ΞΌ |
lvar | exact | yes | enclone help lvars |
mean of GEX UMIs across exact subclonotype (= gex_mean ) | |
group_id |
1 | lvar | group | yes | enclone help parseable
|
identifier of clonotype group |
group_ncells |
1 | lvar | group | yes | enclone help parseable
|
number of cells in clonotype group |
inkt |
lvar | exact | no | enclone help lvars |
evidence for iNKT cell | |
j_id |
cvar | clono | yes | enclone help parseable
|
J region id | |
j_name |
cvar | clono | no | enclone help parseable
|
J region name | |
mait |
lvar | exact | no | enclone help lvars |
evidence for MAIT cell |
1: only implemented for parseable output |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
n |
3 | lvar | cell-exact | yes | enclone help lvars |
number of cells |
n_<name> |
3 | lvar | cell-exact | yes | enclone help lvars |
number of cells associated to the given name |
n_gex |
lvar | cell-exact | yes | enclone help lvars |
number of cells seen by GEX pipeline | |
nbc |
lvar | cell | yes | enclone help lvars |
numerically encoded barcode | |
nchains |
lvar | clono | yes | enclone help lvars |
number of chains in the clonotype | |
nchains_present |
lvar | exact | yes | enclone help lvars |
number of chains present in an exact subclonotype | |
ndiff<n>vj |
cvar | exact | yes | enclone help cvars |
number of base differences with exact subclonotype n | |
near |
lvar | exact | yes | enclone help lvars |
Hamming distance to nearest neighbor | |
notes |
cvar | exact | no | enclone help cvars |
notes for exact subclonotype | |
origins |
lvar | cell-exact | no | enclone help lvars |
origin names | |
q<n>_ |
cvar | exact | no | enclone help parseable
|
read quality scores at position n | |
r |
cvar | cell-exact | yes | enclone help cvars |
number of reads supporting chain | |
r_max |
cvar | exact | yes | enclone help cvars |
maximum chain read count across exact subclonotype | |
r_mean |
cvar | exact | yes | enclone help cvars |
mean chain reads across exact subclonotype (= r_mean ) | |
r_min |
cvar | exact | yes | enclone help cvars |
minimum chain read count across exact subclonotype | |
r_sum |
cvar | exact | yes | enclone help cvars |
sum of chain read counts across exact subclonotype (= r_Ξ£ ) | |
r_Ξ£ |
cvar | exact | yes | enclone help cvars |
sum of chain reads across exact subclonotype (= r_sum ) | |
r_ΞΌ |
cvar | exact | yes | enclone help cvars |
mean chain read count across exact subclonotype (= r_ΞΌ ) |
3: PER_CELL values are not displayed because they are all 1 ,
which would be distracting; also there is no _cell version |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
seq |
1 | cvar | exact | no | enclone help parseable
|
full nucleotide sequence of exact subclonotype |
share_indices_aa |
1 | cvar | clono | no | enclone help parseable
|
shared amino acid positions |
share_indices_dna |
1 | cvar | clono | no | enclone help parseable
|
shared nucleotide positions |
u |
cvar | cell-exact | yes | enclone help cvars |
number of UMIs supporting chain | |
u_max |
cvar | exact | yes | enclone help cvars |
maximum chain UMIs across exact subclonotype | |
u_mean |
cvar | exact | yes | enclone help cvars |
mean chain UMIs across exact subclonotype (= u_ΞΌ ) | |
u_min |
cvar | exact | yes | enclone help cvars |
minimum chain UMIs across exact subclonotype | |
u_sum |
cvar | exact | yes | enclone help cvars |
sum of chain UMIs for exact subclonotype (= u_Ξ£ ) | |
u_Ξ£ |
cvar | exact | yes | enclone help cvars |
sum of chain UMIs across exact subclonotype (= u_sum ) | |
u_ΞΌ |
cvar | exact | yes | enclone help cvars |
mean chain UMIs for exact subclonotype (= u_mean ) | |
udiff |
cvar | exact | no | enclone help cvars |
differences of 5'-UTR region with universal reference | |
ulen |
cvar | exact | yes | enclone help cvars |
length of observed 5'-UTR sequence | |
utr_id |
cvar | exact | yes | enclone help parseable
|
numerical identifier of 5'-UTR region (or null) | |
utr_name |
cvar | exact | yes | enclone help parseable
|
name 5'-UTR region (or null) |
1: only implemented for parseable output |
variable | limits | class | level | numeric | help page | brief description |
---|---|---|---|---|---|---|
v_id |
cvar | clono | yes | enclone help parseable
|
V region id | |
v_name |
cvar | clono | no | enclone help parseable
|
V region name | |
v_start |
cvar | exact | yes | enclone help parseable
|
start of V on full nucleotide sequence | |
var |
cvar | clono | no | enclone help cvars |
bases at position in chain that vary across the clonotype | |
var_aa |
1 | cvar | exact | no | enclone help parseable
|
variant residue indices in clonotype (including synonymous) |
var_indices_aa |
1 | cvar | clono | no | enclone help parseable
|
variable amino acid positions |
var_indices_dna |
1 | cvar | clono | no | enclone help parseable
|
variable nucleotide positions |
vj_aa |
1 | cvar | exact | no | enclone help parseable
|
amino acid sequence of V..J |
vj_aa_nl |
1 | cvar | exact | no | enclone help parseable
|
amino acid sequence of V..J, excluding leader |
vj_seq |
1 | cvar | exact | no | enclone help parseable
|
nucleotide sequence of V..J |
vj_seq_nl |
1 | cvar | exact | no | enclone help parseable
|
nucleotide sequence of V..J, excluding leader |
vjlen |
cvar | exact | yes | enclone help cvars |
length in bases of V..J |
1: only implemented for parseable output |