Shown below is the output of the command:
enclone BCR=123089 CDR3=CARRYFGVVADAFDIW
[1] GROUP = 1 CLONOTYPES = 13 CELLS
[1.1] CLONOTYPE = 13 CELLS
┌───────────┬────────────────────────────────────────┬──────────────────────────────┐
│ │ CHAIN 1 │ CHAIN 2 │
│ │ 740.1.2|IGHV4-30-4 ◆ 53|IGHJ3 │ 253|IGKV1D-39 ◆ 217|IGKJ5 │
│ ├────────────────────────────────────────┼──────────────────────────────┤
│ │ 1 1111111111111111 │ 1 111111111111 │
│ │ 256890 1111122222222223 │ 0 011111111112 │
│ │ 010148 5678901234567890 │ 6 901234567890 │
│ │ ══════CDR3══════ │ ════CDR3════ │
│reference │ LDPPSA ◦◦◦◦◦◦◦◦◦◦◦◦◦◦◦W │ T CQQ◦◦◦◦◦◦◦◦◦ │
│donor ref │ VGHPSA ◦◦◦◦◦◦◦◦◦◦◦◦◦◦◦W │ T CQQ◦◦◦◦◦◦◦◦◦ │
├───────────┼────────────────────────────────────────┼──────────────────────────────┤
│# n │ ...x.. ..............x. u const │ x ......x..... u const│
│1 10 │ VGHPSA CARRYFGVVADAFDIW 3953 IGHM │ T CQQSYSTPPITF 10648 IGKC │
│2 2 │ VGHSSA CARRYFGVVADAFDIW 2626 IGHM │ A CQQSYSPPPITF 9766 IGKC │
│3 1 │ VGHPSA CARRYFGVVADAFDIW 5 IGHM │ │
└───────────┴────────────────────────────────────────┴──────────────────────────────┘
This shows an invocation of enclone that takes one dataset as input and exhibits
all clonotypes for which some chain has the given CDR3 sequence.
What you see here is a compressed view of the entire information encoded in the
full length transcripts of the 13 cells comprising this clonotype: every base!
There is a lot to explain about the compression, so please read carefully.
• Clonotypes are grouped. Here we see just one group having one clonotype in it.
• This clonotype has three exact subclonotypes in it, the first of which has 10 cells.
• This clonotype has two chains. The reference segments for them are shown at the top.
• The notation 740.1.2 says that this V reference sequence is an alternate allele
derived from the universal reference sequence (contig in the reference file)
numbered 181, that is from donor 1 ("740.1") and is alternate allele 2 for that donor.
• Sometimes chains are missing from exact subclonotypes.
• Amino acids are assigned different colors depending on which codon they represent.
• Numbered columns show the state of particular amino acids, e.g. the first column is for amino
acid 20 in chain 1 (where 0 is the start codon). The numbers read vertically, downward!
• Universal ref: state for the contig in the reference file.
• Donor ref: state for the inferred donor germline sequence.
• ◦s are "holes" in the recombined region where the reference doesn't make sense.
• The "dot and x" line has xs where there's a difference *within* the clonotype.
• Amino acids are shown if they differ from the universal reference or are in the CDR3.
• u = median UMI count for a chain in the exact subclonotype.
• const = const region name for a chain in the exact subclonotype.
The view you see here is configurable: see the documentation at enclone help lvars
and enclone
help cvars.