enclone banner

History

This page provides a selective history of what was changed in enclone and when.

We show changes that affect users like new features, changes to results, and the like. This log starts with with initial public availability. The complete history may be seen by cloning the enclone repo and typing git log.

Breaking changes are shown in red.

Please be aware that our workflow when we make changes is to automatically update the GitHub site, including all the website pages and this page too. This happens in advance of actually making a release (which might follow in a couple days). This means that the website may describe features that are not yet available in a release (although they will be in the source code that's available). We apologize for this asynchrony! Note however that the command-line help that comes with your copy of enclone will always match its behavior.


5/9/22: Add ability to create enclone_visual sessions from the command line. Please see enclone_visual.

5/8/22: enclone now runs under Windows, with some limitations, see here.

3/31/22:

3/26/22: Add lvar jun_ins, please see enclone help lvars.

3/15/22: Add DATASET filter, please see enclone help filter.

3/14/22:

3/11/22:

3/4/22: Add ability to select functional groups by donor, see enclone help display.

3/3/22:

3/2/22:

For all of the above, please see enclone help display.

2/28/22: Add an argument BC_JOINT=filename, which functions like BC, but uses only a single file. Please see enclone help input.

2/25/22:

2/18/22: Introducing enclone_visual! This is alpha software for a graphic user interface (GUI) for enclone, that is implemented initially for Macs. Please see enclone_visual.

2/16/22: Add new grouping options, please see enclone help display.

2/14/22:

2/7/22: Remove some genes from the reference that were identical except for extension on the 3' end.

2/4/22:

1/26/22: Add ability to define variables as expressions involving other variables, using an argument VAR_DEF=.... Please see variables.

1/13/22: Make three small changes to BCR clonotyping:

1/5/22:

12/27/21:

12/24/21: Add filter option NMAX to allow barcodes having more than four contigs. Also, this is now turned on by NALL and NALL_CELL. This does not affect default behavior.

12/13/21: This version includes three changes that substantially improve clonotyping specificity. See below and enclone help how.

11/9/21:

10/26/21:

For all of the above, please see enclone help filter.

10/18/21: Allow coloring of honeycomb plots by dataset. Please see enclone plots.

10/12/21:

10/4/21: Add argument MIN_GROUP_DONORS=... to set a lower bound on the number of donors contributing to a group, for it to be printed. Please see enclone help display.

9/21/21:

9/20/21: Add a new default filter called signature filtering that under certain rare circumstances will decompose complex clonotypes that are incorrectly glued together. This filter can dramatically affect certain datasets, but has almost no effect on typical data (less than one per million clonotypes tested). Please see enclone default filters.

9/17/21:

9/15/21: Add control over PNG resolution for cell coloring by variable.

9/14/21: Fix some issues with cell coloring, affecting cell coloring by variable and SPLIT_PLOT_BY_ORIGIN.

9/11/21: Add the ability to color cells in a honeycomb plot using the values of a variable. Please see enclone plots.

8/27/21: Add new argument DVARS=... that can display dataset-level variables, for now very limited, as described at see enclone help display. In addition, values for GVARS now appear inside the summary.

8/24/21:

8/17/21: KEEP_CELL_IF now allows feature variables to appear in constraints, see enclone help special.

7/22/21:

7/14/21: Add new arguments HONEY_OUT and HONEY_IN to allow preservation of cell positions between two honeycomb plots of the same data. Please see enclone plots.

6/30/21: Add cvar cigar that is the CIGAR string of the V..J contig sequence versus the universal reference. See enclone help cvars.

6/24/21: Now enclone UPDATE updates to the latest version of enclone.

6/17/21: Add SIM_MAT_PLOT, to display the pairwise cosine similarity between a list of variables, see enclone plots.

6/4/21: Tweak the order of default filters so as to increase the likelihood that clonotypes that are held together tenuously (or not at all) will be pulled apart. This has a tiny effect on our datasets (changing about 1 in 10^5 clonotypes), but could affect some user datasets disproportionally.

6/3/21: Fix bug in the computation of datasets_cell.

6/2/21:

5/6/21: Clonotype grouping has been completely reworked. Please see enclone help display. The pre-existing options are still present but now undocumented, and their behavior might have changed slightly. We have not optimized the new options for run time, so let us know if there's a problem and we'll see what we can do.

5/4/21: Several changes that involve D gene assignment, and which allow for the VDDJ case:

Additionally, the main page on this site now provides links to previous releases of enclone that match particular releases of Cell Ranger. And there is a new filter MAX_EXACTS.

4/8/21:

4/5/21: Add page enclone default filters that describes the order of the filters that enclone uses, and provides technical details about some of the filters, which was previously on the heuristics page.

4/2/21:

3/23/21:

3/17/21:

3/16/21:

3/4/21:

2/14/21: A VDJ reference file can now be tested for defects using a command of the form enclone REF=<fasta file name>.

2/13/21:

2/6/21: Fix the computations of median of a vector. For a vector x of length n, these were computed as x[n/2], which is incorrect in the case where x has an even number of elements: it should be the mean of the innermost two elements. However, in cases where the vector elements are integers, we now round this mean up to the nearest integer, yielding a "rounded median", as the advantage of presenting the exact median is outweighed by the loss of readability resulting from adding .5 or .0 to the display.

2/4/21: Add option SUPPRESS_ISOTYPE_LEGEND.

2/3/21: Add chain variable cdr3_aa_conx that is the consensus of the CDR3 amino acid sequences across the clonotype, showing X for each variant residue, and cdr3_aa_conp, which instead of X shows an amino acid property symbol. See enclone help cvars. We thank Albert Vilella for suggesting these features.


1/19/21: Add lead variables count_cdr1_<regex> and similar, see enclone help lvars.

1/14/21: Add chain variables corresponding to extended or trimmed versions of CDR sequences, and North versions. Please see enclone help cvars.

1/13/21:

1/9/21: Following a suggestion of Ganesh Phad, single-cell clonotypes in honeycomb plots are now localized by color.

1/7/21:

1/4/21:


12/14/20: Add option METAX that allows meta information to be provided on the command line, rather than through a separate file.

12/9/20:

12/5/20:

12/2/20: CDR3 sequences are now recomputed by enclone, rather than taken from the value in all_contig_annotations.json. This will result in changed values at a very low frequency, and only if the CDR3 calculation algorithm was changed between the time of the relevant cellranger release and the time of the enclone release.


11/24/20:

11/20/20:

11/19/20:

11/4/20:


10/20/20: If a variable is of the form abbr:name, and POUT is set to a file name, then the field label that is shown is now abbr, rather than abbr:name. This is really a bug fix.

10/9/20: Add parseable variables for the start and frame of the D region.


9/17/20: Clarify documentation for the case where GEX=(antibody data) and fix bug in handling of n_gex for that.

9/16/20:

9/4/20: Add the capability of seeing which default filters would have been applied, if the filters are turned off. See the lead variable "filter" at enclone help lvars.

9/2/20:

9/1/20:

8/31/20: add FWR4 support

8/28/20: add new filters CONST_IGH and CONST_IGKL that allow filtering of cells by isotype. Please see enclone help special.

8/27/20:

  1. Fix bugs in reporting of insertions in the notes field.
  2. Now CDR1, CDR2, FWR1, FWR2 and FWR3 can be can be computed and displayed. Please see enclone help amino and enclone help cvars for how to specify these, and this page for how we calculate these features.

8/24/20:

  1. Add lead variable dref_aa, the number of amino acid differences with the donor reference.
  2. Add estimated doublet rate to summary stats.
  3. We are deprecating the lead variables sec and mem that measure UMIs supporting secreted versus membrane-bound antibody transcripts. Because the exon boundaries that determine this are far from the 5' end, there is rarely enough signal to be informative. We removed the documentation.

8/20/20:

  1. Allow "Levenshtein distance patterns" for CDR3 filter specification. For more information, type or click here: enclone help filter.
  2. Change what the weak onesies filter does: unless NWEAK_ONESIES is used, now, rather than delete certain dubious onesie clonotypes, such clonotypes are disintegrated into single cell clonotypes.
  3. Change what the improper filter does: unless NIMPROPER is used, now, clonotypes having more than one chain but all of the same type are deleted.
  4. Add a new filter that prevents merges of onesie clonotypes into other clonotypes; this filter may be turned off with a new argument NMERGE_ONESIES. This behavior is mostly (but not completely) a splitting out of previous functionality so as to simplify and clarify.
Importantly, the last three changes have the effect of increasing the number of cells in clonotypes; the extra cells are mostly in onesie clonotypes. Precise definitions may be found by typing (or clicking here): enclone help special.

8/17/20: deprecate KEEP_IMPROPER; the new name is NIMPROPER.

8/10/20: tweak the definition of the weak onesies filter so that it does not delete single-cell clonotypes. Superceded by subsequent change on 8/18/20.

8/7/20: add the ability to analyze alternate splicing using GEX data to characterize UMIs as secreted or membrane, and display this information using lvars "sec" and "mem". See enclone help lvars for limitations. This is highly experimental.

8/5/20:

  1. A reference file is now required as part of the Cell Ranger outs directory, if Cell Ranger version 4.0 or greater was used.
  2. Deprecate MAX_SCORE and replace by MAX_LOG_SCORE, the base 10 logarithm of it.
  3. Add cvar cdr3_len
  4. Add and document knobs that allow nearly all clonotype join filtering to be turned off. Please see "enclone help how" and the end of "enclone help faq".


6/24/20:

  1. Add support for iNKT and MAIT cells.
  2. Make enclone faster. This is most noticeable in cases where many GEX datasets are provided as input.

6/19/20:

  1. We now use a data hierarchy of donor (top), origin, dataset (bottom), where an origin is a set of 1 or more datasets from the same source (tube of cells, tissue, timepoint, etc.). (This breaks previous invocations of META.)
  2. Improve the alternate (faster) internal storage structure for the GEX matrix created using the option NH5 as described using the command enclone help input. This will speed things up, particularly for the case where several datasets are combined. If you have already used the NH5 option for a given dataset, then the next time you run enclone on it, the file will be automatically rewritten. This would also apply to some datasets obtained as part of the large download.

6/17/20:

  1. Now TREE=const can be used to show a tree with heavy chain constant region names attached to the leaves.
  2. SEG and SEGN are now cumulative, so that multiple instances may be used to progressively filter.
  3. The clonotype joining heuristic parameters MAX_SCORE and MAX_CDR3_DIFFS are now accessible.

6/10/20:

  1. Add a complex of features for generating phylogenetic trees from clonotypes, see here.
  2. New "single button" installation procedure.
  3. Change the default value for PRE to ~/enclone/datasets,~/enclone/datasets2.
  4. Add argument NALL to turn off all filters.
  5. Add new lead variables nd<k> that display the number of cells in the top datasets for a given clonotype.
  6. Add new lead variable dref that shows the distance of V..J from the reference outside the region of recombination.
  7. Add argument COMPLETE to remove exact subclonotypes that do not have all chains.
  8. Test for consistency between VDJ and GEX barcodes, and exit if this is not the case.
  9. Add option COLOR=property to color amino acids by their properties.
  10. Add option FCELL to allow filtering by cells.


5/29/20: Add new "UMI ratio" filter that further reduces noise in certain cases. This can be turned off using the argument NUMI_RATIO.

5/22/20: Add major new "UMI" filter that greatly reduces noise in certain cases. This can be turned off using the argument NUMI.

5/12/20: Add PLOT_BY_ISOTYPE to generate honeycomb plots colored by isotype.

5/1/20: Change the definition of the fields "edit" and "comp" to be based on alignment from the beginning of the CDR3 up to the end of the J, rather than stopping at the end of the CDR3. The intention is to capture the full region of recombination, which may not have been done before.


4/30/20: First release.