Tcrklass: a new k-string–based algorithm for human and mouse tcr repertoire characterization

X Yang, D Liu, N Lv, F Zhao, F Liu, J Zou… - The Journal of …, 2015 - journals.aai.org
X Yang, D Liu, N Lv, F Zhao, F Liu, J Zou, Y Chen, X Xiao, J Wu, P Liu, J Gao, Y Hu, Y Shi…
The Journal of Immunology, 2015journals.aai.org
The next-generation sequencing technology has promoted the study on human TCR
repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR
repertoire, we developed an integrated pipeline, TCRklass, using K-string–based algorithm
that has significantly improved the accuracy and performance over existing tools. We tested
TCRklass using manually curated short read datasets in comparison with in silico datasets; it
showed higher precision and recall rates on CDR3 identification. We applied TCRklass on …
The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string–based algorithm that has significantly improved the accuracy and performance over existing tools. We tested TCRklass using manually curated short read datasets in comparison with in silico datasets; it showed higher precision and recall rates on CDR3 identification. We applied TCRklass on large datasets of two human and three mouse TCR repertoires; it demonstrated higher reliability on CDR3 identification and much less biased V/J profiling, which are the two components contributing the diversity of the repertoire. Because of the sequencing cost, short paired-end reads generated by next-generation sequencing technology are and will remain the main source of data, and we believe that the TCRklass is a useful and reliable toolkit for TCR repertoire analysis.
journals.aai.org