Welcome to the HLi Lab

The general theme of our research is to tackle biological problems with advanced computational and statistical methods. We develop sophisticated algorithms for sequence alignment (e.g. bwa, minimap2 and miniprot), sequence assembly (e.g. miniasm and hifiasm), variant calling (e.g. longcallR and longcallD) and pangenome analysis (e.g. minigraph and pangene). We design widely used formats (e.g. SAM and GFA) and also work on full-text indexing (e.g. ropebwt3) and population genetics (e.g. PSMC). Some of our tools are essential to the applications of high-throughput sequence data and among the most widely used in the field of bioinformatics. Please see the Software page for more tools we developed.

We are part of the department of Biomedical Informatics of Harvard Medical School and the department of Data Science of Dana-Farber Cancer Institute.