Read alignment: approximate matching

Very frequently, the reads may not match the reference genome exactly. The reasons for this may be natural genomic variation or sequencing errors. In these cases, an approximate matching algorithm can still find matches between the query and reference sequences, allowing the read alignment process to proceed. Approximate matching can also be useful when aligning reads from closely related organisms, as their genomes may have undergone some degree of evolution.

In this section you will learn how approximate matching algorithms work.

Approximate matching, Hamming and edit distance

Pigeonhole principle

Practical: Implementing the pigeonhole principle

Solving the edit distance problem

Using dynamic programming for edit distance

Practical: Implementing dynamic programming for edit distance

Edit distance for approximate matching

Problems to solve

Try to solve these problems after completing the section.

If these were too easy for you, try unlocking the following set of Advanced problems


If you made it here, then congratulations! You have successfully completed this section. Move to the next portion of the guide with the arrow buttons below.

Last updated