# Read alignment: exact matching

Read alignment, also known as sequence alignment, refers to the process of mapping or aligning short sequence reads obtained from high-throughput sequencing technologies to a reference genome or transcriptome. The goal is to determine where each read likely originated from in the reference sequence. This information is important for many downstream applications, such as variant calling, i.e. identifying differences between the reads and the reference genome,  and transcriptome analysis, i.e. identifying which genes are being transcribed and at what levels.

There are numerous algorithms designed for aligning reads to a genome. This section introduces fundamental exact matching algorithms, which specifically locate exact occurrences of a given string of characters within a larger body of text or identify patterns in text.

### Read alignment and why it's hard

{% embed url="<https://www.youtube.com/watch?v=PMGstYcBgTY&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=15>" %}

### Naive exact matching

{% embed url="<https://www.youtube.com/watch?v=KUbsdGm3G7s&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=16>" %}

### Practical: Matching artificial reads

{% embed url="<https://www.youtube.com/watch?v=ep91JWd6fs0&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=17>" %}

### Practical: Matching real reads

{% embed url="<https://www.youtube.com/watch?v=SFYpw87lHWQ&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=18>" %} <br>
{% endembed %}

### Boyer-Moore basics

{% embed url="<https://www.youtube.com/watch?v=4Xyhb72LCX4&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=19>" %}

### Boyer-Moore: putting it all together

{% embed url="<https://www.youtube.com/watch?v=Wj606N0IAsw&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=20>" %}

### Diversion: Repetitive elements

{% embed url="<https://www.youtube.com/watch?v=ZwHCRb_y7vA&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=21>" %}

### Practical: Implementing Boyer-Moore

{% embed url="<https://www.youtube.com/watch?v=CT1lQN73UMs&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=22>" %}

## Problems to solve

Try to solve these problems after completing the section.\
Some of them may be challenging :)

1. [Finding a Motif in DNA](https://rosalind.info/problems/subs/)
2. [Open Reading Frames](https://rosalind.info/problems/orf/)
3. [RNA Splicing](https://rosalind.info/problems/splc/)

## Congratulations !

If you made it here, then congratulations! You have successfully completed this section. Move to the next portion of the guide with the arrow buttons below.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://edu.abi.am/bioinformatics-algorithms/read-alignment-exact-matching.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
