> For the complete documentation index, see [llms.txt](https://edu.abi.am/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://edu.abi.am/bioinformatics-algorithms/read-alignment-exact-matching.md).

# Read alignment: exact matching

Read alignment, also known as sequence alignment, refers to the process of mapping or aligning short sequence reads obtained from high-throughput sequencing technologies to a reference genome or transcriptome. The goal is to determine where each read likely originated from in the reference sequence. This information is important for many downstream applications, such as variant calling, i.e. identifying differences between the reads and the reference genome,  and transcriptome analysis, i.e. identifying which genes are being transcribed and at what levels.

There are numerous algorithms designed for aligning reads to a genome. This section introduces fundamental exact matching algorithms, which specifically locate exact occurrences of a given string of characters within a larger body of text or identify patterns in text.

### Read alignment and why it's hard

{% embed url="<https://www.youtube.com/watch?v=PMGstYcBgTY&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=15>" %}

### Naive exact matching

{% embed url="<https://www.youtube.com/watch?v=KUbsdGm3G7s&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=16>" %}

### Practical: Matching artificial reads

{% embed url="<https://www.youtube.com/watch?v=ep91JWd6fs0&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=17>" %}

### Practical: Matching real reads

{% embed url="<https://www.youtube.com/watch?v=SFYpw87lHWQ&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=18>" %} <br>
{% endembed %}

### Boyer-Moore basics

{% embed url="<https://www.youtube.com/watch?v=4Xyhb72LCX4&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=19>" %}

### Boyer-Moore: putting it all together

{% embed url="<https://www.youtube.com/watch?v=Wj606N0IAsw&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=20>" %}

### Diversion: Repetitive elements

{% embed url="<https://www.youtube.com/watch?v=ZwHCRb_y7vA&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=21>" %}

### Practical: Implementing Boyer-Moore

{% embed url="<https://www.youtube.com/watch?v=CT1lQN73UMs&list=PL2mpR0RYFQsBiCWVJSvVAO3OJ2t7DzoHA&index=22>" %}

## Problems to solve

Try to solve these problems after completing the section.\
Some of them may be challenging :)

1. [Finding a Motif in DNA](https://rosalind.info/problems/subs/)
2. [Open Reading Frames](https://rosalind.info/problems/orf/)
3. [RNA Splicing](https://rosalind.info/problems/splc/)

## Congratulations !

If you made it here, then congratulations! You have successfully completed this section. Move to the next portion of the guide with the arrow buttons below.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://edu.abi.am/bioinformatics-algorithms/read-alignment-exact-matching.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
