In DNA sequencing, a read refers to the DNA sequence from one fragment, which is a small section of DNA). In next-generation sequencing (NGS), the genome is fragmented prior to sequencing, and each sequenced fragment produces a read. The length of the read and how many are produced will depend on fragment size and the type of technology being used. After sequencing, the regions of overlap between reads are used to assemble and align the reads to a reference genome, reconstructing the full DNA sequence. The length of the read is an important factor in sequencing, as it determines the amount of information that can be obtained from a single sequencing run. Longer reads allow for more sequence overlap, making them useful for de novo assembly and resolving repetitive areas of the genome with greater confidence. Shorter reads are sufficient and more cost-effective than longer ones for some applications, such as expression profiling or counting studies.
Some key terms related to sequencing reads include:
-
Read depth: The number of times a given nucleotide in a DNA sequence is read during sequencing. Higher read depth can increase the accuracy of the sequencing results.
-
Read length: The number of base pairs (bp) sequenced from a DNA fragment.
-
Paired-end sequencing: A sequencing method where, after a DNA fragment is read from one end, the process starts again in the other direction. This method produces twice the number of sequencing reads and enables more accurate read alignment and detection of structural rearrangements.
-
Single-read sequencing: A sequencing method where DNA fragments are sequenced from one end to the other. This method is useful for some applications, such as small RNA sequencing, and can be a fast and economical option.
-
Long-read sequencing: A next-generation sequencing method that produces very long reads, up to 10,000 bp. Long reads are capable of resolving the ordering of repeat regions, although they have a high error rate).
In summary, a read in sequencing refers to the DNA sequence from one fragment, which is a small section of DNA. The length of the read and how many are produced depend on fragment size and the type of technology being used. After sequencing, the regions of overlap between reads are used to assemble and align the reads to a reference genome, reconstructing the full DNA sequence. The length of the read is an important factor in sequencing, as it determines the amount of information that can be obtained from a single sequencing run.