Skip to content

Read Splitting

Dorado performs read splitting automatically but it can be disabled with the --disable-read-splitting argument.

When a single input read contains multiple concatenated reads, Dorado basecaller will split the original input read into separate subreads. This operation is performed by default for both DNA and RNA.

Each subread has a new read id that is assigned by Dorado.

The following tags can be used to associate a subread to its parent:

Tag Description
pi:Z The parent read id that this subread was generated from.
sp:i Maps the start of the subread's signal data to the corresponding location in the parent read's signal data.
ns:i The number of samples corresponding to the subread after splitting.
ts:i The number samples trimmed from the start of subread's signal after splitting.