SlaPPy (Squiggle and sequence Plotter in Python) is a browser based GUI to map sequence to Squiggle out of a fast5 files created by sequencing with Oxford Nanopore Technology (ONT). It also provides the possibility to visualize the probability for basecalling of different bases at a certain time or sequence position.
git clone [email protected]:Fabianexe/SlaPPy
python3 setup.py install
starting SlaPPy:
slappy
Then copy the http-link into your desired browser.
First choose a fast5 file by entering the file path in the upper left window.
Then you can select manually a read out of the fast5 file.
It is also possible to filter reads using the read-ID.
Please choose the ONT Guppy basecall group for further investigation of the reads.
You can activate stack traces under Graph Options to display the probabilities for the different nucleotides. Also it is possible to normalize the value of current to 1. Negative raw values are converted to positive numbers.
There are two different view options of the squiggle and the related traces and called nucleotides: "signal scaled" and "base scaled".
In the signal scaled mode, the Squiggle can be seen unchanged with respect to duration.
In the base scaled mode, the distances between the nucleotides are normalized to the same length and the squiggle is either stretched or compressed.
In both modes, red vertical lines mark the positions where a certain base was called. The most likely base is written above the red line. Squiggle and sequence are shown from 5' to 3' end.
In both modes you can zoom in and get a closer look at certain areas by choosing the desired area manual. You can also, when Stack Traces is activated, display how likely it is to call which Base at which time. This shows the size of the area in the corresponding color, which can be seen in the legend on the right.
It is also possible by double clicking on a certain nucleotide in the legend to show single traces.
You can select Baseprobability to show a sequence logo representing the probability for each base on a certain position.
The sequence logo is displayed with as subsequence of 200 bases. Use the slider or search to determine the shown range.
There are three different options für displaying the sequence logo of base probabilities:
- "Up next" shows the mean probabilities for the bases between two called bases.
- "At call" shows the probabilites for the bases at the time when the most probable base is called.
You can search for a certain subsequence, mark the position, where the subsequence occured and apply, so the visible subsequence is adjusted in the screen.
Since only a part of the sequence can be displayed as sequence logo, the corresponding area should be selected in the sequence logo beforehand, so that the correct sub-sequence is displayed.
You can activate the help function in the lower left corner.