Assignment 2 in "Software Tools in Bioinformatics" at the University of Guelph
- primary script by Noah Zeidenberg
- major edits by Sameh Mohamed include:
- function for performing quality checks, data cleaning and filtering
- function for calculating kmer frequency
- density plot of the probabilities of 1-, 2-, and 3-mer frequencies
- minor edits include:
- formatting fixes
- filtering out large sequences (treated as outliers)