- Remove reads containing adapters using Cutadapt
- Remove reads containing N > 10% (N represents base that could not be determined)
- Remove reads where the Qscore (Quality value) of over 90% bases of the read is <= 10
3. Align reads to a reference using STAR
4. Count the number of reads assigned to each contig/gene using featureCounts (a part of RSubread package)
- Extract counts and store in a matrix
- Create column metadata (sampleinfo) table