Method |
Small RNA high throughput sequencing data and degradome sequencing data was retrieved using NCBI GEO database and NCBI SRA database.
For small RNA sequencing data, each dataset was processed to fetch unique sequences with their respective cloning frequencies such that their cloning frequency should be greater than 9 and the length of sequences must fall between 14-100 nt.
Due to scarcity of processed data, degradome sequencing data was taken. Raw reads were processed by removing the adapter sequences from RNA data using Cutadapt v1.14 tool and only those sequences were acquired that had length between 14-100 nt, frequency greater than 100 and Phred score must be greater than 28.
tRNA information of specified plant species were generated using tRNAscan-SE 2.0 . Using tRNAscan-SE results and reference genome, precursor tRNAs were extracted 40bp upstream and downstream each for both negative (-) and positive (+) strand using in-house developed scripts. Negative (-) strands were converted to positive (+) strands. Since, tRNA nucleotidyltransferases (CCA-adding enzymes) are responsible for maturation of the functional 3' end of tRNAs so, 'CCA' was added to tRNA sequences procured from tRNAscan-SE to make mature tRNAs. BLAST database was created for combined sequences of tRNAs and precursors tRNAs for each species individually using NCBI blast-2.6.0.