I need simple analysis on R Anova and the data should be taken from published article. Explanation should be provided as well with the data extracted
1. Find closely related sequences for gag gene of HIV viral genome in Homo sapiens by using BLAST.
2. Select top 20 sequence from the list of BLAST output for further analysis.
3. Construct a phylogenetic tree for the selected sequences using Multiple Sequence Alignment to view the evolution pattern and the evolutionary distance.
4. Normalize the data and find the best distribution that fit the data obtained using fitdistrplus package in R.