Sed: an editor fasta using labels of phylogeny and associated traces

International Journal of Development Research

Volume:

Article ID:

17335

4 pages

Research Article

Marta Barreiros and Élcio Leal

Received 08th August, 2019; Received in revised form 11th September, 2019; Accepted 16th October, 2019; Published online 20th November, 2019

Copyright © 2019, Marta de Oliveira Barreiros and Élcio Souza Leal. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract:

Phylogenies are used to treatvarious biological functions, such as the relationships between tree topology and traits of observed species. Currently, tools are available that allow viewing and editing on phylogenetic trees, as well as creating and editing alignments. However, there is a need for a simplified tool that automatically selects sequences from an alignment based on phylogenetic information. Here we created a tool called SED (data editing software), which uses phylogeny-based files to perform statistical summaries and generate subsets of alignments. SED automatically generates an analysis of previously marked data in phylogeny clades, defined by the user from sequencesets. Its input is a tree file (Nexus format), an alignment (Fasta format) and numeric data in a spreadsheet (CSV format) for statistics. Using labels marked a priori on the tree file, SED files will be generated (Fasta format) containing selected labels, it also compares the traits (CSV file) of the selected sequences with those of the entire alignment. Efficiently, SED was tested by analyzing 261 samples from the vif gene of HIV-1 subtype B with TG mutation detection (tryptophan) in eight samples (8/261 ~ 3.06). SED is a tool used to generate subset sequences from a Fasta file and generate analyzes with tagged taxa.

Download PDF:

17335.pdf