Sed: an editor fasta using labels of phylogeny and associated traces

International Journal of Development Research

Volume: 
09
Article ID: 
17335
4 pages
Research Article

Sed: an editor fasta using labels of phylogeny and associated traces

Marta Barreiros and Élcio Leal

Abstract: 

Phylogenies are used to treatvarious biological functions, such as the relationships between tree topology and traits of observed species. Currently, tools are available that allow viewing and editing on phylogenetic trees, as well as creating and editing alignments. However, there is a need for a simplified tool that automatically selects sequences from an alignment based on phylogenetic information. Here we created a tool called SED (data editing software), which uses phylogeny-based files to perform statistical summaries and generate subsets of alignments. SED automatically generates an analysis of previously marked data in phylogeny clades, defined by the user from sequencesets. Its input is a tree file (Nexus format), an alignment (Fasta format) and numeric data in a spreadsheet (CSV format) for statistics. Using labels marked a priori on the tree file, SED files will be generated (Fasta format) containing selected labels, it also compares the traits (CSV file) of the selected sequences with those of the entire alignment. Efficiently, SED was tested by analyzing 261 samples from the vif gene of HIV-1 subtype B with TG mutation detection (tryptophan) in eight samples (8/261 ~ 3.06). SED is a tool used to generate subset sequences from a Fasta file and generate analyzes with tagged taxa.

Download PDF: