Bios2mds: An R Package for Comparing Orthologous Protein Families by Metric Multidimensional Scaling
dc.contributor.ISNI | 0000 0001 1648 3631 (Abdi, H.) | |
dc.contributor.LCNA | 94060324 (Abdi) | |
dc.contributor.author | Pele, Julien | en_US |
dc.contributor.author | Becu, Jean-Michel | en_US |
dc.contributor.author | Abdi, Hervé | en_US |
dc.contributor.author | Chabbert, Marie | en_US |
dc.date.accessioned | 2013-10-04T19:40:43Z | |
dc.date.available | 2013-10-04T19:40:43Z | |
dc.date.created | 2012-06-15 | |
dc.description.abstract | Background: The distance matrix computed from multiple alignments of homologous sequences is widely used by distance-based phylogenetic methods to provide information on the evolution of protein families. This matrix can also be visualized in a low dimensional space by metric multidimensional scaling (MDS). Applied to protein families, MDS provides information complementary to the information derived from tree-based methods. Moreover, MDS gives a unique opportunity to compare orthologous sequence sets because it can add supplementary elements to a reference space.Results: The R package bios2mds (from BIOlogical Sequences to MultiDimensional Scaling) has been designed to analyze multiple sequence alignments by MDS. Bios2mds starts with a sequence alignment, builds a matrix of distances between the aligned sequences, and represents this matrix by MDS to visualize a sequence space. This package also offers the possibility of performing K-means clustering in the MDS derived sequence space. Most importantly, bios2mds includes a function that projects supplementary elements (a.k.a. " out of sample" elements) onto the space defined by reference or " active" elements. Orthologous sequence sets can thus be compared in a straightforward way. The data analysis and visualization tools have been specifically designed for an easy monitoring of the evolutionary drift of protein sub-families.Conclusions: The bios2mds package provides the tools for a complete integrated pipeline aimed at the MDS analysis of multiple sets of orthologous sequences in the R statistical environment. In addition, as the analysis can be carried out from user provided matrices, the projection function can be widely used on any kind of data. | en_US |
dc.identifier.bibliographicCitation | Pele, Julien, Jean-Michel Becu, Hervé Abdi, and Marie Chabbert. 2012. "Bios2mds: An R package for comparing orthologous protein families by metric multidimensional scaling." BMC Bioinformatics 13(133). | en_US |
dc.identifier.issn | 1471-2105 | en_US |
dc.identifier.issue | 1 | en_US |
dc.identifier.uri | http://hdl.handle.net/10735.1/2864 | |
dc.identifier.volume | 13 | en_US |
dc.relation.uri | http://dx.doi.org/10.1186/1471-2105-13-133 | en_US |
dc.rights | CC BY 2.0 (Attribution) © 2012 Pelé et al.; e BioMed Central Ltd. | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by/2.0/ | en_US |
dc.source.journal | BMC Bioinformatics | en_US |
dc.subject | Evolution | en_US |
dc.subject | Phylogeny | en_US |
dc.subject | Proteins | en_US |
dc.subject | R (Computer program language) | en_US |
dc.title | Bios2mds: An R Package for Comparing Orthologous Protein Families by Metric Multidimensional Scaling | en_US |
dc.type | Text | en_US |
dc.type.genre | article | en_US |
Files
Original bundle
1 - 1 of 1