10.18419/darus-802
Orlando, Marco0000-0002-5914-3052(University of Milano Bicocca)
GraphML files for protein sequence networks of glycoside hydrolase 19 homologues
DaRUS
2020
doi:10.18419/darus-802/1doi:10.18419/darus-802/2doi:10.18419/darus-802/3
GraphML files for undirected weighted graphs with nodes that represent protein sequences of glycoside hydrolase 19 homologues. Protein sequences were clustered by a threshold of 90% sequence identity to derive representative sequences. Pairwise sequence identity between two sequences was derived from global Needleman-Wunsch alignment. Protein sequence networks were generated with edge weights of pairwise sequence identity, filtered by a predefined threshold. Metadata of the nodes (e.g. annotations) and of the edges (the edge weights) were summarized in GraphML files.
Pleiss, Jürgen(Universität Stuttgart)