11 to 15 of 15 Results
Unknown - 20.3 KB -
MD5: 1961bf93199d823fa0fb953e5e120100
FASTA headers comprise (from left to right): number of the seed sequence; sequence identifier (sid) in the GH19ED database; protein identifier (pid) in the GH19ED database; Uniprot or NCBI accession. |
Jun 1, 2020
Orlando, Marco, 2020, "GraphML files for protein sequence networks of glycoside hydrolase 19 homologues", https://doi.org/10.18419/darus-802, DaRUS, V1
GraphML files for undirected weighted graphs with nodes that represent protein sequences of glycoside hydrolase 19 homologues. Protein sequences were clustered by a threshold of 90% sequence identity to derive representative sequences. Pairwise sequence identity between two seque... |
GraphML Network Data - 14.3 MB -
MD5: a314980fa659092d23cfda60fb45929a
Protein sequence network for the chitinase domains from the Glycoside Hydrolase 19 Engineering Database.
The GraphML file contains representative nodes (clustered by 90% in USEARCH) connected by at least 60% pairwise sequence identity (edge weights derived from Needleman-Wunsch... |
GraphML Network Data - 5.6 MB -
MD5: f255b9e12d219a65533643ae3b529282
Protein sequence network for the endolysin domains from the Glycoside Hydrolase 19 Engineering Database.
The GraphML file contains representative nodes (clustered by 90% in USEARCH) connected by at least 60% pairwise sequence identity (edge weights derived from Needleman-Wunsch... |
GraphML Network Data - 72.1 MB -
MD5: 1ef54be08ab8b4dbe1f5b4a0407fac75
Protein sequence network for representative GH19 domains (corresponding to Pfam’s GH19 profile HMM: PF00182) from the Glycoside Hydrolase 19 Engineering Database. The GraphML file contains representative nodes (clustered by 90% in USEARCH) connected by at least 40% pairwise seque... |