Just performing the length validation it becomes obvious whether data from a certain database have been curated or not.
The databases and species on which we perform our tests are:
- Solenopsis invicta & Pogonomyrmex barbatus
- Honey bee
- Nasonia vitripennis (wasp)
- Some common genes (e.g INS) for various species - Uniprot
- Brachypodium sylvaticum plant
Other databases I did't touch yet: for all kind of species [1] and plants [2].
[1] ftp://ftp.ensembl.org/pub/release-71/fasta/
[2] http://plantgdb.org/
0 comments:
Post a Comment