DOGMA is a program for fast and easy quality assessment of transcriptome and proteome data based on conserved protein domains. The source code can be obtained at https://zivgitlab.uni-muenster.de/domain-world/DOGMA and a webserver to run your analysis directly in the browser without installation is available here.
The figure shows schematically the method of DOGMA for example proteome data. The output of DOGMA gives you a completeness score measuring the quality of your proteome or transcriptome data from 0 - 100%.
On this website you can find additional data needed for evaluating your proteome/transcriptome. You will need the core sets (unless you want to create your own) and eventually our fast domain annotation tool RADIANT instead of PfamScan.
DOGMA is written in python and runs with any version from 2.7 on (including python 3). For help and instructions on how to use DOGMA please check the UserManual.
To run DOGMA you need a core set with conserved protein domains you can compare your proteome/transcriptome data to. We provide several precomputed core sets for different clades that can be downloaded here for the newest DOGMA version:
|core sets||size (unzipped)||pfam version||comment|
|pfam35.tbz||96 MB (117 MB)||pfam v35||Contains 11 core sets for pfam version 35 and has to be unpacked into the DOGMA folder.|
|pfam34.tbz||91 MB (111 MB)||pfam v34||Contains 11 core sets for pfam version 34 and has to be unpacked into the DOGMA folder.|
|pfam33.1.tbz||90 MB (109 MB)||pfam v33.1||Contains 11 core sets for pfam version 33.1 and has to be unpacked into the DOGMA folder.|
|pfam32.tbz||91 MB (108 MB)||pfam v32||Contains 11 core sets for pfam version 32 and has to be unpacked into the DOGMA folder.|
|pfam31.tbz||88 MB (104 MB)||pfam v31||Contains 11 core sets for pfam version 31 and has to be unpacked into the DOGMA folder.|
For core sets of older DOGMA and pfam versions please check here. Currently core sets for the following clades are included: eukaryotes, archaea, bacteria, arthropods, insects, vertebrates, mammals, fungi, plants, monocots and eudicots. Alternatively, you can create your own core sets. For more information about this or for information about the included species in the precomputed core sets please check the UserManual.
If you find a problem, have questions or any kind of comment please contact us (domainworld[@]uni-muenster.de).
If you used DOGMA in your project please cite our publication:
Elias Dohmen, Lukas P.M. Kremer, Erich Bornberg-Bauer, and Carsten Kemena, DOGMA: domain-based transcriptome and proteome quality assessment, Bioinformatics (2016) 32 (17): 2577-2581. doi:10.1093/bioinformatics/btw231 http://bioinformatics.oxfordjournals.org/content/32/17/2577
Carsten Kemena, Elias Dohmen, Erich Bornberg-Bauer, DOGMA: a web server for proteome and transcriptome quality assessment, Nucleic Acids Research, 2019, doi:10.1093/nar/gkz366 https://academic.oup.com/nar/advance-article/doi/10.1093/nar/gkz366/5488015