Amino acid dipepetide frequency for Rhodobacter veldkampii DSM 11550

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.379AlaAla: 22.379 ± 0.258
1.208AlaCys: 1.208 ± 0.036
7.735AlaAsp: 7.735 ± 0.1
9.526AlaGlu: 9.526 ± 0.129
4.403AlaPhe: 4.403 ± 0.085
12.676AlaGly: 12.676 ± 0.156
2.568AlaHis: 2.568 ± 0.055
5.648AlaIle: 5.648 ± 0.079
3.372AlaLys: 3.372 ± 0.072
16.344AlaLeu: 16.344 ± 0.186
3.949AlaMet: 3.949 ± 0.065
2.619AlaAsn: 2.619 ± 0.054
7.653AlaPro: 7.653 ± 0.112
4.977AlaGln: 4.977 ± 0.075
11.849AlaArg: 11.849 ± 0.151
5.393AlaSer: 5.393 ± 0.081
6.632AlaThr: 6.632 ± 0.087
9.598AlaVal: 9.598 ± 0.13
1.515AlaTrp: 1.515 ± 0.039
2.446AlaTyr: 2.446 ± 0.061
0.0AlaXaa: 0.0 ± 0.0
Cys
1.148CysAla: 1.148 ± 0.038
0.112CysCys: 0.112 ± 0.012
0.562CysAsp: 0.562 ± 0.022
0.441CysGlu: 0.441 ± 0.021
0.269CysPhe: 0.269 ± 0.019
0.924CysGly: 0.924 ± 0.038
0.246CysHis: 0.246 ± 0.019
0.399CysIle: 0.399 ± 0.021
0.174CysLys: 0.174 ± 0.013
0.894CysLeu: 0.894 ± 0.032
0.159CysMet: 0.159 ± 0.013
0.191CysAsn: 0.191 ± 0.015
0.549CysPro: 0.549 ± 0.027
0.229CysGln: 0.229 ± 0.014
0.632CysArg: 0.632 ± 0.027
0.382CysSer: 0.382 ± 0.019
0.436CysThr: 0.436 ± 0.024
0.52CysVal: 0.52 ± 0.026
0.124CysTrp: 0.124 ± 0.011
0.175CysTyr: 0.175 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.11AspAla: 7.11 ± 0.088
0.514AspCys: 0.514 ± 0.028
3.042AspAsp: 3.042 ± 0.079
3.042AspGlu: 3.042 ± 0.071
2.178AspPhe: 2.178 ± 0.051
4.787AspGly: 4.787 ± 0.075
1.383AspHis: 1.383 ± 0.036
2.694AspIle: 2.694 ± 0.055
1.39AspLys: 1.39 ± 0.04
7.043AspLeu: 7.043 ± 0.103
1.496AspMet: 1.496 ± 0.036
1.033AspAsn: 1.033 ± 0.033
3.82AspPro: 3.82 ± 0.063
1.81AspGln: 1.81 ± 0.044
4.762AspArg: 4.762 ± 0.069
1.805AspSer: 1.805 ± 0.046
2.639AspThr: 2.639 ± 0.052
3.614AspVal: 3.614 ± 0.062
1.334AspTrp: 1.334 ± 0.038
1.469AspTyr: 1.469 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
9.223GluAla: 9.223 ± 0.123
0.365GluCys: 0.365 ± 0.018
2.958GluAsp: 2.958 ± 0.069
2.823GluGlu: 2.823 ± 0.071
1.745GluPhe: 1.745 ± 0.051
4.837GluGly: 4.837 ± 0.074
0.936GluHis: 0.936 ± 0.033
3.277GluIle: 3.277 ± 0.066
1.777GluLys: 1.777 ± 0.048
4.597GluLeu: 4.597 ± 0.074
1.666GluMet: 1.666 ± 0.043
1.35GluAsn: 1.35 ± 0.036
2.612GluPro: 2.612 ± 0.058
1.699GluGln: 1.699 ± 0.046
4.353GluArg: 4.353 ± 0.075
1.777GluSer: 1.777 ± 0.046
3.718GluThr: 3.718 ± 0.062
4.504GluVal: 4.504 ± 0.062
0.609GluTrp: 0.609 ± 0.025
0.889GluTyr: 0.889 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.79PheAla: 4.79 ± 0.077
0.416PheCys: 0.416 ± 0.022
2.695PheAsp: 2.695 ± 0.052
1.914PheGlu: 1.914 ± 0.052
1.264PhePhe: 1.264 ± 0.043
3.636PheGly: 3.636 ± 0.064
0.713PheHis: 0.713 ± 0.028
1.347PheIle: 1.347 ± 0.043
0.792PheLys: 0.792 ± 0.035
3.364PheLeu: 3.364 ± 0.071
0.728PheMet: 0.728 ± 0.025
0.911PheAsn: 0.911 ± 0.034
1.509PhePro: 1.509 ± 0.043
0.892PheGln: 0.892 ± 0.032
2.379PheArg: 2.379 ± 0.054
1.855PheSer: 1.855 ± 0.043
1.995PheThr: 1.995 ± 0.043
2.535PheVal: 2.535 ± 0.053
0.546PheTrp: 0.546 ± 0.024
0.811PheTyr: 0.811 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
11.566GlyAla: 11.566 ± 0.152
0.907GlyCys: 0.907 ± 0.035
4.368GlyAsp: 4.368 ± 0.077
4.292GlyGlu: 4.292 ± 0.072
3.664GlyPhe: 3.664 ± 0.063
7.527GlyGly: 7.527 ± 0.122
2.021GlyHis: 2.021 ± 0.046
4.148GlyIle: 4.148 ± 0.065
2.867GlyLys: 2.867 ± 0.06
10.132GlyLeu: 10.132 ± 0.131
2.547GlyMet: 2.547 ± 0.051
1.849GlyAsn: 1.849 ± 0.049
4.077GlyPro: 4.077 ± 0.066
3.346GlyGln: 3.346 ± 0.058
6.952GlyArg: 6.952 ± 0.096
3.836GlySer: 3.836 ± 0.057
4.331GlyThr: 4.331 ± 0.081
6.49GlyVal: 6.49 ± 0.085
1.734GlyTrp: 1.734 ± 0.043
2.12GlyTyr: 2.12 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.41HisAla: 2.41 ± 0.057
0.194HisCys: 0.194 ± 0.015
1.296HisAsp: 1.296 ± 0.038
0.961HisGlu: 0.961 ± 0.036
0.767HisPhe: 0.767 ± 0.028
1.866HisGly: 1.866 ± 0.048
0.53HisHis: 0.53 ± 0.025
0.885HisIle: 0.885 ± 0.026
0.446HisLys: 0.446 ± 0.023
2.221HisLeu: 2.221 ± 0.057
0.469HisMet: 0.469 ± 0.024
0.415HisAsn: 0.415 ± 0.02
1.446HisPro: 1.446 ± 0.038
0.581HisGln: 0.581 ± 0.029
1.471HisArg: 1.471 ± 0.041
0.853HisSer: 0.853 ± 0.031
0.748HisThr: 0.748 ± 0.03
1.493HisVal: 1.493 ± 0.041
0.387HisTrp: 0.387 ± 0.023
0.549HisTyr: 0.549 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.987IleAla: 6.987 ± 0.099
0.503IleCys: 0.503 ± 0.023
3.272IleAsp: 3.272 ± 0.061
3.253IleGlu: 3.253 ± 0.059
1.577IlePhe: 1.577 ± 0.044
4.571IleGly: 4.571 ± 0.078
0.867IleHis: 0.867 ± 0.029
1.824IleIle: 1.824 ± 0.046
1.092IleLys: 1.092 ± 0.039
4.406IleLeu: 4.406 ± 0.076
0.898IleMet: 0.898 ± 0.036
1.152IleAsn: 1.152 ± 0.038
2.122IlePro: 2.122 ± 0.052
0.907IleGln: 0.907 ± 0.028
3.403IleArg: 3.403 ± 0.061
2.569IleSer: 2.569 ± 0.055
2.714IleThr: 2.714 ± 0.058
3.565IleVal: 3.565 ± 0.056
0.727IleTrp: 0.727 ± 0.024
1.015IleTyr: 1.015 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.734LysAla: 3.734 ± 0.063
0.178LysCys: 0.178 ± 0.014
1.464LysAsp: 1.464 ± 0.047
1.19LysGlu: 1.19 ± 0.037
0.784LysPhe: 0.784 ± 0.034
2.459LysGly: 2.459 ± 0.059
0.504LysHis: 0.504 ± 0.024
1.414LysIle: 1.414 ± 0.039
1.047LysLys: 1.047 ± 0.04
2.636LysLeu: 2.636 ± 0.055
0.764LysMet: 0.764 ± 0.031
0.633LysAsn: 0.633 ± 0.028
1.68LysPro: 1.68 ± 0.052
0.727LysGln: 0.727 ± 0.03
2.009LysArg: 2.009 ± 0.047
1.513LysSer: 1.513 ± 0.04
1.722LysThr: 1.722 ± 0.049
2.094LysVal: 2.094 ± 0.053
0.326LysTrp: 0.326 ± 0.017
0.558LysTyr: 0.558 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
16.601LeuAla: 16.601 ± 0.202
0.903LeuCys: 0.903 ± 0.032
5.802LeuAsp: 5.802 ± 0.079
5.23LeuGlu: 5.23 ± 0.079
3.323LeuPhe: 3.323 ± 0.064
9.015LeuGly: 9.015 ± 0.128
1.996LeuHis: 1.996 ± 0.05
5.244LeuIle: 5.244 ± 0.076
2.923LeuLys: 2.923 ± 0.06
8.786LeuLeu: 8.786 ± 0.131
2.792LeuMet: 2.792 ± 0.055
2.393LeuAsn: 2.393 ± 0.047
6.355LeuPro: 6.355 ± 0.097
2.393LeuGln: 2.393 ± 0.053
8.322LeuArg: 8.322 ± 0.112
5.778LeuSer: 5.778 ± 0.082
6.195LeuThr: 6.195 ± 0.082
7.076LeuVal: 7.076 ± 0.093
1.446LeuTrp: 1.446 ± 0.047
1.896LeuTyr: 1.896 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.758MetAla: 3.758 ± 0.065
0.148MetCys: 0.148 ± 0.011
1.198MetAsp: 1.198 ± 0.034
1.209MetGlu: 1.209 ± 0.038
0.727MetPhe: 0.727 ± 0.032
2.213MetGly: 2.213 ± 0.046
0.404MetHis: 0.404 ± 0.02
1.445MetIle: 1.445 ± 0.04
0.908MetLys: 0.908 ± 0.029
2.51MetLeu: 2.51 ± 0.053
0.711MetMet: 0.711 ± 0.028
0.757MetAsn: 0.757 ± 0.025
1.517MetPro: 1.517 ± 0.042
0.909MetGln: 0.909 ± 0.031
1.957MetArg: 1.957 ± 0.047
1.389MetSer: 1.389 ± 0.038
2.043MetThr: 2.043 ± 0.046
1.802MetVal: 1.802 ± 0.043
0.214MetTrp: 0.214 ± 0.014
0.28MetTyr: 0.28 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.8AsnAla: 2.8 ± 0.052
0.237AsnCys: 0.237 ± 0.017
1.194AsnAsp: 1.194 ± 0.04
0.942AsnGlu: 0.942 ± 0.031
0.781AsnPhe: 0.781 ± 0.032
1.935AsnGly: 1.935 ± 0.048
0.436AsnHis: 0.436 ± 0.021
1.126AsnIle: 1.126 ± 0.032
0.559AsnLys: 0.559 ± 0.027
2.327AsnLeu: 2.327 ± 0.05
0.499AsnMet: 0.499 ± 0.025
0.51AsnAsn: 0.51 ± 0.024
1.76AsnPro: 1.76 ± 0.042
0.557AsnGln: 0.557 ± 0.026
1.752AsnArg: 1.752 ± 0.042
0.893AsnSer: 0.893 ± 0.036
1.146AsnThr: 1.146 ± 0.035
1.511AsnVal: 1.511 ± 0.04
0.386AsnTrp: 0.386 ± 0.023
0.491AsnTyr: 0.491 ± 0.024
0.0AsnXaa: 0.0 ± 0.0
Pro
7.737ProAla: 7.737 ± 0.102
0.375ProCys: 0.375 ± 0.019
4.069ProAsp: 4.069 ± 0.068
4.559ProGlu: 4.559 ± 0.066
1.941ProPhe: 1.941 ± 0.04
5.38ProGly: 5.38 ± 0.088
1.025ProHis: 1.025 ± 0.035
2.124ProIle: 2.124 ± 0.047
1.554ProLys: 1.554 ± 0.042
4.798ProLeu: 4.798 ± 0.084
1.363ProMet: 1.363 ± 0.035
1.057ProAsn: 1.057 ± 0.034
2.724ProPro: 2.724 ± 0.068
1.723ProGln: 1.723 ± 0.049
3.422ProArg: 3.422 ± 0.062
2.172ProSer: 2.172 ± 0.046
2.37ProThr: 2.37 ± 0.053
4.717ProVal: 4.717 ± 0.075
0.741ProTrp: 0.741 ± 0.028
1.082ProTyr: 1.082 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.499GlnAla: 4.499 ± 0.076
0.174GlnCys: 0.174 ± 0.015
1.411GlnAsp: 1.411 ± 0.034
1.265GlnGlu: 1.265 ± 0.037
0.998GlnPhe: 0.998 ± 0.03
2.767GlnGly: 2.767 ± 0.062
0.536GlnHis: 0.536 ± 0.024
1.966GlnIle: 1.966 ± 0.05
0.89GlnLys: 0.89 ± 0.032
2.604GlnLeu: 2.604 ± 0.05
1.022GlnMet: 1.022 ± 0.031
0.705GlnAsn: 0.705 ± 0.031
1.765GlnPro: 1.765 ± 0.041
0.894GlnGln: 0.894 ± 0.036
2.291GlnArg: 2.291 ± 0.052
1.484GlnSer: 1.484 ± 0.042
1.651GlnThr: 1.651 ± 0.048
2.451GlnVal: 2.451 ± 0.054
0.365GlnTrp: 0.365 ± 0.022
0.475GlnTyr: 0.475 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
10.845ArgAla: 10.845 ± 0.118
0.503ArgCys: 0.503 ± 0.026
4.363ArgAsp: 4.363 ± 0.064
3.831ArgGlu: 3.831 ± 0.073
3.014ArgPhe: 3.014 ± 0.058
5.516ArgGly: 5.516 ± 0.079
1.769ArgHis: 1.769 ± 0.042
4.064ArgIle: 4.064 ± 0.077
2.192ArgLys: 2.192 ± 0.055
9.274ArgLeu: 9.274 ± 0.132
2.058ArgMet: 2.058 ± 0.051
1.627ArgAsn: 1.627 ± 0.045
4.247ArgPro: 4.247 ± 0.078
2.438ArgGln: 2.438 ± 0.049
6.131ArgArg: 6.131 ± 0.102
3.187ArgSer: 3.187 ± 0.056
2.999ArgThr: 2.999 ± 0.055
5.277ArgVal: 5.277 ± 0.079
1.115ArgTrp: 1.115 ± 0.036
1.552ArgTyr: 1.552 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
5.669SerAla: 5.669 ± 0.075
0.411SerCys: 0.411 ± 0.023
2.736SerAsp: 2.736 ± 0.053
2.285SerGlu: 2.285 ± 0.055
1.934SerPhe: 1.934 ± 0.051
4.829SerGly: 4.829 ± 0.074
0.952SerHis: 0.952 ± 0.033
1.936SerIle: 1.936 ± 0.047
1.156SerLys: 1.156 ± 0.034
4.646SerLeu: 4.646 ± 0.072
1.043SerMet: 1.043 ± 0.032
0.968SerAsn: 0.968 ± 0.034
2.375SerPro: 2.375 ± 0.046
1.278SerGln: 1.278 ± 0.037
3.083SerArg: 3.083 ± 0.067
2.034SerSer: 2.034 ± 0.05
2.097SerThr: 2.097 ± 0.051
3.353SerVal: 3.353 ± 0.058
0.598SerTrp: 0.598 ± 0.025
1.077SerTyr: 1.077 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
6.691ThrAla: 6.691 ± 0.082
0.475ThrCys: 0.475 ± 0.025
2.949ThrAsp: 2.949 ± 0.053
2.994ThrGlu: 2.994 ± 0.057
1.741ThrPhe: 1.741 ± 0.041
5.597ThrGly: 5.597 ± 0.082
1.046ThrHis: 1.046 ± 0.032
2.5ThrIle: 2.5 ± 0.052
1.292ThrLys: 1.292 ± 0.043
5.923ThrLeu: 5.923 ± 0.072
1.101ThrMet: 1.101 ± 0.04
1.104ThrAsn: 1.104 ± 0.039
3.384ThrPro: 3.384 ± 0.05
1.401ThrGln: 1.401 ± 0.04
3.738ThrArg: 3.738 ± 0.065
2.173ThrSer: 2.173 ± 0.054
2.694ThrThr: 2.694 ± 0.056
3.948ThrVal: 3.948 ± 0.072
0.64ThrTrp: 0.64 ± 0.026
1.033ThrTyr: 1.033 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
10.278ValAla: 10.278 ± 0.128
0.611ValCys: 0.611 ± 0.024
3.661ValAsp: 3.661 ± 0.072
4.294ValGlu: 4.294 ± 0.075
2.682ValPhe: 2.682 ± 0.057
5.213ValGly: 5.213 ± 0.079
1.256ValHis: 1.256 ± 0.039
3.955ValIle: 3.955 ± 0.076
2.043ValLys: 2.043 ± 0.056
7.973ValLeu: 7.973 ± 0.106
2.051ValMet: 2.051 ± 0.051
1.721ValAsn: 1.721 ± 0.046
3.639ValPro: 3.639 ± 0.064
2.228ValGln: 2.228 ± 0.051
4.606ValArg: 4.606 ± 0.084
3.664ValSer: 3.664 ± 0.073
4.646ValThr: 4.646 ± 0.074
5.776ValVal: 5.776 ± 0.093
1.013ValTrp: 1.013 ± 0.031
1.38ValTyr: 1.38 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.723TrpAla: 1.723 ± 0.044
0.135TrpCys: 0.135 ± 0.011
0.743TrpAsp: 0.743 ± 0.027
0.598TrpGlu: 0.598 ± 0.027
0.521TrpPhe: 0.521 ± 0.023
1.101TrpGly: 1.101 ± 0.036
0.355TrpHis: 0.355 ± 0.02
0.604TrpIle: 0.604 ± 0.031
0.421TrpLys: 0.421 ± 0.023
1.797TrpLeu: 1.797 ± 0.046
0.365TrpMet: 0.365 ± 0.021
0.328TrpAsn: 0.328 ± 0.018
0.749TrpPro: 0.749 ± 0.032
0.633TrpGln: 0.633 ± 0.026
1.327TrpArg: 1.327 ± 0.038
0.749TrpSer: 0.749 ± 0.028
0.667TrpThr: 0.667 ± 0.027
1.052TrpVal: 1.052 ± 0.028
0.241TrpTrp: 0.241 ± 0.018
0.266TrpTyr: 0.266 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.542TyrAla: 2.542 ± 0.056
0.197TyrCys: 0.197 ± 0.013
1.446TyrAsp: 1.446 ± 0.035
1.064TyrGlu: 1.064 ± 0.039
0.778TyrPhe: 0.778 ± 0.03
1.925TyrGly: 1.925 ± 0.044
0.444TyrHis: 0.444 ± 0.02
0.822TyrIle: 0.822 ± 0.033
0.541TyrLys: 0.541 ± 0.025
2.194TyrLeu: 2.194 ± 0.047
0.417TyrMet: 0.417 ± 0.02
0.518TyrAsn: 0.518 ± 0.023
0.954TyrPro: 0.954 ± 0.032
0.592TyrGln: 0.592 ± 0.028
1.564TyrArg: 1.564 ± 0.041
0.958TyrSer: 0.958 ± 0.036
0.983TyrThr: 0.983 ± 0.032
1.315TyrVal: 1.315 ± 0.04
0.313TyrTrp: 0.313 ± 0.022
0.483TyrTyr: 0.483 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3115 proteins (961833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski