Amino acid dipepetide frequency for Desulfovibrio marinus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.393AlaAla: 12.393 ± 0.144
1.393AlaCys: 1.393 ± 0.029
5.271AlaAsp: 5.271 ± 0.069
6.993AlaGlu: 6.993 ± 0.083
3.719AlaPhe: 3.719 ± 0.056
8.85AlaGly: 8.85 ± 0.09
2.036AlaHis: 2.036 ± 0.034
4.957AlaIle: 4.957 ± 0.059
4.054AlaLys: 4.054 ± 0.071
11.163AlaLeu: 11.163 ± 0.1
3.536AlaMet: 3.536 ± 0.052
2.594AlaAsn: 2.594 ± 0.045
4.483AlaPro: 4.483 ± 0.066
3.53AlaGln: 3.53 ± 0.054
6.525AlaArg: 6.525 ± 0.068
5.54AlaSer: 5.54 ± 0.063
4.839AlaThr: 4.839 ± 0.068
8.162AlaVal: 8.162 ± 0.09
1.264AlaTrp: 1.264 ± 0.032
2.461AlaTyr: 2.461 ± 0.043
0.001AlaXaa: 0.001 ± 0.001
Cys
1.167CysAla: 1.167 ± 0.028
0.247CysCys: 0.247 ± 0.016
0.623CysAsp: 0.623 ± 0.021
0.63CysGlu: 0.63 ± 0.019
0.49CysPhe: 0.49 ± 0.019
1.274CysGly: 1.274 ± 0.033
0.361CysHis: 0.361 ± 0.026
0.665CysIle: 0.665 ± 0.021
0.42CysLys: 0.42 ± 0.016
1.22CysLeu: 1.22 ± 0.027
0.347CysMet: 0.347 ± 0.017
0.352CysAsn: 0.352 ± 0.016
0.825CysPro: 0.825 ± 0.026
0.341CysGln: 0.341 ± 0.016
0.861CysArg: 0.861 ± 0.022
0.727CysSer: 0.727 ± 0.02
0.594CysThr: 0.594 ± 0.023
0.874CysVal: 0.874 ± 0.024
0.146CysTrp: 0.146 ± 0.009
0.299CysTyr: 0.299 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
5.953AspAla: 5.953 ± 0.076
0.6AspCys: 0.6 ± 0.022
3.217AspAsp: 3.217 ± 0.067
4.217AspGlu: 4.217 ± 0.054
2.28AspPhe: 2.28 ± 0.041
4.444AspGly: 4.444 ± 0.075
1.144AspHis: 1.144 ± 0.028
3.532AspIle: 3.532 ± 0.054
2.424AspLys: 2.424 ± 0.046
5.593AspLeu: 5.593 ± 0.056
1.668AspMet: 1.668 ± 0.035
1.584AspAsn: 1.584 ± 0.03
3.179AspPro: 3.179 ± 0.05
1.62AspGln: 1.62 ± 0.034
3.223AspArg: 3.223 ± 0.051
2.934AspSer: 2.934 ± 0.048
2.867AspThr: 2.867 ± 0.048
4.18AspVal: 4.18 ± 0.056
0.756AspTrp: 0.756 ± 0.022
1.607AspTyr: 1.607 ± 0.031
0.0AspXaa: 0.0 ± 0.0
Glu
7.218GluAla: 7.218 ± 0.085
0.66GluCys: 0.66 ± 0.023
3.699GluAsp: 3.699 ± 0.054
5.147GluGlu: 5.147 ± 0.083
2.195GluPhe: 2.195 ± 0.037
4.251GluGly: 4.251 ± 0.062
1.665GluHis: 1.665 ± 0.033
3.646GluIle: 3.646 ± 0.052
3.392GluLys: 3.392 ± 0.051
7.056GluLeu: 7.056 ± 0.073
1.851GluMet: 1.851 ± 0.038
2.267GluAsn: 2.267 ± 0.04
3.068GluPro: 3.068 ± 0.062
2.774GluGln: 2.774 ± 0.053
4.779GluArg: 4.779 ± 0.063
3.815GluSer: 3.815 ± 0.056
3.497GluThr: 3.497 ± 0.048
4.355GluVal: 4.355 ± 0.066
0.726GluTrp: 0.726 ± 0.021
1.792GluTyr: 1.792 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.612PheAla: 3.612 ± 0.051
0.537PheCys: 0.537 ± 0.018
2.337PheAsp: 2.337 ± 0.037
2.343PheGlu: 2.343 ± 0.041
1.854PhePhe: 1.854 ± 0.041
2.995PheGly: 2.995 ± 0.051
0.922PheHis: 0.922 ± 0.025
2.067PheIle: 2.067 ± 0.04
1.376PheLys: 1.376 ± 0.036
4.162PheLeu: 4.162 ± 0.06
1.1PheMet: 1.1 ± 0.029
1.129PheAsn: 1.129 ± 0.026
1.786PhePro: 1.786 ± 0.04
1.135PheGln: 1.135 ± 0.026
2.191PheArg: 2.191 ± 0.042
2.591PheSer: 2.591 ± 0.037
2.254PheThr: 2.254 ± 0.039
2.728PheVal: 2.728 ± 0.047
0.532PheTrp: 0.532 ± 0.021
1.185PheTyr: 1.185 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
7.278GlyAla: 7.278 ± 0.083
1.147GlyCys: 1.147 ± 0.033
4.041GlyAsp: 4.041 ± 0.074
5.14GlyGlu: 5.14 ± 0.067
3.298GlyPhe: 3.298 ± 0.05
6.344GlyGly: 6.344 ± 0.111
1.737GlyHis: 1.737 ± 0.037
4.154GlyIle: 4.154 ± 0.057
3.752GlyLys: 3.752 ± 0.06
8.388GlyLeu: 8.388 ± 0.102
2.613GlyMet: 2.613 ± 0.047
2.181GlyAsn: 2.181 ± 0.054
3.164GlyPro: 3.164 ± 0.044
2.481GlyGln: 2.481 ± 0.045
5.068GlyArg: 5.068 ± 0.063
4.563GlySer: 4.563 ± 0.062
3.726GlyThr: 3.726 ± 0.059
6.385GlyVal: 6.385 ± 0.076
1.076GlyTrp: 1.076 ± 0.027
2.489GlyTyr: 2.489 ± 0.039
0.0GlyXaa: 0.0 ± 0.0
His
2.141HisAla: 2.141 ± 0.04
0.321HisCys: 0.321 ± 0.016
1.255HisAsp: 1.255 ± 0.031
1.416HisGlu: 1.416 ± 0.029
0.939HisPhe: 0.939 ± 0.024
2.006HisGly: 2.006 ± 0.044
0.578HisHis: 0.578 ± 0.022
1.122HisIle: 1.122 ± 0.025
0.78HisLys: 0.78 ± 0.024
2.008HisLeu: 2.008 ± 0.041
0.576HisMet: 0.576 ± 0.019
0.595HisAsn: 0.595 ± 0.022
1.338HisPro: 1.338 ± 0.032
0.614HisGln: 0.614 ± 0.02
1.246HisArg: 1.246 ± 0.032
1.135HisSer: 1.135 ± 0.026
1.168HisThr: 1.168 ± 0.029
1.513HisVal: 1.513 ± 0.035
0.282HisTrp: 0.282 ± 0.014
0.652HisTyr: 0.652 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.482IleAla: 5.482 ± 0.066
0.665IleCys: 0.665 ± 0.022
2.774IleAsp: 2.774 ± 0.042
3.249IleGlu: 3.249 ± 0.056
2.067IlePhe: 2.067 ± 0.042
3.873IleGly: 3.873 ± 0.054
1.216IleHis: 1.216 ± 0.028
2.944IleIle: 2.944 ± 0.056
1.946IleLys: 1.946 ± 0.04
5.386IleLeu: 5.386 ± 0.064
1.281IleMet: 1.281 ± 0.031
1.664IleAsn: 1.664 ± 0.044
2.701IlePro: 2.701 ± 0.048
1.512IleGln: 1.512 ± 0.037
3.223IleArg: 3.223 ± 0.044
3.038IleSer: 3.038 ± 0.05
2.836IleThr: 2.836 ± 0.051
4.013IleVal: 4.013 ± 0.057
0.526IleTrp: 0.526 ± 0.02
1.341IleTyr: 1.341 ± 0.031
0.0IleXaa: 0.0 ± 0.0
Lys
4.332LysAla: 4.332 ± 0.069
0.327LysCys: 0.327 ± 0.018
2.635LysAsp: 2.635 ± 0.044
3.042LysGlu: 3.042 ± 0.053
1.14LysPhe: 1.14 ± 0.034
2.995LysGly: 2.995 ± 0.057
0.842LysHis: 0.842 ± 0.025
2.043LysIle: 2.043 ± 0.042
2.31LysLys: 2.31 ± 0.055
3.661LysLeu: 3.661 ± 0.058
0.989LysMet: 0.989 ± 0.027
1.436LysAsn: 1.436 ± 0.036
2.185LysPro: 2.185 ± 0.042
1.457LysGln: 1.457 ± 0.034
2.794LysArg: 2.794 ± 0.046
2.238LysSer: 2.238 ± 0.038
2.201LysThr: 2.201 ± 0.036
2.641LysVal: 2.641 ± 0.057
0.358LysTrp: 0.358 ± 0.017
1.02LysTyr: 1.02 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
12.121LeuAla: 12.121 ± 0.105
1.338LeuCys: 1.338 ± 0.032
6.702LeuAsp: 6.702 ± 0.06
7.082LeuGlu: 7.082 ± 0.08
4.091LeuPhe: 4.091 ± 0.063
8.477LeuGly: 8.477 ± 0.095
2.128LeuHis: 2.128 ± 0.042
4.435LeuIle: 4.435 ± 0.062
3.971LeuLys: 3.971 ± 0.062
10.737LeuLeu: 10.737 ± 0.124
2.45LeuMet: 2.45 ± 0.044
2.802LeuAsn: 2.802 ± 0.047
5.293LeuPro: 5.293 ± 0.073
3.175LeuGln: 3.175 ± 0.052
6.274LeuArg: 6.274 ± 0.077
6.415LeuSer: 6.415 ± 0.072
5.374LeuThr: 5.374 ± 0.066
7.532LeuVal: 7.532 ± 0.087
1.052LeuTrp: 1.052 ± 0.029
2.525LeuTyr: 2.525 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
3.24MetAla: 3.24 ± 0.047
0.255MetCys: 0.255 ± 0.013
1.923MetAsp: 1.923 ± 0.035
1.943MetGlu: 1.943 ± 0.041
0.82MetPhe: 0.82 ± 0.03
2.177MetGly: 2.177 ± 0.044
0.576MetHis: 0.576 ± 0.021
1.158MetIle: 1.158 ± 0.028
1.071MetLys: 1.071 ± 0.026
2.862MetLeu: 2.862 ± 0.045
0.586MetMet: 0.586 ± 0.024
0.925MetAsn: 0.925 ± 0.025
1.549MetPro: 1.549 ± 0.036
1.052MetGln: 1.052 ± 0.024
1.926MetArg: 1.926 ± 0.038
1.634MetSer: 1.634 ± 0.03
1.577MetThr: 1.577 ± 0.034
1.894MetVal: 1.894 ± 0.041
0.197MetTrp: 0.197 ± 0.011
0.512MetTyr: 0.512 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.149AsnAla: 3.149 ± 0.056
0.352AsnCys: 0.352 ± 0.015
1.494AsnAsp: 1.494 ± 0.039
1.688AsnGlu: 1.688 ± 0.034
1.047AsnPhe: 1.047 ± 0.029
2.341AsnGly: 2.341 ± 0.044
0.606AsnHis: 0.606 ± 0.02
1.788AsnIle: 1.788 ± 0.037
1.105AsnLys: 1.105 ± 0.029
3.013AsnLeu: 3.013 ± 0.049
0.787AsnMet: 0.787 ± 0.024
0.83AsnAsn: 0.83 ± 0.024
1.947AsnPro: 1.947 ± 0.038
0.918AsnGln: 0.918 ± 0.027
1.775AsnArg: 1.775 ± 0.034
1.412AsnSer: 1.412 ± 0.034
1.491AsnThr: 1.491 ± 0.032
2.108AsnVal: 2.108 ± 0.038
0.353AsnTrp: 0.353 ± 0.015
0.831AsnTyr: 0.831 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
5.077ProAla: 5.077 ± 0.079
0.589ProCys: 0.589 ± 0.019
3.512ProAsp: 3.512 ± 0.056
4.481ProGlu: 4.481 ± 0.062
2.01ProPhe: 2.01 ± 0.041
4.33ProGly: 4.33 ± 0.062
1.054ProHis: 1.054 ± 0.028
2.048ProIle: 2.048 ± 0.04
1.863ProLys: 1.863 ± 0.039
4.653ProLeu: 4.653 ± 0.06
1.298ProMet: 1.298 ± 0.027
1.25ProAsn: 1.25 ± 0.034
2.43ProPro: 2.43 ± 0.041
1.688ProGln: 1.688 ± 0.035
2.622ProArg: 2.622 ± 0.047
2.611ProSer: 2.611 ± 0.044
2.204ProThr: 2.204 ± 0.035
3.796ProVal: 3.796 ± 0.051
0.639ProTrp: 0.639 ± 0.02
1.367ProTyr: 1.367 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
3.817GlnAla: 3.817 ± 0.054
0.383GlnCys: 0.383 ± 0.017
1.888GlnAsp: 1.888 ± 0.039
2.473GlnGlu: 2.473 ± 0.041
1.028GlnPhe: 1.028 ± 0.028
2.672GlnGly: 2.672 ± 0.044
0.693GlnHis: 0.693 ± 0.025
1.568GlnIle: 1.568 ± 0.036
1.457GlnLys: 1.457 ± 0.033
2.925GlnLeu: 2.925 ± 0.047
0.858GlnMet: 0.858 ± 0.025
1.094GlnAsn: 1.094 ± 0.03
1.594GlnPro: 1.594 ± 0.036
1.376GlnGln: 1.376 ± 0.036
2.229GlnArg: 2.229 ± 0.045
1.887GlnSer: 1.887 ± 0.043
1.674GlnThr: 1.674 ± 0.037
2.223GlnVal: 2.223 ± 0.038
0.396GlnTrp: 0.396 ± 0.016
0.844GlnTyr: 0.844 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
5.558ArgAla: 5.558 ± 0.067
0.769ArgCys: 0.769 ± 0.025
3.423ArgAsp: 3.423 ± 0.049
4.724ArgGlu: 4.724 ± 0.054
2.687ArgPhe: 2.687 ± 0.051
4.117ArgGly: 4.117 ± 0.068
1.452ArgHis: 1.452 ± 0.034
3.99ArgIle: 3.99 ± 0.053
2.866ArgLys: 2.866 ± 0.046
7.028ArgLeu: 7.028 ± 0.083
1.817ArgMet: 1.817 ± 0.04
2.009ArgAsn: 2.009 ± 0.036
2.754ArgPro: 2.754 ± 0.048
2.311ArgGln: 2.311 ± 0.043
4.726ArgArg: 4.726 ± 0.075
3.465ArgSer: 3.465 ± 0.056
3.353ArgThr: 3.353 ± 0.047
4.235ArgVal: 4.235 ± 0.061
0.74ArgTrp: 0.74 ± 0.025
1.888ArgTyr: 1.888 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
5.192SerAla: 5.192 ± 0.078
0.76SerCys: 0.76 ± 0.026
2.824SerAsp: 2.824 ± 0.047
3.355SerGlu: 3.355 ± 0.054
2.391SerPhe: 2.391 ± 0.042
5.159SerGly: 5.159 ± 0.079
1.231SerHis: 1.231 ± 0.029
3.159SerIle: 3.159 ± 0.046
2.121SerLys: 2.121 ± 0.044
6.222SerLeu: 6.222 ± 0.066
1.906SerMet: 1.906 ± 0.034
1.459SerAsn: 1.459 ± 0.029
2.791SerPro: 2.791 ± 0.045
1.89SerGln: 1.89 ± 0.038
3.854SerArg: 3.854 ± 0.06
3.429SerSer: 3.429 ± 0.062
2.814SerThr: 2.814 ± 0.043
3.901SerVal: 3.901 ± 0.057
0.742SerTrp: 0.742 ± 0.025
1.589SerTyr: 1.589 ± 0.033
0.0SerXaa: 0.0 ± 0.0
Thr
5.378ThrAla: 5.378 ± 0.065
0.569ThrCys: 0.569 ± 0.021
2.559ThrAsp: 2.559 ± 0.044
2.748ThrGlu: 2.748 ± 0.043
2.009ThrPhe: 2.009 ± 0.034
4.539ThrGly: 4.539 ± 0.067
1.067ThrHis: 1.067 ± 0.027
2.885ThrIle: 2.885 ± 0.048
1.771ThrLys: 1.771 ± 0.035
5.612ThrLeu: 5.612 ± 0.06
1.456ThrMet: 1.456 ± 0.034
1.382ThrAsn: 1.382 ± 0.032
3.242ThrPro: 3.242 ± 0.051
1.508ThrGln: 1.508 ± 0.034
3.107ThrArg: 3.107 ± 0.044
2.772ThrSer: 2.772 ± 0.043
2.653ThrThr: 2.653 ± 0.046
3.997ThrVal: 3.997 ± 0.059
0.6ThrTrp: 0.6 ± 0.021
1.321ThrTyr: 1.321 ± 0.031
0.0ThrXaa: 0.0 ± 0.0
Val
7.138ValAla: 7.138 ± 0.075
0.98ValCys: 0.98 ± 0.026
4.502ValAsp: 4.502 ± 0.056
4.818ValGlu: 4.818 ± 0.058
3.101ValPhe: 3.101 ± 0.053
5.087ValGly: 5.087 ± 0.059
1.577ValHis: 1.577 ± 0.034
3.756ValIle: 3.756 ± 0.055
2.472ValLys: 2.472 ± 0.053
8.14ValLeu: 8.14 ± 0.09
1.784ValMet: 1.784 ± 0.035
2.23ValAsn: 2.23 ± 0.038
3.277ValPro: 3.277 ± 0.056
2.424ValGln: 2.424 ± 0.045
4.87ValArg: 4.87 ± 0.061
4.418ValSer: 4.418 ± 0.053
3.956ValThr: 3.956 ± 0.06
5.916ValVal: 5.916 ± 0.077
0.786ValTrp: 0.786 ± 0.024
1.885ValTyr: 1.885 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
0.974TrpAla: 0.974 ± 0.027
0.162TrpCys: 0.162 ± 0.01
0.661TrpAsp: 0.661 ± 0.024
0.716TrpGlu: 0.716 ± 0.023
0.496TrpPhe: 0.496 ± 0.02
0.798TrpGly: 0.798 ± 0.027
0.232TrpHis: 0.232 ± 0.012
0.55TrpIle: 0.55 ± 0.02
0.514TrpLys: 0.514 ± 0.018
1.338TrpLeu: 1.338 ± 0.038
0.323TrpMet: 0.323 ± 0.016
0.423TrpAsn: 0.423 ± 0.016
0.589TrpPro: 0.589 ± 0.019
0.466TrpGln: 0.466 ± 0.018
0.891TrpArg: 0.891 ± 0.027
0.679TrpSer: 0.679 ± 0.02
0.633TrpThr: 0.633 ± 0.021
0.762TrpVal: 0.762 ± 0.027
0.197TrpTrp: 0.197 ± 0.011
0.304TrpTyr: 0.304 ± 0.014
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.542TyrAla: 2.542 ± 0.045
0.41TyrCys: 0.41 ± 0.016
1.677TyrAsp: 1.677 ± 0.038
1.648TyrGlu: 1.648 ± 0.036
1.176TyrPhe: 1.176 ± 0.029
2.297TyrGly: 2.297 ± 0.042
0.569TyrHis: 0.569 ± 0.021
1.271TyrIle: 1.271 ± 0.031
0.982TyrLys: 0.982 ± 0.027
2.807TyrLeu: 2.807 ± 0.043
0.648TyrMet: 0.648 ± 0.022
0.83TyrAsn: 0.83 ± 0.025
1.328TyrPro: 1.328 ± 0.03
0.807TyrGln: 0.807 ± 0.024
1.729TyrArg: 1.729 ± 0.038
1.488TyrSer: 1.488 ± 0.033
1.464TyrThr: 1.464 ± 0.034
1.9TyrVal: 1.9 ± 0.038
0.34TyrTrp: 0.34 ± 0.016
0.865TyrTyr: 0.865 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.001
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4594 proteins (1459414 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski