Amino acid dipepetide frequency for Pseudoglutamicibacter albus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.935AlaAla: 15.935 ± 0.25
0.72AlaCys: 0.72 ± 0.039
7.215AlaAsp: 7.215 ± 0.127
8.51AlaGlu: 8.51 ± 0.172
3.542AlaPhe: 3.542 ± 0.086
10.653AlaGly: 10.653 ± 0.152
2.492AlaHis: 2.492 ± 0.059
5.179AlaIle: 5.179 ± 0.09
4.024AlaLys: 4.024 ± 0.108
11.909AlaLeu: 11.909 ± 0.166
2.777AlaMet: 2.777 ± 0.073
2.751AlaAsn: 2.751 ± 0.066
5.218AlaPro: 5.218 ± 0.111
4.609AlaGln: 4.609 ± 0.097
7.529AlaArg: 7.529 ± 0.133
7.093AlaSer: 7.093 ± 0.127
6.715AlaThr: 6.715 ± 0.125
10.157AlaVal: 10.157 ± 0.151
1.738AlaTrp: 1.738 ± 0.061
2.268AlaTyr: 2.268 ± 0.06
0.0AlaXaa: 0.0 ± 0.0
Cys
0.632CysAla: 0.632 ± 0.03
0.045CysCys: 0.045 ± 0.011
0.343CysAsp: 0.343 ± 0.022
0.331CysGlu: 0.331 ± 0.026
0.182CysPhe: 0.182 ± 0.018
0.673CysGly: 0.673 ± 0.037
0.138CysHis: 0.138 ± 0.015
0.294CysIle: 0.294 ± 0.023
0.125CysLys: 0.125 ± 0.014
0.497CysLeu: 0.497 ± 0.036
0.11CysMet: 0.11 ± 0.014
0.128CysAsn: 0.128 ± 0.013
0.307CysPro: 0.307 ± 0.028
0.161CysGln: 0.161 ± 0.018
0.312CysArg: 0.312 ± 0.021
0.422CysSer: 0.422 ± 0.028
0.323CysThr: 0.323 ± 0.021
0.505CysVal: 0.505 ± 0.028
0.073CysTrp: 0.073 ± 0.011
0.13CysTyr: 0.13 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
8.24AspAla: 8.24 ± 0.147
0.25AspCys: 0.25 ± 0.022
3.439AspAsp: 3.439 ± 0.076
4.647AspGlu: 4.647 ± 0.099
1.748AspPhe: 1.748 ± 0.059
5.335AspGly: 5.335 ± 0.119
1.397AspHis: 1.397 ± 0.048
2.647AspIle: 2.647 ± 0.073
1.759AspLys: 1.759 ± 0.066
5.227AspLeu: 5.227 ± 0.09
1.132AspMet: 1.132 ± 0.037
1.238AspAsn: 1.238 ± 0.051
3.737AspPro: 3.737 ± 0.092
1.885AspGln: 1.885 ± 0.056
3.321AspArg: 3.321 ± 0.081
2.916AspSer: 2.916 ± 0.071
2.95AspThr: 2.95 ± 0.066
5.525AspVal: 5.525 ± 0.093
0.733AspTrp: 0.733 ± 0.038
1.282AspTyr: 1.282 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
8.315GluAla: 8.315 ± 0.158
0.356GluCys: 0.356 ± 0.024
4.148GluAsp: 4.148 ± 0.101
4.448GluGlu: 4.448 ± 0.097
1.958GluPhe: 1.958 ± 0.057
4.663GluGly: 4.663 ± 0.11
1.906GluHis: 1.906 ± 0.065
3.069GluIle: 3.069 ± 0.089
2.554GluLys: 2.554 ± 0.069
6.846GluLeu: 6.846 ± 0.117
1.227GluMet: 1.227 ± 0.041
2.01GluAsn: 2.01 ± 0.064
3.318GluPro: 3.318 ± 0.105
2.77GluGln: 2.77 ± 0.075
5.058GluArg: 5.058 ± 0.108
3.308GluSer: 3.308 ± 0.085
3.722GluThr: 3.722 ± 0.092
4.811GluVal: 4.811 ± 0.12
0.933GluTrp: 0.933 ± 0.04
1.41GluTyr: 1.41 ± 0.056
0.0GluXaa: 0.0 ± 0.0
Phe
3.571PheAla: 3.571 ± 0.075
0.249PheCys: 0.249 ± 0.021
2.083PheAsp: 2.083 ± 0.064
1.964PheGlu: 1.964 ± 0.066
1.108PhePhe: 1.108 ± 0.054
3.207PheGly: 3.207 ± 0.074
0.552PheHis: 0.552 ± 0.032
1.516PheIle: 1.516 ± 0.058
0.868PheLys: 0.868 ± 0.042
2.703PheLeu: 2.703 ± 0.072
0.752PheMet: 0.752 ± 0.035
0.898PheAsn: 0.898 ± 0.035
1.386PhePro: 1.386 ± 0.046
0.887PheGln: 0.887 ± 0.04
1.607PheArg: 1.607 ± 0.057
2.01PheSer: 2.01 ± 0.049
2.148PheThr: 2.148 ± 0.061
2.468PheVal: 2.468 ± 0.07
0.409PheTrp: 0.409 ± 0.026
0.661PheTyr: 0.661 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
9.033GlyAla: 9.033 ± 0.141
0.583GlyCys: 0.583 ± 0.031
4.213GlyAsp: 4.213 ± 0.091
5.441GlyGlu: 5.441 ± 0.102
3.235GlyPhe: 3.235 ± 0.065
6.733GlyGly: 6.733 ± 0.132
1.894GlyHis: 1.894 ± 0.065
4.432GlyIle: 4.432 ± 0.087
3.332GlyLys: 3.332 ± 0.096
8.211GlyLeu: 8.211 ± 0.147
2.005GlyMet: 2.005 ± 0.058
2.278GlyAsn: 2.278 ± 0.076
3.576GlyPro: 3.576 ± 0.087
2.905GlyGln: 2.905 ± 0.072
5.368GlyArg: 5.368 ± 0.101
5.301GlySer: 5.301 ± 0.093
5.046GlyThr: 5.046 ± 0.1
7.152GlyVal: 7.152 ± 0.112
1.482GlyTrp: 1.482 ± 0.053
2.102GlyTyr: 2.102 ± 0.058
0.0GlyXaa: 0.0 ± 0.0
His
2.422HisAla: 2.422 ± 0.064
0.148HisCys: 0.148 ± 0.015
1.348HisAsp: 1.348 ± 0.049
1.378HisGlu: 1.378 ± 0.048
0.585HisPhe: 0.585 ± 0.033
1.985HisGly: 1.985 ± 0.063
0.624HisHis: 0.624 ± 0.034
0.973HisIle: 0.973 ± 0.042
0.554HisLys: 0.554 ± 0.028
1.878HisLeu: 1.878 ± 0.054
0.465HisMet: 0.465 ± 0.026
0.541HisAsn: 0.541 ± 0.031
1.462HisPro: 1.462 ± 0.051
0.755HisGln: 0.755 ± 0.038
1.496HisArg: 1.496 ± 0.05
1.167HisSer: 1.167 ± 0.048
1.348HisThr: 1.348 ± 0.049
1.828HisVal: 1.828 ± 0.06
0.281HisTrp: 0.281 ± 0.02
0.481HisTyr: 0.481 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
6.14IleAla: 6.14 ± 0.102
0.275IleCys: 0.275 ± 0.024
3.184IleAsp: 3.184 ± 0.074
3.301IleGlu: 3.301 ± 0.09
1.335IlePhe: 1.335 ± 0.053
4.151IleGly: 4.151 ± 0.096
0.859IleHis: 0.859 ± 0.038
2.151IleIle: 2.151 ± 0.074
1.556IleLys: 1.556 ± 0.053
3.811IleLeu: 3.811 ± 0.09
0.986IleMet: 0.986 ± 0.047
1.319IleAsn: 1.319 ± 0.051
2.712IlePro: 2.712 ± 0.064
1.267IleGln: 1.267 ± 0.048
2.598IleArg: 2.598 ± 0.074
2.746IleSer: 2.746 ± 0.061
2.993IleThr: 2.993 ± 0.084
4.307IleVal: 4.307 ± 0.095
0.473IleTrp: 0.473 ± 0.028
0.898IleTyr: 0.898 ± 0.036
0.0IleXaa: 0.0 ± 0.0
Lys
4.218LysAla: 4.218 ± 0.113
0.132LysCys: 0.132 ± 0.016
2.297LysAsp: 2.297 ± 0.08
2.005LysGlu: 2.005 ± 0.067
0.79LysPhe: 0.79 ± 0.038
2.317LysGly: 2.317 ± 0.069
0.777LysHis: 0.777 ± 0.035
1.599LysIle: 1.599 ± 0.055
1.8LysLys: 1.8 ± 0.078
2.981LysLeu: 2.981 ± 0.07
0.801LysMet: 0.801 ± 0.033
1.196LysAsn: 1.196 ± 0.042
2.229LysPro: 2.229 ± 0.084
1.334LysGln: 1.334 ± 0.046
2.232LysArg: 2.232 ± 0.064
1.721LysSer: 1.721 ± 0.052
1.99LysThr: 1.99 ± 0.07
2.731LysVal: 2.731 ± 0.081
0.37LysTrp: 0.37 ± 0.024
0.699LysTyr: 0.699 ± 0.034
0.0LysXaa: 0.0 ± 0.0
Leu
12.063LeuAla: 12.063 ± 0.191
0.526LeuCys: 0.526 ± 0.031
5.833LeuAsp: 5.833 ± 0.1
5.683LeuGlu: 5.683 ± 0.108
2.656LeuPhe: 2.656 ± 0.071
8.016LeuGly: 8.016 ± 0.149
1.743LeuHis: 1.743 ± 0.057
4.694LeuIle: 4.694 ± 0.104
3.178LeuLys: 3.178 ± 0.081
8.416LeuLeu: 8.416 ± 0.156
2.138LeuMet: 2.138 ± 0.069
2.692LeuAsn: 2.692 ± 0.065
4.772LeuPro: 4.772 ± 0.082
2.429LeuGln: 2.429 ± 0.063
6.058LeuArg: 6.058 ± 0.102
5.594LeuSer: 5.594 ± 0.115
6.138LeuThr: 6.138 ± 0.112
7.798LeuVal: 7.798 ± 0.138
1.054LeuTrp: 1.054 ± 0.045
1.654LeuTyr: 1.654 ± 0.056
0.0LeuXaa: 0.0 ± 0.0
Met
2.525MetAla: 2.525 ± 0.076
0.119MetCys: 0.119 ± 0.014
1.093MetAsp: 1.093 ± 0.043
0.993MetGlu: 0.993 ± 0.042
0.712MetPhe: 0.712 ± 0.038
1.727MetGly: 1.727 ± 0.066
0.468MetHis: 0.468 ± 0.028
1.139MetIle: 1.139 ± 0.048
0.751MetLys: 0.751 ± 0.03
2.206MetLeu: 2.206 ± 0.064
0.432MetMet: 0.432 ± 0.027
0.684MetAsn: 0.684 ± 0.035
1.17MetPro: 1.17 ± 0.04
0.736MetGln: 0.736 ± 0.036
1.618MetArg: 1.618 ± 0.05
1.763MetSer: 1.763 ± 0.052
1.721MetThr: 1.721 ± 0.052
1.675MetVal: 1.675 ± 0.052
0.314MetTrp: 0.314 ± 0.023
0.395MetTyr: 0.395 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.313AsnAla: 3.313 ± 0.077
0.127AsnCys: 0.127 ± 0.016
1.465AsnAsp: 1.465 ± 0.052
1.548AsnGlu: 1.548 ± 0.051
0.762AsnPhe: 0.762 ± 0.041
2.314AsnGly: 2.314 ± 0.07
0.548AsnHis: 0.548 ± 0.03
1.306AsnIle: 1.306 ± 0.045
1.011AsnLys: 1.011 ± 0.047
2.304AsnLeu: 2.304 ± 0.071
0.642AsnMet: 0.642 ± 0.033
0.863AsnAsn: 0.863 ± 0.037
1.998AsnPro: 1.998 ± 0.064
1.03AsnGln: 1.03 ± 0.045
1.644AsnArg: 1.644 ± 0.058
1.415AsnSer: 1.415 ± 0.057
1.514AsnThr: 1.514 ± 0.053
2.218AsnVal: 2.218 ± 0.066
0.374AsnTrp: 0.374 ± 0.022
0.556AsnTyr: 0.556 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
6.052ProAla: 6.052 ± 0.117
0.214ProCys: 0.214 ± 0.019
3.311ProAsp: 3.311 ± 0.086
4.785ProGlu: 4.785 ± 0.105
1.517ProPhe: 1.517 ± 0.05
4.677ProGly: 4.677 ± 0.11
1.296ProHis: 1.296 ± 0.05
1.919ProIle: 1.919 ± 0.054
1.584ProLys: 1.584 ± 0.057
4.465ProLeu: 4.465 ± 0.094
0.999ProMet: 0.999 ± 0.041
1.384ProAsn: 1.384 ± 0.053
1.716ProPro: 1.716 ± 0.076
1.976ProGln: 1.976 ± 0.065
2.993ProArg: 2.993 ± 0.08
3.1ProSer: 3.1 ± 0.084
3.09ProThr: 3.09 ± 0.104
4.382ProVal: 4.382 ± 0.087
0.751ProTrp: 0.751 ± 0.034
1.038ProTyr: 1.038 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
4.385GlnAla: 4.385 ± 0.093
0.151GlnCys: 0.151 ± 0.017
1.748GlnAsp: 1.748 ± 0.06
2.034GlnGlu: 2.034 ± 0.062
0.921GlnPhe: 0.921 ± 0.041
2.442GlnGly: 2.442 ± 0.062
0.908GlnHis: 0.908 ± 0.042
1.532GlnIle: 1.532 ± 0.048
1.29GlnLys: 1.29 ± 0.052
3.488GlnLeu: 3.488 ± 0.082
0.725GlnMet: 0.725 ± 0.035
0.933GlnAsn: 0.933 ± 0.045
1.943GlnPro: 1.943 ± 0.064
1.769GlnGln: 1.769 ± 0.072
2.879GlnArg: 2.879 ± 0.082
1.503GlnSer: 1.503 ± 0.045
1.758GlnThr: 1.758 ± 0.055
2.593GlnVal: 2.593 ± 0.066
0.45GlnTrp: 0.45 ± 0.031
0.689GlnTyr: 0.689 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
6.993ArgAla: 6.993 ± 0.143
0.377ArgCys: 0.377 ± 0.028
3.558ArgAsp: 3.558 ± 0.091
4.669ArgGlu: 4.669 ± 0.108
2.052ArgPhe: 2.052 ± 0.055
4.954ArgGly: 4.954 ± 0.099
1.37ArgHis: 1.37 ± 0.049
3.475ArgIle: 3.475 ± 0.082
2.253ArgLys: 2.253 ± 0.064
5.943ArgLeu: 5.943 ± 0.124
1.516ArgMet: 1.516 ± 0.051
1.798ArgAsn: 1.798 ± 0.06
2.939ArgPro: 2.939 ± 0.078
2.195ArgGln: 2.195 ± 0.063
5.313ArgArg: 5.313 ± 0.123
3.469ArgSer: 3.469 ± 0.088
3.706ArgThr: 3.706 ± 0.085
5.082ArgVal: 5.082 ± 0.099
1.061ArgTrp: 1.061 ± 0.047
1.54ArgTyr: 1.54 ± 0.05
0.0ArgXaa: 0.0 ± 0.0
Ser
6.83SerAla: 6.83 ± 0.121
0.275SerCys: 0.275 ± 0.02
3.236SerAsp: 3.236 ± 0.078
3.751SerGlu: 3.751 ± 0.082
1.943SerPhe: 1.943 ± 0.055
5.488SerGly: 5.488 ± 0.102
1.124SerHis: 1.124 ± 0.042
2.559SerIle: 2.559 ± 0.071
2.003SerLys: 2.003 ± 0.068
5.036SerLeu: 5.036 ± 0.101
1.519SerMet: 1.519 ± 0.046
1.526SerAsn: 1.526 ± 0.051
2.856SerPro: 2.856 ± 0.066
2.01SerGln: 2.01 ± 0.059
3.475SerArg: 3.475 ± 0.07
3.699SerSer: 3.699 ± 0.097
3.543SerThr: 3.543 ± 0.075
4.711SerVal: 4.711 ± 0.089
0.814SerTrp: 0.814 ± 0.032
1.228SerTyr: 1.228 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
6.596ThrAla: 6.596 ± 0.118
0.346ThrCys: 0.346 ± 0.025
3.418ThrAsp: 3.418 ± 0.085
3.68ThrGlu: 3.68 ± 0.096
1.985ThrPhe: 1.985 ± 0.058
5.428ThrGly: 5.428 ± 0.095
1.405ThrHis: 1.405 ± 0.045
2.738ThrIle: 2.738 ± 0.077
1.85ThrLys: 1.85 ± 0.064
5.571ThrLeu: 5.571 ± 0.098
1.266ThrMet: 1.266 ± 0.044
1.599ThrAsn: 1.599 ± 0.058
3.846ThrPro: 3.846 ± 0.105
2.037ThrGln: 2.037 ± 0.061
3.345ThrArg: 3.345 ± 0.076
3.407ThrSer: 3.407 ± 0.079
3.691ThrThr: 3.691 ± 0.09
5.504ThrVal: 5.504 ± 0.115
0.837ThrTrp: 0.837 ± 0.041
1.443ThrTyr: 1.443 ± 0.052
0.0ThrXaa: 0.0 ± 0.0
Val
10.045ValAla: 10.045 ± 0.154
0.585ValCys: 0.585 ± 0.035
5.326ValAsp: 5.326 ± 0.103
5.558ValGlu: 5.558 ± 0.117
2.746ValPhe: 2.746 ± 0.074
6.513ValGly: 6.513 ± 0.132
1.594ValHis: 1.594 ± 0.053
4.281ValIle: 4.281 ± 0.086
2.599ValLys: 2.599 ± 0.082
8.151ValLeu: 8.151 ± 0.147
1.964ValMet: 1.964 ± 0.062
2.14ValAsn: 2.14 ± 0.054
4.302ValPro: 4.302 ± 0.086
2.166ValGln: 2.166 ± 0.051
5.087ValArg: 5.087 ± 0.102
5.066ValSer: 5.066 ± 0.097
5.694ValThr: 5.694 ± 0.096
8.071ValVal: 8.071 ± 0.159
1.087ValTrp: 1.087 ± 0.053
1.474ValTyr: 1.474 ± 0.047
0.0ValXaa: 0.0 ± 0.0
Trp
1.5TrpAla: 1.5 ± 0.048
0.123TrpCys: 0.123 ± 0.015
0.882TrpAsp: 0.882 ± 0.034
0.755TrpGlu: 0.755 ± 0.037
0.536TrpPhe: 0.536 ± 0.032
1.04TrpGly: 1.04 ± 0.043
0.32TrpHis: 0.32 ± 0.024
0.744TrpIle: 0.744 ± 0.031
0.432TrpLys: 0.432 ± 0.025
1.543TrpLeu: 1.543 ± 0.058
0.396TrpMet: 0.396 ± 0.024
0.426TrpAsn: 0.426 ± 0.029
0.617TrpPro: 0.617 ± 0.032
0.47TrpGln: 0.47 ± 0.028
0.869TrpArg: 0.869 ± 0.043
0.673TrpSer: 0.673 ± 0.034
0.713TrpThr: 0.713 ± 0.033
1.235TrpVal: 1.235 ± 0.044
0.338TrpTrp: 0.338 ± 0.026
0.275TrpTyr: 0.275 ± 0.023
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.227TyrAla: 2.227 ± 0.063
0.133TyrCys: 0.133 ± 0.015
1.267TyrAsp: 1.267 ± 0.047
1.345TyrGlu: 1.345 ± 0.056
0.734TyrPhe: 0.734 ± 0.032
2.006TyrGly: 2.006 ± 0.062
0.304TyrHis: 0.304 ± 0.023
0.817TyrIle: 0.817 ± 0.037
0.692TyrLys: 0.692 ± 0.036
1.948TyrLeu: 1.948 ± 0.055
0.385TyrMet: 0.385 ± 0.022
0.548TyrAsn: 0.548 ± 0.036
1.145TyrPro: 1.145 ± 0.047
0.751TyrGln: 0.751 ± 0.034
1.462TyrArg: 1.462 ± 0.053
1.257TyrSer: 1.257 ± 0.046
1.188TyrThr: 1.188 ± 0.047
1.695TyrVal: 1.695 ± 0.052
0.317TyrTrp: 0.317 ± 0.022
0.479TyrTyr: 0.479 ± 0.03
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1813 proteins (615517 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski