Amino acid dipepetide frequency for Pseudomonas sp.

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.612AlaAla: 12.612 ± 0.306
1.041AlaCys: 1.041 ± 0.066
5.796AlaAsp: 5.796 ± 0.152
6.267AlaGlu: 6.267 ± 0.187
3.722AlaPhe: 3.722 ± 0.134
8.247AlaGly: 8.247 ± 0.246
2.272AlaHis: 2.272 ± 0.1
5.834AlaIle: 5.834 ± 0.169
4.724AlaLys: 4.724 ± 0.152
12.328AlaLeu: 12.328 ± 0.26
2.985AlaMet: 2.985 ± 0.11
3.344AlaAsn: 3.344 ± 0.137
4.069AlaPro: 4.069 ± 0.124
4.767AlaGln: 4.767 ± 0.117
6.871AlaArg: 6.871 ± 0.164
7.0AlaSer: 7.0 ± 0.197
5.69AlaThr: 5.69 ± 0.176
7.865AlaVal: 7.865 ± 0.177
1.465AlaTrp: 1.465 ± 0.092
2.549AlaTyr: 2.549 ± 0.09
0.0AlaXaa: 0.0 ± 0.0
Cys
1.006CysAla: 1.006 ± 0.064
0.168CysCys: 0.168 ± 0.03
0.511CysAsp: 0.511 ± 0.042
0.627CysGlu: 0.627 ± 0.052
0.296CysPhe: 0.296 ± 0.035
0.76CysGly: 0.76 ± 0.057
0.214CysHis: 0.214 ± 0.032
0.44CysIle: 0.44 ± 0.04
0.335CysLys: 0.335 ± 0.039
0.904CysLeu: 0.904 ± 0.066
0.226CysMet: 0.226 ± 0.03
0.203CysAsn: 0.203 ± 0.026
0.335CysPro: 0.335 ± 0.04
0.288CysGln: 0.288 ± 0.034
0.682CysArg: 0.682 ± 0.05
0.709CysSer: 0.709 ± 0.056
0.495CysThr: 0.495 ± 0.047
0.686CysVal: 0.686 ± 0.049
0.125CysTrp: 0.125 ± 0.031
0.21CysTyr: 0.21 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
6.107AspAla: 6.107 ± 0.15
0.53AspCys: 0.53 ± 0.047
2.853AspAsp: 2.853 ± 0.111
3.64AspGlu: 3.64 ± 0.119
2.264AspPhe: 2.264 ± 0.099
4.295AspGly: 4.295 ± 0.137
1.372AspHis: 1.372 ± 0.074
2.822AspIle: 2.822 ± 0.11
2.21AspLys: 2.21 ± 0.098
5.784AspLeu: 5.784 ± 0.168
1.111AspMet: 1.111 ± 0.059
1.528AspAsn: 1.528 ± 0.092
2.572AspPro: 2.572 ± 0.122
2.284AspGln: 2.284 ± 0.103
3.438AspArg: 3.438 ± 0.123
2.728AspSer: 2.728 ± 0.114
2.323AspThr: 2.323 ± 0.102
3.827AspVal: 3.827 ± 0.122
0.857AspTrp: 0.857 ± 0.056
1.485AspTyr: 1.485 ± 0.087
0.0AspXaa: 0.0 ± 0.0
Glu
6.279GluAla: 6.279 ± 0.195
0.401GluCys: 0.401 ± 0.039
2.639GluAsp: 2.639 ± 0.099
3.309GluGlu: 3.309 ± 0.163
1.843GluPhe: 1.843 ± 0.081
3.905GluGly: 3.905 ± 0.133
1.407GluHis: 1.407 ± 0.077
3.301GluIle: 3.301 ± 0.136
2.814GluLys: 2.814 ± 0.128
6.131GluLeu: 6.131 ± 0.169
1.415GluMet: 1.415 ± 0.078
1.766GluAsn: 1.766 ± 0.086
2.401GluPro: 2.401 ± 0.094
2.876GluGln: 2.876 ± 0.117
4.821GluArg: 4.821 ± 0.169
3.114GluSer: 3.114 ± 0.104
2.884GluThr: 2.884 ± 0.097
4.268GluVal: 4.268 ± 0.167
0.674GluTrp: 0.674 ± 0.054
1.185GluTyr: 1.185 ± 0.076
0.0GluXaa: 0.0 ± 0.0
Phe
4.01PheAla: 4.01 ± 0.136
0.437PheCys: 0.437 ± 0.041
2.557PheAsp: 2.557 ± 0.108
2.147PheGlu: 2.147 ± 0.091
1.501PhePhe: 1.501 ± 0.087
3.083PheGly: 3.083 ± 0.13
0.721PheHis: 0.721 ± 0.06
1.785PheIle: 1.785 ± 0.094
1.602PheLys: 1.602 ± 0.082
2.888PheLeu: 2.888 ± 0.106
0.815PheMet: 0.815 ± 0.055
1.469PheAsn: 1.469 ± 0.074
1.387PhePro: 1.387 ± 0.082
1.083PheGln: 1.083 ± 0.069
1.668PheArg: 1.668 ± 0.076
2.572PheSer: 2.572 ± 0.095
2.042PheThr: 2.042 ± 0.09
2.763PheVal: 2.763 ± 0.096
0.577PheTrp: 0.577 ± 0.052
1.009PheTyr: 1.009 ± 0.054
0.0PheXaa: 0.0 ± 0.0
Gly
6.961GlyAla: 6.961 ± 0.22
0.717GlyCys: 0.717 ± 0.053
3.901GlyAsp: 3.901 ± 0.126
3.87GlyGlu: 3.87 ± 0.142
3.188GlyPhe: 3.188 ± 0.107
5.585GlyGly: 5.585 ± 0.244
1.836GlyHis: 1.836 ± 0.086
4.338GlyIle: 4.338 ± 0.139
3.59GlyLys: 3.59 ± 0.131
7.947GlyLeu: 7.947 ± 0.192
2.245GlyMet: 2.245 ± 0.104
2.483GlyAsn: 2.483 ± 0.149
2.296GlyPro: 2.296 ± 0.111
3.079GlyGln: 3.079 ± 0.115
4.77GlyArg: 4.77 ± 0.167
4.833GlySer: 4.833 ± 0.161
4.498GlyThr: 4.498 ± 0.148
5.686GlyVal: 5.686 ± 0.171
1.177GlyTrp: 1.177 ± 0.07
2.276GlyTyr: 2.276 ± 0.098
0.0GlyXaa: 0.0 ± 0.0
His
2.288HisAla: 2.288 ± 0.092
0.218HisCys: 0.218 ± 0.029
1.274HisAsp: 1.274 ± 0.072
1.212HisGlu: 1.212 ± 0.064
0.908HisPhe: 0.908 ± 0.06
1.754HisGly: 1.754 ± 0.082
0.748HisHis: 0.748 ± 0.065
0.986HisIle: 0.986 ± 0.057
0.674HisLys: 0.674 ± 0.05
2.389HisLeu: 2.389 ± 0.101
0.495HisMet: 0.495 ± 0.046
0.53HisAsn: 0.53 ± 0.051
1.271HisPro: 1.271 ± 0.076
0.916HisGln: 0.916 ± 0.057
1.567HisArg: 1.567 ± 0.08
1.306HisSer: 1.306 ± 0.067
1.08HisThr: 1.08 ± 0.07
1.617HisVal: 1.617 ± 0.088
0.327HisTrp: 0.327 ± 0.033
0.627HisTyr: 0.627 ± 0.059
0.0HisXaa: 0.0 ± 0.0
Ile
6.263IleAla: 6.263 ± 0.203
0.468IleCys: 0.468 ± 0.042
3.742IleAsp: 3.742 ± 0.132
3.929IleGlu: 3.929 ± 0.124
1.551IlePhe: 1.551 ± 0.09
4.466IleGly: 4.466 ± 0.143
0.959IleHis: 0.959 ± 0.056
2.175IleIle: 2.175 ± 0.114
2.319IleLys: 2.319 ± 0.109
4.01IleLeu: 4.01 ± 0.143
0.885IleMet: 0.885 ± 0.053
1.968IleAsn: 1.968 ± 0.098
2.467IlePro: 2.467 ± 0.106
1.797IleGln: 1.797 ± 0.083
3.009IleArg: 3.009 ± 0.108
3.613IleSer: 3.613 ± 0.122
2.845IleThr: 2.845 ± 0.113
3.757IleVal: 3.757 ± 0.133
0.514IleTrp: 0.514 ± 0.05
1.259IleTyr: 1.259 ± 0.07
0.0IleXaa: 0.0 ± 0.0
Lys
4.704LysAla: 4.704 ± 0.113
0.288LysCys: 0.288 ± 0.037
2.194LysAsp: 2.194 ± 0.111
2.222LysGlu: 2.222 ± 0.096
1.239LysPhe: 1.239 ± 0.071
2.705LysGly: 2.705 ± 0.114
0.912LysHis: 0.912 ± 0.063
2.101LysIle: 2.101 ± 0.086
2.011LysLys: 2.011 ± 0.095
4.198LysLeu: 4.198 ± 0.15
1.029LysMet: 1.029 ± 0.067
1.329LysAsn: 1.329 ± 0.081
2.311LysPro: 2.311 ± 0.102
1.575LysGln: 1.575 ± 0.088
2.954LysArg: 2.954 ± 0.109
2.833LysSer: 2.833 ± 0.119
2.498LysThr: 2.498 ± 0.112
3.239LysVal: 3.239 ± 0.118
0.475LysTrp: 0.475 ± 0.039
0.967LysTyr: 0.967 ± 0.066
0.0LysXaa: 0.0 ± 0.0
Leu
12.098LeuAla: 12.098 ± 0.272
1.068LeuCys: 1.068 ± 0.076
5.671LeuAsp: 5.671 ± 0.162
5.784LeuGlu: 5.784 ± 0.173
3.418LeuPhe: 3.418 ± 0.121
7.612LeuGly: 7.612 ± 0.213
2.311LeuHis: 2.311 ± 0.101
5.375LeuIle: 5.375 ± 0.16
4.63LeuLys: 4.63 ± 0.154
10.161LeuLeu: 10.161 ± 0.269
2.553LeuMet: 2.553 ± 0.083
3.434LeuAsn: 3.434 ± 0.117
4.891LeuPro: 4.891 ± 0.139
3.862LeuGln: 3.862 ± 0.13
6.875LeuArg: 6.875 ± 0.193
7.358LeuSer: 7.358 ± 0.166
5.753LeuThr: 5.753 ± 0.141
7.269LeuVal: 7.269 ± 0.178
1.048LeuTrp: 1.048 ± 0.068
2.206LeuTyr: 2.206 ± 0.105
0.0LeuXaa: 0.0 ± 0.0
Met
2.876MetAla: 2.876 ± 0.114
0.183MetCys: 0.183 ± 0.03
1.142MetAsp: 1.142 ± 0.063
1.095MetGlu: 1.095 ± 0.071
0.854MetPhe: 0.854 ± 0.063
1.703MetGly: 1.703 ± 0.085
0.464MetHis: 0.464 ± 0.047
1.224MetIle: 1.224 ± 0.073
1.263MetLys: 1.263 ± 0.077
2.479MetLeu: 2.479 ± 0.105
0.526MetMet: 0.526 ± 0.053
0.916MetAsn: 0.916 ± 0.057
1.329MetPro: 1.329 ± 0.067
0.998MetGln: 0.998 ± 0.061
1.571MetArg: 1.571 ± 0.074
1.727MetSer: 1.727 ± 0.081
1.711MetThr: 1.711 ± 0.077
1.754MetVal: 1.754 ± 0.081
0.214MetTrp: 0.214 ± 0.026
0.448MetTyr: 0.448 ± 0.04
0.0MetXaa: 0.0 ± 0.0
Asn
3.687AsnAla: 3.687 ± 0.136
0.331AsnCys: 0.331 ± 0.034
1.773AsnAsp: 1.773 ± 0.093
1.699AsnGlu: 1.699 ± 0.079
1.072AsnPhe: 1.072 ± 0.073
2.759AsnGly: 2.759 ± 0.144
0.702AsnHis: 0.702 ± 0.052
1.543AsnIle: 1.543 ± 0.076
1.099AsnLys: 1.099 ± 0.076
3.36AsnLeu: 3.36 ± 0.111
0.713AsnMet: 0.713 ± 0.049
1.107AsnAsn: 1.107 ± 0.079
1.933AsnPro: 1.933 ± 0.1
1.368AsnGln: 1.368 ± 0.084
2.034AsnArg: 2.034 ± 0.1
1.921AsnSer: 1.921 ± 0.11
1.707AsnThr: 1.707 ± 0.087
2.338AsnVal: 2.338 ± 0.099
0.425AsnTrp: 0.425 ± 0.037
0.908AsnTyr: 0.908 ± 0.064
0.0AsnXaa: 0.0 ± 0.0
Pro
5.043ProAla: 5.043 ± 0.154
0.374ProCys: 0.374 ± 0.042
2.603ProAsp: 2.603 ± 0.09
2.783ProGlu: 2.783 ± 0.098
1.664ProPhe: 1.664 ± 0.071
3.512ProGly: 3.512 ± 0.139
0.928ProHis: 0.928 ± 0.064
2.342ProIle: 2.342 ± 0.109
1.766ProLys: 1.766 ± 0.089
4.466ProLeu: 4.466 ± 0.126
1.177ProMet: 1.177 ± 0.076
1.575ProAsn: 1.575 ± 0.086
1.925ProPro: 1.925 ± 0.109
1.684ProGln: 1.684 ± 0.083
2.432ProArg: 2.432 ± 0.111
2.896ProSer: 2.896 ± 0.108
2.436ProThr: 2.436 ± 0.121
3.625ProVal: 3.625 ± 0.131
0.639ProTrp: 0.639 ± 0.047
0.994ProTyr: 0.994 ± 0.064
0.0ProXaa: 0.0 ± 0.0
Gln
4.463GlnAla: 4.463 ± 0.138
0.362GlnCys: 0.362 ± 0.041
1.668GlnAsp: 1.668 ± 0.075
2.116GlnGlu: 2.116 ± 0.106
1.22GlnPhe: 1.22 ± 0.062
3.204GlnGly: 3.204 ± 0.135
0.873GlnHis: 0.873 ± 0.06
2.346GlnIle: 2.346 ± 0.101
1.504GlnLys: 1.504 ± 0.084
4.517GlnLeu: 4.517 ± 0.145
1.099GlnMet: 1.099 ± 0.07
1.083GlnAsn: 1.083 ± 0.066
1.953GlnPro: 1.953 ± 0.092
2.015GlnGln: 2.015 ± 0.096
2.997GlnArg: 2.997 ± 0.12
2.448GlnSer: 2.448 ± 0.106
2.077GlnThr: 2.077 ± 0.093
2.993GlnVal: 2.993 ± 0.117
0.616GlnTrp: 0.616 ± 0.052
0.955GlnTyr: 0.955 ± 0.07
0.0GlnXaa: 0.0 ± 0.0
Arg
6.392ArgAla: 6.392 ± 0.166
0.444ArgCys: 0.444 ± 0.041
3.469ArgAsp: 3.469 ± 0.111
4.049ArgGlu: 4.049 ± 0.156
2.572ArgPhe: 2.572 ± 0.091
3.905ArgGly: 3.905 ± 0.129
1.672ArgHis: 1.672 ± 0.086
3.445ArgIle: 3.445 ± 0.113
2.635ArgLys: 2.635 ± 0.108
7.284ArgLeu: 7.284 ± 0.211
1.758ArgMet: 1.758 ± 0.088
2.218ArgAsn: 2.218 ± 0.093
2.915ArgPro: 2.915 ± 0.105
3.095ArgGln: 3.095 ± 0.113
4.778ArgArg: 4.778 ± 0.146
4.385ArgSer: 4.385 ± 0.133
3.18ArgThr: 3.18 ± 0.106
4.478ArgVal: 4.478 ± 0.142
1.06ArgTrp: 1.06 ± 0.063
1.875ArgTyr: 1.875 ± 0.089
0.0ArgXaa: 0.0 ± 0.0
Ser
6.973SerAla: 6.973 ± 0.201
0.616SerCys: 0.616 ± 0.049
3.157SerAsp: 3.157 ± 0.121
3.282SerGlu: 3.282 ± 0.131
2.413SerPhe: 2.413 ± 0.107
5.343SerGly: 5.343 ± 0.161
1.458SerHis: 1.458 ± 0.081
3.43SerIle: 3.43 ± 0.122
2.451SerLys: 2.451 ± 0.102
6.715SerLeu: 6.715 ± 0.176
1.676SerMet: 1.676 ± 0.065
2.058SerAsn: 2.058 ± 0.109
3.102SerPro: 3.102 ± 0.098
2.475SerGln: 2.475 ± 0.096
4.209SerArg: 4.209 ± 0.123
4.447SerSer: 4.447 ± 0.166
3.921SerThr: 3.921 ± 0.128
4.693SerVal: 4.693 ± 0.152
0.931SerTrp: 0.931 ± 0.061
1.649SerTyr: 1.649 ± 0.087
0.0SerXaa: 0.0 ± 0.0
Thr
5.784ThrAla: 5.784 ± 0.175
0.44ThrCys: 0.44 ± 0.046
2.841ThrAsp: 2.841 ± 0.113
2.666ThrGlu: 2.666 ± 0.106
2.046ThrPhe: 2.046 ± 0.099
4.49ThrGly: 4.49 ± 0.144
1.204ThrHis: 1.204 ± 0.068
2.693ThrIle: 2.693 ± 0.117
1.793ThrLys: 1.793 ± 0.085
5.912ThrLeu: 5.912 ± 0.163
1.009ThrMet: 1.009 ± 0.058
1.746ThrAsn: 1.746 ± 0.093
3.024ThrPro: 3.024 ± 0.132
2.132ThrGln: 2.132 ± 0.092
3.387ThrArg: 3.387 ± 0.116
3.328ThrSer: 3.328 ± 0.123
3.18ThrThr: 3.18 ± 0.126
4.494ThrVal: 4.494 ± 0.15
0.702ThrTrp: 0.702 ± 0.069
1.399ThrTyr: 1.399 ± 0.079
0.0ThrXaa: 0.0 ± 0.0
Val
8.228ValAla: 8.228 ± 0.17
0.795ValCys: 0.795 ± 0.045
4.209ValAsp: 4.209 ± 0.154
4.747ValGlu: 4.747 ± 0.143
2.818ValPhe: 2.818 ± 0.119
5.102ValGly: 5.102 ± 0.174
1.376ValHis: 1.376 ± 0.084
4.081ValIle: 4.081 ± 0.143
2.752ValLys: 2.752 ± 0.112
7.542ValLeu: 7.542 ± 0.209
1.929ValMet: 1.929 ± 0.075
2.358ValAsn: 2.358 ± 0.105
3.188ValPro: 3.188 ± 0.115
2.658ValGln: 2.658 ± 0.095
4.599ValArg: 4.599 ± 0.136
5.195ValSer: 5.195 ± 0.14
4.026ValThr: 4.026 ± 0.129
5.932ValVal: 5.932 ± 0.163
0.947ValTrp: 0.947 ± 0.056
1.504ValTyr: 1.504 ± 0.085
0.0ValXaa: 0.0 ± 0.0
Trp
1.247TrpAla: 1.247 ± 0.078
0.121TrpCys: 0.121 ± 0.021
0.608TrpAsp: 0.608 ± 0.049
0.616TrpGlu: 0.616 ± 0.045
0.6TrpPhe: 0.6 ± 0.047
0.787TrpGly: 0.787 ± 0.068
0.3TrpHis: 0.3 ± 0.033
0.635TrpIle: 0.635 ± 0.058
0.674TrpLys: 0.674 ± 0.046
1.727TrpLeu: 1.727 ± 0.08
0.37TrpMet: 0.37 ± 0.035
0.561TrpAsn: 0.561 ± 0.053
0.503TrpPro: 0.503 ± 0.049
0.62TrpGln: 0.62 ± 0.05
1.048TrpArg: 1.048 ± 0.067
0.826TrpSer: 0.826 ± 0.059
0.725TrpThr: 0.725 ± 0.055
0.9TrpVal: 0.9 ± 0.064
0.273TrpTrp: 0.273 ± 0.032
0.32TrpTyr: 0.32 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.432TyrAla: 2.432 ± 0.079
0.218TyrCys: 0.218 ± 0.028
1.423TyrAsp: 1.423 ± 0.092
1.372TyrGlu: 1.372 ± 0.071
0.928TyrPhe: 0.928 ± 0.069
2.058TyrGly: 2.058 ± 0.095
0.46TyrHis: 0.46 ± 0.046
1.099TyrIle: 1.099 ± 0.073
0.939TyrLys: 0.939 ± 0.066
2.635TyrLeu: 2.635 ± 0.118
0.456TyrMet: 0.456 ± 0.048
0.807TyrAsn: 0.807 ± 0.055
0.99TyrPro: 0.99 ± 0.06
0.939TyrGln: 0.939 ± 0.06
1.898TyrArg: 1.898 ± 0.088
1.758TyrSer: 1.758 ± 0.09
1.2TyrThr: 1.2 ± 0.073
1.801TyrVal: 1.801 ± 0.08
0.413TyrTrp: 0.413 ± 0.042
0.647TyrTyr: 0.647 ± 0.052
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1027 proteins (256579 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski