Amino acid dipepetide frequency for Pseudomonas phage vB_PaeM_PS119XW

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.997AlaAla: 5.997 ± 0.388
0.556AlaCys: 0.556 ± 0.092
3.635AlaAsp: 3.635 ± 0.237
4.789AlaGlu: 4.789 ± 0.264
2.566AlaPhe: 2.566 ± 0.17
4.704AlaGly: 4.704 ± 0.339
1.315AlaHis: 1.315 ± 0.114
5.003AlaIle: 5.003 ± 0.21
3.945AlaLys: 3.945 ± 0.267
6.104AlaLeu: 6.104 ± 0.267
2.405AlaMet: 2.405 ± 0.18
3.474AlaAsn: 3.474 ± 0.194
2.341AlaPro: 2.341 ± 0.193
2.437AlaGln: 2.437 ± 0.195
2.972AlaArg: 2.972 ± 0.179
4.266AlaSer: 4.266 ± 0.201
4.554AlaThr: 4.554 ± 0.239
5.164AlaVal: 5.164 ± 0.242
0.919AlaTrp: 0.919 ± 0.111
2.983AlaTyr: 2.983 ± 0.165
0.0AlaXaa: 0.0 ± 0.0
Cys
0.438CysAla: 0.438 ± 0.073
0.192CysCys: 0.192 ± 0.04
0.535CysAsp: 0.535 ± 0.083
0.663CysGlu: 0.663 ± 0.082
0.363CysPhe: 0.363 ± 0.064
0.62CysGly: 0.62 ± 0.091
0.203CysHis: 0.203 ± 0.048
0.738CysIle: 0.738 ± 0.098
0.62CysLys: 0.62 ± 0.093
0.695CysLeu: 0.695 ± 0.092
0.225CysMet: 0.225 ± 0.048
0.406CysAsn: 0.406 ± 0.066
0.331CysPro: 0.331 ± 0.064
0.331CysGln: 0.331 ± 0.06
0.524CysArg: 0.524 ± 0.075
0.438CysSer: 0.438 ± 0.08
0.588CysThr: 0.588 ± 0.08
0.738CysVal: 0.738 ± 0.096
0.214CysTrp: 0.214 ± 0.045
0.299CysTyr: 0.299 ± 0.055
0.0CysXaa: 0.0 ± 0.0
Asp
3.966AspAla: 3.966 ± 0.229
0.545AspCys: 0.545 ± 0.085
4.223AspAsp: 4.223 ± 0.231
4.565AspGlu: 4.565 ± 0.242
2.662AspPhe: 2.662 ± 0.168
4.458AspGly: 4.458 ± 0.266
1.208AspHis: 1.208 ± 0.115
4.383AspIle: 4.383 ± 0.243
3.849AspLys: 3.849 ± 0.251
5.848AspLeu: 5.848 ± 0.246
1.561AspMet: 1.561 ± 0.125
3.186AspAsn: 3.186 ± 0.157
3.4AspPro: 3.4 ± 0.198
2.288AspGln: 2.288 ± 0.167
3.079AspArg: 3.079 ± 0.193
3.025AspSer: 3.025 ± 0.172
3.645AspThr: 3.645 ± 0.193
4.886AspVal: 4.886 ± 0.244
0.812AspTrp: 0.812 ± 0.085
2.598AspTyr: 2.598 ± 0.193
0.0AspXaa: 0.0 ± 0.0
Glu
5.302GluAla: 5.302 ± 0.315
0.524GluCys: 0.524 ± 0.078
3.923GluAsp: 3.923 ± 0.238
5.025GluGlu: 5.025 ± 0.413
3.036GluPhe: 3.036 ± 0.172
3.656GluGly: 3.656 ± 0.214
1.614GluHis: 1.614 ± 0.12
4.565GluIle: 4.565 ± 0.247
3.528GluLys: 3.528 ± 0.198
7.195GluLeu: 7.195 ± 0.296
2.159GluMet: 2.159 ± 0.166
3.143GluAsn: 3.143 ± 0.195
2.117GluPro: 2.117 ± 0.203
3.164GluGln: 3.164 ± 0.196
3.474GluArg: 3.474 ± 0.223
3.571GluSer: 3.571 ± 0.235
4.148GluThr: 4.148 ± 0.305
5.089GluVal: 5.089 ± 0.227
1.165GluTrp: 1.165 ± 0.102
2.641GluTyr: 2.641 ± 0.171
0.0GluXaa: 0.0 ± 0.0
Phe
2.352PheAla: 2.352 ± 0.165
0.385PheCys: 0.385 ± 0.07
3.057PheAsp: 3.057 ± 0.21
2.641PheGlu: 2.641 ± 0.192
1.518PhePhe: 1.518 ± 0.122
2.78PheGly: 2.78 ± 0.203
0.951PheHis: 0.951 ± 0.114
2.897PheIle: 2.897 ± 0.165
2.737PheLys: 2.737 ± 0.178
2.715PheLeu: 2.715 ± 0.164
1.058PheMet: 1.058 ± 0.098
2.662PheAsn: 2.662 ± 0.179
1.529PhePro: 1.529 ± 0.116
1.4PheGln: 1.4 ± 0.124
2.127PheArg: 2.127 ± 0.153
2.576PheSer: 2.576 ± 0.183
2.534PheThr: 2.534 ± 0.154
3.068PheVal: 3.068 ± 0.172
0.406PheTrp: 0.406 ± 0.072
1.839PheTyr: 1.839 ± 0.14
0.0PheXaa: 0.0 ± 0.0
Gly
3.752GlyAla: 3.752 ± 0.374
0.588GlyCys: 0.588 ± 0.068
4.715GlyAsp: 4.715 ± 0.434
4.447GlyGlu: 4.447 ± 0.29
2.534GlyPhe: 2.534 ± 0.154
4.511GlyGly: 4.511 ± 0.332
1.187GlyHis: 1.187 ± 0.111
4.244GlyIle: 4.244 ± 0.221
4.148GlyLys: 4.148 ± 0.187
4.768GlyLeu: 4.768 ± 0.244
2.063GlyMet: 2.063 ± 0.178
3.603GlyAsn: 3.603 ± 0.194
1.817GlyPro: 1.817 ± 0.194
2.256GlyGln: 2.256 ± 0.172
3.303GlyArg: 3.303 ± 0.208
3.827GlySer: 3.827 ± 0.196
4.148GlyThr: 4.148 ± 0.275
4.404GlyVal: 4.404 ± 0.23
1.112GlyTrp: 1.112 ± 0.099
2.822GlyTyr: 2.822 ± 0.189
0.0GlyXaa: 0.0 ± 0.0
His
1.197HisAla: 1.197 ± 0.108
0.267HisCys: 0.267 ± 0.049
1.326HisAsp: 1.326 ± 0.122
1.379HisGlu: 1.379 ± 0.127
0.919HisPhe: 0.919 ± 0.129
1.55HisGly: 1.55 ± 0.123
0.492HisHis: 0.492 ± 0.082
1.486HisIle: 1.486 ± 0.122
1.037HisLys: 1.037 ± 0.12
1.614HisLeu: 1.614 ± 0.13
0.502HisMet: 0.502 ± 0.08
0.866HisAsn: 0.866 ± 0.088
1.187HisPro: 1.187 ± 0.118
0.652HisGln: 0.652 ± 0.081
1.39HisArg: 1.39 ± 0.136
0.919HisSer: 0.919 ± 0.102
1.08HisThr: 1.08 ± 0.115
1.379HisVal: 1.379 ± 0.129
0.353HisTrp: 0.353 ± 0.064
0.962HisTyr: 0.962 ± 0.109
0.0HisXaa: 0.0 ± 0.0
Ile
5.003IleAla: 5.003 ± 0.23
0.513IleCys: 0.513 ± 0.091
4.971IleAsp: 4.971 ± 0.209
4.982IleGlu: 4.982 ± 0.245
2.063IlePhe: 2.063 ± 0.154
3.891IleGly: 3.891 ± 0.212
1.272IleHis: 1.272 ± 0.121
3.763IleIle: 3.763 ± 0.199
3.581IleLys: 3.581 ± 0.214
4.725IleLeu: 4.725 ± 0.21
1.4IleMet: 1.4 ± 0.124
3.506IleAsn: 3.506 ± 0.169
3.474IlePro: 3.474 ± 0.207
2.363IleGln: 2.363 ± 0.151
3.849IleArg: 3.849 ± 0.209
4.105IleSer: 4.105 ± 0.231
4.212IleThr: 4.212 ± 0.222
4.212IleVal: 4.212 ± 0.238
0.812IleTrp: 0.812 ± 0.1
2.288IleTyr: 2.288 ± 0.187
0.0IleXaa: 0.0 ± 0.0
Lys
5.046LysAla: 5.046 ± 0.261
0.396LysCys: 0.396 ± 0.076
3.378LysAsp: 3.378 ± 0.211
4.747LysGlu: 4.747 ± 0.256
2.641LysPhe: 2.641 ± 0.151
3.998LysGly: 3.998 ± 0.437
1.443LysHis: 1.443 ± 0.13
3.196LysIle: 3.196 ± 0.156
3.357LysLys: 3.357 ± 0.264
5.27LysLeu: 5.27 ± 0.238
1.978LysMet: 1.978 ± 0.133
2.502LysAsn: 2.502 ± 0.143
2.159LysPro: 2.159 ± 0.155
1.743LysGln: 1.743 ± 0.144
2.897LysArg: 2.897 ± 0.171
3.004LysSer: 3.004 ± 0.18
3.164LysThr: 3.164 ± 0.203
4.415LysVal: 4.415 ± 0.205
0.855LysTrp: 0.855 ± 0.087
2.32LysTyr: 2.32 ± 0.15
0.0LysXaa: 0.0 ± 0.0
Leu
5.933LeuAla: 5.933 ± 0.266
0.909LeuCys: 0.909 ± 0.097
5.869LeuAsp: 5.869 ± 0.28
5.901LeuGlu: 5.901 ± 0.293
3.325LeuPhe: 3.325 ± 0.216
5.099LeuGly: 5.099 ± 0.233
1.796LeuHis: 1.796 ± 0.139
5.003LeuIle: 5.003 ± 0.239
4.95LeuLys: 4.95 ± 0.24
5.944LeuLeu: 5.944 ± 0.311
2.117LeuMet: 2.117 ± 0.135
4.308LeuAsn: 4.308 ± 0.238
3.955LeuPro: 3.955 ± 0.189
2.78LeuGln: 2.78 ± 0.155
5.025LeuArg: 5.025 ± 0.263
5.356LeuSer: 5.356 ± 0.234
5.399LeuThr: 5.399 ± 0.223
5.623LeuVal: 5.623 ± 0.231
0.951LeuTrp: 0.951 ± 0.104
3.079LeuTyr: 3.079 ± 0.199
0.0LeuXaa: 0.0 ± 0.0
Met
2.459MetAla: 2.459 ± 0.161
0.235MetCys: 0.235 ± 0.051
1.646MetAsp: 1.646 ± 0.129
1.753MetGlu: 1.753 ± 0.153
1.379MetPhe: 1.379 ± 0.111
1.807MetGly: 1.807 ± 0.159
0.609MetHis: 0.609 ± 0.083
1.636MetIle: 1.636 ± 0.131
1.582MetLys: 1.582 ± 0.122
1.988MetLeu: 1.988 ± 0.151
0.845MetMet: 0.845 ± 0.157
1.443MetAsn: 1.443 ± 0.13
1.133MetPro: 1.133 ± 0.112
0.984MetGln: 0.984 ± 0.118
1.497MetArg: 1.497 ± 0.145
2.405MetSer: 2.405 ± 0.152
1.753MetThr: 1.753 ± 0.14
2.063MetVal: 2.063 ± 0.135
0.246MetTrp: 0.246 ± 0.055
1.155MetTyr: 1.155 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
3.656AsnAla: 3.656 ± 0.251
0.513AsnCys: 0.513 ± 0.074
2.801AsnAsp: 2.801 ± 0.137
3.207AsnGlu: 3.207 ± 0.185
1.839AsnPhe: 1.839 ± 0.159
4.18AsnGly: 4.18 ± 0.28
1.005AsnHis: 1.005 ± 0.103
3.325AsnIle: 3.325 ± 0.186
3.025AsnLys: 3.025 ± 0.156
3.966AsnLeu: 3.966 ± 0.193
1.433AsnMet: 1.433 ± 0.121
3.4AsnAsn: 3.4 ± 0.246
2.94AsnPro: 2.94 ± 0.208
1.732AsnGln: 1.732 ± 0.136
2.63AsnArg: 2.63 ± 0.162
3.196AsnSer: 3.196 ± 0.207
3.079AsnThr: 3.079 ± 0.216
3.613AsnVal: 3.613 ± 0.201
0.791AsnTrp: 0.791 ± 0.082
1.999AsnTyr: 1.999 ± 0.128
0.0AsnXaa: 0.0 ± 0.0
Pro
2.94ProAla: 2.94 ± 0.219
0.385ProCys: 0.385 ± 0.059
2.673ProAsp: 2.673 ± 0.194
3.303ProGlu: 3.303 ± 0.218
1.86ProPhe: 1.86 ± 0.119
2.48ProGly: 2.48 ± 0.167
0.674ProHis: 0.674 ± 0.088
2.523ProIle: 2.523 ± 0.129
2.245ProLys: 2.245 ± 0.146
3.229ProLeu: 3.229 ± 0.197
1.09ProMet: 1.09 ± 0.085
2.063ProAsn: 2.063 ± 0.153
1.368ProPro: 1.368 ± 0.124
1.358ProGln: 1.358 ± 0.132
1.946ProArg: 1.946 ± 0.148
2.363ProSer: 2.363 ± 0.164
2.951ProThr: 2.951 ± 0.178
3.389ProVal: 3.389 ± 0.247
0.449ProTrp: 0.449 ± 0.079
1.443ProTyr: 1.443 ± 0.12
0.0ProXaa: 0.0 ± 0.0
Gln
2.673GlnAla: 2.673 ± 0.219
0.396GlnCys: 0.396 ± 0.061
1.646GlnAsp: 1.646 ± 0.145
2.288GlnGlu: 2.288 ± 0.18
1.764GlnPhe: 1.764 ± 0.138
2.202GlnGly: 2.202 ± 0.186
0.845GlnHis: 0.845 ± 0.108
2.106GlnIle: 2.106 ± 0.155
1.604GlnLys: 1.604 ± 0.144
3.913GlnLeu: 3.913 ± 0.175
1.101GlnMet: 1.101 ± 0.12
1.978GlnAsn: 1.978 ± 0.139
1.208GlnPro: 1.208 ± 0.14
1.689GlnGln: 1.689 ± 0.197
2.149GlnArg: 2.149 ± 0.134
1.657GlnSer: 1.657 ± 0.155
1.839GlnThr: 1.839 ± 0.154
2.331GlnVal: 2.331 ± 0.171
0.545GlnTrp: 0.545 ± 0.081
1.593GlnTyr: 1.593 ± 0.144
0.0GlnXaa: 0.0 ± 0.0
Arg
3.389ArgAla: 3.389 ± 0.213
0.428ArgCys: 0.428 ± 0.073
3.635ArgAsp: 3.635 ± 0.2
3.207ArgGlu: 3.207 ± 0.208
2.448ArgPhe: 2.448 ± 0.16
2.876ArgGly: 2.876 ± 0.17
0.93ArgHis: 0.93 ± 0.114
3.41ArgIle: 3.41 ± 0.204
3.432ArgLys: 3.432 ± 0.21
4.886ArgLeu: 4.886 ± 0.257
1.518ArgMet: 1.518 ± 0.127
2.844ArgAsn: 2.844 ± 0.158
1.849ArgPro: 1.849 ± 0.141
1.999ArgGln: 1.999 ± 0.13
3.1ArgArg: 3.1 ± 0.185
3.122ArgSer: 3.122 ± 0.2
2.683ArgThr: 2.683 ± 0.156
3.998ArgVal: 3.998 ± 0.201
0.898ArgTrp: 0.898 ± 0.094
2.245ArgTyr: 2.245 ± 0.181
0.0ArgXaa: 0.0 ± 0.0
Ser
3.795SerAla: 3.795 ± 0.243
0.577SerCys: 0.577 ± 0.094
3.613SerAsp: 3.613 ± 0.22
3.239SerGlu: 3.239 ± 0.204
2.598SerPhe: 2.598 ± 0.189
3.923SerGly: 3.923 ± 0.205
1.144SerHis: 1.144 ± 0.126
4.319SerIle: 4.319 ± 0.291
3.581SerLys: 3.581 ± 0.168
4.95SerLeu: 4.95 ± 0.211
1.753SerMet: 1.753 ± 0.151
3.25SerAsn: 3.25 ± 0.202
2.234SerPro: 2.234 ± 0.137
1.817SerGln: 1.817 ± 0.166
3.025SerArg: 3.025 ± 0.168
3.571SerSer: 3.571 ± 0.226
3.688SerThr: 3.688 ± 0.21
4.191SerVal: 4.191 ± 0.193
0.887SerTrp: 0.887 ± 0.119
2.106SerTyr: 2.106 ± 0.161
0.0SerXaa: 0.0 ± 0.0
Thr
4.244ThrAla: 4.244 ± 0.233
0.492ThrCys: 0.492 ± 0.074
3.923ThrAsp: 3.923 ± 0.187
4.094ThrGlu: 4.094 ± 0.256
2.683ThrPhe: 2.683 ± 0.155
4.362ThrGly: 4.362 ± 0.263
1.037ThrHis: 1.037 ± 0.122
3.945ThrIle: 3.945 ± 0.221
3.025ThrLys: 3.025 ± 0.183
5.078ThrLeu: 5.078 ± 0.233
1.443ThrMet: 1.443 ± 0.123
2.961ThrAsn: 2.961 ± 0.202
3.015ThrPro: 3.015 ± 0.194
2.32ThrGln: 2.32 ± 0.18
3.1ThrArg: 3.1 ± 0.17
3.496ThrSer: 3.496 ± 0.206
3.752ThrThr: 3.752 ± 0.247
4.629ThrVal: 4.629 ± 0.193
0.941ThrTrp: 0.941 ± 0.094
2.373ThrTyr: 2.373 ± 0.155
0.0ThrXaa: 0.0 ± 0.0
Val
4.715ValAla: 4.715 ± 0.227
0.641ValCys: 0.641 ± 0.093
5.206ValAsp: 5.206 ± 0.215
5.345ValGlu: 5.345 ± 0.314
2.865ValPhe: 2.865 ± 0.145
4.105ValGly: 4.105 ± 0.258
1.433ValHis: 1.433 ± 0.14
4.864ValIle: 4.864 ± 0.24
4.928ValLys: 4.928 ± 0.253
5.42ValLeu: 5.42 ± 0.232
2.288ValMet: 2.288 ± 0.161
3.913ValAsn: 3.913 ± 0.237
2.854ValPro: 2.854 ± 0.194
2.384ValGln: 2.384 ± 0.166
3.667ValArg: 3.667 ± 0.194
4.298ValSer: 4.298 ± 0.244
4.511ValThr: 4.511 ± 0.236
4.896ValVal: 4.896 ± 0.257
0.716ValTrp: 0.716 ± 0.097
2.715ValTyr: 2.715 ± 0.19
0.0ValXaa: 0.0 ± 0.0
Trp
0.78TrpAla: 0.78 ± 0.083
0.15TrpCys: 0.15 ± 0.04
0.738TrpAsp: 0.738 ± 0.094
0.941TrpGlu: 0.941 ± 0.096
0.674TrpPhe: 0.674 ± 0.091
0.695TrpGly: 0.695 ± 0.072
0.299TrpHis: 0.299 ± 0.05
0.93TrpIle: 0.93 ± 0.095
0.951TrpLys: 0.951 ± 0.109
1.326TrpLeu: 1.326 ± 0.12
0.428TrpMet: 0.428 ± 0.07
0.716TrpAsn: 0.716 ± 0.084
0.353TrpPro: 0.353 ± 0.064
0.374TrpGln: 0.374 ± 0.061
0.727TrpArg: 0.727 ± 0.086
0.802TrpSer: 0.802 ± 0.086
0.93TrpThr: 0.93 ± 0.089
1.219TrpVal: 1.219 ± 0.111
0.139TrpTrp: 0.139 ± 0.038
0.545TrpTyr: 0.545 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.491TyrAla: 2.491 ± 0.171
0.47TyrCys: 0.47 ± 0.067
2.683TyrAsp: 2.683 ± 0.179
2.523TyrGlu: 2.523 ± 0.19
1.497TyrPhe: 1.497 ± 0.126
2.352TyrGly: 2.352 ± 0.157
1.101TyrHis: 1.101 ± 0.129
2.651TyrIle: 2.651 ± 0.196
2.384TyrLys: 2.384 ± 0.151
3.688TyrLeu: 3.688 ± 0.224
1.24TyrMet: 1.24 ± 0.12
2.106TyrAsn: 2.106 ± 0.157
1.465TyrPro: 1.465 ± 0.121
1.454TyrGln: 1.454 ± 0.117
2.416TyrArg: 2.416 ± 0.177
2.266TyrSer: 2.266 ± 0.175
2.256TyrThr: 2.256 ± 0.164
2.459TyrVal: 2.459 ± 0.176
0.502TyrTrp: 0.502 ± 0.078
1.86TyrTyr: 1.86 ± 0.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 389 proteins (93542 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski