Amino acid dipepetide frequency for Erwinia phage vB_EamM_Yoloswag

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.566AlaAla: 8.566 ± 0.536
0.923AlaCys: 0.923 ± 0.097
6.148AlaAsp: 6.148 ± 0.367
5.225AlaGlu: 5.225 ± 0.267
3.074AlaPhe: 3.074 ± 0.194
5.091AlaGly: 5.091 ± 0.302
1.385AlaHis: 1.385 ± 0.121
4.423AlaIle: 4.423 ± 0.227
5.249AlaLys: 5.249 ± 0.416
6.78AlaLeu: 6.78 ± 0.305
2.576AlaMet: 2.576 ± 0.176
3.961AlaAsn: 3.961 ± 0.264
2.94AlaPro: 2.94 ± 0.2
3.718AlaGln: 3.718 ± 0.211
5.067AlaArg: 5.067 ± 0.264
5.383AlaSer: 5.383 ± 0.255
5.334AlaThr: 5.334 ± 0.346
6.306AlaVal: 6.306 ± 0.296
0.729AlaTrp: 0.729 ± 0.079
2.576AlaTyr: 2.576 ± 0.175
0.0AlaXaa: 0.0 ± 0.0
Cys
0.96CysAla: 0.96 ± 0.094
0.219CysCys: 0.219 ± 0.057
0.656CysAsp: 0.656 ± 0.087
0.644CysGlu: 0.644 ± 0.095
0.668CysPhe: 0.668 ± 0.095
0.948CysGly: 0.948 ± 0.113
0.243CysHis: 0.243 ± 0.061
0.583CysIle: 0.583 ± 0.093
0.571CysLys: 0.571 ± 0.083
0.948CysLeu: 0.948 ± 0.111
0.316CysMet: 0.316 ± 0.065
0.352CysAsn: 0.352 ± 0.077
0.571CysPro: 0.571 ± 0.089
0.328CysGln: 0.328 ± 0.06
0.753CysArg: 0.753 ± 0.098
0.753CysSer: 0.753 ± 0.103
0.535CysThr: 0.535 ± 0.098
0.899CysVal: 0.899 ± 0.114
0.122CysTrp: 0.122 ± 0.039
0.352CysTyr: 0.352 ± 0.063
0.0CysXaa: 0.0 ± 0.0
Asp
5.273AspAla: 5.273 ± 0.277
0.887AspCys: 0.887 ± 0.103
5.638AspAsp: 5.638 ± 0.541
5.638AspGlu: 5.638 ± 0.809
2.77AspPhe: 2.77 ± 0.171
4.241AspGly: 4.241 ± 0.329
1.264AspHis: 1.264 ± 0.123
3.39AspIle: 3.39 ± 0.193
3.22AspLys: 3.22 ± 0.209
6.075AspLeu: 6.075 ± 0.255
1.762AspMet: 1.762 ± 0.174
2.746AspAsn: 2.746 ± 0.183
2.685AspPro: 2.685 ± 0.203
2.625AspGln: 2.625 ± 0.194
3.098AspArg: 3.098 ± 0.19
4.52AspSer: 4.52 ± 0.258
3.973AspThr: 3.973 ± 0.219
5.03AspVal: 5.03 ± 0.25
1.057AspTrp: 1.057 ± 0.124
2.6AspTyr: 2.6 ± 0.151
0.0AspXaa: 0.0 ± 0.0
Glu
4.253GluAla: 4.253 ± 0.286
0.583GluCys: 0.583 ± 0.087
4.799GluAsp: 4.799 ± 0.731
4.253GluGlu: 4.253 ± 0.554
2.539GluPhe: 2.539 ± 0.168
3.062GluGly: 3.062 ± 0.157
1.458GluHis: 1.458 ± 0.129
3.888GluIle: 3.888 ± 0.201
2.685GluLys: 2.685 ± 0.244
5.954GluLeu: 5.954 ± 0.309
1.397GluMet: 1.397 ± 0.121
2.406GluAsn: 2.406 ± 0.179
2.248GluPro: 2.248 ± 0.171
3.305GluGln: 3.305 ± 0.249
3.536GluArg: 3.536 ± 0.221
3.512GluSer: 3.512 ± 0.235
3.244GluThr: 3.244 ± 0.211
3.9GluVal: 3.9 ± 0.212
0.693GluTrp: 0.693 ± 0.086
1.932GluTyr: 1.932 ± 0.161
0.0GluXaa: 0.0 ± 0.0
Phe
3.013PheAla: 3.013 ± 0.177
0.474PheCys: 0.474 ± 0.081
3.803PheAsp: 3.803 ± 0.205
2.697PheGlu: 2.697 ± 0.185
1.045PhePhe: 1.045 ± 0.119
2.697PheGly: 2.697 ± 0.193
0.535PheHis: 0.535 ± 0.08
2.09PheIle: 2.09 ± 0.172
2.041PheLys: 2.041 ± 0.152
2.94PheLeu: 2.94 ± 0.182
0.948PheMet: 0.948 ± 0.1
1.762PheAsn: 1.762 ± 0.153
1.373PhePro: 1.373 ± 0.131
1.13PheGln: 1.13 ± 0.13
2.053PheArg: 2.053 ± 0.167
2.491PheSer: 2.491 ± 0.193
2.467PheThr: 2.467 ± 0.168
2.868PheVal: 2.868 ± 0.172
0.413PheTrp: 0.413 ± 0.072
1.142PheTyr: 1.142 ± 0.104
0.0PheXaa: 0.0 ± 0.0
Gly
4.666GlyAla: 4.666 ± 0.274
0.51GlyCys: 0.51 ± 0.073
3.779GlyAsp: 3.779 ± 0.211
3.39GlyGlu: 3.39 ± 0.188
2.588GlyPhe: 2.588 ± 0.184
4.326GlyGly: 4.326 ± 0.333
1.227GlyHis: 1.227 ± 0.12
3.572GlyIle: 3.572 ± 0.217
3.973GlyLys: 3.973 ± 0.271
4.812GlyLeu: 4.812 ± 0.27
1.592GlyMet: 1.592 ± 0.129
2.673GlyAsn: 2.673 ± 0.198
2.09GlyPro: 2.09 ± 0.224
3.159GlyGln: 3.159 ± 0.178
3.536GlyArg: 3.536 ± 0.215
4.484GlySer: 4.484 ± 0.312
4.872GlyThr: 4.872 ± 0.277
4.253GlyVal: 4.253 ± 0.236
0.972GlyTrp: 0.972 ± 0.102
2.309GlyTyr: 2.309 ± 0.169
0.0GlyXaa: 0.0 ± 0.0
His
1.397HisAla: 1.397 ± 0.142
0.316HisCys: 0.316 ± 0.061
1.264HisAsp: 1.264 ± 0.115
1.142HisGlu: 1.142 ± 0.132
0.535HisPhe: 0.535 ± 0.086
1.227HisGly: 1.227 ± 0.122
0.522HisHis: 0.522 ± 0.094
1.142HisIle: 1.142 ± 0.142
0.996HisLys: 0.996 ± 0.112
1.665HisLeu: 1.665 ± 0.151
0.595HisMet: 0.595 ± 0.084
1.118HisAsn: 1.118 ± 0.143
0.765HisPro: 0.765 ± 0.099
0.693HisGln: 0.693 ± 0.096
1.3HisArg: 1.3 ± 0.157
1.142HisSer: 1.142 ± 0.139
1.081HisThr: 1.081 ± 0.111
1.847HisVal: 1.847 ± 0.145
0.243HisTrp: 0.243 ± 0.053
0.632HisTyr: 0.632 ± 0.088
0.0HisXaa: 0.0 ± 0.0
Ile
4.641IleAla: 4.641 ± 0.244
0.632IleCys: 0.632 ± 0.077
4.994IleAsp: 4.994 ± 0.246
4.07IleGlu: 4.07 ± 0.248
1.58IlePhe: 1.58 ± 0.145
3.062IleGly: 3.062 ± 0.212
0.972IleHis: 0.972 ± 0.099
2.479IleIle: 2.479 ± 0.171
3.244IleLys: 3.244 ± 0.237
3.621IleLeu: 3.621 ± 0.201
1.337IleMet: 1.337 ± 0.132
3.013IleAsn: 3.013 ± 0.227
2.382IlePro: 2.382 ± 0.173
2.236IleGln: 2.236 ± 0.187
3.22IleArg: 3.22 ± 0.192
3.73IleSer: 3.73 ± 0.195
3.487IleThr: 3.487 ± 0.185
4.253IleVal: 4.253 ± 0.244
0.413IleTrp: 0.413 ± 0.066
1.47IleTyr: 1.47 ± 0.131
0.0IleXaa: 0.0 ± 0.0
Lys
4.921LysAla: 4.921 ± 0.434
0.425LysCys: 0.425 ± 0.083
2.539LysAsp: 2.539 ± 0.212
2.552LysGlu: 2.552 ± 0.185
2.199LysPhe: 2.199 ± 0.183
3.305LysGly: 3.305 ± 0.269
1.118LysHis: 1.118 ± 0.12
3.098LysIle: 3.098 ± 0.202
4.617LysLys: 4.617 ± 0.544
4.727LysLeu: 4.727 ± 0.272
1.361LysMet: 1.361 ± 0.125
2.418LysAsn: 2.418 ± 0.185
2.527LysPro: 2.527 ± 0.22
3.001LysGln: 3.001 ± 0.23
3.098LysArg: 3.098 ± 0.238
3.682LysSer: 3.682 ± 0.239
3.73LysThr: 3.73 ± 0.244
3.354LysVal: 3.354 ± 0.217
0.62LysTrp: 0.62 ± 0.091
1.981LysTyr: 1.981 ± 0.171
0.0LysXaa: 0.0 ± 0.0
Leu
7.023LeuAla: 7.023 ± 0.244
0.996LeuCys: 0.996 ± 0.118
5.711LeuAsp: 5.711 ± 0.273
4.666LeuGlu: 4.666 ± 0.276
3.013LeuPhe: 3.013 ± 0.216
4.933LeuGly: 4.933 ± 0.239
1.847LeuHis: 1.847 ± 0.15
4.216LeuIle: 4.216 ± 0.249
4.471LeuLys: 4.471 ± 0.307
6.015LeuLeu: 6.015 ± 0.318
2.199LeuMet: 2.199 ± 0.188
4.386LeuAsn: 4.386 ± 0.261
3.949LeuPro: 3.949 ± 0.216
2.77LeuGln: 2.77 ± 0.183
5.067LeuArg: 5.067 ± 0.239
5.978LeuSer: 5.978 ± 0.273
5.261LeuThr: 5.261 ± 0.298
5.395LeuVal: 5.395 ± 0.219
0.814LeuTrp: 0.814 ± 0.106
2.479LeuTyr: 2.479 ± 0.194
0.0LeuXaa: 0.0 ± 0.0
Met
2.09MetAla: 2.09 ± 0.192
0.389MetCys: 0.389 ± 0.066
1.094MetAsp: 1.094 ± 0.12
1.045MetGlu: 1.045 ± 0.115
1.373MetPhe: 1.373 ± 0.12
1.13MetGly: 1.13 ± 0.118
0.559MetHis: 0.559 ± 0.093
1.665MetIle: 1.665 ± 0.138
1.58MetLys: 1.58 ± 0.154
2.6MetLeu: 2.6 ± 0.183
0.656MetMet: 0.656 ± 0.103
1.166MetAsn: 1.166 ± 0.144
1.495MetPro: 1.495 ± 0.113
1.519MetGln: 1.519 ± 0.152
1.64MetArg: 1.64 ± 0.141
2.114MetSer: 2.114 ± 0.161
1.883MetThr: 1.883 ± 0.155
1.118MetVal: 1.118 ± 0.119
0.328MetTrp: 0.328 ± 0.057
0.948MetTyr: 0.948 ± 0.105
0.0MetXaa: 0.0 ± 0.0
Asn
4.083AsnAla: 4.083 ± 0.222
0.559AsnCys: 0.559 ± 0.094
2.357AsnAsp: 2.357 ± 0.193
2.296AsnGlu: 2.296 ± 0.155
1.956AsnPhe: 1.956 ± 0.152
3.499AsnGly: 3.499 ± 0.226
0.802AsnHis: 0.802 ± 0.095
2.758AsnIle: 2.758 ± 0.196
2.467AsnLys: 2.467 ± 0.211
3.912AsnLeu: 3.912 ± 0.234
1.166AsnMet: 1.166 ± 0.125
2.43AsnAsn: 2.43 ± 0.273
2.224AsnPro: 2.224 ± 0.158
1.908AsnGln: 1.908 ± 0.166
2.454AsnArg: 2.454 ± 0.174
3.22AsnSer: 3.22 ± 0.241
3.159AsnThr: 3.159 ± 0.237
3.281AsnVal: 3.281 ± 0.215
0.474AsnTrp: 0.474 ± 0.083
1.677AsnTyr: 1.677 ± 0.171
0.0AsnXaa: 0.0 ± 0.0
Pro
3.572ProAla: 3.572 ± 0.268
0.45ProCys: 0.45 ± 0.071
2.88ProAsp: 2.88 ± 0.194
2.637ProGlu: 2.637 ± 0.173
1.58ProPhe: 1.58 ± 0.141
2.102ProGly: 2.102 ± 0.169
0.899ProHis: 0.899 ± 0.107
2.236ProIle: 2.236 ± 0.167
2.722ProLys: 2.722 ± 0.213
2.588ProLeu: 2.588 ± 0.159
0.863ProMet: 0.863 ± 0.102
1.908ProAsn: 1.908 ± 0.142
1.179ProPro: 1.179 ± 0.129
2.029ProGln: 2.029 ± 0.145
2.296ProArg: 2.296 ± 0.174
2.418ProSer: 2.418 ± 0.184
2.953ProThr: 2.953 ± 0.216
3.597ProVal: 3.597 ± 0.205
0.292ProTrp: 0.292 ± 0.055
1.409ProTyr: 1.409 ± 0.132
0.0ProXaa: 0.0 ± 0.0
Gln
4.034GlnAla: 4.034 ± 0.245
0.328GlnCys: 0.328 ± 0.066
2.442GlnAsp: 2.442 ± 0.152
1.932GlnGlu: 1.932 ± 0.156
1.798GlnPhe: 1.798 ± 0.132
2.503GlnGly: 2.503 ± 0.191
1.166GlnHis: 1.166 ± 0.125
2.649GlnIle: 2.649 ± 0.203
1.798GlnLys: 1.798 ± 0.152
3.694GlnLeu: 3.694 ± 0.236
1.397GlnMet: 1.397 ± 0.144
1.75GlnAsn: 1.75 ± 0.15
1.883GlnPro: 1.883 ± 0.16
2.673GlnGln: 2.673 ± 0.183
2.953GlnArg: 2.953 ± 0.207
2.564GlnSer: 2.564 ± 0.191
3.135GlnThr: 3.135 ± 0.227
2.673GlnVal: 2.673 ± 0.186
0.535GlnTrp: 0.535 ± 0.072
1.798GlnTyr: 1.798 ± 0.14
0.0GlnXaa: 0.0 ± 0.0
Arg
4.848ArgAla: 4.848 ± 0.239
0.802ArgCys: 0.802 ± 0.116
3.682ArgAsp: 3.682 ± 0.235
3.098ArgGlu: 3.098 ± 0.213
2.126ArgPhe: 2.126 ± 0.146
3.378ArgGly: 3.378 ± 0.252
1.227ArgHis: 1.227 ± 0.114
3.354ArgIle: 3.354 ± 0.244
3.341ArgLys: 3.341 ± 0.238
4.982ArgLeu: 4.982 ± 0.261
1.713ArgMet: 1.713 ± 0.135
2.673ArgAsn: 2.673 ± 0.164
2.066ArgPro: 2.066 ± 0.166
2.588ArgGln: 2.588 ± 0.175
3.56ArgArg: 3.56 ± 0.296
3.524ArgSer: 3.524 ± 0.231
3.086ArgThr: 3.086 ± 0.225
4.07ArgVal: 4.07 ± 0.217
0.535ArgTrp: 0.535 ± 0.068
2.236ArgTyr: 2.236 ± 0.18
0.0ArgXaa: 0.0 ± 0.0
Ser
5.456SerAla: 5.456 ± 0.327
0.887SerCys: 0.887 ± 0.123
4.872SerAsp: 4.872 ± 0.264
4.01SerGlu: 4.01 ± 0.275
2.467SerPhe: 2.467 ± 0.186
5.334SerGly: 5.334 ± 0.269
1.069SerHis: 1.069 ± 0.118
3.742SerIle: 3.742 ± 0.242
3.366SerLys: 3.366 ± 0.222
5.261SerLeu: 5.261 ± 0.255
1.81SerMet: 1.81 ± 0.154
3.183SerAsn: 3.183 ± 0.213
2.418SerPro: 2.418 ± 0.157
2.467SerGln: 2.467 ± 0.185
3.536SerArg: 3.536 ± 0.233
4.799SerSer: 4.799 ± 0.316
4.07SerThr: 4.07 ± 0.253
5.322SerVal: 5.322 ± 0.26
0.729SerTrp: 0.729 ± 0.085
2.296SerTyr: 2.296 ± 0.205
0.0SerXaa: 0.0 ± 0.0
Thr
6.561ThrAla: 6.561 ± 0.374
0.535ThrCys: 0.535 ± 0.081
4.107ThrAsp: 4.107 ± 0.248
3.609ThrGlu: 3.609 ± 0.202
2.333ThrPhe: 2.333 ± 0.17
4.569ThrGly: 4.569 ± 0.302
1.118ThrHis: 1.118 ± 0.126
3.609ThrIle: 3.609 ± 0.242
3.001ThrLys: 3.001 ± 0.245
5.31ThrLeu: 5.31 ± 0.258
1.58ThrMet: 1.58 ± 0.141
2.71ThrAsn: 2.71 ± 0.222
2.418ThrPro: 2.418 ± 0.18
2.394ThrGln: 2.394 ± 0.221
2.625ThrArg: 2.625 ± 0.22
4.326ThrSer: 4.326 ± 0.22
4.119ThrThr: 4.119 ± 0.271
6.172ThrVal: 6.172 ± 0.38
0.571ThrTrp: 0.571 ± 0.084
2.102ThrTyr: 2.102 ± 0.259
0.0ThrXaa: 0.0 ± 0.0
Val
6.124ValAla: 6.124 ± 0.307
0.753ValCys: 0.753 ± 0.093
4.714ValAsp: 4.714 ± 0.249
4.593ValGlu: 4.593 ± 0.273
2.612ValPhe: 2.612 ± 0.194
4.228ValGly: 4.228 ± 0.275
1.239ValHis: 1.239 ± 0.132
3.56ValIle: 3.56 ± 0.191
3.742ValLys: 3.742 ± 0.218
5.857ValLeu: 5.857 ± 0.273
1.689ValMet: 1.689 ± 0.14
3.803ValAsn: 3.803 ± 0.206
3.803ValPro: 3.803 ± 0.245
3.098ValGln: 3.098 ± 0.182
4.52ValArg: 4.52 ± 0.279
5.273ValSer: 5.273 ± 0.265
4.641ValThr: 4.641 ± 0.342
5.528ValVal: 5.528 ± 0.348
0.802ValTrp: 0.802 ± 0.098
2.625ValTyr: 2.625 ± 0.19
0.0ValXaa: 0.0 ± 0.0
Trp
0.863TrpAla: 0.863 ± 0.11
0.182TrpCys: 0.182 ± 0.045
0.474TrpAsp: 0.474 ± 0.077
0.559TrpGlu: 0.559 ± 0.079
0.437TrpPhe: 0.437 ± 0.07
0.559TrpGly: 0.559 ± 0.086
0.292TrpHis: 0.292 ± 0.063
0.62TrpIle: 0.62 ± 0.09
0.547TrpLys: 0.547 ± 0.071
0.923TrpLeu: 0.923 ± 0.105
0.462TrpMet: 0.462 ± 0.072
0.462TrpAsn: 0.462 ± 0.073
0.535TrpPro: 0.535 ± 0.076
0.644TrpGln: 0.644 ± 0.088
0.583TrpArg: 0.583 ± 0.083
0.936TrpSer: 0.936 ± 0.116
0.693TrpThr: 0.693 ± 0.101
0.705TrpVal: 0.705 ± 0.091
0.085TrpTrp: 0.085 ± 0.028
0.279TrpTyr: 0.279 ± 0.061
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.111TyrAla: 3.111 ± 0.2
0.571TyrCys: 0.571 ± 0.079
2.539TyrAsp: 2.539 ± 0.165
1.592TyrGlu: 1.592 ± 0.132
1.288TyrPhe: 1.288 ± 0.126
2.588TyrGly: 2.588 ± 0.172
0.68TyrHis: 0.68 ± 0.109
1.762TyrIle: 1.762 ± 0.163
1.64TyrLys: 1.64 ± 0.132
2.527TyrLeu: 2.527 ± 0.19
0.96TyrMet: 0.96 ± 0.114
1.762TyrAsn: 1.762 ± 0.171
1.057TyrPro: 1.057 ± 0.117
1.422TyrGln: 1.422 ± 0.153
1.993TyrArg: 1.993 ± 0.156
2.248TyrSer: 2.248 ± 0.158
2.041TyrThr: 2.041 ± 0.187
2.649TyrVal: 2.649 ± 0.182
0.413TyrTrp: 0.413 ± 0.075
1.276TyrTyr: 1.276 ± 0.128
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 333 proteins (82302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski