Amino acid dipepetide frequency for Gordonia phage RedWattleHog

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.32AlaAla: 11.32 ± 0.9
0.744AlaCys: 0.744 ± 0.124
6.857AlaAsp: 6.857 ± 0.519
7.136AlaGlu: 7.136 ± 0.444
2.813AlaPhe: 2.813 ± 0.254
6.811AlaGly: 6.811 ± 0.528
1.906AlaHis: 1.906 ± 0.217
4.045AlaIle: 4.045 ± 0.34
3.835AlaLys: 3.835 ± 0.3
7.833AlaLeu: 7.833 ± 0.568
2.348AlaMet: 2.348 ± 0.215
3.44AlaAsn: 3.44 ± 0.308
4.579AlaPro: 4.579 ± 0.35
4.161AlaGln: 4.161 ± 0.388
6.415AlaArg: 6.415 ± 0.494
5.858AlaSer: 5.858 ± 0.465
5.393AlaThr: 5.393 ± 0.499
6.415AlaVal: 6.415 ± 0.429
1.511AlaTrp: 1.511 ± 0.225
2.231AlaTyr: 2.231 ± 0.209
0.0AlaXaa: 0.0 ± 0.0
Cys
0.628CysAla: 0.628 ± 0.112
0.139CysCys: 0.139 ± 0.054
0.767CysAsp: 0.767 ± 0.121
0.488CysGlu: 0.488 ± 0.104
0.186CysPhe: 0.186 ± 0.059
1.418CysGly: 1.418 ± 0.221
0.256CysHis: 0.256 ± 0.079
0.232CysIle: 0.232 ± 0.07
0.349CysLys: 0.349 ± 0.082
0.628CysLeu: 0.628 ± 0.137
0.209CysMet: 0.209 ± 0.064
0.488CysAsn: 0.488 ± 0.115
0.721CysPro: 0.721 ± 0.135
0.232CysGln: 0.232 ± 0.079
0.814CysArg: 0.814 ± 0.146
0.651CysSer: 0.651 ± 0.139
0.511CysThr: 0.511 ± 0.11
0.488CysVal: 0.488 ± 0.106
0.209CysTrp: 0.209 ± 0.069
0.488CysTyr: 0.488 ± 0.123
0.0CysXaa: 0.0 ± 0.0
Asp
6.462AspAla: 6.462 ± 0.449
0.697AspCys: 0.697 ± 0.126
5.532AspAsp: 5.532 ± 0.558
5.439AspGlu: 5.439 ± 0.463
2.185AspPhe: 2.185 ± 0.177
5.788AspGly: 5.788 ± 0.44
1.72AspHis: 1.72 ± 0.214
3.766AspIle: 3.766 ± 0.329
1.743AspLys: 1.743 ± 0.182
5.718AspLeu: 5.718 ± 0.412
1.348AspMet: 1.348 ± 0.176
2.046AspAsn: 2.046 ± 0.209
5.369AspPro: 5.369 ± 0.303
2.115AspGln: 2.115 ± 0.231
4.812AspArg: 4.812 ± 0.375
3.161AspSer: 3.161 ± 0.302
3.882AspThr: 3.882 ± 0.292
4.998AspVal: 4.998 ± 0.334
1.348AspTrp: 1.348 ± 0.217
2.603AspTyr: 2.603 ± 0.238
0.0AspXaa: 0.0 ± 0.0
Glu
6.927GluAla: 6.927 ± 0.476
0.744GluCys: 0.744 ± 0.146
4.393GluAsp: 4.393 ± 0.458
4.812GluGlu: 4.812 ± 0.424
2.441GluPhe: 2.441 ± 0.233
4.533GluGly: 4.533 ± 0.272
1.441GluHis: 1.441 ± 0.212
3.208GluIle: 3.208 ± 0.288
2.441GluLys: 2.441 ± 0.283
5.927GluLeu: 5.927 ± 0.442
1.953GluMet: 1.953 ± 0.236
1.883GluAsn: 1.883 ± 0.233
3.161GluPro: 3.161 ± 0.324
2.836GluGln: 2.836 ± 0.285
4.881GluArg: 4.881 ± 0.413
3.231GluSer: 3.231 ± 0.365
3.673GluThr: 3.673 ± 0.339
5.718GluVal: 5.718 ± 0.338
1.697GluTrp: 1.697 ± 0.24
2.046GluTyr: 2.046 ± 0.203
0.0GluXaa: 0.0 ± 0.0
Phe
2.673PheAla: 2.673 ± 0.325
0.302PheCys: 0.302 ± 0.073
2.557PheAsp: 2.557 ± 0.288
2.138PheGlu: 2.138 ± 0.209
0.814PhePhe: 0.814 ± 0.168
2.534PheGly: 2.534 ± 0.209
0.581PheHis: 0.581 ± 0.115
1.162PheIle: 1.162 ± 0.172
0.907PheLys: 0.907 ± 0.159
1.813PheLeu: 1.813 ± 0.223
0.883PheMet: 0.883 ± 0.133
0.79PheAsn: 0.79 ± 0.14
1.743PhePro: 1.743 ± 0.234
0.907PheGln: 0.907 ± 0.156
1.72PheArg: 1.72 ± 0.232
1.953PheSer: 1.953 ± 0.218
1.976PheThr: 1.976 ± 0.291
1.883PheVal: 1.883 ± 0.241
0.302PheTrp: 0.302 ± 0.076
1.0PheTyr: 1.0 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
6.067GlyAla: 6.067 ± 0.476
0.837GlyCys: 0.837 ± 0.146
5.625GlyAsp: 5.625 ± 0.279
4.835GlyGlu: 4.835 ± 0.342
2.394GlyPhe: 2.394 ± 0.233
7.461GlyGly: 7.461 ± 0.906
1.813GlyHis: 1.813 ± 0.224
3.208GlyIle: 3.208 ± 0.295
3.487GlyLys: 3.487 ± 0.342
5.532GlyLeu: 5.532 ± 0.344
2.371GlyMet: 2.371 ± 0.261
2.51GlyAsn: 2.51 ± 0.257
3.37GlyPro: 3.37 ± 0.28
3.045GlyGln: 3.045 ± 0.29
5.3GlyArg: 5.3 ± 0.368
5.044GlySer: 5.044 ± 0.498
6.044GlyThr: 6.044 ± 0.709
5.067GlyVal: 5.067 ± 0.317
2.069GlyTrp: 2.069 ± 0.257
3.277GlyTyr: 3.277 ± 0.259
0.0GlyXaa: 0.0 ± 0.0
His
1.906HisAla: 1.906 ± 0.225
0.256HisCys: 0.256 ± 0.074
1.348HisAsp: 1.348 ± 0.184
1.302HisGlu: 1.302 ± 0.189
0.883HisPhe: 0.883 ± 0.163
1.488HisGly: 1.488 ± 0.199
0.628HisHis: 0.628 ± 0.124
0.651HisIle: 0.651 ± 0.136
0.628HisLys: 0.628 ± 0.124
1.511HisLeu: 1.511 ± 0.207
0.372HisMet: 0.372 ± 0.093
0.837HisAsn: 0.837 ± 0.159
1.627HisPro: 1.627 ± 0.229
0.93HisGln: 0.93 ± 0.137
1.813HisArg: 1.813 ± 0.233
1.325HisSer: 1.325 ± 0.198
1.139HisThr: 1.139 ± 0.199
1.697HisVal: 1.697 ± 0.233
0.488HisTrp: 0.488 ± 0.102
0.721HisTyr: 0.721 ± 0.106
0.0HisXaa: 0.0 ± 0.0
Ile
4.091IleAla: 4.091 ± 0.263
0.465IleCys: 0.465 ± 0.101
3.556IleAsp: 3.556 ± 0.316
3.835IleGlu: 3.835 ± 0.345
0.744IlePhe: 0.744 ± 0.131
3.649IleGly: 3.649 ± 0.328
0.814IleHis: 0.814 ± 0.131
1.581IleIle: 1.581 ± 0.187
1.674IleLys: 1.674 ± 0.225
2.371IleLeu: 2.371 ± 0.246
0.79IleMet: 0.79 ± 0.152
1.581IleAsn: 1.581 ± 0.211
2.394IlePro: 2.394 ± 0.275
1.697IleGln: 1.697 ± 0.224
2.557IleArg: 2.557 ± 0.244
2.673IleSer: 2.673 ± 0.331
2.929IleThr: 2.929 ± 0.326
2.952IleVal: 2.952 ± 0.301
0.628IleTrp: 0.628 ± 0.126
1.116IleTyr: 1.116 ± 0.175
0.0IleXaa: 0.0 ± 0.0
Lys
3.998LysAla: 3.998 ± 0.303
0.302LysCys: 0.302 ± 0.089
2.255LysAsp: 2.255 ± 0.253
2.231LysGlu: 2.231 ± 0.238
0.79LysPhe: 0.79 ± 0.17
3.115LysGly: 3.115 ± 0.273
0.814LysHis: 0.814 ± 0.161
1.278LysIle: 1.278 ± 0.17
1.906LysLys: 1.906 ± 0.301
2.882LysLeu: 2.882 ± 0.241
1.046LysMet: 1.046 ± 0.163
1.162LysAsn: 1.162 ± 0.194
2.394LysPro: 2.394 ± 0.238
1.139LysGln: 1.139 ± 0.173
2.534LysArg: 2.534 ± 0.206
2.278LysSer: 2.278 ± 0.237
2.092LysThr: 2.092 ± 0.225
3.347LysVal: 3.347 ± 0.293
0.604LysTrp: 0.604 ± 0.114
1.116LysTyr: 1.116 ± 0.153
0.0LysXaa: 0.0 ± 0.0
Leu
7.368LeuAla: 7.368 ± 0.537
0.883LeuCys: 0.883 ± 0.125
5.951LeuAsp: 5.951 ± 0.416
3.952LeuGlu: 3.952 ± 0.287
2.138LeuPhe: 2.138 ± 0.241
5.718LeuGly: 5.718 ± 0.416
1.511LeuHis: 1.511 ± 0.225
3.254LeuIle: 3.254 ± 0.307
2.743LeuLys: 2.743 ± 0.277
4.905LeuLeu: 4.905 ± 0.378
1.86LeuMet: 1.86 ± 0.218
2.859LeuAsn: 2.859 ± 0.288
3.673LeuPro: 3.673 ± 0.293
2.999LeuGln: 2.999 ± 0.248
6.485LeuArg: 6.485 ± 0.366
5.555LeuSer: 5.555 ± 0.389
5.021LeuThr: 5.021 ± 0.443
5.486LeuVal: 5.486 ± 0.35
0.767LeuTrp: 0.767 ± 0.151
2.092LeuTyr: 2.092 ± 0.26
0.0LeuXaa: 0.0 ± 0.0
Met
2.348MetAla: 2.348 ± 0.251
0.256MetCys: 0.256 ± 0.067
1.581MetAsp: 1.581 ± 0.206
1.534MetGlu: 1.534 ± 0.179
0.721MetPhe: 0.721 ± 0.117
1.418MetGly: 1.418 ± 0.151
0.418MetHis: 0.418 ± 0.082
1.046MetIle: 1.046 ± 0.169
1.023MetLys: 1.023 ± 0.149
1.395MetLeu: 1.395 ± 0.178
0.628MetMet: 0.628 ± 0.132
1.092MetAsn: 1.092 ± 0.194
1.395MetPro: 1.395 ± 0.224
0.814MetGln: 0.814 ± 0.131
1.976MetArg: 1.976 ± 0.216
2.138MetSer: 2.138 ± 0.21
2.162MetThr: 2.162 ± 0.25
1.65MetVal: 1.65 ± 0.186
0.349MetTrp: 0.349 ± 0.103
0.697MetTyr: 0.697 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
3.138AsnAla: 3.138 ± 0.278
0.395AsnCys: 0.395 ± 0.12
2.348AsnAsp: 2.348 ± 0.243
1.72AsnGlu: 1.72 ± 0.18
1.092AsnPhe: 1.092 ± 0.181
3.556AsnGly: 3.556 ± 0.352
0.558AsnHis: 0.558 ± 0.106
1.162AsnIle: 1.162 ± 0.163
1.139AsnLys: 1.139 ± 0.161
2.999AsnLeu: 2.999 ± 0.298
0.418AsnMet: 0.418 ± 0.106
1.046AsnAsn: 1.046 ± 0.153
2.743AsnPro: 2.743 ± 0.269
1.395AsnGln: 1.395 ± 0.235
2.371AsnArg: 2.371 ± 0.229
2.115AsnSer: 2.115 ± 0.236
2.138AsnThr: 2.138 ± 0.245
2.022AsnVal: 2.022 ± 0.205
0.744AsnTrp: 0.744 ± 0.162
1.185AsnTyr: 1.185 ± 0.187
0.0AsnXaa: 0.0 ± 0.0
Pro
4.951ProAla: 4.951 ± 0.382
0.511ProCys: 0.511 ± 0.109
4.858ProAsp: 4.858 ± 0.292
4.998ProGlu: 4.998 ± 0.411
1.348ProPhe: 1.348 ± 0.182
4.626ProGly: 4.626 ± 0.497
1.023ProHis: 1.023 ± 0.155
2.231ProIle: 2.231 ± 0.277
2.092ProLys: 2.092 ± 0.228
3.626ProLeu: 3.626 ± 0.287
1.092ProMet: 1.092 ± 0.149
2.301ProAsn: 2.301 ± 0.267
2.882ProPro: 2.882 ± 0.398
1.743ProGln: 1.743 ± 0.205
2.882ProArg: 2.882 ± 0.236
3.719ProSer: 3.719 ± 0.325
3.766ProThr: 3.766 ± 0.462
4.695ProVal: 4.695 ± 0.422
0.767ProTrp: 0.767 ± 0.133
1.581ProTyr: 1.581 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
4.37GlnAla: 4.37 ± 0.316
0.209GlnCys: 0.209 ± 0.072
2.022GlnAsp: 2.022 ± 0.251
2.231GlnGlu: 2.231 ± 0.204
1.464GlnPhe: 1.464 ± 0.189
2.324GlnGly: 2.324 ± 0.213
0.744GlnHis: 0.744 ± 0.142
1.511GlnIle: 1.511 ± 0.175
1.418GlnLys: 1.418 ± 0.16
3.44GlnLeu: 3.44 ± 0.318
1.116GlnMet: 1.116 ± 0.169
1.162GlnAsn: 1.162 ± 0.187
1.976GlnPro: 1.976 ± 0.216
1.557GlnGln: 1.557 ± 0.202
3.161GlnArg: 3.161 ± 0.322
2.348GlnSer: 2.348 ± 0.25
1.953GlnThr: 1.953 ± 0.214
2.743GlnVal: 2.743 ± 0.218
0.953GlnTrp: 0.953 ± 0.179
0.976GlnTyr: 0.976 ± 0.138
0.0GlnXaa: 0.0 ± 0.0
Arg
5.765ArgAla: 5.765 ± 0.403
0.744ArgCys: 0.744 ± 0.144
4.138ArgAsp: 4.138 ± 0.376
4.602ArgGlu: 4.602 ± 0.425
1.79ArgPhe: 1.79 ± 0.244
4.44ArgGly: 4.44 ± 0.35
2.255ArgHis: 2.255 ± 0.256
2.952ArgIle: 2.952 ± 0.27
3.254ArgLys: 3.254 ± 0.327
5.323ArgLeu: 5.323 ± 0.353
1.767ArgMet: 1.767 ± 0.222
2.138ArgAsn: 2.138 ± 0.188
3.603ArgPro: 3.603 ± 0.35
2.882ArgGln: 2.882 ± 0.345
4.788ArgArg: 4.788 ± 0.416
4.44ArgSer: 4.44 ± 0.395
3.882ArgThr: 3.882 ± 0.379
5.393ArgVal: 5.393 ± 0.396
1.325ArgTrp: 1.325 ± 0.203
2.603ArgTyr: 2.603 ± 0.289
0.0ArgXaa: 0.0 ± 0.0
Ser
5.834SerAla: 5.834 ± 0.386
0.604SerCys: 0.604 ± 0.129
4.207SerAsp: 4.207 ± 0.336
4.37SerGlu: 4.37 ± 0.321
1.464SerPhe: 1.464 ± 0.212
5.904SerGly: 5.904 ± 0.459
1.209SerHis: 1.209 ± 0.223
2.696SerIle: 2.696 ± 0.257
2.557SerLys: 2.557 ± 0.236
4.254SerLeu: 4.254 ± 0.319
1.836SerMet: 1.836 ± 0.187
2.417SerAsn: 2.417 ± 0.278
3.277SerPro: 3.277 ± 0.296
2.464SerGln: 2.464 ± 0.282
3.463SerArg: 3.463 ± 0.336
4.44SerSer: 4.44 ± 0.343
3.975SerThr: 3.975 ± 0.401
4.951SerVal: 4.951 ± 0.369
1.139SerTrp: 1.139 ± 0.141
2.069SerTyr: 2.069 ± 0.201
0.0SerXaa: 0.0 ± 0.0
Thr
6.764ThrAla: 6.764 ± 0.688
0.581ThrCys: 0.581 ± 0.116
3.952ThrAsp: 3.952 ± 0.302
3.905ThrGlu: 3.905 ± 0.274
2.441ThrPhe: 2.441 ± 0.267
5.021ThrGly: 5.021 ± 0.413
1.371ThrHis: 1.371 ± 0.222
2.789ThrIle: 2.789 ± 0.263
1.906ThrLys: 1.906 ± 0.211
4.649ThrLeu: 4.649 ± 0.288
1.325ThrMet: 1.325 ± 0.177
1.883ThrAsn: 1.883 ± 0.237
4.765ThrPro: 4.765 ± 0.367
1.929ThrGln: 1.929 ± 0.227
3.184ThrArg: 3.184 ± 0.298
3.835ThrSer: 3.835 ± 0.454
4.858ThrThr: 4.858 ± 0.616
4.881ThrVal: 4.881 ± 0.327
0.883ThrTrp: 0.883 ± 0.149
2.58ThrTyr: 2.58 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
6.88ValAla: 6.88 ± 0.448
0.697ValCys: 0.697 ± 0.134
4.858ValAsp: 4.858 ± 0.32
5.276ValGlu: 5.276 ± 0.429
1.743ValPhe: 1.743 ± 0.204
5.369ValGly: 5.369 ± 0.335
1.534ValHis: 1.534 ± 0.191
3.44ValIle: 3.44 ± 0.275
2.417ValLys: 2.417 ± 0.205
5.951ValLeu: 5.951 ± 0.427
1.836ValMet: 1.836 ± 0.202
2.836ValAsn: 2.836 ± 0.321
3.766ValPro: 3.766 ± 0.368
2.603ValGln: 2.603 ± 0.267
5.509ValArg: 5.509 ± 0.419
4.858ValSer: 4.858 ± 0.292
4.928ValThr: 4.928 ± 0.343
5.672ValVal: 5.672 ± 0.435
1.418ValTrp: 1.418 ± 0.165
2.394ValTyr: 2.394 ± 0.248
0.0ValXaa: 0.0 ± 0.0
Trp
1.674TrpAla: 1.674 ± 0.192
0.139TrpCys: 0.139 ± 0.062
1.418TrpAsp: 1.418 ± 0.22
0.86TrpGlu: 0.86 ± 0.17
0.395TrpPhe: 0.395 ± 0.119
1.069TrpGly: 1.069 ± 0.205
0.279TrpHis: 0.279 ± 0.08
0.628TrpIle: 0.628 ± 0.107
0.744TrpLys: 0.744 ± 0.167
1.627TrpLeu: 1.627 ± 0.189
0.604TrpMet: 0.604 ± 0.118
0.767TrpAsn: 0.767 ± 0.121
0.79TrpPro: 0.79 ± 0.144
0.814TrpGln: 0.814 ± 0.133
1.325TrpArg: 1.325 ± 0.21
1.371TrpSer: 1.371 ± 0.174
1.395TrpThr: 1.395 ± 0.166
1.185TrpVal: 1.185 ± 0.169
0.511TrpTrp: 0.511 ± 0.106
0.604TrpTyr: 0.604 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.487TyrAla: 2.487 ± 0.253
0.395TyrCys: 0.395 ± 0.12
2.51TyrAsp: 2.51 ± 0.213
2.278TyrGlu: 2.278 ± 0.223
0.79TyrPhe: 0.79 ± 0.162
3.115TyrGly: 3.115 ± 0.314
0.674TyrHis: 0.674 ± 0.127
1.255TyrIle: 1.255 ± 0.183
1.023TyrLys: 1.023 ± 0.189
2.836TyrLeu: 2.836 ± 0.234
0.814TyrMet: 0.814 ± 0.179
1.046TyrAsn: 1.046 ± 0.142
1.464TyrPro: 1.464 ± 0.207
1.418TyrGln: 1.418 ± 0.154
2.115TyrArg: 2.115 ± 0.255
2.138TyrSer: 2.138 ± 0.242
1.79TyrThr: 1.79 ± 0.225
2.743TyrVal: 2.743 ± 0.229
0.488TyrTrp: 0.488 ± 0.099
1.046TyrTyr: 1.046 ± 0.162
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 212 proteins (43022 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski