Amino acid dipepetide frequency for Gordonia phage RogerDodger

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.348AlaAla: 17.348 ± 1.598
1.216AlaCys: 1.216 ± 0.289
8.674AlaAsp: 8.674 ± 0.849
8.462AlaGlu: 8.462 ± 0.815
3.068AlaPhe: 3.068 ± 0.363
9.573AlaGly: 9.573 ± 0.844
2.327AlaHis: 2.327 ± 0.491
5.553AlaIle: 5.553 ± 0.634
3.967AlaLys: 3.967 ± 0.547
10.79AlaLeu: 10.79 ± 1.147
3.121AlaMet: 3.121 ± 0.342
3.173AlaAsn: 3.173 ± 0.395
6.876AlaPro: 6.876 ± 0.777
4.496AlaGln: 4.496 ± 0.539
8.674AlaArg: 8.674 ± 0.779
5.765AlaSer: 5.765 ± 0.707
7.352AlaThr: 7.352 ± 0.681
7.881AlaVal: 7.881 ± 0.874
2.592AlaTrp: 2.592 ± 0.308
2.645AlaTyr: 2.645 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
0.899CysAla: 0.899 ± 0.227
0.053CysCys: 0.053 ± 0.051
0.952CysAsp: 0.952 ± 0.254
0.264CysGlu: 0.264 ± 0.129
0.212CysPhe: 0.212 ± 0.107
1.111CysGly: 1.111 ± 0.293
0.212CysHis: 0.212 ± 0.101
0.159CysIle: 0.159 ± 0.118
0.212CysLys: 0.212 ± 0.108
0.212CysLeu: 0.212 ± 0.1
0.106CysMet: 0.106 ± 0.061
0.264CysAsn: 0.264 ± 0.114
0.793CysPro: 0.793 ± 0.257
0.106CysGln: 0.106 ± 0.082
0.688CysArg: 0.688 ± 0.231
0.74CysSer: 0.74 ± 0.208
0.582CysThr: 0.582 ± 0.189
0.317CysVal: 0.317 ± 0.131
0.159CysTrp: 0.159 ± 0.123
0.159CysTyr: 0.159 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
8.515AspAla: 8.515 ± 0.64
0.846AspCys: 0.846 ± 0.231
5.765AspAsp: 5.765 ± 0.672
5.236AspGlu: 5.236 ± 0.604
1.745AspPhe: 1.745 ± 0.319
6.03AspGly: 6.03 ± 0.697
1.692AspHis: 1.692 ± 0.359
1.851AspIle: 1.851 ± 0.305
1.375AspLys: 1.375 ± 0.288
6.453AspLeu: 6.453 ± 0.628
1.111AspMet: 1.111 ± 0.262
1.904AspAsn: 1.904 ± 0.379
5.765AspPro: 5.765 ± 0.589
2.221AspGln: 2.221 ± 0.304
5.025AspArg: 5.025 ± 0.668
3.279AspSer: 3.279 ± 0.569
4.231AspThr: 4.231 ± 0.482
4.972AspVal: 4.972 ± 0.606
1.798AspTrp: 1.798 ± 0.251
2.274AspTyr: 2.274 ± 0.362
0.0AspXaa: 0.0 ± 0.0
Glu
6.664GluAla: 6.664 ± 0.55
0.423GluCys: 0.423 ± 0.155
3.173GluAsp: 3.173 ± 0.458
2.274GluGlu: 2.274 ± 0.472
1.745GluPhe: 1.745 ± 0.271
3.967GluGly: 3.967 ± 0.618
0.688GluHis: 0.688 ± 0.202
2.539GluIle: 2.539 ± 0.315
1.904GluLys: 1.904 ± 0.333
5.924GluLeu: 5.924 ± 0.691
1.058GluMet: 1.058 ± 0.259
1.481GluAsn: 1.481 ± 0.26
3.491GluPro: 3.491 ± 0.654
2.697GluGln: 2.697 ± 0.398
3.808GluArg: 3.808 ± 0.52
2.803GluSer: 2.803 ± 0.342
3.068GluThr: 3.068 ± 0.51
4.601GluVal: 4.601 ± 0.491
1.798GluTrp: 1.798 ± 0.367
1.428GluTyr: 1.428 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
3.121PheAla: 3.121 ± 0.418
0.106PheCys: 0.106 ± 0.077
2.274PheAsp: 2.274 ± 0.319
1.269PheGlu: 1.269 ± 0.272
0.899PhePhe: 0.899 ± 0.297
1.64PheGly: 1.64 ± 0.288
0.846PheHis: 0.846 ± 0.233
1.005PheIle: 1.005 ± 0.196
0.476PheLys: 0.476 ± 0.17
1.534PheLeu: 1.534 ± 0.28
0.317PheMet: 0.317 ± 0.152
0.952PheAsn: 0.952 ± 0.381
1.164PhePro: 1.164 ± 0.257
0.74PheGln: 0.74 ± 0.218
2.274PheArg: 2.274 ± 0.313
1.64PheSer: 1.64 ± 0.287
1.851PheThr: 1.851 ± 0.357
1.798PheVal: 1.798 ± 0.309
0.37PheTrp: 0.37 ± 0.131
0.212PheTyr: 0.212 ± 0.104
0.0PheXaa: 0.0 ± 0.0
Gly
8.092GlyAla: 8.092 ± 1.099
0.74GlyCys: 0.74 ± 0.202
5.871GlyAsp: 5.871 ± 0.612
4.654GlyGlu: 4.654 ± 0.488
2.592GlyPhe: 2.592 ± 0.404
9.52GlyGly: 9.52 ± 1.804
1.851GlyHis: 1.851 ± 0.472
4.02GlyIle: 4.02 ± 0.395
3.015GlyLys: 3.015 ± 0.43
6.506GlyLeu: 6.506 ± 0.567
1.904GlyMet: 1.904 ± 0.329
3.226GlyAsn: 3.226 ± 0.429
3.702GlyPro: 3.702 ± 0.508
3.649GlyGln: 3.649 ± 0.569
6.03GlyArg: 6.03 ± 0.611
3.967GlySer: 3.967 ± 0.481
5.236GlyThr: 5.236 ± 0.876
7.934GlyVal: 7.934 ± 0.696
1.64GlyTrp: 1.64 ± 0.272
1.64GlyTyr: 1.64 ± 0.271
0.0GlyXaa: 0.0 ± 0.0
His
1.798HisAla: 1.798 ± 0.328
0.159HisCys: 0.159 ± 0.095
1.481HisAsp: 1.481 ± 0.307
0.952HisGlu: 0.952 ± 0.218
0.423HisPhe: 0.423 ± 0.139
1.692HisGly: 1.692 ± 0.256
0.793HisHis: 0.793 ± 0.228
1.216HisIle: 1.216 ± 0.24
0.159HisLys: 0.159 ± 0.09
1.851HisLeu: 1.851 ± 0.334
0.264HisMet: 0.264 ± 0.123
0.37HisAsn: 0.37 ± 0.122
1.481HisPro: 1.481 ± 0.286
0.846HisGln: 0.846 ± 0.285
1.957HisArg: 1.957 ± 0.44
0.793HisSer: 0.793 ± 0.2
1.587HisThr: 1.587 ± 0.399
1.375HisVal: 1.375 ± 0.31
0.635HisTrp: 0.635 ± 0.162
0.635HisTyr: 0.635 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
5.501IleAla: 5.501 ± 0.587
0.106IleCys: 0.106 ± 0.084
3.914IleAsp: 3.914 ± 0.446
2.539IleGlu: 2.539 ± 0.475
0.793IlePhe: 0.793 ± 0.25
3.649IleGly: 3.649 ± 0.62
0.688IleHis: 0.688 ± 0.189
1.481IleIle: 1.481 ± 0.39
0.899IleLys: 0.899 ± 0.267
2.909IleLeu: 2.909 ± 0.412
0.529IleMet: 0.529 ± 0.14
0.793IleAsn: 0.793 ± 0.187
2.856IlePro: 2.856 ± 0.341
1.269IleGln: 1.269 ± 0.361
3.385IleArg: 3.385 ± 0.462
1.851IleSer: 1.851 ± 0.295
3.332IleThr: 3.332 ± 0.459
3.755IleVal: 3.755 ± 0.482
0.582IleTrp: 0.582 ± 0.195
0.899IleTyr: 0.899 ± 0.228
0.0IleXaa: 0.0 ± 0.0
Lys
3.544LysAla: 3.544 ± 0.493
0.106LysCys: 0.106 ± 0.071
1.851LysAsp: 1.851 ± 0.476
1.269LysGlu: 1.269 ± 0.314
0.688LysPhe: 0.688 ± 0.238
1.692LysGly: 1.692 ± 0.333
0.317LysHis: 0.317 ± 0.143
0.846LysIle: 0.846 ± 0.21
1.428LysLys: 1.428 ± 0.332
3.015LysLeu: 3.015 ± 0.375
0.37LysMet: 0.37 ± 0.134
0.899LysAsn: 0.899 ± 0.213
1.745LysPro: 1.745 ± 0.333
0.793LysGln: 0.793 ± 0.21
2.75LysArg: 2.75 ± 0.369
1.428LysSer: 1.428 ± 0.294
2.486LysThr: 2.486 ± 0.373
2.116LysVal: 2.116 ± 0.258
0.529LysTrp: 0.529 ± 0.146
0.582LysTyr: 0.582 ± 0.185
0.0LysXaa: 0.0 ± 0.0
Leu
10.737LeuAla: 10.737 ± 0.786
0.582LeuCys: 0.582 ± 0.176
7.034LeuAsp: 7.034 ± 0.629
4.284LeuGlu: 4.284 ± 0.516
1.64LeuPhe: 1.64 ± 0.327
7.458LeuGly: 7.458 ± 1.133
1.64LeuHis: 1.64 ± 0.327
3.332LeuIle: 3.332 ± 0.394
1.692LeuLys: 1.692 ± 0.343
6.188LeuLeu: 6.188 ± 0.55
1.111LeuMet: 1.111 ± 0.251
1.692LeuAsn: 1.692 ± 0.28
4.073LeuPro: 4.073 ± 0.416
3.173LeuGln: 3.173 ± 0.641
6.135LeuArg: 6.135 ± 0.744
4.178LeuSer: 4.178 ± 0.459
6.347LeuThr: 6.347 ± 0.624
5.501LeuVal: 5.501 ± 0.542
2.38LeuTrp: 2.38 ± 0.357
1.692LeuTyr: 1.692 ± 0.314
0.0LeuXaa: 0.0 ± 0.0
Met
2.221MetAla: 2.221 ± 0.323
0.317MetCys: 0.317 ± 0.118
0.846MetAsp: 0.846 ± 0.194
0.74MetGlu: 0.74 ± 0.261
0.423MetPhe: 0.423 ± 0.141
1.375MetGly: 1.375 ± 0.232
0.212MetHis: 0.212 ± 0.113
1.058MetIle: 1.058 ± 0.221
0.582MetLys: 0.582 ± 0.152
1.375MetLeu: 1.375 ± 0.294
0.264MetMet: 0.264 ± 0.124
0.635MetAsn: 0.635 ± 0.19
1.798MetPro: 1.798 ± 0.277
0.582MetGln: 0.582 ± 0.183
1.164MetArg: 1.164 ± 0.251
1.745MetSer: 1.745 ± 0.268
3.068MetThr: 3.068 ± 0.448
1.216MetVal: 1.216 ± 0.26
0.37MetTrp: 0.37 ± 0.123
0.423MetTyr: 0.423 ± 0.157
0.0MetXaa: 0.0 ± 0.0
Asn
3.544AsnAla: 3.544 ± 0.486
0.37AsnCys: 0.37 ± 0.149
1.745AsnAsp: 1.745 ± 0.308
1.216AsnGlu: 1.216 ± 0.245
0.582AsnPhe: 0.582 ± 0.155
2.909AsnGly: 2.909 ± 0.382
0.635AsnHis: 0.635 ± 0.192
1.058AsnIle: 1.058 ± 0.209
0.846AsnLys: 0.846 ± 0.197
2.327AsnLeu: 2.327 ± 0.37
0.635AsnMet: 0.635 ± 0.158
0.635AsnAsn: 0.635 ± 0.185
2.697AsnPro: 2.697 ± 0.312
0.793AsnGln: 0.793 ± 0.214
1.692AsnArg: 1.692 ± 0.255
1.375AsnSer: 1.375 ± 0.269
2.063AsnThr: 2.063 ± 0.34
1.851AsnVal: 1.851 ± 0.296
0.317AsnTrp: 0.317 ± 0.119
0.529AsnTyr: 0.529 ± 0.192
0.0AsnXaa: 0.0 ± 0.0
Pro
8.304ProAla: 8.304 ± 0.9
0.317ProCys: 0.317 ± 0.135
5.077ProAsp: 5.077 ± 0.689
4.02ProGlu: 4.02 ± 0.62
1.481ProPhe: 1.481 ± 0.295
5.712ProGly: 5.712 ± 0.614
1.481ProHis: 1.481 ± 0.376
3.068ProIle: 3.068 ± 0.322
1.534ProLys: 1.534 ± 0.24
3.438ProLeu: 3.438 ± 0.345
1.005ProMet: 1.005 ± 0.216
1.587ProAsn: 1.587 ± 0.279
4.866ProPro: 4.866 ± 0.768
1.745ProGln: 1.745 ± 0.287
3.808ProArg: 3.808 ± 0.541
3.491ProSer: 3.491 ± 0.468
4.601ProThr: 4.601 ± 0.496
4.813ProVal: 4.813 ± 0.466
1.164ProTrp: 1.164 ± 0.284
0.846ProTyr: 0.846 ± 0.201
0.0ProXaa: 0.0 ± 0.0
Gln
4.601GlnAla: 4.601 ± 0.535
0.317GlnCys: 0.317 ± 0.122
1.692GlnAsp: 1.692 ± 0.355
1.587GlnGlu: 1.587 ± 0.324
0.74GlnPhe: 0.74 ± 0.172
2.697GlnGly: 2.697 ± 0.409
0.899GlnHis: 0.899 ± 0.22
1.587GlnIle: 1.587 ± 0.303
1.111GlnLys: 1.111 ± 0.289
3.332GlnLeu: 3.332 ± 0.389
1.216GlnMet: 1.216 ± 0.226
0.74GlnAsn: 0.74 ± 0.217
2.01GlnPro: 2.01 ± 0.347
1.745GlnGln: 1.745 ± 0.291
2.486GlnArg: 2.486 ± 0.387
1.957GlnSer: 1.957 ± 0.35
2.116GlnThr: 2.116 ± 0.354
2.697GlnVal: 2.697 ± 0.339
0.74GlnTrp: 0.74 ± 0.185
0.74GlnTyr: 0.74 ± 0.198
0.0GlnXaa: 0.0 ± 0.0
Arg
9.467ArgAla: 9.467 ± 0.861
0.635ArgCys: 0.635 ± 0.182
5.765ArgAsp: 5.765 ± 0.675
3.967ArgGlu: 3.967 ± 0.53
1.64ArgPhe: 1.64 ± 0.346
5.818ArgGly: 5.818 ± 0.502
1.957ArgHis: 1.957 ± 0.44
2.962ArgIle: 2.962 ± 0.332
3.385ArgLys: 3.385 ± 0.467
4.707ArgLeu: 4.707 ± 0.523
2.274ArgMet: 2.274 ± 0.326
2.116ArgAsn: 2.116 ± 0.407
3.173ArgPro: 3.173 ± 0.487
3.121ArgGln: 3.121 ± 0.448
7.51ArgArg: 7.51 ± 0.865
3.702ArgSer: 3.702 ± 0.471
4.178ArgThr: 4.178 ± 0.596
4.549ArgVal: 4.549 ± 0.68
1.851ArgTrp: 1.851 ± 0.384
1.375ArgTyr: 1.375 ± 0.27
0.0ArgXaa: 0.0 ± 0.0
Ser
6.823SerAla: 6.823 ± 0.696
0.212SerCys: 0.212 ± 0.157
3.808SerAsp: 3.808 ± 0.331
2.169SerGlu: 2.169 ± 0.305
1.005SerPhe: 1.005 ± 0.182
5.553SerGly: 5.553 ± 0.66
0.688SerHis: 0.688 ± 0.232
2.486SerIle: 2.486 ± 0.377
1.216SerLys: 1.216 ± 0.251
4.337SerLeu: 4.337 ± 0.535
1.322SerMet: 1.322 ± 0.243
1.269SerAsn: 1.269 ± 0.285
3.015SerPro: 3.015 ± 0.421
1.269SerGln: 1.269 ± 0.23
3.226SerArg: 3.226 ± 0.408
2.962SerSer: 2.962 ± 0.48
4.76SerThr: 4.76 ± 0.527
3.438SerVal: 3.438 ± 0.404
1.216SerTrp: 1.216 ± 0.277
1.375SerTyr: 1.375 ± 0.292
0.0SerXaa: 0.0 ± 0.0
Thr
9.256ThrAla: 9.256 ± 1.138
0.635ThrCys: 0.635 ± 0.173
4.549ThrAsp: 4.549 ± 0.459
3.914ThrGlu: 3.914 ± 0.458
1.745ThrPhe: 1.745 ± 0.244
6.453ThrGly: 6.453 ± 0.649
1.111ThrHis: 1.111 ± 0.246
3.385ThrIle: 3.385 ± 0.421
1.957ThrLys: 1.957 ± 0.261
5.501ThrLeu: 5.501 ± 0.552
1.375ThrMet: 1.375 ± 0.248
1.904ThrAsn: 1.904 ± 0.299
5.606ThrPro: 5.606 ± 0.494
1.481ThrGln: 1.481 ± 0.334
5.183ThrArg: 5.183 ± 0.527
4.02ThrSer: 4.02 ± 0.543
5.289ThrThr: 5.289 ± 0.589
4.919ThrVal: 4.919 ± 0.546
1.164ThrTrp: 1.164 ± 0.197
1.164ThrTyr: 1.164 ± 0.269
0.0ThrXaa: 0.0 ± 0.0
Val
8.039ValAla: 8.039 ± 0.544
0.635ValCys: 0.635 ± 0.176
5.183ValAsp: 5.183 ± 0.479
4.284ValGlu: 4.284 ± 0.431
1.745ValPhe: 1.745 ± 0.348
5.236ValGly: 5.236 ± 0.818
1.269ValHis: 1.269 ± 0.259
3.121ValIle: 3.121 ± 0.358
1.745ValLys: 1.745 ± 0.274
6.558ValLeu: 6.558 ± 0.64
1.269ValMet: 1.269 ± 0.243
2.486ValAsn: 2.486 ± 0.394
4.813ValPro: 4.813 ± 0.601
2.38ValGln: 2.38 ± 0.417
5.236ValArg: 5.236 ± 0.601
4.39ValSer: 4.39 ± 0.53
5.236ValThr: 5.236 ± 0.462
6.03ValVal: 6.03 ± 0.499
2.063ValTrp: 2.063 ± 0.321
1.164ValTyr: 1.164 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
2.75TrpAla: 2.75 ± 0.496
0.317TrpCys: 0.317 ± 0.138
1.005TrpAsp: 1.005 ± 0.206
0.899TrpGlu: 0.899 ± 0.204
0.688TrpPhe: 0.688 ± 0.199
1.216TrpGly: 1.216 ± 0.233
0.582TrpHis: 0.582 ± 0.159
0.793TrpIle: 0.793 ± 0.218
0.582TrpLys: 0.582 ± 0.179
2.38TrpLeu: 2.38 ± 0.374
0.635TrpMet: 0.635 ± 0.172
1.005TrpAsn: 1.005 ± 0.251
1.216TrpPro: 1.216 ± 0.329
1.164TrpGln: 1.164 ± 0.242
1.375TrpArg: 1.375 ± 0.278
1.269TrpSer: 1.269 ± 0.312
1.851TrpThr: 1.851 ± 0.28
1.745TrpVal: 1.745 ± 0.253
0.793TrpTrp: 0.793 ± 0.221
0.37TrpTyr: 0.37 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.327TyrAla: 2.327 ± 0.437
0.106TyrCys: 0.106 ± 0.071
1.428TyrAsp: 1.428 ± 0.279
1.534TyrGlu: 1.534 ± 0.358
0.635TyrPhe: 0.635 ± 0.153
2.486TyrGly: 2.486 ± 0.379
0.582TyrHis: 0.582 ± 0.2
0.212TyrIle: 0.212 ± 0.098
0.37TyrLys: 0.37 ± 0.146
1.534TyrLeu: 1.534 ± 0.275
0.476TyrMet: 0.476 ± 0.148
0.688TyrAsn: 0.688 ± 0.199
1.216TyrPro: 1.216 ± 0.229
0.74TyrGln: 0.74 ± 0.164
1.745TyrArg: 1.745 ± 0.34
0.793TyrSer: 0.793 ± 0.199
1.428TyrThr: 1.428 ± 0.277
1.375TyrVal: 1.375 ± 0.3
0.476TyrTrp: 0.476 ± 0.148
0.635TyrTyr: 0.635 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (18908 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski