Amino acid dipepetide frequency for Aeromonas phage SD04

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.1AlaAla: 13.1 ± 2.332
1.045AlaCys: 1.045 ± 0.259
4.529AlaAsp: 4.529 ± 0.623
6.132AlaGlu: 6.132 ± 1.226
2.718AlaPhe: 2.718 ± 0.593
6.271AlaGly: 6.271 ± 0.782
1.394AlaHis: 1.394 ± 0.282
4.947AlaIle: 4.947 ± 0.651
6.132AlaLys: 6.132 ± 0.741
7.735AlaLeu: 7.735 ± 0.775
3.972AlaMet: 3.972 ± 0.873
3.902AlaAsn: 3.902 ± 0.526
4.181AlaPro: 4.181 ± 0.817
5.435AlaGln: 5.435 ± 1.182
4.529AlaArg: 4.529 ± 0.653
5.575AlaSer: 5.575 ± 0.944
5.853AlaThr: 5.853 ± 1.078
5.435AlaVal: 5.435 ± 0.522
1.533AlaTrp: 1.533 ± 0.285
2.857AlaTyr: 2.857 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.627CysAla: 0.627 ± 0.181
0.418CysCys: 0.418 ± 0.202
1.324CysAsp: 1.324 ± 0.303
0.766CysGlu: 0.766 ± 0.25
0.139CysPhe: 0.139 ± 0.091
1.185CysGly: 1.185 ± 0.388
0.418CysHis: 0.418 ± 0.192
0.557CysIle: 0.557 ± 0.207
0.976CysLys: 0.976 ± 0.284
0.906CysLeu: 0.906 ± 0.28
0.139CysMet: 0.139 ± 0.085
0.906CysAsn: 0.906 ± 0.27
0.557CysPro: 0.557 ± 0.157
0.766CysGln: 0.766 ± 0.275
0.348CysArg: 0.348 ± 0.182
0.488CysSer: 0.488 ± 0.179
0.557CysThr: 0.557 ± 0.21
0.766CysVal: 0.766 ± 0.261
0.139CysTrp: 0.139 ± 0.096
0.348CysTyr: 0.348 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
5.714AspAla: 5.714 ± 0.618
0.627AspCys: 0.627 ± 0.218
3.345AspAsp: 3.345 ± 0.524
3.763AspGlu: 3.763 ± 0.466
2.369AspPhe: 2.369 ± 0.396
5.156AspGly: 5.156 ± 0.818
1.463AspHis: 1.463 ± 0.305
3.414AspIle: 3.414 ± 0.459
3.275AspLys: 3.275 ± 0.429
4.599AspLeu: 4.599 ± 0.477
1.394AspMet: 1.394 ± 0.349
2.369AspAsn: 2.369 ± 0.417
2.787AspPro: 2.787 ± 0.453
1.951AspGln: 1.951 ± 0.351
2.439AspArg: 2.439 ± 0.424
2.857AspSer: 2.857 ± 0.466
2.648AspThr: 2.648 ± 0.403
3.554AspVal: 3.554 ± 0.416
0.976AspTrp: 0.976 ± 0.309
1.951AspTyr: 1.951 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
7.177GluAla: 7.177 ± 0.946
0.766GluCys: 0.766 ± 0.2
4.181GluAsp: 4.181 ± 0.55
4.042GluGlu: 4.042 ± 0.667
2.369GluPhe: 2.369 ± 0.346
4.529GluGly: 4.529 ± 0.524
0.836GluHis: 0.836 ± 0.269
4.111GluIle: 4.111 ± 0.631
4.042GluLys: 4.042 ± 0.696
5.365GluLeu: 5.365 ± 0.545
1.533GluMet: 1.533 ± 0.418
3.066GluAsn: 3.066 ± 0.38
2.09GluPro: 2.09 ± 0.483
3.972GluGln: 3.972 ± 0.512
3.275GluArg: 3.275 ± 0.559
2.299GluSer: 2.299 ± 0.436
2.927GluThr: 2.927 ± 0.52
3.763GluVal: 3.763 ± 0.507
0.906GluTrp: 0.906 ± 0.238
2.021GluTyr: 2.021 ± 0.399
0.0GluXaa: 0.0 ± 0.0
Phe
3.205PheAla: 3.205 ± 0.52
0.627PheCys: 0.627 ± 0.294
2.23PheAsp: 2.23 ± 0.425
2.09PheGlu: 2.09 ± 0.412
1.185PhePhe: 1.185 ± 0.279
1.951PheGly: 1.951 ± 0.366
0.697PheHis: 0.697 ± 0.21
2.23PheIle: 2.23 ± 0.39
2.439PheLys: 2.439 ± 0.346
2.578PheLeu: 2.578 ± 0.377
0.697PheMet: 0.697 ± 0.186
2.578PheAsn: 2.578 ± 0.488
1.394PhePro: 1.394 ± 0.317
1.115PheGln: 1.115 ± 0.297
1.812PheArg: 1.812 ± 0.354
2.23PheSer: 2.23 ± 0.335
1.812PheThr: 1.812 ± 0.422
2.648PheVal: 2.648 ± 0.589
0.209PheTrp: 0.209 ± 0.105
1.045PheTyr: 1.045 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
6.271GlyAla: 6.271 ± 0.678
1.115GlyCys: 1.115 ± 0.276
3.623GlyAsp: 3.623 ± 0.376
4.808GlyGlu: 4.808 ± 0.675
2.718GlyPhe: 2.718 ± 0.507
5.714GlyGly: 5.714 ± 0.668
1.185GlyHis: 1.185 ± 0.331
4.111GlyIle: 4.111 ± 0.652
4.669GlyLys: 4.669 ± 0.584
6.271GlyLeu: 6.271 ± 0.712
2.648GlyMet: 2.648 ± 0.489
3.136GlyAsn: 3.136 ± 0.479
2.09GlyPro: 2.09 ± 0.317
2.787GlyGln: 2.787 ± 0.529
3.763GlyArg: 3.763 ± 0.418
4.599GlySer: 4.599 ± 0.621
4.46GlyThr: 4.46 ± 0.546
5.993GlyVal: 5.993 ± 0.621
1.812GlyTrp: 1.812 ± 0.311
3.066GlyTyr: 3.066 ± 0.486
0.0GlyXaa: 0.0 ± 0.0
His
1.463HisAla: 1.463 ± 0.295
0.279HisCys: 0.279 ± 0.15
1.185HisAsp: 1.185 ± 0.314
1.185HisGlu: 1.185 ± 0.332
0.488HisPhe: 0.488 ± 0.172
0.766HisGly: 0.766 ± 0.173
0.418HisHis: 0.418 ± 0.188
0.697HisIle: 0.697 ± 0.288
0.906HisLys: 0.906 ± 0.254
1.324HisLeu: 1.324 ± 0.342
0.488HisMet: 0.488 ± 0.157
0.836HisAsn: 0.836 ± 0.283
0.906HisPro: 0.906 ± 0.223
0.836HisGln: 0.836 ± 0.267
0.418HisArg: 0.418 ± 0.15
1.394HisSer: 1.394 ± 0.254
1.324HisThr: 1.324 ± 0.29
0.976HisVal: 0.976 ± 0.316
0.348HisTrp: 0.348 ± 0.138
0.488HisTyr: 0.488 ± 0.198
0.0HisXaa: 0.0 ± 0.0
Ile
5.714IleAla: 5.714 ± 0.428
0.906IleCys: 0.906 ± 0.287
2.996IleAsp: 2.996 ± 0.402
4.181IleGlu: 4.181 ± 0.586
1.672IlePhe: 1.672 ± 0.374
4.111IleGly: 4.111 ± 0.62
0.766IleHis: 0.766 ± 0.219
3.484IleIle: 3.484 ± 0.421
3.136IleLys: 3.136 ± 0.431
3.066IleLeu: 3.066 ± 0.349
1.951IleMet: 1.951 ± 0.275
3.484IleAsn: 3.484 ± 0.536
2.648IlePro: 2.648 ± 0.349
1.742IleGln: 1.742 ± 0.356
2.927IleArg: 2.927 ± 0.461
3.414IleSer: 3.414 ± 0.429
3.902IleThr: 3.902 ± 0.436
4.669IleVal: 4.669 ± 0.709
0.557IleTrp: 0.557 ± 0.225
1.881IleTyr: 1.881 ± 0.312
0.0IleXaa: 0.0 ± 0.0
Lys
6.48LysAla: 6.48 ± 0.981
0.697LysCys: 0.697 ± 0.194
3.832LysAsp: 3.832 ± 0.52
4.947LysGlu: 4.947 ± 0.784
2.718LysPhe: 2.718 ± 0.401
4.46LysGly: 4.46 ± 0.339
1.394LysHis: 1.394 ± 0.309
3.623LysIle: 3.623 ± 0.461
3.554LysLys: 3.554 ± 0.633
4.251LysLeu: 4.251 ± 0.655
1.881LysMet: 1.881 ± 0.459
2.578LysAsn: 2.578 ± 0.377
2.509LysPro: 2.509 ± 0.426
2.369LysGln: 2.369 ± 0.355
2.578LysArg: 2.578 ± 0.499
2.927LysSer: 2.927 ± 0.482
3.066LysThr: 3.066 ± 0.448
4.947LysVal: 4.947 ± 0.589
0.906LysTrp: 0.906 ± 0.293
1.951LysTyr: 1.951 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
6.898LeuAla: 6.898 ± 1.119
0.906LeuCys: 0.906 ± 0.286
3.832LeuAsp: 3.832 ± 0.588
4.46LeuGlu: 4.46 ± 0.624
1.951LeuPhe: 1.951 ± 0.45
5.365LeuGly: 5.365 ± 0.563
1.324LeuHis: 1.324 ± 0.284
4.878LeuIle: 4.878 ± 0.649
4.46LeuLys: 4.46 ± 0.535
4.39LeuLeu: 4.39 ± 0.657
1.951LeuMet: 1.951 ± 0.391
5.017LeuAsn: 5.017 ± 0.665
3.902LeuPro: 3.902 ± 0.605
3.832LeuGln: 3.832 ± 0.534
4.878LeuArg: 4.878 ± 0.479
4.46LeuSer: 4.46 ± 0.511
4.738LeuThr: 4.738 ± 0.471
4.599LeuVal: 4.599 ± 0.666
0.697LeuTrp: 0.697 ± 0.215
2.299LeuTyr: 2.299 ± 0.49
0.0LeuXaa: 0.0 ± 0.0
Met
2.23MetAla: 2.23 ± 0.322
0.139MetCys: 0.139 ± 0.096
1.324MetAsp: 1.324 ± 0.305
2.09MetGlu: 2.09 ± 0.407
0.697MetPhe: 0.697 ± 0.188
2.787MetGly: 2.787 ± 0.515
0.348MetHis: 0.348 ± 0.149
1.463MetIle: 1.463 ± 0.34
2.021MetLys: 2.021 ± 0.35
2.578MetLeu: 2.578 ± 0.464
0.766MetMet: 0.766 ± 0.239
0.976MetAsn: 0.976 ± 0.188
1.533MetPro: 1.533 ± 0.32
1.672MetGln: 1.672 ± 0.347
1.812MetArg: 1.812 ± 0.469
2.439MetSer: 2.439 ± 0.465
2.16MetThr: 2.16 ± 0.413
1.324MetVal: 1.324 ± 0.327
0.139MetTrp: 0.139 ± 0.084
1.115MetTyr: 1.115 ± 0.262
0.0MetXaa: 0.0 ± 0.0
Asn
4.46AsnAla: 4.46 ± 0.669
0.557AsnCys: 0.557 ± 0.164
3.484AsnAsp: 3.484 ± 0.479
2.857AsnGlu: 2.857 ± 0.455
1.881AsnPhe: 1.881 ± 0.297
5.365AsnGly: 5.365 ± 0.804
1.045AsnHis: 1.045 ± 0.339
2.509AsnIle: 2.509 ± 0.35
3.623AsnLys: 3.623 ± 0.511
3.484AsnLeu: 3.484 ± 0.487
1.533AsnMet: 1.533 ± 0.3
3.136AsnAsn: 3.136 ± 0.46
2.648AsnPro: 2.648 ± 0.453
1.533AsnGln: 1.533 ± 0.373
2.23AsnArg: 2.23 ± 0.395
1.951AsnSer: 1.951 ± 0.444
2.718AsnThr: 2.718 ± 0.379
3.693AsnVal: 3.693 ± 0.411
0.836AsnTrp: 0.836 ± 0.173
1.324AsnTyr: 1.324 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
3.414ProAla: 3.414 ± 0.616
0.209ProCys: 0.209 ± 0.107
2.578ProAsp: 2.578 ± 0.382
3.136ProGlu: 3.136 ± 0.426
1.881ProPhe: 1.881 ± 0.459
3.066ProGly: 3.066 ± 0.434
0.557ProHis: 0.557 ± 0.181
1.672ProIle: 1.672 ± 0.322
2.648ProLys: 2.648 ± 0.422
3.066ProLeu: 3.066 ± 0.568
1.742ProMet: 1.742 ± 0.358
2.09ProAsn: 2.09 ± 0.439
1.881ProPro: 1.881 ± 0.46
1.812ProGln: 1.812 ± 0.466
1.533ProArg: 1.533 ± 0.348
3.205ProSer: 3.205 ± 0.446
2.16ProThr: 2.16 ± 0.409
3.972ProVal: 3.972 ± 0.763
0.557ProTrp: 0.557 ± 0.296
1.324ProTyr: 1.324 ± 0.337
0.0ProXaa: 0.0 ± 0.0
Gln
4.878GlnAla: 4.878 ± 0.947
0.627GlnCys: 0.627 ± 0.214
2.509GlnAsp: 2.509 ± 0.364
1.812GlnGlu: 1.812 ± 0.333
1.672GlnPhe: 1.672 ± 0.359
3.066GlnGly: 3.066 ± 0.511
0.836GlnHis: 0.836 ± 0.274
2.369GlnIle: 2.369 ± 0.446
2.299GlnLys: 2.299 ± 0.436
4.042GlnLeu: 4.042 ± 0.501
1.533GlnMet: 1.533 ± 0.305
2.439GlnAsn: 2.439 ± 0.471
3.066GlnPro: 3.066 ± 0.846
4.042GlnGln: 4.042 ± 1.463
2.857GlnArg: 2.857 ± 0.438
2.23GlnSer: 2.23 ± 0.436
2.439GlnThr: 2.439 ± 0.358
2.857GlnVal: 2.857 ± 0.389
0.836GlnTrp: 0.836 ± 0.247
1.185GlnTyr: 1.185 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
3.763ArgAla: 3.763 ± 0.613
0.348ArgCys: 0.348 ± 0.15
2.787ArgAsp: 2.787 ± 0.388
3.414ArgGlu: 3.414 ± 0.572
1.672ArgPhe: 1.672 ± 0.288
3.484ArgGly: 3.484 ± 0.482
0.488ArgHis: 0.488 ± 0.195
2.369ArgIle: 2.369 ± 0.395
3.066ArgLys: 3.066 ± 0.562
4.111ArgLeu: 4.111 ± 0.514
1.672ArgMet: 1.672 ± 0.415
1.881ArgAsn: 1.881 ± 0.362
1.812ArgPro: 1.812 ± 0.38
2.299ArgGln: 2.299 ± 0.277
2.299ArgArg: 2.299 ± 0.425
2.369ArgSer: 2.369 ± 0.454
2.718ArgThr: 2.718 ± 0.438
3.205ArgVal: 3.205 ± 0.465
1.324ArgTrp: 1.324 ± 0.337
2.299ArgTyr: 2.299 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
5.505SerAla: 5.505 ± 0.758
0.697SerCys: 0.697 ± 0.211
2.578SerAsp: 2.578 ± 0.479
2.927SerGlu: 2.927 ± 0.427
2.09SerPhe: 2.09 ± 0.346
4.947SerGly: 4.947 ± 0.562
0.766SerHis: 0.766 ± 0.211
2.996SerIle: 2.996 ± 0.39
2.996SerLys: 2.996 ± 0.46
4.46SerLeu: 4.46 ± 0.652
1.394SerMet: 1.394 ± 0.385
2.857SerAsn: 2.857 ± 0.476
1.742SerPro: 1.742 ± 0.293
2.509SerGln: 2.509 ± 0.483
2.439SerArg: 2.439 ± 0.476
2.718SerSer: 2.718 ± 0.376
2.996SerThr: 2.996 ± 0.458
3.623SerVal: 3.623 ± 0.486
1.603SerTrp: 1.603 ± 0.305
1.812SerTyr: 1.812 ± 0.311
0.0SerXaa: 0.0 ± 0.0
Thr
5.853ThrAla: 5.853 ± 0.907
0.976ThrCys: 0.976 ± 0.288
3.345ThrAsp: 3.345 ± 0.426
2.857ThrGlu: 2.857 ± 0.391
2.718ThrPhe: 2.718 ± 0.422
4.39ThrGly: 4.39 ± 0.473
0.836ThrHis: 0.836 ± 0.212
4.251ThrIle: 4.251 ± 0.646
3.136ThrLys: 3.136 ± 0.467
4.042ThrLeu: 4.042 ± 0.5
1.742ThrMet: 1.742 ± 0.319
2.718ThrAsn: 2.718 ± 0.396
2.578ThrPro: 2.578 ± 0.511
2.996ThrGln: 2.996 ± 0.502
2.439ThrArg: 2.439 ± 0.413
3.136ThrSer: 3.136 ± 0.567
3.275ThrThr: 3.275 ± 0.589
5.017ThrVal: 5.017 ± 0.708
0.836ThrTrp: 0.836 ± 0.234
1.603ThrTyr: 1.603 ± 0.34
0.0ThrXaa: 0.0 ± 0.0
Val
6.132ValAla: 6.132 ± 0.77
0.766ValCys: 0.766 ± 0.269
4.32ValAsp: 4.32 ± 0.481
5.156ValGlu: 5.156 ± 0.774
2.369ValPhe: 2.369 ± 0.424
4.251ValGly: 4.251 ± 0.509
1.324ValHis: 1.324 ± 0.361
4.878ValIle: 4.878 ± 0.622
5.156ValLys: 5.156 ± 0.589
4.738ValLeu: 4.738 ± 0.496
1.672ValMet: 1.672 ± 0.322
3.832ValAsn: 3.832 ± 0.487
2.439ValPro: 2.439 ± 0.371
3.554ValGln: 3.554 ± 0.567
2.09ValArg: 2.09 ± 0.475
2.648ValSer: 2.648 ± 0.363
6.062ValThr: 6.062 ± 0.636
6.202ValVal: 6.202 ± 0.738
0.836ValTrp: 0.836 ± 0.217
2.439ValTyr: 2.439 ± 0.451
0.0ValXaa: 0.0 ± 0.0
Trp
1.533TrpAla: 1.533 ± 0.299
0.139TrpCys: 0.139 ± 0.099
1.045TrpAsp: 1.045 ± 0.239
0.766TrpGlu: 0.766 ± 0.284
0.279TrpPhe: 0.279 ± 0.128
1.463TrpGly: 1.463 ± 0.354
0.07TrpHis: 0.07 ± 0.077
0.488TrpIle: 0.488 ± 0.171
1.115TrpLys: 1.115 ± 0.319
1.463TrpLeu: 1.463 ± 0.234
0.418TrpMet: 0.418 ± 0.17
0.557TrpAsn: 0.557 ± 0.185
0.627TrpPro: 0.627 ± 0.182
0.906TrpGln: 0.906 ± 0.237
0.906TrpArg: 0.906 ± 0.311
0.557TrpSer: 0.557 ± 0.169
0.906TrpThr: 0.906 ± 0.265
1.185TrpVal: 1.185 ± 0.277
0.279TrpTrp: 0.279 ± 0.123
0.976TrpTyr: 0.976 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.857TyrAla: 2.857 ± 0.515
0.488TyrCys: 0.488 ± 0.166
1.603TyrAsp: 1.603 ± 0.372
1.881TyrGlu: 1.881 ± 0.392
1.185TyrPhe: 1.185 ± 0.285
2.509TyrGly: 2.509 ± 0.392
0.557TyrHis: 0.557 ± 0.199
2.23TyrIle: 2.23 ± 0.381
2.021TyrLys: 2.021 ± 0.398
2.369TyrLeu: 2.369 ± 0.441
0.418TyrMet: 0.418 ± 0.158
2.509TyrAsn: 2.509 ± 0.391
1.115TyrPro: 1.115 ± 0.305
1.463TyrGln: 1.463 ± 0.32
1.812TyrArg: 1.812 ± 0.468
2.16TyrSer: 2.16 ± 0.361
2.021TyrThr: 2.021 ± 0.424
2.439TyrVal: 2.439 ± 0.451
0.348TyrTrp: 0.348 ± 0.131
1.533TyrTyr: 1.533 ± 0.314
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 67 proteins (14352 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski