Amino acid dipepetide frequency for Gordonia phage EpicDab

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
23.737AlaAla: 23.737 ± 2.712
0.574AlaCys: 0.574 ± 0.365
7.466AlaAsp: 7.466 ± 1.024
8.231AlaGlu: 8.231 ± 1.397
2.871AlaPhe: 2.871 ± 0.902
17.611AlaGly: 17.611 ± 2.434
3.254AlaHis: 3.254 ± 0.993
5.551AlaIle: 5.551 ± 0.836
2.871AlaLys: 2.871 ± 0.537
11.294AlaLeu: 11.294 ± 1.402
2.68AlaMet: 2.68 ± 0.737
2.106AlaAsn: 2.106 ± 0.627
5.551AlaPro: 5.551 ± 0.872
6.508AlaGln: 6.508 ± 1.396
11.103AlaArg: 11.103 ± 1.342
5.36AlaSer: 5.36 ± 1.488
9.763AlaThr: 9.763 ± 1.692
10.911AlaVal: 10.911 ± 2.417
2.297AlaTrp: 2.297 ± 0.646
4.211AlaTyr: 4.211 ± 0.826
0.0AlaXaa: 0.0 ± 0.0
Cys
2.106CysAla: 2.106 ± 0.896
0.0CysCys: 0.0 ± 0.0
0.191CysAsp: 0.191 ± 0.17
0.383CysGlu: 0.383 ± 0.235
0.0CysPhe: 0.0 ± 0.0
1.34CysGly: 1.34 ± 0.448
0.383CysHis: 0.383 ± 0.273
0.191CysIle: 0.191 ± 0.201
0.191CysLys: 0.191 ± 0.17
0.574CysLeu: 0.574 ± 0.271
0.0CysMet: 0.0 ± 0.0
0.191CysAsn: 0.191 ± 0.23
0.383CysPro: 0.383 ± 0.266
0.191CysGln: 0.191 ± 0.216
0.957CysArg: 0.957 ± 0.442
0.0CysSer: 0.0 ± 0.0
0.191CysThr: 0.191 ± 0.184
0.574CysVal: 0.574 ± 0.275
0.383CysTrp: 0.383 ± 0.237
0.191CysTyr: 0.191 ± 0.205
0.0CysXaa: 0.0 ± 0.0
Asp
8.997AspAla: 8.997 ± 1.733
1.531AspCys: 1.531 ± 0.577
2.871AspAsp: 2.871 ± 0.772
2.489AspGlu: 2.489 ± 0.683
1.723AspPhe: 1.723 ± 0.572
5.168AspGly: 5.168 ± 1.12
1.34AspHis: 1.34 ± 0.479
1.723AspIle: 1.723 ± 0.537
1.914AspLys: 1.914 ± 1.077
4.977AspLeu: 4.977 ± 0.833
1.914AspMet: 1.914 ± 0.848
1.914AspAsn: 1.914 ± 0.435
5.551AspPro: 5.551 ± 0.788
0.957AspGln: 0.957 ± 0.337
4.786AspArg: 4.786 ± 1.09
2.489AspSer: 2.489 ± 0.456
4.977AspThr: 4.977 ± 1.519
5.934AspVal: 5.934 ± 1.288
2.106AspTrp: 2.106 ± 0.659
1.531AspTyr: 1.531 ± 0.714
0.0AspXaa: 0.0 ± 0.0
Glu
5.551GluAla: 5.551 ± 1.204
0.0GluCys: 0.0 ± 0.0
2.871GluAsp: 2.871 ± 0.594
1.149GluGlu: 1.149 ± 0.551
1.531GluPhe: 1.531 ± 0.457
4.403GluGly: 4.403 ± 1.01
0.766GluHis: 0.766 ± 0.294
0.766GluIle: 0.766 ± 0.37
1.149GluLys: 1.149 ± 0.444
6.126GluLeu: 6.126 ± 1.022
0.383GluMet: 0.383 ± 0.24
1.723GluAsn: 1.723 ± 0.446
4.403GluPro: 4.403 ± 0.963
2.106GluGln: 2.106 ± 0.605
3.446GluArg: 3.446 ± 0.935
2.106GluSer: 2.106 ± 0.505
2.106GluThr: 2.106 ± 0.626
5.168GluVal: 5.168 ± 0.812
1.531GluTrp: 1.531 ± 0.559
0.957GluTyr: 0.957 ± 0.405
0.0GluXaa: 0.0 ± 0.0
Phe
4.02PheAla: 4.02 ± 0.871
0.0PheCys: 0.0 ± 0.0
3.254PheAsp: 3.254 ± 0.617
1.149PheGlu: 1.149 ± 0.51
0.766PhePhe: 0.766 ± 0.307
4.403PheGly: 4.403 ± 0.99
0.574PheHis: 0.574 ± 0.271
1.34PheIle: 1.34 ± 0.596
0.191PheLys: 0.191 ± 0.181
1.531PheLeu: 1.531 ± 0.747
0.191PheMet: 0.191 ± 0.173
0.383PheAsn: 0.383 ± 0.231
1.34PhePro: 1.34 ± 0.477
0.574PheGln: 0.574 ± 0.295
1.531PheArg: 1.531 ± 0.47
1.149PheSer: 1.149 ± 0.48
2.297PheThr: 2.297 ± 0.549
1.531PheVal: 1.531 ± 0.626
0.383PheTrp: 0.383 ± 0.402
0.574PheTyr: 0.574 ± 0.283
0.0PheXaa: 0.0 ± 0.0
Gly
8.04GlyAla: 8.04 ± 1.239
0.191GlyCys: 0.191 ± 0.193
6.891GlyAsp: 6.891 ± 2.917
5.743GlyGlu: 5.743 ± 0.864
2.297GlyPhe: 2.297 ± 0.479
6.508GlyGly: 6.508 ± 1.106
3.063GlyHis: 3.063 ± 0.817
4.02GlyIle: 4.02 ± 1.132
1.723GlyLys: 1.723 ± 0.516
6.891GlyLeu: 6.891 ± 0.966
1.531GlyMet: 1.531 ± 0.54
2.68GlyAsn: 2.68 ± 0.647
10.911GlyPro: 10.911 ± 5.69
3.254GlyGln: 3.254 ± 0.872
6.126GlyArg: 6.126 ± 1.097
4.977GlySer: 4.977 ± 0.834
7.274GlyThr: 7.274 ± 1.14
5.36GlyVal: 5.36 ± 1.06
2.871GlyTrp: 2.871 ± 0.82
1.723GlyTyr: 1.723 ± 0.543
0.0GlyXaa: 0.0 ± 0.0
His
2.871HisAla: 2.871 ± 0.684
0.0HisCys: 0.0 ± 0.0
1.723HisAsp: 1.723 ± 0.535
1.723HisGlu: 1.723 ± 0.528
0.191HisPhe: 0.191 ± 0.173
1.531HisGly: 1.531 ± 0.553
0.957HisHis: 0.957 ± 0.368
0.766HisIle: 0.766 ± 0.331
0.574HisLys: 0.574 ± 0.356
2.106HisLeu: 2.106 ± 0.687
0.383HisMet: 0.383 ± 0.217
0.574HisAsn: 0.574 ± 0.26
2.297HisPro: 2.297 ± 0.753
0.957HisGln: 0.957 ± 0.483
2.106HisArg: 2.106 ± 0.619
0.957HisSer: 0.957 ± 0.4
2.489HisThr: 2.489 ± 0.679
1.149HisVal: 1.149 ± 0.469
0.191HisTrp: 0.191 ± 0.185
0.191HisTyr: 0.191 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
6.126IleAla: 6.126 ± 0.946
0.383IleCys: 0.383 ± 0.253
4.02IleAsp: 4.02 ± 0.797
3.446IleGlu: 3.446 ± 0.762
0.191IlePhe: 0.191 ± 0.164
3.254IleGly: 3.254 ± 0.936
1.723IleHis: 1.723 ± 0.383
1.531IleIle: 1.531 ± 0.49
0.766IleLys: 0.766 ± 0.306
2.489IleLeu: 2.489 ± 0.483
0.383IleMet: 0.383 ± 0.229
1.723IleAsn: 1.723 ± 0.643
2.297IlePro: 2.297 ± 0.721
0.766IleGln: 0.766 ± 0.369
3.446IleArg: 3.446 ± 0.898
1.149IleSer: 1.149 ± 0.443
3.063IleThr: 3.063 ± 0.852
3.637IleVal: 3.637 ± 0.601
0.766IleTrp: 0.766 ± 0.403
0.766IleTyr: 0.766 ± 0.35
0.0IleXaa: 0.0 ± 0.0
Lys
3.637LysAla: 3.637 ± 0.699
0.574LysCys: 0.574 ± 0.326
0.574LysAsp: 0.574 ± 0.297
0.766LysGlu: 0.766 ± 0.341
0.766LysPhe: 0.766 ± 0.32
4.211LysGly: 4.211 ± 2.707
0.383LysHis: 0.383 ± 0.271
0.766LysIle: 0.766 ± 0.302
0.766LysLys: 0.766 ± 0.472
1.531LysLeu: 1.531 ± 0.436
0.383LysMet: 0.383 ± 0.236
0.574LysAsn: 0.574 ± 0.285
1.723LysPro: 1.723 ± 0.474
0.574LysGln: 0.574 ± 0.529
1.149LysArg: 1.149 ± 0.396
0.766LysSer: 0.766 ± 0.336
0.957LysThr: 0.957 ± 0.251
1.531LysVal: 1.531 ± 0.549
0.766LysTrp: 0.766 ± 0.356
0.191LysTyr: 0.191 ± 0.157
0.0LysXaa: 0.0 ± 0.0
Leu
13.017LeuAla: 13.017 ± 1.8
1.723LeuCys: 1.723 ± 0.641
4.403LeuAsp: 4.403 ± 1.009
3.637LeuGlu: 3.637 ± 0.859
1.914LeuPhe: 1.914 ± 0.565
5.168LeuGly: 5.168 ± 1.132
2.871LeuHis: 2.871 ± 0.75
4.594LeuIle: 4.594 ± 0.844
1.531LeuLys: 1.531 ± 0.562
5.36LeuLeu: 5.36 ± 1.054
0.957LeuMet: 0.957 ± 0.515
1.914LeuAsn: 1.914 ± 0.681
4.403LeuPro: 4.403 ± 0.72
1.34LeuGln: 1.34 ± 0.404
4.977LeuArg: 4.977 ± 1.71
4.02LeuSer: 4.02 ± 0.577
6.508LeuThr: 6.508 ± 1.28
5.36LeuVal: 5.36 ± 1.038
0.957LeuTrp: 0.957 ± 0.429
1.149LeuTyr: 1.149 ± 0.52
0.0LeuXaa: 0.0 ± 0.0
Met
2.106MetAla: 2.106 ± 0.59
0.0MetCys: 0.0 ± 0.0
0.383MetAsp: 0.383 ± 0.285
0.383MetGlu: 0.383 ± 0.259
0.766MetPhe: 0.766 ± 0.298
1.531MetGly: 1.531 ± 0.62
0.191MetHis: 0.191 ± 0.208
0.383MetIle: 0.383 ± 0.236
0.383MetLys: 0.383 ± 0.242
0.957MetLeu: 0.957 ± 0.373
0.574MetMet: 0.574 ± 0.387
0.383MetAsn: 0.383 ± 0.352
1.34MetPro: 1.34 ± 0.496
0.574MetGln: 0.574 ± 0.321
1.531MetArg: 1.531 ± 0.625
2.489MetSer: 2.489 ± 0.534
1.723MetThr: 1.723 ± 0.739
1.531MetVal: 1.531 ± 0.68
0.191MetTrp: 0.191 ± 0.192
0.191MetTyr: 0.191 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
4.786AsnAla: 4.786 ± 0.92
0.0AsnCys: 0.0 ± 0.0
1.149AsnAsp: 1.149 ± 0.4
0.383AsnGlu: 0.383 ± 0.242
0.383AsnPhe: 0.383 ± 0.255
2.489AsnGly: 2.489 ± 0.68
0.766AsnHis: 0.766 ± 0.423
0.574AsnIle: 0.574 ± 0.355
0.574AsnLys: 0.574 ± 0.3
2.106AsnLeu: 2.106 ± 0.622
0.0AsnMet: 0.0 ± 0.0
0.191AsnAsn: 0.191 ± 0.157
2.871AsnPro: 2.871 ± 0.567
0.383AsnGln: 0.383 ± 0.24
1.531AsnArg: 1.531 ± 0.323
1.914AsnSer: 1.914 ± 0.503
1.149AsnThr: 1.149 ± 0.602
1.723AsnVal: 1.723 ± 0.533
0.766AsnTrp: 0.766 ± 0.462
0.957AsnTyr: 0.957 ± 0.492
0.0AsnXaa: 0.0 ± 0.0
Pro
12.443ProAla: 12.443 ± 1.815
0.383ProCys: 0.383 ± 0.41
6.508ProAsp: 6.508 ± 1.122
3.446ProGlu: 3.446 ± 0.854
1.723ProPhe: 1.723 ± 0.428
6.317ProGly: 6.317 ± 1.107
1.34ProHis: 1.34 ± 0.475
3.637ProIle: 3.637 ± 0.701
2.297ProLys: 2.297 ± 1.396
3.446ProLeu: 3.446 ± 0.903
1.34ProMet: 1.34 ± 0.494
1.149ProAsn: 1.149 ± 0.46
4.594ProPro: 4.594 ± 0.75
1.34ProGln: 1.34 ± 0.608
4.403ProArg: 4.403 ± 1.036
3.446ProSer: 3.446 ± 0.797
6.7ProThr: 6.7 ± 1.299
4.786ProVal: 4.786 ± 0.67
1.531ProTrp: 1.531 ± 0.455
1.149ProTyr: 1.149 ± 0.375
0.0ProXaa: 0.0 ± 0.0
Gln
5.168GlnAla: 5.168 ± 1.02
0.0GlnCys: 0.0 ± 0.0
0.957GlnAsp: 0.957 ± 0.381
1.34GlnGlu: 1.34 ± 0.453
2.106GlnPhe: 2.106 ± 0.707
2.297GlnGly: 2.297 ± 0.604
0.574GlnHis: 0.574 ± 0.399
1.149GlnIle: 1.149 ± 0.444
0.383GlnLys: 0.383 ± 0.243
3.063GlnLeu: 3.063 ± 0.73
0.574GlnMet: 0.574 ± 0.28
0.574GlnAsn: 0.574 ± 0.367
2.68GlnPro: 2.68 ± 0.623
0.766GlnGln: 0.766 ± 0.562
2.297GlnArg: 2.297 ± 0.676
1.149GlnSer: 1.149 ± 0.349
2.871GlnThr: 2.871 ± 0.575
1.914GlnVal: 1.914 ± 0.711
0.957GlnTrp: 0.957 ± 0.418
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
10.528ArgAla: 10.528 ± 1.202
0.574ArgCys: 0.574 ± 0.298
3.828ArgAsp: 3.828 ± 0.97
3.446ArgGlu: 3.446 ± 0.589
2.489ArgPhe: 2.489 ± 0.823
5.551ArgGly: 5.551 ± 1.006
2.297ArgHis: 2.297 ± 0.639
3.637ArgIle: 3.637 ± 0.961
2.489ArgLys: 2.489 ± 0.766
6.7ArgLeu: 6.7 ± 0.848
2.106ArgMet: 2.106 ± 0.666
2.68ArgAsn: 2.68 ± 0.686
4.977ArgPro: 4.977 ± 1.307
3.254ArgGln: 3.254 ± 0.848
7.274ArgArg: 7.274 ± 1.557
2.489ArgSer: 2.489 ± 0.692
2.68ArgThr: 2.68 ± 0.707
4.977ArgVal: 4.977 ± 1.131
2.489ArgTrp: 2.489 ± 0.575
0.766ArgTyr: 0.766 ± 0.293
0.0ArgXaa: 0.0 ± 0.0
Ser
5.743SerAla: 5.743 ± 1.116
0.574SerCys: 0.574 ± 0.341
4.211SerAsp: 4.211 ± 0.743
2.871SerGlu: 2.871 ± 0.712
0.574SerPhe: 0.574 ± 0.399
2.871SerGly: 2.871 ± 0.765
0.574SerHis: 0.574 ± 0.265
1.531SerIle: 1.531 ± 0.495
0.574SerLys: 0.574 ± 0.343
4.594SerLeu: 4.594 ± 0.838
0.574SerMet: 0.574 ± 0.391
1.34SerAsn: 1.34 ± 0.47
2.106SerPro: 2.106 ± 0.441
1.723SerGln: 1.723 ± 0.501
4.403SerArg: 4.403 ± 1.111
1.34SerSer: 1.34 ± 0.428
3.063SerThr: 3.063 ± 0.697
3.637SerVal: 3.637 ± 0.626
1.34SerTrp: 1.34 ± 0.587
1.34SerTyr: 1.34 ± 0.422
0.0SerXaa: 0.0 ± 0.0
Thr
9.763ThrAla: 9.763 ± 1.167
0.766ThrCys: 0.766 ± 0.393
4.977ThrAsp: 4.977 ± 0.873
2.68ThrGlu: 2.68 ± 0.734
3.063ThrPhe: 3.063 ± 0.614
7.848ThrGly: 7.848 ± 2.984
0.574ThrHis: 0.574 ± 0.314
3.446ThrIle: 3.446 ± 0.722
1.531ThrLys: 1.531 ± 0.544
5.551ThrLeu: 5.551 ± 1.531
1.531ThrMet: 1.531 ± 0.432
1.914ThrAsn: 1.914 ± 0.785
6.317ThrPro: 6.317 ± 1.058
1.34ThrGln: 1.34 ± 0.546
5.934ThrArg: 5.934 ± 1.102
3.063ThrSer: 3.063 ± 0.649
5.168ThrThr: 5.168 ± 0.918
3.446ThrVal: 3.446 ± 1.07
1.723ThrTrp: 1.723 ± 0.462
1.34ThrTyr: 1.34 ± 0.819
0.0ThrXaa: 0.0 ± 0.0
Val
8.806ValAla: 8.806 ± 1.244
0.957ValCys: 0.957 ± 0.335
6.508ValAsp: 6.508 ± 1.13
2.871ValGlu: 2.871 ± 0.549
1.914ValPhe: 1.914 ± 0.552
6.126ValGly: 6.126 ± 0.885
1.149ValHis: 1.149 ± 0.432
4.403ValIle: 4.403 ± 0.696
1.531ValLys: 1.531 ± 0.418
3.637ValLeu: 3.637 ± 0.735
0.957ValMet: 0.957 ± 0.485
1.531ValAsn: 1.531 ± 0.478
5.168ValPro: 5.168 ± 1.096
3.446ValGln: 3.446 ± 0.658
4.786ValArg: 4.786 ± 0.857
4.02ValSer: 4.02 ± 0.794
5.743ValThr: 5.743 ± 1.093
6.317ValVal: 6.317 ± 1.553
0.957ValTrp: 0.957 ± 0.629
1.34ValTyr: 1.34 ± 0.525
0.0ValXaa: 0.0 ± 0.0
Trp
3.446TrpAla: 3.446 ± 0.882
0.191TrpCys: 0.191 ± 0.164
0.957TrpAsp: 0.957 ± 0.393
0.957TrpGlu: 0.957 ± 0.386
0.957TrpPhe: 0.957 ± 0.452
1.531TrpGly: 1.531 ± 0.535
0.574TrpHis: 0.574 ± 0.395
0.957TrpIle: 0.957 ± 0.408
0.574TrpLys: 0.574 ± 0.26
1.723TrpLeu: 1.723 ± 0.588
0.766TrpMet: 0.766 ± 0.39
0.957TrpAsn: 0.957 ± 0.369
2.297TrpPro: 2.297 ± 0.881
0.191TrpGln: 0.191 ± 0.17
1.723TrpArg: 1.723 ± 0.567
0.957TrpSer: 0.957 ± 0.324
1.723TrpThr: 1.723 ± 0.565
1.531TrpVal: 1.531 ± 0.613
0.0TrpTrp: 0.0 ± 0.0
0.574TrpTyr: 0.574 ± 0.283
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.063TyrAla: 3.063 ± 0.845
0.0TyrCys: 0.0 ± 0.0
1.149TyrAsp: 1.149 ± 0.713
1.149TyrGlu: 1.149 ± 0.425
0.957TyrPhe: 0.957 ± 0.315
1.914TyrGly: 1.914 ± 0.587
0.383TyrHis: 0.383 ± 0.254
0.957TyrIle: 0.957 ± 0.465
0.574TyrLys: 0.574 ± 0.277
1.149TyrLeu: 1.149 ± 0.476
0.191TyrMet: 0.191 ± 0.188
0.191TyrAsn: 0.191 ± 0.187
0.766TyrPro: 0.766 ± 0.352
0.574TyrGln: 0.574 ± 0.27
1.723TyrArg: 1.723 ± 0.508
1.149TyrSer: 1.149 ± 0.428
1.531TyrThr: 1.531 ± 0.371
1.149TyrVal: 1.149 ± 0.447
0.574TyrTrp: 0.574 ± 0.317
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (5225 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski