Amino acid dipepetide frequency for Marinobacter phage PS3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.256AlaAla: 11.256 ± 1.681
0.798AlaCys: 0.798 ± 0.243
6.945AlaAsp: 6.945 ± 0.845
7.184AlaGlu: 7.184 ± 0.963
3.991AlaPhe: 3.991 ± 0.671
8.781AlaGly: 8.781 ± 1.304
1.118AlaHis: 1.118 ± 0.294
5.748AlaIle: 5.748 ± 1.065
4.47AlaLys: 4.47 ± 0.526
7.903AlaLeu: 7.903 ± 0.747
2.315AlaMet: 2.315 ± 0.344
3.113AlaAsn: 3.113 ± 0.53
3.592AlaPro: 3.592 ± 0.525
3.672AlaGln: 3.672 ± 0.649
5.588AlaArg: 5.588 ± 0.72
7.264AlaSer: 7.264 ± 0.802
5.269AlaThr: 5.269 ± 0.748
5.748AlaVal: 5.748 ± 0.929
2.395AlaTrp: 2.395 ± 0.449
2.395AlaTyr: 2.395 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
0.639CysAla: 0.639 ± 0.212
0.08CysCys: 0.08 ± 0.079
0.718CysAsp: 0.718 ± 0.212
0.639CysGlu: 0.639 ± 0.189
0.479CysPhe: 0.479 ± 0.184
0.559CysGly: 0.559 ± 0.26
0.16CysHis: 0.16 ± 0.11
0.639CysIle: 0.639 ± 0.245
0.319CysLys: 0.319 ± 0.16
0.718CysLeu: 0.718 ± 0.266
0.239CysMet: 0.239 ± 0.135
0.718CysAsn: 0.718 ± 0.262
0.798CysPro: 0.798 ± 0.297
0.16CysGln: 0.16 ± 0.131
0.399CysArg: 0.399 ± 0.153
0.399CysSer: 0.399 ± 0.178
0.559CysThr: 0.559 ± 0.233
0.639CysVal: 0.639 ± 0.23
0.0CysTrp: 0.0 ± 0.0
0.08CysTyr: 0.08 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
6.067AspAla: 6.067 ± 0.71
0.798AspCys: 0.798 ± 0.278
4.231AspAsp: 4.231 ± 0.655
4.47AspGlu: 4.47 ± 0.528
1.916AspPhe: 1.916 ± 0.358
5.987AspGly: 5.987 ± 0.773
0.878AspHis: 0.878 ± 0.297
3.273AspIle: 3.273 ± 0.495
2.076AspLys: 2.076 ± 0.458
5.907AspLeu: 5.907 ± 0.83
1.517AspMet: 1.517 ± 0.388
2.076AspAsn: 2.076 ± 0.442
3.353AspPro: 3.353 ± 0.548
3.832AspGln: 3.832 ± 0.549
2.954AspArg: 2.954 ± 0.564
4.71AspSer: 4.71 ± 0.569
3.912AspThr: 3.912 ± 0.866
3.832AspVal: 3.832 ± 0.616
0.958AspTrp: 0.958 ± 0.277
2.155AspTyr: 2.155 ± 0.414
0.0AspXaa: 0.0 ± 0.0
Glu
6.227GluAla: 6.227 ± 0.919
0.479GluCys: 0.479 ± 0.192
3.592GluAsp: 3.592 ± 0.543
3.433GluGlu: 3.433 ± 0.508
1.996GluPhe: 1.996 ± 0.416
5.029GluGly: 5.029 ± 0.539
0.559GluHis: 0.559 ± 0.222
4.55GluIle: 4.55 ± 0.708
3.592GluLys: 3.592 ± 0.64
6.706GluLeu: 6.706 ± 0.747
2.076GluMet: 2.076 ± 0.566
2.235GluAsn: 2.235 ± 0.364
2.475GluPro: 2.475 ± 0.484
2.714GluGln: 2.714 ± 0.556
4.311GluArg: 4.311 ± 0.683
5.907GluSer: 5.907 ± 0.684
3.193GluThr: 3.193 ± 0.577
3.912GluVal: 3.912 ± 0.547
1.277GluTrp: 1.277 ± 0.354
2.315GluTyr: 2.315 ± 0.475
0.0GluXaa: 0.0 ± 0.0
Phe
2.554PheAla: 2.554 ± 0.559
0.559PheCys: 0.559 ± 0.19
2.554PheAsp: 2.554 ± 0.541
2.714PheGlu: 2.714 ± 0.386
0.718PhePhe: 0.718 ± 0.214
3.512PheGly: 3.512 ± 0.65
0.798PheHis: 0.798 ± 0.315
1.357PheIle: 1.357 ± 0.329
1.357PheLys: 1.357 ± 0.35
2.475PheLeu: 2.475 ± 0.43
0.559PheMet: 0.559 ± 0.166
1.676PheAsn: 1.676 ± 0.31
1.357PhePro: 1.357 ± 0.424
1.836PheGln: 1.836 ± 0.42
1.916PheArg: 1.916 ± 0.323
2.794PheSer: 2.794 ± 0.434
3.033PheThr: 3.033 ± 0.507
2.155PheVal: 2.155 ± 0.345
0.479PheTrp: 0.479 ± 0.187
0.798PheTyr: 0.798 ± 0.23
0.0PheXaa: 0.0 ± 0.0
Gly
6.865GlyAla: 6.865 ± 0.876
0.639GlyCys: 0.639 ± 0.299
5.508GlyAsp: 5.508 ± 0.644
6.227GlyGlu: 6.227 ± 0.626
3.273GlyPhe: 3.273 ± 0.461
7.584GlyGly: 7.584 ± 0.768
1.437GlyHis: 1.437 ± 0.308
3.832GlyIle: 3.832 ± 0.706
2.714GlyLys: 2.714 ± 0.389
7.344GlyLeu: 7.344 ± 1.205
1.676GlyMet: 1.676 ± 0.366
3.033GlyAsn: 3.033 ± 0.439
3.113GlyPro: 3.113 ± 0.673
4.231GlyGln: 4.231 ± 0.454
4.311GlyArg: 4.311 ± 0.687
6.785GlySer: 6.785 ± 1.014
5.827GlyThr: 5.827 ± 0.725
6.386GlyVal: 6.386 ± 0.851
1.197GlyTrp: 1.197 ± 0.355
3.193GlyTyr: 3.193 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
1.197HisAla: 1.197 ± 0.276
0.399HisCys: 0.399 ± 0.211
1.517HisAsp: 1.517 ± 0.324
1.437HisGlu: 1.437 ± 0.35
0.878HisPhe: 0.878 ± 0.231
0.878HisGly: 0.878 ± 0.305
0.399HisHis: 0.399 ± 0.207
1.118HisIle: 1.118 ± 0.331
0.479HisLys: 0.479 ± 0.176
1.756HisLeu: 1.756 ± 0.419
0.559HisMet: 0.559 ± 0.217
0.718HisAsn: 0.718 ± 0.276
0.639HisPro: 0.639 ± 0.247
0.319HisGln: 0.319 ± 0.159
1.277HisArg: 1.277 ± 0.292
0.878HisSer: 0.878 ± 0.299
0.878HisThr: 0.878 ± 0.215
1.118HisVal: 1.118 ± 0.416
0.399HisTrp: 0.399 ± 0.148
0.319HisTyr: 0.319 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
5.748IleAla: 5.748 ± 1.125
0.479IleCys: 0.479 ± 0.243
3.752IleAsp: 3.752 ± 0.74
3.113IleGlu: 3.113 ± 0.506
0.958IlePhe: 0.958 ± 0.242
3.991IleGly: 3.991 ± 0.815
1.118IleHis: 1.118 ± 0.425
1.517IleIle: 1.517 ± 0.302
1.756IleLys: 1.756 ± 0.401
3.752IleLeu: 3.752 ± 0.539
0.399IleMet: 0.399 ± 0.177
2.155IleAsn: 2.155 ± 0.434
2.395IlePro: 2.395 ± 0.413
1.676IleGln: 1.676 ± 0.297
2.954IleArg: 2.954 ± 0.428
3.273IleSer: 3.273 ± 0.59
4.071IleThr: 4.071 ± 0.662
2.954IleVal: 2.954 ± 0.445
0.718IleTrp: 0.718 ± 0.211
1.517IleTyr: 1.517 ± 0.353
0.0IleXaa: 0.0 ± 0.0
Lys
5.029LysAla: 5.029 ± 0.808
0.399LysCys: 0.399 ± 0.176
3.113LysAsp: 3.113 ± 0.528
2.634LysGlu: 2.634 ± 0.486
1.836LysPhe: 1.836 ± 0.342
2.395LysGly: 2.395 ± 0.429
0.878LysHis: 0.878 ± 0.266
1.437LysIle: 1.437 ± 0.322
1.756LysLys: 1.756 ± 0.386
3.991LysLeu: 3.991 ± 0.492
0.878LysMet: 0.878 ± 0.267
1.597LysAsn: 1.597 ± 0.399
2.076LysPro: 2.076 ± 0.534
1.996LysGln: 1.996 ± 0.397
2.395LysArg: 2.395 ± 0.541
1.916LysSer: 1.916 ± 0.407
3.033LysThr: 3.033 ± 0.473
3.273LysVal: 3.273 ± 0.603
0.958LysTrp: 0.958 ± 0.234
1.118LysTyr: 1.118 ± 0.372
0.0LysXaa: 0.0 ± 0.0
Leu
9.499LeuAla: 9.499 ± 0.991
0.798LeuCys: 0.798 ± 0.256
5.109LeuAsp: 5.109 ± 0.713
7.264LeuGlu: 7.264 ± 0.788
3.273LeuPhe: 3.273 ± 0.407
6.386LeuGly: 6.386 ± 0.828
1.357LeuHis: 1.357 ± 0.381
3.672LeuIle: 3.672 ± 0.595
3.912LeuLys: 3.912 ± 0.541
6.466LeuLeu: 6.466 ± 0.876
2.235LeuMet: 2.235 ± 0.454
4.47LeuAsn: 4.47 ± 0.635
3.113LeuPro: 3.113 ± 0.493
3.672LeuGln: 3.672 ± 0.559
3.991LeuArg: 3.991 ± 0.587
6.067LeuSer: 6.067 ± 0.678
5.508LeuThr: 5.508 ± 0.703
6.067LeuVal: 6.067 ± 0.578
1.197LeuTrp: 1.197 ± 0.397
1.597LeuTyr: 1.597 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
2.235MetAla: 2.235 ± 0.333
0.239MetCys: 0.239 ± 0.126
1.197MetAsp: 1.197 ± 0.278
1.437MetGlu: 1.437 ± 0.304
0.399MetPhe: 0.399 ± 0.186
1.277MetGly: 1.277 ± 0.366
0.399MetHis: 0.399 ± 0.164
0.718MetIle: 0.718 ± 0.272
0.958MetLys: 0.958 ± 0.318
1.597MetLeu: 1.597 ± 0.37
0.559MetMet: 0.559 ± 0.231
1.038MetAsn: 1.038 ± 0.27
0.958MetPro: 0.958 ± 0.266
0.559MetGln: 0.559 ± 0.183
1.676MetArg: 1.676 ± 0.392
1.836MetSer: 1.836 ± 0.33
2.155MetThr: 2.155 ± 0.443
1.277MetVal: 1.277 ± 0.318
0.08MetTrp: 0.08 ± 0.072
0.239MetTyr: 0.239 ± 0.129
0.0MetXaa: 0.0 ± 0.0
Asn
3.672AsnAla: 3.672 ± 0.676
0.239AsnCys: 0.239 ± 0.154
1.836AsnAsp: 1.836 ± 0.427
2.155AsnGlu: 2.155 ± 0.404
1.357AsnPhe: 1.357 ± 0.382
3.991AsnGly: 3.991 ± 0.637
0.798AsnHis: 0.798 ± 0.333
1.357AsnIle: 1.357 ± 0.271
1.517AsnLys: 1.517 ± 0.328
3.512AsnLeu: 3.512 ± 0.483
0.559AsnMet: 0.559 ± 0.187
1.676AsnAsn: 1.676 ± 0.44
1.756AsnPro: 1.756 ± 0.374
1.836AsnGln: 1.836 ± 0.407
2.315AsnArg: 2.315 ± 0.352
3.033AsnSer: 3.033 ± 0.514
2.395AsnThr: 2.395 ± 0.412
2.235AsnVal: 2.235 ± 0.506
0.559AsnTrp: 0.559 ± 0.185
0.559AsnTyr: 0.559 ± 0.193
0.0AsnXaa: 0.0 ± 0.0
Pro
4.71ProAla: 4.71 ± 0.729
0.399ProCys: 0.399 ± 0.203
3.353ProAsp: 3.353 ± 0.621
3.193ProGlu: 3.193 ± 0.532
1.437ProPhe: 1.437 ± 0.363
5.508ProGly: 5.508 ± 0.795
0.798ProHis: 0.798 ± 0.266
1.916ProIle: 1.916 ± 0.365
1.756ProLys: 1.756 ± 0.484
3.193ProLeu: 3.193 ± 0.636
1.277ProMet: 1.277 ± 0.23
1.197ProAsn: 1.197 ± 0.282
2.554ProPro: 2.554 ± 0.433
1.277ProGln: 1.277 ± 0.252
2.076ProArg: 2.076 ± 0.571
2.634ProSer: 2.634 ± 0.478
2.315ProThr: 2.315 ± 0.406
3.512ProVal: 3.512 ± 0.633
0.718ProTrp: 0.718 ± 0.272
1.197ProTyr: 1.197 ± 0.434
0.0ProXaa: 0.0 ± 0.0
Gln
4.311GlnAla: 4.311 ± 0.572
0.399GlnCys: 0.399 ± 0.232
2.076GlnAsp: 2.076 ± 0.463
2.475GlnGlu: 2.475 ± 0.519
1.517GlnPhe: 1.517 ± 0.308
2.794GlnGly: 2.794 ± 0.464
0.878GlnHis: 0.878 ± 0.24
2.395GlnIle: 2.395 ± 0.484
2.634GlnLys: 2.634 ± 0.435
3.592GlnLeu: 3.592 ± 0.522
0.798GlnMet: 0.798 ± 0.333
1.597GlnAsn: 1.597 ± 0.393
2.315GlnPro: 2.315 ± 0.368
2.155GlnGln: 2.155 ± 0.518
3.353GlnArg: 3.353 ± 0.568
2.155GlnSer: 2.155 ± 0.425
2.475GlnThr: 2.475 ± 0.431
2.794GlnVal: 2.794 ± 0.426
0.319GlnTrp: 0.319 ± 0.207
1.038GlnTyr: 1.038 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
5.428ArgAla: 5.428 ± 0.751
0.639ArgCys: 0.639 ± 0.232
2.954ArgAsp: 2.954 ± 0.494
3.433ArgGlu: 3.433 ± 0.608
2.315ArgPhe: 2.315 ± 0.386
4.231ArgGly: 4.231 ± 0.474
1.277ArgHis: 1.277 ± 0.404
2.954ArgIle: 2.954 ± 0.424
2.954ArgLys: 2.954 ± 0.613
5.748ArgLeu: 5.748 ± 0.738
1.277ArgMet: 1.277 ± 0.331
1.996ArgAsn: 1.996 ± 0.429
2.475ArgPro: 2.475 ± 0.649
1.916ArgGln: 1.916 ± 0.381
4.151ArgArg: 4.151 ± 0.765
4.79ArgSer: 4.79 ± 0.727
2.554ArgThr: 2.554 ± 0.41
4.231ArgVal: 4.231 ± 0.508
0.878ArgTrp: 0.878 ± 0.292
1.996ArgTyr: 1.996 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
7.504SerAla: 7.504 ± 0.978
0.399SerCys: 0.399 ± 0.188
4.869SerAsp: 4.869 ± 0.615
4.311SerGlu: 4.311 ± 0.713
2.235SerPhe: 2.235 ± 0.463
7.663SerGly: 7.663 ± 1.055
0.958SerHis: 0.958 ± 0.288
3.273SerIle: 3.273 ± 0.678
2.554SerLys: 2.554 ± 0.44
5.748SerLeu: 5.748 ± 0.614
1.118SerMet: 1.118 ± 0.322
2.554SerAsn: 2.554 ± 0.5
3.273SerPro: 3.273 ± 0.473
3.193SerGln: 3.193 ± 0.504
3.512SerArg: 3.512 ± 0.566
4.63SerSer: 4.63 ± 1.092
3.752SerThr: 3.752 ± 0.501
5.269SerVal: 5.269 ± 0.8
1.357SerTrp: 1.357 ± 0.385
1.756SerTyr: 1.756 ± 0.395
0.0SerXaa: 0.0 ± 0.0
Thr
7.025ThrAla: 7.025 ± 0.797
0.08ThrCys: 0.08 ± 0.078
3.672ThrAsp: 3.672 ± 0.541
3.193ThrGlu: 3.193 ± 0.467
2.076ThrPhe: 2.076 ± 0.45
6.785ThrGly: 6.785 ± 0.917
1.118ThrHis: 1.118 ± 0.281
3.193ThrIle: 3.193 ± 0.594
2.874ThrLys: 2.874 ± 0.53
5.748ThrLeu: 5.748 ± 0.805
0.639ThrMet: 0.639 ± 0.225
1.996ThrAsn: 1.996 ± 0.341
3.273ThrPro: 3.273 ± 0.692
2.315ThrGln: 2.315 ± 0.422
3.033ThrArg: 3.033 ± 0.551
3.592ThrSer: 3.592 ± 0.529
3.991ThrThr: 3.991 ± 0.724
4.391ThrVal: 4.391 ± 0.766
0.718ThrTrp: 0.718 ± 0.251
2.155ThrTyr: 2.155 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
6.546ValAla: 6.546 ± 0.717
0.639ValCys: 0.639 ± 0.282
4.79ValAsp: 4.79 ± 0.7
4.391ValGlu: 4.391 ± 0.513
2.714ValPhe: 2.714 ± 0.469
4.869ValGly: 4.869 ± 0.661
1.517ValHis: 1.517 ± 0.369
3.512ValIle: 3.512 ± 0.522
2.954ValLys: 2.954 ± 0.584
5.109ValLeu: 5.109 ± 0.662
1.277ValMet: 1.277 ± 0.321
2.395ValAsn: 2.395 ± 0.467
3.273ValPro: 3.273 ± 0.729
2.315ValGln: 2.315 ± 0.428
5.109ValArg: 5.109 ± 0.836
4.71ValSer: 4.71 ± 0.668
4.071ValThr: 4.071 ± 0.85
4.55ValVal: 4.55 ± 0.646
0.639ValTrp: 0.639 ± 0.223
2.076ValTyr: 2.076 ± 0.407
0.0ValXaa: 0.0 ± 0.0
Trp
0.399TrpAla: 0.399 ± 0.173
0.16TrpCys: 0.16 ± 0.104
1.597TrpAsp: 1.597 ± 0.359
0.878TrpGlu: 0.878 ± 0.221
0.639TrpPhe: 0.639 ± 0.21
0.958TrpGly: 0.958 ± 0.318
0.16TrpHis: 0.16 ± 0.117
0.718TrpIle: 0.718 ± 0.271
0.958TrpLys: 0.958 ± 0.331
1.996TrpLeu: 1.996 ± 0.391
0.16TrpMet: 0.16 ± 0.106
0.479TrpAsn: 0.479 ± 0.178
0.878TrpPro: 0.878 ± 0.314
0.958TrpGln: 0.958 ± 0.235
1.038TrpArg: 1.038 ± 0.318
0.718TrpSer: 0.718 ± 0.281
1.118TrpThr: 1.118 ± 0.275
1.277TrpVal: 1.277 ± 0.358
0.16TrpTrp: 0.16 ± 0.113
0.319TrpTyr: 0.319 ± 0.14
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.634TyrAla: 2.634 ± 0.359
0.319TyrCys: 0.319 ± 0.141
1.517TyrAsp: 1.517 ± 0.362
1.996TyrGlu: 1.996 ± 0.474
1.197TyrPhe: 1.197 ± 0.395
1.996TyrGly: 1.996 ± 0.305
0.559TyrHis: 0.559 ± 0.21
1.118TyrIle: 1.118 ± 0.379
1.118TyrLys: 1.118 ± 0.256
2.395TyrLeu: 2.395 ± 0.518
0.639TyrMet: 0.639 ± 0.181
0.639TyrAsn: 0.639 ± 0.202
1.357TyrPro: 1.357 ± 0.344
1.517TyrGln: 1.517 ± 0.305
1.836TyrArg: 1.836 ± 0.393
1.836TyrSer: 1.836 ± 0.31
1.756TyrThr: 1.756 ± 0.497
1.836TyrVal: 1.836 ± 0.355
0.559TyrTrp: 0.559 ± 0.182
0.718TyrTyr: 0.718 ± 0.304
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12528 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski