Amino acid dipepetide frequency for Streptococcus phage phiARI0462

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.617AlaAla: 2.617 ± 0.815
0.181AlaCys: 0.181 ± 0.125
6.137AlaAsp: 6.137 ± 0.561
7.22AlaGlu: 7.22 ± 0.75
3.43AlaPhe: 3.43 ± 0.696
5.866AlaGly: 5.866 ± 0.944
0.632AlaHis: 0.632 ± 0.239
5.054AlaIle: 5.054 ± 1.048
5.686AlaLys: 5.686 ± 0.818
6.679AlaLeu: 6.679 ± 0.987
2.166AlaMet: 2.166 ± 0.499
3.7AlaAsn: 3.7 ± 0.735
1.986AlaPro: 1.986 ± 0.384
2.708AlaGln: 2.708 ± 0.594
2.978AlaArg: 2.978 ± 0.689
2.617AlaSer: 2.617 ± 0.905
4.513AlaThr: 4.513 ± 0.632
4.693AlaVal: 4.693 ± 0.902
1.444AlaTrp: 1.444 ± 0.423
1.805AlaTyr: 1.805 ± 0.4
0.0AlaXaa: 0.0 ± 0.0
Cys
0.361CysAla: 0.361 ± 0.169
0.271CysCys: 0.271 ± 0.165
0.451CysAsp: 0.451 ± 0.209
0.451CysGlu: 0.451 ± 0.17
0.271CysPhe: 0.271 ± 0.159
0.632CysGly: 0.632 ± 0.337
0.09CysHis: 0.09 ± 0.091
0.271CysIle: 0.271 ± 0.2
0.812CysLys: 0.812 ± 0.254
0.451CysLeu: 0.451 ± 0.175
0.09CysMet: 0.09 ± 0.123
0.09CysAsn: 0.09 ± 0.091
0.181CysPro: 0.181 ± 0.136
0.451CysGln: 0.451 ± 0.223
0.451CysArg: 0.451 ± 0.19
0.181CysSer: 0.181 ± 0.132
0.271CysThr: 0.271 ± 0.173
0.0CysVal: 0.0 ± 0.0
0.181CysTrp: 0.181 ± 0.111
0.271CysTyr: 0.271 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
4.152AspAla: 4.152 ± 0.625
0.722AspCys: 0.722 ± 0.306
3.43AspAsp: 3.43 ± 0.863
4.422AspGlu: 4.422 ± 1.179
2.888AspPhe: 2.888 ± 0.521
5.596AspGly: 5.596 ± 0.672
0.451AspHis: 0.451 ± 0.222
4.783AspIle: 4.783 ± 0.57
5.505AspLys: 5.505 ± 0.823
5.596AspLeu: 5.596 ± 0.703
2.166AspMet: 2.166 ± 0.459
2.708AspAsn: 2.708 ± 0.482
2.256AspPro: 2.256 ± 0.58
1.715AspGln: 1.715 ± 0.385
2.617AspArg: 2.617 ± 0.52
3.43AspSer: 3.43 ± 0.497
3.7AspThr: 3.7 ± 0.48
3.971AspVal: 3.971 ± 0.501
1.173AspTrp: 1.173 ± 0.365
3.52AspTyr: 3.52 ± 0.672
0.0AspXaa: 0.0 ± 0.0
Glu
6.318GluAla: 6.318 ± 1.037
0.451GluCys: 0.451 ± 0.19
4.061GluAsp: 4.061 ± 0.846
6.318GluGlu: 6.318 ± 1.363
3.791GluPhe: 3.791 ± 0.726
3.52GluGly: 3.52 ± 0.569
1.264GluHis: 1.264 ± 0.33
5.686GluIle: 5.686 ± 0.604
6.408GluLys: 6.408 ± 1.207
9.206GluLeu: 9.206 ± 1.186
1.625GluMet: 1.625 ± 0.474
4.242GluAsn: 4.242 ± 0.562
1.805GluPro: 1.805 ± 0.421
3.971GluGln: 3.971 ± 0.806
3.971GluArg: 3.971 ± 0.501
5.054GluSer: 5.054 ± 0.532
3.791GluThr: 3.791 ± 0.6
5.957GluVal: 5.957 ± 0.754
1.083GluTrp: 1.083 ± 0.281
2.347GluTyr: 2.347 ± 0.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.708PheAla: 2.708 ± 0.733
0.181PheCys: 0.181 ± 0.147
4.783PheAsp: 4.783 ± 0.601
3.61PheGlu: 3.61 ± 0.638
1.173PhePhe: 1.173 ± 0.326
1.986PheGly: 1.986 ± 0.652
0.181PheHis: 0.181 ± 0.144
1.986PheIle: 1.986 ± 0.423
3.43PheLys: 3.43 ± 0.499
2.076PheLeu: 2.076 ± 0.327
1.083PheMet: 1.083 ± 0.355
2.888PheAsn: 2.888 ± 0.618
0.451PhePro: 0.451 ± 0.208
1.805PheGln: 1.805 ± 0.391
1.264PheArg: 1.264 ± 0.314
3.52PheSer: 3.52 ± 0.731
2.798PheThr: 2.798 ± 0.493
1.986PheVal: 1.986 ± 0.439
0.451PheTrp: 0.451 ± 0.192
1.895PheTyr: 1.895 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
3.249GlyAla: 3.249 ± 0.448
0.181GlyCys: 0.181 ± 0.108
4.061GlyAsp: 4.061 ± 0.617
4.964GlyGlu: 4.964 ± 0.885
2.437GlyPhe: 2.437 ± 0.697
5.325GlyGly: 5.325 ± 1.462
0.812GlyHis: 0.812 ± 0.268
3.791GlyIle: 3.791 ± 0.744
4.603GlyLys: 4.603 ± 0.636
6.047GlyLeu: 6.047 ± 1.109
2.076GlyMet: 2.076 ± 0.334
3.791GlyAsn: 3.791 ± 0.566
0.903GlyPro: 0.903 ± 0.28
4.152GlyGln: 4.152 ± 0.471
3.43GlyArg: 3.43 ± 0.598
4.332GlySer: 4.332 ± 0.751
3.43GlyThr: 3.43 ± 0.587
4.242GlyVal: 4.242 ± 0.932
1.083GlyTrp: 1.083 ± 0.522
3.159GlyTyr: 3.159 ± 0.58
0.0GlyXaa: 0.0 ± 0.0
His
0.993HisAla: 0.993 ± 0.371
0.09HisCys: 0.09 ± 0.114
0.812HisAsp: 0.812 ± 0.282
1.264HisGlu: 1.264 ± 0.364
0.542HisPhe: 0.542 ± 0.232
0.632HisGly: 0.632 ± 0.274
0.09HisHis: 0.09 ± 0.098
0.361HisIle: 0.361 ± 0.286
0.993HisLys: 0.993 ± 0.267
1.264HisLeu: 1.264 ± 0.386
0.09HisMet: 0.09 ± 0.088
0.903HisAsn: 0.903 ± 0.322
0.722HisPro: 0.722 ± 0.212
0.271HisGln: 0.271 ± 0.155
0.361HisArg: 0.361 ± 0.157
1.264HisSer: 1.264 ± 0.469
0.451HisThr: 0.451 ± 0.223
0.632HisVal: 0.632 ± 0.221
0.451HisTrp: 0.451 ± 0.234
0.542HisTyr: 0.542 ± 0.217
0.0HisXaa: 0.0 ± 0.0
Ile
5.596IleAla: 5.596 ± 0.916
0.451IleCys: 0.451 ± 0.16
3.881IleAsp: 3.881 ± 0.611
6.679IleGlu: 6.679 ± 0.756
2.798IlePhe: 2.798 ± 0.651
4.242IleGly: 4.242 ± 1.052
0.722IleHis: 0.722 ± 0.296
2.798IleIle: 2.798 ± 0.515
6.047IleLys: 6.047 ± 0.663
3.43IleLeu: 3.43 ± 0.649
1.173IleMet: 1.173 ± 0.326
4.242IleAsn: 4.242 ± 0.603
1.715IlePro: 1.715 ± 0.345
2.076IleGln: 2.076 ± 0.482
2.708IleArg: 2.708 ± 0.714
5.144IleSer: 5.144 ± 0.846
4.874IleThr: 4.874 ± 0.513
2.617IleVal: 2.617 ± 0.614
0.542IleTrp: 0.542 ± 0.291
1.715IleTyr: 1.715 ± 0.564
0.0IleXaa: 0.0 ± 0.0
Lys
6.137LysAla: 6.137 ± 0.962
0.451LysCys: 0.451 ± 0.217
6.408LysAsp: 6.408 ± 0.688
6.408LysGlu: 6.408 ± 0.769
2.798LysPhe: 2.798 ± 0.571
4.332LysGly: 4.332 ± 0.569
0.993LysHis: 0.993 ± 0.269
5.776LysIle: 5.776 ± 0.841
7.581LysLys: 7.581 ± 1.016
6.859LysLeu: 6.859 ± 0.833
3.159LysMet: 3.159 ± 0.559
5.054LysAsn: 5.054 ± 0.518
2.527LysPro: 2.527 ± 0.73
4.061LysGln: 4.061 ± 0.788
3.52LysArg: 3.52 ± 0.526
4.513LysSer: 4.513 ± 0.811
4.152LysThr: 4.152 ± 0.517
6.408LysVal: 6.408 ± 0.695
1.264LysTrp: 1.264 ± 0.501
3.7LysTyr: 3.7 ± 0.554
0.0LysXaa: 0.0 ± 0.0
Leu
6.859LeuAla: 6.859 ± 1.132
0.542LeuCys: 0.542 ± 0.299
6.498LeuAsp: 6.498 ± 0.757
7.22LeuGlu: 7.22 ± 0.929
2.166LeuPhe: 2.166 ± 0.421
5.866LeuGly: 5.866 ± 1.424
1.264LeuHis: 1.264 ± 0.288
4.332LeuIle: 4.332 ± 0.561
7.04LeuLys: 7.04 ± 0.889
6.949LeuLeu: 6.949 ± 0.919
1.715LeuMet: 1.715 ± 0.426
3.791LeuAsn: 3.791 ± 0.708
2.076LeuPro: 2.076 ± 0.571
2.888LeuGln: 2.888 ± 0.58
5.054LeuArg: 5.054 ± 0.78
5.235LeuSer: 5.235 ± 1.145
4.422LeuThr: 4.422 ± 0.865
3.61LeuVal: 3.61 ± 0.462
0.451LeuTrp: 0.451 ± 0.192
2.437LeuTyr: 2.437 ± 0.343
0.0LeuXaa: 0.0 ± 0.0
Met
1.895MetAla: 1.895 ± 0.451
0.0MetCys: 0.0 ± 0.0
1.264MetAsp: 1.264 ± 0.276
1.534MetGlu: 1.534 ± 0.362
1.173MetPhe: 1.173 ± 0.321
1.444MetGly: 1.444 ± 0.465
0.361MetHis: 0.361 ± 0.23
1.895MetIle: 1.895 ± 0.386
2.888MetLys: 2.888 ± 0.676
1.625MetLeu: 1.625 ± 0.274
0.271MetMet: 0.271 ± 0.168
1.083MetAsn: 1.083 ± 0.369
0.903MetPro: 0.903 ± 0.271
1.173MetGln: 1.173 ± 0.393
0.812MetArg: 0.812 ± 0.296
1.354MetSer: 1.354 ± 0.4
1.805MetThr: 1.805 ± 0.425
1.173MetVal: 1.173 ± 0.259
0.271MetTrp: 0.271 ± 0.165
0.812MetTyr: 0.812 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
4.693AsnAla: 4.693 ± 1.059
0.361AsnCys: 0.361 ± 0.164
2.527AsnAsp: 2.527 ± 0.497
3.7AsnGlu: 3.7 ± 0.633
1.986AsnPhe: 1.986 ± 0.485
4.242AsnGly: 4.242 ± 0.693
1.173AsnHis: 1.173 ± 0.377
3.249AsnIle: 3.249 ± 0.473
4.152AsnLys: 4.152 ± 0.65
3.7AsnLeu: 3.7 ± 0.762
1.083AsnMet: 1.083 ± 0.314
2.076AsnAsn: 2.076 ± 0.478
1.986AsnPro: 1.986 ± 0.379
2.978AsnGln: 2.978 ± 0.645
2.076AsnArg: 2.076 ± 0.51
3.069AsnSer: 3.069 ± 0.629
3.52AsnThr: 3.52 ± 0.581
3.52AsnVal: 3.52 ± 0.441
0.993AsnTrp: 0.993 ± 0.23
1.444AsnTyr: 1.444 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
2.256ProAla: 2.256 ± 0.525
0.181ProCys: 0.181 ± 0.139
2.437ProAsp: 2.437 ± 0.608
2.708ProGlu: 2.708 ± 0.421
1.354ProPhe: 1.354 ± 0.422
0.812ProGly: 0.812 ± 0.211
0.361ProHis: 0.361 ± 0.143
1.354ProIle: 1.354 ± 0.514
3.069ProLys: 3.069 ± 0.546
1.264ProLeu: 1.264 ± 0.291
0.271ProMet: 0.271 ± 0.173
1.444ProAsn: 1.444 ± 0.435
0.812ProPro: 0.812 ± 0.322
1.083ProGln: 1.083 ± 0.394
1.534ProArg: 1.534 ± 0.413
0.993ProSer: 0.993 ± 0.38
0.542ProThr: 0.542 ± 0.209
1.715ProVal: 1.715 ± 0.455
0.451ProTrp: 0.451 ± 0.203
1.083ProTyr: 1.083 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
3.43GlnAla: 3.43 ± 0.487
0.361GlnCys: 0.361 ± 0.179
1.895GlnAsp: 1.895 ± 0.394
4.513GlnGlu: 4.513 ± 0.774
1.354GlnPhe: 1.354 ± 0.395
2.617GlnGly: 2.617 ± 0.399
0.271GlnHis: 0.271 ± 0.148
3.881GlnIle: 3.881 ± 0.599
3.971GlnLys: 3.971 ± 0.617
3.339GlnLeu: 3.339 ± 0.481
0.632GlnMet: 0.632 ± 0.199
1.715GlnAsn: 1.715 ± 0.389
1.083GlnPro: 1.083 ± 0.433
2.166GlnGln: 2.166 ± 0.581
1.986GlnArg: 1.986 ± 0.501
2.708GlnSer: 2.708 ± 0.368
2.978GlnThr: 2.978 ± 0.549
3.7GlnVal: 3.7 ± 0.599
0.632GlnTrp: 0.632 ± 0.237
1.083GlnTyr: 1.083 ± 0.383
0.0GlnXaa: 0.0 ± 0.0
Arg
3.249ArgAla: 3.249 ± 0.405
0.181ArgCys: 0.181 ± 0.096
2.166ArgAsp: 2.166 ± 0.472
3.43ArgGlu: 3.43 ± 0.656
1.715ArgPhe: 1.715 ± 0.474
2.166ArgGly: 2.166 ± 0.533
0.361ArgHis: 0.361 ± 0.178
3.61ArgIle: 3.61 ± 0.604
3.971ArgLys: 3.971 ± 0.748
5.054ArgLeu: 5.054 ± 0.828
1.625ArgMet: 1.625 ± 0.395
2.437ArgAsn: 2.437 ± 0.583
1.083ArgPro: 1.083 ± 0.251
2.527ArgGln: 2.527 ± 0.51
2.076ArgArg: 2.076 ± 0.576
2.076ArgSer: 2.076 ± 0.469
2.347ArgThr: 2.347 ± 0.714
2.256ArgVal: 2.256 ± 0.354
0.542ArgTrp: 0.542 ± 0.198
1.534ArgTyr: 1.534 ± 0.414
0.0ArgXaa: 0.0 ± 0.0
Ser
4.603SerAla: 4.603 ± 1.089
0.271SerCys: 0.271 ± 0.171
3.159SerAsp: 3.159 ± 0.449
4.152SerGlu: 4.152 ± 0.605
2.347SerPhe: 2.347 ± 0.447
5.415SerGly: 5.415 ± 0.891
1.173SerHis: 1.173 ± 0.411
3.7SerIle: 3.7 ± 0.637
4.783SerLys: 4.783 ± 0.789
4.603SerLeu: 4.603 ± 0.722
1.173SerMet: 1.173 ± 0.48
2.798SerAsn: 2.798 ± 0.523
1.173SerPro: 1.173 ± 0.265
3.069SerGln: 3.069 ± 0.548
3.249SerArg: 3.249 ± 0.677
3.249SerSer: 3.249 ± 0.748
4.603SerThr: 4.603 ± 0.508
3.069SerVal: 3.069 ± 0.776
0.903SerTrp: 0.903 ± 0.412
3.159SerTyr: 3.159 ± 0.673
0.0SerXaa: 0.0 ± 0.0
Thr
4.874ThrAla: 4.874 ± 1.023
0.181ThrCys: 0.181 ± 0.143
4.693ThrAsp: 4.693 ± 0.557
3.61ThrGlu: 3.61 ± 0.508
2.978ThrPhe: 2.978 ± 0.744
3.881ThrGly: 3.881 ± 0.661
0.632ThrHis: 0.632 ± 0.227
4.693ThrIle: 4.693 ± 0.647
4.242ThrLys: 4.242 ± 0.659
5.054ThrLeu: 5.054 ± 0.579
0.903ThrMet: 0.903 ± 0.34
3.52ThrAsn: 3.52 ± 0.489
1.173ThrPro: 1.173 ± 0.365
2.978ThrGln: 2.978 ± 0.61
1.264ThrArg: 1.264 ± 0.274
4.152ThrSer: 4.152 ± 0.691
4.783ThrThr: 4.783 ± 0.965
3.61ThrVal: 3.61 ± 0.86
1.354ThrTrp: 1.354 ± 0.348
2.708ThrTyr: 2.708 ± 0.587
0.0ThrXaa: 0.0 ± 0.0
Val
5.054ValAla: 5.054 ± 0.741
0.09ValCys: 0.09 ± 0.091
3.339ValAsp: 3.339 ± 0.51
5.144ValGlu: 5.144 ± 0.781
2.437ValPhe: 2.437 ± 0.472
4.693ValGly: 4.693 ± 0.772
0.812ValHis: 0.812 ± 0.401
2.888ValIle: 2.888 ± 0.432
5.776ValLys: 5.776 ± 0.616
3.339ValLeu: 3.339 ± 0.534
0.993ValMet: 0.993 ± 0.323
3.881ValAsn: 3.881 ± 0.838
1.715ValPro: 1.715 ± 0.271
1.625ValGln: 1.625 ± 0.37
2.256ValArg: 2.256 ± 0.373
4.603ValSer: 4.603 ± 0.744
5.144ValThr: 5.144 ± 0.67
4.693ValVal: 4.693 ± 0.744
0.632ValTrp: 0.632 ± 0.276
2.617ValTyr: 2.617 ± 0.593
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.351
0.361TrpCys: 0.361 ± 0.165
0.542TrpAsp: 0.542 ± 0.294
0.542TrpGlu: 0.542 ± 0.243
1.354TrpPhe: 1.354 ± 0.566
0.903TrpGly: 0.903 ± 0.272
0.0TrpHis: 0.0 ± 0.0
0.812TrpIle: 0.812 ± 0.329
1.173TrpLys: 1.173 ± 0.458
0.903TrpLeu: 0.903 ± 0.336
0.451TrpMet: 0.451 ± 0.237
1.083TrpAsn: 1.083 ± 0.327
0.0TrpPro: 0.0 ± 0.0
0.722TrpGln: 0.722 ± 0.338
0.903TrpArg: 0.903 ± 0.337
0.632TrpSer: 0.632 ± 0.266
0.903TrpThr: 0.903 ± 0.312
0.993TrpVal: 0.993 ± 0.28
0.09TrpTrp: 0.09 ± 0.071
0.722TrpTyr: 0.722 ± 0.639
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.805TyrAla: 1.805 ± 0.405
0.722TyrCys: 0.722 ± 0.339
2.256TyrAsp: 2.256 ± 0.433
2.978TyrGlu: 2.978 ± 0.561
1.264TyrPhe: 1.264 ± 0.302
1.895TyrGly: 1.895 ± 0.381
1.083TyrHis: 1.083 ± 0.278
2.617TyrIle: 2.617 ± 0.492
3.971TyrLys: 3.971 ± 0.785
2.978TyrLeu: 2.978 ± 0.561
0.903TyrMet: 0.903 ± 0.369
1.083TyrAsn: 1.083 ± 0.31
1.173TyrPro: 1.173 ± 0.404
1.895TyrGln: 1.895 ± 0.479
1.895TyrArg: 1.895 ± 0.531
2.527TyrSer: 2.527 ± 0.584
2.256TyrThr: 2.256 ± 0.372
2.888TyrVal: 2.888 ± 0.564
0.361TyrTrp: 0.361 ± 0.212
1.715TyrTyr: 1.715 ± 0.601
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (11081 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski