Amino acid dipepetide frequency for Streptococcus phage IPP63

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.196AlaAla: 4.196 ± 1.298
0.477AlaCys: 0.477 ± 0.226
5.627AlaAsp: 5.627 ± 0.973
5.818AlaGlu: 5.818 ± 0.591
3.433AlaPhe: 3.433 ± 0.572
3.815AlaGly: 3.815 ± 0.732
0.381AlaHis: 0.381 ± 0.212
5.436AlaIle: 5.436 ± 1.289
4.864AlaLys: 4.864 ± 0.782
7.725AlaLeu: 7.725 ± 1.032
2.003AlaMet: 2.003 ± 0.444
4.196AlaAsn: 4.196 ± 0.922
1.431AlaPro: 1.431 ± 0.328
3.529AlaGln: 3.529 ± 0.763
2.67AlaArg: 2.67 ± 0.42
2.957AlaSer: 2.957 ± 0.508
5.246AlaThr: 5.246 ± 1.095
4.292AlaVal: 4.292 ± 0.659
1.049AlaTrp: 1.049 ± 0.467
2.194AlaTyr: 2.194 ± 0.574
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.095CysCys: 0.095 ± 0.095
0.191CysAsp: 0.191 ± 0.151
0.668CysGlu: 0.668 ± 0.233
0.286CysPhe: 0.286 ± 0.24
0.286CysGly: 0.286 ± 0.118
0.095CysHis: 0.095 ± 0.102
0.381CysIle: 0.381 ± 0.239
0.381CysLys: 0.381 ± 0.2
0.381CysLeu: 0.381 ± 0.176
0.095CysMet: 0.095 ± 0.105
0.0CysAsn: 0.0 ± 0.0
0.095CysPro: 0.095 ± 0.104
0.191CysGln: 0.191 ± 0.124
0.477CysArg: 0.477 ± 0.247
0.286CysSer: 0.286 ± 0.194
0.286CysThr: 0.286 ± 0.169
0.191CysVal: 0.191 ± 0.126
0.0CysTrp: 0.0 ± 0.0
0.858CysTyr: 0.858 ± 0.325
0.0CysXaa: 0.0 ± 0.0
Asp
4.101AspAla: 4.101 ± 0.688
0.095AspCys: 0.095 ± 0.109
3.529AspAsp: 3.529 ± 0.835
3.91AspGlu: 3.91 ± 1.128
2.957AspPhe: 2.957 ± 0.583
5.055AspGly: 5.055 ± 0.726
1.144AspHis: 1.144 ± 0.304
5.627AspIle: 5.627 ± 0.641
4.101AspLys: 4.101 ± 0.445
4.578AspLeu: 4.578 ± 0.635
1.049AspMet: 1.049 ± 0.374
3.052AspAsn: 3.052 ± 0.82
1.431AspPro: 1.431 ± 0.429
1.621AspGln: 1.621 ± 0.371
1.907AspArg: 1.907 ± 0.56
3.529AspSer: 3.529 ± 0.602
2.384AspThr: 2.384 ± 0.445
3.147AspVal: 3.147 ± 0.601
1.144AspTrp: 1.144 ± 0.406
3.052AspTyr: 3.052 ± 0.553
0.0AspXaa: 0.0 ± 0.0
Glu
6.199GluAla: 6.199 ± 1.203
0.572GluCys: 0.572 ± 0.251
3.91GluAsp: 3.91 ± 0.735
6.772GluGlu: 6.772 ± 1.161
3.72GluPhe: 3.72 ± 0.521
2.957GluGly: 2.957 ± 0.689
0.668GluHis: 0.668 ± 0.293
6.581GluIle: 6.581 ± 0.894
7.153GluLys: 7.153 ± 1.045
9.537GluLeu: 9.537 ± 1.243
3.147GluMet: 3.147 ± 0.834
5.341GluAsn: 5.341 ± 0.728
1.431GluPro: 1.431 ± 0.487
3.624GluGln: 3.624 ± 0.842
3.338GluArg: 3.338 ± 0.725
4.483GluSer: 4.483 ± 0.687
4.578GluThr: 4.578 ± 0.801
4.101GluVal: 4.101 ± 0.734
1.144GluTrp: 1.144 ± 0.376
2.48GluTyr: 2.48 ± 0.5
0.0GluXaa: 0.0 ± 0.0
Phe
2.384PheAla: 2.384 ± 0.657
0.095PheCys: 0.095 ± 0.088
3.338PheAsp: 3.338 ± 0.591
2.957PheGlu: 2.957 ± 0.592
1.907PhePhe: 1.907 ± 0.734
2.289PheGly: 2.289 ± 0.549
0.286PheHis: 0.286 ± 0.136
3.243PheIle: 3.243 ± 0.793
4.959PheLys: 4.959 ± 0.587
3.052PheLeu: 3.052 ± 0.657
1.335PheMet: 1.335 ± 0.472
3.91PheAsn: 3.91 ± 0.741
1.049PhePro: 1.049 ± 0.353
1.526PheGln: 1.526 ± 0.519
1.717PheArg: 1.717 ± 0.293
2.861PheSer: 2.861 ± 0.799
2.194PheThr: 2.194 ± 0.325
2.384PheVal: 2.384 ± 0.451
0.763PheTrp: 0.763 ± 0.277
1.621PheTyr: 1.621 ± 0.477
0.0PheXaa: 0.0 ± 0.0
Gly
4.387GlyAla: 4.387 ± 0.72
0.095GlyCys: 0.095 ± 0.093
3.433GlyAsp: 3.433 ± 0.624
3.243GlyGlu: 3.243 ± 0.388
2.67GlyPhe: 2.67 ± 0.635
3.72GlyGly: 3.72 ± 0.437
1.144GlyHis: 1.144 ± 0.358
4.769GlyIle: 4.769 ± 0.756
4.578GlyLys: 4.578 ± 0.477
4.769GlyLeu: 4.769 ± 0.688
1.335GlyMet: 1.335 ± 0.361
3.147GlyAsn: 3.147 ± 0.683
0.954GlyPro: 0.954 ± 0.211
2.575GlyGln: 2.575 ± 0.697
3.91GlyArg: 3.91 ± 0.701
3.624GlySer: 3.624 ± 0.735
3.147GlyThr: 3.147 ± 0.571
3.624GlyVal: 3.624 ± 0.475
1.431GlyTrp: 1.431 ± 0.532
2.289GlyTyr: 2.289 ± 0.573
0.0GlyXaa: 0.0 ± 0.0
His
0.572HisAla: 0.572 ± 0.203
0.286HisCys: 0.286 ± 0.191
0.668HisAsp: 0.668 ± 0.293
1.335HisGlu: 1.335 ± 0.368
0.858HisPhe: 0.858 ± 0.281
0.763HisGly: 0.763 ± 0.267
0.191HisHis: 0.191 ± 0.119
1.526HisIle: 1.526 ± 0.325
0.572HisLys: 0.572 ± 0.262
1.621HisLeu: 1.621 ± 0.422
0.858HisMet: 0.858 ± 0.287
0.668HisAsn: 0.668 ± 0.23
0.477HisPro: 0.477 ± 0.215
0.572HisGln: 0.572 ± 0.291
0.477HisArg: 0.477 ± 0.248
1.144HisSer: 1.144 ± 0.472
0.763HisThr: 0.763 ± 0.32
1.144HisVal: 1.144 ± 0.355
0.095HisTrp: 0.095 ± 0.104
0.572HisTyr: 0.572 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
6.104IleAla: 6.104 ± 0.764
0.381IleCys: 0.381 ± 0.215
3.624IleAsp: 3.624 ± 0.737
5.341IleGlu: 5.341 ± 0.712
2.384IlePhe: 2.384 ± 0.533
3.529IleGly: 3.529 ± 0.536
0.763IleHis: 0.763 ± 0.204
4.101IleIle: 4.101 ± 0.802
6.676IleLys: 6.676 ± 0.873
6.199IleLeu: 6.199 ± 0.847
1.049IleMet: 1.049 ± 0.223
3.72IleAsn: 3.72 ± 0.51
2.003IlePro: 2.003 ± 0.323
2.575IleGln: 2.575 ± 0.475
3.529IleArg: 3.529 ± 0.616
6.295IleSer: 6.295 ± 0.889
4.578IleThr: 4.578 ± 0.578
3.624IleVal: 3.624 ± 0.672
0.191IleTrp: 0.191 ± 0.16
1.907IleTyr: 1.907 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
6.009LysAla: 6.009 ± 0.994
0.191LysCys: 0.191 ± 0.176
5.436LysAsp: 5.436 ± 0.749
6.676LysGlu: 6.676 ± 0.783
2.67LysPhe: 2.67 ± 0.576
4.578LysGly: 4.578 ± 0.844
2.289LysHis: 2.289 ± 0.467
4.483LysIle: 4.483 ± 0.831
5.436LysLys: 5.436 ± 0.975
6.676LysLeu: 6.676 ± 0.743
2.098LysMet: 2.098 ± 0.425
4.673LysAsn: 4.673 ± 0.56
2.861LysPro: 2.861 ± 0.608
3.338LysGln: 3.338 ± 0.536
4.387LysArg: 4.387 ± 0.789
4.769LysSer: 4.769 ± 0.601
5.913LysThr: 5.913 ± 1.099
4.673LysVal: 4.673 ± 0.774
1.144LysTrp: 1.144 ± 0.381
2.48LysTyr: 2.48 ± 0.51
0.0LysXaa: 0.0 ± 0.0
Leu
5.341LeuAla: 5.341 ± 0.87
0.286LeuCys: 0.286 ± 0.181
5.341LeuAsp: 5.341 ± 0.729
8.488LeuGlu: 8.488 ± 1.21
3.529LeuPhe: 3.529 ± 0.553
5.722LeuGly: 5.722 ± 0.655
1.144LeuHis: 1.144 ± 0.326
4.483LeuIle: 4.483 ± 0.727
7.344LeuLys: 7.344 ± 0.775
6.295LeuLeu: 6.295 ± 1.167
2.194LeuMet: 2.194 ± 0.616
4.769LeuAsn: 4.769 ± 0.631
2.48LeuPro: 2.48 ± 0.511
3.052LeuGln: 3.052 ± 0.831
5.818LeuArg: 5.818 ± 0.795
6.295LeuSer: 6.295 ± 1.009
5.15LeuThr: 5.15 ± 0.526
4.006LeuVal: 4.006 ± 0.583
0.858LeuTrp: 0.858 ± 0.29
3.338LeuTyr: 3.338 ± 0.453
0.0LeuXaa: 0.0 ± 0.0
Met
1.907MetAla: 1.907 ± 0.399
0.095MetCys: 0.095 ± 0.088
0.954MetAsp: 0.954 ± 0.321
2.384MetGlu: 2.384 ± 0.778
1.144MetPhe: 1.144 ± 0.444
1.144MetGly: 1.144 ± 0.268
0.095MetHis: 0.095 ± 0.101
1.431MetIle: 1.431 ± 0.488
2.289MetLys: 2.289 ± 0.46
1.907MetLeu: 1.907 ± 0.507
0.477MetMet: 0.477 ± 0.219
1.907MetAsn: 1.907 ± 0.503
0.668MetPro: 0.668 ± 0.259
0.763MetGln: 0.763 ± 0.257
1.335MetArg: 1.335 ± 0.449
1.335MetSer: 1.335 ± 0.41
2.003MetThr: 2.003 ± 0.464
1.812MetVal: 1.812 ± 0.466
0.095MetTrp: 0.095 ± 0.1
0.381MetTyr: 0.381 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
4.196AsnAla: 4.196 ± 0.818
0.381AsnCys: 0.381 ± 0.238
2.67AsnAsp: 2.67 ± 0.542
4.196AsnGlu: 4.196 ± 0.508
3.147AsnPhe: 3.147 ± 0.524
4.387AsnGly: 4.387 ± 0.734
1.526AsnHis: 1.526 ± 0.306
3.433AsnIle: 3.433 ± 0.514
4.864AsnLys: 4.864 ± 0.574
4.673AsnLeu: 4.673 ± 0.744
0.572AsnMet: 0.572 ± 0.3
2.194AsnAsn: 2.194 ± 0.404
2.289AsnPro: 2.289 ± 0.516
3.72AsnGln: 3.72 ± 0.579
2.67AsnArg: 2.67 ± 0.453
3.72AsnSer: 3.72 ± 0.754
2.575AsnThr: 2.575 ± 0.431
4.673AsnVal: 4.673 ± 0.743
0.763AsnTrp: 0.763 ± 0.305
1.049AsnTyr: 1.049 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
1.812ProAla: 1.812 ± 0.341
0.286ProCys: 0.286 ± 0.164
2.098ProAsp: 2.098 ± 0.544
2.67ProGlu: 2.67 ± 0.513
0.858ProPhe: 0.858 ± 0.347
0.954ProGly: 0.954 ± 0.406
0.572ProHis: 0.572 ± 0.216
2.289ProIle: 2.289 ± 0.554
2.194ProLys: 2.194 ± 0.466
2.098ProLeu: 2.098 ± 0.53
0.191ProMet: 0.191 ± 0.127
0.668ProAsn: 0.668 ± 0.309
0.381ProPro: 0.381 ± 0.257
1.335ProGln: 1.335 ± 0.494
1.24ProArg: 1.24 ± 0.407
1.717ProSer: 1.717 ± 0.488
1.717ProThr: 1.717 ± 0.508
1.907ProVal: 1.907 ± 0.28
0.286ProTrp: 0.286 ± 0.145
1.144ProTyr: 1.144 ± 0.412
0.0ProXaa: 0.0 ± 0.0
Gln
4.196GlnAla: 4.196 ± 0.992
0.0GlnCys: 0.0 ± 0.0
1.717GlnAsp: 1.717 ± 0.43
4.387GlnGlu: 4.387 ± 0.708
1.812GlnPhe: 1.812 ± 0.401
2.194GlnGly: 2.194 ± 0.341
0.286GlnHis: 0.286 ± 0.188
2.003GlnIle: 2.003 ± 0.424
3.529GlnLys: 3.529 ± 0.571
2.861GlnLeu: 2.861 ± 0.483
1.431GlnMet: 1.431 ± 0.487
2.67GlnAsn: 2.67 ± 0.409
1.335GlnPro: 1.335 ± 0.448
1.144GlnGln: 1.144 ± 0.365
2.289GlnArg: 2.289 ± 0.648
3.624GlnSer: 3.624 ± 0.462
2.48GlnThr: 2.48 ± 0.4
2.575GlnVal: 2.575 ± 0.471
0.286GlnTrp: 0.286 ± 0.203
0.858GlnTyr: 0.858 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
3.624ArgAla: 3.624 ± 0.62
0.286ArgCys: 0.286 ± 0.158
2.289ArgAsp: 2.289 ± 0.457
4.387ArgGlu: 4.387 ± 0.547
2.098ArgPhe: 2.098 ± 0.601
1.812ArgGly: 1.812 ± 0.326
0.668ArgHis: 0.668 ± 0.299
3.147ArgIle: 3.147 ± 0.499
3.91ArgLys: 3.91 ± 0.715
4.769ArgLeu: 4.769 ± 0.815
1.144ArgMet: 1.144 ± 0.305
3.624ArgAsn: 3.624 ± 0.458
1.717ArgPro: 1.717 ± 0.437
2.384ArgGln: 2.384 ± 0.825
2.384ArgArg: 2.384 ± 0.643
1.812ArgSer: 1.812 ± 0.417
3.052ArgThr: 3.052 ± 1.031
2.48ArgVal: 2.48 ± 0.549
0.858ArgTrp: 0.858 ± 0.285
2.289ArgTyr: 2.289 ± 0.548
0.0ArgXaa: 0.0 ± 0.0
Ser
5.818SerAla: 5.818 ± 0.939
0.286SerCys: 0.286 ± 0.149
3.624SerAsp: 3.624 ± 0.522
5.532SerGlu: 5.532 ± 0.855
3.147SerPhe: 3.147 ± 0.518
4.578SerGly: 4.578 ± 0.758
0.954SerHis: 0.954 ± 0.326
3.338SerIle: 3.338 ± 0.524
4.483SerLys: 4.483 ± 0.723
4.387SerLeu: 4.387 ± 0.639
1.717SerMet: 1.717 ± 0.704
3.624SerAsn: 3.624 ± 0.674
1.907SerPro: 1.907 ± 0.386
2.289SerGln: 2.289 ± 0.561
3.529SerArg: 3.529 ± 0.511
3.433SerSer: 3.433 ± 0.704
4.292SerThr: 4.292 ± 0.757
3.338SerVal: 3.338 ± 0.654
0.763SerTrp: 0.763 ± 0.19
1.907SerTyr: 1.907 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
4.959ThrAla: 4.959 ± 1.025
0.191ThrCys: 0.191 ± 0.142
3.815ThrAsp: 3.815 ± 0.548
4.196ThrGlu: 4.196 ± 0.592
3.624ThrPhe: 3.624 ± 0.984
4.292ThrGly: 4.292 ± 0.745
0.954ThrHis: 0.954 ± 0.342
4.578ThrIle: 4.578 ± 0.699
4.769ThrLys: 4.769 ± 0.665
5.722ThrLeu: 5.722 ± 0.769
1.144ThrMet: 1.144 ± 0.401
2.67ThrAsn: 2.67 ± 0.407
0.954ThrPro: 0.954 ± 0.253
2.384ThrGln: 2.384 ± 0.733
2.575ThrArg: 2.575 ± 0.401
4.101ThrSer: 4.101 ± 0.822
3.91ThrThr: 3.91 ± 0.779
4.673ThrVal: 4.673 ± 0.856
0.572ThrTrp: 0.572 ± 0.233
1.907ThrTyr: 1.907 ± 0.688
0.0ThrXaa: 0.0 ± 0.0
Val
2.861ValAla: 2.861 ± 0.535
0.477ValCys: 0.477 ± 0.197
3.052ValAsp: 3.052 ± 0.57
5.436ValGlu: 5.436 ± 0.597
1.621ValPhe: 1.621 ± 0.324
4.769ValGly: 4.769 ± 0.715
1.144ValHis: 1.144 ± 0.329
4.006ValIle: 4.006 ± 0.623
5.818ValLys: 5.818 ± 0.728
4.006ValLeu: 4.006 ± 0.918
1.049ValMet: 1.049 ± 0.336
3.243ValAsn: 3.243 ± 0.41
1.049ValPro: 1.049 ± 0.376
2.575ValGln: 2.575 ± 0.519
1.907ValArg: 1.907 ± 0.533
3.91ValSer: 3.91 ± 0.703
5.627ValThr: 5.627 ± 1.145
4.864ValVal: 4.864 ± 0.543
1.049ValTrp: 1.049 ± 0.393
1.24ValTyr: 1.24 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.572TrpAla: 0.572 ± 0.233
0.095TrpCys: 0.095 ± 0.102
0.381TrpAsp: 0.381 ± 0.172
0.954TrpGlu: 0.954 ± 0.358
0.572TrpPhe: 0.572 ± 0.309
0.763TrpGly: 0.763 ± 0.286
0.191TrpHis: 0.191 ± 0.122
1.24TrpIle: 1.24 ± 0.548
0.763TrpLys: 0.763 ± 0.337
0.858TrpLeu: 0.858 ± 0.265
0.381TrpMet: 0.381 ± 0.155
1.812TrpAsn: 1.812 ± 0.438
0.191TrpPro: 0.191 ± 0.122
0.668TrpGln: 0.668 ± 0.297
0.477TrpArg: 0.477 ± 0.189
0.954TrpSer: 0.954 ± 0.261
0.763TrpThr: 0.763 ± 0.227
0.668TrpVal: 0.668 ± 0.247
0.191TrpTrp: 0.191 ± 0.118
0.763TrpTyr: 0.763 ± 0.633
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.098TyrAla: 2.098 ± 0.437
0.572TyrCys: 0.572 ± 0.245
1.717TyrAsp: 1.717 ± 0.398
2.384TyrGlu: 2.384 ± 0.443
1.431TyrPhe: 1.431 ± 0.518
1.24TyrGly: 1.24 ± 0.373
0.668TyrHis: 0.668 ± 0.211
2.48TyrIle: 2.48 ± 0.418
2.098TyrLys: 2.098 ± 0.446
3.815TyrLeu: 3.815 ± 0.592
0.763TyrMet: 0.763 ± 0.335
1.812TyrAsn: 1.812 ± 0.45
1.717TyrPro: 1.717 ± 0.356
1.812TyrGln: 1.812 ± 0.525
2.194TyrArg: 2.194 ± 0.519
2.098TyrSer: 2.098 ± 0.491
1.335TyrThr: 1.335 ± 0.292
1.526TyrVal: 1.526 ± 0.301
0.572TyrTrp: 0.572 ± 0.217
2.194TyrTyr: 2.194 ± 0.819
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (10486 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski