Amino acid dipepetide frequency for Staphylococcus phage phiSa2wa_st22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.325AlaAla: 2.325 ± 0.691
0.249AlaCys: 0.249 ± 0.136
2.74AlaAsp: 2.74 ± 0.397
3.902AlaGlu: 3.902 ± 0.721
2.574AlaPhe: 2.574 ± 0.68
4.068AlaGly: 4.068 ± 0.701
0.581AlaHis: 0.581 ± 0.184
4.318AlaIle: 4.318 ± 0.693
4.567AlaLys: 4.567 ± 0.563
4.65AlaLeu: 4.65 ± 0.608
1.744AlaMet: 1.744 ± 0.441
3.57AlaAsn: 3.57 ± 0.594
1.411AlaPro: 1.411 ± 0.295
2.906AlaGln: 2.906 ± 0.571
2.408AlaArg: 2.408 ± 0.46
3.321AlaSer: 3.321 ± 0.481
3.404AlaThr: 3.404 ± 0.634
2.906AlaVal: 2.906 ± 0.537
0.747AlaTrp: 0.747 ± 0.233
2.076AlaTyr: 2.076 ± 0.357
0.0AlaXaa: 0.0 ± 0.0
Cys
0.249CysAla: 0.249 ± 0.165
0.083CysCys: 0.083 ± 0.08
0.0CysAsp: 0.0 ± 0.0
0.332CysGlu: 0.332 ± 0.15
0.083CysPhe: 0.083 ± 0.08
0.249CysGly: 0.249 ± 0.13
0.083CysHis: 0.083 ± 0.082
0.83CysIle: 0.83 ± 0.232
0.415CysLys: 0.415 ± 0.193
0.332CysLeu: 0.332 ± 0.166
0.0CysMet: 0.0 ± 0.0
0.332CysAsn: 0.332 ± 0.153
0.166CysPro: 0.166 ± 0.116
0.166CysGln: 0.166 ± 0.118
0.0CysArg: 0.0 ± 0.0
0.332CysSer: 0.332 ± 0.177
0.249CysThr: 0.249 ± 0.128
0.083CysVal: 0.083 ± 0.073
0.083CysTrp: 0.083 ± 0.079
0.249CysTyr: 0.249 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
2.906AspAla: 2.906 ± 0.454
0.332AspCys: 0.332 ± 0.151
4.234AspAsp: 4.234 ± 0.775
5.895AspGlu: 5.895 ± 1.048
3.487AspPhe: 3.487 ± 0.583
4.899AspGly: 4.899 ± 0.919
0.83AspHis: 0.83 ± 0.242
4.733AspIle: 4.733 ± 0.563
5.314AspLys: 5.314 ± 0.717
5.48AspLeu: 5.48 ± 0.664
1.661AspMet: 1.661 ± 0.31
4.068AspAsn: 4.068 ± 0.641
1.827AspPro: 1.827 ± 0.367
0.747AspGln: 0.747 ± 0.198
2.574AspArg: 2.574 ± 0.607
3.072AspSer: 3.072 ± 0.548
3.57AspThr: 3.57 ± 0.51
3.736AspVal: 3.736 ± 0.642
0.747AspTrp: 0.747 ± 0.219
2.906AspTyr: 2.906 ± 0.526
0.0AspXaa: 0.0 ± 0.0
Glu
4.401GluAla: 4.401 ± 0.847
0.249GluCys: 0.249 ± 0.15
3.736GluAsp: 3.736 ± 0.624
6.31GluGlu: 6.31 ± 1.048
2.823GluPhe: 2.823 ± 0.462
2.491GluGly: 2.491 ± 0.463
1.162GluHis: 1.162 ± 0.287
6.559GluIle: 6.559 ± 0.958
7.639GluLys: 7.639 ± 1.015
7.805GluLeu: 7.805 ± 1.064
2.325GluMet: 2.325 ± 0.449
5.563GluAsn: 5.563 ± 0.748
1.245GluPro: 1.245 ± 0.274
3.238GluGln: 3.238 ± 0.813
4.567GluArg: 4.567 ± 0.817
4.151GluSer: 4.151 ± 0.569
3.985GluThr: 3.985 ± 0.904
4.318GluVal: 4.318 ± 0.47
0.747GluTrp: 0.747 ± 0.218
3.819GluTyr: 3.819 ± 0.635
0.0GluXaa: 0.0 ± 0.0
Phe
2.159PheAla: 2.159 ± 0.6
0.166PheCys: 0.166 ± 0.124
2.657PheAsp: 2.657 ± 0.424
3.902PheGlu: 3.902 ± 0.619
1.079PhePhe: 1.079 ± 0.276
1.993PheGly: 1.993 ± 0.384
0.498PheHis: 0.498 ± 0.203
3.902PheIle: 3.902 ± 0.546
4.484PheLys: 4.484 ± 0.638
2.491PheLeu: 2.491 ± 0.436
1.411PheMet: 1.411 ± 0.373
4.401PheAsn: 4.401 ± 0.521
0.913PhePro: 0.913 ± 0.287
0.664PheGln: 0.664 ± 0.203
1.245PheArg: 1.245 ± 0.313
2.989PheSer: 2.989 ± 0.728
2.574PheThr: 2.574 ± 0.511
2.076PheVal: 2.076 ± 0.555
0.166PheTrp: 0.166 ± 0.124
1.411PheTyr: 1.411 ± 0.335
0.0PheXaa: 0.0 ± 0.0
Gly
3.072GlyAla: 3.072 ± 0.598
0.166GlyCys: 0.166 ± 0.108
3.487GlyAsp: 3.487 ± 0.63
3.487GlyGlu: 3.487 ± 0.715
2.491GlyPhe: 2.491 ± 0.555
4.484GlyGly: 4.484 ± 1.008
1.495GlyHis: 1.495 ± 0.447
4.234GlyIle: 4.234 ± 0.799
5.646GlyLys: 5.646 ± 0.799
5.065GlyLeu: 5.065 ± 0.917
1.245GlyMet: 1.245 ± 0.3
3.819GlyAsn: 3.819 ± 0.661
1.411GlyPro: 1.411 ± 0.357
1.744GlyGln: 1.744 ± 0.495
2.906GlyArg: 2.906 ± 0.58
2.657GlySer: 2.657 ± 0.391
2.491GlyThr: 2.491 ± 0.491
3.736GlyVal: 3.736 ± 0.683
1.079GlyTrp: 1.079 ± 0.268
2.657GlyTyr: 2.657 ± 0.511
0.0GlyXaa: 0.0 ± 0.0
His
1.162HisAla: 1.162 ± 0.321
0.0HisCys: 0.0 ± 0.0
1.079HisAsp: 1.079 ± 0.291
1.245HisGlu: 1.245 ± 0.337
1.079HisPhe: 1.079 ± 0.283
0.747HisGly: 0.747 ± 0.326
0.249HisHis: 0.249 ± 0.151
1.91HisIle: 1.91 ± 0.328
1.411HisLys: 1.411 ± 0.347
1.245HisLeu: 1.245 ± 0.265
0.083HisMet: 0.083 ± 0.082
0.996HisAsn: 0.996 ± 0.304
0.498HisPro: 0.498 ± 0.184
0.664HisGln: 0.664 ± 0.299
0.415HisArg: 0.415 ± 0.182
1.079HisSer: 1.079 ± 0.23
0.913HisThr: 0.913 ± 0.31
1.245HisVal: 1.245 ± 0.408
0.332HisTrp: 0.332 ± 0.155
0.913HisTyr: 0.913 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.48IleAla: 5.48 ± 0.766
0.249IleCys: 0.249 ± 0.115
5.231IleAsp: 5.231 ± 0.647
6.725IleGlu: 6.725 ± 0.599
2.906IlePhe: 2.906 ± 0.548
3.985IleGly: 3.985 ± 0.608
1.827IleHis: 1.827 ± 0.461
4.234IleIle: 4.234 ± 0.696
10.13IleLys: 10.13 ± 0.73
4.567IleLeu: 4.567 ± 0.518
1.578IleMet: 1.578 ± 0.307
6.725IleAsn: 6.725 ± 1.154
1.91IlePro: 1.91 ± 0.397
3.57IleGln: 3.57 ± 0.537
2.491IleArg: 2.491 ± 0.458
5.148IleSer: 5.148 ± 0.541
4.567IleThr: 4.567 ± 0.542
5.231IleVal: 5.231 ± 0.556
1.079IleTrp: 1.079 ± 0.503
2.491IleTyr: 2.491 ± 0.507
0.0IleXaa: 0.0 ± 0.0
Lys
5.812LysAla: 5.812 ± 0.571
0.166LysCys: 0.166 ± 0.112
6.642LysAsp: 6.642 ± 0.639
7.722LysGlu: 7.722 ± 0.947
3.321LysPhe: 3.321 ± 0.451
4.982LysGly: 4.982 ± 1.016
1.245LysHis: 1.245 ± 0.343
6.725LysIle: 6.725 ± 0.972
8.22LysLys: 8.22 ± 1.04
7.307LysLeu: 7.307 ± 0.645
2.242LysMet: 2.242 ± 0.415
5.48LysAsn: 5.48 ± 0.923
2.906LysPro: 2.906 ± 0.521
4.982LysGln: 4.982 ± 0.668
4.65LysArg: 4.65 ± 0.621
5.065LysSer: 5.065 ± 0.612
6.061LysThr: 6.061 ± 0.776
5.812LysVal: 5.812 ± 0.737
1.411LysTrp: 1.411 ± 0.326
4.733LysTyr: 4.733 ± 0.752
0.0LysXaa: 0.0 ± 0.0
Leu
2.823LeuAla: 2.823 ± 0.479
0.498LeuCys: 0.498 ± 0.176
5.231LeuAsp: 5.231 ± 0.518
7.722LeuGlu: 7.722 ± 0.878
3.321LeuPhe: 3.321 ± 0.461
3.653LeuGly: 3.653 ± 0.673
1.245LeuHis: 1.245 ± 0.216
6.31LeuIle: 6.31 ± 0.926
7.971LeuLys: 7.971 ± 0.907
6.31LeuLeu: 6.31 ± 0.867
1.91LeuMet: 1.91 ± 0.442
5.978LeuAsn: 5.978 ± 0.808
2.325LeuPro: 2.325 ± 0.513
3.072LeuGln: 3.072 ± 0.422
3.238LeuArg: 3.238 ± 0.528
5.148LeuSer: 5.148 ± 0.734
4.816LeuThr: 4.816 ± 0.722
4.068LeuVal: 4.068 ± 0.674
0.581LeuTrp: 0.581 ± 0.233
2.823LeuTyr: 2.823 ± 0.627
0.0LeuXaa: 0.0 ± 0.0
Met
1.079MetAla: 1.079 ± 0.346
0.083MetCys: 0.083 ± 0.073
1.744MetAsp: 1.744 ± 0.384
1.079MetGlu: 1.079 ± 0.3
1.079MetPhe: 1.079 ± 0.268
1.328MetGly: 1.328 ± 0.564
0.166MetHis: 0.166 ± 0.115
2.242MetIle: 2.242 ± 0.446
1.661MetLys: 1.661 ± 0.475
2.076MetLeu: 2.076 ± 0.315
0.664MetMet: 0.664 ± 0.284
1.744MetAsn: 1.744 ± 0.379
0.996MetPro: 0.996 ± 0.339
1.162MetGln: 1.162 ± 0.277
1.495MetArg: 1.495 ± 0.272
1.661MetSer: 1.661 ± 0.312
1.411MetThr: 1.411 ± 0.396
1.578MetVal: 1.578 ± 0.342
0.415MetTrp: 0.415 ± 0.169
0.664MetTyr: 0.664 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
4.318AsnAla: 4.318 ± 0.552
0.083AsnCys: 0.083 ± 0.089
4.567AsnAsp: 4.567 ± 0.677
4.816AsnGlu: 4.816 ± 0.933
2.159AsnPhe: 2.159 ± 0.634
5.231AsnGly: 5.231 ± 0.471
0.996AsnHis: 0.996 ± 0.326
4.733AsnIle: 4.733 ± 0.688
7.473AsnLys: 7.473 ± 0.875
4.65AsnLeu: 4.65 ± 0.505
1.744AsnMet: 1.744 ± 0.334
5.148AsnAsn: 5.148 ± 0.714
3.404AsnPro: 3.404 ± 0.337
3.653AsnGln: 3.653 ± 0.651
3.072AsnArg: 3.072 ± 0.509
3.819AsnSer: 3.819 ± 0.567
3.155AsnThr: 3.155 ± 0.42
4.484AsnVal: 4.484 ± 0.855
0.83AsnTrp: 0.83 ± 0.356
3.487AsnTyr: 3.487 ± 0.668
0.0AsnXaa: 0.0 ± 0.0
Pro
0.913ProAla: 0.913 ± 0.333
0.0ProCys: 0.0 ± 0.0
1.411ProAsp: 1.411 ± 0.283
2.491ProGlu: 2.491 ± 0.369
1.578ProPhe: 1.578 ± 0.387
0.913ProGly: 0.913 ± 0.242
0.581ProHis: 0.581 ± 0.198
2.408ProIle: 2.408 ± 0.432
3.072ProLys: 3.072 ± 0.548
1.661ProLeu: 1.661 ± 0.381
0.83ProMet: 0.83 ± 0.207
1.328ProAsn: 1.328 ± 0.302
0.996ProPro: 0.996 ± 0.229
0.498ProGln: 0.498 ± 0.167
1.162ProArg: 1.162 ± 0.262
1.91ProSer: 1.91 ± 0.409
0.996ProThr: 0.996 ± 0.312
1.91ProVal: 1.91 ± 0.331
0.166ProTrp: 0.166 ± 0.097
1.245ProTyr: 1.245 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
3.238GlnAla: 3.238 ± 0.44
0.415GlnCys: 0.415 ± 0.181
2.242GlnAsp: 2.242 ± 0.555
2.74GlnGlu: 2.74 ± 0.527
1.162GlnPhe: 1.162 ± 0.247
2.159GlnGly: 2.159 ± 0.434
0.913GlnHis: 0.913 ± 0.305
3.155GlnIle: 3.155 ± 0.397
2.823GlnLys: 2.823 ± 0.529
3.155GlnLeu: 3.155 ± 0.473
0.747GlnMet: 0.747 ± 0.236
3.487GlnAsn: 3.487 ± 0.607
0.664GlnPro: 0.664 ± 0.19
2.159GlnGln: 2.159 ± 0.526
2.076GlnArg: 2.076 ± 0.352
2.74GlnSer: 2.74 ± 0.388
2.159GlnThr: 2.159 ± 0.454
1.827GlnVal: 1.827 ± 0.343
0.498GlnTrp: 0.498 ± 0.191
1.245GlnTyr: 1.245 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
2.325ArgAla: 2.325 ± 0.386
0.083ArgCys: 0.083 ± 0.084
2.989ArgAsp: 2.989 ± 0.471
3.487ArgGlu: 3.487 ± 0.655
1.91ArgPhe: 1.91 ± 0.428
1.91ArgGly: 1.91 ± 0.404
0.664ArgHis: 0.664 ± 0.229
3.985ArgIle: 3.985 ± 0.618
3.072ArgLys: 3.072 ± 0.573
3.819ArgLeu: 3.819 ± 0.523
0.83ArgMet: 0.83 ± 0.221
3.238ArgAsn: 3.238 ± 0.395
0.498ArgPro: 0.498 ± 0.19
1.411ArgGln: 1.411 ± 0.381
2.076ArgArg: 2.076 ± 0.362
1.661ArgSer: 1.661 ± 0.372
2.574ArgThr: 2.574 ± 0.587
2.325ArgVal: 2.325 ± 0.438
0.747ArgTrp: 0.747 ± 0.236
3.072ArgTyr: 3.072 ± 0.684
0.0ArgXaa: 0.0 ± 0.0
Ser
2.989SerAla: 2.989 ± 0.449
0.332SerCys: 0.332 ± 0.17
4.234SerAsp: 4.234 ± 0.79
5.48SerGlu: 5.48 ± 0.785
2.989SerPhe: 2.989 ± 0.695
3.155SerGly: 3.155 ± 0.773
1.411SerHis: 1.411 ± 0.317
5.895SerIle: 5.895 ± 0.853
5.48SerLys: 5.48 ± 0.861
3.902SerLeu: 3.902 ± 0.594
1.245SerMet: 1.245 ± 0.252
4.733SerAsn: 4.733 ± 0.627
0.581SerPro: 0.581 ± 0.224
2.574SerGln: 2.574 ± 0.428
1.91SerArg: 1.91 ± 0.391
3.238SerSer: 3.238 ± 0.472
3.321SerThr: 3.321 ± 0.63
2.574SerVal: 2.574 ± 0.433
0.166SerTrp: 0.166 ± 0.108
2.076SerTyr: 2.076 ± 0.501
0.0SerXaa: 0.0 ± 0.0
Thr
3.487ThrAla: 3.487 ± 0.562
0.249ThrCys: 0.249 ± 0.143
3.404ThrAsp: 3.404 ± 1.0
3.072ThrGlu: 3.072 ± 0.482
2.242ThrPhe: 2.242 ± 0.379
3.902ThrGly: 3.902 ± 0.724
1.495ThrHis: 1.495 ± 0.404
4.816ThrIle: 4.816 ± 0.533
4.484ThrLys: 4.484 ± 0.591
4.234ThrLeu: 4.234 ± 0.452
0.996ThrMet: 0.996 ± 0.231
3.072ThrAsn: 3.072 ± 0.59
2.325ThrPro: 2.325 ± 0.388
1.91ThrGln: 1.91 ± 0.444
2.491ThrArg: 2.491 ± 0.453
4.151ThrSer: 4.151 ± 0.717
3.57ThrThr: 3.57 ± 0.733
3.404ThrVal: 3.404 ± 0.398
0.83ThrTrp: 0.83 ± 0.248
2.989ThrTyr: 2.989 ± 0.607
0.0ThrXaa: 0.0 ± 0.0
Val
3.238ValAla: 3.238 ± 0.606
0.332ValCys: 0.332 ± 0.154
4.151ValAsp: 4.151 ± 0.588
2.906ValGlu: 2.906 ± 0.487
1.993ValPhe: 1.993 ± 0.356
3.487ValGly: 3.487 ± 0.482
0.996ValHis: 0.996 ± 0.345
4.068ValIle: 4.068 ± 0.598
6.393ValLys: 6.393 ± 0.717
5.314ValLeu: 5.314 ± 0.695
1.578ValMet: 1.578 ± 0.314
4.151ValAsn: 4.151 ± 0.494
1.162ValPro: 1.162 ± 0.27
2.242ValGln: 2.242 ± 0.532
1.661ValArg: 1.661 ± 0.311
2.906ValSer: 2.906 ± 0.514
4.068ValThr: 4.068 ± 0.512
3.487ValVal: 3.487 ± 0.584
0.747ValTrp: 0.747 ± 0.283
1.91ValTyr: 1.91 ± 0.283
0.0ValXaa: 0.0 ± 0.0
Trp
0.249TrpAla: 0.249 ± 0.137
0.0TrpCys: 0.0 ± 0.0
0.747TrpAsp: 0.747 ± 0.215
0.913TrpGlu: 0.913 ± 0.208
0.913TrpPhe: 0.913 ± 0.256
0.913TrpGly: 0.913 ± 0.268
0.0TrpHis: 0.0 ± 0.0
1.411TrpIle: 1.411 ± 0.277
0.83TrpLys: 0.83 ± 0.216
1.411TrpLeu: 1.411 ± 0.346
0.415TrpMet: 0.415 ± 0.162
0.913TrpAsn: 0.913 ± 0.249
0.166TrpPro: 0.166 ± 0.104
0.498TrpGln: 0.498 ± 0.183
0.415TrpArg: 0.415 ± 0.177
0.83TrpSer: 0.83 ± 0.3
0.415TrpThr: 0.415 ± 0.133
0.415TrpVal: 0.415 ± 0.132
0.083TrpTrp: 0.083 ± 0.081
0.415TrpTyr: 0.415 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.242TyrAla: 2.242 ± 0.309
0.581TyrCys: 0.581 ± 0.216
2.491TyrAsp: 2.491 ± 0.619
2.906TyrGlu: 2.906 ± 0.53
1.993TyrPhe: 1.993 ± 0.455
2.823TyrGly: 2.823 ± 0.557
0.83TyrHis: 0.83 ± 0.264
3.653TyrIle: 3.653 ± 0.616
4.401TyrLys: 4.401 ± 0.518
3.57TyrLeu: 3.57 ± 0.518
1.079TyrMet: 1.079 ± 0.324
3.072TyrAsn: 3.072 ± 0.528
0.664TyrPro: 0.664 ± 0.241
1.744TyrGln: 1.744 ± 0.36
1.827TyrArg: 1.827 ± 0.43
2.408TyrSer: 2.408 ± 0.447
2.823TyrThr: 2.823 ± 0.392
1.578TyrVal: 1.578 ± 0.384
0.498TyrTrp: 0.498 ± 0.167
0.747TyrTyr: 0.747 ± 0.267
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12045 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski