Amino acid dipepetide frequency for Staphylococcus phage SA97

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.666AlaAla: 1.666 ± 0.527
0.333AlaCys: 0.333 ± 0.152
3.416AlaAsp: 3.416 ± 0.496
4.249AlaGlu: 4.249 ± 0.665
2.749AlaPhe: 2.749 ± 0.478
3.166AlaGly: 3.166 ± 0.519
1.083AlaHis: 1.083 ± 0.262
5.332AlaIle: 5.332 ± 0.921
5.332AlaLys: 5.332 ± 0.806
4.166AlaLeu: 4.166 ± 0.704
1.75AlaMet: 1.75 ± 0.45
4.749AlaAsn: 4.749 ± 0.551
1.25AlaPro: 1.25 ± 0.282
2.249AlaGln: 2.249 ± 0.481
2.666AlaArg: 2.666 ± 0.493
3.416AlaSer: 3.416 ± 0.668
3.582AlaThr: 3.582 ± 0.618
3.916AlaVal: 3.916 ± 0.644
0.916AlaTrp: 0.916 ± 0.274
2.416AlaTyr: 2.416 ± 0.335
0.0AlaXaa: 0.0 ± 0.0
Cys
0.167CysAla: 0.167 ± 0.122
0.0CysCys: 0.0 ± 0.0
0.167CysAsp: 0.167 ± 0.135
0.25CysGlu: 0.25 ± 0.137
0.25CysPhe: 0.25 ± 0.13
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.083CysIle: 0.083 ± 0.081
0.583CysLys: 0.583 ± 0.231
0.167CysLeu: 0.167 ± 0.11
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.167CysPro: 0.167 ± 0.094
0.333CysGln: 0.333 ± 0.165
0.417CysArg: 0.417 ± 0.173
0.25CysSer: 0.25 ± 0.157
0.25CysThr: 0.25 ± 0.123
0.083CysVal: 0.083 ± 0.085
0.167CysTrp: 0.167 ± 0.119
0.333CysTyr: 0.333 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
3.999AspAla: 3.999 ± 0.566
0.167AspCys: 0.167 ± 0.117
4.499AspAsp: 4.499 ± 0.861
4.749AspGlu: 4.749 ± 0.882
3.999AspPhe: 3.999 ± 0.577
3.249AspGly: 3.249 ± 0.52
0.167AspHis: 0.167 ± 0.139
5.415AspIle: 5.415 ± 0.614
6.415AspLys: 6.415 ± 0.74
5.499AspLeu: 5.499 ± 0.799
1.5AspMet: 1.5 ± 0.344
3.749AspAsn: 3.749 ± 0.515
1.5AspPro: 1.5 ± 0.361
1.333AspGln: 1.333 ± 0.298
2.083AspArg: 2.083 ± 0.385
3.916AspSer: 3.916 ± 0.566
3.582AspThr: 3.582 ± 0.638
3.832AspVal: 3.832 ± 0.491
0.916AspTrp: 0.916 ± 0.302
2.916AspTyr: 2.916 ± 0.59
0.0AspXaa: 0.0 ± 0.0
Glu
5.749GluAla: 5.749 ± 0.798
0.333GluCys: 0.333 ± 0.174
3.416GluAsp: 3.416 ± 0.626
5.332GluGlu: 5.332 ± 0.748
3.166GluPhe: 3.166 ± 0.595
3.333GluGly: 3.333 ± 0.552
1.833GluHis: 1.833 ± 0.357
4.999GluIle: 4.999 ± 0.827
5.999GluLys: 5.999 ± 0.888
6.832GluLeu: 6.832 ± 0.939
2.0GluMet: 2.0 ± 0.449
5.082GluAsn: 5.082 ± 0.625
1.583GluPro: 1.583 ± 0.379
3.916GluGln: 3.916 ± 0.71
3.832GluArg: 3.832 ± 0.589
3.999GluSer: 3.999 ± 0.61
3.333GluThr: 3.333 ± 0.49
5.582GluVal: 5.582 ± 0.754
0.833GluTrp: 0.833 ± 0.268
4.582GluTyr: 4.582 ± 0.74
0.0GluXaa: 0.0 ± 0.0
Phe
1.916PheAla: 1.916 ± 0.352
0.25PheCys: 0.25 ± 0.124
3.166PheAsp: 3.166 ± 0.534
3.916PheGlu: 3.916 ± 0.578
1.416PhePhe: 1.416 ± 0.291
2.916PheGly: 2.916 ± 0.452
0.833PheHis: 0.833 ± 0.285
2.999PheIle: 2.999 ± 0.417
4.249PheLys: 4.249 ± 0.575
2.249PheLeu: 2.249 ± 0.344
1.083PheMet: 1.083 ± 0.328
3.416PheAsn: 3.416 ± 0.497
1.083PhePro: 1.083 ± 0.307
1.416PheGln: 1.416 ± 0.371
1.333PheArg: 1.333 ± 0.297
3.166PheSer: 3.166 ± 0.527
3.166PheThr: 3.166 ± 0.479
2.333PheVal: 2.333 ± 0.477
0.25PheTrp: 0.25 ± 0.129
1.75PheTyr: 1.75 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
2.999GlyAla: 2.999 ± 0.524
0.167GlyCys: 0.167 ± 0.102
3.749GlyAsp: 3.749 ± 0.554
3.499GlyGlu: 3.499 ± 0.561
2.916GlyPhe: 2.916 ± 0.416
3.832GlyGly: 3.832 ± 0.547
1.583GlyHis: 1.583 ± 0.442
4.666GlyIle: 4.666 ± 0.541
4.166GlyLys: 4.166 ± 0.571
5.582GlyLeu: 5.582 ± 0.834
1.583GlyMet: 1.583 ± 0.337
2.999GlyAsn: 2.999 ± 0.445
0.5GlyPro: 0.5 ± 0.229
2.083GlyGln: 2.083 ± 0.39
2.166GlyArg: 2.166 ± 0.378
3.333GlySer: 3.333 ± 0.589
3.999GlyThr: 3.999 ± 0.577
4.582GlyVal: 4.582 ± 0.59
0.75GlyTrp: 0.75 ± 0.3
2.666GlyTyr: 2.666 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
1.666HisAla: 1.666 ± 0.382
0.0HisCys: 0.0 ± 0.0
1.083HisAsp: 1.083 ± 0.284
0.916HisGlu: 0.916 ± 0.29
0.833HisPhe: 0.833 ± 0.277
1.5HisGly: 1.5 ± 0.321
0.5HisHis: 0.5 ± 0.193
1.75HisIle: 1.75 ± 0.553
0.916HisLys: 0.916 ± 0.306
0.916HisLeu: 0.916 ± 0.243
0.333HisMet: 0.333 ± 0.194
0.833HisAsn: 0.833 ± 0.28
0.5HisPro: 0.5 ± 0.17
0.417HisGln: 0.417 ± 0.228
0.5HisArg: 0.5 ± 0.203
1.25HisSer: 1.25 ± 0.293
0.833HisThr: 0.833 ± 0.246
1.083HisVal: 1.083 ± 0.297
0.083HisTrp: 0.083 ± 0.075
0.75HisTyr: 0.75 ± 0.411
0.0HisXaa: 0.0 ± 0.0
Ile
5.499IleAla: 5.499 ± 0.82
0.0IleCys: 0.0 ± 0.0
5.082IleAsp: 5.082 ± 0.579
5.999IleGlu: 5.999 ± 0.696
2.333IlePhe: 2.333 ± 0.555
3.916IleGly: 3.916 ± 0.594
1.583IleHis: 1.583 ± 0.346
4.332IleIle: 4.332 ± 0.731
7.165IleLys: 7.165 ± 0.653
4.582IleLeu: 4.582 ± 0.546
2.166IleMet: 2.166 ± 0.356
4.749IleAsn: 4.749 ± 0.617
2.499IlePro: 2.499 ± 0.395
2.166IleGln: 2.166 ± 0.468
3.999IleArg: 3.999 ± 0.573
4.249IleSer: 4.249 ± 0.544
5.499IleThr: 5.499 ± 0.765
4.499IleVal: 4.499 ± 0.575
1.583IleTrp: 1.583 ± 0.578
3.166IleTyr: 3.166 ± 0.52
0.0IleXaa: 0.0 ± 0.0
Lys
5.582LysAla: 5.582 ± 0.607
0.0LysCys: 0.0 ± 0.0
6.082LysAsp: 6.082 ± 0.746
7.248LysGlu: 7.248 ± 1.041
3.416LysPhe: 3.416 ± 0.557
5.165LysGly: 5.165 ± 0.68
1.333LysHis: 1.333 ± 0.375
6.165LysIle: 6.165 ± 0.635
8.248LysLys: 8.248 ± 0.818
7.082LysLeu: 7.082 ± 0.825
2.249LysMet: 2.249 ± 0.504
6.082LysAsn: 6.082 ± 0.714
3.249LysPro: 3.249 ± 0.573
4.832LysGln: 4.832 ± 0.653
3.832LysArg: 3.832 ± 0.566
5.749LysSer: 5.749 ± 0.792
5.415LysThr: 5.415 ± 0.705
5.999LysVal: 5.999 ± 0.722
0.75LysTrp: 0.75 ± 0.227
4.249LysTyr: 4.249 ± 0.825
0.0LysXaa: 0.0 ± 0.0
Leu
3.166LeuAla: 3.166 ± 0.69
0.25LeuCys: 0.25 ± 0.221
4.332LeuAsp: 4.332 ± 0.646
6.332LeuGlu: 6.332 ± 0.594
3.749LeuPhe: 3.749 ± 0.606
4.166LeuGly: 4.166 ± 0.542
0.833LeuHis: 0.833 ± 0.259
4.666LeuIle: 4.666 ± 0.526
7.082LeuLys: 7.082 ± 0.734
5.415LeuLeu: 5.415 ± 0.714
1.5LeuMet: 1.5 ± 0.321
5.415LeuAsn: 5.415 ± 0.511
2.499LeuPro: 2.499 ± 0.297
3.166LeuGln: 3.166 ± 0.637
2.999LeuArg: 2.999 ± 0.582
4.832LeuSer: 4.832 ± 0.542
4.999LeuThr: 4.999 ± 0.676
4.332LeuVal: 4.332 ± 0.761
0.667LeuTrp: 0.667 ± 0.233
3.499LeuTyr: 3.499 ± 0.562
0.0LeuXaa: 0.0 ± 0.0
Met
1.5MetAla: 1.5 ± 0.315
0.25MetCys: 0.25 ± 0.15
1.083MetAsp: 1.083 ± 0.249
1.833MetGlu: 1.833 ± 0.446
0.833MetPhe: 0.833 ± 0.277
0.916MetGly: 0.916 ± 0.283
0.417MetHis: 0.417 ± 0.184
1.583MetIle: 1.583 ± 0.344
1.666MetLys: 1.666 ± 0.343
2.499MetLeu: 2.499 ± 0.361
0.583MetMet: 0.583 ± 0.209
1.833MetAsn: 1.833 ± 0.342
0.833MetPro: 0.833 ± 0.216
1.25MetGln: 1.25 ± 0.328
0.916MetArg: 0.916 ± 0.315
1.666MetSer: 1.666 ± 0.416
2.749MetThr: 2.749 ± 0.538
0.75MetVal: 0.75 ± 0.261
0.5MetTrp: 0.5 ± 0.2
1.083MetTyr: 1.083 ± 0.292
0.0MetXaa: 0.0 ± 0.0
Asn
4.999AsnAla: 4.999 ± 0.721
0.083AsnCys: 0.083 ± 0.088
4.332AsnAsp: 4.332 ± 0.683
5.415AsnGlu: 5.415 ± 0.812
2.916AsnPhe: 2.916 ± 0.508
4.832AsnGly: 4.832 ± 0.636
0.667AsnHis: 0.667 ± 0.238
4.416AsnIle: 4.416 ± 0.467
6.998AsnLys: 6.998 ± 0.671
4.999AsnLeu: 4.999 ± 0.588
1.583AsnMet: 1.583 ± 0.349
4.832AsnAsn: 4.832 ± 0.888
2.249AsnPro: 2.249 ± 0.53
2.083AsnGln: 2.083 ± 0.301
2.249AsnArg: 2.249 ± 0.444
2.999AsnSer: 2.999 ± 0.49
4.666AsnThr: 4.666 ± 0.569
4.082AsnVal: 4.082 ± 0.504
0.667AsnTrp: 0.667 ± 0.266
3.083AsnTyr: 3.083 ± 0.547
0.0AsnXaa: 0.0 ± 0.0
Pro
0.916ProAla: 0.916 ± 0.291
0.167ProCys: 0.167 ± 0.121
1.5ProAsp: 1.5 ± 0.3
2.749ProGlu: 2.749 ± 0.46
0.833ProPhe: 0.833 ± 0.247
2.166ProGly: 2.166 ± 0.497
0.333ProHis: 0.333 ± 0.172
1.75ProIle: 1.75 ± 0.278
2.749ProLys: 2.749 ± 0.787
1.25ProLeu: 1.25 ± 0.278
1.083ProMet: 1.083 ± 0.269
2.499ProAsn: 2.499 ± 0.438
0.417ProPro: 0.417 ± 0.266
0.916ProGln: 0.916 ± 0.268
1.25ProArg: 1.25 ± 0.404
1.333ProSer: 1.333 ± 0.348
2.0ProThr: 2.0 ± 0.381
1.583ProVal: 1.583 ± 0.383
0.167ProTrp: 0.167 ± 0.135
1.25ProTyr: 1.25 ± 0.395
0.0ProXaa: 0.0 ± 0.0
Gln
2.166GlnAla: 2.166 ± 0.502
0.5GlnCys: 0.5 ± 0.215
2.666GlnAsp: 2.666 ± 0.492
2.583GlnGlu: 2.583 ± 0.566
1.416GlnPhe: 1.416 ± 0.328
1.833GlnGly: 1.833 ± 0.281
0.5GlnHis: 0.5 ± 0.198
2.916GlnIle: 2.916 ± 0.52
3.333GlnLys: 3.333 ± 0.482
2.916GlnLeu: 2.916 ± 0.577
1.583GlnMet: 1.583 ± 0.359
2.083GlnAsn: 2.083 ± 0.378
1.083GlnPro: 1.083 ± 0.346
1.75GlnGln: 1.75 ± 0.483
1.916GlnArg: 1.916 ± 0.413
2.249GlnSer: 2.249 ± 0.443
2.083GlnThr: 2.083 ± 0.423
2.083GlnVal: 2.083 ± 0.437
0.333GlnTrp: 0.333 ± 0.166
1.333GlnTyr: 1.333 ± 0.337
0.0GlnXaa: 0.0 ± 0.0
Arg
1.75ArgAla: 1.75 ± 0.367
0.25ArgCys: 0.25 ± 0.137
1.833ArgAsp: 1.833 ± 0.507
3.333ArgGlu: 3.333 ± 0.561
1.833ArgPhe: 1.833 ± 0.385
2.833ArgGly: 2.833 ± 0.442
1.083ArgHis: 1.083 ± 0.323
3.582ArgIle: 3.582 ± 0.494
4.249ArgLys: 4.249 ± 0.537
3.333ArgLeu: 3.333 ± 0.47
0.75ArgMet: 0.75 ± 0.24
2.999ArgAsn: 2.999 ± 0.621
0.75ArgPro: 0.75 ± 0.257
1.583ArgGln: 1.583 ± 0.377
2.083ArgArg: 2.083 ± 0.479
1.916ArgSer: 1.916 ± 0.302
2.249ArgThr: 2.249 ± 0.495
2.499ArgVal: 2.499 ± 0.431
0.417ArgTrp: 0.417 ± 0.233
1.916ArgTyr: 1.916 ± 0.454
0.0ArgXaa: 0.0 ± 0.0
Ser
3.832SerAla: 3.832 ± 0.719
0.25SerCys: 0.25 ± 0.135
5.165SerAsp: 5.165 ± 0.788
3.499SerGlu: 3.499 ± 0.587
2.666SerPhe: 2.666 ± 0.513
3.749SerGly: 3.749 ± 0.672
1.0SerHis: 1.0 ± 0.394
5.165SerIle: 5.165 ± 0.619
5.082SerLys: 5.082 ± 0.748
3.499SerLeu: 3.499 ± 0.544
1.5SerMet: 1.5 ± 0.347
3.999SerAsn: 3.999 ± 0.413
1.416SerPro: 1.416 ± 0.363
2.249SerGln: 2.249 ± 0.403
2.249SerArg: 2.249 ± 0.388
3.416SerSer: 3.416 ± 0.462
3.999SerThr: 3.999 ± 0.347
3.083SerVal: 3.083 ± 0.518
0.167SerTrp: 0.167 ± 0.111
2.333SerTyr: 2.333 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
3.916ThrAla: 3.916 ± 0.529
0.0ThrCys: 0.0 ± 0.0
4.499ThrAsp: 4.499 ± 0.724
3.999ThrGlu: 3.999 ± 0.661
3.166ThrPhe: 3.166 ± 0.508
4.082ThrGly: 4.082 ± 0.593
1.083ThrHis: 1.083 ± 0.251
5.999ThrIle: 5.999 ± 1.233
6.915ThrLys: 6.915 ± 0.709
4.499ThrLeu: 4.499 ± 0.556
1.083ThrMet: 1.083 ± 0.378
3.749ThrAsn: 3.749 ± 0.606
1.833ThrPro: 1.833 ± 0.382
2.666ThrGln: 2.666 ± 0.536
2.083ThrArg: 2.083 ± 0.329
3.333ThrSer: 3.333 ± 0.686
4.332ThrThr: 4.332 ± 0.756
4.915ThrVal: 4.915 ± 0.783
0.75ThrTrp: 0.75 ± 0.273
2.083ThrTyr: 2.083 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
3.832ValAla: 3.832 ± 0.845
0.5ValCys: 0.5 ± 0.206
5.165ValAsp: 5.165 ± 0.728
5.249ValGlu: 5.249 ± 0.723
2.166ValPhe: 2.166 ± 0.504
2.499ValGly: 2.499 ± 0.679
0.417ValHis: 0.417 ± 0.161
5.499ValIle: 5.499 ± 0.77
5.999ValLys: 5.999 ± 0.623
4.082ValLeu: 4.082 ± 0.646
1.25ValMet: 1.25 ± 0.324
4.666ValAsn: 4.666 ± 0.628
2.083ValPro: 2.083 ± 0.492
1.0ValGln: 1.0 ± 0.277
2.166ValArg: 2.166 ± 0.46
3.999ValSer: 3.999 ± 0.747
4.082ValThr: 4.082 ± 0.591
3.749ValVal: 3.749 ± 0.568
0.916ValTrp: 0.916 ± 0.372
2.666ValTyr: 2.666 ± 0.506
0.0ValXaa: 0.0 ± 0.0
Trp
1.166TrpAla: 1.166 ± 0.435
0.083TrpCys: 0.083 ± 0.087
0.167TrpAsp: 0.167 ± 0.136
0.5TrpGlu: 0.5 ± 0.212
0.333TrpPhe: 0.333 ± 0.173
0.583TrpGly: 0.583 ± 0.382
0.25TrpHis: 0.25 ± 0.143
1.0TrpIle: 1.0 ± 0.279
0.833TrpLys: 0.833 ± 0.288
0.833TrpLeu: 0.833 ± 0.355
0.083TrpMet: 0.083 ± 0.078
1.833TrpAsn: 1.833 ± 0.992
0.167TrpPro: 0.167 ± 0.124
0.333TrpGln: 0.333 ± 0.195
0.333TrpArg: 0.333 ± 0.157
0.916TrpSer: 0.916 ± 0.31
1.333TrpThr: 1.333 ± 0.279
0.583TrpVal: 0.583 ± 0.238
0.167TrpTrp: 0.167 ± 0.108
0.583TrpTyr: 0.583 ± 0.263
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.916TyrAla: 1.916 ± 0.347
0.083TyrCys: 0.083 ± 0.081
2.083TyrAsp: 2.083 ± 0.439
4.166TyrGlu: 4.166 ± 0.755
1.916TyrPhe: 1.916 ± 0.524
3.083TyrGly: 3.083 ± 0.536
1.166TyrHis: 1.166 ± 0.324
2.749TyrIle: 2.749 ± 0.444
4.749TyrLys: 4.749 ± 0.591
3.499TyrLeu: 3.499 ± 0.465
1.0TyrMet: 1.0 ± 0.289
2.583TyrAsn: 2.583 ± 0.418
1.25TyrPro: 1.25 ± 0.386
1.583TyrGln: 1.583 ± 0.361
2.166TyrArg: 2.166 ± 0.463
2.333TyrSer: 2.333 ± 0.391
2.833TyrThr: 2.833 ± 0.431
2.416TyrVal: 2.416 ± 0.479
1.083TyrTrp: 1.083 ± 0.465
1.25TyrTyr: 1.25 ± 0.244
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (12004 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski