Amino acid dipepetide frequency for Staphylococcus phage SAP40

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.66AlaAla: 1.66 ± 0.573
0.528AlaCys: 0.528 ± 0.18
2.943AlaAsp: 2.943 ± 0.478
3.622AlaGlu: 3.622 ± 0.572
3.018AlaPhe: 3.018 ± 0.497
2.792AlaGly: 2.792 ± 0.406
0.754AlaHis: 0.754 ± 0.205
5.131AlaIle: 5.131 ± 1.106
5.357AlaLys: 5.357 ± 0.738
4.15AlaLeu: 4.15 ± 0.706
2.037AlaMet: 2.037 ± 0.514
3.848AlaAsn: 3.848 ± 0.512
1.584AlaPro: 1.584 ± 0.307
2.943AlaGln: 2.943 ± 0.529
3.093AlaArg: 3.093 ± 0.509
3.471AlaSer: 3.471 ± 0.683
4.15AlaThr: 4.15 ± 0.641
3.395AlaVal: 3.395 ± 0.681
0.83AlaTrp: 0.83 ± 0.33
2.565AlaTyr: 2.565 ± 0.378
0.0AlaXaa: 0.0 ± 0.0
Cys
0.151CysAla: 0.151 ± 0.12
0.075CysCys: 0.075 ± 0.069
0.075CysAsp: 0.075 ± 0.071
0.302CysGlu: 0.302 ± 0.136
0.226CysPhe: 0.226 ± 0.139
0.151CysGly: 0.151 ± 0.106
0.0CysHis: 0.0 ± 0.0
0.075CysIle: 0.075 ± 0.089
0.302CysLys: 0.302 ± 0.157
0.226CysLeu: 0.226 ± 0.125
0.0CysMet: 0.0 ± 0.0
0.377CysAsn: 0.377 ± 0.203
0.226CysPro: 0.226 ± 0.12
0.151CysGln: 0.151 ± 0.119
0.226CysArg: 0.226 ± 0.155
0.453CysSer: 0.453 ± 0.228
0.377CysThr: 0.377 ± 0.158
0.075CysVal: 0.075 ± 0.08
0.151CysTrp: 0.151 ± 0.098
0.302CysTyr: 0.302 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
3.923AspAla: 3.923 ± 0.569
0.075AspCys: 0.075 ± 0.083
4.225AspAsp: 4.225 ± 0.768
5.432AspGlu: 5.432 ± 0.867
3.395AspPhe: 3.395 ± 0.584
4.074AspGly: 4.074 ± 0.581
0.377AspHis: 0.377 ± 0.159
4.678AspIle: 4.678 ± 0.597
5.96AspLys: 5.96 ± 0.805
4.602AspLeu: 4.602 ± 0.761
1.584AspMet: 1.584 ± 0.383
4.904AspAsn: 4.904 ± 0.635
0.981AspPro: 0.981 ± 0.315
1.207AspGln: 1.207 ± 0.313
2.716AspArg: 2.716 ± 0.484
3.471AspSer: 3.471 ± 0.569
3.923AspThr: 3.923 ± 0.644
3.697AspVal: 3.697 ± 0.56
0.754AspTrp: 0.754 ± 0.252
2.641AspTyr: 2.641 ± 0.477
0.0AspXaa: 0.0 ± 0.0
Glu
5.432GluAla: 5.432 ± 0.721
0.226GluCys: 0.226 ± 0.116
3.395GluAsp: 3.395 ± 0.661
6.564GluGlu: 6.564 ± 0.861
3.471GluPhe: 3.471 ± 0.564
3.018GluGly: 3.018 ± 0.481
1.283GluHis: 1.283 ± 0.321
6.187GluIle: 6.187 ± 1.051
6.036GluLys: 6.036 ± 1.012
7.168GluLeu: 7.168 ± 0.905
1.962GluMet: 1.962 ± 0.451
5.281GluAsn: 5.281 ± 0.803
1.886GluPro: 1.886 ± 0.364
4.301GluGln: 4.301 ± 0.59
3.244GluArg: 3.244 ± 0.67
4.451GluSer: 4.451 ± 0.543
2.943GluThr: 2.943 ± 0.418
5.432GluVal: 5.432 ± 0.728
0.83GluTrp: 0.83 ± 0.23
4.451GluTyr: 4.451 ± 0.663
0.0GluXaa: 0.0 ± 0.0
Phe
2.037PheAla: 2.037 ± 0.406
0.302PheCys: 0.302 ± 0.139
3.471PheAsp: 3.471 ± 0.57
3.999PheGlu: 3.999 ± 0.522
1.735PhePhe: 1.735 ± 0.363
2.49PheGly: 2.49 ± 0.475
0.453PheHis: 0.453 ± 0.229
3.244PheIle: 3.244 ± 0.514
4.451PheLys: 4.451 ± 0.533
2.414PheLeu: 2.414 ± 0.404
0.83PheMet: 0.83 ± 0.262
3.999PheAsn: 3.999 ± 0.534
0.754PhePro: 0.754 ± 0.315
1.358PheGln: 1.358 ± 0.434
1.283PheArg: 1.283 ± 0.346
2.867PheSer: 2.867 ± 0.56
2.565PheThr: 2.565 ± 0.425
3.093PheVal: 3.093 ± 0.589
0.302PheTrp: 0.302 ± 0.143
1.509PheTyr: 1.509 ± 0.36
0.0PheXaa: 0.0 ± 0.0
Gly
2.641GlyAla: 2.641 ± 0.422
0.075GlyCys: 0.075 ± 0.085
3.848GlyAsp: 3.848 ± 0.567
2.641GlyGlu: 2.641 ± 0.494
2.49GlyPhe: 2.49 ± 0.543
2.867GlyGly: 2.867 ± 0.495
1.358GlyHis: 1.358 ± 0.417
4.753GlyIle: 4.753 ± 0.542
4.301GlyLys: 4.301 ± 0.602
4.98GlyLeu: 4.98 ± 0.749
1.207GlyMet: 1.207 ± 0.32
3.32GlyAsn: 3.32 ± 0.549
0.604GlyPro: 0.604 ± 0.297
1.886GlyGln: 1.886 ± 0.375
2.565GlyArg: 2.565 ± 0.457
2.716GlySer: 2.716 ± 0.521
3.848GlyThr: 3.848 ± 0.53
4.074GlyVal: 4.074 ± 0.69
1.207GlyTrp: 1.207 ± 0.286
1.886GlyTyr: 1.886 ± 0.311
0.0GlyXaa: 0.0 ± 0.0
His
1.207HisAla: 1.207 ± 0.316
0.075HisCys: 0.075 ± 0.069
1.056HisAsp: 1.056 ± 0.272
1.283HisGlu: 1.283 ± 0.368
0.83HisPhe: 0.83 ± 0.256
0.981HisGly: 0.981 ± 0.312
0.453HisHis: 0.453 ± 0.192
1.132HisIle: 1.132 ± 0.315
0.83HisLys: 0.83 ± 0.255
0.754HisLeu: 0.754 ± 0.238
0.453HisMet: 0.453 ± 0.231
0.83HisAsn: 0.83 ± 0.288
0.604HisPro: 0.604 ± 0.194
0.528HisGln: 0.528 ± 0.237
0.377HisArg: 0.377 ± 0.175
1.207HisSer: 1.207 ± 0.257
0.83HisThr: 0.83 ± 0.232
1.132HisVal: 1.132 ± 0.315
0.075HisTrp: 0.075 ± 0.071
0.453HisTyr: 0.453 ± 0.307
0.0HisXaa: 0.0 ± 0.0
Ile
5.131IleAla: 5.131 ± 0.836
0.0IleCys: 0.0 ± 0.0
5.508IleAsp: 5.508 ± 0.631
6.64IleGlu: 6.64 ± 0.823
2.414IlePhe: 2.414 ± 0.51
3.848IleGly: 3.848 ± 0.576
1.358IleHis: 1.358 ± 0.348
4.376IleIle: 4.376 ± 0.603
7.243IleLys: 7.243 ± 0.789
4.98IleLeu: 4.98 ± 0.617
1.962IleMet: 1.962 ± 0.388
5.508IleAsn: 5.508 ± 0.928
2.037IlePro: 2.037 ± 0.441
2.792IleGln: 2.792 ± 0.59
2.792IleArg: 2.792 ± 0.509
3.848IleSer: 3.848 ± 0.556
5.432IleThr: 5.432 ± 0.727
3.999IleVal: 3.999 ± 0.663
1.358IleTrp: 1.358 ± 0.696
3.32IleTyr: 3.32 ± 0.54
0.0IleXaa: 0.0 ± 0.0
Lys
5.583LysAla: 5.583 ± 0.549
0.226LysCys: 0.226 ± 0.131
5.734LysAsp: 5.734 ± 0.738
7.394LysGlu: 7.394 ± 1.006
2.943LysPhe: 2.943 ± 0.477
5.583LysGly: 5.583 ± 0.832
1.132LysHis: 1.132 ± 0.32
6.338LysIle: 6.338 ± 0.952
8.752LysLys: 8.752 ± 0.793
7.092LysLeu: 7.092 ± 0.76
2.037LysMet: 2.037 ± 0.401
6.64LysAsn: 6.64 ± 0.798
2.867LysPro: 2.867 ± 0.55
5.281LysGln: 5.281 ± 0.666
3.923LysArg: 3.923 ± 0.603
5.734LysSer: 5.734 ± 0.681
5.432LysThr: 5.432 ± 0.611
5.734LysVal: 5.734 ± 0.744
0.754LysTrp: 0.754 ± 0.244
3.999LysTyr: 3.999 ± 0.732
0.0LysXaa: 0.0 ± 0.0
Leu
2.792LeuAla: 2.792 ± 0.496
0.226LeuCys: 0.226 ± 0.121
4.527LeuAsp: 4.527 ± 0.607
6.79LeuGlu: 6.79 ± 0.693
4.074LeuPhe: 4.074 ± 0.685
3.622LeuGly: 3.622 ± 0.545
0.981LeuHis: 0.981 ± 0.29
5.508LeuIle: 5.508 ± 0.704
7.998LeuLys: 7.998 ± 0.65
5.659LeuLeu: 5.659 ± 0.747
1.735LeuMet: 1.735 ± 0.329
5.583LeuAsn: 5.583 ± 0.565
2.263LeuPro: 2.263 ± 0.383
3.772LeuGln: 3.772 ± 0.584
3.622LeuArg: 3.622 ± 0.633
4.602LeuSer: 4.602 ± 0.625
5.281LeuThr: 5.281 ± 0.709
4.904LeuVal: 4.904 ± 0.714
0.453LeuTrp: 0.453 ± 0.223
2.943LeuTyr: 2.943 ± 0.535
0.0LeuXaa: 0.0 ± 0.0
Met
1.509MetAla: 1.509 ± 0.32
0.0MetCys: 0.0 ± 0.0
1.207MetAsp: 1.207 ± 0.282
1.886MetGlu: 1.886 ± 0.364
0.754MetPhe: 0.754 ± 0.278
1.132MetGly: 1.132 ± 0.299
0.302MetHis: 0.302 ± 0.179
1.132MetIle: 1.132 ± 0.263
1.962MetLys: 1.962 ± 0.35
2.792MetLeu: 2.792 ± 0.385
0.679MetMet: 0.679 ± 0.205
1.358MetAsn: 1.358 ± 0.306
0.905MetPro: 0.905 ± 0.288
1.434MetGln: 1.434 ± 0.364
0.981MetArg: 0.981 ± 0.33
1.207MetSer: 1.207 ± 0.315
2.414MetThr: 2.414 ± 0.599
1.283MetVal: 1.283 ± 0.255
0.604MetTrp: 0.604 ± 0.294
0.905MetTyr: 0.905 ± 0.321
0.0MetXaa: 0.0 ± 0.0
Asn
6.036AsnAla: 6.036 ± 0.737
0.226AsnCys: 0.226 ± 0.183
4.829AsnAsp: 4.829 ± 0.729
5.659AsnGlu: 5.659 ± 0.798
3.093AsnPhe: 3.093 ± 0.513
4.678AsnGly: 4.678 ± 0.721
0.905AsnHis: 0.905 ± 0.27
4.15AsnIle: 4.15 ± 0.452
7.847AsnLys: 7.847 ± 0.809
5.357AsnLeu: 5.357 ± 0.707
1.509AsnMet: 1.509 ± 0.336
4.301AsnAsn: 4.301 ± 0.721
2.716AsnPro: 2.716 ± 0.512
2.49AsnGln: 2.49 ± 0.462
2.49AsnArg: 2.49 ± 0.413
3.018AsnSer: 3.018 ± 0.49
4.15AsnThr: 4.15 ± 0.527
3.999AsnVal: 3.999 ± 0.553
0.604AsnTrp: 0.604 ± 0.221
2.792AsnTyr: 2.792 ± 0.546
0.0AsnXaa: 0.0 ± 0.0
Pro
1.207ProAla: 1.207 ± 0.301
0.226ProCys: 0.226 ± 0.13
1.207ProAsp: 1.207 ± 0.278
2.037ProGlu: 2.037 ± 0.347
1.132ProPhe: 1.132 ± 0.252
1.886ProGly: 1.886 ± 0.561
0.302ProHis: 0.302 ± 0.169
2.641ProIle: 2.641 ± 0.396
2.49ProLys: 2.49 ± 0.578
1.207ProLeu: 1.207 ± 0.301
0.754ProMet: 0.754 ± 0.265
2.113ProAsn: 2.113 ± 0.358
0.604ProPro: 0.604 ± 0.281
1.132ProGln: 1.132 ± 0.301
1.283ProArg: 1.283 ± 0.391
1.358ProSer: 1.358 ± 0.32
1.66ProThr: 1.66 ± 0.347
1.735ProVal: 1.735 ± 0.325
0.151ProTrp: 0.151 ± 0.099
1.283ProTyr: 1.283 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
2.49GlnAla: 2.49 ± 0.487
0.453GlnCys: 0.453 ± 0.203
3.169GlnAsp: 3.169 ± 0.564
3.018GlnGlu: 3.018 ± 0.597
1.509GlnPhe: 1.509 ± 0.353
1.962GlnGly: 1.962 ± 0.331
0.377GlnHis: 0.377 ± 0.148
3.32GlnIle: 3.32 ± 0.484
2.943GlnLys: 2.943 ± 0.396
3.772GlnLeu: 3.772 ± 0.642
1.434GlnMet: 1.434 ± 0.297
2.867GlnAsn: 2.867 ± 0.565
1.132GlnPro: 1.132 ± 0.307
2.188GlnGln: 2.188 ± 0.622
1.735GlnArg: 1.735 ± 0.401
2.49GlnSer: 2.49 ± 0.397
2.263GlnThr: 2.263 ± 0.423
2.113GlnVal: 2.113 ± 0.408
0.453GlnTrp: 0.453 ± 0.181
1.283GlnTyr: 1.283 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
1.886ArgAla: 1.886 ± 0.422
0.151ArgCys: 0.151 ± 0.128
2.188ArgAsp: 2.188 ± 0.532
2.414ArgGlu: 2.414 ± 0.398
2.113ArgPhe: 2.113 ± 0.462
2.263ArgGly: 2.263 ± 0.478
0.905ArgHis: 0.905 ± 0.264
3.32ArgIle: 3.32 ± 0.593
4.15ArgLys: 4.15 ± 0.767
4.225ArgLeu: 4.225 ± 0.602
0.679ArgMet: 0.679 ± 0.22
2.867ArgAsn: 2.867 ± 0.516
0.679ArgPro: 0.679 ± 0.226
1.283ArgGln: 1.283 ± 0.323
2.037ArgArg: 2.037 ± 0.445
2.339ArgSer: 2.339 ± 0.447
1.886ArgThr: 1.886 ± 0.395
2.716ArgVal: 2.716 ± 0.569
0.604ArgTrp: 0.604 ± 0.23
2.339ArgTyr: 2.339 ± 0.511
0.0ArgXaa: 0.0 ± 0.0
Ser
3.923SerAla: 3.923 ± 0.627
0.302SerCys: 0.302 ± 0.13
4.602SerAsp: 4.602 ± 0.744
3.848SerGlu: 3.848 ± 0.568
2.716SerPhe: 2.716 ± 0.481
2.716SerGly: 2.716 ± 0.561
1.509SerHis: 1.509 ± 0.381
4.376SerIle: 4.376 ± 0.591
5.659SerLys: 5.659 ± 0.639
4.15SerLeu: 4.15 ± 0.505
1.358SerMet: 1.358 ± 0.354
4.225SerAsn: 4.225 ± 0.478
1.283SerPro: 1.283 ± 0.324
1.886SerGln: 1.886 ± 0.378
2.414SerArg: 2.414 ± 0.399
2.943SerSer: 2.943 ± 0.5
4.15SerThr: 4.15 ± 0.477
2.867SerVal: 2.867 ± 0.461
0.453SerTrp: 0.453 ± 0.185
1.962SerTyr: 1.962 ± 0.324
0.0SerXaa: 0.0 ± 0.0
Thr
3.848ThrAla: 3.848 ± 0.499
0.151ThrCys: 0.151 ± 0.111
3.395ThrAsp: 3.395 ± 0.665
3.999ThrGlu: 3.999 ± 0.621
2.263ThrPhe: 2.263 ± 0.488
3.697ThrGly: 3.697 ± 0.568
0.981ThrHis: 0.981 ± 0.233
5.81ThrIle: 5.81 ± 1.333
6.338ThrLys: 6.338 ± 0.618
5.055ThrLeu: 5.055 ± 0.629
0.905ThrMet: 0.905 ± 0.308
3.999ThrAsn: 3.999 ± 0.599
1.735ThrPro: 1.735 ± 0.391
3.169ThrGln: 3.169 ± 0.571
2.49ThrArg: 2.49 ± 0.393
3.999ThrSer: 3.999 ± 0.73
4.527ThrThr: 4.527 ± 0.82
4.602ThrVal: 4.602 ± 0.736
0.679ThrTrp: 0.679 ± 0.255
2.037ThrTyr: 2.037 ± 0.49
0.0ThrXaa: 0.0 ± 0.0
Val
3.546ValAla: 3.546 ± 1.052
0.302ValCys: 0.302 ± 0.141
4.904ValAsp: 4.904 ± 0.716
5.131ValGlu: 5.131 ± 0.66
2.867ValPhe: 2.867 ± 0.509
2.263ValGly: 2.263 ± 0.435
0.604ValHis: 0.604 ± 0.178
4.602ValIle: 4.602 ± 0.62
5.281ValLys: 5.281 ± 0.642
4.753ValLeu: 4.753 ± 0.696
1.962ValMet: 1.962 ± 0.415
4.602ValAsn: 4.602 ± 0.628
2.188ValPro: 2.188 ± 0.507
1.132ValGln: 1.132 ± 0.303
1.811ValArg: 1.811 ± 0.423
3.622ValSer: 3.622 ± 0.594
4.225ValThr: 4.225 ± 0.594
4.301ValVal: 4.301 ± 0.743
1.056ValTrp: 1.056 ± 0.317
3.018ValTyr: 3.018 ± 0.562
0.0ValXaa: 0.0 ± 0.0
Trp
1.207TrpAla: 1.207 ± 0.37
0.151TrpCys: 0.151 ± 0.108
0.151TrpAsp: 0.151 ± 0.108
0.754TrpGlu: 0.754 ± 0.27
0.226TrpPhe: 0.226 ± 0.161
0.679TrpGly: 0.679 ± 0.329
0.302TrpHis: 0.302 ± 0.156
0.754TrpIle: 0.754 ± 0.242
0.905TrpLys: 0.905 ± 0.295
1.056TrpLeu: 1.056 ± 0.286
0.226TrpMet: 0.226 ± 0.134
1.584TrpAsn: 1.584 ± 1.01
0.151TrpPro: 0.151 ± 0.106
0.302TrpGln: 0.302 ± 0.198
0.226TrpArg: 0.226 ± 0.137
0.981TrpSer: 0.981 ± 0.306
0.905TrpThr: 0.905 ± 0.313
0.604TrpVal: 0.604 ± 0.215
0.0TrpTrp: 0.0 ± 0.0
0.905TrpTyr: 0.905 ± 0.302
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.66TyrAla: 1.66 ± 0.267
0.151TyrCys: 0.151 ± 0.105
2.188TyrAsp: 2.188 ± 0.466
4.15TyrGlu: 4.15 ± 0.613
1.886TyrPhe: 1.886 ± 0.393
2.565TyrGly: 2.565 ± 0.625
0.754TyrHis: 0.754 ± 0.271
3.018TyrIle: 3.018 ± 0.446
4.451TyrLys: 4.451 ± 0.615
2.943TyrLeu: 2.943 ± 0.482
1.056TyrMet: 1.056 ± 0.281
2.641TyrAsn: 2.641 ± 0.4
1.207TyrPro: 1.207 ± 0.312
1.735TyrGln: 1.735 ± 0.429
1.735TyrArg: 1.735 ± 0.41
2.565TyrSer: 2.565 ± 0.52
2.716TyrThr: 2.716 ± 0.498
2.414TyrVal: 2.414 ± 0.496
0.754TyrTrp: 0.754 ± 0.271
1.886TyrTyr: 1.886 ± 0.412
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (13255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski