Amino acid dipepetide frequency for Staphylococcus virus X2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.43AlaAla: 0.43 ± 0.157
0.287AlaCys: 0.287 ± 0.136
2.795AlaAsp: 2.795 ± 0.439
3.727AlaGlu: 3.727 ± 0.417
2.365AlaPhe: 2.365 ± 0.52
3.082AlaGly: 3.082 ± 0.622
0.86AlaHis: 0.86 ± 0.296
5.232AlaIle: 5.232 ± 0.538
5.447AlaLys: 5.447 ± 0.636
4.443AlaLeu: 4.443 ± 0.707
1.648AlaMet: 1.648 ± 0.429
3.153AlaAsn: 3.153 ± 0.405
2.007AlaPro: 2.007 ± 0.378
2.867AlaGln: 2.867 ± 0.397
2.365AlaArg: 2.365 ± 0.359
3.727AlaSer: 3.727 ± 0.667
4.085AlaThr: 4.085 ± 0.589
3.512AlaVal: 3.512 ± 0.74
0.573AlaTrp: 0.573 ± 0.258
2.795AlaTyr: 2.795 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.215CysAla: 0.215 ± 0.115
0.0CysCys: 0.0 ± 0.0
0.143CysAsp: 0.143 ± 0.107
0.287CysGlu: 0.287 ± 0.134
0.43CysPhe: 0.43 ± 0.199
0.215CysGly: 0.215 ± 0.118
0.0CysHis: 0.0 ± 0.0
0.358CysIle: 0.358 ± 0.184
0.573CysLys: 0.573 ± 0.204
0.358CysLeu: 0.358 ± 0.182
0.215CysMet: 0.215 ± 0.128
0.43CysAsn: 0.43 ± 0.185
0.287CysPro: 0.287 ± 0.17
0.287CysGln: 0.287 ± 0.138
0.287CysArg: 0.287 ± 0.147
0.287CysSer: 0.287 ± 0.189
0.287CysThr: 0.287 ± 0.172
0.358CysVal: 0.358 ± 0.158
0.072CysTrp: 0.072 ± 0.071
0.358CysTyr: 0.358 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
3.798AspAla: 3.798 ± 0.565
0.072AspCys: 0.072 ± 0.068
4.085AspAsp: 4.085 ± 0.714
5.017AspGlu: 5.017 ± 0.672
2.723AspPhe: 2.723 ± 0.463
4.085AspGly: 4.085 ± 0.68
0.502AspHis: 0.502 ± 0.191
4.802AspIle: 4.802 ± 0.686
5.375AspLys: 5.375 ± 0.708
4.945AspLeu: 4.945 ± 0.564
1.863AspMet: 1.863 ± 0.357
4.085AspAsn: 4.085 ± 0.5
1.147AspPro: 1.147 ± 0.293
0.932AspGln: 0.932 ± 0.246
2.15AspArg: 2.15 ± 0.395
3.583AspSer: 3.583 ± 0.492
3.225AspThr: 3.225 ± 0.481
3.44AspVal: 3.44 ± 0.519
0.717AspTrp: 0.717 ± 0.219
3.727AspTyr: 3.727 ± 0.462
0.0AspXaa: 0.0 ± 0.0
Glu
4.013GluAla: 4.013 ± 0.529
0.573GluCys: 0.573 ± 0.197
4.228GluAsp: 4.228 ± 0.625
5.375GluGlu: 5.375 ± 0.735
3.01GluPhe: 3.01 ± 0.45
2.365GluGly: 2.365 ± 0.386
1.863GluHis: 1.863 ± 0.358
5.734GluIle: 5.734 ± 0.8
5.304GluLys: 5.304 ± 0.704
7.525GluLeu: 7.525 ± 0.996
1.72GluMet: 1.72 ± 0.401
4.013GluAsn: 4.013 ± 0.456
1.72GluPro: 1.72 ± 0.291
4.013GluGln: 4.013 ± 0.577
3.727GluArg: 3.727 ± 0.605
3.297GluSer: 3.297 ± 0.502
4.587GluThr: 4.587 ± 0.638
5.232GluVal: 5.232 ± 0.453
1.362GluTrp: 1.362 ± 0.348
4.085GluTyr: 4.085 ± 0.689
0.0GluXaa: 0.0 ± 0.0
Phe
1.648PheAla: 1.648 ± 0.319
0.502PheCys: 0.502 ± 0.178
3.655PheAsp: 3.655 ± 0.358
3.082PheGlu: 3.082 ± 0.517
1.433PhePhe: 1.433 ± 0.362
2.867PheGly: 2.867 ± 0.603
0.788PheHis: 0.788 ± 0.217
3.583PheIle: 3.583 ± 0.528
4.443PheLys: 4.443 ± 0.54
2.652PheLeu: 2.652 ± 0.398
0.573PheMet: 0.573 ± 0.196
3.082PheAsn: 3.082 ± 0.479
0.86PhePro: 0.86 ± 0.262
0.932PheGln: 0.932 ± 0.257
1.362PheArg: 1.362 ± 0.306
2.078PheSer: 2.078 ± 0.411
3.01PheThr: 3.01 ± 0.566
3.153PheVal: 3.153 ± 0.44
0.43PheTrp: 0.43 ± 0.172
2.078PheTyr: 2.078 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
3.87GlyAla: 3.87 ± 0.601
0.358GlyCys: 0.358 ± 0.148
3.512GlyAsp: 3.512 ± 0.573
3.01GlyGlu: 3.01 ± 0.537
2.58GlyPhe: 2.58 ± 0.454
2.938GlyGly: 2.938 ± 0.432
1.362GlyHis: 1.362 ± 0.303
4.228GlyIle: 4.228 ± 0.543
3.87GlyLys: 3.87 ± 0.531
4.443GlyLeu: 4.443 ± 0.748
1.362GlyMet: 1.362 ± 0.273
2.867GlyAsn: 2.867 ± 0.409
0.358GlyPro: 0.358 ± 0.176
2.437GlyGln: 2.437 ± 0.387
2.222GlyArg: 2.222 ± 0.411
2.58GlySer: 2.58 ± 0.386
4.3GlyThr: 4.3 ± 0.665
4.587GlyVal: 4.587 ± 0.751
1.218GlyTrp: 1.218 ± 0.317
3.225GlyTyr: 3.225 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
0.86HisAla: 0.86 ± 0.229
0.072HisCys: 0.072 ± 0.079
0.573HisAsp: 0.573 ± 0.23
1.147HisGlu: 1.147 ± 0.288
0.717HisPhe: 0.717 ± 0.166
1.218HisGly: 1.218 ± 0.281
0.43HisHis: 0.43 ± 0.17
1.29HisIle: 1.29 ± 0.357
1.218HisLys: 1.218 ± 0.315
1.147HisLeu: 1.147 ± 0.277
0.358HisMet: 0.358 ± 0.143
1.218HisAsn: 1.218 ± 0.31
0.645HisPro: 0.645 ± 0.236
0.932HisGln: 0.932 ± 0.308
0.645HisArg: 0.645 ± 0.223
1.218HisSer: 1.218 ± 0.273
1.505HisThr: 1.505 ± 0.281
1.075HisVal: 1.075 ± 0.329
0.072HisTrp: 0.072 ± 0.07
0.788HisTyr: 0.788 ± 0.289
0.0HisXaa: 0.0 ± 0.0
Ile
4.658IleAla: 4.658 ± 0.672
0.43IleCys: 0.43 ± 0.181
5.304IleAsp: 5.304 ± 0.755
5.877IleGlu: 5.877 ± 0.816
2.938IlePhe: 2.938 ± 0.563
4.945IleGly: 4.945 ± 0.843
0.86IleHis: 0.86 ± 0.259
5.304IleIle: 5.304 ± 0.717
6.952IleLys: 6.952 ± 0.777
3.225IleLeu: 3.225 ± 0.397
2.222IleMet: 2.222 ± 0.339
4.658IleAsn: 4.658 ± 0.639
2.293IlePro: 2.293 ± 0.331
3.153IleGln: 3.153 ± 0.513
3.583IleArg: 3.583 ± 0.673
4.874IleSer: 4.874 ± 0.587
5.662IleThr: 5.662 ± 0.557
3.727IleVal: 3.727 ± 0.435
0.86IleTrp: 0.86 ± 0.302
2.58IleTyr: 2.58 ± 0.521
0.0IleXaa: 0.0 ± 0.0
Lys
4.874LysAla: 4.874 ± 0.616
0.287LysCys: 0.287 ± 0.156
5.017LysAsp: 5.017 ± 0.512
8.457LysGlu: 8.457 ± 1.014
3.225LysPhe: 3.225 ± 0.458
5.519LysGly: 5.519 ± 0.672
1.792LysHis: 1.792 ± 0.329
5.949LysIle: 5.949 ± 0.615
7.095LysLys: 7.095 ± 0.937
6.737LysLeu: 6.737 ± 0.804
2.652LysMet: 2.652 ± 0.416
5.59LysAsn: 5.59 ± 0.795
2.795LysPro: 2.795 ± 0.434
4.802LysGln: 4.802 ± 0.503
4.3LysArg: 4.3 ± 0.595
5.662LysSer: 5.662 ± 0.564
5.662LysThr: 5.662 ± 0.65
5.375LysVal: 5.375 ± 0.512
0.573LysTrp: 0.573 ± 0.22
3.153LysTyr: 3.153 ± 0.492
0.0LysXaa: 0.0 ± 0.0
Leu
4.228LeuAla: 4.228 ± 0.749
0.358LeuCys: 0.358 ± 0.174
5.304LeuAsp: 5.304 ± 0.65
5.16LeuGlu: 5.16 ± 0.54
3.368LeuPhe: 3.368 ± 0.436
2.508LeuGly: 2.508 ± 0.362
1.433LeuHis: 1.433 ± 0.368
5.089LeuIle: 5.089 ± 0.586
7.884LeuLys: 7.884 ± 0.785
5.304LeuLeu: 5.304 ± 0.586
2.078LeuMet: 2.078 ± 0.391
5.447LeuAsn: 5.447 ± 0.545
2.652LeuPro: 2.652 ± 0.414
2.795LeuGln: 2.795 ± 0.4
2.795LeuArg: 2.795 ± 0.603
4.3LeuSer: 4.3 ± 0.498
5.734LeuThr: 5.734 ± 0.73
4.515LeuVal: 4.515 ± 0.579
0.645LeuTrp: 0.645 ± 0.222
2.938LeuTyr: 2.938 ± 0.524
0.0LeuXaa: 0.0 ± 0.0
Met
1.577MetAla: 1.577 ± 0.412
0.072MetCys: 0.072 ± 0.079
1.72MetAsp: 1.72 ± 0.391
1.863MetGlu: 1.863 ± 0.366
1.003MetPhe: 1.003 ± 0.248
1.075MetGly: 1.075 ± 0.237
0.143MetHis: 0.143 ± 0.101
1.648MetIle: 1.648 ± 0.314
2.58MetLys: 2.58 ± 0.447
2.078MetLeu: 2.078 ± 0.38
1.003MetMet: 1.003 ± 0.311
1.505MetAsn: 1.505 ± 0.351
1.218MetPro: 1.218 ± 0.263
1.792MetGln: 1.792 ± 0.395
1.29MetArg: 1.29 ± 0.29
2.007MetSer: 2.007 ± 0.421
2.007MetThr: 2.007 ± 0.384
1.075MetVal: 1.075 ± 0.299
0.287MetTrp: 0.287 ± 0.173
0.932MetTyr: 0.932 ± 0.248
0.0MetXaa: 0.0 ± 0.0
Asn
3.87AsnAla: 3.87 ± 0.489
0.502AsnCys: 0.502 ± 0.24
4.228AsnAsp: 4.228 ± 0.631
4.372AsnGlu: 4.372 ± 0.626
2.938AsnPhe: 2.938 ± 0.541
4.228AsnGly: 4.228 ± 0.577
0.86AsnHis: 0.86 ± 0.332
4.157AsnIle: 4.157 ± 0.662
6.594AsnLys: 6.594 ± 0.698
3.727AsnLeu: 3.727 ± 0.478
1.72AsnMet: 1.72 ± 0.273
4.802AsnAsn: 4.802 ± 0.624
2.58AsnPro: 2.58 ± 0.411
2.222AsnGln: 2.222 ± 0.331
2.508AsnArg: 2.508 ± 0.42
3.368AsnSer: 3.368 ± 0.477
3.655AsnThr: 3.655 ± 0.536
4.443AsnVal: 4.443 ± 0.656
1.218AsnTrp: 1.218 ± 0.272
2.437AsnTyr: 2.437 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
1.147ProAla: 1.147 ± 0.259
0.0ProCys: 0.0 ± 0.0
1.362ProAsp: 1.362 ± 0.319
1.863ProGlu: 1.863 ± 0.332
1.72ProPhe: 1.72 ± 0.338
1.362ProGly: 1.362 ± 0.359
0.717ProHis: 0.717 ± 0.212
1.863ProIle: 1.863 ± 0.341
3.44ProLys: 3.44 ± 0.55
1.577ProLeu: 1.577 ± 0.322
0.932ProMet: 0.932 ± 0.284
2.007ProAsn: 2.007 ± 0.418
0.717ProPro: 0.717 ± 0.262
0.932ProGln: 0.932 ± 0.201
1.218ProArg: 1.218 ± 0.307
2.365ProSer: 2.365 ± 0.474
1.433ProThr: 1.433 ± 0.346
2.293ProVal: 2.293 ± 0.445
0.143ProTrp: 0.143 ± 0.105
1.433ProTyr: 1.433 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.153GlnAla: 3.153 ± 0.528
0.287GlnCys: 0.287 ± 0.18
1.792GlnAsp: 1.792 ± 0.323
2.795GlnGlu: 2.795 ± 0.459
2.222GlnPhe: 2.222 ± 0.304
2.078GlnGly: 2.078 ± 0.391
0.932GlnHis: 0.932 ± 0.225
3.082GlnIle: 3.082 ± 0.394
2.508GlnLys: 2.508 ± 0.493
3.512GlnLeu: 3.512 ± 0.506
1.577GlnMet: 1.577 ± 0.393
3.153GlnAsn: 3.153 ± 0.381
1.72GlnPro: 1.72 ± 0.409
2.078GlnGln: 2.078 ± 0.372
2.007GlnArg: 2.007 ± 0.303
2.293GlnSer: 2.293 ± 0.374
2.222GlnThr: 2.222 ± 0.508
2.58GlnVal: 2.58 ± 0.451
0.287GlnTrp: 0.287 ± 0.125
1.505GlnTyr: 1.505 ± 0.333
0.0GlnXaa: 0.0 ± 0.0
Arg
1.863ArgAla: 1.863 ± 0.416
0.43ArgCys: 0.43 ± 0.173
2.15ArgAsp: 2.15 ± 0.42
3.583ArgGlu: 3.583 ± 0.452
1.935ArgPhe: 1.935 ± 0.358
2.723ArgGly: 2.723 ± 0.449
0.788ArgHis: 0.788 ± 0.217
2.867ArgIle: 2.867 ± 0.525
3.87ArgLys: 3.87 ± 0.579
3.44ArgLeu: 3.44 ± 0.574
0.932ArgMet: 0.932 ± 0.252
2.938ArgAsn: 2.938 ± 0.506
0.86ArgPro: 0.86 ± 0.224
2.007ArgGln: 2.007 ± 0.357
1.577ArgArg: 1.577 ± 0.344
2.007ArgSer: 2.007 ± 0.517
2.437ArgThr: 2.437 ± 0.523
2.867ArgVal: 2.867 ± 0.424
0.86ArgTrp: 0.86 ± 0.279
2.365ArgTyr: 2.365 ± 0.499
0.0ArgXaa: 0.0 ± 0.0
Ser
4.443SerAla: 4.443 ± 0.592
0.43SerCys: 0.43 ± 0.225
4.443SerAsp: 4.443 ± 0.598
3.368SerGlu: 3.368 ± 0.532
2.508SerPhe: 2.508 ± 0.431
3.655SerGly: 3.655 ± 0.582
0.645SerHis: 0.645 ± 0.185
4.874SerIle: 4.874 ± 0.819
4.945SerLys: 4.945 ± 0.678
4.372SerLeu: 4.372 ± 0.527
1.648SerMet: 1.648 ± 0.361
3.727SerAsn: 3.727 ± 0.611
1.218SerPro: 1.218 ± 0.281
2.867SerGln: 2.867 ± 0.556
2.437SerArg: 2.437 ± 0.344
3.583SerSer: 3.583 ± 0.602
3.583SerThr: 3.583 ± 0.4
4.658SerVal: 4.658 ± 0.581
0.717SerTrp: 0.717 ± 0.201
2.58SerTyr: 2.58 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
3.727ThrAla: 3.727 ± 0.529
0.072ThrCys: 0.072 ± 0.068
3.727ThrAsp: 3.727 ± 0.45
4.228ThrGlu: 4.228 ± 0.656
2.723ThrPhe: 2.723 ± 0.487
3.87ThrGly: 3.87 ± 0.694
1.433ThrHis: 1.433 ± 0.297
5.089ThrIle: 5.089 ± 0.586
4.802ThrLys: 4.802 ± 0.65
5.232ThrLeu: 5.232 ± 0.485
1.29ThrMet: 1.29 ± 0.345
3.87ThrAsn: 3.87 ± 0.486
1.935ThrPro: 1.935 ± 0.302
2.58ThrGln: 2.58 ± 0.404
2.508ThrArg: 2.508 ± 0.495
5.375ThrSer: 5.375 ± 0.782
4.658ThrThr: 4.658 ± 0.478
4.085ThrVal: 4.085 ± 0.571
1.003ThrTrp: 1.003 ± 0.287
2.795ThrTyr: 2.795 ± 0.507
0.0ThrXaa: 0.0 ± 0.0
Val
4.372ValAla: 4.372 ± 0.578
0.143ValCys: 0.143 ± 0.096
4.013ValAsp: 4.013 ± 0.602
5.734ValGlu: 5.734 ± 0.765
2.15ValPhe: 2.15 ± 0.361
3.583ValGly: 3.583 ± 0.529
0.717ValHis: 0.717 ± 0.215
4.587ValIle: 4.587 ± 0.594
6.952ValLys: 6.952 ± 0.682
5.16ValLeu: 5.16 ± 0.711
1.72ValMet: 1.72 ± 0.355
4.3ValAsn: 4.3 ± 0.654
2.365ValPro: 2.365 ± 0.349
1.218ValGln: 1.218 ± 0.358
2.58ValArg: 2.58 ± 0.368
4.228ValSer: 4.228 ± 0.536
4.085ValThr: 4.085 ± 0.531
4.085ValVal: 4.085 ± 0.693
0.645ValTrp: 0.645 ± 0.255
2.365ValTyr: 2.365 ± 0.473
0.0ValXaa: 0.0 ± 0.0
Trp
0.788TrpAla: 0.788 ± 0.231
0.072TrpCys: 0.072 ± 0.071
0.43TrpAsp: 0.43 ± 0.197
1.218TrpGlu: 1.218 ± 0.242
0.502TrpPhe: 0.502 ± 0.186
0.717TrpGly: 0.717 ± 0.353
0.287TrpHis: 0.287 ± 0.137
0.932TrpIle: 0.932 ± 0.277
1.075TrpLys: 1.075 ± 0.277
1.218TrpLeu: 1.218 ± 0.351
0.143TrpMet: 0.143 ± 0.102
0.573TrpAsn: 0.573 ± 0.21
0.0TrpPro: 0.0 ± 0.0
0.788TrpGln: 0.788 ± 0.24
0.358TrpArg: 0.358 ± 0.163
0.86TrpSer: 0.86 ± 0.28
0.717TrpThr: 0.717 ± 0.204
0.932TrpVal: 0.932 ± 0.239
0.0TrpTrp: 0.0 ± 0.0
0.645TrpTyr: 0.645 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.863TyrAla: 1.863 ± 0.406
0.502TyrCys: 0.502 ± 0.22
1.863TyrAsp: 1.863 ± 0.429
3.583TyrGlu: 3.583 ± 0.569
1.72TyrPhe: 1.72 ± 0.42
2.293TyrGly: 2.293 ± 0.373
0.717TyrHis: 0.717 ± 0.247
3.44TyrIle: 3.44 ± 0.504
4.587TyrLys: 4.587 ± 0.603
3.44TyrLeu: 3.44 ± 0.545
1.147TyrMet: 1.147 ± 0.268
3.01TyrAsn: 3.01 ± 0.465
1.147TyrPro: 1.147 ± 0.3
2.15TyrGln: 2.15 ± 0.363
2.58TyrArg: 2.58 ± 0.563
3.01TyrSer: 3.01 ± 0.539
2.15TyrThr: 2.15 ± 0.396
3.01TyrVal: 3.01 ± 0.431
0.573TyrTrp: 0.573 ± 0.202
2.437TyrTyr: 2.437 ± 0.457
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (13954 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski