Amino acid dipepetide frequency for Staphylococcus virus 52a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.305AlaAla: 0.305 ± 0.153
0.229AlaCys: 0.229 ± 0.126
2.44AlaAsp: 2.44 ± 0.389
3.507AlaGlu: 3.507 ± 0.454
2.287AlaPhe: 2.287 ± 0.579
3.888AlaGly: 3.888 ± 0.638
1.22AlaHis: 1.22 ± 0.322
5.26AlaIle: 5.26 ± 0.622
5.794AlaLys: 5.794 ± 0.785
4.422AlaLeu: 4.422 ± 0.637
1.372AlaMet: 1.372 ± 0.418
4.269AlaAsn: 4.269 ± 0.525
2.135AlaPro: 2.135 ± 0.444
2.592AlaGln: 2.592 ± 0.495
3.049AlaArg: 3.049 ± 0.438
3.354AlaSer: 3.354 ± 0.679
4.422AlaThr: 4.422 ± 0.667
4.193AlaVal: 4.193 ± 0.739
0.991AlaTrp: 0.991 ± 0.321
2.211AlaTyr: 2.211 ± 0.434
0.0AlaXaa: 0.0 ± 0.0
Cys
0.305CysAla: 0.305 ± 0.152
0.076CysCys: 0.076 ± 0.074
0.305CysAsp: 0.305 ± 0.158
0.381CysGlu: 0.381 ± 0.172
0.305CysPhe: 0.305 ± 0.134
0.457CysGly: 0.457 ± 0.207
0.0CysHis: 0.0 ± 0.0
0.229CysIle: 0.229 ± 0.129
0.305CysLys: 0.305 ± 0.139
0.229CysLeu: 0.229 ± 0.139
0.076CysMet: 0.076 ± 0.078
0.229CysAsn: 0.229 ± 0.131
0.305CysPro: 0.305 ± 0.167
0.229CysGln: 0.229 ± 0.121
0.61CysArg: 0.61 ± 0.275
0.534CysSer: 0.534 ± 0.221
0.152CysThr: 0.152 ± 0.148
0.305CysVal: 0.305 ± 0.158
0.152CysTrp: 0.152 ± 0.11
0.305CysTyr: 0.305 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
4.117AspAla: 4.117 ± 0.766
0.152AspCys: 0.152 ± 0.116
4.498AspAsp: 4.498 ± 0.826
4.422AspGlu: 4.422 ± 0.666
2.897AspPhe: 2.897 ± 0.438
4.193AspGly: 4.193 ± 0.646
0.381AspHis: 0.381 ± 0.163
4.65AspIle: 4.65 ± 0.645
5.794AspLys: 5.794 ± 0.977
5.184AspLeu: 5.184 ± 0.546
1.601AspMet: 1.601 ± 0.373
3.583AspAsn: 3.583 ± 0.551
1.22AspPro: 1.22 ± 0.284
1.144AspGln: 1.144 ± 0.302
1.982AspArg: 1.982 ± 0.374
3.659AspSer: 3.659 ± 0.463
3.354AspThr: 3.354 ± 0.602
4.193AspVal: 4.193 ± 0.753
0.534AspTrp: 0.534 ± 0.201
2.745AspTyr: 2.745 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
4.041GluAla: 4.041 ± 0.619
0.991GluCys: 0.991 ± 0.25
4.117GluAsp: 4.117 ± 0.745
6.099GluGlu: 6.099 ± 1.066
2.821GluPhe: 2.821 ± 0.413
2.973GluGly: 2.973 ± 0.4
1.525GluHis: 1.525 ± 0.379
5.337GluIle: 5.337 ± 0.71
5.718GluLys: 5.718 ± 0.902
7.243GluLeu: 7.243 ± 1.066
2.058GluMet: 2.058 ± 0.44
4.727GluAsn: 4.727 ± 0.59
2.058GluPro: 2.058 ± 0.343
4.879GluGln: 4.879 ± 0.719
3.202GluArg: 3.202 ± 0.522
3.202GluSer: 3.202 ± 0.545
3.431GluThr: 3.431 ± 0.404
5.337GluVal: 5.337 ± 0.542
1.067GluTrp: 1.067 ± 0.245
4.117GluTyr: 4.117 ± 0.639
0.0GluXaa: 0.0 ± 0.0
Phe
1.83PheAla: 1.83 ± 0.37
0.305PheCys: 0.305 ± 0.128
4.269PheAsp: 4.269 ± 0.474
3.202PheGlu: 3.202 ± 0.569
0.915PhePhe: 0.915 ± 0.269
2.745PheGly: 2.745 ± 0.691
0.61PheHis: 0.61 ± 0.212
3.507PheIle: 3.507 ± 0.527
4.422PheLys: 4.422 ± 0.542
2.897PheLeu: 2.897 ± 0.493
1.372PheMet: 1.372 ± 0.394
3.507PheAsn: 3.507 ± 0.621
0.762PhePro: 0.762 ± 0.27
0.991PheGln: 0.991 ± 0.291
1.22PheArg: 1.22 ± 0.265
2.516PheSer: 2.516 ± 0.464
3.659PheThr: 3.659 ± 0.564
2.44PheVal: 2.44 ± 0.592
0.381PheTrp: 0.381 ± 0.197
1.677PheTyr: 1.677 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
4.574GlyAla: 4.574 ± 0.668
0.457GlyCys: 0.457 ± 0.151
3.049GlyAsp: 3.049 ± 0.518
2.897GlyGlu: 2.897 ± 0.505
3.278GlyPhe: 3.278 ± 0.515
2.745GlyGly: 2.745 ± 0.563
1.982GlyHis: 1.982 ± 0.429
4.269GlyIle: 4.269 ± 0.668
5.032GlyLys: 5.032 ± 0.492
4.117GlyLeu: 4.117 ± 0.739
1.525GlyMet: 1.525 ± 0.345
3.354GlyAsn: 3.354 ± 0.514
0.534GlyPro: 0.534 ± 0.208
2.821GlyGln: 2.821 ± 0.465
2.287GlyArg: 2.287 ± 0.442
3.354GlySer: 3.354 ± 0.519
4.346GlyThr: 4.346 ± 0.556
4.879GlyVal: 4.879 ± 0.738
0.991GlyTrp: 0.991 ± 0.408
2.668GlyTyr: 2.668 ± 0.567
0.0GlyXaa: 0.0 ± 0.0
His
1.372HisAla: 1.372 ± 0.298
0.076HisCys: 0.076 ± 0.078
0.61HisAsp: 0.61 ± 0.218
1.067HisGlu: 1.067 ± 0.32
0.839HisPhe: 0.839 ± 0.254
0.915HisGly: 0.915 ± 0.276
0.534HisHis: 0.534 ± 0.255
1.449HisIle: 1.449 ± 0.306
1.067HisLys: 1.067 ± 0.246
1.601HisLeu: 1.601 ± 0.355
0.229HisMet: 0.229 ± 0.109
0.762HisAsn: 0.762 ± 0.254
1.067HisPro: 1.067 ± 0.309
0.991HisGln: 0.991 ± 0.327
0.61HisArg: 0.61 ± 0.244
1.22HisSer: 1.22 ± 0.318
1.982HisThr: 1.982 ± 0.472
1.067HisVal: 1.067 ± 0.266
0.076HisTrp: 0.076 ± 0.095
0.915HisTyr: 0.915 ± 0.36
0.0HisXaa: 0.0 ± 0.0
Ile
4.193IleAla: 4.193 ± 0.567
0.152IleCys: 0.152 ± 0.13
4.727IleAsp: 4.727 ± 0.582
6.709IleGlu: 6.709 ± 0.942
3.126IlePhe: 3.126 ± 0.562
5.032IleGly: 5.032 ± 0.864
0.991IleHis: 0.991 ± 0.279
3.812IleIle: 3.812 ± 0.638
8.386IleLys: 8.386 ± 0.978
3.812IleLeu: 3.812 ± 0.528
2.363IleMet: 2.363 ± 0.373
5.26IleAsn: 5.26 ± 0.847
2.211IlePro: 2.211 ± 0.367
3.049IleGln: 3.049 ± 0.46
3.278IleArg: 3.278 ± 0.605
4.346IleSer: 4.346 ± 0.583
4.65IleThr: 4.65 ± 0.529
3.431IleVal: 3.431 ± 0.471
0.991IleTrp: 0.991 ± 0.334
2.592IleTyr: 2.592 ± 0.587
0.0IleXaa: 0.0 ± 0.0
Lys
5.26LysAla: 5.26 ± 0.51
0.457LysCys: 0.457 ± 0.173
6.48LysAsp: 6.48 ± 0.803
8.31LysGlu: 8.31 ± 1.009
3.049LysPhe: 3.049 ± 0.429
5.26LysGly: 5.26 ± 0.668
1.753LysHis: 1.753 ± 0.362
6.099LysIle: 6.099 ± 0.666
7.014LysLys: 7.014 ± 0.978
7.014LysLeu: 7.014 ± 0.605
2.897LysMet: 2.897 ± 0.494
4.955LysAsn: 4.955 ± 0.683
2.44LysPro: 2.44 ± 0.447
4.803LysGln: 4.803 ± 0.657
4.879LysArg: 4.879 ± 0.649
4.193LysSer: 4.193 ± 0.585
6.023LysThr: 6.023 ± 0.842
5.565LysVal: 5.565 ± 0.728
0.762LysTrp: 0.762 ± 0.213
4.269LysTyr: 4.269 ± 0.585
0.0LysXaa: 0.0 ± 0.0
Leu
4.574LeuAla: 4.574 ± 0.688
0.229LeuCys: 0.229 ± 0.165
4.955LeuAsp: 4.955 ± 0.589
5.642LeuGlu: 5.642 ± 0.841
3.507LeuPhe: 3.507 ± 0.51
3.354LeuGly: 3.354 ± 0.518
1.372LeuHis: 1.372 ± 0.377
4.346LeuIle: 4.346 ± 0.534
6.938LeuLys: 6.938 ± 0.697
5.184LeuLeu: 5.184 ± 0.619
1.906LeuMet: 1.906 ± 0.429
5.108LeuAsn: 5.108 ± 0.605
2.363LeuPro: 2.363 ± 0.476
3.354LeuGln: 3.354 ± 0.495
2.745LeuArg: 2.745 ± 0.582
4.574LeuSer: 4.574 ± 0.536
5.413LeuThr: 5.413 ± 0.727
3.964LeuVal: 3.964 ± 0.605
0.686LeuTrp: 0.686 ± 0.274
3.126LeuTyr: 3.126 ± 0.646
0.0LeuXaa: 0.0 ± 0.0
Met
1.296MetAla: 1.296 ± 0.601
0.076MetCys: 0.076 ± 0.095
1.372MetAsp: 1.372 ± 0.288
1.449MetGlu: 1.449 ± 0.319
1.449MetPhe: 1.449 ± 0.365
1.144MetGly: 1.144 ± 0.311
0.534MetHis: 0.534 ± 0.204
1.83MetIle: 1.83 ± 0.383
2.363MetLys: 2.363 ± 0.511
2.363MetLeu: 2.363 ± 0.413
0.457MetMet: 0.457 ± 0.159
1.677MetAsn: 1.677 ± 0.351
1.067MetPro: 1.067 ± 0.246
1.982MetGln: 1.982 ± 0.506
0.686MetArg: 0.686 ± 0.234
1.753MetSer: 1.753 ± 0.417
1.677MetThr: 1.677 ± 0.37
1.144MetVal: 1.144 ± 0.264
0.457MetTrp: 0.457 ± 0.185
1.144MetTyr: 1.144 ± 0.337
0.0MetXaa: 0.0 ± 0.0
Asn
5.184AsnAla: 5.184 ± 0.574
0.457AsnCys: 0.457 ± 0.205
3.812AsnAsp: 3.812 ± 0.5
5.794AsnGlu: 5.794 ± 0.622
2.973AsnPhe: 2.973 ± 0.549
4.269AsnGly: 4.269 ± 0.656
0.762AsnHis: 0.762 ± 0.276
4.422AsnIle: 4.422 ± 0.547
7.014AsnLys: 7.014 ± 0.836
3.431AsnLeu: 3.431 ± 0.489
1.525AsnMet: 1.525 ± 0.278
5.642AsnAsn: 5.642 ± 1.157
2.592AsnPro: 2.592 ± 0.398
2.668AsnGln: 2.668 ± 0.496
2.211AsnArg: 2.211 ± 0.395
3.278AsnSer: 3.278 ± 0.436
3.431AsnThr: 3.431 ± 0.404
3.126AsnVal: 3.126 ± 0.49
0.915AsnTrp: 0.915 ± 0.264
2.516AsnTyr: 2.516 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
1.449ProAla: 1.449 ± 0.34
0.076ProCys: 0.076 ± 0.077
1.525ProAsp: 1.525 ± 0.32
1.83ProGlu: 1.83 ± 0.442
1.525ProPhe: 1.525 ± 0.4
1.83ProGly: 1.83 ± 0.459
0.534ProHis: 0.534 ± 0.211
2.135ProIle: 2.135 ± 0.359
2.973ProLys: 2.973 ± 0.534
1.449ProLeu: 1.449 ± 0.268
0.762ProMet: 0.762 ± 0.233
2.211ProAsn: 2.211 ± 0.526
0.457ProPro: 0.457 ± 0.168
0.991ProGln: 0.991 ± 0.253
0.915ProArg: 0.915 ± 0.243
2.44ProSer: 2.44 ± 0.431
1.677ProThr: 1.677 ± 0.34
1.449ProVal: 1.449 ± 0.377
0.076ProTrp: 0.076 ± 0.09
1.677ProTyr: 1.677 ± 0.431
0.0ProXaa: 0.0 ± 0.0
Gln
3.507GlnAla: 3.507 ± 0.569
0.381GlnCys: 0.381 ± 0.191
1.83GlnAsp: 1.83 ± 0.417
3.049GlnGlu: 3.049 ± 0.584
1.906GlnPhe: 1.906 ± 0.338
2.592GlnGly: 2.592 ± 0.462
1.144GlnHis: 1.144 ± 0.227
3.202GlnIle: 3.202 ± 0.404
3.049GlnLys: 3.049 ± 0.511
2.821GlnLeu: 2.821 ± 0.458
1.601GlnMet: 1.601 ± 0.38
2.668GlnAsn: 2.668 ± 0.42
1.525GlnPro: 1.525 ± 0.436
2.44GlnGln: 2.44 ± 0.567
1.83GlnArg: 1.83 ± 0.398
2.592GlnSer: 2.592 ± 0.398
2.135GlnThr: 2.135 ± 0.391
3.278GlnVal: 3.278 ± 0.655
0.152GlnTrp: 0.152 ± 0.106
1.83GlnTyr: 1.83 ± 0.444
0.0GlnXaa: 0.0 ± 0.0
Arg
1.22ArgAla: 1.22 ± 0.283
0.305ArgCys: 0.305 ± 0.13
2.973ArgAsp: 2.973 ± 0.492
2.897ArgGlu: 2.897 ± 0.508
2.211ArgPhe: 2.211 ± 0.441
2.668ArgGly: 2.668 ± 0.472
1.144ArgHis: 1.144 ± 0.244
3.278ArgIle: 3.278 ± 0.636
3.659ArgLys: 3.659 ± 0.568
4.117ArgLeu: 4.117 ± 0.679
0.991ArgMet: 0.991 ± 0.228
2.973ArgAsn: 2.973 ± 0.469
0.991ArgPro: 0.991 ± 0.264
1.372ArgGln: 1.372 ± 0.341
0.991ArgArg: 0.991 ± 0.271
1.83ArgSer: 1.83 ± 0.332
1.525ArgThr: 1.525 ± 0.387
2.363ArgVal: 2.363 ± 0.391
0.534ArgTrp: 0.534 ± 0.198
2.211ArgTyr: 2.211 ± 0.407
0.0ArgXaa: 0.0 ± 0.0
Ser
4.193SerAla: 4.193 ± 0.531
0.229SerCys: 0.229 ± 0.131
3.583SerAsp: 3.583 ± 0.615
3.659SerGlu: 3.659 ± 0.556
2.745SerPhe: 2.745 ± 0.555
4.041SerGly: 4.041 ± 0.653
0.991SerHis: 0.991 ± 0.253
4.574SerIle: 4.574 ± 0.631
5.565SerLys: 5.565 ± 0.671
3.431SerLeu: 3.431 ± 0.513
1.525SerMet: 1.525 ± 0.307
4.346SerAsn: 4.346 ± 0.618
0.839SerPro: 0.839 ± 0.313
2.516SerGln: 2.516 ± 0.486
2.211SerArg: 2.211 ± 0.334
3.659SerSer: 3.659 ± 0.575
3.278SerThr: 3.278 ± 0.466
3.583SerVal: 3.583 ± 0.61
0.686SerTrp: 0.686 ± 0.195
1.906SerTyr: 1.906 ± 0.357
0.0SerXaa: 0.0 ± 0.0
Thr
3.507ThrAla: 3.507 ± 0.609
0.152ThrCys: 0.152 ± 0.1
3.583ThrAsp: 3.583 ± 0.442
4.574ThrGlu: 4.574 ± 0.502
3.126ThrPhe: 3.126 ± 0.59
3.888ThrGly: 3.888 ± 0.625
1.601ThrHis: 1.601 ± 0.409
5.718ThrIle: 5.718 ± 0.792
4.879ThrLys: 4.879 ± 0.672
4.727ThrLeu: 4.727 ± 0.555
0.686ThrMet: 0.686 ± 0.264
4.041ThrAsn: 4.041 ± 0.622
1.753ThrPro: 1.753 ± 0.356
2.592ThrGln: 2.592 ± 0.53
2.516ThrArg: 2.516 ± 0.448
4.422ThrSer: 4.422 ± 1.002
4.041ThrThr: 4.041 ± 0.657
3.583ThrVal: 3.583 ± 0.539
0.915ThrTrp: 0.915 ± 0.366
2.592ThrTyr: 2.592 ± 0.415
0.0ThrXaa: 0.0 ± 0.0
Val
4.498ValAla: 4.498 ± 0.737
0.229ValCys: 0.229 ± 0.138
4.269ValAsp: 4.269 ± 0.687
5.184ValGlu: 5.184 ± 0.732
2.135ValPhe: 2.135 ± 0.349
3.736ValGly: 3.736 ± 0.6
0.305ValHis: 0.305 ± 0.149
5.032ValIle: 5.032 ± 0.541
6.328ValLys: 6.328 ± 0.571
5.032ValLeu: 5.032 ± 0.601
1.677ValMet: 1.677 ± 0.41
3.126ValAsn: 3.126 ± 0.506
2.363ValPro: 2.363 ± 0.479
1.296ValGln: 1.296 ± 0.368
1.906ValArg: 1.906 ± 0.361
3.583ValSer: 3.583 ± 0.58
4.117ValThr: 4.117 ± 0.646
4.65ValVal: 4.65 ± 0.524
0.915ValTrp: 0.915 ± 0.259
2.287ValTyr: 2.287 ± 0.474
0.0ValXaa: 0.0 ± 0.0
Trp
0.457TrpAla: 0.457 ± 0.195
0.076TrpCys: 0.076 ± 0.074
0.305TrpAsp: 0.305 ± 0.13
1.067TrpGlu: 1.067 ± 0.283
0.762TrpPhe: 0.762 ± 0.217
1.144TrpGly: 1.144 ± 0.347
0.229TrpHis: 0.229 ± 0.126
0.61TrpIle: 0.61 ± 0.21
1.067TrpLys: 1.067 ± 0.279
0.839TrpLeu: 0.839 ± 0.281
0.229TrpMet: 0.229 ± 0.124
0.686TrpAsn: 0.686 ± 0.227
0.0TrpPro: 0.0 ± 0.0
0.686TrpGln: 0.686 ± 0.233
0.534TrpArg: 0.534 ± 0.184
0.686TrpSer: 0.686 ± 0.262
1.144TrpThr: 1.144 ± 0.211
1.067TrpVal: 1.067 ± 0.283
0.076TrpTrp: 0.076 ± 0.071
0.534TrpTyr: 0.534 ± 0.229
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.058TyrAla: 2.058 ± 0.439
0.381TyrCys: 0.381 ± 0.207
1.677TyrAsp: 1.677 ± 0.403
3.202TyrGlu: 3.202 ± 0.453
1.372TyrPhe: 1.372 ± 0.347
2.287TyrGly: 2.287 ± 0.508
0.839TyrHis: 0.839 ± 0.291
3.354TyrIle: 3.354 ± 0.542
4.117TyrLys: 4.117 ± 0.535
3.431TyrLeu: 3.431 ± 0.475
0.991TyrMet: 0.991 ± 0.324
2.897TyrAsn: 2.897 ± 0.488
1.296TyrPro: 1.296 ± 0.342
2.135TyrGln: 2.135 ± 0.364
2.592TyrArg: 2.592 ± 0.543
2.363TyrSer: 2.363 ± 0.404
2.44TyrThr: 2.44 ± 0.434
3.049TyrVal: 3.049 ± 0.661
0.839TyrTrp: 0.839 ± 0.219
2.211TyrTyr: 2.211 ± 0.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski