Amino acid dipepetide frequency for Streptococcus virus ALQ132

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.779AlaAla: 5.779 ± 2.412
0.275AlaCys: 0.275 ± 0.18
4.77AlaAsp: 4.77 ± 0.824
4.586AlaGlu: 4.586 ± 0.845
3.027AlaPhe: 3.027 ± 1.274
5.687AlaGly: 5.687 ± 1.38
0.826AlaHis: 0.826 ± 0.324
6.237AlaIle: 6.237 ± 1.723
5.32AlaLys: 5.32 ± 0.65
6.237AlaLeu: 6.237 ± 1.205
2.477AlaMet: 2.477 ± 1.219
4.219AlaAsn: 4.219 ± 0.809
2.293AlaPro: 2.293 ± 0.495
2.752AlaGln: 2.752 ± 1.034
3.119AlaArg: 3.119 ± 0.725
6.329AlaSer: 6.329 ± 1.671
4.219AlaThr: 4.219 ± 1.095
3.853AlaVal: 3.853 ± 1.181
0.459AlaTrp: 0.459 ± 0.149
2.11AlaTyr: 2.11 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.183CysAla: 0.183 ± 0.13
0.459CysCys: 0.459 ± 0.249
0.734CysAsp: 0.734 ± 0.309
0.55CysGlu: 0.55 ± 0.213
0.183CysPhe: 0.183 ± 0.137
0.55CysGly: 0.55 ± 0.237
0.275CysHis: 0.275 ± 0.166
0.459CysIle: 0.459 ± 0.24
0.459CysLys: 0.459 ± 0.252
0.367CysLeu: 0.367 ± 0.214
0.0CysMet: 0.0 ± 0.0
0.183CysAsn: 0.183 ± 0.139
0.183CysPro: 0.183 ± 0.143
0.092CysGln: 0.092 ± 0.098
0.459CysArg: 0.459 ± 0.249
0.642CysSer: 0.642 ± 0.34
0.0CysThr: 0.0 ± 0.0
0.183CysVal: 0.183 ± 0.128
0.092CysTrp: 0.092 ± 0.088
0.275CysTyr: 0.275 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
3.027AspAla: 3.027 ± 0.522
0.826AspCys: 0.826 ± 0.246
4.861AspAsp: 4.861 ± 0.726
3.761AspGlu: 3.761 ± 0.649
3.669AspPhe: 3.669 ± 0.784
5.87AspGly: 5.87 ± 1.079
0.55AspHis: 0.55 ± 0.292
3.486AspIle: 3.486 ± 0.716
4.861AspLys: 4.861 ± 0.925
4.036AspLeu: 4.036 ± 0.582
1.559AspMet: 1.559 ± 0.423
4.953AspAsn: 4.953 ± 0.811
0.734AspPro: 0.734 ± 0.339
1.284AspGln: 1.284 ± 0.336
3.027AspArg: 3.027 ± 0.588
4.128AspSer: 4.128 ± 0.654
3.853AspThr: 3.853 ± 0.7
4.036AspVal: 4.036 ± 0.531
1.192AspTrp: 1.192 ± 0.469
3.21AspTyr: 3.21 ± 0.684
0.0AspXaa: 0.0 ± 0.0
Glu
4.128GluAla: 4.128 ± 0.692
0.275GluCys: 0.275 ± 0.152
1.559GluAsp: 1.559 ± 0.451
3.669GluGlu: 3.669 ± 0.822
3.027GluPhe: 3.027 ± 0.596
3.394GluGly: 3.394 ± 0.625
1.009GluHis: 1.009 ± 0.377
4.861GluIle: 4.861 ± 0.822
4.311GluLys: 4.311 ± 0.892
7.246GluLeu: 7.246 ± 1.218
1.743GluMet: 1.743 ± 0.426
4.495GluAsn: 4.495 ± 0.739
1.926GluPro: 1.926 ± 0.67
2.385GluGln: 2.385 ± 0.341
3.853GluArg: 3.853 ± 0.635
2.752GluSer: 2.752 ± 0.817
3.761GluThr: 3.761 ± 0.805
5.412GluVal: 5.412 ± 0.851
0.917GluTrp: 0.917 ± 0.278
3.027GluTyr: 3.027 ± 0.69
0.0GluXaa: 0.0 ± 0.0
Phe
2.66PheAla: 2.66 ± 0.499
0.275PheCys: 0.275 ± 0.205
3.21PheAsp: 3.21 ± 0.5
3.394PheGlu: 3.394 ± 0.679
1.101PhePhe: 1.101 ± 0.366
3.853PheGly: 3.853 ± 0.637
0.367PheHis: 0.367 ± 0.162
2.752PheIle: 2.752 ± 0.532
4.953PheLys: 4.953 ± 0.748
2.752PheLeu: 2.752 ± 0.716
0.367PheMet: 0.367 ± 0.157
3.302PheAsn: 3.302 ± 0.567
0.55PhePro: 0.55 ± 0.24
1.101PheGln: 1.101 ± 0.314
1.376PheArg: 1.376 ± 0.322
3.577PheSer: 3.577 ± 0.902
3.21PheThr: 3.21 ± 0.705
2.293PheVal: 2.293 ± 0.417
0.55PheTrp: 0.55 ± 0.269
1.284PheTyr: 1.284 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
5.045GlyAla: 5.045 ± 1.154
0.092GlyCys: 0.092 ± 0.097
3.577GlyAsp: 3.577 ± 0.571
3.119GlyGlu: 3.119 ± 0.579
2.935GlyPhe: 2.935 ± 0.559
3.486GlyGly: 3.486 ± 0.559
0.459GlyHis: 0.459 ± 0.207
6.421GlyIle: 6.421 ± 1.74
6.513GlyLys: 6.513 ± 1.097
6.971GlyLeu: 6.971 ± 1.089
1.559GlyMet: 1.559 ± 0.716
3.944GlyAsn: 3.944 ± 0.536
1.284GlyPro: 1.284 ± 0.569
2.935GlyGln: 2.935 ± 0.63
2.935GlyArg: 2.935 ± 0.606
5.045GlySer: 5.045 ± 0.939
4.586GlyThr: 4.586 ± 0.83
4.953GlyVal: 4.953 ± 0.66
1.192GlyTrp: 1.192 ± 0.467
3.486GlyTyr: 3.486 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.917HisAla: 0.917 ± 0.251
0.092HisCys: 0.092 ± 0.097
0.917HisAsp: 0.917 ± 0.291
0.826HisGlu: 0.826 ± 0.262
0.367HisPhe: 0.367 ± 0.17
1.101HisGly: 1.101 ± 0.337
0.367HisHis: 0.367 ± 0.18
1.009HisIle: 1.009 ± 0.34
0.734HisLys: 0.734 ± 0.232
0.55HisLeu: 0.55 ± 0.267
0.183HisMet: 0.183 ± 0.115
0.55HisAsn: 0.55 ± 0.245
0.367HisPro: 0.367 ± 0.201
0.275HisGln: 0.275 ± 0.172
0.642HisArg: 0.642 ± 0.261
0.734HisSer: 0.734 ± 0.33
0.917HisThr: 0.917 ± 0.294
1.192HisVal: 1.192 ± 0.38
0.092HisTrp: 0.092 ± 0.086
0.55HisTyr: 0.55 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
5.687IleAla: 5.687 ± 1.161
0.734IleCys: 0.734 ± 0.273
4.953IleAsp: 4.953 ± 0.642
4.036IleGlu: 4.036 ± 0.571
1.835IlePhe: 1.835 ± 0.381
4.953IleGly: 4.953 ± 1.09
1.284IleHis: 1.284 ± 0.312
4.219IleIle: 4.219 ± 0.922
6.421IleLys: 6.421 ± 0.763
3.027IleLeu: 3.027 ± 0.519
2.293IleMet: 2.293 ± 0.425
3.027IleAsn: 3.027 ± 0.694
2.844IlePro: 2.844 ± 0.68
2.385IleGln: 2.385 ± 0.497
3.21IleArg: 3.21 ± 0.671
5.595IleSer: 5.595 ± 1.631
4.128IleThr: 4.128 ± 0.707
4.036IleVal: 4.036 ± 0.651
0.734IleTrp: 0.734 ± 0.247
3.21IleTyr: 3.21 ± 0.68
0.0IleXaa: 0.0 ± 0.0
Lys
7.43LysAla: 7.43 ± 1.116
0.55LysCys: 0.55 ± 0.268
4.678LysAsp: 4.678 ± 0.669
6.879LysGlu: 6.879 ± 1.111
2.293LysPhe: 2.293 ± 0.519
5.687LysGly: 5.687 ± 0.649
0.734LysHis: 0.734 ± 0.21
4.678LysIle: 4.678 ± 0.752
6.054LysLys: 6.054 ± 1.149
5.962LysLeu: 5.962 ± 0.829
2.11LysMet: 2.11 ± 0.53
3.761LysAsn: 3.761 ± 0.874
2.844LysPro: 2.844 ± 0.605
2.293LysGln: 2.293 ± 0.522
4.953LysArg: 4.953 ± 0.846
4.219LysSer: 4.219 ± 0.599
6.054LysThr: 6.054 ± 1.098
3.853LysVal: 3.853 ± 0.565
0.734LysTrp: 0.734 ± 0.258
4.036LysTyr: 4.036 ± 1.143
0.0LysXaa: 0.0 ± 0.0
Leu
6.237LeuAla: 6.237 ± 1.092
0.367LeuCys: 0.367 ± 0.205
5.504LeuAsp: 5.504 ± 0.748
5.962LeuGlu: 5.962 ± 1.097
2.935LeuPhe: 2.935 ± 0.329
5.137LeuGly: 5.137 ± 0.909
0.55LeuHis: 0.55 ± 0.2
4.495LeuIle: 4.495 ± 0.723
6.146LeuLys: 6.146 ± 0.996
4.861LeuLeu: 4.861 ± 0.859
2.293LeuMet: 2.293 ± 0.424
5.87LeuAsn: 5.87 ± 0.681
2.568LeuPro: 2.568 ± 0.565
2.568LeuGln: 2.568 ± 0.436
3.21LeuArg: 3.21 ± 0.627
4.678LeuSer: 4.678 ± 0.699
6.237LeuThr: 6.237 ± 0.92
3.853LeuVal: 3.853 ± 0.64
0.826LeuTrp: 0.826 ± 0.327
2.477LeuTyr: 2.477 ± 0.484
0.0LeuXaa: 0.0 ± 0.0
Met
2.935MetAla: 2.935 ± 0.867
0.183MetCys: 0.183 ± 0.144
1.009MetAsp: 1.009 ± 0.302
1.009MetGlu: 1.009 ± 0.273
2.018MetPhe: 2.018 ± 0.472
1.284MetGly: 1.284 ± 0.322
0.367MetHis: 0.367 ± 0.185
1.101MetIle: 1.101 ± 0.398
2.201MetLys: 2.201 ± 0.502
1.835MetLeu: 1.835 ± 0.377
1.101MetMet: 1.101 ± 0.451
0.734MetAsn: 0.734 ± 0.236
0.459MetPro: 0.459 ± 0.193
1.468MetGln: 1.468 ± 0.425
1.009MetArg: 1.009 ± 0.253
2.844MetSer: 2.844 ± 0.618
1.101MetThr: 1.101 ± 0.331
2.477MetVal: 2.477 ± 0.477
0.0MetTrp: 0.0 ± 0.0
0.459MetTyr: 0.459 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
3.944AsnAla: 3.944 ± 0.442
0.275AsnCys: 0.275 ± 0.153
4.128AsnAsp: 4.128 ± 0.681
3.761AsnGlu: 3.761 ± 0.781
2.201AsnPhe: 2.201 ± 0.444
5.87AsnGly: 5.87 ± 1.115
1.101AsnHis: 1.101 ± 0.446
2.935AsnIle: 2.935 ± 0.611
4.403AsnLys: 4.403 ± 0.775
4.403AsnLeu: 4.403 ± 0.684
1.376AsnMet: 1.376 ± 0.335
3.944AsnAsn: 3.944 ± 0.955
2.66AsnPro: 2.66 ± 0.377
1.376AsnGln: 1.376 ± 0.367
2.293AsnArg: 2.293 ± 0.518
3.486AsnSer: 3.486 ± 0.681
3.21AsnThr: 3.21 ± 0.57
2.844AsnVal: 2.844 ± 0.386
1.376AsnTrp: 1.376 ± 0.386
2.018AsnTyr: 2.018 ± 0.635
0.0AsnXaa: 0.0 ± 0.0
Pro
1.284ProAla: 1.284 ± 0.357
0.092ProCys: 0.092 ± 0.092
1.743ProAsp: 1.743 ± 0.534
1.926ProGlu: 1.926 ± 0.532
1.101ProPhe: 1.101 ± 0.265
1.192ProGly: 1.192 ± 0.36
0.275ProHis: 0.275 ± 0.123
2.11ProIle: 2.11 ± 0.443
3.027ProLys: 3.027 ± 0.55
2.018ProLeu: 2.018 ± 0.49
0.275ProMet: 0.275 ± 0.151
2.018ProAsn: 2.018 ± 0.522
1.101ProPro: 1.101 ± 0.295
1.743ProGln: 1.743 ± 0.464
1.101ProArg: 1.101 ± 0.377
2.293ProSer: 2.293 ± 0.38
1.651ProThr: 1.651 ± 0.493
1.926ProVal: 1.926 ± 0.537
0.275ProTrp: 0.275 ± 0.138
0.917ProTyr: 0.917 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
3.853GlnAla: 3.853 ± 0.919
0.275GlnCys: 0.275 ± 0.171
1.468GlnAsp: 1.468 ± 0.367
2.935GlnGlu: 2.935 ± 0.596
2.201GlnPhe: 2.201 ± 0.506
2.935GlnGly: 2.935 ± 0.866
0.459GlnHis: 0.459 ± 0.239
1.926GlnIle: 1.926 ± 0.66
2.293GlnLys: 2.293 ± 0.537
3.21GlnLeu: 3.21 ± 0.488
1.284GlnMet: 1.284 ± 0.284
1.376GlnAsn: 1.376 ± 0.25
0.826GlnPro: 0.826 ± 0.239
1.284GlnGln: 1.284 ± 0.345
1.284GlnArg: 1.284 ± 0.368
2.752GlnSer: 2.752 ± 0.578
2.752GlnThr: 2.752 ± 0.587
2.477GlnVal: 2.477 ± 0.596
0.275GlnTrp: 0.275 ± 0.138
1.284GlnTyr: 1.284 ± 0.429
0.0GlnXaa: 0.0 ± 0.0
Arg
3.21ArgAla: 3.21 ± 0.55
0.367ArgCys: 0.367 ± 0.169
2.568ArgAsp: 2.568 ± 0.434
2.935ArgGlu: 2.935 ± 0.588
1.835ArgPhe: 1.835 ± 0.431
2.66ArgGly: 2.66 ± 0.444
0.459ArgHis: 0.459 ± 0.239
3.577ArgIle: 3.577 ± 0.692
3.486ArgLys: 3.486 ± 0.675
4.678ArgLeu: 4.678 ± 0.771
1.743ArgMet: 1.743 ± 0.392
1.926ArgAsn: 1.926 ± 0.449
0.917ArgPro: 0.917 ± 0.24
1.468ArgGln: 1.468 ± 0.267
1.559ArgArg: 1.559 ± 0.46
2.385ArgSer: 2.385 ± 0.451
1.743ArgThr: 1.743 ± 0.509
2.568ArgVal: 2.568 ± 0.587
0.734ArgTrp: 0.734 ± 0.308
2.568ArgTyr: 2.568 ± 0.514
0.0ArgXaa: 0.0 ± 0.0
Ser
6.329SerAla: 6.329 ± 3.158
0.183SerCys: 0.183 ± 0.123
4.495SerAsp: 4.495 ± 0.677
3.761SerGlu: 3.761 ± 0.581
2.935SerPhe: 2.935 ± 0.394
4.77SerGly: 4.77 ± 0.694
0.917SerHis: 0.917 ± 0.383
5.595SerIle: 5.595 ± 0.777
4.495SerLys: 4.495 ± 0.809
4.77SerLeu: 4.77 ± 0.8
1.559SerMet: 1.559 ± 0.335
3.761SerAsn: 3.761 ± 0.518
1.468SerPro: 1.468 ± 0.291
4.128SerGln: 4.128 ± 1.14
2.568SerArg: 2.568 ± 0.429
4.128SerSer: 4.128 ± 0.914
4.495SerThr: 4.495 ± 0.839
5.595SerVal: 5.595 ± 0.747
1.009SerTrp: 1.009 ± 0.247
1.651SerTyr: 1.651 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
4.403ThrAla: 4.403 ± 1.418
0.275ThrCys: 0.275 ± 0.162
3.944ThrAsp: 3.944 ± 0.765
3.027ThrGlu: 3.027 ± 0.535
3.944ThrPhe: 3.944 ± 0.608
4.678ThrGly: 4.678 ± 0.912
1.192ThrHis: 1.192 ± 0.277
5.045ThrIle: 5.045 ± 1.037
5.962ThrLys: 5.962 ± 0.729
5.32ThrLeu: 5.32 ± 0.847
1.468ThrMet: 1.468 ± 0.807
2.752ThrAsn: 2.752 ± 0.75
1.743ThrPro: 1.743 ± 0.478
2.844ThrGln: 2.844 ± 0.538
2.11ThrArg: 2.11 ± 0.519
3.486ThrSer: 3.486 ± 0.686
4.495ThrThr: 4.495 ± 0.728
5.045ThrVal: 5.045 ± 0.588
0.092ThrTrp: 0.092 ± 0.077
2.752ThrTyr: 2.752 ± 0.805
0.0ThrXaa: 0.0 ± 0.0
Val
4.403ValAla: 4.403 ± 0.923
0.183ValCys: 0.183 ± 0.129
5.32ValAsp: 5.32 ± 1.034
4.311ValGlu: 4.311 ± 0.768
3.119ValPhe: 3.119 ± 0.582
4.128ValGly: 4.128 ± 0.782
0.642ValHis: 0.642 ± 0.227
4.586ValIle: 4.586 ± 0.7
4.953ValLys: 4.953 ± 0.557
4.219ValLeu: 4.219 ± 0.509
0.917ValMet: 0.917 ± 0.282
4.036ValAsn: 4.036 ± 0.905
2.385ValPro: 2.385 ± 0.383
2.385ValGln: 2.385 ± 0.729
1.835ValArg: 1.835 ± 0.35
5.595ValSer: 5.595 ± 0.719
4.77ValThr: 4.77 ± 0.604
4.953ValVal: 4.953 ± 0.606
0.826ValTrp: 0.826 ± 0.209
1.559ValTyr: 1.559 ± 0.439
0.0ValXaa: 0.0 ± 0.0
Trp
0.367TrpAla: 0.367 ± 0.171
0.092TrpCys: 0.092 ± 0.092
0.642TrpAsp: 0.642 ± 0.3
0.917TrpGlu: 0.917 ± 0.279
0.367TrpPhe: 0.367 ± 0.184
1.009TrpGly: 1.009 ± 0.318
0.183TrpHis: 0.183 ± 0.11
0.55TrpIle: 0.55 ± 0.188
0.917TrpLys: 0.917 ± 0.219
0.917TrpLeu: 0.917 ± 0.341
0.183TrpMet: 0.183 ± 0.139
0.826TrpAsn: 0.826 ± 0.257
0.092TrpPro: 0.092 ± 0.097
0.642TrpGln: 0.642 ± 0.217
0.459TrpArg: 0.459 ± 0.222
1.468TrpSer: 1.468 ± 0.706
0.826TrpThr: 0.826 ± 0.335
0.917TrpVal: 0.917 ± 0.303
0.275TrpTrp: 0.275 ± 0.182
0.367TrpTyr: 0.367 ± 0.305
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.752TyrAla: 2.752 ± 0.378
0.459TyrCys: 0.459 ± 0.185
2.935TyrAsp: 2.935 ± 0.708
2.385TyrGlu: 2.385 ± 0.606
1.835TyrPhe: 1.835 ± 0.458
2.293TyrGly: 2.293 ± 0.486
0.367TyrHis: 0.367 ± 0.177
2.568TyrIle: 2.568 ± 0.558
2.293TyrLys: 2.293 ± 0.513
3.394TyrLeu: 3.394 ± 0.679
1.009TyrMet: 1.009 ± 0.289
1.926TyrAsn: 1.926 ± 0.464
0.826TyrPro: 0.826 ± 0.268
1.743TyrGln: 1.743 ± 0.363
2.385TyrArg: 2.385 ± 0.663
2.477TyrSer: 2.477 ± 0.514
2.568TyrThr: 2.568 ± 0.723
2.66TyrVal: 2.66 ± 0.623
0.367TyrTrp: 0.367 ± 0.152
2.018TyrTyr: 2.018 ± 0.577
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10903 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski