Amino acid dipepetide frequency for Staphylococcus virus pSp_SNUABM-S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.137AlaAla: 2.137 ± 0.362
0.396AlaCys: 0.396 ± 0.178
3.798AlaAsp: 3.798 ± 0.538
5.144AlaGlu: 5.144 ± 0.674
2.77AlaPhe: 2.77 ± 0.458
2.374AlaGly: 2.374 ± 0.423
0.87AlaHis: 0.87 ± 0.279
5.223AlaIle: 5.223 ± 1.076
5.777AlaLys: 5.777 ± 0.861
5.539AlaLeu: 5.539 ± 0.873
1.187AlaMet: 1.187 ± 0.299
4.194AlaAsn: 4.194 ± 0.588
1.029AlaPro: 1.029 ± 0.288
2.453AlaGln: 2.453 ± 0.407
2.295AlaArg: 2.295 ± 0.421
3.086AlaSer: 3.086 ± 0.539
4.827AlaThr: 4.827 ± 0.592
3.482AlaVal: 3.482 ± 0.701
0.791AlaTrp: 0.791 ± 0.216
2.137AlaTyr: 2.137 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.079CysAla: 0.079 ± 0.098
0.0CysCys: 0.0 ± 0.0
0.396CysAsp: 0.396 ± 0.177
0.475CysGlu: 0.475 ± 0.309
0.0CysPhe: 0.0 ± 0.0
0.317CysGly: 0.317 ± 0.172
0.237CysHis: 0.237 ± 0.12
0.396CysIle: 0.396 ± 0.161
0.554CysLys: 0.554 ± 0.212
0.396CysLeu: 0.396 ± 0.187
0.079CysMet: 0.079 ± 0.071
0.554CysAsn: 0.554 ± 0.207
0.079CysPro: 0.079 ± 0.098
0.237CysGln: 0.237 ± 0.155
0.396CysArg: 0.396 ± 0.184
0.396CysSer: 0.396 ± 0.166
0.475CysThr: 0.475 ± 0.162
0.079CysVal: 0.079 ± 0.072
0.079CysTrp: 0.079 ± 0.069
0.158CysTyr: 0.158 ± 0.107
0.0CysXaa: 0.0 ± 0.0
Asp
3.798AspAla: 3.798 ± 0.572
0.158AspCys: 0.158 ± 0.121
5.539AspAsp: 5.539 ± 0.82
5.698AspGlu: 5.698 ± 0.947
2.928AspPhe: 2.928 ± 0.487
4.352AspGly: 4.352 ± 0.558
0.791AspHis: 0.791 ± 0.267
4.748AspIle: 4.748 ± 0.625
5.856AspLys: 5.856 ± 0.741
5.539AspLeu: 5.539 ± 0.968
1.741AspMet: 1.741 ± 0.379
3.957AspAsn: 3.957 ± 0.489
1.108AspPro: 1.108 ± 0.228
0.95AspGln: 0.95 ± 0.342
1.662AspArg: 1.662 ± 0.305
4.431AspSer: 4.431 ± 0.545
3.086AspThr: 3.086 ± 0.388
4.511AspVal: 4.511 ± 0.524
0.475AspTrp: 0.475 ± 0.182
3.482AspTyr: 3.482 ± 0.581
0.0AspXaa: 0.0 ± 0.0
Glu
4.352GluAla: 4.352 ± 0.636
1.029GluCys: 1.029 ± 0.312
3.086GluAsp: 3.086 ± 0.571
6.568GluGlu: 6.568 ± 1.049
3.798GluPhe: 3.798 ± 0.505
2.374GluGly: 2.374 ± 0.403
0.87GluHis: 0.87 ± 0.243
5.302GluIle: 5.302 ± 0.739
5.381GluLys: 5.381 ± 0.663
6.726GluLeu: 6.726 ± 0.809
2.928GluMet: 2.928 ± 0.464
4.669GluAsn: 4.669 ± 0.654
1.424GluPro: 1.424 ± 0.397
3.561GluGln: 3.561 ± 0.494
4.985GluArg: 4.985 ± 0.767
3.878GluSer: 3.878 ± 0.811
2.849GluThr: 2.849 ± 0.435
5.144GluVal: 5.144 ± 0.655
0.87GluTrp: 0.87 ± 0.241
3.719GluTyr: 3.719 ± 0.607
0.0GluXaa: 0.0 ± 0.0
Phe
2.295PheAla: 2.295 ± 0.459
0.237PheCys: 0.237 ± 0.145
2.216PheAsp: 2.216 ± 0.533
3.798PheGlu: 3.798 ± 0.611
1.345PhePhe: 1.345 ± 0.286
2.77PheGly: 2.77 ± 0.73
0.475PheHis: 0.475 ± 0.167
3.244PheIle: 3.244 ± 0.429
4.748PheLys: 4.748 ± 0.563
3.878PheLeu: 3.878 ± 0.641
0.95PheMet: 0.95 ± 0.238
3.165PheAsn: 3.165 ± 0.491
0.87PhePro: 0.87 ± 0.2
0.554PheGln: 0.554 ± 0.23
1.187PheArg: 1.187 ± 0.294
2.453PheSer: 2.453 ± 0.508
2.532PheThr: 2.532 ± 0.428
3.482PheVal: 3.482 ± 0.544
0.237PheTrp: 0.237 ± 0.145
1.583PheTyr: 1.583 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
2.691GlyAla: 2.691 ± 0.809
0.317GlyCys: 0.317 ± 0.135
2.77GlyAsp: 2.77 ± 0.563
3.719GlyGlu: 3.719 ± 0.572
2.216GlyPhe: 2.216 ± 0.441
1.662GlyGly: 1.662 ± 0.376
1.029GlyHis: 1.029 ± 0.217
3.482GlyIle: 3.482 ± 0.586
5.144GlyLys: 5.144 ± 0.717
3.957GlyLeu: 3.957 ± 0.71
1.029GlyMet: 1.029 ± 0.242
2.374GlyAsn: 2.374 ± 0.403
0.237GlyPro: 0.237 ± 0.167
2.453GlyGln: 2.453 ± 0.461
2.374GlyArg: 2.374 ± 0.41
2.532GlySer: 2.532 ± 0.356
3.482GlyThr: 3.482 ± 0.447
4.748GlyVal: 4.748 ± 0.662
0.396GlyTrp: 0.396 ± 0.15
3.64GlyTyr: 3.64 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
0.237HisAla: 0.237 ± 0.13
0.237HisCys: 0.237 ± 0.129
1.187HisAsp: 1.187 ± 0.313
0.396HisGlu: 0.396 ± 0.174
0.712HisPhe: 0.712 ± 0.245
1.345HisGly: 1.345 ± 0.34
0.079HisHis: 0.079 ± 0.076
1.424HisIle: 1.424 ± 0.375
1.583HisLys: 1.583 ± 0.477
1.583HisLeu: 1.583 ± 0.396
0.317HisMet: 0.317 ± 0.171
1.108HisAsn: 1.108 ± 0.269
0.554HisPro: 0.554 ± 0.203
0.95HisGln: 0.95 ± 0.282
0.237HisArg: 0.237 ± 0.195
1.108HisSer: 1.108 ± 0.262
0.95HisThr: 0.95 ± 0.276
0.95HisVal: 0.95 ± 0.279
0.0HisTrp: 0.0 ± 0.0
1.187HisTyr: 1.187 ± 0.366
0.0HisXaa: 0.0 ± 0.0
Ile
5.064IleAla: 5.064 ± 0.621
0.475IleCys: 0.475 ± 0.212
6.41IleAsp: 6.41 ± 0.61
5.46IleGlu: 5.46 ± 0.835
2.928IlePhe: 2.928 ± 0.54
2.928IleGly: 2.928 ± 0.707
0.554IleHis: 0.554 ± 0.236
4.352IleIle: 4.352 ± 0.823
7.992IleLys: 7.992 ± 0.709
4.273IleLeu: 4.273 ± 0.667
1.266IleMet: 1.266 ± 0.325
5.302IleAsn: 5.302 ± 0.643
3.244IlePro: 3.244 ± 0.41
3.007IleGln: 3.007 ± 0.522
2.691IleArg: 2.691 ± 0.422
4.985IleSer: 4.985 ± 0.669
4.352IleThr: 4.352 ± 0.756
4.273IleVal: 4.273 ± 0.582
0.87IleTrp: 0.87 ± 0.251
3.324IleTyr: 3.324 ± 0.531
0.0IleXaa: 0.0 ± 0.0
Lys
6.251LysAla: 6.251 ± 0.679
0.158LysCys: 0.158 ± 0.115
6.251LysAsp: 6.251 ± 0.801
6.41LysGlu: 6.41 ± 0.879
3.482LysPhe: 3.482 ± 0.553
4.906LysGly: 4.906 ± 0.661
1.424LysHis: 1.424 ± 0.345
7.28LysIle: 7.28 ± 0.631
7.518LysLys: 7.518 ± 0.923
7.201LysLeu: 7.201 ± 0.779
2.928LysMet: 2.928 ± 0.506
4.511LysAsn: 4.511 ± 0.641
2.611LysPro: 2.611 ± 0.821
5.381LysGln: 5.381 ± 0.649
4.036LysArg: 4.036 ± 0.707
5.381LysSer: 5.381 ± 0.815
5.935LysThr: 5.935 ± 0.768
6.014LysVal: 6.014 ± 0.715
1.108LysTrp: 1.108 ± 0.261
3.798LysTyr: 3.798 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
4.036LeuAla: 4.036 ± 0.545
0.079LeuCys: 0.079 ± 0.072
5.856LeuAsp: 5.856 ± 0.659
6.489LeuGlu: 6.489 ± 0.734
3.878LeuPhe: 3.878 ± 0.476
3.64LeuGly: 3.64 ± 0.655
1.266LeuHis: 1.266 ± 0.286
5.856LeuIle: 5.856 ± 0.64
6.647LeuLys: 6.647 ± 0.745
6.331LeuLeu: 6.331 ± 0.681
2.057LeuMet: 2.057 ± 0.511
6.805LeuAsn: 6.805 ± 0.937
2.057LeuPro: 2.057 ± 0.351
2.849LeuGln: 2.849 ± 0.496
3.798LeuArg: 3.798 ± 0.626
5.856LeuSer: 5.856 ± 0.828
6.093LeuThr: 6.093 ± 0.787
3.719LeuVal: 3.719 ± 0.639
1.187LeuTrp: 1.187 ± 0.419
2.137LeuTyr: 2.137 ± 0.407
0.0LeuXaa: 0.0 ± 0.0
Met
1.899MetAla: 1.899 ± 0.429
0.079MetCys: 0.079 ± 0.079
1.741MetAsp: 1.741 ± 0.365
1.82MetGlu: 1.82 ± 0.431
1.029MetPhe: 1.029 ± 0.261
0.791MetGly: 0.791 ± 0.24
0.554MetHis: 0.554 ± 0.213
2.137MetIle: 2.137 ± 0.357
2.374MetLys: 2.374 ± 0.384
1.741MetLeu: 1.741 ± 0.364
0.554MetMet: 0.554 ± 0.194
1.662MetAsn: 1.662 ± 0.397
0.633MetPro: 0.633 ± 0.215
1.029MetGln: 1.029 ± 0.325
0.87MetArg: 0.87 ± 0.276
1.662MetSer: 1.662 ± 0.367
1.583MetThr: 1.583 ± 0.319
0.95MetVal: 0.95 ± 0.267
0.237MetTrp: 0.237 ± 0.155
0.95MetTyr: 0.95 ± 0.288
0.0MetXaa: 0.0 ± 0.0
Asn
4.985AsnAla: 4.985 ± 0.659
0.554AsnCys: 0.554 ± 0.252
4.194AsnAsp: 4.194 ± 0.9
4.036AsnGlu: 4.036 ± 0.541
1.978AsnPhe: 1.978 ± 0.343
5.144AsnGly: 5.144 ± 0.635
1.029AsnHis: 1.029 ± 0.263
3.324AsnIle: 3.324 ± 0.457
6.251AsnLys: 6.251 ± 0.66
4.59AsnLeu: 4.59 ± 0.705
1.662AsnMet: 1.662 ± 0.312
5.381AsnAsn: 5.381 ± 0.876
2.137AsnPro: 2.137 ± 0.377
1.978AsnGln: 1.978 ± 0.459
2.137AsnArg: 2.137 ± 0.435
4.511AsnSer: 4.511 ± 0.642
4.194AsnThr: 4.194 ± 0.569
4.115AsnVal: 4.115 ± 0.585
0.791AsnTrp: 0.791 ± 0.217
2.928AsnTyr: 2.928 ± 0.598
0.0AsnXaa: 0.0 ± 0.0
Pro
1.583ProAla: 1.583 ± 0.343
0.237ProCys: 0.237 ± 0.141
1.187ProAsp: 1.187 ± 0.309
1.583ProGlu: 1.583 ± 0.435
0.87ProPhe: 0.87 ± 0.291
0.712ProGly: 0.712 ± 0.262
0.554ProHis: 0.554 ± 0.173
2.611ProIle: 2.611 ± 0.681
3.561ProLys: 3.561 ± 0.681
1.345ProLeu: 1.345 ± 0.304
0.554ProMet: 0.554 ± 0.187
1.82ProAsn: 1.82 ± 0.319
0.712ProPro: 0.712 ± 0.211
1.187ProGln: 1.187 ± 0.283
0.791ProArg: 0.791 ± 0.217
2.057ProSer: 2.057 ± 0.402
1.345ProThr: 1.345 ± 0.304
1.82ProVal: 1.82 ± 0.389
0.0ProTrp: 0.0 ± 0.0
0.87ProTyr: 0.87 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.086GlnAla: 3.086 ± 0.445
0.317GlnCys: 0.317 ± 0.131
1.899GlnAsp: 1.899 ± 0.423
1.978GlnGlu: 1.978 ± 0.505
1.583GlnPhe: 1.583 ± 0.33
1.662GlnGly: 1.662 ± 0.294
0.396GlnHis: 0.396 ± 0.231
2.137GlnIle: 2.137 ± 0.436
4.115GlnLys: 4.115 ± 0.49
3.324GlnLeu: 3.324 ± 0.547
0.475GlnMet: 0.475 ± 0.22
3.165GlnAsn: 3.165 ± 0.407
0.87GlnPro: 0.87 ± 0.236
1.899GlnGln: 1.899 ± 0.458
1.899GlnArg: 1.899 ± 0.431
2.453GlnSer: 2.453 ± 0.37
2.532GlnThr: 2.532 ± 0.402
2.77GlnVal: 2.77 ± 0.553
0.237GlnTrp: 0.237 ± 0.132
2.137GlnTyr: 2.137 ± 0.399
0.0GlnXaa: 0.0 ± 0.0
Arg
2.295ArgAla: 2.295 ± 0.471
0.237ArgCys: 0.237 ± 0.126
2.374ArgAsp: 2.374 ± 0.359
2.216ArgGlu: 2.216 ± 0.468
2.216ArgPhe: 2.216 ± 0.362
1.662ArgGly: 1.662 ± 0.305
0.95ArgHis: 0.95 ± 0.367
4.115ArgIle: 4.115 ± 0.538
3.798ArgLys: 3.798 ± 0.73
4.194ArgLeu: 4.194 ± 0.547
1.662ArgMet: 1.662 ± 0.312
1.82ArgAsn: 1.82 ± 0.364
0.712ArgPro: 0.712 ± 0.264
1.899ArgGln: 1.899 ± 0.358
1.741ArgArg: 1.741 ± 0.314
1.899ArgSer: 1.899 ± 0.387
2.057ArgThr: 2.057 ± 0.477
2.532ArgVal: 2.532 ± 0.444
0.396ArgTrp: 0.396 ± 0.18
2.295ArgTyr: 2.295 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
3.482SerAla: 3.482 ± 0.586
0.079SerCys: 0.079 ± 0.083
3.561SerAsp: 3.561 ± 0.429
4.906SerGlu: 4.906 ± 0.668
1.978SerPhe: 1.978 ± 0.533
4.036SerGly: 4.036 ± 0.465
1.662SerHis: 1.662 ± 0.444
5.064SerIle: 5.064 ± 0.753
5.698SerLys: 5.698 ± 0.646
6.093SerLeu: 6.093 ± 0.661
1.424SerMet: 1.424 ± 0.302
4.59SerAsn: 4.59 ± 0.752
1.424SerPro: 1.424 ± 0.276
1.82SerGln: 1.82 ± 0.333
3.403SerArg: 3.403 ± 0.464
3.64SerSer: 3.64 ± 0.726
4.194SerThr: 4.194 ± 0.568
3.957SerVal: 3.957 ± 0.578
0.237SerTrp: 0.237 ± 0.161
2.77SerTyr: 2.77 ± 0.49
0.0SerXaa: 0.0 ± 0.0
Thr
3.957ThrAla: 3.957 ± 0.517
0.079ThrCys: 0.079 ± 0.089
3.798ThrAsp: 3.798 ± 0.661
3.798ThrGlu: 3.798 ± 0.554
2.532ThrPhe: 2.532 ± 0.507
3.165ThrGly: 3.165 ± 0.52
1.662ThrHis: 1.662 ± 0.308
5.064ThrIle: 5.064 ± 0.683
5.618ThrLys: 5.618 ± 0.675
5.777ThrLeu: 5.777 ± 0.705
0.791ThrMet: 0.791 ± 0.236
2.453ThrAsn: 2.453 ± 0.396
2.374ThrPro: 2.374 ± 0.473
2.295ThrGln: 2.295 ± 0.357
2.453ThrArg: 2.453 ± 0.442
3.957ThrSer: 3.957 ± 0.644
4.748ThrThr: 4.748 ± 0.656
4.431ThrVal: 4.431 ± 0.752
0.633ThrTrp: 0.633 ± 0.229
2.77ThrTyr: 2.77 ± 0.503
0.0ThrXaa: 0.0 ± 0.0
Val
4.115ValAla: 4.115 ± 0.537
0.237ValCys: 0.237 ± 0.148
4.511ValAsp: 4.511 ± 0.495
5.223ValGlu: 5.223 ± 0.654
3.086ValPhe: 3.086 ± 0.547
3.324ValGly: 3.324 ± 0.585
0.633ValHis: 0.633 ± 0.2
4.985ValIle: 4.985 ± 0.669
5.223ValLys: 5.223 ± 0.532
4.431ValLeu: 4.431 ± 0.614
1.108ValMet: 1.108 ± 0.33
4.352ValAsn: 4.352 ± 0.571
1.82ValPro: 1.82 ± 0.444
2.137ValGln: 2.137 ± 0.345
1.82ValArg: 1.82 ± 0.326
5.856ValSer: 5.856 ± 0.614
3.64ValThr: 3.64 ± 0.631
3.719ValVal: 3.719 ± 0.618
0.712ValTrp: 0.712 ± 0.256
3.007ValTyr: 3.007 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
0.87TrpAla: 0.87 ± 0.309
0.158TrpCys: 0.158 ± 0.115
0.554TrpAsp: 0.554 ± 0.174
0.237TrpGlu: 0.237 ± 0.118
0.554TrpPhe: 0.554 ± 0.195
0.396TrpGly: 0.396 ± 0.161
0.237TrpHis: 0.237 ± 0.124
0.475TrpIle: 0.475 ± 0.211
0.158TrpLys: 0.158 ± 0.099
0.87TrpLeu: 0.87 ± 0.231
0.237TrpMet: 0.237 ± 0.148
1.266TrpAsn: 1.266 ± 0.629
0.158TrpPro: 0.158 ± 0.115
0.475TrpGln: 0.475 ± 0.245
0.396TrpArg: 0.396 ± 0.185
1.029TrpSer: 1.029 ± 0.295
1.029TrpThr: 1.029 ± 0.305
0.554TrpVal: 0.554 ± 0.224
0.158TrpTrp: 0.158 ± 0.131
0.475TrpTyr: 0.475 ± 0.173
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.295TyrAla: 2.295 ± 0.453
0.317TyrCys: 0.317 ± 0.247
3.482TyrAsp: 3.482 ± 0.583
3.561TyrGlu: 3.561 ± 0.596
2.137TyrPhe: 2.137 ± 0.442
2.691TyrGly: 2.691 ± 0.435
0.87TyrHis: 0.87 ± 0.308
2.532TyrIle: 2.532 ± 0.548
4.59TyrLys: 4.59 ± 0.665
2.849TyrLeu: 2.849 ± 0.547
1.266TyrMet: 1.266 ± 0.303
2.77TyrAsn: 2.77 ± 0.426
1.424TyrPro: 1.424 ± 0.397
1.82TyrGln: 1.82 ± 0.415
1.899TyrArg: 1.899 ± 0.426
2.849TyrSer: 2.849 ± 0.469
2.453TyrThr: 2.453 ± 0.515
2.691TyrVal: 2.691 ± 0.452
0.87TyrTrp: 0.87 ± 0.278
1.82TyrTyr: 1.82 ± 0.397
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 74 proteins (12638 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski