Amino acid dipepetide frequency for Staphylococcus virus 80alpha

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.56AlaAla: 1.56 ± 0.484
0.355AlaCys: 0.355 ± 0.128
2.836AlaAsp: 2.836 ± 0.43
3.687AlaGlu: 3.687 ± 0.518
2.482AlaPhe: 2.482 ± 0.438
3.262AlaGly: 3.262 ± 0.572
1.064AlaHis: 1.064 ± 0.318
4.822AlaIle: 4.822 ± 1.03
4.964AlaLys: 4.964 ± 0.597
4.113AlaLeu: 4.113 ± 0.635
1.631AlaMet: 1.631 ± 0.496
3.617AlaAsn: 3.617 ± 0.52
1.56AlaPro: 1.56 ± 0.316
1.915AlaGln: 1.915 ± 0.372
2.766AlaArg: 2.766 ± 0.521
3.829AlaSer: 3.829 ± 0.581
3.829AlaThr: 3.829 ± 0.545
3.333AlaVal: 3.333 ± 0.568
0.922AlaTrp: 0.922 ± 0.297
2.34AlaTyr: 2.34 ± 0.396
0.0AlaXaa: 0.0 ± 0.0
Cys
0.071CysAla: 0.071 ± 0.08
0.0CysCys: 0.0 ± 0.0
0.142CysAsp: 0.142 ± 0.151
0.425CysGlu: 0.425 ± 0.18
0.496CysPhe: 0.496 ± 0.212
0.355CysGly: 0.355 ± 0.14
0.0CysHis: 0.0 ± 0.0
0.425CysIle: 0.425 ± 0.174
0.567CysLys: 0.567 ± 0.184
0.638CysLeu: 0.638 ± 0.263
0.071CysMet: 0.071 ± 0.082
0.709CysAsn: 0.709 ± 0.275
0.213CysPro: 0.213 ± 0.118
0.213CysGln: 0.213 ± 0.114
0.355CysArg: 0.355 ± 0.154
0.496CysSer: 0.496 ± 0.222
0.355CysThr: 0.355 ± 0.147
0.213CysVal: 0.213 ± 0.126
0.071CysTrp: 0.071 ± 0.061
0.355CysTyr: 0.355 ± 0.172
0.0CysXaa: 0.0 ± 0.0
Asp
3.475AspAla: 3.475 ± 0.615
0.567AspCys: 0.567 ± 0.199
4.538AspAsp: 4.538 ± 0.889
5.389AspGlu: 5.389 ± 0.973
3.475AspPhe: 3.475 ± 0.606
3.333AspGly: 3.333 ± 0.573
0.355AspHis: 0.355 ± 0.186
5.318AspIle: 5.318 ± 0.715
6.028AspLys: 6.028 ± 0.701
4.893AspLeu: 4.893 ± 0.741
2.127AspMet: 2.127 ± 0.485
4.326AspAsn: 4.326 ± 0.575
1.276AspPro: 1.276 ± 0.274
0.851AspGln: 0.851 ± 0.264
2.34AspArg: 2.34 ± 0.386
3.687AspSer: 3.687 ± 0.571
3.404AspThr: 3.404 ± 0.579
3.829AspVal: 3.829 ± 0.584
0.709AspTrp: 0.709 ± 0.193
2.482AspTyr: 2.482 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
5.247GluAla: 5.247 ± 0.741
0.709GluCys: 0.709 ± 0.294
3.333GluAsp: 3.333 ± 0.633
5.602GluGlu: 5.602 ± 0.847
3.546GluPhe: 3.546 ± 0.66
3.049GluGly: 3.049 ± 0.452
1.418GluHis: 1.418 ± 0.339
5.957GluIle: 5.957 ± 0.897
5.744GluLys: 5.744 ± 0.813
6.595GluLeu: 6.595 ± 0.67
2.056GluMet: 2.056 ± 0.413
4.68GluAsn: 4.68 ± 0.594
1.986GluPro: 1.986 ± 0.292
3.758GluGln: 3.758 ± 0.573
3.971GluArg: 3.971 ± 0.645
3.758GluSer: 3.758 ± 0.646
3.191GluThr: 3.191 ± 0.481
4.964GluVal: 4.964 ± 0.709
0.709GluTrp: 0.709 ± 0.214
5.247GluTyr: 5.247 ± 0.71
0.0GluXaa: 0.0 ± 0.0
Phe
1.773PheAla: 1.773 ± 0.307
0.355PheCys: 0.355 ± 0.14
3.191PheAsp: 3.191 ± 0.481
3.687PheGlu: 3.687 ± 0.443
1.418PhePhe: 1.418 ± 0.307
2.624PheGly: 2.624 ± 0.456
0.496PheHis: 0.496 ± 0.222
3.262PheIle: 3.262 ± 0.374
4.042PheLys: 4.042 ± 0.463
2.695PheLeu: 2.695 ± 0.371
0.993PheMet: 0.993 ± 0.212
3.333PheAsn: 3.333 ± 0.476
0.851PhePro: 0.851 ± 0.291
1.135PheGln: 1.135 ± 0.404
1.489PheArg: 1.489 ± 0.361
2.907PheSer: 2.907 ± 0.455
3.12PheThr: 3.12 ± 0.38
2.695PheVal: 2.695 ± 0.53
0.142PheTrp: 0.142 ± 0.098
2.127PheTyr: 2.127 ± 0.324
0.0PheXaa: 0.0 ± 0.0
Gly
2.695GlyAla: 2.695 ± 0.494
0.284GlyCys: 0.284 ± 0.148
3.333GlyAsp: 3.333 ± 0.499
3.333GlyGlu: 3.333 ± 0.496
2.056GlyPhe: 2.056 ± 0.389
2.907GlyGly: 2.907 ± 0.526
1.489GlyHis: 1.489 ± 0.413
4.538GlyIle: 4.538 ± 0.583
4.751GlyLys: 4.751 ± 0.602
5.106GlyLeu: 5.106 ± 0.589
1.56GlyMet: 1.56 ± 0.336
3.475GlyAsn: 3.475 ± 0.537
0.78GlyPro: 0.78 ± 0.295
1.773GlyGln: 1.773 ± 0.369
2.411GlyArg: 2.411 ± 0.369
2.553GlySer: 2.553 ± 0.428
3.333GlyThr: 3.333 ± 0.562
4.042GlyVal: 4.042 ± 0.513
0.922GlyTrp: 0.922 ± 0.312
2.836GlyTyr: 2.836 ± 0.432
0.0GlyXaa: 0.0 ± 0.0
His
1.206HisAla: 1.206 ± 0.363
0.142HisCys: 0.142 ± 0.098
1.064HisAsp: 1.064 ± 0.268
1.418HisGlu: 1.418 ± 0.341
0.78HisPhe: 0.78 ± 0.231
1.489HisGly: 1.489 ± 0.301
0.425HisHis: 0.425 ± 0.159
1.135HisIle: 1.135 ± 0.311
1.064HisLys: 1.064 ± 0.324
1.135HisLeu: 1.135 ± 0.284
0.355HisMet: 0.355 ± 0.152
1.064HisAsn: 1.064 ± 0.279
0.567HisPro: 0.567 ± 0.187
0.638HisGln: 0.638 ± 0.207
0.284HisArg: 0.284 ± 0.126
1.206HisSer: 1.206 ± 0.281
0.851HisThr: 0.851 ± 0.251
1.347HisVal: 1.347 ± 0.313
0.0HisTrp: 0.0 ± 0.0
0.922HisTyr: 0.922 ± 0.346
0.0HisXaa: 0.0 ± 0.0
Ile
5.035IleAla: 5.035 ± 0.683
0.213IleCys: 0.213 ± 0.099
5.673IleAsp: 5.673 ± 0.588
6.311IleGlu: 6.311 ± 0.717
2.553IlePhe: 2.553 ± 0.5
3.546IleGly: 3.546 ± 0.522
1.418IleHis: 1.418 ± 0.273
4.326IleIle: 4.326 ± 0.587
8.439IleLys: 8.439 ± 0.732
4.822IleLeu: 4.822 ± 0.695
1.915IleMet: 1.915 ± 0.392
4.538IleAsn: 4.538 ± 0.545
2.269IlePro: 2.269 ± 0.309
2.553IleGln: 2.553 ± 0.463
2.907IleArg: 2.907 ± 0.477
4.822IleSer: 4.822 ± 0.646
5.106IleThr: 5.106 ± 0.759
3.758IleVal: 3.758 ± 0.579
1.276IleTrp: 1.276 ± 0.604
3.687IleTyr: 3.687 ± 0.513
0.0IleXaa: 0.0 ± 0.0
Lys
5.744LysAla: 5.744 ± 0.593
0.213LysCys: 0.213 ± 0.144
5.815LysAsp: 5.815 ± 0.667
7.658LysGlu: 7.658 ± 0.868
3.049LysPhe: 3.049 ± 0.504
5.035LysGly: 5.035 ± 0.691
1.56LysHis: 1.56 ± 0.369
6.666LysIle: 6.666 ± 0.739
8.864LysLys: 8.864 ± 0.888
7.304LysLeu: 7.304 ± 0.746
1.844LysMet: 1.844 ± 0.338
6.595LysAsn: 6.595 ± 0.741
2.056LysPro: 2.056 ± 0.428
4.751LysGln: 4.751 ± 0.542
3.829LysArg: 3.829 ± 0.603
4.964LysSer: 4.964 ± 0.527
5.815LysThr: 5.815 ± 0.74
6.311LysVal: 6.311 ± 0.587
0.567LysTrp: 0.567 ± 0.189
4.467LysTyr: 4.467 ± 0.726
0.0LysXaa: 0.0 ± 0.0
Leu
3.333LeuAla: 3.333 ± 0.456
0.496LeuCys: 0.496 ± 0.216
4.255LeuAsp: 4.255 ± 0.631
6.098LeuGlu: 6.098 ± 0.694
3.262LeuPhe: 3.262 ± 0.525
3.829LeuGly: 3.829 ± 0.494
0.993LeuHis: 0.993 ± 0.227
4.467LeuIle: 4.467 ± 0.494
7.658LeuLys: 7.658 ± 0.725
6.098LeuLeu: 6.098 ± 0.686
1.773LeuMet: 1.773 ± 0.33
5.673LeuAsn: 5.673 ± 0.54
2.269LeuPro: 2.269 ± 0.376
3.262LeuGln: 3.262 ± 0.541
3.758LeuArg: 3.758 ± 0.718
5.035LeuSer: 5.035 ± 0.584
4.467LeuThr: 4.467 ± 0.466
4.964LeuVal: 4.964 ± 0.773
0.638LeuTrp: 0.638 ± 0.261
3.9LeuTyr: 3.9 ± 0.56
0.0LeuXaa: 0.0 ± 0.0
Met
1.489MetAla: 1.489 ± 0.313
0.142MetCys: 0.142 ± 0.091
1.206MetAsp: 1.206 ± 0.277
2.056MetGlu: 2.056 ± 0.462
0.78MetPhe: 0.78 ± 0.198
1.064MetGly: 1.064 ± 0.267
0.425MetHis: 0.425 ± 0.202
1.56MetIle: 1.56 ± 0.272
1.844MetLys: 1.844 ± 0.395
2.907MetLeu: 2.907 ± 0.449
0.922MetMet: 0.922 ± 0.254
1.844MetAsn: 1.844 ± 0.426
1.135MetPro: 1.135 ± 0.297
1.418MetGln: 1.418 ± 0.385
1.206MetArg: 1.206 ± 0.342
1.418MetSer: 1.418 ± 0.314
2.695MetThr: 2.695 ± 0.504
0.922MetVal: 0.922 ± 0.208
0.355MetTrp: 0.355 ± 0.176
1.206MetTyr: 1.206 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
4.68AsnAla: 4.68 ± 0.838
0.638AsnCys: 0.638 ± 0.266
5.46AsnAsp: 5.46 ± 0.647
5.247AsnGlu: 5.247 ± 0.659
2.695AsnPhe: 2.695 ± 0.567
4.538AsnGly: 4.538 ± 0.662
1.135AsnHis: 1.135 ± 0.248
4.538AsnIle: 4.538 ± 0.534
7.02AsnLys: 7.02 ± 0.828
4.68AsnLeu: 4.68 ± 0.564
1.773AsnMet: 1.773 ± 0.329
4.538AsnAsn: 4.538 ± 0.846
2.482AsnPro: 2.482 ± 0.425
2.198AsnGln: 2.198 ± 0.33
2.624AsnArg: 2.624 ± 0.327
3.546AsnSer: 3.546 ± 0.534
4.042AsnThr: 4.042 ± 0.488
4.68AsnVal: 4.68 ± 0.667
0.709AsnTrp: 0.709 ± 0.19
2.907AsnTyr: 2.907 ± 0.584
0.0AsnXaa: 0.0 ± 0.0
Pro
1.276ProAla: 1.276 ± 0.273
0.213ProCys: 0.213 ± 0.105
1.56ProAsp: 1.56 ± 0.319
1.986ProGlu: 1.986 ± 0.364
1.206ProPhe: 1.206 ± 0.325
1.489ProGly: 1.489 ± 0.404
0.425ProHis: 0.425 ± 0.173
2.198ProIle: 2.198 ± 0.413
2.695ProLys: 2.695 ± 0.418
1.135ProLeu: 1.135 ± 0.284
0.993ProMet: 0.993 ± 0.229
2.553ProAsn: 2.553 ± 0.363
0.567ProPro: 0.567 ± 0.213
1.276ProGln: 1.276 ± 0.313
0.993ProArg: 0.993 ± 0.291
1.347ProSer: 1.347 ± 0.282
1.915ProThr: 1.915 ± 0.39
1.986ProVal: 1.986 ± 0.375
0.142ProTrp: 0.142 ± 0.102
1.206ProTyr: 1.206 ± 0.395
0.0ProXaa: 0.0 ± 0.0
Gln
2.269GlnAla: 2.269 ± 0.418
0.284GlnCys: 0.284 ± 0.158
1.986GlnAsp: 1.986 ± 0.375
2.269GlnGlu: 2.269 ± 0.491
1.986GlnPhe: 1.986 ± 0.393
2.553GlnGly: 2.553 ± 0.297
0.567GlnHis: 0.567 ± 0.165
2.34GlnIle: 2.34 ± 0.313
3.262GlnLys: 3.262 ± 0.476
2.553GlnLeu: 2.553 ± 0.515
1.489GlnMet: 1.489 ± 0.338
1.986GlnAsn: 1.986 ± 0.277
1.347GlnPro: 1.347 ± 0.289
1.56GlnGln: 1.56 ± 0.394
1.915GlnArg: 1.915 ± 0.371
2.198GlnSer: 2.198 ± 0.393
2.34GlnThr: 2.34 ± 0.359
2.056GlnVal: 2.056 ± 0.392
0.425GlnTrp: 0.425 ± 0.183
1.135GlnTyr: 1.135 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
1.56ArgAla: 1.56 ± 0.293
0.496ArgCys: 0.496 ± 0.158
2.056ArgAsp: 2.056 ± 0.45
2.766ArgGlu: 2.766 ± 0.472
1.986ArgPhe: 1.986 ± 0.431
2.127ArgGly: 2.127 ± 0.454
1.064ArgHis: 1.064 ± 0.282
3.687ArgIle: 3.687 ± 0.567
4.397ArgLys: 4.397 ± 0.655
3.971ArgLeu: 3.971 ± 0.466
0.78ArgMet: 0.78 ± 0.239
2.978ArgAsn: 2.978 ± 0.556
0.851ArgPro: 0.851 ± 0.236
1.489ArgGln: 1.489 ± 0.308
1.631ArgArg: 1.631 ± 0.397
2.269ArgSer: 2.269 ± 0.342
2.198ArgThr: 2.198 ± 0.411
2.482ArgVal: 2.482 ± 0.486
0.567ArgTrp: 0.567 ± 0.192
2.482ArgTyr: 2.482 ± 0.563
0.0ArgXaa: 0.0 ± 0.0
Ser
3.971SerAla: 3.971 ± 0.69
0.071SerCys: 0.071 ± 0.072
4.467SerAsp: 4.467 ± 0.619
2.907SerGlu: 2.907 ± 0.393
2.836SerPhe: 2.836 ± 0.499
3.758SerGly: 3.758 ± 0.674
1.064SerHis: 1.064 ± 0.33
5.247SerIle: 5.247 ± 0.563
4.893SerLys: 4.893 ± 0.527
4.255SerLeu: 4.255 ± 0.498
1.489SerMet: 1.489 ± 0.356
3.9SerAsn: 3.9 ± 0.485
1.418SerPro: 1.418 ± 0.368
2.127SerGln: 2.127 ± 0.373
2.766SerArg: 2.766 ± 0.319
3.404SerSer: 3.404 ± 0.479
3.9SerThr: 3.9 ± 0.424
3.191SerVal: 3.191 ± 0.511
0.284SerTrp: 0.284 ± 0.13
2.836SerTyr: 2.836 ± 0.549
0.0SerXaa: 0.0 ± 0.0
Thr
3.333ThrAla: 3.333 ± 0.644
0.071ThrCys: 0.071 ± 0.067
3.546ThrAsp: 3.546 ± 0.565
4.68ThrGlu: 4.68 ± 0.772
3.049ThrPhe: 3.049 ± 0.439
3.758ThrGly: 3.758 ± 0.567
1.064ThrHis: 1.064 ± 0.279
5.106ThrIle: 5.106 ± 1.133
5.389ThrLys: 5.389 ± 0.56
4.538ThrLeu: 4.538 ± 0.453
1.064ThrMet: 1.064 ± 0.337
4.397ThrAsn: 4.397 ± 0.67
2.056ThrPro: 2.056 ± 0.372
2.482ThrGln: 2.482 ± 0.475
2.198ThrArg: 2.198 ± 0.371
4.042ThrSer: 4.042 ± 0.621
3.9ThrThr: 3.9 ± 0.657
4.964ThrVal: 4.964 ± 0.621
0.496ThrTrp: 0.496 ± 0.203
1.915ThrTyr: 1.915 ± 0.351
0.0ThrXaa: 0.0 ± 0.0
Val
3.475ValAla: 3.475 ± 0.952
0.355ValCys: 0.355 ± 0.149
4.68ValAsp: 4.68 ± 0.561
5.318ValGlu: 5.318 ± 0.68
2.269ValPhe: 2.269 ± 0.515
2.553ValGly: 2.553 ± 0.418
0.993ValHis: 0.993 ± 0.247
5.389ValIle: 5.389 ± 0.628
5.886ValLys: 5.886 ± 0.576
5.035ValLeu: 5.035 ± 0.57
2.269ValMet: 2.269 ± 0.399
4.964ValAsn: 4.964 ± 0.567
2.127ValPro: 2.127 ± 0.494
1.206ValGln: 1.206 ± 0.259
1.986ValArg: 1.986 ± 0.399
3.546ValSer: 3.546 ± 0.587
3.475ValThr: 3.475 ± 0.451
3.404ValVal: 3.404 ± 0.485
0.993ValTrp: 0.993 ± 0.344
2.34ValTyr: 2.34 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
0.851TrpAla: 0.851 ± 0.301
0.071TrpCys: 0.071 ± 0.061
0.355TrpAsp: 0.355 ± 0.17
0.567TrpGlu: 0.567 ± 0.195
0.284TrpPhe: 0.284 ± 0.137
0.425TrpGly: 0.425 ± 0.264
0.142TrpHis: 0.142 ± 0.099
0.709TrpIle: 0.709 ± 0.203
0.709TrpLys: 0.709 ± 0.223
0.638TrpLeu: 0.638 ± 0.218
0.213TrpMet: 0.213 ± 0.121
1.702TrpAsn: 1.702 ± 0.966
0.142TrpPro: 0.142 ± 0.099
0.567TrpGln: 0.567 ± 0.207
0.213TrpArg: 0.213 ± 0.128
0.851TrpSer: 0.851 ± 0.302
1.135TrpThr: 1.135 ± 0.295
0.709TrpVal: 0.709 ± 0.236
0.0TrpTrp: 0.0 ± 0.0
0.567TrpTyr: 0.567 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.844TyrAla: 1.844 ± 0.294
0.496TyrCys: 0.496 ± 0.224
2.907TyrAsp: 2.907 ± 0.686
4.113TyrGlu: 4.113 ± 0.67
2.127TyrPhe: 2.127 ± 0.412
2.482TyrGly: 2.482 ± 0.62
0.993TyrHis: 0.993 ± 0.316
3.475TyrIle: 3.475 ± 0.471
4.964TyrLys: 4.964 ± 0.65
3.12TyrLeu: 3.12 ± 0.559
1.206TyrMet: 1.206 ± 0.342
3.262TyrAsn: 3.262 ± 0.56
1.206TyrPro: 1.206 ± 0.322
1.347TyrGln: 1.347 ± 0.285
2.198TyrArg: 2.198 ± 0.428
2.907TyrSer: 2.907 ± 0.48
2.978TyrThr: 2.978 ± 0.38
2.411TyrVal: 2.411 ± 0.391
0.922TyrTrp: 0.922 ± 0.257
1.773TyrTyr: 1.773 ± 0.288
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (14103 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski