Amino acid dipepetide frequency for Staphylococcus virus phiNM2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.329AlaAla: 1.329 ± 0.452
0.369AlaCys: 0.369 ± 0.132
2.879AlaAsp: 2.879 ± 0.44
4.134AlaGlu: 4.134 ± 0.547
3.027AlaPhe: 3.027 ± 0.441
3.101AlaGly: 3.101 ± 0.44
1.107AlaHis: 1.107 ± 0.273
4.798AlaIle: 4.798 ± 0.998
4.356AlaLys: 4.356 ± 0.752
4.208AlaLeu: 4.208 ± 0.646
1.698AlaMet: 1.698 ± 0.427
3.47AlaAsn: 3.47 ± 0.495
2.067AlaPro: 2.067 ± 0.383
2.436AlaGln: 2.436 ± 0.454
2.879AlaArg: 2.879 ± 0.487
3.839AlaSer: 3.839 ± 0.604
4.356AlaThr: 4.356 ± 0.637
3.47AlaVal: 3.47 ± 0.541
0.812AlaTrp: 0.812 ± 0.261
2.879AlaTyr: 2.879 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
0.148CysAla: 0.148 ± 0.113
0.0CysCys: 0.0 ± 0.0
0.221CysAsp: 0.221 ± 0.157
0.517CysGlu: 0.517 ± 0.216
0.295CysPhe: 0.295 ± 0.134
0.221CysGly: 0.221 ± 0.143
0.074CysHis: 0.074 ± 0.074
0.517CysIle: 0.517 ± 0.173
0.443CysLys: 0.443 ± 0.176
0.295CysLeu: 0.295 ± 0.176
0.074CysMet: 0.074 ± 0.093
0.443CysAsn: 0.443 ± 0.246
0.369CysPro: 0.369 ± 0.186
0.221CysGln: 0.221 ± 0.126
0.369CysArg: 0.369 ± 0.181
0.517CysSer: 0.517 ± 0.201
0.221CysThr: 0.221 ± 0.12
0.221CysVal: 0.221 ± 0.135
0.074CysTrp: 0.074 ± 0.068
0.221CysTyr: 0.221 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
3.248AspAla: 3.248 ± 0.496
0.369AspCys: 0.369 ± 0.145
4.282AspAsp: 4.282 ± 0.761
5.98AspGlu: 5.98 ± 0.893
3.913AspPhe: 3.913 ± 0.609
4.06AspGly: 4.06 ± 0.452
0.369AspHis: 0.369 ± 0.247
5.684AspIle: 5.684 ± 0.752
5.315AspLys: 5.315 ± 0.713
4.946AspLeu: 4.946 ± 0.616
2.067AspMet: 2.067 ± 0.36
4.134AspAsn: 4.134 ± 0.537
1.107AspPro: 1.107 ± 0.29
1.107AspGln: 1.107 ± 0.288
2.362AspArg: 2.362 ± 0.384
3.839AspSer: 3.839 ± 0.546
3.986AspThr: 3.986 ± 0.558
3.986AspVal: 3.986 ± 0.551
0.591AspTrp: 0.591 ± 0.171
2.51AspTyr: 2.51 ± 0.538
0.0AspXaa: 0.0 ± 0.0
Glu
5.537GluAla: 5.537 ± 0.815
0.517GluCys: 0.517 ± 0.183
4.134GluAsp: 4.134 ± 0.671
6.201GluGlu: 6.201 ± 0.983
3.543GluPhe: 3.543 ± 0.618
3.396GluGly: 3.396 ± 0.473
1.034GluHis: 1.034 ± 0.297
6.349GluIle: 6.349 ± 0.836
5.537GluLys: 5.537 ± 0.914
7.087GluLeu: 7.087 ± 0.775
2.658GluMet: 2.658 ± 0.531
4.798GluAsn: 4.798 ± 0.648
2.067GluPro: 2.067 ± 0.363
4.208GluGln: 4.208 ± 0.58
3.47GluArg: 3.47 ± 0.676
4.134GluSer: 4.134 ± 0.586
3.396GluThr: 3.396 ± 0.52
4.725GluVal: 4.725 ± 0.631
0.591GluTrp: 0.591 ± 0.185
5.463GluTyr: 5.463 ± 0.795
0.0GluXaa: 0.0 ± 0.0
Phe
2.141PheAla: 2.141 ± 0.32
0.443PheCys: 0.443 ± 0.182
3.027PheAsp: 3.027 ± 0.444
3.986PheGlu: 3.986 ± 0.63
1.403PhePhe: 1.403 ± 0.307
2.879PheGly: 2.879 ± 0.438
0.738PheHis: 0.738 ± 0.249
2.879PheIle: 2.879 ± 0.44
3.986PheLys: 3.986 ± 0.526
2.805PheLeu: 2.805 ± 0.459
1.034PheMet: 1.034 ± 0.231
3.322PheAsn: 3.322 ± 0.413
0.812PhePro: 0.812 ± 0.293
1.55PheGln: 1.55 ± 0.391
1.476PheArg: 1.476 ± 0.319
2.879PheSer: 2.879 ± 0.438
2.953PheThr: 2.953 ± 0.442
2.879PheVal: 2.879 ± 0.541
0.148PheTrp: 0.148 ± 0.121
1.919PheTyr: 1.919 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
3.174GlyAla: 3.174 ± 0.39
0.295GlyCys: 0.295 ± 0.148
3.839GlyAsp: 3.839 ± 0.442
3.322GlyGlu: 3.322 ± 0.57
2.51GlyPhe: 2.51 ± 0.449
2.805GlyGly: 2.805 ± 0.511
1.476GlyHis: 1.476 ± 0.475
4.503GlyIle: 4.503 ± 0.544
5.389GlyLys: 5.389 ± 0.594
5.389GlyLeu: 5.389 ± 0.623
1.403GlyMet: 1.403 ± 0.37
2.953GlyAsn: 2.953 ± 0.442
0.738GlyPro: 0.738 ± 0.292
1.403GlyGln: 1.403 ± 0.354
1.919GlyArg: 1.919 ± 0.384
2.51GlySer: 2.51 ± 0.48
3.617GlyThr: 3.617 ± 0.542
4.503GlyVal: 4.503 ± 0.628
1.107GlyTrp: 1.107 ± 0.476
2.731GlyTyr: 2.731 ± 0.496
0.0GlyXaa: 0.0 ± 0.0
His
1.255HisAla: 1.255 ± 0.388
0.074HisCys: 0.074 ± 0.075
1.034HisAsp: 1.034 ± 0.25
1.255HisGlu: 1.255 ± 0.336
0.738HisPhe: 0.738 ± 0.212
1.034HisGly: 1.034 ± 0.303
0.517HisHis: 0.517 ± 0.185
1.034HisIle: 1.034 ± 0.331
0.664HisLys: 0.664 ± 0.22
1.255HisLeu: 1.255 ± 0.265
0.295HisMet: 0.295 ± 0.156
1.181HisAsn: 1.181 ± 0.3
0.591HisPro: 0.591 ± 0.15
0.517HisGln: 0.517 ± 0.189
0.369HisArg: 0.369 ± 0.175
1.181HisSer: 1.181 ± 0.25
0.886HisThr: 0.886 ± 0.293
1.403HisVal: 1.403 ± 0.336
0.074HisTrp: 0.074 ± 0.066
0.96HisTyr: 0.96 ± 0.391
0.0HisXaa: 0.0 ± 0.0
Ile
5.168IleAla: 5.168 ± 0.658
0.148IleCys: 0.148 ± 0.109
6.275IleAsp: 6.275 ± 0.642
6.644IleGlu: 6.644 ± 0.79
2.436IlePhe: 2.436 ± 0.381
3.839IleGly: 3.839 ± 0.518
1.329IleHis: 1.329 ± 0.324
4.429IleIle: 4.429 ± 0.632
7.825IleLys: 7.825 ± 0.778
4.356IleLeu: 4.356 ± 0.509
1.919IleMet: 1.919 ± 0.371
4.282IleAsn: 4.282 ± 0.458
2.362IlePro: 2.362 ± 0.357
2.362IleGln: 2.362 ± 0.476
2.805IleArg: 2.805 ± 0.488
3.986IleSer: 3.986 ± 0.56
4.503IleThr: 4.503 ± 0.793
4.208IleVal: 4.208 ± 0.52
1.329IleTrp: 1.329 ± 0.676
3.765IleTyr: 3.765 ± 0.614
0.0IleXaa: 0.0 ± 0.0
Lys
5.537LysAla: 5.537 ± 0.599
0.443LysCys: 0.443 ± 0.209
6.423LysAsp: 6.423 ± 0.826
8.342LysGlu: 8.342 ± 1.077
3.174LysPhe: 3.174 ± 0.587
5.094LysGly: 5.094 ± 0.742
1.772LysHis: 1.772 ± 0.411
5.98LysIle: 5.98 ± 0.634
8.49LysLys: 8.49 ± 0.894
7.678LysLeu: 7.678 ± 0.88
1.55LysMet: 1.55 ± 0.342
5.684LysAsn: 5.684 ± 0.72
2.51LysPro: 2.51 ± 0.488
4.798LysGln: 4.798 ± 0.626
4.06LysArg: 4.06 ± 0.578
4.798LysSer: 4.798 ± 0.576
5.168LysThr: 5.168 ± 0.652
6.423LysVal: 6.423 ± 0.623
0.664LysTrp: 0.664 ± 0.211
4.208LysTyr: 4.208 ± 0.687
0.0LysXaa: 0.0 ± 0.0
Leu
3.174LeuAla: 3.174 ± 0.439
0.443LeuCys: 0.443 ± 0.244
4.134LeuAsp: 4.134 ± 0.536
5.315LeuGlu: 5.315 ± 0.654
3.248LeuPhe: 3.248 ± 0.42
3.913LeuGly: 3.913 ± 0.495
0.886LeuHis: 0.886 ± 0.291
4.651LeuIle: 4.651 ± 0.496
8.49LeuLys: 8.49 ± 0.714
5.906LeuLeu: 5.906 ± 0.591
1.55LeuMet: 1.55 ± 0.323
5.906LeuAsn: 5.906 ± 0.558
2.436LeuPro: 2.436 ± 0.429
3.543LeuGln: 3.543 ± 0.506
3.765LeuArg: 3.765 ± 0.651
4.725LeuSer: 4.725 ± 0.623
5.168LeuThr: 5.168 ± 0.629
4.725LeuVal: 4.725 ± 0.774
0.812LeuTrp: 0.812 ± 0.249
3.248LeuTyr: 3.248 ± 0.538
0.0LeuXaa: 0.0 ± 0.0
Met
1.403MetAla: 1.403 ± 0.305
0.221MetCys: 0.221 ± 0.136
1.255MetAsp: 1.255 ± 0.251
1.772MetGlu: 1.772 ± 0.374
0.96MetPhe: 0.96 ± 0.251
1.034MetGly: 1.034 ± 0.258
0.369MetHis: 0.369 ± 0.214
1.107MetIle: 1.107 ± 0.257
1.846MetLys: 1.846 ± 0.377
2.436MetLeu: 2.436 ± 0.353
0.738MetMet: 0.738 ± 0.231
1.624MetAsn: 1.624 ± 0.37
1.107MetPro: 1.107 ± 0.351
1.624MetGln: 1.624 ± 0.37
0.96MetArg: 0.96 ± 0.307
1.55MetSer: 1.55 ± 0.325
2.436MetThr: 2.436 ± 0.559
0.96MetVal: 0.96 ± 0.284
0.443MetTrp: 0.443 ± 0.196
1.034MetTyr: 1.034 ± 0.258
0.0MetXaa: 0.0 ± 0.0
Asn
5.02AsnAla: 5.02 ± 0.753
0.295AsnCys: 0.295 ± 0.164
4.577AsnAsp: 4.577 ± 0.599
5.168AsnGlu: 5.168 ± 0.717
2.953AsnPhe: 2.953 ± 0.535
5.094AsnGly: 5.094 ± 0.618
1.107AsnHis: 1.107 ± 0.278
3.839AsnIle: 3.839 ± 0.445
6.201AsnLys: 6.201 ± 0.597
4.651AsnLeu: 4.651 ± 0.64
1.624AsnMet: 1.624 ± 0.317
4.06AsnAsn: 4.06 ± 0.727
2.658AsnPro: 2.658 ± 0.467
2.51AsnGln: 2.51 ± 0.407
2.731AsnArg: 2.731 ± 0.317
2.658AsnSer: 2.658 ± 0.501
4.282AsnThr: 4.282 ± 0.61
3.691AsnVal: 3.691 ± 0.631
0.886AsnTrp: 0.886 ± 0.232
2.805AsnTyr: 2.805 ± 0.472
0.0AsnXaa: 0.0 ± 0.0
Pro
1.403ProAla: 1.403 ± 0.281
0.295ProCys: 0.295 ± 0.123
1.476ProAsp: 1.476 ± 0.259
1.698ProGlu: 1.698 ± 0.296
1.329ProPhe: 1.329 ± 0.326
1.698ProGly: 1.698 ± 0.433
0.369ProHis: 0.369 ± 0.172
1.993ProIle: 1.993 ± 0.35
3.248ProLys: 3.248 ± 0.525
1.476ProLeu: 1.476 ± 0.254
1.034ProMet: 1.034 ± 0.318
2.436ProAsn: 2.436 ± 0.42
0.738ProPro: 0.738 ± 0.245
1.476ProGln: 1.476 ± 0.322
1.107ProArg: 1.107 ± 0.289
1.255ProSer: 1.255 ± 0.315
2.067ProThr: 2.067 ± 0.389
1.993ProVal: 1.993 ± 0.461
0.148ProTrp: 0.148 ± 0.117
1.255ProTyr: 1.255 ± 0.336
0.0ProXaa: 0.0 ± 0.0
Gln
2.658GlnAla: 2.658 ± 0.532
0.295GlnCys: 0.295 ± 0.172
2.436GlnAsp: 2.436 ± 0.442
2.805GlnGlu: 2.805 ± 0.544
2.141GlnPhe: 2.141 ± 0.468
2.215GlnGly: 2.215 ± 0.306
0.664GlnHis: 0.664 ± 0.187
2.805GlnIle: 2.805 ± 0.36
3.174GlnLys: 3.174 ± 0.426
3.322GlnLeu: 3.322 ± 0.536
1.107GlnMet: 1.107 ± 0.353
2.436GlnAsn: 2.436 ± 0.399
1.698GlnPro: 1.698 ± 0.438
1.698GlnGln: 1.698 ± 0.511
1.624GlnArg: 1.624 ± 0.414
2.288GlnSer: 2.288 ± 0.328
2.067GlnThr: 2.067 ± 0.405
1.919GlnVal: 1.919 ± 0.373
0.369GlnTrp: 0.369 ± 0.158
1.107GlnTyr: 1.107 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
1.772ArgAla: 1.772 ± 0.365
0.369ArgCys: 0.369 ± 0.172
2.288ArgAsp: 2.288 ± 0.578
2.731ArgGlu: 2.731 ± 0.411
1.993ArgPhe: 1.993 ± 0.452
2.215ArgGly: 2.215 ± 0.409
0.886ArgHis: 0.886 ± 0.261
3.248ArgIle: 3.248 ± 0.455
4.503ArgLys: 4.503 ± 0.715
4.06ArgLeu: 4.06 ± 0.549
0.443ArgMet: 0.443 ± 0.173
3.396ArgAsn: 3.396 ± 0.519
0.886ArgPro: 0.886 ± 0.239
1.624ArgGln: 1.624 ± 0.304
1.55ArgArg: 1.55 ± 0.405
1.993ArgSer: 1.993 ± 0.372
1.772ArgThr: 1.772 ± 0.338
2.436ArgVal: 2.436 ± 0.429
0.369ArgTrp: 0.369 ± 0.167
1.919ArgTyr: 1.919 ± 0.477
0.0ArgXaa: 0.0 ± 0.0
Ser
3.839SerAla: 3.839 ± 0.65
0.221SerCys: 0.221 ± 0.168
3.986SerAsp: 3.986 ± 0.677
3.617SerGlu: 3.617 ± 0.506
2.215SerPhe: 2.215 ± 0.431
3.47SerGly: 3.47 ± 0.66
1.034SerHis: 1.034 ± 0.322
5.98SerIle: 5.98 ± 0.743
5.611SerLys: 5.611 ± 0.675
3.47SerLeu: 3.47 ± 0.47
1.624SerMet: 1.624 ± 0.365
4.282SerAsn: 4.282 ± 0.538
1.329SerPro: 1.329 ± 0.369
1.476SerGln: 1.476 ± 0.352
2.362SerArg: 2.362 ± 0.317
3.322SerSer: 3.322 ± 0.479
3.543SerThr: 3.543 ± 0.372
3.027SerVal: 3.027 ± 0.436
0.221SerTrp: 0.221 ± 0.123
1.993SerTyr: 1.993 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
4.134ThrAla: 4.134 ± 0.614
0.074ThrCys: 0.074 ± 0.058
3.617ThrAsp: 3.617 ± 0.541
4.282ThrGlu: 4.282 ± 0.526
2.51ThrPhe: 2.51 ± 0.464
3.765ThrGly: 3.765 ± 0.539
0.886ThrHis: 0.886 ± 0.269
5.389ThrIle: 5.389 ± 1.192
5.832ThrLys: 5.832 ± 0.661
4.651ThrLeu: 4.651 ± 0.581
0.886ThrMet: 0.886 ± 0.282
3.765ThrAsn: 3.765 ± 0.606
1.772ThrPro: 1.772 ± 0.351
2.805ThrGln: 2.805 ± 0.585
2.51ThrArg: 2.51 ± 0.358
4.429ThrSer: 4.429 ± 0.659
3.396ThrThr: 3.396 ± 0.796
4.356ThrVal: 4.356 ± 0.666
0.738ThrTrp: 0.738 ± 0.233
1.846ThrTyr: 1.846 ± 0.412
0.0ThrXaa: 0.0 ± 0.0
Val
3.47ValAla: 3.47 ± 0.936
0.369ValCys: 0.369 ± 0.179
4.798ValAsp: 4.798 ± 0.706
6.127ValGlu: 6.127 ± 0.71
2.362ValPhe: 2.362 ± 0.486
2.362ValGly: 2.362 ± 0.517
0.738ValHis: 0.738 ± 0.246
5.094ValIle: 5.094 ± 0.601
6.053ValLys: 6.053 ± 0.593
4.725ValLeu: 4.725 ± 0.702
1.846ValMet: 1.846 ± 0.344
4.577ValAsn: 4.577 ± 0.587
1.846ValPro: 1.846 ± 0.474
1.403ValGln: 1.403 ± 0.282
1.846ValArg: 1.846 ± 0.398
3.47ValSer: 3.47 ± 0.554
3.839ValThr: 3.839 ± 0.534
4.282ValVal: 4.282 ± 0.56
1.107ValTrp: 1.107 ± 0.351
2.362ValTyr: 2.362 ± 0.454
0.0ValXaa: 0.0 ± 0.0
Trp
0.886TrpAla: 0.886 ± 0.351
0.074TrpCys: 0.074 ± 0.068
0.369TrpAsp: 0.369 ± 0.163
0.738TrpGlu: 0.738 ± 0.247
0.295TrpPhe: 0.295 ± 0.159
0.738TrpGly: 0.738 ± 0.397
0.221TrpHis: 0.221 ± 0.117
0.591TrpIle: 0.591 ± 0.212
0.96TrpLys: 0.96 ± 0.265
0.591TrpLeu: 0.591 ± 0.198
0.148TrpMet: 0.148 ± 0.113
1.698TrpAsn: 1.698 ± 0.906
0.148TrpPro: 0.148 ± 0.118
0.591TrpGln: 0.591 ± 0.19
0.221TrpArg: 0.221 ± 0.12
0.812TrpSer: 0.812 ± 0.266
0.96TrpThr: 0.96 ± 0.298
0.738TrpVal: 0.738 ± 0.268
0.0TrpTrp: 0.0 ± 0.0
0.591TrpTyr: 0.591 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.919TyrAla: 1.919 ± 0.341
0.148TyrCys: 0.148 ± 0.113
2.805TyrAsp: 2.805 ± 0.539
4.208TyrGlu: 4.208 ± 0.688
2.215TyrPhe: 2.215 ± 0.435
2.658TyrGly: 2.658 ± 0.531
0.738TyrHis: 0.738 ± 0.271
3.101TyrIle: 3.101 ± 0.554
4.872TyrLys: 4.872 ± 0.671
2.731TyrLeu: 2.731 ± 0.481
1.107TyrMet: 1.107 ± 0.311
2.362TyrAsn: 2.362 ± 0.322
1.255TyrPro: 1.255 ± 0.358
1.476TyrGln: 1.476 ± 0.293
2.067TyrArg: 2.067 ± 0.513
2.51TyrSer: 2.51 ± 0.483
3.027TyrThr: 3.027 ± 0.411
2.731TyrVal: 2.731 ± 0.439
0.886TyrTrp: 0.886 ± 0.245
2.067TyrTyr: 2.067 ± 0.445
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski