Amino acid dipepetide frequency for Staphylococcus virus 42e

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.792AlaAla: 2.792 ± 0.729
0.409AlaCys: 0.409 ± 0.188
2.928AlaAsp: 2.928 ± 0.462
3.949AlaGlu: 3.949 ± 0.447
1.566AlaPhe: 1.566 ± 0.33
2.996AlaGly: 2.996 ± 0.572
1.021AlaHis: 1.021 ± 0.29
4.902AlaIle: 4.902 ± 0.636
5.992AlaLys: 5.992 ± 0.972
5.379AlaLeu: 5.379 ± 0.929
1.226AlaMet: 1.226 ± 0.28
3.949AlaAsn: 3.949 ± 0.808
1.77AlaPro: 1.77 ± 0.357
1.702AlaGln: 1.702 ± 0.348
2.655AlaArg: 2.655 ± 0.401
4.766AlaSer: 4.766 ± 0.722
3.541AlaThr: 3.541 ± 0.455
3.404AlaVal: 3.404 ± 0.486
0.817AlaTrp: 0.817 ± 0.315
1.838AlaTyr: 1.838 ± 0.319
0.0AlaXaa: 0.0 ± 0.0
Cys
0.272CysAla: 0.272 ± 0.15
0.068CysCys: 0.068 ± 0.061
0.136CysAsp: 0.136 ± 0.102
0.477CysGlu: 0.477 ± 0.21
0.409CysPhe: 0.409 ± 0.201
0.34CysGly: 0.34 ± 0.182
0.136CysHis: 0.136 ± 0.087
0.545CysIle: 0.545 ± 0.213
0.545CysLys: 0.545 ± 0.195
0.477CysLeu: 0.477 ± 0.218
0.136CysMet: 0.136 ± 0.099
0.068CysAsn: 0.068 ± 0.064
0.0CysPro: 0.0 ± 0.0
0.068CysGln: 0.068 ± 0.071
0.34CysArg: 0.34 ± 0.193
0.136CysSer: 0.136 ± 0.089
0.477CysThr: 0.477 ± 0.194
0.204CysVal: 0.204 ± 0.111
0.0CysTrp: 0.0 ± 0.0
0.34CysTyr: 0.34 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
2.996AspAla: 2.996 ± 0.526
0.136AspCys: 0.136 ± 0.111
3.609AspAsp: 3.609 ± 0.661
5.175AspGlu: 5.175 ± 0.834
3.404AspPhe: 3.404 ± 0.516
4.494AspGly: 4.494 ± 0.641
0.885AspHis: 0.885 ± 0.249
5.107AspIle: 5.107 ± 0.626
6.741AspLys: 6.741 ± 0.717
5.038AspLeu: 5.038 ± 0.419
1.702AspMet: 1.702 ± 0.33
3.472AspAsn: 3.472 ± 0.515
1.021AspPro: 1.021 ± 0.281
1.089AspGln: 1.089 ± 0.241
1.975AspArg: 1.975 ± 0.423
3.745AspSer: 3.745 ± 0.548
3.609AspThr: 3.609 ± 0.483
4.358AspVal: 4.358 ± 0.51
0.681AspTrp: 0.681 ± 0.222
3.677AspTyr: 3.677 ± 0.635
0.0AspXaa: 0.0 ± 0.0
Glu
4.766GluAla: 4.766 ± 0.538
0.681GluCys: 0.681 ± 0.222
4.494GluAsp: 4.494 ± 0.635
5.311GluGlu: 5.311 ± 0.919
2.996GluPhe: 2.996 ± 0.453
3.064GluGly: 3.064 ± 0.428
1.498GluHis: 1.498 ± 0.356
5.787GluIle: 5.787 ± 0.928
7.762GluLys: 7.762 ± 0.832
6.741GluLeu: 6.741 ± 0.787
3.2GluMet: 3.2 ± 0.492
5.515GluAsn: 5.515 ± 0.667
1.362GluPro: 1.362 ± 0.35
3.2GluGln: 3.2 ± 0.474
3.336GluArg: 3.336 ± 0.603
3.745GluSer: 3.745 ± 0.572
4.494GluThr: 4.494 ± 0.698
3.2GluVal: 3.2 ± 0.433
1.089GluTrp: 1.089 ± 0.241
2.996GluTyr: 2.996 ± 0.577
0.0GluXaa: 0.0 ± 0.0
Phe
2.111PheAla: 2.111 ± 0.388
0.34PheCys: 0.34 ± 0.182
3.609PheAsp: 3.609 ± 0.544
2.792PheGlu: 2.792 ± 0.526
1.362PhePhe: 1.362 ± 0.337
3.336PheGly: 3.336 ± 0.518
0.272PheHis: 0.272 ± 0.133
2.86PheIle: 2.86 ± 0.526
5.243PheLys: 5.243 ± 0.665
2.315PheLeu: 2.315 ± 0.367
0.953PheMet: 0.953 ± 0.231
3.132PheAsn: 3.132 ± 0.503
0.749PhePro: 0.749 ± 0.29
1.089PheGln: 1.089 ± 0.313
1.021PheArg: 1.021 ± 0.232
1.702PheSer: 1.702 ± 0.268
2.043PheThr: 2.043 ± 0.425
2.111PheVal: 2.111 ± 0.388
0.34PheTrp: 0.34 ± 0.165
1.566PheTyr: 1.566 ± 0.356
0.0PheXaa: 0.0 ± 0.0
Gly
3.472GlyAla: 3.472 ± 0.862
0.272GlyCys: 0.272 ± 0.133
3.949GlyAsp: 3.949 ± 0.486
3.541GlyGlu: 3.541 ± 0.435
2.519GlyPhe: 2.519 ± 0.325
4.63GlyGly: 4.63 ± 1.085
1.021GlyHis: 1.021 ± 0.261
3.745GlyIle: 3.745 ± 0.553
5.924GlyLys: 5.924 ± 0.658
5.515GlyLeu: 5.515 ± 0.808
1.362GlyMet: 1.362 ± 0.379
3.064GlyAsn: 3.064 ± 0.507
0.885GlyPro: 0.885 ± 0.27
1.838GlyGln: 1.838 ± 0.362
1.975GlyArg: 1.975 ± 0.443
3.677GlySer: 3.677 ± 0.686
3.677GlyThr: 3.677 ± 0.526
4.562GlyVal: 4.562 ± 0.641
1.021GlyTrp: 1.021 ± 0.294
2.996GlyTyr: 2.996 ± 0.498
0.0GlyXaa: 0.0 ± 0.0
His
1.157HisAla: 1.157 ± 0.303
0.068HisCys: 0.068 ± 0.062
0.613HisAsp: 0.613 ± 0.197
1.226HisGlu: 1.226 ± 0.266
0.953HisPhe: 0.953 ± 0.376
1.362HisGly: 1.362 ± 0.337
0.409HisHis: 0.409 ± 0.179
1.498HisIle: 1.498 ± 0.411
1.498HisLys: 1.498 ± 0.311
1.702HisLeu: 1.702 ± 0.337
0.477HisMet: 0.477 ± 0.176
1.021HisAsn: 1.021 ± 0.26
0.749HisPro: 0.749 ± 0.207
0.953HisGln: 0.953 ± 0.231
0.817HisArg: 0.817 ± 0.216
1.021HisSer: 1.021 ± 0.228
0.885HisThr: 0.885 ± 0.213
0.613HisVal: 0.613 ± 0.187
0.34HisTrp: 0.34 ± 0.156
1.089HisTyr: 1.089 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
4.153IleAla: 4.153 ± 0.561
0.681IleCys: 0.681 ± 0.274
5.992IleAsp: 5.992 ± 0.809
5.311IleGlu: 5.311 ± 0.537
2.792IlePhe: 2.792 ± 0.509
4.017IleGly: 4.017 ± 0.559
1.294IleHis: 1.294 ± 0.258
4.358IleIle: 4.358 ± 0.605
7.013IleLys: 7.013 ± 0.862
4.221IleLeu: 4.221 ± 0.669
2.519IleMet: 2.519 ± 0.473
4.766IleAsn: 4.766 ± 0.479
1.906IlePro: 1.906 ± 0.314
2.247IleGln: 2.247 ± 0.432
3.404IleArg: 3.404 ± 0.442
5.175IleSer: 5.175 ± 0.682
5.107IleThr: 5.107 ± 0.621
3.949IleVal: 3.949 ± 0.681
0.34IleTrp: 0.34 ± 0.15
2.655IleTyr: 2.655 ± 0.456
0.0IleXaa: 0.0 ± 0.0
Lys
7.149LysAla: 7.149 ± 1.076
0.34LysCys: 0.34 ± 0.156
4.97LysAsp: 4.97 ± 0.519
9.941LysGlu: 9.941 ± 1.012
2.383LysPhe: 2.383 ± 0.527
5.175LysGly: 5.175 ± 0.817
1.906LysHis: 1.906 ± 0.399
6.128LysIle: 6.128 ± 0.769
7.762LysLys: 7.762 ± 1.059
8.511LysLeu: 8.511 ± 0.836
2.792LysMet: 2.792 ± 0.433
5.992LysAsn: 5.992 ± 0.561
2.247LysPro: 2.247 ± 0.395
6.128LysGln: 6.128 ± 0.66
4.085LysArg: 4.085 ± 0.732
6.604LysSer: 6.604 ± 1.12
5.175LysThr: 5.175 ± 0.617
5.651LysVal: 5.651 ± 0.77
1.498LysTrp: 1.498 ± 0.445
3.949LysTyr: 3.949 ± 0.54
0.0LysXaa: 0.0 ± 0.0
Leu
4.63LeuAla: 4.63 ± 0.729
0.409LeuCys: 0.409 ± 0.18
5.447LeuAsp: 5.447 ± 0.856
6.264LeuGlu: 6.264 ± 0.669
3.132LeuPhe: 3.132 ± 0.43
4.153LeuGly: 4.153 ± 0.872
1.43LeuHis: 1.43 ± 0.401
5.243LeuIle: 5.243 ± 0.621
8.851LeuLys: 8.851 ± 1.03
6.536LeuLeu: 6.536 ± 0.68
1.838LeuMet: 1.838 ± 0.338
5.856LeuAsn: 5.856 ± 0.736
2.723LeuPro: 2.723 ± 0.555
3.268LeuGln: 3.268 ± 0.616
2.996LeuArg: 2.996 ± 0.445
5.175LeuSer: 5.175 ± 0.696
5.311LeuThr: 5.311 ± 0.585
3.745LeuVal: 3.745 ± 0.482
0.409LeuTrp: 0.409 ± 0.185
3.268LeuTyr: 3.268 ± 0.687
0.0LeuXaa: 0.0 ± 0.0
Met
1.43MetAla: 1.43 ± 0.289
0.068MetCys: 0.068 ± 0.071
1.294MetAsp: 1.294 ± 0.401
1.498MetGlu: 1.498 ± 0.258
1.021MetPhe: 1.021 ± 0.247
1.566MetGly: 1.566 ± 0.45
0.613MetHis: 0.613 ± 0.205
1.43MetIle: 1.43 ± 0.238
2.86MetLys: 2.86 ± 0.533
2.451MetLeu: 2.451 ± 0.552
0.681MetMet: 0.681 ± 0.213
2.451MetAsn: 2.451 ± 0.44
0.885MetPro: 0.885 ± 0.224
1.838MetGln: 1.838 ± 0.388
1.157MetArg: 1.157 ± 0.296
1.975MetSer: 1.975 ± 0.381
1.906MetThr: 1.906 ± 0.37
1.43MetVal: 1.43 ± 0.254
0.34MetTrp: 0.34 ± 0.131
0.681MetTyr: 0.681 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
3.949AsnAla: 3.949 ± 0.456
0.204AsnCys: 0.204 ± 0.157
3.677AsnAsp: 3.677 ± 0.548
4.902AsnGlu: 4.902 ± 0.643
1.294AsnPhe: 1.294 ± 0.236
5.107AsnGly: 5.107 ± 0.591
1.43AsnHis: 1.43 ± 0.322
4.698AsnIle: 4.698 ± 0.577
6.809AsnLys: 6.809 ± 0.712
4.97AsnLeu: 4.97 ± 0.566
1.566AsnMet: 1.566 ± 0.296
5.038AsnAsn: 5.038 ± 1.002
2.247AsnPro: 2.247 ± 0.365
2.383AsnGln: 2.383 ± 0.375
2.86AsnArg: 2.86 ± 0.486
3.881AsnSer: 3.881 ± 0.61
3.677AsnThr: 3.677 ± 0.486
2.996AsnVal: 2.996 ± 0.396
1.157AsnTrp: 1.157 ± 0.357
3.064AsnTyr: 3.064 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
1.157ProAla: 1.157 ± 0.22
0.136ProCys: 0.136 ± 0.099
1.566ProAsp: 1.566 ± 0.32
1.906ProGlu: 1.906 ± 0.416
1.566ProPhe: 1.566 ± 0.396
1.43ProGly: 1.43 ± 0.359
0.272ProHis: 0.272 ± 0.141
1.77ProIle: 1.77 ± 0.366
2.519ProLys: 2.519 ± 0.507
2.315ProLeu: 2.315 ± 0.508
0.681ProMet: 0.681 ± 0.249
1.566ProAsn: 1.566 ± 0.316
0.545ProPro: 0.545 ± 0.23
1.157ProGln: 1.157 ± 0.241
1.089ProArg: 1.089 ± 0.284
2.043ProSer: 2.043 ± 0.418
1.838ProThr: 1.838 ± 0.342
1.226ProVal: 1.226 ± 0.317
0.204ProTrp: 0.204 ± 0.118
1.157ProTyr: 1.157 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
2.86GlnAla: 2.86 ± 0.362
0.204GlnCys: 0.204 ± 0.131
2.315GlnAsp: 2.315 ± 0.367
2.723GlnGlu: 2.723 ± 0.302
1.566GlnPhe: 1.566 ± 0.356
1.838GlnGly: 1.838 ± 0.353
0.953GlnHis: 0.953 ± 0.258
2.723GlnIle: 2.723 ± 0.521
3.404GlnLys: 3.404 ± 0.466
3.2GlnLeu: 3.2 ± 0.457
1.021GlnMet: 1.021 ± 0.269
2.519GlnAsn: 2.519 ± 0.548
1.226GlnPro: 1.226 ± 0.319
2.043GlnGln: 2.043 ± 0.577
2.179GlnArg: 2.179 ± 0.396
2.111GlnSer: 2.111 ± 0.387
1.975GlnThr: 1.975 ± 0.435
2.587GlnVal: 2.587 ± 0.436
0.409GlnTrp: 0.409 ± 0.245
1.838GlnTyr: 1.838 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
1.838ArgAla: 1.838 ± 0.497
0.136ArgCys: 0.136 ± 0.11
2.451ArgAsp: 2.451 ± 0.358
3.064ArgGlu: 3.064 ± 0.524
2.111ArgPhe: 2.111 ± 0.389
1.838ArgGly: 1.838 ± 0.365
1.089ArgHis: 1.089 ± 0.276
3.132ArgIle: 3.132 ± 0.476
4.29ArgLys: 4.29 ± 0.596
2.928ArgLeu: 2.928 ± 0.596
1.021ArgMet: 1.021 ± 0.215
2.655ArgAsn: 2.655 ± 0.509
1.089ArgPro: 1.089 ± 0.345
1.362ArgGln: 1.362 ± 0.311
1.498ArgArg: 1.498 ± 0.321
1.975ArgSer: 1.975 ± 0.409
2.928ArgThr: 2.928 ± 0.559
2.111ArgVal: 2.111 ± 0.338
0.409ArgTrp: 0.409 ± 0.242
2.179ArgTyr: 2.179 ± 0.428
0.0ArgXaa: 0.0 ± 0.0
Ser
3.745SerAla: 3.745 ± 0.79
0.272SerCys: 0.272 ± 0.12
4.97SerAsp: 4.97 ± 0.52
4.29SerGlu: 4.29 ± 0.49
2.519SerPhe: 2.519 ± 0.46
4.358SerGly: 4.358 ± 0.662
0.681SerHis: 0.681 ± 0.213
3.881SerIle: 3.881 ± 0.525
6.264SerLys: 6.264 ± 1.045
5.107SerLeu: 5.107 ± 0.448
2.179SerMet: 2.179 ± 0.409
4.902SerAsn: 4.902 ± 0.603
1.566SerPro: 1.566 ± 0.391
2.315SerGln: 2.315 ± 0.316
1.838SerArg: 1.838 ± 0.357
2.928SerSer: 2.928 ± 0.794
3.132SerThr: 3.132 ± 0.513
3.472SerVal: 3.472 ± 0.618
0.817SerTrp: 0.817 ± 0.265
2.587SerTyr: 2.587 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
3.064ThrAla: 3.064 ± 0.524
0.068ThrCys: 0.068 ± 0.084
3.881ThrAsp: 3.881 ± 0.627
4.221ThrGlu: 4.221 ± 0.555
2.519ThrPhe: 2.519 ± 0.402
4.494ThrGly: 4.494 ± 0.627
1.634ThrHis: 1.634 ± 0.306
5.175ThrIle: 5.175 ± 0.716
5.243ThrLys: 5.243 ± 0.647
4.221ThrLeu: 4.221 ± 0.502
1.021ThrMet: 1.021 ± 0.257
3.404ThrAsn: 3.404 ± 0.591
2.655ThrPro: 2.655 ± 0.379
2.179ThrGln: 2.179 ± 0.381
1.975ThrArg: 1.975 ± 0.404
3.404ThrSer: 3.404 ± 0.564
2.996ThrThr: 2.996 ± 0.401
4.358ThrVal: 4.358 ± 0.578
0.749ThrTrp: 0.749 ± 0.261
1.77ThrTyr: 1.77 ± 0.366
0.0ThrXaa: 0.0 ± 0.0
Val
3.268ValAla: 3.268 ± 0.412
0.272ValCys: 0.272 ± 0.168
3.813ValAsp: 3.813 ± 0.595
4.97ValGlu: 4.97 ± 0.63
2.383ValPhe: 2.383 ± 0.392
3.268ValGly: 3.268 ± 0.594
0.681ValHis: 0.681 ± 0.17
4.358ValIle: 4.358 ± 0.581
4.902ValLys: 4.902 ± 0.621
4.902ValLeu: 4.902 ± 0.613
1.634ValMet: 1.634 ± 0.313
3.677ValAsn: 3.677 ± 0.48
1.43ValPro: 1.43 ± 0.335
1.975ValGln: 1.975 ± 0.379
2.179ValArg: 2.179 ± 0.372
4.085ValSer: 4.085 ± 0.569
2.996ValThr: 2.996 ± 0.605
3.268ValVal: 3.268 ± 0.522
0.409ValTrp: 0.409 ± 0.188
2.315ValTyr: 2.315 ± 0.416
0.0ValXaa: 0.0 ± 0.0
Trp
0.613TrpAla: 0.613 ± 0.241
0.068TrpCys: 0.068 ± 0.066
0.272TrpAsp: 0.272 ± 0.172
0.613TrpGlu: 0.613 ± 0.265
0.953TrpPhe: 0.953 ± 0.308
0.136TrpGly: 0.136 ± 0.102
0.136TrpHis: 0.136 ± 0.107
1.226TrpIle: 1.226 ± 0.308
0.817TrpLys: 0.817 ± 0.308
0.885TrpLeu: 0.885 ± 0.258
0.545TrpMet: 0.545 ± 0.191
1.021TrpAsn: 1.021 ± 0.252
0.068TrpPro: 0.068 ± 0.073
0.613TrpGln: 0.613 ± 0.175
0.409TrpArg: 0.409 ± 0.151
1.021TrpSer: 1.021 ± 0.304
0.681TrpThr: 0.681 ± 0.199
0.817TrpVal: 0.817 ± 0.212
0.136TrpTrp: 0.136 ± 0.104
0.749TrpTyr: 0.749 ± 0.262
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.179TyrAla: 2.179 ± 0.336
0.34TyrCys: 0.34 ± 0.171
2.928TyrAsp: 2.928 ± 0.536
3.064TyrGlu: 3.064 ± 0.509
1.498TyrPhe: 1.498 ± 0.402
2.043TyrGly: 2.043 ± 0.432
1.089TyrHis: 1.089 ± 0.304
3.336TyrIle: 3.336 ± 0.552
3.949TyrLys: 3.949 ± 0.54
3.268TyrLeu: 3.268 ± 0.552
0.953TyrMet: 0.953 ± 0.248
1.838TyrAsn: 1.838 ± 0.396
1.021TyrPro: 1.021 ± 0.259
2.247TyrGln: 2.247 ± 0.376
2.315TyrArg: 2.315 ± 0.506
2.655TyrSer: 2.655 ± 0.387
2.587TyrThr: 2.587 ± 0.48
2.723TyrVal: 2.723 ± 0.427
0.613TyrTrp: 0.613 ± 0.232
1.566TyrTyr: 1.566 ± 0.401
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14688 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski