Amino acid dipepetide frequency for Streptococcus virus Sfi11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.893AlaAla: 4.893 ± 1.569
0.245AlaCys: 0.245 ± 0.128
5.219AlaAsp: 5.219 ± 0.902
3.914AlaGlu: 3.914 ± 0.583
2.773AlaPhe: 2.773 ± 0.856
5.627AlaGly: 5.627 ± 1.222
0.734AlaHis: 0.734 ± 0.268
6.034AlaIle: 6.034 ± 1.541
5.056AlaLys: 5.056 ± 0.686
5.953AlaLeu: 5.953 ± 0.795
2.283AlaMet: 2.283 ± 0.967
3.099AlaAsn: 3.099 ± 0.623
2.365AlaPro: 2.365 ± 0.51
2.936AlaGln: 2.936 ± 0.89
3.67AlaArg: 3.67 ± 0.524
5.708AlaSer: 5.708 ± 1.307
4.648AlaThr: 4.648 ± 0.781
4.24AlaVal: 4.24 ± 0.78
0.815AlaTrp: 0.815 ± 0.285
3.099AlaTyr: 3.099 ± 0.566
0.0AlaXaa: 0.0 ± 0.0
Cys
0.082CysAla: 0.082 ± 0.076
0.082CysCys: 0.082 ± 0.09
0.408CysAsp: 0.408 ± 0.243
0.408CysGlu: 0.408 ± 0.16
0.0CysPhe: 0.0 ± 0.0
0.326CysGly: 0.326 ± 0.177
0.163CysHis: 0.163 ± 0.122
0.326CysIle: 0.326 ± 0.152
0.489CysLys: 0.489 ± 0.198
0.571CysLeu: 0.571 ± 0.229
0.0CysMet: 0.0 ± 0.0
0.408CysAsn: 0.408 ± 0.194
0.082CysPro: 0.082 ± 0.091
0.082CysGln: 0.082 ± 0.091
0.408CysArg: 0.408 ± 0.194
0.326CysSer: 0.326 ± 0.168
0.245CysThr: 0.245 ± 0.182
0.163CysVal: 0.163 ± 0.108
0.163CysTrp: 0.163 ± 0.125
0.571CysTyr: 0.571 ± 0.315
0.0CysXaa: 0.0 ± 0.0
Asp
2.283AspAla: 2.283 ± 0.387
0.571AspCys: 0.571 ± 0.224
4.567AspAsp: 4.567 ± 0.562
3.996AspGlu: 3.996 ± 0.758
3.506AspPhe: 3.506 ± 0.544
6.116AspGly: 6.116 ± 1.116
0.489AspHis: 0.489 ± 0.195
4.077AspIle: 4.077 ± 0.589
4.648AspLys: 4.648 ± 0.811
3.914AspLeu: 3.914 ± 0.595
1.549AspMet: 1.549 ± 0.327
5.382AspAsn: 5.382 ± 0.997
1.142AspPro: 1.142 ± 0.326
1.549AspGln: 1.549 ± 0.345
2.609AspArg: 2.609 ± 0.4
4.159AspSer: 4.159 ± 0.566
3.506AspThr: 3.506 ± 0.447
3.343AspVal: 3.343 ± 0.604
0.979AspTrp: 0.979 ± 0.278
3.343AspTyr: 3.343 ± 0.601
0.0AspXaa: 0.0 ± 0.0
Glu
4.648GluAla: 4.648 ± 0.778
0.163GluCys: 0.163 ± 0.119
3.343GluAsp: 3.343 ± 0.578
3.833GluGlu: 3.833 ± 0.783
3.099GluPhe: 3.099 ± 0.579
3.588GluGly: 3.588 ± 0.468
1.223GluHis: 1.223 ± 0.353
5.137GluIle: 5.137 ± 0.803
4.893GluLys: 4.893 ± 1.253
5.79GluLeu: 5.79 ± 1.112
2.283GluMet: 2.283 ± 0.43
4.485GluAsn: 4.485 ± 0.778
1.957GluPro: 1.957 ± 0.568
2.283GluGln: 2.283 ± 0.395
3.506GluArg: 3.506 ± 0.79
2.528GluSer: 2.528 ± 0.503
2.854GluThr: 2.854 ± 0.529
5.219GluVal: 5.219 ± 0.913
1.386GluTrp: 1.386 ± 0.37
3.343GluTyr: 3.343 ± 0.818
0.0GluXaa: 0.0 ± 0.0
Phe
2.283PheAla: 2.283 ± 0.389
0.245PheCys: 0.245 ± 0.143
2.936PheAsp: 2.936 ± 0.439
3.425PheGlu: 3.425 ± 0.794
1.468PhePhe: 1.468 ± 0.373
3.343PheGly: 3.343 ± 0.563
0.408PheHis: 0.408 ± 0.16
2.12PheIle: 2.12 ± 0.392
4.893PheLys: 4.893 ± 0.562
2.365PheLeu: 2.365 ± 0.539
0.897PheMet: 0.897 ± 0.249
2.936PheAsn: 2.936 ± 0.481
0.489PhePro: 0.489 ± 0.262
1.549PheGln: 1.549 ± 0.335
1.549PheArg: 1.549 ± 0.375
2.936PheSer: 2.936 ± 0.561
2.528PheThr: 2.528 ± 0.479
2.039PheVal: 2.039 ± 0.397
0.652PheTrp: 0.652 ± 0.223
1.631PheTyr: 1.631 ± 0.414
0.0PheXaa: 0.0 ± 0.0
Gly
5.464GlyAla: 5.464 ± 0.738
0.163GlyCys: 0.163 ± 0.12
3.099GlyAsp: 3.099 ± 0.425
3.18GlyGlu: 3.18 ± 0.453
2.446GlyPhe: 2.446 ± 0.456
4.159GlyGly: 4.159 ± 0.703
0.897GlyHis: 0.897 ± 0.305
6.524GlyIle: 6.524 ± 1.961
6.116GlyLys: 6.116 ± 0.758
5.464GlyLeu: 5.464 ± 0.838
1.712GlyMet: 1.712 ± 0.615
4.322GlyAsn: 4.322 ± 0.82
1.223GlyPro: 1.223 ± 0.41
2.609GlyGln: 2.609 ± 0.54
3.914GlyArg: 3.914 ± 0.648
4.811GlySer: 4.811 ± 0.765
5.056GlyThr: 5.056 ± 0.679
4.403GlyVal: 4.403 ± 0.625
1.06GlyTrp: 1.06 ± 0.281
2.854GlyTyr: 2.854 ± 0.435
0.0GlyXaa: 0.0 ± 0.0
His
0.734HisAla: 0.734 ± 0.231
0.0HisCys: 0.0 ± 0.0
0.897HisAsp: 0.897 ± 0.247
0.571HisGlu: 0.571 ± 0.225
0.734HisPhe: 0.734 ± 0.233
0.979HisGly: 0.979 ± 0.301
0.408HisHis: 0.408 ± 0.173
0.571HisIle: 0.571 ± 0.209
0.897HisLys: 0.897 ± 0.28
1.142HisLeu: 1.142 ± 0.336
0.163HisMet: 0.163 ± 0.118
0.571HisAsn: 0.571 ± 0.243
0.408HisPro: 0.408 ± 0.184
0.326HisGln: 0.326 ± 0.185
0.489HisArg: 0.489 ± 0.193
0.734HisSer: 0.734 ± 0.215
0.815HisThr: 0.815 ± 0.215
1.142HisVal: 1.142 ± 0.287
0.245HisTrp: 0.245 ± 0.154
0.734HisTyr: 0.734 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.382IleAla: 5.382 ± 1.115
0.408IleCys: 0.408 ± 0.184
5.79IleAsp: 5.79 ± 0.648
4.974IleGlu: 4.974 ± 0.722
1.712IlePhe: 1.712 ± 0.37
4.974IleGly: 4.974 ± 0.886
0.815IleHis: 0.815 ± 0.246
4.485IleIle: 4.485 ± 0.707
4.893IleLys: 4.893 ± 0.568
4.159IleLeu: 4.159 ± 0.576
1.223IleMet: 1.223 ± 0.325
4.159IleAsn: 4.159 ± 0.548
2.446IlePro: 2.446 ± 0.517
2.609IleGln: 2.609 ± 0.375
2.609IleArg: 2.609 ± 0.462
6.687IleSer: 6.687 ± 1.477
5.056IleThr: 5.056 ± 1.01
3.67IleVal: 3.67 ± 0.682
0.815IleTrp: 0.815 ± 0.266
3.017IleTyr: 3.017 ± 0.625
0.0IleXaa: 0.0 ± 0.0
Lys
6.524LysAla: 6.524 ± 0.767
0.245LysCys: 0.245 ± 0.127
4.648LysAsp: 4.648 ± 0.83
6.931LysGlu: 6.931 ± 1.225
2.528LysPhe: 2.528 ± 0.684
4.974LysGly: 4.974 ± 0.58
0.734LysHis: 0.734 ± 0.215
4.24LysIle: 4.24 ± 0.69
5.545LysLys: 5.545 ± 1.001
5.871LysLeu: 5.871 ± 0.894
1.712LysMet: 1.712 ± 0.421
4.322LysAsn: 4.322 ± 0.765
3.18LysPro: 3.18 ± 0.636
2.446LysGln: 2.446 ± 0.529
4.24LysArg: 4.24 ± 0.661
4.159LysSer: 4.159 ± 0.574
5.137LysThr: 5.137 ± 0.632
4.893LysVal: 4.893 ± 0.783
0.897LysTrp: 0.897 ± 0.173
3.262LysTyr: 3.262 ± 0.57
0.0LysXaa: 0.0 ± 0.0
Leu
6.279LeuAla: 6.279 ± 0.681
0.571LeuCys: 0.571 ± 0.233
5.056LeuAsp: 5.056 ± 0.635
5.953LeuGlu: 5.953 ± 0.991
3.017LeuPhe: 3.017 ± 0.413
5.137LeuGly: 5.137 ± 0.843
0.815LeuHis: 0.815 ± 0.287
3.588LeuIle: 3.588 ± 0.6
5.79LeuLys: 5.79 ± 0.786
4.73LeuLeu: 4.73 ± 0.706
1.957LeuMet: 1.957 ± 0.276
5.382LeuAsn: 5.382 ± 0.595
2.283LeuPro: 2.283 ± 0.461
2.691LeuGln: 2.691 ± 0.384
2.528LeuArg: 2.528 ± 0.577
5.953LeuSer: 5.953 ± 0.837
6.605LeuThr: 6.605 ± 1.287
4.159LeuVal: 4.159 ± 0.571
0.571LeuTrp: 0.571 ± 0.298
3.099LeuTyr: 3.099 ± 0.547
0.0LeuXaa: 0.0 ± 0.0
Met
2.854MetAla: 2.854 ± 0.578
0.082MetCys: 0.082 ± 0.091
0.734MetAsp: 0.734 ± 0.238
1.386MetGlu: 1.386 ± 0.373
0.815MetPhe: 0.815 ± 0.264
1.223MetGly: 1.223 ± 0.528
0.652MetHis: 0.652 ± 0.24
1.305MetIle: 1.305 ± 0.377
2.283MetLys: 2.283 ± 0.438
1.712MetLeu: 1.712 ± 0.333
0.979MetMet: 0.979 ± 0.489
1.142MetAsn: 1.142 ± 0.362
0.652MetPro: 0.652 ± 0.229
1.468MetGln: 1.468 ± 0.594
1.06MetArg: 1.06 ± 0.227
1.549MetSer: 1.549 ± 0.432
1.712MetThr: 1.712 ± 0.388
1.468MetVal: 1.468 ± 0.374
0.0MetTrp: 0.0 ± 0.0
0.571MetTyr: 0.571 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
4.893AsnAla: 4.893 ± 0.608
0.408AsnCys: 0.408 ± 0.163
3.099AsnAsp: 3.099 ± 0.64
4.403AsnGlu: 4.403 ± 0.814
2.283AsnPhe: 2.283 ± 0.534
5.708AsnGly: 5.708 ± 1.157
1.386AsnHis: 1.386 ± 0.426
3.506AsnIle: 3.506 ± 0.44
3.67AsnLys: 3.67 ± 0.594
4.485AsnLeu: 4.485 ± 0.541
1.142AsnMet: 1.142 ± 0.281
3.588AsnAsn: 3.588 ± 0.631
2.365AsnPro: 2.365 ± 0.385
1.876AsnGln: 1.876 ± 0.367
2.283AsnArg: 2.283 ± 0.49
3.425AsnSer: 3.425 ± 0.482
3.996AsnThr: 3.996 ± 0.591
4.322AsnVal: 4.322 ± 0.519
1.305AsnTrp: 1.305 ± 0.312
2.365AsnTyr: 2.365 ± 0.476
0.0AsnXaa: 0.0 ± 0.0
Pro
1.549ProAla: 1.549 ± 0.3
0.082ProCys: 0.082 ± 0.096
1.876ProAsp: 1.876 ± 0.389
1.957ProGlu: 1.957 ± 0.459
1.305ProPhe: 1.305 ± 0.266
1.305ProGly: 1.305 ± 0.241
0.163ProHis: 0.163 ± 0.104
2.12ProIle: 2.12 ± 0.484
2.528ProLys: 2.528 ± 0.437
1.712ProLeu: 1.712 ± 0.454
0.326ProMet: 0.326 ± 0.186
2.365ProAsn: 2.365 ± 0.498
1.305ProPro: 1.305 ± 0.352
1.468ProGln: 1.468 ± 0.405
1.06ProArg: 1.06 ± 0.312
2.365ProSer: 2.365 ± 0.359
1.386ProThr: 1.386 ± 0.407
2.773ProVal: 2.773 ± 0.488
0.326ProTrp: 0.326 ± 0.142
1.223ProTyr: 1.223 ± 0.397
0.0ProXaa: 0.0 ± 0.0
Gln
3.425GlnAla: 3.425 ± 0.794
0.408GlnCys: 0.408 ± 0.185
1.876GlnAsp: 1.876 ± 0.388
2.446GlnGlu: 2.446 ± 0.563
1.957GlnPhe: 1.957 ± 0.308
2.365GlnGly: 2.365 ± 0.589
0.571GlnHis: 0.571 ± 0.209
2.691GlnIle: 2.691 ± 0.532
2.283GlnLys: 2.283 ± 0.465
3.506GlnLeu: 3.506 ± 0.351
0.815GlnMet: 0.815 ± 0.249
2.039GlnAsn: 2.039 ± 0.326
0.897GlnPro: 0.897 ± 0.259
1.142GlnGln: 1.142 ± 0.288
1.06GlnArg: 1.06 ± 0.283
2.854GlnSer: 2.854 ± 0.728
2.854GlnThr: 2.854 ± 0.456
2.283GlnVal: 2.283 ± 0.364
0.408GlnTrp: 0.408 ± 0.153
1.468GlnTyr: 1.468 ± 0.398
0.0GlnXaa: 0.0 ± 0.0
Arg
2.773ArgAla: 2.773 ± 0.473
0.326ArgCys: 0.326 ± 0.156
2.039ArgAsp: 2.039 ± 0.348
2.936ArgGlu: 2.936 ± 0.519
1.957ArgPhe: 1.957 ± 0.323
2.365ArgGly: 2.365 ± 0.364
0.571ArgHis: 0.571 ± 0.269
3.18ArgIle: 3.18 ± 0.594
3.996ArgLys: 3.996 ± 0.887
3.751ArgLeu: 3.751 ± 0.474
1.468ArgMet: 1.468 ± 0.386
2.365ArgAsn: 2.365 ± 0.527
0.897ArgPro: 0.897 ± 0.274
1.549ArgGln: 1.549 ± 0.296
1.712ArgArg: 1.712 ± 0.461
2.202ArgSer: 2.202 ± 0.374
2.365ArgThr: 2.365 ± 0.431
2.854ArgVal: 2.854 ± 0.465
0.571ArgTrp: 0.571 ± 0.21
2.202ArgTyr: 2.202 ± 0.364
0.0ArgXaa: 0.0 ± 0.0
Ser
6.279SerAla: 6.279 ± 2.518
0.408SerCys: 0.408 ± 0.176
4.485SerAsp: 4.485 ± 0.501
2.854SerGlu: 2.854 ± 0.439
3.18SerPhe: 3.18 ± 0.66
5.464SerGly: 5.464 ± 0.921
0.489SerHis: 0.489 ± 0.221
5.79SerIle: 5.79 ± 0.624
4.567SerLys: 4.567 ± 0.663
5.219SerLeu: 5.219 ± 0.706
1.305SerMet: 1.305 ± 0.359
3.425SerAsn: 3.425 ± 0.549
2.039SerPro: 2.039 ± 0.319
3.343SerGln: 3.343 ± 0.84
2.202SerArg: 2.202 ± 0.412
4.893SerSer: 4.893 ± 1.553
4.974SerThr: 4.974 ± 0.76
5.382SerVal: 5.382 ± 0.921
0.652SerTrp: 0.652 ± 0.251
2.039SerTyr: 2.039 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
4.974ThrAla: 4.974 ± 1.327
0.408ThrCys: 0.408 ± 0.281
3.18ThrAsp: 3.18 ± 0.614
3.425ThrGlu: 3.425 ± 0.565
3.588ThrPhe: 3.588 ± 0.572
3.833ThrGly: 3.833 ± 0.511
0.979ThrHis: 0.979 ± 0.27
5.871ThrIle: 5.871 ± 1.205
5.464ThrLys: 5.464 ± 0.698
6.116ThrLeu: 6.116 ± 0.729
1.142ThrMet: 1.142 ± 0.445
3.506ThrAsn: 3.506 ± 0.644
2.365ThrPro: 2.365 ± 0.436
2.691ThrGln: 2.691 ± 0.543
1.876ThrArg: 1.876 ± 0.426
4.567ThrSer: 4.567 ± 1.07
4.73ThrThr: 4.73 ± 0.732
4.893ThrVal: 4.893 ± 0.718
0.652ThrTrp: 0.652 ± 0.317
2.773ThrTyr: 2.773 ± 0.621
0.0ThrXaa: 0.0 ± 0.0
Val
3.425ValAla: 3.425 ± 0.772
0.082ValCys: 0.082 ± 0.091
4.567ValAsp: 4.567 ± 0.804
4.974ValGlu: 4.974 ± 1.073
2.12ValPhe: 2.12 ± 0.415
4.403ValGly: 4.403 ± 0.758
0.489ValHis: 0.489 ± 0.182
4.322ValIle: 4.322 ± 0.495
5.219ValLys: 5.219 ± 0.548
5.464ValLeu: 5.464 ± 0.727
1.305ValMet: 1.305 ± 0.394
4.322ValAsn: 4.322 ± 0.846
1.549ValPro: 1.549 ± 0.356
2.528ValGln: 2.528 ± 0.547
2.12ValArg: 2.12 ± 0.442
5.219ValSer: 5.219 ± 0.615
4.974ValThr: 4.974 ± 0.669
4.567ValVal: 4.567 ± 0.601
1.142ValTrp: 1.142 ± 0.267
1.957ValTyr: 1.957 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
0.897TrpAla: 0.897 ± 0.278
0.082TrpCys: 0.082 ± 0.079
0.815TrpAsp: 0.815 ± 0.277
0.897TrpGlu: 0.897 ± 0.26
0.652TrpPhe: 0.652 ± 0.252
0.897TrpGly: 0.897 ± 0.277
0.163TrpHis: 0.163 ± 0.11
0.897TrpIle: 0.897 ± 0.277
0.652TrpLys: 0.652 ± 0.2
0.734TrpLeu: 0.734 ± 0.33
0.163TrpMet: 0.163 ± 0.12
0.734TrpAsn: 0.734 ± 0.329
0.163TrpPro: 0.163 ± 0.099
0.652TrpGln: 0.652 ± 0.228
0.897TrpArg: 0.897 ± 0.322
1.549TrpSer: 1.549 ± 0.525
0.897TrpThr: 0.897 ± 0.261
0.979TrpVal: 0.979 ± 0.284
0.245TrpTrp: 0.245 ± 0.165
0.408TrpTyr: 0.408 ± 0.165
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.751TyrAla: 3.751 ± 0.579
0.326TyrCys: 0.326 ± 0.145
3.099TyrAsp: 3.099 ± 0.731
3.099TyrGlu: 3.099 ± 0.652
1.549TyrPhe: 1.549 ± 0.301
2.691TyrGly: 2.691 ± 0.49
0.408TyrHis: 0.408 ± 0.25
3.099TyrIle: 3.099 ± 0.556
2.609TyrLys: 2.609 ± 0.45
3.506TyrLeu: 3.506 ± 0.824
1.142TyrMet: 1.142 ± 0.266
1.957TyrAsn: 1.957 ± 0.497
1.386TyrPro: 1.386 ± 0.399
1.549TyrGln: 1.549 ± 0.308
2.283TyrArg: 2.283 ± 0.651
2.365TyrSer: 2.365 ± 0.457
2.691TyrThr: 2.691 ± 0.493
1.957TyrVal: 1.957 ± 0.381
0.489TyrTrp: 0.489 ± 0.192
1.957TyrTyr: 1.957 ± 0.67
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12264 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski