Amino acid dipepetide frequency for Streptococcus virus 2972

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.387AlaAla: 6.387 ± 1.906
0.37AlaCys: 0.37 ± 0.252
5.183AlaAsp: 5.183 ± 1.219
4.258AlaGlu: 4.258 ± 0.606
2.684AlaPhe: 2.684 ± 1.08
5.831AlaGly: 5.831 ± 1.213
0.926AlaHis: 0.926 ± 0.282
6.849AlaIle: 6.849 ± 1.718
5.091AlaLys: 5.091 ± 0.881
6.572AlaLeu: 6.572 ± 1.548
2.684AlaMet: 2.684 ± 1.033
4.443AlaAsn: 4.443 ± 0.78
2.592AlaPro: 2.592 ± 0.363
3.054AlaGln: 3.054 ± 1.172
2.777AlaArg: 2.777 ± 0.6
6.664AlaSer: 6.664 ± 1.518
5.368AlaThr: 5.368 ± 1.147
4.443AlaVal: 4.443 ± 1.126
0.833AlaTrp: 0.833 ± 0.284
2.592AlaTyr: 2.592 ± 0.624
0.0AlaXaa: 0.0 ± 0.0
Cys
0.185CysAla: 0.185 ± 0.131
0.0CysCys: 0.0 ± 0.0
0.555CysAsp: 0.555 ± 0.222
0.463CysGlu: 0.463 ± 0.244
0.093CysPhe: 0.093 ± 0.097
0.37CysGly: 0.37 ± 0.238
0.185CysHis: 0.185 ± 0.12
0.185CysIle: 0.185 ± 0.104
0.37CysLys: 0.37 ± 0.199
0.185CysLeu: 0.185 ± 0.148
0.093CysMet: 0.093 ± 0.096
0.463CysAsn: 0.463 ± 0.236
0.093CysPro: 0.093 ± 0.077
0.093CysGln: 0.093 ± 0.077
0.278CysArg: 0.278 ± 0.17
0.463CysSer: 0.463 ± 0.215
0.278CysThr: 0.278 ± 0.141
0.37CysVal: 0.37 ± 0.164
0.093CysTrp: 0.093 ± 0.101
0.278CysTyr: 0.278 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
3.24AspAla: 3.24 ± 0.411
0.278AspCys: 0.278 ± 0.163
3.98AspAsp: 3.98 ± 0.767
3.702AspGlu: 3.702 ± 0.895
3.332AspPhe: 3.332 ± 0.615
6.849AspGly: 6.849 ± 1.7
0.185AspHis: 0.185 ± 0.127
3.332AspIle: 3.332 ± 0.552
4.535AspLys: 4.535 ± 0.837
4.165AspLeu: 4.165 ± 0.732
1.481AspMet: 1.481 ± 0.435
3.517AspAsn: 3.517 ± 0.612
0.926AspPro: 0.926 ± 0.342
1.666AspGln: 1.666 ± 0.357
2.221AspArg: 2.221 ± 0.372
4.813AspSer: 4.813 ± 0.803
4.258AspThr: 4.258 ± 0.702
4.258AspVal: 4.258 ± 0.756
0.833AspTrp: 0.833 ± 0.328
3.887AspTyr: 3.887 ± 0.728
0.0AspXaa: 0.0 ± 0.0
Glu
4.813GluAla: 4.813 ± 0.996
0.185GluCys: 0.185 ± 0.13
2.407GluAsp: 2.407 ± 0.473
3.054GluGlu: 3.054 ± 0.657
2.684GluPhe: 2.684 ± 0.558
4.073GluGly: 4.073 ± 0.53
1.111GluHis: 1.111 ± 0.353
4.628GluIle: 4.628 ± 0.68
4.443GluLys: 4.443 ± 0.956
6.387GluLeu: 6.387 ± 1.2
2.499GluMet: 2.499 ± 0.673
3.887GluAsn: 3.887 ± 0.575
1.759GluPro: 1.759 ± 0.613
2.869GluGln: 2.869 ± 0.577
3.61GluArg: 3.61 ± 0.714
2.407GluSer: 2.407 ± 0.777
3.332GluThr: 3.332 ± 0.707
5.276GluVal: 5.276 ± 0.832
0.926GluTrp: 0.926 ± 0.276
2.777GluTyr: 2.777 ± 0.782
0.0GluXaa: 0.0 ± 0.0
Phe
2.407PheAla: 2.407 ± 0.505
0.278PheCys: 0.278 ± 0.17
2.684PheAsp: 2.684 ± 0.567
3.61PheGlu: 3.61 ± 0.594
1.296PhePhe: 1.296 ± 0.403
3.517PheGly: 3.517 ± 0.702
0.463PheHis: 0.463 ± 0.188
2.684PheIle: 2.684 ± 0.401
4.165PheLys: 4.165 ± 0.624
1.666PheLeu: 1.666 ± 0.588
0.555PheMet: 0.555 ± 0.218
2.962PheAsn: 2.962 ± 0.572
0.648PhePro: 0.648 ± 0.245
1.481PheGln: 1.481 ± 0.347
1.296PheArg: 1.296 ± 0.314
3.054PheSer: 3.054 ± 0.609
3.054PheThr: 3.054 ± 0.499
2.036PheVal: 2.036 ± 0.682
0.648PheTrp: 0.648 ± 0.271
1.203PheTyr: 1.203 ± 0.345
0.0PheXaa: 0.0 ± 0.0
Gly
5.368GlyAla: 5.368 ± 1.094
0.463GlyCys: 0.463 ± 0.233
3.702GlyAsp: 3.702 ± 0.439
3.425GlyGlu: 3.425 ± 0.506
3.24GlyPhe: 3.24 ± 0.657
3.332GlyGly: 3.332 ± 0.525
0.833GlyHis: 0.833 ± 0.274
6.664GlyIle: 6.664 ± 1.849
7.034GlyLys: 7.034 ± 0.953
6.387GlyLeu: 6.387 ± 0.791
2.129GlyMet: 2.129 ± 0.735
3.61GlyAsn: 3.61 ± 0.633
0.833GlyPro: 0.833 ± 0.438
3.24GlyGln: 3.24 ± 0.514
3.147GlyArg: 3.147 ± 0.531
4.998GlySer: 4.998 ± 0.7
5.646GlyThr: 5.646 ± 0.664
4.535GlyVal: 4.535 ± 0.682
1.111GlyTrp: 1.111 ± 0.335
2.962GlyTyr: 2.962 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
0.926HisAla: 0.926 ± 0.287
0.093HisCys: 0.093 ± 0.076
0.926HisAsp: 0.926 ± 0.232
0.555HisGlu: 0.555 ± 0.195
0.463HisPhe: 0.463 ± 0.205
0.74HisGly: 0.74 ± 0.293
0.463HisHis: 0.463 ± 0.197
1.018HisIle: 1.018 ± 0.326
0.926HisLys: 0.926 ± 0.327
0.833HisLeu: 0.833 ± 0.266
0.278HisMet: 0.278 ± 0.179
0.37HisAsn: 0.37 ± 0.184
0.37HisPro: 0.37 ± 0.176
0.185HisGln: 0.185 ± 0.139
0.555HisArg: 0.555 ± 0.233
0.74HisSer: 0.74 ± 0.334
0.648HisThr: 0.648 ± 0.221
1.018HisVal: 1.018 ± 0.316
0.185HisTrp: 0.185 ± 0.136
0.463HisTyr: 0.463 ± 0.186
0.0HisXaa: 0.0 ± 0.0
Ile
6.109IleAla: 6.109 ± 1.17
0.555IleCys: 0.555 ± 0.27
5.091IleAsp: 5.091 ± 0.637
3.61IleGlu: 3.61 ± 0.558
1.759IlePhe: 1.759 ± 0.346
5.553IleGly: 5.553 ± 1.042
1.111IleHis: 1.111 ± 0.272
3.425IleIle: 3.425 ± 0.831
4.72IleLys: 4.72 ± 0.549
3.61IleLeu: 3.61 ± 0.578
2.036IleMet: 2.036 ± 0.428
3.425IleAsn: 3.425 ± 0.59
2.962IlePro: 2.962 ± 0.674
2.499IleGln: 2.499 ± 0.436
2.962IleArg: 2.962 ± 0.68
6.479IleSer: 6.479 ± 1.594
4.258IleThr: 4.258 ± 0.625
4.72IleVal: 4.72 ± 0.819
0.37IleTrp: 0.37 ± 0.171
3.147IleTyr: 3.147 ± 0.819
0.0IleXaa: 0.0 ± 0.0
Lys
6.942LysAla: 6.942 ± 0.952
0.278LysCys: 0.278 ± 0.154
4.073LysAsp: 4.073 ± 0.725
7.034LysGlu: 7.034 ± 1.067
2.036LysPhe: 2.036 ± 0.442
5.091LysGly: 5.091 ± 0.572
0.833LysHis: 0.833 ± 0.292
4.258LysIle: 4.258 ± 0.631
5.646LysLys: 5.646 ± 1.258
5.831LysLeu: 5.831 ± 0.849
1.481LysMet: 1.481 ± 0.381
3.98LysAsn: 3.98 ± 0.619
2.962LysPro: 2.962 ± 0.516
2.129LysGln: 2.129 ± 0.623
3.98LysArg: 3.98 ± 0.825
4.72LysSer: 4.72 ± 0.548
6.109LysThr: 6.109 ± 0.874
3.887LysVal: 3.887 ± 0.598
1.111LysTrp: 1.111 ± 0.248
3.147LysTyr: 3.147 ± 0.842
0.0LysXaa: 0.0 ± 0.0
Leu
6.479LeuAla: 6.479 ± 0.87
0.0LeuCys: 0.0 ± 0.0
4.535LeuAsp: 4.535 ± 0.783
5.553LeuGlu: 5.553 ± 0.862
2.407LeuPhe: 2.407 ± 0.423
5.368LeuGly: 5.368 ± 1.078
0.463LeuHis: 0.463 ± 0.249
4.073LeuIle: 4.073 ± 0.561
5.276LeuLys: 5.276 ± 0.952
4.998LeuLeu: 4.998 ± 0.733
1.388LeuMet: 1.388 ± 0.346
5.368LeuAsn: 5.368 ± 0.708
2.684LeuPro: 2.684 ± 0.51
2.592LeuGln: 2.592 ± 0.489
2.962LeuArg: 2.962 ± 0.586
6.109LeuSer: 6.109 ± 0.64
5.924LeuThr: 5.924 ± 0.825
4.906LeuVal: 4.906 ± 0.679
0.555LeuTrp: 0.555 ± 0.328
2.777LeuTyr: 2.777 ± 0.495
0.0LeuXaa: 0.0 ± 0.0
Met
2.777MetAla: 2.777 ± 0.853
0.093MetCys: 0.093 ± 0.077
0.926MetAsp: 0.926 ± 0.234
1.111MetGlu: 1.111 ± 0.374
1.203MetPhe: 1.203 ± 0.284
1.203MetGly: 1.203 ± 0.391
0.185MetHis: 0.185 ± 0.12
1.296MetIle: 1.296 ± 0.386
2.036MetLys: 2.036 ± 0.509
1.203MetLeu: 1.203 ± 0.293
1.111MetMet: 1.111 ± 0.492
1.388MetAsn: 1.388 ± 0.368
0.926MetPro: 0.926 ± 0.317
1.666MetGln: 1.666 ± 0.473
1.111MetArg: 1.111 ± 0.361
1.944MetSer: 1.944 ± 0.465
1.296MetThr: 1.296 ± 0.291
2.314MetVal: 2.314 ± 0.534
0.093MetTrp: 0.093 ± 0.086
1.018MetTyr: 1.018 ± 0.387
0.0MetXaa: 0.0 ± 0.0
Asn
4.258AsnAla: 4.258 ± 0.558
0.37AsnCys: 0.37 ± 0.168
3.887AsnAsp: 3.887 ± 0.857
4.258AsnGlu: 4.258 ± 0.849
2.314AsnPhe: 2.314 ± 0.574
5.646AsnGly: 5.646 ± 0.898
1.203AsnHis: 1.203 ± 0.391
3.425AsnIle: 3.425 ± 0.451
3.702AsnLys: 3.702 ± 0.536
3.24AsnLeu: 3.24 ± 0.543
1.018AsnMet: 1.018 ± 0.271
2.962AsnAsn: 2.962 ± 0.618
2.777AsnPro: 2.777 ± 0.756
1.388AsnGln: 1.388 ± 0.314
2.314AsnArg: 2.314 ± 0.58
3.24AsnSer: 3.24 ± 0.559
3.98AsnThr: 3.98 ± 0.763
2.684AsnVal: 2.684 ± 0.417
1.481AsnTrp: 1.481 ± 0.344
1.851AsnTyr: 1.851 ± 0.553
0.0AsnXaa: 0.0 ± 0.0
Pro
1.481ProAla: 1.481 ± 0.406
0.0ProCys: 0.0 ± 0.0
1.944ProAsp: 1.944 ± 0.48
1.944ProGlu: 1.944 ± 0.479
1.018ProPhe: 1.018 ± 0.306
1.759ProGly: 1.759 ± 0.483
0.185ProHis: 0.185 ± 0.109
1.481ProIle: 1.481 ± 0.435
2.962ProLys: 2.962 ± 0.508
2.129ProLeu: 2.129 ± 0.518
0.185ProMet: 0.185 ± 0.11
2.221ProAsn: 2.221 ± 0.587
1.111ProPro: 1.111 ± 0.325
1.111ProGln: 1.111 ± 0.354
1.759ProArg: 1.759 ± 0.458
2.777ProSer: 2.777 ± 0.604
2.036ProThr: 2.036 ± 0.705
1.944ProVal: 1.944 ± 0.517
0.37ProTrp: 0.37 ± 0.177
1.111ProTyr: 1.111 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
3.887GlnAla: 3.887 ± 1.232
0.185GlnCys: 0.185 ± 0.103
1.759GlnAsp: 1.759 ± 0.402
2.499GlnGlu: 2.499 ± 0.649
2.129GlnPhe: 2.129 ± 0.438
2.592GlnGly: 2.592 ± 0.78
0.185GlnHis: 0.185 ± 0.135
2.314GlnIle: 2.314 ± 0.582
2.592GlnLys: 2.592 ± 0.464
3.795GlnLeu: 3.795 ± 0.457
1.203GlnMet: 1.203 ± 0.391
1.388GlnAsn: 1.388 ± 0.27
0.833GlnPro: 0.833 ± 0.296
1.018GlnGln: 1.018 ± 0.289
1.111GlnArg: 1.111 ± 0.278
2.684GlnSer: 2.684 ± 0.557
2.684GlnThr: 2.684 ± 0.403
2.499GlnVal: 2.499 ± 0.376
0.555GlnTrp: 0.555 ± 0.267
1.481GlnTyr: 1.481 ± 0.49
0.0GlnXaa: 0.0 ± 0.0
Arg
3.425ArgAla: 3.425 ± 0.456
0.74ArgCys: 0.74 ± 0.269
2.499ArgAsp: 2.499 ± 0.449
2.777ArgGlu: 2.777 ± 0.71
1.481ArgPhe: 1.481 ± 0.415
3.24ArgGly: 3.24 ± 0.434
0.463ArgHis: 0.463 ± 0.213
3.054ArgIle: 3.054 ± 0.685
3.332ArgLys: 3.332 ± 0.689
3.332ArgLeu: 3.332 ± 0.601
1.388ArgMet: 1.388 ± 0.367
1.851ArgAsn: 1.851 ± 0.422
1.018ArgPro: 1.018 ± 0.219
1.296ArgGln: 1.296 ± 0.332
2.036ArgArg: 2.036 ± 0.594
2.407ArgSer: 2.407 ± 0.357
2.036ArgThr: 2.036 ± 0.492
2.314ArgVal: 2.314 ± 0.616
0.74ArgTrp: 0.74 ± 0.303
2.221ArgTyr: 2.221 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
7.034SerAla: 7.034 ± 2.802
0.37SerCys: 0.37 ± 0.174
4.813SerAsp: 4.813 ± 0.918
3.795SerGlu: 3.795 ± 0.634
3.702SerPhe: 3.702 ± 0.676
5.276SerGly: 5.276 ± 0.547
0.74SerHis: 0.74 ± 0.267
5.183SerIle: 5.183 ± 0.96
5.183SerLys: 5.183 ± 0.813
5.183SerLeu: 5.183 ± 0.809
1.851SerMet: 1.851 ± 0.29
3.61SerAsn: 3.61 ± 0.603
1.666SerPro: 1.666 ± 0.423
3.425SerGln: 3.425 ± 0.866
1.666SerArg: 1.666 ± 0.336
4.258SerSer: 4.258 ± 0.949
4.906SerThr: 4.906 ± 0.792
6.016SerVal: 6.016 ± 0.851
1.018SerTrp: 1.018 ± 0.27
1.759SerTyr: 1.759 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
5.553ThrAla: 5.553 ± 1.692
0.185ThrCys: 0.185 ± 0.107
3.795ThrAsp: 3.795 ± 0.649
3.147ThrGlu: 3.147 ± 0.607
4.165ThrPhe: 4.165 ± 0.647
4.813ThrGly: 4.813 ± 0.75
1.203ThrHis: 1.203 ± 0.373
6.664ThrIle: 6.664 ± 1.064
5.276ThrLys: 5.276 ± 0.812
5.368ThrLeu: 5.368 ± 0.682
1.388ThrMet: 1.388 ± 0.606
3.147ThrAsn: 3.147 ± 0.61
1.944ThrPro: 1.944 ± 0.471
3.054ThrGln: 3.054 ± 0.666
1.851ThrArg: 1.851 ± 0.309
4.258ThrSer: 4.258 ± 0.856
4.073ThrThr: 4.073 ± 0.578
5.739ThrVal: 5.739 ± 0.632
0.555ThrTrp: 0.555 ± 0.389
2.592ThrTyr: 2.592 ± 0.833
0.0ThrXaa: 0.0 ± 0.0
Val
4.813ValAla: 4.813 ± 0.885
0.185ValCys: 0.185 ± 0.107
5.368ValAsp: 5.368 ± 0.753
4.998ValGlu: 4.998 ± 0.9
2.129ValPhe: 2.129 ± 0.403
4.35ValGly: 4.35 ± 0.664
0.463ValHis: 0.463 ± 0.172
4.72ValIle: 4.72 ± 0.589
5.461ValLys: 5.461 ± 0.707
4.813ValLeu: 4.813 ± 0.553
1.018ValMet: 1.018 ± 0.348
4.258ValAsn: 4.258 ± 0.763
2.129ValPro: 2.129 ± 0.364
2.129ValGln: 2.129 ± 0.693
2.407ValArg: 2.407 ± 0.633
5.924ValSer: 5.924 ± 0.677
4.258ValThr: 4.258 ± 0.77
4.35ValVal: 4.35 ± 0.837
1.018ValTrp: 1.018 ± 0.25
1.666ValTyr: 1.666 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
0.555TrpAla: 0.555 ± 0.166
0.093TrpCys: 0.093 ± 0.098
0.74TrpAsp: 0.74 ± 0.34
1.203TrpGlu: 1.203 ± 0.326
0.463TrpPhe: 0.463 ± 0.202
0.833TrpGly: 0.833 ± 0.309
0.093TrpHis: 0.093 ± 0.098
0.648TrpIle: 0.648 ± 0.201
0.74TrpLys: 0.74 ± 0.216
0.926TrpLeu: 0.926 ± 0.288
0.278TrpMet: 0.278 ± 0.134
0.926TrpAsn: 0.926 ± 0.404
0.185TrpPro: 0.185 ± 0.108
0.648TrpGln: 0.648 ± 0.251
0.648TrpArg: 0.648 ± 0.201
1.203TrpSer: 1.203 ± 0.499
1.481TrpThr: 1.481 ± 0.529
0.833TrpVal: 0.833 ± 0.239
0.278TrpTrp: 0.278 ± 0.196
0.278TrpTyr: 0.278 ± 0.169
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.962TyrAla: 2.962 ± 0.443
0.37TyrCys: 0.37 ± 0.156
2.592TyrAsp: 2.592 ± 0.797
2.129TyrGlu: 2.129 ± 0.552
1.111TyrPhe: 1.111 ± 0.35
2.221TyrGly: 2.221 ± 0.525
0.463TyrHis: 0.463 ± 0.18
2.499TyrIle: 2.499 ± 0.602
2.036TyrLys: 2.036 ± 0.407
3.795TyrLeu: 3.795 ± 0.596
0.74TyrMet: 0.74 ± 0.256
2.221TyrAsn: 2.221 ± 0.529
1.111TyrPro: 1.111 ± 0.352
1.851TyrGln: 1.851 ± 0.406
3.054TyrArg: 3.054 ± 0.82
2.314TyrSer: 2.314 ± 0.521
3.147TyrThr: 3.147 ± 0.993
2.314TyrVal: 2.314 ± 0.377
0.278TyrTrp: 0.278 ± 0.165
1.851TyrTyr: 1.851 ± 0.521
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 44 proteins (10805 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski