Amino acid dipepetide frequency for Garba virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.185AlaAla: 2.185 ± 2.153
0.819AlaCys: 0.819 ± 0.379
3.004AlaAsp: 3.004 ± 0.706
2.185AlaGlu: 2.185 ± 0.614
1.638AlaPhe: 1.638 ± 0.57
1.365AlaGly: 1.365 ± 0.567
0.546AlaHis: 0.546 ± 0.332
1.912AlaIle: 1.912 ± 0.548
2.185AlaLys: 2.185 ± 0.991
4.369AlaLeu: 4.369 ± 2.047
0.273AlaMet: 0.273 ± 0.406
1.092AlaAsn: 1.092 ± 0.735
0.273AlaPro: 0.273 ± 0.406
1.638AlaGln: 1.638 ± 0.798
1.638AlaArg: 1.638 ± 0.683
2.731AlaSer: 2.731 ± 0.935
1.912AlaThr: 1.912 ± 0.82
1.912AlaVal: 1.912 ± 1.049
0.546AlaTrp: 0.546 ± 0.332
1.092AlaTyr: 1.092 ± 0.556
0.0AlaXaa: 0.0 ± 0.0
Cys
0.273CysAla: 0.273 ± 0.406
0.273CysCys: 0.273 ± 0.166
1.092CysAsp: 1.092 ± 0.478
0.546CysGlu: 0.546 ± 0.616
1.092CysPhe: 1.092 ± 0.664
1.912CysGly: 1.912 ± 0.855
0.273CysHis: 0.273 ± 0.673
1.365CysIle: 1.365 ± 0.526
2.185CysLys: 2.185 ± 0.437
2.185CysLeu: 2.185 ± 0.614
0.0CysMet: 0.0 ± 0.0
0.819CysAsn: 0.819 ± 0.498
0.819CysPro: 0.819 ± 0.614
1.912CysGln: 1.912 ± 1.229
0.546CysArg: 0.546 ± 0.332
1.638CysSer: 1.638 ± 0.675
0.819CysThr: 0.819 ± 0.762
1.638CysVal: 1.638 ± 0.608
0.546CysTrp: 0.546 ± 0.332
0.819CysTyr: 0.819 ± 0.614
0.0CysXaa: 0.0 ± 0.0
Asp
1.638AspAla: 1.638 ± 0.583
1.092AspCys: 1.092 ± 0.283
4.642AspAsp: 4.642 ± 0.962
3.004AspGlu: 3.004 ± 1.159
4.369AspPhe: 4.369 ± 0.961
2.731AspGly: 2.731 ± 2.219
1.365AspHis: 1.365 ± 0.267
4.369AspIle: 4.369 ± 0.749
4.369AspLys: 4.369 ± 1.103
6.827AspLeu: 6.827 ± 1.192
1.092AspMet: 1.092 ± 0.832
2.458AspAsn: 2.458 ± 0.927
3.823AspPro: 3.823 ± 1.368
3.55AspGln: 3.55 ± 0.551
1.365AspArg: 1.365 ± 0.693
5.188AspSer: 5.188 ± 2.985
3.55AspThr: 3.55 ± 1.436
2.185AspVal: 2.185 ± 1.09
1.365AspTrp: 1.365 ± 1.201
2.185AspTyr: 2.185 ± 0.975
0.0AspXaa: 0.0 ± 0.0
Glu
1.638GluAla: 1.638 ± 0.882
2.185GluCys: 2.185 ± 1.959
3.004GluAsp: 3.004 ± 0.52
3.55GluGlu: 3.55 ± 0.671
4.642GluPhe: 4.642 ± 1.006
4.096GluGly: 4.096 ± 1.062
1.638GluHis: 1.638 ± 0.694
7.1GluIle: 7.1 ± 1.148
5.188GluLys: 5.188 ± 1.407
4.369GluLeu: 4.369 ± 1.091
2.185GluMet: 2.185 ± 0.801
3.823GluAsn: 3.823 ± 1.208
1.638GluPro: 1.638 ± 1.371
0.546GluGln: 0.546 ± 0.332
2.185GluArg: 2.185 ± 0.768
3.823GluSer: 3.823 ± 0.744
4.096GluThr: 4.096 ± 2.447
4.915GluVal: 4.915 ± 1.34
0.546GluTrp: 0.546 ± 0.631
3.004GluTyr: 3.004 ± 1.175
0.0GluXaa: 0.0 ± 0.0
Phe
2.731PheAla: 2.731 ± 0.404
1.092PheCys: 1.092 ± 0.491
3.277PheAsp: 3.277 ± 0.338
3.823PheGlu: 3.823 ± 1.091
2.458PhePhe: 2.458 ± 0.787
2.185PheGly: 2.185 ± 1.205
1.365PheHis: 1.365 ± 0.405
3.004PheIle: 3.004 ± 0.486
3.55PheLys: 3.55 ± 0.918
5.735PheLeu: 5.735 ± 1.581
1.638PheMet: 1.638 ± 0.785
2.458PheAsn: 2.458 ± 0.729
2.731PhePro: 2.731 ± 0.647
0.546PheGln: 0.546 ± 0.278
2.458PheArg: 2.458 ± 0.826
2.731PheSer: 2.731 ± 0.876
2.458PheThr: 2.458 ± 0.642
4.915PheVal: 4.915 ± 1.126
0.546PheTrp: 0.546 ± 0.416
0.819PheTyr: 0.819 ± 0.738
0.0PheXaa: 0.0 ± 0.0
Gly
2.458GlyAla: 2.458 ± 1.055
0.273GlyCys: 0.273 ± 0.383
3.55GlyAsp: 3.55 ± 0.942
2.185GlyGlu: 2.185 ± 1.555
2.458GlyPhe: 2.458 ± 0.793
5.735GlyGly: 5.735 ± 1.318
1.092GlyHis: 1.092 ± 0.606
3.823GlyIle: 3.823 ± 1.697
4.642GlyLys: 4.642 ± 1.066
6.554GlyLeu: 6.554 ± 0.983
2.458GlyMet: 2.458 ± 1.043
2.731GlyAsn: 2.731 ± 0.64
1.638GlyPro: 1.638 ± 0.658
1.912GlyGln: 1.912 ± 0.406
1.365GlyArg: 1.365 ± 0.567
4.369GlySer: 4.369 ± 0.979
3.004GlyThr: 3.004 ± 0.482
4.642GlyVal: 4.642 ± 1.452
1.092GlyTrp: 1.092 ± 0.384
2.458GlyTyr: 2.458 ± 0.866
0.0GlyXaa: 0.0 ± 0.0
His
0.546HisAla: 0.546 ± 0.512
0.273HisCys: 0.273 ± 0.166
1.092HisAsp: 1.092 ± 0.556
1.092HisGlu: 1.092 ± 0.43
2.458HisPhe: 2.458 ± 0.943
0.273HisGly: 0.273 ± 0.673
0.819HisHis: 0.819 ± 0.692
1.092HisIle: 1.092 ± 0.556
0.819HisLys: 0.819 ± 0.291
2.458HisLeu: 2.458 ± 1.166
0.546HisMet: 0.546 ± 0.4
1.365HisAsn: 1.365 ± 0.515
1.365HisPro: 1.365 ± 0.812
0.819HisGln: 0.819 ± 0.498
2.458HisArg: 2.458 ± 0.677
1.365HisSer: 1.365 ± 0.7
0.819HisThr: 0.819 ± 0.291
1.638HisVal: 1.638 ± 0.44
0.273HisTrp: 0.273 ± 0.354
0.819HisTyr: 0.819 ± 0.525
0.0HisXaa: 0.0 ± 0.0
Ile
1.638IleAla: 1.638 ± 1.132
1.092IleCys: 1.092 ± 0.556
4.369IleAsp: 4.369 ± 0.871
5.188IleGlu: 5.188 ± 0.922
2.731IlePhe: 2.731 ± 0.378
2.731IleGly: 2.731 ± 0.573
2.185IleHis: 2.185 ± 0.828
6.281IleIle: 6.281 ± 1.235
7.919IleLys: 7.919 ± 1.537
4.096IleLeu: 4.096 ± 2.156
0.819IleMet: 0.819 ± 0.291
5.461IleAsn: 5.461 ± 1.555
3.277IlePro: 3.277 ± 1.702
3.55IleGln: 3.55 ± 1.383
5.188IleArg: 5.188 ± 1.326
6.281IleSer: 6.281 ± 1.584
3.823IleThr: 3.823 ± 0.882
3.277IleVal: 3.277 ± 0.884
0.819IleTrp: 0.819 ± 0.498
2.185IleTyr: 2.185 ± 0.685
0.0IleXaa: 0.0 ± 0.0
Lys
1.912LysAla: 1.912 ± 1.035
1.092LysCys: 1.092 ± 0.664
3.55LysAsp: 3.55 ± 0.825
4.915LysGlu: 4.915 ± 1.529
3.823LysPhe: 3.823 ± 0.959
3.55LysGly: 3.55 ± 0.492
0.819LysHis: 0.819 ± 0.614
6.827LysIle: 6.827 ± 0.769
3.823LysLys: 3.823 ± 1.209
7.646LysLeu: 7.646 ± 1.099
2.185LysMet: 2.185 ± 0.626
2.731LysAsn: 2.731 ± 0.633
3.823LysPro: 3.823 ± 1.009
2.731LysGln: 2.731 ± 1.319
3.55LysArg: 3.55 ± 1.107
5.735LysSer: 5.735 ± 0.921
4.642LysThr: 4.642 ± 1.925
4.096LysVal: 4.096 ± 0.869
2.185LysTrp: 2.185 ± 0.828
2.185LysTyr: 2.185 ± 0.778
0.0LysXaa: 0.0 ± 0.0
Leu
3.55LeuAla: 3.55 ± 0.919
2.731LeuCys: 2.731 ± 1.12
6.008LeuAsp: 6.008 ± 0.963
6.281LeuGlu: 6.281 ± 0.996
5.461LeuPhe: 5.461 ± 1.941
7.373LeuGly: 7.373 ± 0.877
0.819LeuHis: 0.819 ± 0.498
7.373LeuIle: 7.373 ± 2.899
6.554LeuLys: 6.554 ± 1.248
11.196LeuLeu: 11.196 ± 2.139
1.365LeuMet: 1.365 ± 0.83
6.008LeuAsn: 6.008 ± 1.804
2.185LeuPro: 2.185 ± 0.576
3.277LeuGln: 3.277 ± 0.579
6.554LeuArg: 6.554 ± 1.374
10.104LeuSer: 10.104 ± 2.2
4.915LeuThr: 4.915 ± 1.178
2.185LeuVal: 2.185 ± 1.042
0.0LeuTrp: 0.0 ± 0.0
4.096LeuTyr: 4.096 ± 1.359
0.0LeuXaa: 0.0 ± 0.0
Met
1.365MetAla: 1.365 ± 0.515
0.0MetCys: 0.0 ± 0.0
1.092MetAsp: 1.092 ± 0.406
1.092MetGlu: 1.092 ± 0.631
1.638MetPhe: 1.638 ± 0.658
1.365MetGly: 1.365 ± 0.562
0.0MetHis: 0.0 ± 0.0
2.458MetIle: 2.458 ± 0.971
1.912MetLys: 1.912 ± 0.706
1.638MetLeu: 1.638 ± 0.548
0.273MetMet: 0.273 ± 0.406
2.185MetAsn: 2.185 ± 0.76
0.819MetPro: 0.819 ± 0.704
0.273MetGln: 0.273 ± 0.166
1.638MetArg: 1.638 ± 0.698
2.185MetSer: 2.185 ± 0.927
1.912MetThr: 1.912 ± 0.976
0.819MetVal: 0.819 ± 0.658
0.273MetTrp: 0.273 ± 0.166
1.638MetTyr: 1.638 ± 0.317
0.0MetXaa: 0.0 ± 0.0
Asn
1.912AsnAla: 1.912 ± 0.721
2.185AsnCys: 2.185 ± 0.828
3.55AsnAsp: 3.55 ± 1.145
2.458AsnGlu: 2.458 ± 0.843
2.185AsnPhe: 2.185 ± 0.67
2.185AsnGly: 2.185 ± 1.513
4.369AsnHis: 4.369 ± 0.723
4.642AsnIle: 4.642 ± 1.703
3.55AsnLys: 3.55 ± 1.053
8.465AsnLeu: 8.465 ± 1.079
1.365AsnMet: 1.365 ± 0.83
3.277AsnAsn: 3.277 ± 0.859
2.185AsnPro: 2.185 ± 0.466
1.638AsnGln: 1.638 ± 0.704
1.912AsnArg: 1.912 ± 0.566
5.461AsnSer: 5.461 ± 0.82
2.458AsnThr: 2.458 ± 1.135
3.277AsnVal: 3.277 ± 1.602
1.092AsnTrp: 1.092 ± 0.602
3.004AsnTyr: 3.004 ± 0.91
0.0AsnXaa: 0.0 ± 0.0
Pro
0.819ProAla: 0.819 ± 1.003
1.092ProCys: 1.092 ± 0.556
2.185ProAsp: 2.185 ± 0.768
2.185ProGlu: 2.185 ± 0.533
0.546ProPhe: 0.546 ± 0.332
1.092ProGly: 1.092 ± 1.094
0.819ProHis: 0.819 ± 0.498
2.458ProIle: 2.458 ± 0.678
2.731ProLys: 2.731 ± 0.83
3.823ProLeu: 3.823 ± 1.623
0.0ProMet: 0.0 ± 0.573
1.912ProAsn: 1.912 ± 0.855
1.638ProPro: 1.638 ± 0.44
1.092ProGln: 1.092 ± 0.631
1.365ProArg: 1.365 ± 0.557
4.642ProSer: 4.642 ± 1.143
3.55ProThr: 3.55 ± 0.606
2.458ProVal: 2.458 ± 1.407
1.092ProTrp: 1.092 ± 0.406
1.638ProTyr: 1.638 ± 0.57
0.0ProXaa: 0.0 ± 0.0
Gln
0.546GlnAla: 0.546 ± 0.332
0.273GlnCys: 0.273 ± 0.166
1.638GlnAsp: 1.638 ± 0.858
3.277GlnGlu: 3.277 ± 2.485
1.912GlnPhe: 1.912 ± 0.542
2.458GlnGly: 2.458 ± 1.169
0.546GlnHis: 0.546 ± 0.278
0.819GlnIle: 0.819 ± 0.365
1.638GlnLys: 1.638 ± 0.608
2.731GlnLeu: 2.731 ± 0.947
0.819GlnMet: 0.819 ± 0.762
4.369GlnAsn: 4.369 ± 0.662
0.819GlnPro: 0.819 ± 0.291
1.092GlnGln: 1.092 ± 0.62
2.185GlnArg: 2.185 ± 0.336
1.365GlnSer: 1.365 ± 0.394
2.185GlnThr: 2.185 ± 0.783
2.458GlnVal: 2.458 ± 0.387
0.273GlnTrp: 0.273 ± 0.354
0.819GlnTyr: 0.819 ± 0.291
0.0GlnXaa: 0.0 ± 0.0
Arg
2.185ArgAla: 2.185 ± 0.879
0.546ArgCys: 0.546 ± 0.278
3.55ArgAsp: 3.55 ± 0.577
4.369ArgGlu: 4.369 ± 1.163
2.185ArgPhe: 2.185 ± 0.457
3.004ArgGly: 3.004 ± 1.289
0.819ArgHis: 0.819 ± 0.291
1.638ArgIle: 1.638 ± 0.838
3.004ArgLys: 3.004 ± 0.577
4.915ArgLeu: 4.915 ± 1.073
0.819ArgMet: 0.819 ± 0.467
3.004ArgAsn: 3.004 ± 0.793
1.638ArgPro: 1.638 ± 0.741
0.819ArgGln: 0.819 ± 0.498
0.273ArgArg: 0.273 ± 0.166
4.642ArgSer: 4.642 ± 1.417
2.458ArgThr: 2.458 ± 0.677
3.55ArgVal: 3.55 ± 0.785
1.365ArgTrp: 1.365 ± 0.675
1.365ArgTyr: 1.365 ± 0.487
0.0ArgXaa: 0.0 ± 0.0
Ser
3.55SerAla: 3.55 ± 0.766
2.185SerCys: 2.185 ± 0.975
4.369SerAsp: 4.369 ± 2.669
4.369SerGlu: 4.369 ± 0.766
2.458SerPhe: 2.458 ± 0.768
3.55SerGly: 3.55 ± 0.919
1.912SerHis: 1.912 ± 0.697
5.188SerIle: 5.188 ± 0.858
6.827SerLys: 6.827 ± 0.741
9.011SerLeu: 9.011 ± 1.237
2.185SerMet: 2.185 ± 0.614
4.642SerAsn: 4.642 ± 1.373
2.458SerPro: 2.458 ± 0.667
3.004SerGln: 3.004 ± 0.891
4.369SerArg: 4.369 ± 1.326
5.735SerSer: 5.735 ± 1.396
5.188SerThr: 5.188 ± 1.543
4.096SerVal: 4.096 ± 0.921
1.638SerTrp: 1.638 ± 0.794
1.638SerTyr: 1.638 ± 0.344
0.0SerXaa: 0.0 ± 0.0
Thr
1.638ThrAla: 1.638 ± 0.57
1.638ThrCys: 1.638 ± 0.704
5.461ThrAsp: 5.461 ± 1.34
4.915ThrGlu: 4.915 ± 1.562
1.365ThrPhe: 1.365 ± 0.885
4.096ThrGly: 4.096 ± 0.964
0.546ThrHis: 0.546 ± 0.332
4.642ThrIle: 4.642 ± 0.546
4.369ThrLys: 4.369 ± 1.242
4.915ThrLeu: 4.915 ± 1.216
1.912ThrMet: 1.912 ± 0.685
2.731ThrAsn: 2.731 ± 1.046
1.092ThrPro: 1.092 ± 0.283
0.819ThrGln: 0.819 ± 0.471
2.458ThrArg: 2.458 ± 0.893
4.915ThrSer: 4.915 ± 1.002
4.096ThrThr: 4.096 ± 1.404
2.731ThrVal: 2.731 ± 1.301
1.638ThrTrp: 1.638 ± 0.583
2.731ThrTyr: 2.731 ± 0.947
0.0ThrXaa: 0.0 ± 0.0
Val
1.092ValAla: 1.092 ± 1.176
1.092ValCys: 1.092 ± 0.384
3.55ValAsp: 3.55 ± 0.888
3.55ValGlu: 3.55 ± 2.18
3.277ValPhe: 3.277 ± 1.147
3.55ValGly: 3.55 ± 1.328
1.912ValHis: 1.912 ± 1.265
4.369ValIle: 4.369 ± 1.461
3.277ValLys: 3.277 ± 1.505
3.277ValLeu: 3.277 ± 0.557
1.638ValMet: 1.638 ± 0.841
4.642ValAsn: 4.642 ± 1.384
3.55ValPro: 3.55 ± 0.766
1.092ValGln: 1.092 ± 0.735
2.185ValArg: 2.185 ± 1.243
3.277ValSer: 3.277 ± 1.083
4.915ValThr: 4.915 ± 1.554
2.731ValVal: 2.731 ± 0.693
0.546ValTrp: 0.546 ± 0.348
2.458ValTyr: 2.458 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
0.546TrpAla: 0.546 ± 0.348
0.273TrpCys: 0.273 ± 0.354
0.546TrpAsp: 0.546 ± 0.326
1.912TrpGlu: 1.912 ± 0.816
1.365TrpPhe: 1.365 ± 0.515
1.365TrpGly: 1.365 ± 0.589
0.0TrpHis: 0.0 ± 0.0
1.365TrpIle: 1.365 ± 1.166
0.819TrpLys: 0.819 ± 0.498
0.546TrpLeu: 0.546 ± 0.332
1.092TrpMet: 1.092 ± 1.025
2.458TrpAsn: 2.458 ± 0.767
0.273TrpPro: 0.273 ± 0.166
0.0TrpGln: 0.0 ± 0.0
0.546TrpArg: 0.546 ± 0.512
0.546TrpSer: 0.546 ± 0.348
1.092TrpThr: 1.092 ± 0.664
1.092TrpVal: 1.092 ± 1.211
0.0TrpTrp: 0.0 ± 0.0
0.273TrpTyr: 0.273 ± 0.451
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.092TyrAla: 1.092 ± 0.384
0.546TyrCys: 0.546 ± 0.812
1.638TyrAsp: 1.638 ± 0.583
3.277TyrGlu: 3.277 ± 1.012
2.458TyrPhe: 2.458 ± 0.522
3.823TyrGly: 3.823 ± 0.922
0.546TyrHis: 0.546 ± 0.278
2.185TyrIle: 2.185 ± 0.614
2.458TyrLys: 2.458 ± 0.768
3.277TyrLeu: 3.277 ± 0.871
1.638TyrMet: 1.638 ± 1.043
3.004TyrAsn: 3.004 ± 1.412
1.365TyrPro: 1.365 ± 0.562
1.912TyrGln: 1.912 ± 0.905
1.912TyrArg: 1.912 ± 1.155
1.638TyrSer: 1.638 ± 0.706
0.819TyrThr: 0.819 ± 0.498
1.365TyrVal: 1.365 ± 0.661
0.273TyrTrp: 0.273 ± 0.166
0.819TyrTyr: 0.819 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (3663 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski