Amino acid dipepetide frequency for Orungo virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.927AlaAla: 6.927 ± 1.648
0.322AlaCys: 0.322 ± 0.23
2.738AlaAsp: 2.738 ± 0.574
3.705AlaGlu: 3.705 ± 0.643
2.255AlaPhe: 2.255 ± 0.497
3.383AlaGly: 3.383 ± 1.264
1.45AlaHis: 1.45 ± 0.469
4.349AlaIle: 4.349 ± 0.932
4.188AlaLys: 4.188 ± 1.106
9.182AlaLeu: 9.182 ± 1.887
2.255AlaMet: 2.255 ± 0.697
2.899AlaAsn: 2.899 ± 0.657
3.544AlaPro: 3.544 ± 0.789
2.416AlaGln: 2.416 ± 0.79
4.51AlaArg: 4.51 ± 0.734
3.705AlaSer: 3.705 ± 0.681
3.705AlaThr: 3.705 ± 0.882
4.994AlaVal: 4.994 ± 0.707
0.805AlaTrp: 0.805 ± 0.367
2.899AlaTyr: 2.899 ± 0.662
0.0AlaXaa: 0.0 ± 0.0
Cys
0.805CysAla: 0.805 ± 0.236
0.161CysCys: 0.161 ± 0.138
0.161CysAsp: 0.161 ± 0.169
0.322CysGlu: 0.322 ± 0.179
0.161CysPhe: 0.161 ± 0.156
1.289CysGly: 1.289 ± 0.379
0.161CysHis: 0.161 ± 0.142
0.805CysIle: 0.805 ± 0.371
0.322CysLys: 0.322 ± 0.195
0.483CysLeu: 0.483 ± 0.264
0.161CysMet: 0.161 ± 0.138
0.644CysAsn: 0.644 ± 0.42
0.644CysPro: 0.644 ± 0.327
0.322CysGln: 0.322 ± 0.195
0.322CysArg: 0.322 ± 0.199
0.644CysSer: 0.644 ± 0.326
0.644CysThr: 0.644 ± 0.248
0.644CysVal: 0.644 ± 0.41
0.0CysTrp: 0.0 ± 0.0
0.322CysTyr: 0.322 ± 0.201
0.0CysXaa: 0.0 ± 0.0
Asp
2.899AspAla: 2.899 ± 0.567
0.805AspCys: 0.805 ± 0.312
3.222AspAsp: 3.222 ± 0.555
4.832AspGlu: 4.832 ± 0.657
1.933AspPhe: 1.933 ± 0.872
4.832AspGly: 4.832 ± 0.616
0.966AspHis: 0.966 ± 0.422
3.222AspIle: 3.222 ± 0.589
2.577AspLys: 2.577 ± 0.935
5.316AspLeu: 5.316 ± 1.133
2.094AspMet: 2.094 ± 0.388
0.966AspAsn: 0.966 ± 0.272
3.383AspPro: 3.383 ± 0.761
2.416AspGln: 2.416 ± 0.61
4.832AspArg: 4.832 ± 0.702
3.866AspSer: 3.866 ± 0.78
3.705AspThr: 3.705 ± 0.831
5.638AspVal: 5.638 ± 0.733
0.483AspTrp: 0.483 ± 0.324
2.577AspTyr: 2.577 ± 0.461
0.0AspXaa: 0.0 ± 0.0
Glu
3.705GluAla: 3.705 ± 0.691
0.322GluCys: 0.322 ± 0.185
4.51GluAsp: 4.51 ± 0.855
4.994GluGlu: 4.994 ± 0.916
3.544GluPhe: 3.544 ± 0.771
3.222GluGly: 3.222 ± 0.958
1.289GluHis: 1.289 ± 0.718
6.765GluIle: 6.765 ± 0.822
3.866GluLys: 3.866 ± 0.983
5.155GluLeu: 5.155 ± 0.741
3.222GluMet: 3.222 ± 0.848
2.255GluAsn: 2.255 ± 0.524
2.577GluPro: 2.577 ± 0.759
1.772GluGln: 1.772 ± 0.69
4.51GluArg: 4.51 ± 0.958
3.705GluSer: 3.705 ± 0.742
2.738GluThr: 2.738 ± 0.737
5.155GluVal: 5.155 ± 1.0
1.772GluTrp: 1.772 ± 0.591
1.772GluTyr: 1.772 ± 0.441
0.0GluXaa: 0.0 ± 0.0
Phe
2.738PheAla: 2.738 ± 0.583
0.483PheCys: 0.483 ± 0.372
3.705PheAsp: 3.705 ± 0.827
2.255PheGlu: 2.255 ± 0.635
2.416PhePhe: 2.416 ± 0.629
3.866PheGly: 3.866 ± 0.904
1.289PheHis: 1.289 ± 0.287
2.738PheIle: 2.738 ± 0.591
1.611PheLys: 1.611 ± 0.708
3.222PheLeu: 3.222 ± 0.817
1.128PheMet: 1.128 ± 0.417
1.933PheAsn: 1.933 ± 0.452
0.644PhePro: 0.644 ± 0.188
1.128PheGln: 1.128 ± 0.474
3.061PheArg: 3.061 ± 0.566
2.416PheSer: 2.416 ± 1.038
3.705PheThr: 3.705 ± 1.12
2.255PheVal: 2.255 ± 0.331
0.483PheTrp: 0.483 ± 0.241
1.45PheTyr: 1.45 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
3.061GlyAla: 3.061 ± 0.939
0.483GlyCys: 0.483 ± 0.303
5.799GlyAsp: 5.799 ± 0.971
4.027GlyGlu: 4.027 ± 0.679
2.738GlyPhe: 2.738 ± 0.584
5.155GlyGly: 5.155 ± 1.448
1.128GlyHis: 1.128 ± 0.483
2.255GlyIle: 2.255 ± 0.635
3.222GlyLys: 3.222 ± 0.576
4.994GlyLeu: 4.994 ± 1.251
2.577GlyMet: 2.577 ± 0.54
1.45GlyAsn: 1.45 ± 0.424
2.416GlyPro: 2.416 ± 0.731
1.933GlyGln: 1.933 ± 0.825
6.121GlyArg: 6.121 ± 0.816
3.222GlySer: 3.222 ± 0.55
2.899GlyThr: 2.899 ± 0.85
6.443GlyVal: 6.443 ± 1.236
0.644GlyTrp: 0.644 ± 0.355
2.094GlyTyr: 2.094 ± 0.731
0.0GlyXaa: 0.0 ± 0.0
His
2.255HisAla: 2.255 ± 0.8
0.322HisCys: 0.322 ± 0.199
0.805HisAsp: 0.805 ± 0.236
1.289HisGlu: 1.289 ± 0.492
1.128HisPhe: 1.128 ± 0.455
1.772HisGly: 1.772 ± 0.622
0.483HisHis: 0.483 ± 0.266
0.966HisIle: 0.966 ± 0.577
0.161HisLys: 0.161 ± 0.215
2.899HisLeu: 2.899 ± 0.702
0.644HisMet: 0.644 ± 0.291
0.805HisAsn: 0.805 ± 0.406
1.128HisPro: 1.128 ± 0.435
1.128HisGln: 1.128 ± 0.464
1.772HisArg: 1.772 ± 0.677
0.805HisSer: 0.805 ± 0.329
0.644HisThr: 0.644 ± 0.317
1.772HisVal: 1.772 ± 0.442
0.161HisTrp: 0.161 ± 0.215
0.805HisTyr: 0.805 ± 0.369
0.0HisXaa: 0.0 ± 0.0
Ile
4.832IleAla: 4.832 ± 0.811
0.966IleCys: 0.966 ± 0.351
4.832IleAsp: 4.832 ± 1.057
3.866IleGlu: 3.866 ± 1.199
2.738IlePhe: 2.738 ± 0.873
4.188IleGly: 4.188 ± 0.436
2.094IleHis: 2.094 ± 0.819
2.094IleIle: 2.094 ± 0.529
4.51IleLys: 4.51 ± 1.274
5.155IleLeu: 5.155 ± 0.747
1.289IleMet: 1.289 ± 0.373
3.222IleAsn: 3.222 ± 0.783
2.416IlePro: 2.416 ± 0.597
3.383IleGln: 3.383 ± 0.832
4.51IleArg: 4.51 ± 0.836
3.705IleSer: 3.705 ± 0.849
4.349IleThr: 4.349 ± 0.707
3.061IleVal: 3.061 ± 0.533
0.805IleTrp: 0.805 ± 0.391
3.061IleTyr: 3.061 ± 0.565
0.0IleXaa: 0.0 ± 0.0
Lys
4.027LysAla: 4.027 ± 1.005
0.483LysCys: 0.483 ± 0.213
1.772LysAsp: 1.772 ± 0.324
3.866LysGlu: 3.866 ± 1.561
1.611LysPhe: 1.611 ± 0.539
2.899LysGly: 2.899 ± 0.419
1.128LysHis: 1.128 ± 0.443
5.155LysIle: 5.155 ± 1.189
2.899LysLys: 2.899 ± 0.308
4.994LysLeu: 4.994 ± 0.96
1.933LysMet: 1.933 ± 0.535
1.772LysAsn: 1.772 ± 0.47
1.772LysPro: 1.772 ± 0.342
1.772LysGln: 1.772 ± 0.767
5.638LysArg: 5.638 ± 0.657
3.061LysSer: 3.061 ± 0.519
2.094LysThr: 2.094 ± 0.582
2.416LysVal: 2.416 ± 0.772
0.644LysTrp: 0.644 ± 0.221
1.45LysTyr: 1.45 ± 0.407
0.0LysXaa: 0.0 ± 0.0
Leu
6.604LeuAla: 6.604 ± 1.321
0.805LeuCys: 0.805 ± 0.313
5.799LeuAsp: 5.799 ± 1.048
4.994LeuGlu: 4.994 ± 0.651
2.738LeuPhe: 2.738 ± 0.548
5.316LeuGly: 5.316 ± 0.637
2.416LeuHis: 2.416 ± 0.741
6.443LeuIle: 6.443 ± 0.904
4.994LeuLys: 4.994 ± 0.731
6.282LeuLeu: 6.282 ± 0.838
3.705LeuMet: 3.705 ± 0.921
4.51LeuAsn: 4.51 ± 0.755
4.349LeuPro: 4.349 ± 0.753
3.705LeuGln: 3.705 ± 1.151
8.698LeuArg: 8.698 ± 1.51
6.282LeuSer: 6.282 ± 0.934
4.188LeuThr: 4.188 ± 0.505
4.832LeuVal: 4.832 ± 0.685
1.45LeuTrp: 1.45 ± 0.692
2.094LeuTyr: 2.094 ± 0.553
0.0LeuXaa: 0.0 ± 0.0
Met
2.738MetAla: 2.738 ± 0.544
0.322MetCys: 0.322 ± 0.182
1.45MetAsp: 1.45 ± 0.583
3.222MetGlu: 3.222 ± 0.444
1.933MetPhe: 1.933 ± 0.61
1.45MetGly: 1.45 ± 0.559
1.128MetHis: 1.128 ± 0.492
2.094MetIle: 2.094 ± 0.623
1.45MetLys: 1.45 ± 0.545
4.027MetLeu: 4.027 ± 0.653
1.611MetMet: 1.611 ± 0.371
1.289MetAsn: 1.289 ± 0.477
0.966MetPro: 0.966 ± 0.332
1.933MetGln: 1.933 ± 0.539
2.738MetArg: 2.738 ± 1.034
2.094MetSer: 2.094 ± 0.654
2.416MetThr: 2.416 ± 0.524
2.255MetVal: 2.255 ± 0.521
0.322MetTrp: 0.322 ± 0.199
1.128MetTyr: 1.128 ± 0.357
0.0MetXaa: 0.0 ± 0.0
Asn
3.222AsnAla: 3.222 ± 0.986
0.161AsnCys: 0.161 ± 0.169
1.128AsnAsp: 1.128 ± 0.284
2.416AsnGlu: 2.416 ± 0.863
1.772AsnPhe: 1.772 ± 0.495
2.255AsnGly: 2.255 ± 0.312
0.805AsnHis: 0.805 ± 0.256
3.383AsnIle: 3.383 ± 0.581
0.644AsnLys: 0.644 ± 0.32
4.027AsnLeu: 4.027 ± 1.074
1.611AsnMet: 1.611 ± 0.506
0.966AsnAsn: 0.966 ± 0.376
2.255AsnPro: 2.255 ± 0.455
0.644AsnGln: 0.644 ± 0.28
2.255AsnArg: 2.255 ± 0.507
3.222AsnSer: 3.222 ± 1.011
2.255AsnThr: 2.255 ± 0.638
3.061AsnVal: 3.061 ± 0.849
0.161AsnTrp: 0.161 ± 0.169
1.289AsnTyr: 1.289 ± 0.386
0.0AsnXaa: 0.0 ± 0.0
Pro
3.061ProAla: 3.061 ± 0.576
0.0ProCys: 0.0 ± 0.0
2.577ProAsp: 2.577 ± 0.643
2.738ProGlu: 2.738 ± 0.467
1.611ProPhe: 1.611 ± 0.581
1.772ProGly: 1.772 ± 0.708
0.483ProHis: 0.483 ± 0.192
2.899ProIle: 2.899 ± 0.468
1.772ProLys: 1.772 ± 0.559
2.738ProLeu: 2.738 ± 0.478
0.966ProMet: 0.966 ± 0.407
1.45ProAsn: 1.45 ± 0.438
1.289ProPro: 1.289 ± 0.431
2.416ProGln: 2.416 ± 0.565
2.094ProArg: 2.094 ± 0.724
2.416ProSer: 2.416 ± 0.595
3.383ProThr: 3.383 ± 0.875
3.383ProVal: 3.383 ± 0.76
0.805ProTrp: 0.805 ± 0.456
2.094ProTyr: 2.094 ± 0.543
0.0ProXaa: 0.0 ± 0.0
Gln
1.933GlnAla: 1.933 ± 0.533
0.161GlnCys: 0.161 ± 0.169
1.933GlnAsp: 1.933 ± 0.471
2.094GlnGlu: 2.094 ± 0.594
1.772GlnPhe: 1.772 ± 0.372
2.416GlnGly: 2.416 ± 0.666
1.289GlnHis: 1.289 ± 0.598
2.577GlnIle: 2.577 ± 0.494
1.128GlnLys: 1.128 ± 0.404
4.188GlnLeu: 4.188 ± 0.721
1.128GlnMet: 1.128 ± 0.454
1.45GlnAsn: 1.45 ± 0.34
1.289GlnPro: 1.289 ± 0.5
0.483GlnGln: 0.483 ± 0.384
2.577GlnArg: 2.577 ± 0.754
2.255GlnSer: 2.255 ± 0.412
2.899GlnThr: 2.899 ± 0.527
3.383GlnVal: 3.383 ± 0.398
0.644GlnTrp: 0.644 ± 0.3
0.805GlnTyr: 0.805 ± 0.272
0.0GlnXaa: 0.0 ± 0.0
Arg
5.316ArgAla: 5.316 ± 1.019
0.805ArgCys: 0.805 ± 0.333
5.799ArgAsp: 5.799 ± 0.724
5.638ArgGlu: 5.638 ± 1.111
4.832ArgPhe: 4.832 ± 0.952
4.994ArgGly: 4.994 ± 0.396
0.483ArgHis: 0.483 ± 0.233
5.155ArgIle: 5.155 ± 0.941
4.027ArgLys: 4.027 ± 0.746
5.96ArgLeu: 5.96 ± 1.11
3.705ArgMet: 3.705 ± 0.746
2.255ArgAsn: 2.255 ± 0.607
1.933ArgPro: 1.933 ± 0.614
2.899ArgGln: 2.899 ± 0.649
5.477ArgArg: 5.477 ± 0.792
4.188ArgSer: 4.188 ± 0.414
3.866ArgThr: 3.866 ± 0.698
4.832ArgVal: 4.832 ± 0.605
1.128ArgTrp: 1.128 ± 0.464
2.416ArgTyr: 2.416 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
4.349SerAla: 4.349 ± 0.587
0.0SerCys: 0.0 ± 0.0
4.832SerAsp: 4.832 ± 0.583
4.188SerGlu: 4.188 ± 1.115
2.738SerPhe: 2.738 ± 0.946
3.544SerGly: 3.544 ± 0.531
1.128SerHis: 1.128 ± 0.406
3.705SerIle: 3.705 ± 0.988
3.544SerLys: 3.544 ± 0.727
4.994SerLeu: 4.994 ± 0.985
1.772SerMet: 1.772 ± 0.219
1.933SerAsn: 1.933 ± 0.412
2.416SerPro: 2.416 ± 0.442
2.416SerGln: 2.416 ± 0.564
3.222SerArg: 3.222 ± 0.672
4.027SerSer: 4.027 ± 1.164
3.222SerThr: 3.222 ± 0.945
4.671SerVal: 4.671 ± 1.252
0.805SerTrp: 0.805 ± 0.282
2.416SerTyr: 2.416 ± 0.79
0.0SerXaa: 0.0 ± 0.0
Thr
2.899ThrAla: 2.899 ± 0.624
0.483ThrCys: 0.483 ± 0.469
2.577ThrAsp: 2.577 ± 0.696
4.188ThrGlu: 4.188 ± 0.669
2.094ThrPhe: 2.094 ± 0.419
4.188ThrGly: 4.188 ± 0.439
1.45ThrHis: 1.45 ± 0.572
4.188ThrIle: 4.188 ± 0.957
2.416ThrLys: 2.416 ± 0.644
5.477ThrLeu: 5.477 ± 0.936
2.416ThrMet: 2.416 ± 0.484
2.255ThrAsn: 2.255 ± 0.795
2.738ThrPro: 2.738 ± 0.577
2.255ThrGln: 2.255 ± 0.787
3.383ThrArg: 3.383 ± 0.598
3.383ThrSer: 3.383 ± 0.48
3.383ThrThr: 3.383 ± 0.629
4.027ThrVal: 4.027 ± 0.402
0.644ThrTrp: 0.644 ± 0.322
1.45ThrTyr: 1.45 ± 0.644
0.0ThrXaa: 0.0 ± 0.0
Val
5.638ValAla: 5.638 ± 0.702
1.45ValCys: 1.45 ± 0.431
2.899ValAsp: 2.899 ± 0.591
4.349ValGlu: 4.349 ± 0.617
1.772ValPhe: 1.772 ± 0.429
3.705ValGly: 3.705 ± 0.408
1.128ValHis: 1.128 ± 0.628
3.705ValIle: 3.705 ± 0.787
4.994ValLys: 4.994 ± 0.783
6.765ValLeu: 6.765 ± 0.929
2.255ValMet: 2.255 ± 0.579
3.544ValAsn: 3.544 ± 0.621
3.544ValPro: 3.544 ± 1.027
2.738ValGln: 2.738 ± 0.609
6.282ValArg: 6.282 ± 0.908
4.51ValSer: 4.51 ± 0.825
3.061ValThr: 3.061 ± 0.987
5.155ValVal: 5.155 ± 1.098
0.805ValTrp: 0.805 ± 0.269
2.738ValTyr: 2.738 ± 0.592
0.0ValXaa: 0.0 ± 0.0
Trp
0.805TrpAla: 0.805 ± 0.338
0.161TrpCys: 0.161 ± 0.142
0.966TrpAsp: 0.966 ± 0.394
1.128TrpGlu: 1.128 ± 0.422
0.644TrpPhe: 0.644 ± 0.212
0.644TrpGly: 0.644 ± 0.221
0.805TrpHis: 0.805 ± 0.445
0.966TrpIle: 0.966 ± 0.49
0.805TrpLys: 0.805 ± 0.355
1.45TrpLeu: 1.45 ± 0.497
0.322TrpMet: 0.322 ± 0.182
0.805TrpAsn: 0.805 ± 0.402
0.0TrpPro: 0.0 ± 0.0
0.161TrpGln: 0.161 ± 0.145
0.805TrpArg: 0.805 ± 0.445
0.322TrpSer: 0.322 ± 0.29
0.805TrpThr: 0.805 ± 0.346
0.966TrpVal: 0.966 ± 0.318
0.322TrpTrp: 0.322 ± 0.236
0.161TrpTyr: 0.161 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.416TyrAla: 2.416 ± 0.335
0.322TyrCys: 0.322 ± 0.212
3.061TyrAsp: 3.061 ± 0.568
2.738TyrGlu: 2.738 ± 0.457
2.094TyrPhe: 2.094 ± 0.559
1.772TyrGly: 1.772 ± 0.585
0.644TyrHis: 0.644 ± 0.29
1.611TyrIle: 1.611 ± 0.44
2.255TyrLys: 2.255 ± 0.595
2.899TyrLeu: 2.899 ± 0.933
1.611TyrMet: 1.611 ± 0.436
1.289TyrAsn: 1.289 ± 0.527
0.644TyrPro: 0.644 ± 0.205
0.483TyrGln: 0.483 ± 0.192
2.899TyrArg: 2.899 ± 0.752
2.094TyrSer: 2.094 ± 0.66
1.933TyrThr: 1.933 ± 0.588
2.094TyrVal: 2.094 ± 0.707
0.161TyrTrp: 0.161 ± 0.156
0.644TyrTyr: 0.644 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 11 proteins (6209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski