Amino acid dipepetide frequency for Phlebovirus GGP-2011a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.549AlaAla: 4.549 ± 3.418
2.022AlaCys: 2.022 ± 0.447
2.274AlaAsp: 2.274 ± 0.576
3.791AlaGlu: 3.791 ± 1.267
1.516AlaPhe: 1.516 ± 2.163
2.274AlaGly: 2.274 ± 1.274
1.769AlaHis: 1.769 ± 0.797
4.802AlaIle: 4.802 ± 1.358
1.769AlaLys: 1.769 ± 0.677
4.296AlaLeu: 4.296 ± 0.824
2.527AlaMet: 2.527 ± 1.046
2.78AlaAsn: 2.78 ± 1.015
2.022AlaPro: 2.022 ± 0.771
0.253AlaGln: 0.253 ± 0.164
2.274AlaArg: 2.274 ± 0.986
5.307AlaSer: 5.307 ± 1.624
3.285AlaThr: 3.285 ± 0.881
4.549AlaVal: 4.549 ± 1.813
0.505AlaTrp: 0.505 ± 0.161
2.527AlaTyr: 2.527 ± 1.112
0.0AlaXaa: 0.0 ± 0.0
Cys
1.264CysAla: 1.264 ± 0.349
0.253CysCys: 0.253 ± 0.164
0.253CysAsp: 0.253 ± 0.164
1.011CysGlu: 1.011 ± 0.323
1.264CysPhe: 1.264 ± 0.793
1.516CysGly: 1.516 ± 0.51
1.516CysHis: 1.516 ± 0.484
0.505CysIle: 0.505 ± 0.329
2.78CysLys: 2.78 ± 0.997
3.285CysLeu: 3.285 ± 1.8
0.758CysMet: 0.758 ± 0.22
1.264CysAsn: 1.264 ± 0.847
1.011CysPro: 1.011 ± 0.356
0.758CysGln: 0.758 ± 0.821
1.769CysArg: 1.769 ± 1.064
2.78CysSer: 2.78 ± 1.273
0.758CysThr: 0.758 ± 0.22
1.516CysVal: 1.516 ± 0.657
0.505CysTrp: 0.505 ± 0.481
1.264CysTyr: 1.264 ± 0.525
0.0CysXaa: 0.0 ± 0.0
Asp
3.033AspAla: 3.033 ± 1.964
1.516AspCys: 1.516 ± 0.843
4.802AspAsp: 4.802 ± 1.336
4.549AspGlu: 4.549 ± 1.118
2.022AspPhe: 2.022 ± 0.56
2.274AspGly: 2.274 ± 0.628
1.011AspHis: 1.011 ± 0.353
4.296AspIle: 4.296 ± 0.956
3.033AspLys: 3.033 ± 0.943
4.549AspLeu: 4.549 ± 2.37
1.516AspMet: 1.516 ± 0.829
3.538AspAsn: 3.538 ± 0.519
2.78AspPro: 2.78 ± 0.656
1.264AspGln: 1.264 ± 0.586
2.78AspArg: 2.78 ± 1.402
5.56AspSer: 5.56 ± 1.179
2.527AspThr: 2.527 ± 0.82
2.274AspVal: 2.274 ± 0.552
0.505AspTrp: 0.505 ± 0.426
1.516AspTyr: 1.516 ± 0.691
0.0AspXaa: 0.0 ± 0.0
Glu
5.307GluAla: 5.307 ± 0.413
2.527GluCys: 2.527 ± 0.439
4.802GluAsp: 4.802 ± 1.06
6.065GluGlu: 6.065 ± 1.484
4.296GluPhe: 4.296 ± 1.368
4.802GluGly: 4.802 ± 1.3
1.264GluHis: 1.264 ± 0.41
5.812GluIle: 5.812 ± 1.41
4.296GluLys: 4.296 ± 0.125
6.065GluLeu: 6.065 ± 1.484
1.264GluMet: 1.264 ± 0.525
1.264GluAsn: 1.264 ± 0.525
1.264GluPro: 1.264 ± 0.888
2.022GluGln: 2.022 ± 0.898
4.296GluArg: 4.296 ± 1.025
3.538GluSer: 3.538 ± 0.757
2.78GluThr: 2.78 ± 1.592
4.802GluVal: 4.802 ± 0.852
1.011GluTrp: 1.011 ± 0.694
2.022GluTyr: 2.022 ± 0.725
0.0GluXaa: 0.0 ± 0.0
Phe
2.78PheAla: 2.78 ± 1.878
1.011PheCys: 1.011 ± 0.937
2.274PheAsp: 2.274 ± 1.122
2.274PheGlu: 2.274 ± 0.855
2.78PhePhe: 2.78 ± 0.656
1.011PheGly: 1.011 ± 0.353
0.758PheHis: 0.758 ± 0.22
2.022PheIle: 2.022 ± 0.501
3.538PheLys: 3.538 ± 0.642
4.802PheLeu: 4.802 ± 1.22
2.274PheMet: 2.274 ± 0.792
2.022PheAsn: 2.022 ± 1.19
2.527PhePro: 2.527 ± 1.511
1.264PheGln: 1.264 ± 0.821
2.527PheArg: 2.527 ± 0.43
3.285PheSer: 3.285 ± 0.72
2.022PheThr: 2.022 ± 0.504
3.285PheVal: 3.285 ± 0.968
1.011PheTrp: 1.011 ± 0.353
0.505PheTyr: 0.505 ± 0.426
0.0PheXaa: 0.0 ± 0.0
Gly
4.043GlyAla: 4.043 ± 0.62
1.011GlyCys: 1.011 ± 0.609
2.274GlyAsp: 2.274 ± 0.838
2.527GlyGlu: 2.527 ± 1.392
4.549GlyPhe: 4.549 ± 0.988
4.549GlyGly: 4.549 ± 1.069
1.769GlyHis: 1.769 ± 0.497
3.538GlyIle: 3.538 ± 0.599
3.033GlyLys: 3.033 ± 0.87
4.549GlyLeu: 4.549 ± 0.84
2.022GlyMet: 2.022 ± 0.66
2.022GlyAsn: 2.022 ± 2.048
2.022GlyPro: 2.022 ± 0.735
2.022GlyGln: 2.022 ± 1.218
2.78GlyArg: 2.78 ± 0.806
6.318GlySer: 6.318 ± 1.799
2.274GlyThr: 2.274 ± 0.672
3.285GlyVal: 3.285 ± 1.207
0.505GlyTrp: 0.505 ± 0.764
1.516GlyTyr: 1.516 ± 0.75
0.0GlyXaa: 0.0 ± 0.0
His
0.253HisAla: 0.253 ± 0.435
0.505HisCys: 0.505 ± 0.161
1.516HisAsp: 1.516 ± 0.439
1.516HisGlu: 1.516 ± 0.512
1.264HisPhe: 1.264 ± 0.589
2.022HisGly: 2.022 ± 0.706
0.505HisHis: 0.505 ± 0.329
1.516HisIle: 1.516 ± 0.662
1.264HisLys: 1.264 ± 0.349
2.274HisLeu: 2.274 ± 0.454
0.505HisMet: 0.505 ± 0.426
1.769HisAsn: 1.769 ± 0.712
0.758HisPro: 0.758 ± 0.718
0.758HisGln: 0.758 ± 0.22
2.274HisArg: 2.274 ± 0.454
2.78HisSer: 2.78 ± 0.551
1.011HisThr: 1.011 ± 0.609
2.274HisVal: 2.274 ± 0.576
0.505HisTrp: 0.505 ± 0.329
0.758HisTyr: 0.758 ± 0.22
0.0HisXaa: 0.0 ± 0.0
Ile
4.802IleAla: 4.802 ± 1.502
1.011IleCys: 1.011 ± 0.323
3.791IleAsp: 3.791 ± 0.798
4.043IleGlu: 4.043 ± 0.895
2.274IlePhe: 2.274 ± 1.278
3.285IleGly: 3.285 ± 0.661
2.274IleHis: 2.274 ± 0.628
4.549IleIle: 4.549 ± 0.655
4.802IleLys: 4.802 ± 1.336
5.054IleLeu: 5.054 ± 1.306
1.769IleMet: 1.769 ± 0.599
3.033IleAsn: 3.033 ± 1.829
4.043IlePro: 4.043 ± 0.895
1.516IleGln: 1.516 ± 0.697
5.56IleArg: 5.56 ± 1.172
5.812IleSer: 5.812 ± 1.113
3.538IleThr: 3.538 ± 1.207
3.033IleVal: 3.033 ± 1.152
0.505IleTrp: 0.505 ± 0.329
2.527IleTyr: 2.527 ± 0.698
0.0IleXaa: 0.0 ± 0.0
Lys
3.285LysAla: 3.285 ± 0.211
1.516LysCys: 1.516 ± 1.443
3.538LysAsp: 3.538 ± 0.172
3.033LysGlu: 3.033 ± 0.621
1.769LysPhe: 1.769 ± 0.641
3.285LysGly: 3.285 ± 0.984
1.516LysHis: 1.516 ± 0.662
5.56LysIle: 5.56 ± 1.291
4.802LysLys: 4.802 ± 0.964
4.549LysLeu: 4.549 ± 0.922
3.285LysMet: 3.285 ± 0.819
2.527LysAsn: 2.527 ± 0.54
3.033LysPro: 3.033 ± 0.493
1.011LysGln: 1.011 ± 0.657
2.78LysArg: 2.78 ± 1.36
5.054LysSer: 5.054 ± 0.498
3.538LysThr: 3.538 ± 0.799
4.043LysVal: 4.043 ± 0.874
0.758LysTrp: 0.758 ± 0.493
2.527LysTyr: 2.527 ± 1.009
0.0LysXaa: 0.0 ± 0.0
Leu
3.791LeuAla: 3.791 ± 1.739
2.022LeuCys: 2.022 ± 1.314
4.043LeuAsp: 4.043 ± 0.214
5.307LeuGlu: 5.307 ± 1.428
4.043LeuPhe: 4.043 ± 0.931
5.812LeuGly: 5.812 ± 0.953
2.022LeuHis: 2.022 ± 1.124
4.802LeuIle: 4.802 ± 1.574
5.56LeuLys: 5.56 ± 1.475
7.076LeuLeu: 7.076 ± 1.822
2.527LeuMet: 2.527 ± 1.054
3.033LeuAsn: 3.033 ± 1.479
3.791LeuPro: 3.791 ± 1.276
3.791LeuGln: 3.791 ± 1.13
5.812LeuArg: 5.812 ± 1.19
9.098LeuSer: 9.098 ± 1.611
4.296LeuThr: 4.296 ± 0.59
5.56LeuVal: 5.56 ± 1.651
0.758LeuTrp: 0.758 ± 0.721
1.769LeuTyr: 1.769 ± 0.641
0.0LeuXaa: 0.0 ± 0.0
Met
1.516MetAla: 1.516 ± 0.439
0.253MetCys: 0.253 ± 0.24
1.264MetAsp: 1.264 ± 0.505
1.769MetGlu: 1.769 ± 1.141
0.758MetPhe: 0.758 ± 0.22
1.769MetGly: 1.769 ± 0.823
0.758MetHis: 0.758 ± 0.886
2.274MetIle: 2.274 ± 0.652
1.264MetLys: 1.264 ± 0.821
2.527MetLeu: 2.527 ± 0.885
2.78MetMet: 2.78 ± 0.649
1.011MetAsn: 1.011 ± 0.571
1.264MetPro: 1.264 ± 0.41
1.516MetGln: 1.516 ± 0.484
2.78MetArg: 2.78 ± 1.426
3.791MetSer: 3.791 ± 0.925
3.033MetThr: 3.033 ± 1.205
1.264MetVal: 1.264 ± 0.589
0.0MetTrp: 0.0 ± 0.0
0.758MetTyr: 0.758 ± 0.22
0.0MetXaa: 0.0 ± 0.0
Asn
1.011AsnAla: 1.011 ± 0.937
1.011AsnCys: 1.011 ± 0.353
1.264AsnAsp: 1.264 ± 0.525
3.285AsnGlu: 3.285 ± 1.083
2.78AsnPhe: 2.78 ± 0.508
1.769AsnGly: 1.769 ± 0.722
1.011AsnHis: 1.011 ± 0.353
2.022AsnIle: 2.022 ± 0.646
2.527AsnLys: 2.527 ± 0.391
6.065AsnLeu: 6.065 ± 0.723
1.011AsnMet: 1.011 ± 0.318
1.264AsnAsn: 1.264 ± 0.807
2.527AsnPro: 2.527 ± 1.173
1.516AsnGln: 1.516 ± 0.484
2.022AsnArg: 2.022 ± 1.187
4.043AsnSer: 4.043 ± 1.686
1.769AsnThr: 1.769 ± 0.983
2.274AsnVal: 2.274 ± 0.552
0.505AsnTrp: 0.505 ± 0.732
1.769AsnTyr: 1.769 ± 0.996
0.0AsnXaa: 0.0 ± 0.0
Pro
2.022ProAla: 2.022 ± 0.418
0.758ProCys: 0.758 ± 0.671
2.274ProAsp: 2.274 ± 0.576
5.307ProGlu: 5.307 ± 1.903
1.516ProPhe: 1.516 ± 0.905
3.033ProGly: 3.033 ± 1.335
1.264ProHis: 1.264 ± 0.378
1.516ProIle: 1.516 ± 0.768
2.527ProLys: 2.527 ± 0.807
3.285ProLeu: 3.285 ± 0.673
1.264ProMet: 1.264 ± 0.589
2.022ProAsn: 2.022 ± 0.418
0.758ProPro: 0.758 ± 0.375
1.011ProGln: 1.011 ± 0.323
1.769ProArg: 1.769 ± 1.126
3.538ProSer: 3.538 ± 1.094
2.527ProThr: 2.527 ± 0.439
3.791ProVal: 3.791 ± 1.952
1.011ProTrp: 1.011 ± 0.571
1.769ProTyr: 1.769 ± 0.564
0.0ProXaa: 0.0 ± 0.0
Gln
1.011GlnAla: 1.011 ± 0.937
1.516GlnCys: 1.516 ± 0.353
1.516GlnAsp: 1.516 ± 0.439
2.527GlnGlu: 2.527 ± 0.43
1.011GlnPhe: 1.011 ± 0.353
2.274GlnGly: 2.274 ± 1.316
1.516GlnHis: 1.516 ± 0.986
2.527GlnIle: 2.527 ± 0.698
2.022GlnLys: 2.022 ± 0.898
1.516GlnLeu: 1.516 ± 0.484
0.758GlnMet: 0.758 ± 0.375
1.011GlnAsn: 1.011 ± 0.609
2.527GlnPro: 2.527 ± 1.112
0.253GlnGln: 0.253 ± 0.164
1.011GlnArg: 1.011 ± 0.61
1.769GlnSer: 1.769 ± 1.041
1.516GlnThr: 1.516 ± 0.439
1.264GlnVal: 1.264 ± 0.693
0.253GlnTrp: 0.253 ± 0.24
0.505GlnTyr: 0.505 ± 0.161
0.0GlnXaa: 0.0 ± 0.0
Arg
3.285ArgAla: 3.285 ± 1.568
1.264ArgCys: 1.264 ± 0.525
4.549ArgAsp: 4.549 ± 0.908
6.065ArgGlu: 6.065 ± 1.231
2.022ArgPhe: 2.022 ± 0.784
4.043ArgGly: 4.043 ± 0.875
1.264ArgHis: 1.264 ± 0.817
4.296ArgIle: 4.296 ± 1.51
2.527ArgLys: 2.527 ± 0.43
3.538ArgLeu: 3.538 ± 1.063
2.527ArgMet: 2.527 ± 0.885
2.274ArgAsn: 2.274 ± 0.694
2.527ArgPro: 2.527 ± 1.05
2.022ArgGln: 2.022 ± 0.589
2.022ArgArg: 2.022 ± 1.292
5.56ArgSer: 5.56 ± 2.38
1.516ArgThr: 1.516 ± 0.512
3.791ArgVal: 3.791 ± 1.228
1.011ArgTrp: 1.011 ± 0.571
1.516ArgTyr: 1.516 ± 0.51
0.0ArgXaa: 0.0 ± 0.0
Ser
5.56SerAla: 5.56 ± 1.334
4.043SerCys: 4.043 ± 2.11
5.56SerAsp: 5.56 ± 2.534
5.812SerGlu: 5.812 ± 0.681
4.296SerPhe: 4.296 ± 0.421
4.549SerGly: 4.549 ± 1.314
2.022SerHis: 2.022 ± 0.646
5.56SerIle: 5.56 ± 0.945
7.076SerLys: 7.076 ± 1.526
7.834SerLeu: 7.834 ± 0.913
1.011SerMet: 1.011 ± 0.571
4.802SerAsn: 4.802 ± 1.251
4.043SerPro: 4.043 ± 0.895
2.78SerGln: 2.78 ± 0.915
4.043SerArg: 4.043 ± 0.931
10.109SerSer: 10.109 ± 2.516
4.549SerThr: 4.549 ± 1.359
5.56SerVal: 5.56 ± 1.511
2.022SerTrp: 2.022 ± 0.646
2.022SerTyr: 2.022 ± 0.646
0.0SerXaa: 0.0 ± 0.0
Thr
3.033ThrAla: 3.033 ± 1.581
1.264ThrCys: 1.264 ± 0.586
3.033ThrAsp: 3.033 ± 1.073
2.78ThrGlu: 2.78 ± 0.656
1.264ThrPhe: 1.264 ± 0.847
4.043ThrGly: 4.043 ± 0.947
1.264ThrHis: 1.264 ± 0.349
4.043ThrIle: 4.043 ± 0.62
3.033ThrLys: 3.033 ± 1.019
4.802ThrLeu: 4.802 ± 1.026
1.769ThrMet: 1.769 ± 0.479
1.264ThrAsn: 1.264 ± 0.505
1.264ThrPro: 1.264 ± 0.525
1.516ThrGln: 1.516 ± 0.353
3.033ThrArg: 3.033 ± 0.77
5.054ThrSer: 5.054 ± 1.431
3.033ThrThr: 3.033 ± 0.875
3.538ThrVal: 3.538 ± 1.138
0.758ThrTrp: 0.758 ± 0.858
1.516ThrTyr: 1.516 ± 0.768
0.0ThrXaa: 0.0 ± 0.0
Val
3.033ValAla: 3.033 ± 0.657
1.516ValCys: 1.516 ± 1.443
3.033ValAsp: 3.033 ± 1.058
4.802ValGlu: 4.802 ± 0.82
2.78ValPhe: 2.78 ± 0.551
2.022ValGly: 2.022 ± 0.589
2.274ValHis: 2.274 ± 0.531
5.56ValIle: 5.56 ± 2.173
2.527ValLys: 2.527 ± 0.807
3.791ValLeu: 3.791 ± 2.13
1.516ValMet: 1.516 ± 0.353
2.78ValAsn: 2.78 ± 1.369
2.022ValPro: 2.022 ± 0.94
2.527ValGln: 2.527 ± 1.009
5.307ValArg: 5.307 ± 0.658
7.076ValSer: 7.076 ± 0.974
4.296ValThr: 4.296 ± 0.338
4.549ValVal: 4.549 ± 1.602
0.758ValTrp: 0.758 ± 0.476
1.769ValTyr: 1.769 ± 0.497
0.0ValXaa: 0.0 ± 0.0
Trp
0.253TrpAla: 0.253 ± 0.164
0.0TrpCys: 0.0 ± 0.0
1.264TrpAsp: 1.264 ± 0.765
0.758TrpGlu: 0.758 ± 0.22
0.758TrpPhe: 0.758 ± 0.22
0.758TrpGly: 0.758 ± 0.374
0.0TrpHis: 0.0 ± 0.0
0.505TrpIle: 0.505 ± 0.161
0.758TrpLys: 0.758 ± 0.718
1.264TrpLeu: 1.264 ± 0.41
0.505TrpMet: 0.505 ± 0.161
1.011TrpAsn: 1.011 ± 0.463
0.758TrpPro: 0.758 ± 0.374
0.0TrpGln: 0.0 ± 0.0
0.505TrpArg: 0.505 ± 0.161
0.505TrpSer: 0.505 ± 0.161
1.769TrpThr: 1.769 ± 0.537
1.769TrpVal: 1.769 ± 0.599
0.253TrpTrp: 0.253 ± 0.164
0.253TrpTyr: 0.253 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.516TyrAla: 1.516 ± 0.662
1.011TyrCys: 1.011 ± 0.609
2.022TyrAsp: 2.022 ± 0.927
2.274TyrGlu: 2.274 ± 1.172
1.011TyrPhe: 1.011 ± 0.353
1.516TyrGly: 1.516 ± 0.662
0.253TyrHis: 0.253 ± 0.164
1.516TyrIle: 1.516 ± 0.512
2.274TyrLys: 2.274 ± 0.652
3.791TyrLeu: 3.791 ± 0.516
0.505TyrMet: 0.505 ± 0.329
0.758TyrAsn: 0.758 ± 0.493
2.022TyrPro: 2.022 ± 0.589
0.505TyrGln: 0.505 ± 0.764
2.274TyrArg: 2.274 ± 1.224
2.274TyrSer: 2.274 ± 0.838
1.264TyrThr: 1.264 ± 0.378
1.516TyrVal: 1.516 ± 0.439
0.505TyrTrp: 0.505 ± 0.455
0.253TyrTyr: 0.253 ± 0.164
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3958 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski