Amino acid dipepetide frequency for Rift valley fever virus (strain ZH-548 M12) (RVFV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.367AlaAla: 7.367 ± 5.191
1.719AlaCys: 1.719 ± 0.747
2.947AlaAsp: 2.947 ± 0.859
3.929AlaGlu: 3.929 ± 1.507
3.684AlaPhe: 3.684 ± 0.389
3.438AlaGly: 3.438 ± 1.37
3.193AlaHis: 3.193 ± 1.729
4.666AlaIle: 4.666 ± 0.829
2.701AlaLys: 2.701 ± 1.41
8.104AlaLeu: 8.104 ± 3.1
2.456AlaMet: 2.456 ± 1.764
1.965AlaAsn: 1.965 ± 0.754
3.438AlaPro: 3.438 ± 0.699
1.228AlaGln: 1.228 ± 1.125
3.193AlaArg: 3.193 ± 1.026
4.666AlaSer: 4.666 ± 1.321
3.193AlaThr: 3.193 ± 1.123
3.684AlaVal: 3.684 ± 2.086
0.737AlaTrp: 0.737 ± 0.442
1.965AlaTyr: 1.965 ± 1.668
0.0AlaXaa: 0.0 ± 0.0
Cys
0.737CysAla: 0.737 ± 0.387
0.737CysCys: 0.737 ± 0.506
1.228CysAsp: 1.228 ± 0.829
2.947CysGlu: 2.947 ± 0.701
1.228CysPhe: 1.228 ± 0.829
0.982CysGly: 0.982 ± 0.375
1.719CysHis: 1.719 ± 1.036
1.228CysIle: 1.228 ± 0.564
1.473CysLys: 1.473 ± 0.512
3.438CysLeu: 3.438 ± 1.937
0.982CysMet: 0.982 ± 0.377
1.473CysAsn: 1.473 ± 0.562
1.473CysPro: 1.473 ± 0.774
1.228CysGln: 1.228 ± 0.498
1.228CysArg: 1.228 ± 0.564
3.193CysSer: 3.193 ± 0.988
1.965CysThr: 1.965 ± 0.75
1.228CysVal: 1.228 ± 0.967
0.491CysTrp: 0.491 ± 0.305
0.491CysTyr: 0.491 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
4.42AspAla: 4.42 ± 1.678
1.228AspCys: 1.228 ± 0.421
2.947AspAsp: 2.947 ± 1.259
2.456AspGlu: 2.456 ± 0.634
2.701AspPhe: 2.701 ± 1.311
4.912AspGly: 4.912 ± 1.137
1.473AspHis: 1.473 ± 0.915
2.456AspIle: 2.456 ± 0.885
2.947AspLys: 2.947 ± 0.72
4.912AspLeu: 4.912 ± 2.591
1.965AspMet: 1.965 ± 0.992
1.965AspAsn: 1.965 ± 0.456
2.701AspPro: 2.701 ± 1.4
1.228AspGln: 1.228 ± 0.945
1.719AspArg: 1.719 ± 0.554
3.684AspSer: 3.684 ± 0.325
0.982AspThr: 0.982 ± 0.377
3.193AspVal: 3.193 ± 1.424
0.982AspTrp: 0.982 ± 0.598
1.228AspTyr: 1.228 ± 0.646
0.0AspXaa: 0.0 ± 0.0
Glu
3.438GluAla: 3.438 ± 0.713
0.982GluCys: 0.982 ± 0.606
5.894GluAsp: 5.894 ± 0.99
6.631GluGlu: 6.631 ± 1.648
3.193GluPhe: 3.193 ± 0.897
3.929GluGly: 3.929 ± 0.861
0.737GluHis: 0.737 ± 0.256
4.912GluIle: 4.912 ± 1.343
2.456GluLys: 2.456 ± 0.922
7.859GluLeu: 7.859 ± 1.83
2.456GluMet: 2.456 ± 0.589
1.965GluAsn: 1.965 ± 0.671
1.965GluPro: 1.965 ± 0.503
0.982GluGln: 0.982 ± 0.485
3.684GluArg: 3.684 ± 0.837
4.666GluSer: 4.666 ± 1.324
1.965GluThr: 1.965 ± 0.754
4.175GluVal: 4.175 ± 1.462
0.491GluTrp: 0.491 ± 0.595
1.228GluTyr: 1.228 ± 0.649
0.0GluXaa: 0.0 ± 0.0
Phe
4.42PheAla: 4.42 ± 2.615
0.982PheCys: 0.982 ± 0.606
3.193PheAsp: 3.193 ± 1.052
1.719PheGlu: 1.719 ± 0.747
1.965PhePhe: 1.965 ± 0.529
2.21PheGly: 2.21 ± 0.528
0.982PheHis: 0.982 ± 0.61
1.719PheIle: 1.719 ± 0.413
2.21PheLys: 2.21 ± 0.889
3.929PheLeu: 3.929 ± 0.571
1.228PheMet: 1.228 ± 0.515
2.456PheAsn: 2.456 ± 1.03
2.701PhePro: 2.701 ± 1.432
1.228PheGln: 1.228 ± 0.442
1.473PheArg: 1.473 ± 0.757
5.157PheSer: 5.157 ± 0.808
2.21PheThr: 2.21 ± 0.401
5.157PheVal: 5.157 ± 1.248
0.246PheTrp: 0.246 ± 0.152
0.737PheTyr: 0.737 ± 0.506
0.0PheXaa: 0.0 ± 0.0
Gly
4.42GlyAla: 4.42 ± 0.816
1.719GlyCys: 1.719 ± 0.6
2.947GlyAsp: 2.947 ± 0.751
2.21GlyGlu: 2.21 ± 0.952
4.175GlyPhe: 4.175 ± 1.18
3.684GlyGly: 3.684 ± 1.012
1.965GlyHis: 1.965 ± 0.529
5.648GlyIle: 5.648 ± 2.055
4.912GlyLys: 4.912 ± 1.003
4.175GlyLeu: 4.175 ± 1.612
1.473GlyMet: 1.473 ± 0.486
1.719GlyAsn: 1.719 ± 1.127
2.947GlyPro: 2.947 ± 1.048
2.21GlyGln: 2.21 ± 0.931
2.456GlyArg: 2.456 ± 1.451
7.367GlySer: 7.367 ± 2.811
2.456GlyThr: 2.456 ± 1.054
4.666GlyVal: 4.666 ± 1.62
0.737GlyTrp: 0.737 ± 0.716
1.228GlyTyr: 1.228 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
0.982HisAla: 0.982 ± 0.375
0.491HisCys: 0.491 ± 0.454
1.965HisAsp: 1.965 ± 0.671
1.473HisGlu: 1.473 ± 0.615
1.228HisPhe: 1.228 ± 0.515
3.438HisGly: 3.438 ± 0.784
0.982HisHis: 0.982 ± 0.515
1.228HisIle: 1.228 ± 0.515
1.965HisLys: 1.965 ± 0.739
2.21HisLeu: 2.21 ± 0.767
0.737HisMet: 0.737 ± 0.846
0.491HisAsn: 0.491 ± 0.454
0.491HisPro: 0.491 ± 0.595
0.737HisGln: 0.737 ± 0.457
1.473HisArg: 1.473 ± 1.434
0.982HisSer: 0.982 ± 0.584
1.228HisThr: 1.228 ± 0.421
1.473HisVal: 1.473 ± 0.512
0.0HisTrp: 0.0 ± 0.0
0.737HisTyr: 0.737 ± 0.457
0.0HisXaa: 0.0 ± 0.0
Ile
5.157IleAla: 5.157 ± 1.406
1.965IleCys: 1.965 ± 0.754
2.947IleAsp: 2.947 ± 1.141
3.438IleGlu: 3.438 ± 1.069
1.473IlePhe: 1.473 ± 0.915
3.684IleGly: 3.684 ± 0.644
0.246IleHis: 0.246 ± 0.152
4.175IleIle: 4.175 ± 1.036
3.438IleLys: 3.438 ± 1.366
5.648IleLeu: 5.648 ± 1.587
0.246IleMet: 0.246 ± 0.152
2.456IleAsn: 2.456 ± 0.922
3.193IlePro: 3.193 ± 1.738
2.701IleGln: 2.701 ± 1.001
4.912IleArg: 4.912 ± 0.864
5.894IleSer: 5.894 ± 1.809
3.929IleThr: 3.929 ± 0.986
4.666IleVal: 4.666 ± 0.724
0.246IleTrp: 0.246 ± 0.152
1.473IleTyr: 1.473 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
3.684LysAla: 3.684 ± 1.209
2.21LysCys: 2.21 ± 1.732
2.456LysAsp: 2.456 ± 0.924
3.193LysGlu: 3.193 ± 1.711
2.456LysPhe: 2.456 ± 1.03
5.648LysGly: 5.648 ± 1.936
0.982LysHis: 0.982 ± 0.61
3.684LysIle: 3.684 ± 0.644
5.648LysLys: 5.648 ± 0.914
4.42LysLeu: 4.42 ± 1.091
2.701LysMet: 2.701 ± 0.92
1.473LysAsn: 1.473 ± 0.659
2.947LysPro: 2.947 ± 0.879
2.456LysGln: 2.456 ± 1.0
2.947LysArg: 2.947 ± 0.801
2.701LysSer: 2.701 ± 1.173
5.157LysThr: 5.157 ± 1.598
3.438LysVal: 3.438 ± 1.614
1.473LysTrp: 1.473 ± 0.915
1.473LysTyr: 1.473 ± 0.432
0.0LysXaa: 0.0 ± 0.0
Leu
6.139LeuAla: 6.139 ± 1.811
1.719LeuCys: 1.719 ± 0.612
3.438LeuAsp: 3.438 ± 0.923
5.157LeuGlu: 5.157 ± 1.844
3.193LeuPhe: 3.193 ± 1.132
3.438LeuGly: 3.438 ± 0.908
2.21LeuHis: 2.21 ± 0.632
7.859LeuIle: 7.859 ± 0.734
7.122LeuLys: 7.122 ± 1.979
7.122LeuLeu: 7.122 ± 0.769
4.175LeuMet: 4.175 ± 1.66
3.929LeuAsn: 3.929 ± 0.929
4.42LeuPro: 4.42 ± 1.313
3.438LeuGln: 3.438 ± 0.897
6.385LeuArg: 6.385 ± 1.617
11.051LeuSer: 11.051 ± 1.001
3.684LeuThr: 3.684 ± 0.958
5.648LeuVal: 5.648 ± 1.129
0.246LeuTrp: 0.246 ± 0.152
2.701LeuTyr: 2.701 ± 0.851
0.0LeuXaa: 0.0 ± 0.0
Met
2.21MetAla: 2.21 ± 1.015
0.246MetCys: 0.246 ± 0.152
2.456MetAsp: 2.456 ± 1.04
2.456MetGlu: 2.456 ± 1.058
1.228MetPhe: 1.228 ± 0.515
2.456MetGly: 2.456 ± 1.283
1.473MetHis: 1.473 ± 0.724
2.21MetIle: 2.21 ± 0.718
1.719MetLys: 1.719 ± 0.513
2.21MetLeu: 2.21 ± 0.632
2.21MetMet: 2.21 ± 1.78
0.737MetAsn: 0.737 ± 0.57
0.737MetPro: 0.737 ± 0.903
0.982MetGln: 0.982 ± 0.61
1.473MetArg: 1.473 ± 0.512
1.473MetSer: 1.473 ± 0.486
1.965MetThr: 1.965 ± 0.926
2.21MetVal: 2.21 ± 0.615
0.491MetTrp: 0.491 ± 0.454
0.982MetTyr: 0.982 ± 0.515
0.0MetXaa: 0.0 ± 0.0
Asn
1.719AsnAla: 1.719 ± 0.509
1.228AsnCys: 1.228 ± 0.515
2.947AsnAsp: 2.947 ± 0.877
1.228AsnGlu: 1.228 ± 0.461
2.947AsnPhe: 2.947 ± 0.672
1.228AsnGly: 1.228 ± 0.498
0.491AsnHis: 0.491 ± 0.305
1.965AsnIle: 1.965 ± 0.676
1.965AsnLys: 1.965 ± 0.969
4.666AsnLeu: 4.666 ± 0.993
0.246AsnMet: 0.246 ± 0.152
2.21AsnAsn: 2.21 ± 0.763
2.947AsnPro: 2.947 ± 1.208
1.228AsnGln: 1.228 ± 0.515
1.965AsnArg: 1.965 ± 0.75
1.965AsnSer: 1.965 ± 0.533
0.491AsnThr: 0.491 ± 0.187
1.473AsnVal: 1.473 ± 0.896
1.228AsnTrp: 1.228 ± 0.442
1.719AsnTyr: 1.719 ± 0.413
0.0AsnXaa: 0.0 ± 0.0
Pro
1.965ProAla: 1.965 ± 1.238
1.228ProCys: 1.228 ± 0.657
1.228ProAsp: 1.228 ± 0.442
5.648ProGlu: 5.648 ± 1.665
2.456ProPhe: 2.456 ± 0.63
2.456ProGly: 2.456 ± 0.326
1.473ProHis: 1.473 ± 0.916
1.965ProIle: 1.965 ± 0.503
2.456ProLys: 2.456 ± 0.843
3.438ProLeu: 3.438 ± 0.947
0.982ProMet: 0.982 ± 0.485
1.965ProAsn: 1.965 ± 0.941
4.175ProPro: 4.175 ± 1.8
0.737ProGln: 0.737 ± 0.387
3.193ProArg: 3.193 ± 0.784
4.175ProSer: 4.175 ± 2.432
0.982ProThr: 0.982 ± 0.61
2.456ProVal: 2.456 ± 1.449
0.737ProTrp: 0.737 ± 0.457
1.228ProTyr: 1.228 ± 0.987
0.0ProXaa: 0.0 ± 0.0
Gln
3.438GlnAla: 3.438 ± 1.412
1.965GlnCys: 1.965 ± 0.675
0.982GlnAsp: 0.982 ± 0.375
1.965GlnGlu: 1.965 ± 1.019
0.491GlnPhe: 0.491 ± 1.312
2.701GlnGly: 2.701 ± 1.026
0.737GlnHis: 0.737 ± 0.457
2.947GlnIle: 2.947 ± 0.645
1.473GlnLys: 1.473 ± 0.915
1.719GlnLeu: 1.719 ± 0.6
0.982GlnMet: 0.982 ± 0.377
0.491GlnAsn: 0.491 ± 0.305
0.982GlnPro: 0.982 ± 0.584
1.228GlnGln: 1.228 ± 0.762
1.228GlnArg: 1.228 ± 0.442
3.193GlnSer: 3.193 ± 1.425
1.473GlnThr: 1.473 ± 0.562
1.473GlnVal: 1.473 ± 0.511
0.491GlnTrp: 0.491 ± 0.595
0.982GlnTyr: 0.982 ± 0.598
0.0GlnXaa: 0.0 ± 0.0
Arg
2.701ArgAla: 2.701 ± 0.851
2.456ArgCys: 2.456 ± 0.765
3.684ArgAsp: 3.684 ± 0.911
5.894ArgGlu: 5.894 ± 1.421
1.719ArgPhe: 1.719 ± 0.901
3.684ArgGly: 3.684 ± 1.105
0.491ArgHis: 0.491 ± 0.648
2.947ArgIle: 2.947 ± 1.279
2.456ArgLys: 2.456 ± 0.663
3.684ArgLeu: 3.684 ± 1.698
2.21ArgMet: 2.21 ± 0.907
1.965ArgAsn: 1.965 ± 0.941
1.719ArgPro: 1.719 ± 0.454
2.21ArgGln: 2.21 ± 0.889
3.684ArgArg: 3.684 ± 3.194
3.684ArgSer: 3.684 ± 1.6
3.438ArgThr: 3.438 ± 1.45
5.403ArgVal: 5.403 ± 2.95
0.491ArgTrp: 0.491 ± 0.187
1.228ArgTyr: 1.228 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
5.894SerAla: 5.894 ± 1.229
2.947SerCys: 2.947 ± 1.739
3.438SerAsp: 3.438 ± 1.85
4.666SerGlu: 4.666 ± 0.656
4.912SerPhe: 4.912 ± 0.742
5.403SerGly: 5.403 ± 1.34
1.473SerHis: 1.473 ± 0.581
3.438SerIle: 3.438 ± 0.765
6.385SerLys: 6.385 ± 1.811
8.841SerLeu: 8.841 ± 0.836
1.965SerMet: 1.965 ± 0.962
2.21SerAsn: 2.21 ± 1.378
4.175SerPro: 4.175 ± 1.343
2.21SerGln: 2.21 ± 0.837
3.438SerArg: 3.438 ± 1.39
9.578SerSer: 9.578 ± 1.853
4.666SerThr: 4.666 ± 1.904
6.631SerVal: 6.631 ± 1.665
1.228SerTrp: 1.228 ± 0.657
2.701SerTyr: 2.701 ± 0.967
0.0SerXaa: 0.0 ± 0.0
Thr
2.21ThrAla: 2.21 ± 0.852
3.193ThrCys: 3.193 ± 0.925
1.473ThrAsp: 1.473 ± 0.659
2.701ThrGlu: 2.701 ± 0.662
1.473ThrPhe: 1.473 ± 0.486
3.684ThrGly: 3.684 ± 1.531
0.491ThrHis: 0.491 ± 0.187
3.684ThrIle: 3.684 ± 1.377
3.193ThrLys: 3.193 ± 1.104
5.894ThrLeu: 5.894 ± 1.23
1.965ThrMet: 1.965 ± 1.119
2.21ThrAsn: 2.21 ± 0.536
0.491ThrPro: 0.491 ± 0.305
1.228ThrGln: 1.228 ± 0.461
3.193ThrArg: 3.193 ± 0.838
3.929ThrSer: 3.929 ± 0.929
2.456ThrThr: 2.456 ± 0.547
2.456ThrVal: 2.456 ± 1.06
0.0ThrTrp: 0.0 ± 0.0
1.473ThrTyr: 1.473 ± 0.562
0.0ThrXaa: 0.0 ± 0.0
Val
4.666ValAla: 4.666 ± 0.695
1.965ValCys: 1.965 ± 1.212
2.947ValAsp: 2.947 ± 1.777
3.929ValGlu: 3.929 ± 1.317
3.193ValPhe: 3.193 ± 2.107
4.42ValGly: 4.42 ± 1.979
2.456ValHis: 2.456 ± 0.81
2.456ValIle: 2.456 ± 1.44
3.438ValLys: 3.438 ± 0.784
6.139ValLeu: 6.139 ± 0.902
1.719ValMet: 1.719 ± 0.554
2.21ValAsn: 2.21 ± 0.852
1.965ValPro: 1.965 ± 0.529
2.456ValGln: 2.456 ± 0.51
4.666ValArg: 4.666 ± 0.985
6.631ValSer: 6.631 ± 1.991
3.193ValThr: 3.193 ± 1.026
5.648ValVal: 5.648 ± 1.001
0.491ValTrp: 0.491 ± 0.187
2.947ValTyr: 2.947 ± 1.559
0.0ValXaa: 0.0 ± 0.0
Trp
0.982TrpAla: 0.982 ± 0.61
0.246TrpCys: 0.246 ± 0.152
0.737TrpAsp: 0.737 ± 0.442
0.491TrpGlu: 0.491 ± 0.595
0.737TrpPhe: 0.737 ± 0.387
0.982TrpGly: 0.982 ± 0.375
0.0TrpHis: 0.0 ± 0.0
1.228TrpIle: 1.228 ± 0.447
0.246TrpLys: 0.246 ± 0.227
0.982TrpLeu: 0.982 ± 0.485
0.491TrpMet: 0.491 ± 0.187
0.737TrpAsn: 0.737 ± 0.256
0.491TrpPro: 0.491 ± 0.493
0.0TrpGln: 0.0 ± 0.0
0.737TrpArg: 0.737 ± 0.256
0.246TrpSer: 0.246 ± 0.152
0.737TrpThr: 0.737 ± 0.551
0.982TrpVal: 0.982 ± 0.485
0.246TrpTrp: 0.246 ± 0.152
0.246TrpTyr: 0.246 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.965TyrAla: 1.965 ± 0.95
0.246TyrCys: 0.246 ± 0.227
0.246TyrAsp: 0.246 ± 0.152
1.473TyrGlu: 1.473 ± 0.571
1.473TyrPhe: 1.473 ± 1.011
0.737TyrGly: 0.737 ± 0.57
0.737TyrHis: 0.737 ± 0.457
1.473TyrIle: 1.473 ± 0.659
2.947TyrLys: 2.947 ± 0.855
3.684TyrLeu: 3.684 ± 0.849
0.491TyrMet: 0.491 ± 0.305
1.228TyrAsn: 1.228 ± 0.649
0.982TyrPro: 0.982 ± 0.598
1.228TyrGln: 1.228 ± 1.194
2.701TyrArg: 2.701 ± 0.731
1.965TyrSer: 1.965 ± 0.503
1.228TyrThr: 1.228 ± 0.762
1.473TyrVal: 1.473 ± 0.511
0.491TyrTrp: 0.491 ± 0.187
0.246TyrTyr: 0.246 ± 0.152
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4073 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski