Amino acid dipepetide frequency for Wabat virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.744AlaAla: 3.744 ± 0.835
0.267AlaCys: 0.267 ± 0.143
3.744AlaAsp: 3.744 ± 2.184
4.547AlaGlu: 4.547 ± 1.891
3.477AlaPhe: 3.477 ± 1.498
5.082AlaGly: 5.082 ± 4.193
1.872AlaHis: 1.872 ± 0.593
4.012AlaIle: 4.012 ± 0.598
3.209AlaLys: 3.209 ± 0.794
6.151AlaLeu: 6.151 ± 1.069
1.872AlaMet: 1.872 ± 0.31
2.14AlaAsn: 2.14 ± 0.548
2.407AlaPro: 2.407 ± 0.538
2.407AlaGln: 2.407 ± 0.939
2.407AlaArg: 2.407 ± 1.233
4.012AlaSer: 4.012 ± 1.256
6.151AlaThr: 6.151 ± 0.962
4.012AlaVal: 4.012 ± 0.808
0.802AlaTrp: 0.802 ± 0.785
2.675AlaTyr: 2.675 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
1.07CysAla: 1.07 ± 0.57
0.802CysCys: 0.802 ± 0.428
0.802CysAsp: 0.802 ± 0.428
0.267CysGlu: 0.267 ± 0.143
1.07CysPhe: 1.07 ± 0.393
0.535CysGly: 0.535 ± 0.368
0.267CysHis: 0.267 ± 0.434
1.07CysIle: 1.07 ± 0.57
1.337CysLys: 1.337 ± 0.471
1.605CysLeu: 1.605 ± 0.855
1.07CysMet: 1.07 ± 1.224
1.872CysAsn: 1.872 ± 0.729
1.605CysPro: 1.605 ± 0.855
0.802CysGln: 0.802 ± 0.428
1.337CysArg: 1.337 ± 0.471
0.802CysSer: 0.802 ± 0.428
0.535CysThr: 0.535 ± 0.868
1.337CysVal: 1.337 ± 0.706
0.0CysTrp: 0.0 ± 0.0
1.07CysTyr: 1.07 ± 0.349
0.0CysXaa: 0.0 ± 0.0
Asp
4.547AspAla: 4.547 ± 2.103
2.407AspCys: 2.407 ± 0.51
2.942AspAsp: 2.942 ± 0.932
4.012AspGlu: 4.012 ± 1.318
4.012AspPhe: 4.012 ± 0.699
2.407AspGly: 2.407 ± 0.494
2.675AspHis: 2.675 ± 0.941
3.209AspIle: 3.209 ± 0.638
3.209AspLys: 3.209 ± 0.638
3.744AspLeu: 3.744 ± 1.176
1.605AspMet: 1.605 ± 0.667
2.407AspAsn: 2.407 ± 0.538
3.477AspPro: 3.477 ± 0.406
2.675AspGln: 2.675 ± 1.032
2.407AspArg: 2.407 ± 1.233
3.744AspSer: 3.744 ± 1.321
2.407AspThr: 2.407 ± 0.818
5.616AspVal: 5.616 ± 0.81
0.267AspTrp: 0.267 ± 0.143
2.942AspTyr: 2.942 ± 0.623
0.0AspXaa: 0.0 ± 0.0
Glu
5.349GluAla: 5.349 ± 1.224
1.07GluCys: 1.07 ± 0.57
4.279GluAsp: 4.279 ± 1.641
6.151GluGlu: 6.151 ± 2.278
2.14GluPhe: 2.14 ± 0.82
3.744GluGly: 3.744 ± 0.397
1.605GluHis: 1.605 ± 0.704
4.012GluIle: 4.012 ± 0.993
3.477GluLys: 3.477 ± 0.737
6.151GluLeu: 6.151 ± 1.235
2.675GluMet: 2.675 ± 0.508
1.872GluAsn: 1.872 ± 0.735
3.477GluPro: 3.477 ± 1.102
3.477GluGln: 3.477 ± 0.891
2.407GluArg: 2.407 ± 0.853
5.349GluSer: 5.349 ± 1.442
3.744GluThr: 3.744 ± 0.918
3.209GluVal: 3.209 ± 0.548
1.07GluTrp: 1.07 ± 0.39
1.605GluTyr: 1.605 ± 0.274
0.0GluXaa: 0.0 ± 0.0
Phe
1.605PheAla: 1.605 ± 0.576
0.535PheCys: 0.535 ± 0.368
2.942PheAsp: 2.942 ± 0.943
4.279PheGlu: 4.279 ± 1.111
1.872PhePhe: 1.872 ± 0.729
1.337PheGly: 1.337 ± 0.713
1.07PheHis: 1.07 ± 0.739
3.477PheIle: 3.477 ± 0.568
3.477PheLys: 3.477 ± 1.267
4.547PheLeu: 4.547 ± 1.129
1.605PheMet: 1.605 ± 0.519
2.14PheAsn: 2.14 ± 0.816
1.337PhePro: 1.337 ± 0.308
0.535PheGln: 0.535 ± 0.368
2.407PheArg: 2.407 ± 0.532
1.872PheSer: 1.872 ± 0.664
3.477PheThr: 3.477 ± 1.349
4.012PheVal: 4.012 ± 1.315
1.07PheTrp: 1.07 ± 0.393
1.605PheTyr: 1.605 ± 1.105
0.0PheXaa: 0.0 ± 0.0
Gly
4.814GlyAla: 4.814 ± 1.845
0.535GlyCys: 0.535 ± 0.285
4.012GlyAsp: 4.012 ± 0.699
2.14GlyGlu: 2.14 ± 0.74
1.872GlyPhe: 1.872 ± 1.029
2.407GlyGly: 2.407 ± 0.532
2.407GlyHis: 2.407 ± 0.939
2.942GlyIle: 2.942 ± 0.766
5.349GlyLys: 5.349 ± 1.674
3.744GlyLeu: 3.744 ± 3.093
2.14GlyMet: 2.14 ± 1.819
1.605GlyAsn: 1.605 ± 0.519
2.675GlyPro: 2.675 ± 1.516
1.605GlyGln: 1.605 ± 1.632
1.605GlyArg: 1.605 ± 0.696
3.209GlySer: 3.209 ± 1.408
4.012GlyThr: 4.012 ± 2.057
2.14GlyVal: 2.14 ± 0.761
1.07GlyTrp: 1.07 ± 0.739
4.012GlyTyr: 4.012 ± 1.256
0.0GlyXaa: 0.0 ± 0.0
His
2.675HisAla: 2.675 ± 0.635
1.07HisCys: 1.07 ± 0.737
1.337HisAsp: 1.337 ± 0.76
0.535HisGlu: 0.535 ± 0.285
1.872HisPhe: 1.872 ± 0.664
1.872HisGly: 1.872 ± 0.636
1.605HisHis: 1.605 ± 1.202
2.407HisIle: 2.407 ± 0.494
1.337HisLys: 1.337 ± 0.713
2.942HisLeu: 2.942 ± 1.221
0.802HisMet: 0.802 ± 0.505
0.0HisAsn: 0.0 ± 0.0
2.407HisPro: 2.407 ± 0.51
1.605HisGln: 1.605 ± 0.855
2.407HisArg: 2.407 ± 0.892
1.605HisSer: 1.605 ± 0.668
1.605HisThr: 1.605 ± 0.576
1.337HisVal: 1.337 ± 0.757
0.267HisTrp: 0.267 ± 0.143
0.802HisTyr: 0.802 ± 0.982
0.0HisXaa: 0.0 ± 0.0
Ile
3.744IleAla: 3.744 ± 1.173
1.337IleCys: 1.337 ± 0.713
2.675IleAsp: 2.675 ± 0.564
4.814IleGlu: 4.814 ± 1.26
1.337IlePhe: 1.337 ± 0.713
3.209IleGly: 3.209 ± 1.038
1.337IleHis: 1.337 ± 0.686
1.337IleIle: 1.337 ± 0.419
4.012IleLys: 4.012 ± 1.395
3.744IleLeu: 3.744 ± 1.321
2.14IleMet: 2.14 ± 0.398
3.477IleAsn: 3.477 ± 0.789
3.209IlePro: 3.209 ± 0.707
2.675IleGln: 2.675 ± 0.867
3.477IleArg: 3.477 ± 0.807
5.082IleSer: 5.082 ± 0.812
2.942IleThr: 2.942 ± 1.008
2.14IleVal: 2.14 ± 0.698
1.337IleTrp: 1.337 ± 0.706
2.675IleTyr: 2.675 ± 0.86
0.0IleXaa: 0.0 ± 0.0
Lys
3.477LysAla: 3.477 ± 1.274
1.07LysCys: 1.07 ± 0.57
3.477LysAsp: 3.477 ± 1.0
5.082LysGlu: 5.082 ± 1.876
3.209LysPhe: 3.209 ± 1.358
3.209LysGly: 3.209 ± 1.736
1.07LysHis: 1.07 ± 0.57
5.082LysIle: 5.082 ± 1.214
5.884LysLys: 5.884 ± 2.702
5.349LysLeu: 5.349 ± 1.349
1.872LysMet: 1.872 ± 0.914
2.942LysAsn: 2.942 ± 0.559
3.209LysPro: 3.209 ± 0.794
2.407LysGln: 2.407 ± 1.055
3.744LysArg: 3.744 ± 0.673
2.675LysSer: 2.675 ± 1.026
3.744LysThr: 3.744 ± 1.995
2.14LysVal: 2.14 ± 0.398
1.337LysTrp: 1.337 ± 0.713
2.675LysTyr: 2.675 ± 1.026
0.0LysXaa: 0.0 ± 0.0
Leu
5.884LeuAla: 5.884 ± 1.981
1.337LeuCys: 1.337 ± 0.471
6.419LeuAsp: 6.419 ± 1.377
4.279LeuGlu: 4.279 ± 1.092
3.209LeuPhe: 3.209 ± 0.909
4.012LeuGly: 4.012 ± 1.584
1.605LeuHis: 1.605 ± 0.855
3.477LeuIle: 3.477 ± 0.737
4.814LeuLys: 4.814 ± 1.065
7.489LeuLeu: 7.489 ± 2.041
1.605LeuMet: 1.605 ± 0.576
2.942LeuAsn: 2.942 ± 0.766
4.547LeuPro: 4.547 ± 1.214
1.872LeuGln: 1.872 ± 0.31
3.477LeuArg: 3.477 ± 2.379
4.279LeuSer: 4.279 ± 1.914
6.419LeuThr: 6.419 ± 1.065
5.349LeuVal: 5.349 ± 1.297
1.605LeuTrp: 1.605 ± 0.274
2.407LeuTyr: 2.407 ± 0.679
0.0LeuXaa: 0.0 ± 0.0
Met
2.942MetAla: 2.942 ± 1.804
0.267MetCys: 0.267 ± 0.143
2.407MetAsp: 2.407 ± 0.989
1.872MetGlu: 1.872 ± 1.07
1.605MetPhe: 1.605 ± 0.667
1.872MetGly: 1.872 ± 1.048
0.267MetHis: 0.267 ± 0.143
2.14MetIle: 2.14 ± 0.781
2.407MetLys: 2.407 ± 1.283
2.407MetLeu: 2.407 ± 0.692
0.0MetMet: 0.0 ± 0.0
1.872MetAsn: 1.872 ± 0.896
1.605MetPro: 1.605 ± 0.519
1.337MetGln: 1.337 ± 1.742
0.802MetArg: 0.802 ± 0.505
2.675MetSer: 2.675 ± 1.36
1.605MetThr: 1.605 ± 0.696
1.337MetVal: 1.337 ± 0.758
0.267MetTrp: 0.267 ± 0.434
0.535MetTyr: 0.535 ± 0.368
0.0MetXaa: 0.0 ± 0.0
Asn
2.14AsnAla: 2.14 ± 1.517
1.605AsnCys: 1.605 ± 0.667
3.477AsnAsp: 3.477 ± 1.518
4.012AsnGlu: 4.012 ± 1.256
3.209AsnPhe: 3.209 ± 0.548
2.407AsnGly: 2.407 ± 0.951
1.07AsnHis: 1.07 ± 0.349
3.209AsnIle: 3.209 ± 0.638
2.407AsnLys: 2.407 ± 0.892
3.209AsnLeu: 3.209 ± 0.866
0.535AsnMet: 0.535 ± 0.285
2.407AsnAsn: 2.407 ± 0.818
2.942AsnPro: 2.942 ± 0.937
0.535AsnGln: 0.535 ± 0.835
3.744AsnArg: 3.744 ± 1.249
2.675AsnSer: 2.675 ± 1.47
1.872AsnThr: 1.872 ± 1.685
2.942AsnVal: 2.942 ± 0.989
0.802AsnTrp: 0.802 ± 0.982
1.07AsnTyr: 1.07 ± 0.838
0.0AsnXaa: 0.0 ± 0.0
Pro
2.407ProAla: 2.407 ± 0.755
0.267ProCys: 0.267 ± 0.143
4.012ProAsp: 4.012 ± 1.315
4.547ProGlu: 4.547 ± 1.795
0.802ProPhe: 0.802 ± 0.33
3.209ProGly: 3.209 ± 1.299
2.407ProHis: 2.407 ± 0.679
2.675ProIle: 2.675 ± 0.717
2.407ProLys: 2.407 ± 0.51
3.744ProLeu: 3.744 ± 1.176
1.337ProMet: 1.337 ± 0.757
1.605ProAsn: 1.605 ± 1.637
5.616ProPro: 5.616 ± 0.789
1.605ProGln: 1.605 ± 1.511
1.872ProArg: 1.872 ± 1.503
3.477ProSer: 3.477 ± 1.791
3.209ProThr: 3.209 ± 0.575
3.744ProVal: 3.744 ± 1.576
1.07ProTrp: 1.07 ± 0.39
1.872ProTyr: 1.872 ± 1.824
0.0ProXaa: 0.0 ± 0.0
Gln
3.477GlnAla: 3.477 ± 1.54
0.0GlnCys: 0.0 ± 0.0
1.337GlnAsp: 1.337 ± 0.471
1.872GlnGlu: 1.872 ± 0.593
0.802GlnPhe: 0.802 ± 0.785
1.07GlnGly: 1.07 ± 1.228
0.802GlnHis: 0.802 ± 0.428
2.407GlnIle: 2.407 ± 0.692
2.14GlnLys: 2.14 ± 0.82
2.675GlnLeu: 2.675 ± 1.413
1.337GlnMet: 1.337 ± 0.406
2.675GlnAsn: 2.675 ± 1.114
2.14GlnPro: 2.14 ± 2.789
2.407GlnGln: 2.407 ± 1.531
2.942GlnArg: 2.942 ± 0.634
2.14GlnSer: 2.14 ± 0.917
1.605GlnThr: 1.605 ± 0.274
2.407GlnVal: 2.407 ± 0.925
0.267GlnTrp: 0.267 ± 0.434
1.07GlnTyr: 1.07 ± 0.867
0.0GlnXaa: 0.0 ± 0.0
Arg
3.477ArgAla: 3.477 ± 1.689
0.535ArgCys: 0.535 ± 0.368
4.547ArgAsp: 4.547 ± 2.59
2.942ArgGlu: 2.942 ± 1.543
2.675ArgPhe: 2.675 ± 1.114
1.605ArgGly: 1.605 ± 0.704
2.14ArgHis: 2.14 ± 1.14
1.872ArgIle: 1.872 ± 0.636
1.872ArgLys: 1.872 ± 0.735
4.279ArgLeu: 4.279 ± 1.395
2.14ArgMet: 2.14 ± 0.771
3.744ArgAsn: 3.744 ± 1.204
1.07ArgPro: 1.07 ± 0.872
1.337ArgGln: 1.337 ± 1.313
3.744ArgArg: 3.744 ± 1.321
4.814ArgSer: 4.814 ± 3.009
3.209ArgThr: 3.209 ± 1.458
3.477ArgVal: 3.477 ± 1.496
0.535ArgTrp: 0.535 ± 0.368
1.605ArgTyr: 1.605 ± 0.66
0.0ArgXaa: 0.0 ± 0.0
Ser
4.012SerAla: 4.012 ± 1.256
1.872SerCys: 1.872 ± 2.016
4.279SerAsp: 4.279 ± 1.161
4.547SerGlu: 4.547 ± 1.034
3.744SerPhe: 3.744 ± 1.173
2.942SerGly: 2.942 ± 1.704
1.337SerHis: 1.337 ± 0.419
5.884SerIle: 5.884 ± 0.825
6.419SerLys: 6.419 ± 1.327
4.012SerLeu: 4.012 ± 1.296
2.407SerMet: 2.407 ± 0.581
3.209SerAsn: 3.209 ± 1.609
2.942SerPro: 2.942 ± 1.501
2.407SerGln: 2.407 ± 0.755
2.14SerArg: 2.14 ± 0.698
5.884SerSer: 5.884 ± 3.112
4.012SerThr: 4.012 ± 0.688
3.209SerVal: 3.209 ± 1.046
0.802SerTrp: 0.802 ± 0.352
2.675SerTyr: 2.675 ± 1.781
0.0SerXaa: 0.0 ± 0.0
Thr
2.675ThrAla: 2.675 ± 0.508
1.07ThrCys: 1.07 ± 0.57
3.477ThrAsp: 3.477 ± 0.568
2.675ThrGlu: 2.675 ± 1.085
4.547ThrPhe: 4.547 ± 0.833
4.279ThrGly: 4.279 ± 2.203
1.605ThrHis: 1.605 ± 0.519
3.744ThrIle: 3.744 ± 1.576
3.477ThrLys: 3.477 ± 0.9
4.279ThrLeu: 4.279 ± 0.55
1.605ThrMet: 1.605 ± 0.757
3.209ThrAsn: 3.209 ± 2.515
2.942ThrPro: 2.942 ± 0.634
1.872ThrGln: 1.872 ± 1.508
3.744ThrArg: 3.744 ± 0.83
4.279ThrSer: 4.279 ± 1.633
5.082ThrThr: 5.082 ± 0.517
4.814ThrVal: 4.814 ± 1.26
0.802ThrTrp: 0.802 ± 0.33
1.872ThrTyr: 1.872 ± 0.834
0.0ThrXaa: 0.0 ± 0.0
Val
3.744ValAla: 3.744 ± 1.388
2.407ValCys: 2.407 ± 1.283
2.675ValAsp: 2.675 ± 0.86
4.279ValGlu: 4.279 ± 1.349
1.605ValPhe: 1.605 ± 0.855
6.419ValGly: 6.419 ± 1.109
2.942ValHis: 2.942 ± 0.559
1.337ValIle: 1.337 ± 0.419
4.547ValLys: 4.547 ± 1.123
4.012ValLeu: 4.012 ± 0.808
1.872ValMet: 1.872 ± 0.593
2.675ValAsn: 2.675 ± 0.941
2.14ValPro: 2.14 ± 0.787
2.14ValGln: 2.14 ± 0.761
2.942ValArg: 2.942 ± 1.283
6.151ValSer: 6.151 ± 1.667
3.477ValThr: 3.477 ± 0.769
4.814ValVal: 4.814 ± 1.131
1.07ValTrp: 1.07 ± 1.429
1.605ValTyr: 1.605 ± 0.667
0.0ValXaa: 0.0 ± 0.0
Trp
0.802TrpAla: 0.802 ± 0.505
0.0TrpCys: 0.0 ± 0.0
0.535TrpAsp: 0.535 ± 0.368
0.802TrpGlu: 0.802 ± 0.793
1.605TrpPhe: 1.605 ± 0.667
0.802TrpGly: 0.802 ± 0.785
1.337TrpHis: 1.337 ± 0.308
0.535TrpIle: 0.535 ± 0.368
0.802TrpLys: 0.802 ± 0.33
1.07TrpLeu: 1.07 ± 0.39
0.267TrpMet: 0.267 ± 0.143
1.605TrpAsn: 1.605 ± 0.576
0.267TrpPro: 0.267 ± 0.143
0.535TrpGln: 0.535 ± 0.285
0.802TrpArg: 0.802 ± 0.505
1.605TrpSer: 1.605 ± 1.202
0.535TrpThr: 0.535 ± 0.63
1.07TrpVal: 1.07 ± 0.57
0.535TrpTrp: 0.535 ± 0.369
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.407TyrAla: 2.407 ± 0.951
0.802TyrCys: 0.802 ± 0.352
1.337TyrAsp: 1.337 ± 0.308
3.209TyrGlu: 3.209 ± 1.453
0.535TyrPhe: 0.535 ± 0.285
1.872TyrGly: 1.872 ± 0.998
1.07TyrHis: 1.07 ± 0.393
1.872TyrIle: 1.872 ± 1.351
1.07TyrLys: 1.07 ± 0.57
1.872TyrLeu: 1.872 ± 0.894
1.07TyrMet: 1.07 ± 1.264
1.872TyrAsn: 1.872 ± 0.664
1.872TyrPro: 1.872 ± 0.697
1.337TyrGln: 1.337 ± 0.771
3.209TyrArg: 3.209 ± 0.575
2.675TyrSer: 2.675 ± 1.059
2.407TyrThr: 2.407 ± 2.355
3.744TyrVal: 3.744 ± 0.784
0.535TyrTrp: 0.535 ± 0.285
1.337TyrTyr: 1.337 ± 0.757
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3740 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski