Amino acid dipepetide frequency for Long Island tick rhabdovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.461AlaAla: 4.461 ± 2.588
1.394AlaCys: 1.394 ± 0.557
3.067AlaAsp: 3.067 ± 0.618
3.624AlaGlu: 3.624 ± 1.301
1.394AlaPhe: 1.394 ± 1.075
4.182AlaGly: 4.182 ± 1.746
2.23AlaHis: 2.23 ± 0.299
2.509AlaIle: 2.509 ± 1.017
0.836AlaLys: 0.836 ± 0.406
6.691AlaLeu: 6.691 ± 2.06
1.673AlaMet: 1.673 ± 0.84
2.788AlaAsn: 2.788 ± 0.444
1.951AlaPro: 1.951 ± 1.225
2.509AlaGln: 2.509 ± 0.528
5.297AlaArg: 5.297 ± 0.567
4.182AlaSer: 4.182 ± 1.547
5.018AlaThr: 5.018 ± 0.391
3.345AlaVal: 3.345 ± 0.784
1.115AlaTrp: 1.115 ± 0.679
2.23AlaTyr: 2.23 ± 1.027
0.0AlaXaa: 0.0 ± 0.0
Cys
0.836CysAla: 0.836 ± 0.609
0.558CysCys: 0.558 ± 0.322
1.673CysAsp: 1.673 ± 0.292
0.558CysGlu: 0.558 ± 0.322
0.836CysPhe: 0.836 ± 0.344
1.951CysGly: 1.951 ± 0.836
0.558CysHis: 0.558 ± 0.322
1.115CysIle: 1.115 ± 0.294
0.836CysLys: 0.836 ± 0.716
1.394CysLeu: 1.394 ± 0.244
0.558CysMet: 0.558 ± 0.339
0.558CysAsn: 0.558 ± 0.445
1.115CysPro: 1.115 ± 0.679
1.115CysGln: 1.115 ± 0.507
0.279CysArg: 0.279 ± 0.393
1.394CysSer: 1.394 ± 0.557
1.394CysThr: 1.394 ± 0.536
1.115CysVal: 1.115 ± 0.458
0.558CysTrp: 0.558 ± 0.322
0.558CysTyr: 0.558 ± 0.322
0.0CysXaa: 0.0 ± 0.0
Asp
2.509AspAla: 2.509 ± 1.523
0.558AspCys: 0.558 ± 0.322
2.509AspAsp: 2.509 ± 1.878
5.297AspGlu: 5.297 ± 2.217
1.951AspPhe: 1.951 ± 0.459
3.345AspGly: 3.345 ± 1.264
1.394AspHis: 1.394 ± 0.547
2.23AspIle: 2.23 ± 0.879
3.345AspLys: 3.345 ± 1.438
5.576AspLeu: 5.576 ± 1.731
0.558AspMet: 0.558 ± 0.702
2.509AspAsn: 2.509 ± 0.797
5.854AspPro: 5.854 ± 1.353
1.673AspGln: 1.673 ± 0.877
1.115AspArg: 1.115 ± 0.558
3.345AspSer: 3.345 ± 0.996
3.345AspThr: 3.345 ± 0.961
3.345AspVal: 3.345 ± 0.449
1.115AspTrp: 1.115 ± 0.558
1.394AspTyr: 1.394 ± 0.805
0.0AspXaa: 0.0 ± 0.0
Glu
3.067GluAla: 3.067 ± 0.974
1.951GluCys: 1.951 ± 1.012
3.624GluAsp: 3.624 ± 2.146
4.739GluGlu: 4.739 ± 1.014
1.115GluPhe: 1.115 ± 0.644
5.297GluGly: 5.297 ± 1.57
2.509GluHis: 2.509 ± 1.017
2.509GluIle: 2.509 ± 0.891
3.624GluLys: 3.624 ± 1.683
8.364GluLeu: 8.364 ± 1.517
1.673GluMet: 1.673 ± 0.628
1.951GluAsn: 1.951 ± 0.403
1.951GluPro: 1.951 ± 0.965
1.115GluGln: 1.115 ± 0.644
2.788GluArg: 2.788 ± 0.993
4.461GluSer: 4.461 ± 0.598
4.182GluThr: 4.182 ± 1.494
1.951GluVal: 1.951 ± 0.763
1.115GluTrp: 1.115 ± 1.087
2.509GluTyr: 2.509 ± 0.987
0.0GluXaa: 0.0 ± 0.0
Phe
2.509PheAla: 2.509 ± 0.891
0.836PheCys: 0.836 ± 0.344
0.836PheAsp: 0.836 ± 0.563
1.394PheGlu: 1.394 ± 0.805
1.394PhePhe: 1.394 ± 0.689
1.951PheGly: 1.951 ± 0.658
0.558PheHis: 0.558 ± 0.339
0.836PheIle: 0.836 ± 0.917
2.788PheLys: 2.788 ± 1.448
3.903PheLeu: 3.903 ± 1.046
0.558PheMet: 0.558 ± 0.363
1.115PheAsn: 1.115 ± 0.679
2.788PhePro: 2.788 ± 0.607
1.394PheGln: 1.394 ± 0.557
3.067PheArg: 3.067 ± 0.599
2.509PheSer: 2.509 ± 0.807
2.23PheThr: 2.23 ± 0.494
2.509PheVal: 2.509 ± 0.516
0.558PheTrp: 0.558 ± 0.339
1.115PheTyr: 1.115 ± 0.294
0.0PheXaa: 0.0 ± 0.0
Gly
3.903GlyAla: 3.903 ± 0.162
1.673GlyCys: 1.673 ± 0.46
2.788GlyAsp: 2.788 ± 0.852
1.951GlyGlu: 1.951 ± 0.849
2.509GlyPhe: 2.509 ± 1.279
2.788GlyGly: 2.788 ± 1.425
2.509GlyHis: 2.509 ± 0.555
3.067GlyIle: 3.067 ± 1.13
2.23GlyLys: 2.23 ± 0.299
8.085GlyLeu: 8.085 ± 1.532
1.115GlyMet: 1.115 ± 0.644
1.951GlyAsn: 1.951 ± 0.57
2.788GlyPro: 2.788 ± 0.466
3.067GlyGln: 3.067 ± 0.618
4.182GlyArg: 4.182 ± 1.332
4.739GlySer: 4.739 ± 0.638
5.018GlyThr: 5.018 ± 1.016
4.739GlyVal: 4.739 ± 1.278
1.115GlyTrp: 1.115 ± 0.679
1.394GlyTyr: 1.394 ± 0.536
0.0GlyXaa: 0.0 ± 0.0
His
1.673HisAla: 1.673 ± 0.782
0.279HisCys: 0.279 ± 0.393
1.394HisAsp: 1.394 ± 0.462
1.673HisGlu: 1.673 ± 0.711
0.836HisPhe: 0.836 ± 0.406
1.673HisGly: 1.673 ± 0.46
1.115HisHis: 1.115 ± 0.516
2.23HisIle: 2.23 ± 0.737
1.673HisLys: 1.673 ± 0.46
1.115HisLeu: 1.115 ± 0.44
0.279HisMet: 0.279 ± 0.393
0.279HisAsn: 0.279 ± 0.393
3.345HisPro: 3.345 ± 2.089
1.673HisGln: 1.673 ± 0.482
2.23HisArg: 2.23 ± 0.604
1.673HisSer: 1.673 ± 0.411
1.951HisThr: 1.951 ± 1.03
1.673HisVal: 1.673 ± 0.665
0.558HisTrp: 0.558 ± 0.322
2.509HisTyr: 2.509 ± 1.326
0.0HisXaa: 0.0 ± 0.0
Ile
2.23IleAla: 2.23 ± 0.588
0.558IleCys: 0.558 ± 0.322
1.951IleAsp: 1.951 ± 0.811
2.788IleGlu: 2.788 ± 0.652
1.951IlePhe: 1.951 ± 0.786
1.673IleGly: 1.673 ± 0.956
1.115IleHis: 1.115 ± 0.679
1.673IleIle: 1.673 ± 0.692
5.297IleLys: 5.297 ± 1.511
2.23IleLeu: 2.23 ± 0.54
1.115IleMet: 1.115 ± 0.44
2.788IleAsn: 2.788 ± 0.984
3.903IlePro: 3.903 ± 1.317
0.279IleGln: 0.279 ± 0.161
3.345IleArg: 3.345 ± 0.728
4.182IleSer: 4.182 ± 0.645
5.576IleThr: 5.576 ± 0.644
2.509IleVal: 2.509 ± 1.314
0.836IleTrp: 0.836 ± 0.508
2.509IleTyr: 2.509 ± 1.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.182LysAla: 4.182 ± 1.051
0.836LysCys: 0.836 ± 0.413
3.345LysAsp: 3.345 ± 1.138
2.509LysGlu: 2.509 ± 0.796
2.23LysPhe: 2.23 ± 0.791
1.951LysGly: 1.951 ± 0.512
0.558LysHis: 0.558 ± 0.339
2.23LysIle: 2.23 ± 0.973
3.624LysLys: 3.624 ± 0.9
5.018LysLeu: 5.018 ± 1.024
0.836LysMet: 0.836 ± 0.563
2.509LysAsn: 2.509 ± 1.032
1.951LysPro: 1.951 ± 1.047
0.558LysGln: 0.558 ± 0.726
2.788LysArg: 2.788 ± 0.685
2.788LysSer: 2.788 ± 0.466
5.576LysThr: 5.576 ± 1.255
4.461LysVal: 4.461 ± 0.566
1.673LysTrp: 1.673 ± 0.782
1.115LysTyr: 1.115 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
7.248LeuAla: 7.248 ± 0.866
1.394LeuCys: 1.394 ± 0.557
3.903LeuAsp: 3.903 ± 0.698
4.461LeuGlu: 4.461 ± 0.712
3.345LeuPhe: 3.345 ± 0.888
5.854LeuGly: 5.854 ± 1.417
1.951LeuHis: 1.951 ± 0.811
6.97LeuIle: 6.97 ± 0.456
5.576LeuLys: 5.576 ± 0.863
8.364LeuLeu: 8.364 ± 1.55
5.018LeuMet: 5.018 ± 1.674
3.345LeuAsn: 3.345 ± 1.229
6.133LeuPro: 6.133 ± 0.862
2.788LeuGln: 2.788 ± 0.901
10.036LeuArg: 10.036 ± 2.461
6.133LeuSer: 6.133 ± 1.769
4.182LeuThr: 4.182 ± 2.734
8.364LeuVal: 8.364 ± 1.759
1.951LeuTrp: 1.951 ± 0.71
4.182LeuTyr: 4.182 ± 1.072
0.0LeuXaa: 0.0 ± 0.0
Met
3.345MetAla: 3.345 ± 0.72
0.279MetCys: 0.279 ± 0.161
1.673MetAsp: 1.673 ± 0.46
1.394MetGlu: 1.394 ± 0.668
0.836MetPhe: 0.836 ± 0.344
2.23MetGly: 2.23 ± 0.595
1.115MetHis: 1.115 ± 1.104
1.115MetIle: 1.115 ± 0.792
0.279MetLys: 0.279 ± 0.468
2.509MetLeu: 2.509 ± 0.807
0.836MetMet: 0.836 ± 0.483
0.558MetAsn: 0.558 ± 0.339
0.558MetPro: 0.558 ± 0.339
0.558MetGln: 0.558 ± 0.429
1.673MetArg: 1.673 ± 0.595
1.115MetSer: 1.115 ± 0.396
1.394MetThr: 1.394 ± 0.492
0.279MetVal: 0.279 ± 0.445
0.836MetTrp: 0.836 ± 0.508
0.836MetTyr: 0.836 ± 0.483
0.0MetXaa: 0.0 ± 0.0
Asn
1.673AsnAla: 1.673 ± 1.299
0.558AsnCys: 0.558 ± 0.543
2.509AsnAsp: 2.509 ± 0.99
1.951AsnGlu: 1.951 ± 0.403
1.115AsnPhe: 1.115 ± 0.679
1.951AsnGly: 1.951 ± 1.863
1.115AsnHis: 1.115 ± 0.396
1.673AsnIle: 1.673 ± 0.558
2.23AsnLys: 2.23 ± 0.879
3.903AsnLeu: 3.903 ± 1.33
0.558AsnMet: 0.558 ± 0.363
1.394AsnAsn: 1.394 ± 0.557
1.951AsnPro: 1.951 ± 0.822
1.673AsnGln: 1.673 ± 0.795
1.115AsnArg: 1.115 ± 0.44
3.067AsnSer: 3.067 ± 0.97
1.673AsnThr: 1.673 ± 0.966
1.951AsnVal: 1.951 ± 0.505
0.558AsnTrp: 0.558 ± 0.445
1.115AsnTyr: 1.115 ± 0.644
0.0AsnXaa: 0.0 ± 0.0
Pro
2.509ProAla: 2.509 ± 1.637
1.115ProCys: 1.115 ± 0.656
4.461ProAsp: 4.461 ± 0.553
5.854ProGlu: 5.854 ± 1.335
1.394ProPhe: 1.394 ± 0.557
3.903ProGly: 3.903 ± 1.206
1.951ProHis: 1.951 ± 0.697
2.788ProIle: 2.788 ± 1.071
2.23ProLys: 2.23 ± 0.915
6.133ProLeu: 6.133 ± 1.032
1.115ProMet: 1.115 ± 1.104
0.558ProAsn: 0.558 ± 0.363
5.297ProPro: 5.297 ± 3.735
1.951ProGln: 1.951 ± 1.478
3.067ProArg: 3.067 ± 0.956
5.297ProSer: 5.297 ± 0.979
4.739ProThr: 4.739 ± 0.722
3.903ProVal: 3.903 ± 0.451
1.115ProTrp: 1.115 ± 0.679
2.509ProTyr: 2.509 ± 2.086
0.0ProXaa: 0.0 ± 0.0
Gln
3.067GlnAla: 3.067 ± 0.806
0.279GlnCys: 0.279 ± 0.445
1.394GlnAsp: 1.394 ± 0.498
3.067GlnGlu: 3.067 ± 0.894
0.558GlnPhe: 0.558 ± 0.322
1.951GlnGly: 1.951 ± 0.933
0.558GlnHis: 0.558 ± 0.322
1.394GlnIle: 1.394 ± 0.602
1.673GlnLys: 1.673 ± 0.692
3.345GlnLeu: 3.345 ± 0.498
1.115GlnMet: 1.115 ± 0.766
0.836GlnAsn: 0.836 ± 0.483
1.394GlnPro: 1.394 ± 1.241
1.394GlnGln: 1.394 ± 1.117
2.23GlnArg: 2.23 ± 1.216
1.673GlnSer: 1.673 ± 0.623
1.951GlnThr: 1.951 ± 0.697
2.23GlnVal: 2.23 ± 0.54
0.836GlnTrp: 0.836 ± 0.413
0.279GlnTyr: 0.279 ± 0.445
0.0GlnXaa: 0.0 ± 0.0
Arg
4.461ArgAla: 4.461 ± 1.511
1.115ArgCys: 1.115 ± 0.396
3.624ArgAsp: 3.624 ± 1.451
3.903ArgGlu: 3.903 ± 1.099
2.788ArgPhe: 2.788 ± 0.843
2.509ArgGly: 2.509 ± 0.883
2.509ArgHis: 2.509 ± 0.635
2.788ArgIle: 2.788 ± 1.378
2.788ArgLys: 2.788 ± 1.08
7.806ArgLeu: 7.806 ± 1.329
1.673ArgMet: 1.673 ± 0.482
1.394ArgAsn: 1.394 ± 0.509
3.067ArgPro: 3.067 ± 0.792
3.067ArgGln: 3.067 ± 0.759
5.854ArgArg: 5.854 ± 0.348
3.067ArgSer: 3.067 ± 1.167
2.788ArgThr: 2.788 ± 0.894
6.691ArgVal: 6.691 ± 1.676
1.394ArgTrp: 1.394 ± 0.805
1.673ArgTyr: 1.673 ± 0.711
0.0ArgXaa: 0.0 ± 0.0
Ser
3.067SerAla: 3.067 ± 0.714
0.558SerCys: 0.558 ± 0.339
3.345SerAsp: 3.345 ± 1.187
4.461SerGlu: 4.461 ± 0.587
2.23SerPhe: 2.23 ± 1.106
5.854SerGly: 5.854 ± 1.746
1.115SerHis: 1.115 ± 0.881
3.903SerIle: 3.903 ± 1.395
3.067SerLys: 3.067 ± 1.326
9.2SerLeu: 9.2 ± 1.552
0.558SerMet: 0.558 ± 0.339
1.951SerAsn: 1.951 ± 0.763
4.461SerPro: 4.461 ± 1.867
0.836SerGln: 0.836 ± 0.571
3.903SerArg: 3.903 ± 1.186
5.018SerSer: 5.018 ± 2.067
5.854SerThr: 5.854 ± 1.447
8.085SerVal: 8.085 ± 1.353
1.394SerTrp: 1.394 ± 0.492
1.951SerTyr: 1.951 ± 0.724
0.0SerXaa: 0.0 ± 0.0
Thr
3.624ThrAla: 3.624 ± 1.812
1.115ThrCys: 1.115 ± 0.679
4.739ThrAsp: 4.739 ± 0.65
3.624ThrGlu: 3.624 ± 1.309
4.182ThrPhe: 4.182 ± 0.829
3.345ThrGly: 3.345 ± 1.303
2.509ThrHis: 2.509 ± 0.718
3.903ThrIle: 3.903 ± 0.911
3.067ThrLys: 3.067 ± 0.77
6.691ThrLeu: 6.691 ± 0.629
1.951ThrMet: 1.951 ± 1.014
0.836ThrAsn: 0.836 ± 0.406
5.576ThrPro: 5.576 ± 2.206
1.673ThrGln: 1.673 ± 0.795
3.624ThrArg: 3.624 ± 0.875
5.576ThrSer: 5.576 ± 1.288
3.903ThrThr: 3.903 ± 1.42
3.903ThrVal: 3.903 ± 1.255
1.394ThrTrp: 1.394 ± 0.471
2.23ThrTyr: 2.23 ± 0.828
0.0ThrXaa: 0.0 ± 0.0
Val
3.903ValAla: 3.903 ± 0.808
2.788ValCys: 2.788 ± 0.932
2.788ValAsp: 2.788 ± 1.008
3.903ValGlu: 3.903 ± 0.988
2.23ValPhe: 2.23 ± 1.288
4.461ValGly: 4.461 ± 1.13
1.951ValHis: 1.951 ± 0.849
4.461ValIle: 4.461 ± 1.517
2.788ValLys: 2.788 ± 0.582
5.576ValLeu: 5.576 ± 1.37
0.836ValMet: 0.836 ± 0.344
3.067ValAsn: 3.067 ± 0.714
5.297ValPro: 5.297 ± 1.263
1.951ValGln: 1.951 ± 0.724
3.903ValArg: 3.903 ± 0.592
5.018ValSer: 5.018 ± 1.514
4.461ValThr: 4.461 ± 1.049
5.854ValVal: 5.854 ± 2.586
1.951ValTrp: 1.951 ± 0.658
3.067ValTyr: 3.067 ± 0.781
0.0ValXaa: 0.0 ± 0.0
Trp
1.115TrpAla: 1.115 ± 0.644
0.0TrpCys: 0.0 ± 0.0
1.951TrpAsp: 1.951 ± 0.321
1.673TrpGlu: 1.673 ± 0.692
0.836TrpPhe: 0.836 ± 0.406
1.394TrpGly: 1.394 ± 0.679
1.115TrpHis: 1.115 ± 0.44
0.279TrpIle: 0.279 ± 0.468
0.279TrpLys: 0.279 ± 0.445
1.951TrpLeu: 1.951 ± 1.06
0.558TrpMet: 0.558 ± 0.853
1.394TrpAsn: 1.394 ± 0.668
0.558TrpPro: 0.558 ± 0.445
0.279TrpGln: 0.279 ± 0.468
1.115TrpArg: 1.115 ± 0.679
2.509TrpSer: 2.509 ± 1.075
0.279TrpThr: 0.279 ± 0.468
1.673TrpVal: 1.673 ± 0.292
0.279TrpTrp: 0.279 ± 0.161
1.394TrpTyr: 1.394 ± 0.471
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.115TyrAla: 1.115 ± 0.679
1.115TyrCys: 1.115 ± 0.44
1.394TyrAsp: 1.394 ± 0.898
1.673TyrGlu: 1.673 ± 0.292
1.115TyrPhe: 1.115 ± 0.396
3.345TyrGly: 3.345 ± 0.913
1.394TyrHis: 1.394 ± 0.991
0.836TyrIle: 0.836 ± 0.483
2.509TyrLys: 2.509 ± 0.762
3.345TyrLeu: 3.345 ± 0.672
0.558TyrMet: 0.558 ± 0.322
1.951TyrAsn: 1.951 ± 0.403
1.951TyrPro: 1.951 ± 1.089
1.673TyrGln: 1.673 ± 0.716
3.345TyrArg: 3.345 ± 1.772
3.067TyrSer: 3.067 ± 0.337
1.951TyrThr: 1.951 ± 1.422
1.673TyrVal: 1.673 ± 0.648
0.558TyrTrp: 0.558 ± 0.339
1.394TyrTyr: 1.394 ± 0.727
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3588 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski