Amino acid dipepetide frequency for Hirame rhabdovirus (strain Korea/CA 9703/1997) (HIRRV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.733AlaAla: 6.733 ± 1.074
1.756AlaCys: 1.756 ± 0.954
3.22AlaAsp: 3.22 ± 0.7
3.806AlaGlu: 3.806 ± 1.434
1.756AlaPhe: 1.756 ± 0.563
3.513AlaGly: 3.513 ± 1.044
0.878AlaHis: 0.878 ± 0.684
4.684AlaIle: 4.684 ± 0.602
3.806AlaLys: 3.806 ± 0.948
9.66AlaLeu: 9.66 ± 2.086
1.171AlaMet: 1.171 ± 0.35
1.756AlaAsn: 1.756 ± 0.835
3.806AlaPro: 3.806 ± 0.575
3.806AlaGln: 3.806 ± 0.662
3.806AlaArg: 3.806 ± 1.321
3.513AlaSer: 3.513 ± 0.483
5.562AlaThr: 5.562 ± 0.526
3.806AlaVal: 3.806 ± 1.528
1.756AlaTrp: 1.756 ± 0.655
1.756AlaTyr: 1.756 ± 0.671
0.0AlaXaa: 0.0 ± 0.0
Cys
1.464CysAla: 1.464 ± 1.117
0.293CysCys: 0.293 ± 0.384
1.464CysAsp: 1.464 ± 0.491
0.293CysGlu: 0.293 ± 0.169
0.293CysPhe: 0.293 ± 0.169
0.585CysGly: 0.585 ± 0.318
0.0CysHis: 0.0 ± 0.0
0.293CysIle: 0.293 ± 0.416
0.293CysLys: 0.293 ± 0.384
0.878CysLeu: 0.878 ± 0.335
1.171CysMet: 1.171 ± 1.199
0.293CysAsn: 0.293 ± 0.169
1.756CysPro: 1.756 ± 0.585
1.171CysGln: 1.171 ± 0.727
0.293CysArg: 0.293 ± 0.169
2.342CysSer: 2.342 ± 0.703
0.878CysThr: 0.878 ± 0.508
0.293CysVal: 0.293 ± 0.169
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.342AspAla: 2.342 ± 0.795
0.585AspCys: 0.585 ± 0.318
3.22AspAsp: 3.22 ± 1.524
6.148AspGlu: 6.148 ± 1.309
1.756AspPhe: 1.756 ± 0.671
4.684AspGly: 4.684 ± 1.85
0.293AspHis: 0.293 ± 0.169
4.098AspIle: 4.098 ± 0.919
4.098AspLys: 4.098 ± 0.714
9.368AspLeu: 9.368 ± 1.388
2.927AspMet: 2.927 ± 1.466
2.635AspAsn: 2.635 ± 0.426
2.927AspPro: 2.927 ± 0.963
1.756AspGln: 1.756 ± 0.458
3.806AspArg: 3.806 ± 0.662
4.391AspSer: 4.391 ± 1.112
3.22AspThr: 3.22 ± 1.943
2.342AspVal: 2.342 ± 0.975
0.878AspTrp: 0.878 ± 0.508
1.171AspTyr: 1.171 ± 0.503
0.0AspXaa: 0.0 ± 0.0
Glu
4.684GluAla: 4.684 ± 2.113
1.171GluCys: 1.171 ± 0.425
5.269GluAsp: 5.269 ± 2.258
4.684GluGlu: 4.684 ± 2.122
2.635GluPhe: 2.635 ± 0.803
5.562GluGly: 5.562 ± 1.767
1.171GluHis: 1.171 ± 0.941
5.562GluIle: 5.562 ± 0.794
6.148GluLys: 6.148 ± 1.564
4.684GluLeu: 4.684 ± 2.359
1.464GluMet: 1.464 ± 0.491
2.927GluAsn: 2.927 ± 0.982
0.585GluPro: 0.585 ± 0.471
2.635GluGln: 2.635 ± 1.587
2.927GluArg: 2.927 ± 1.062
4.098GluSer: 4.098 ± 2.006
4.098GluThr: 4.098 ± 1.569
2.635GluVal: 2.635 ± 0.433
1.171GluTrp: 1.171 ± 0.425
1.464GluTyr: 1.464 ± 1.241
0.0GluXaa: 0.0 ± 0.0
Phe
2.049PheAla: 2.049 ± 0.537
0.585PheCys: 0.585 ± 0.318
2.927PheAsp: 2.927 ± 0.649
0.878PheGlu: 0.878 ± 0.467
2.342PhePhe: 2.342 ± 0.85
3.806PheGly: 3.806 ± 1.645
0.585PheHis: 0.585 ± 0.318
2.342PheIle: 2.342 ± 0.682
1.756PheLys: 1.756 ± 1.016
3.513PheLeu: 3.513 ± 1.316
0.878PheMet: 0.878 ± 0.593
1.171PheAsn: 1.171 ± 0.468
2.049PhePro: 2.049 ± 0.78
1.464PheGln: 1.464 ± 0.943
2.342PheArg: 2.342 ± 0.688
2.635PheSer: 2.635 ± 1.184
2.049PheThr: 2.049 ± 0.537
1.464PheVal: 1.464 ± 0.628
0.293PheTrp: 0.293 ± 0.384
1.171PheTyr: 1.171 ± 0.425
0.0PheXaa: 0.0 ± 0.0
Gly
3.22GlyAla: 3.22 ± 1.466
0.585GlyCys: 0.585 ± 0.491
4.684GlyAsp: 4.684 ± 1.507
5.269GlyGlu: 5.269 ± 1.462
2.927GlyPhe: 2.927 ± 1.171
4.977GlyGly: 4.977 ± 1.845
1.171GlyHis: 1.171 ± 1.063
4.098GlyIle: 4.098 ± 1.044
4.098GlyLys: 4.098 ± 1.548
8.782GlyLeu: 8.782 ± 1.599
1.756GlyMet: 1.756 ± 0.686
2.342GlyAsn: 2.342 ± 0.551
1.464GlyPro: 1.464 ± 0.591
1.464GlyGln: 1.464 ± 0.628
2.049GlyArg: 2.049 ± 0.838
4.098GlySer: 4.098 ± 1.522
2.927GlyThr: 2.927 ± 1.452
4.391GlyVal: 4.391 ± 0.886
0.585GlyTrp: 0.585 ± 0.339
2.049GlyTyr: 2.049 ± 0.674
0.0GlyXaa: 0.0 ± 0.0
His
1.756HisAla: 1.756 ± 0.7
0.293HisCys: 0.293 ± 0.169
0.585HisAsp: 0.585 ± 0.318
0.878HisGlu: 0.878 ± 0.474
0.585HisPhe: 0.585 ± 0.339
2.049HisGly: 2.049 ± 1.087
0.878HisHis: 0.878 ± 0.684
1.756HisIle: 1.756 ± 0.7
1.171HisLys: 1.171 ± 0.425
2.635HisLeu: 2.635 ± 0.607
0.293HisMet: 0.293 ± 0.169
0.293HisAsn: 0.293 ± 0.384
1.756HisPro: 1.756 ± 1.6
1.171HisGln: 1.171 ± 0.425
1.756HisArg: 1.756 ± 0.458
0.585HisSer: 0.585 ± 0.339
0.878HisThr: 0.878 ± 0.335
1.464HisVal: 1.464 ± 0.491
0.585HisTrp: 0.585 ± 0.318
0.585HisTyr: 0.585 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
3.513IleAla: 3.513 ± 0.45
0.585IleCys: 0.585 ± 0.339
4.098IleAsp: 4.098 ± 1.329
4.977IleGlu: 4.977 ± 1.536
1.171IlePhe: 1.171 ± 0.636
1.756IleGly: 1.756 ± 0.732
1.756IleHis: 1.756 ± 0.7
3.806IleIle: 3.806 ± 1.174
4.098IleLys: 4.098 ± 0.859
5.562IleLeu: 5.562 ± 0.66
0.878IleMet: 0.878 ± 0.427
2.635IleAsn: 2.635 ± 1.363
3.22IlePro: 3.22 ± 1.577
0.878IleGln: 0.878 ± 0.508
4.977IleArg: 4.977 ± 1.355
4.098IleSer: 4.098 ± 1.259
6.733IleThr: 6.733 ± 1.25
2.635IleVal: 2.635 ± 0.908
0.585IleTrp: 0.585 ± 0.339
2.049IleTyr: 2.049 ± 0.928
0.0IleXaa: 0.0 ± 0.0
Lys
3.513LysAla: 3.513 ± 0.944
1.464LysCys: 1.464 ± 0.799
2.049LysAsp: 2.049 ± 0.993
4.098LysGlu: 4.098 ± 1.513
2.635LysPhe: 2.635 ± 0.962
5.269LysGly: 5.269 ± 0.869
0.878LysHis: 0.878 ± 0.508
3.806LysIle: 3.806 ± 1.428
4.684LysLys: 4.684 ± 1.937
7.611LysLeu: 7.611 ± 1.982
1.171LysMet: 1.171 ± 0.648
1.464LysAsn: 1.464 ± 1.033
2.342LysPro: 2.342 ± 1.67
1.756LysGln: 1.756 ± 0.948
4.391LysArg: 4.391 ± 1.228
4.098LysSer: 4.098 ± 0.66
4.977LysThr: 4.977 ± 1.408
3.806LysVal: 3.806 ± 1.743
0.293LysTrp: 0.293 ± 0.169
1.756LysTyr: 1.756 ± 1.834
0.0LysXaa: 0.0 ± 0.0
Leu
8.197LeuAla: 8.197 ± 1.504
2.342LeuCys: 2.342 ± 1.101
8.197LeuAsp: 8.197 ± 2.189
7.026LeuGlu: 7.026 ± 1.577
6.148LeuPhe: 6.148 ± 2.673
6.148LeuGly: 6.148 ± 0.926
3.22LeuHis: 3.22 ± 0.899
4.391LeuIle: 4.391 ± 0.707
4.684LeuLys: 4.684 ± 1.48
12.295LeuLeu: 12.295 ± 2.883
3.22LeuMet: 3.22 ± 1.48
3.806LeuAsn: 3.806 ± 0.807
4.391LeuPro: 4.391 ± 1.772
3.513LeuGln: 3.513 ± 1.166
6.148LeuArg: 6.148 ± 1.262
10.246LeuSer: 10.246 ± 2.749
7.904LeuThr: 7.904 ± 1.252
5.855LeuVal: 5.855 ± 1.101
0.293LeuTrp: 0.293 ± 0.169
2.635LeuTyr: 2.635 ± 1.348
0.0LeuXaa: 0.0 ± 0.0
Met
1.756MetAla: 1.756 ± 0.837
0.293MetCys: 0.293 ± 0.525
3.22MetAsp: 3.22 ± 0.865
1.171MetGlu: 1.171 ± 0.503
0.293MetPhe: 0.293 ± 0.169
0.878MetGly: 0.878 ± 0.508
0.293MetHis: 0.293 ± 0.384
0.878MetIle: 0.878 ± 0.335
1.171MetLys: 1.171 ± 0.35
1.171MetLeu: 1.171 ± 0.575
0.878MetMet: 0.878 ± 0.427
1.171MetAsn: 1.171 ± 0.734
0.878MetPro: 0.878 ± 0.335
0.878MetGln: 0.878 ± 0.508
1.756MetArg: 1.756 ± 1.326
2.927MetSer: 2.927 ± 1.719
3.22MetThr: 3.22 ± 0.954
1.171MetVal: 1.171 ± 1.434
0.293MetTrp: 0.293 ± 0.169
0.585MetTyr: 0.585 ± 0.768
0.0MetXaa: 0.0 ± 0.0
Asn
1.171AsnAla: 1.171 ± 1.23
0.0AsnCys: 0.0 ± 0.0
2.635AsnAsp: 2.635 ± 1.256
2.635AsnGlu: 2.635 ± 1.432
1.171AsnPhe: 1.171 ± 0.503
0.878AsnGly: 0.878 ± 0.427
0.293AsnHis: 0.293 ± 0.169
2.049AsnIle: 2.049 ± 0.884
1.756AsnLys: 1.756 ± 0.732
4.684AsnLeu: 4.684 ± 1.959
1.464AsnMet: 1.464 ± 0.566
1.171AsnAsn: 1.171 ± 0.734
1.464AsnPro: 1.464 ± 0.347
0.293AsnGln: 0.293 ± 0.416
3.806AsnArg: 3.806 ± 1.324
3.513AsnSer: 3.513 ± 0.947
0.293AsnThr: 0.293 ± 0.169
1.171AsnVal: 1.171 ± 0.761
0.585AsnTrp: 0.585 ± 0.318
0.585AsnTyr: 0.585 ± 0.491
0.0AsnXaa: 0.0 ± 0.0
Pro
3.806ProAla: 3.806 ± 1.545
0.293ProCys: 0.293 ± 0.384
1.464ProAsp: 1.464 ± 0.347
2.342ProGlu: 2.342 ± 0.804
0.878ProPhe: 0.878 ± 0.427
1.756ProGly: 1.756 ± 0.854
2.049ProHis: 2.049 ± 0.682
2.049ProIle: 2.049 ± 0.577
1.464ProLys: 1.464 ± 1.054
6.148ProLeu: 6.148 ± 1.665
0.585ProMet: 0.585 ± 0.318
0.878ProAsn: 0.878 ± 0.577
2.049ProPro: 2.049 ± 1.643
1.464ProGln: 1.464 ± 1.045
5.269ProArg: 5.269 ± 1.606
2.927ProSer: 2.927 ± 1.491
3.22ProThr: 3.22 ± 0.746
3.513ProVal: 3.513 ± 0.867
0.878ProTrp: 0.878 ± 0.684
2.635ProTyr: 2.635 ± 0.973
0.0ProXaa: 0.0 ± 0.0
Gln
3.806GlnAla: 3.806 ± 1.706
0.293GlnCys: 0.293 ± 0.169
1.171GlnAsp: 1.171 ± 0.678
3.806GlnGlu: 3.806 ± 1.624
1.464GlnPhe: 1.464 ± 0.635
2.342GlnGly: 2.342 ± 0.585
1.171GlnHis: 1.171 ± 0.678
1.464GlnIle: 1.464 ± 0.719
2.049GlnLys: 2.049 ± 0.918
1.756GlnLeu: 1.756 ± 0.458
0.293GlnMet: 0.293 ± 0.158
0.293GlnAsn: 0.293 ± 0.169
1.171GlnPro: 1.171 ± 0.53
1.756GlnGln: 1.756 ± 0.687
2.049GlnArg: 2.049 ± 0.838
2.635GlnSer: 2.635 ± 0.607
3.513GlnThr: 3.513 ± 1.123
0.878GlnVal: 0.878 ± 0.631
0.293GlnTrp: 0.293 ± 0.384
1.756GlnTyr: 1.756 ± 1.209
0.0GlnXaa: 0.0 ± 0.0
Arg
5.269ArgAla: 5.269 ± 1.032
0.585ArgCys: 0.585 ± 0.768
3.22ArgAsp: 3.22 ± 1.506
5.855ArgGlu: 5.855 ± 1.816
0.878ArgPhe: 0.878 ± 0.577
5.562ArgGly: 5.562 ± 1.136
0.878ArgHis: 0.878 ± 1.216
3.22ArgIle: 3.22 ± 1.32
4.098ArgLys: 4.098 ± 0.804
8.197ArgLeu: 8.197 ± 2.814
1.756ArgMet: 1.756 ± 1.189
0.293ArgAsn: 0.293 ± 0.169
2.635ArgPro: 2.635 ± 0.975
2.342ArgGln: 2.342 ± 0.695
2.927ArgArg: 2.927 ± 0.763
3.806ArgSer: 3.806 ± 1.163
3.806ArgThr: 3.806 ± 1.153
5.269ArgVal: 5.269 ± 1.279
0.585ArgTrp: 0.585 ± 0.491
1.464ArgTyr: 1.464 ± 0.738
0.0ArgXaa: 0.0 ± 0.0
Ser
5.855SerAla: 5.855 ± 0.671
0.293SerCys: 0.293 ± 0.169
5.269SerAsp: 5.269 ± 1.171
4.098SerGlu: 4.098 ± 1.048
2.635SerPhe: 2.635 ± 1.773
3.806SerGly: 3.806 ± 1.194
1.464SerHis: 1.464 ± 0.719
4.391SerIle: 4.391 ± 1.077
3.806SerLys: 3.806 ± 1.185
8.197SerLeu: 8.197 ± 1.985
1.756SerMet: 1.756 ± 0.723
1.756SerAsn: 1.756 ± 0.599
4.977SerPro: 4.977 ± 0.669
4.391SerGln: 4.391 ± 1.072
5.562SerArg: 5.562 ± 0.713
4.977SerSer: 4.977 ± 1.777
3.22SerThr: 3.22 ± 0.656
3.22SerVal: 3.22 ± 1.622
0.585SerTrp: 0.585 ± 0.339
0.878SerTyr: 0.878 ± 0.427
0.0SerXaa: 0.0 ± 0.0
Thr
4.977ThrAla: 4.977 ± 0.877
1.171ThrCys: 1.171 ± 0.781
3.806ThrAsp: 3.806 ± 0.848
2.927ThrGlu: 2.927 ± 1.492
2.049ThrPhe: 2.049 ± 0.682
2.049ThrGly: 2.049 ± 0.763
2.342ThrHis: 2.342 ± 1.015
5.562ThrIle: 5.562 ± 1.577
5.269ThrLys: 5.269 ± 1.21
6.733ThrLeu: 6.733 ± 1.651
2.342ThrMet: 2.342 ± 0.979
3.513ThrAsn: 3.513 ± 1.032
3.513ThrPro: 3.513 ± 1.273
1.464ThrGln: 1.464 ± 0.554
3.513ThrArg: 3.513 ± 1.683
4.391ThrSer: 4.391 ± 0.661
3.806ThrThr: 3.806 ± 1.904
4.684ThrVal: 4.684 ± 1.633
1.464ThrTrp: 1.464 ± 0.554
1.464ThrTyr: 1.464 ± 0.928
0.0ThrXaa: 0.0 ± 0.0
Val
3.806ValAla: 3.806 ± 1.472
0.585ValCys: 0.585 ± 0.471
4.391ValAsp: 4.391 ± 1.726
3.22ValGlu: 3.22 ± 0.441
3.22ValPhe: 3.22 ± 1.133
3.22ValGly: 3.22 ± 1.396
0.878ValHis: 0.878 ± 0.684
3.22ValIle: 3.22 ± 0.87
5.269ValLys: 5.269 ± 2.402
5.562ValLeu: 5.562 ± 0.848
0.585ValMet: 0.585 ± 0.339
1.171ValAsn: 1.171 ± 0.704
2.927ValPro: 2.927 ± 0.763
0.585ValGln: 0.585 ± 0.318
3.22ValArg: 3.22 ± 1.18
3.513ValSer: 3.513 ± 1.34
2.927ValThr: 2.927 ± 0.623
4.098ValVal: 4.098 ± 1.991
1.171ValTrp: 1.171 ± 0.621
1.756ValTyr: 1.756 ± 0.954
0.0ValXaa: 0.0 ± 0.0
Trp
1.171TrpAla: 1.171 ± 0.425
0.293TrpCys: 0.293 ± 0.169
0.0TrpAsp: 0.0 ± 0.0
1.171TrpGlu: 1.171 ± 0.678
0.0TrpPhe: 0.0 ± 0.0
1.464TrpGly: 1.464 ± 0.996
0.0TrpHis: 0.0 ± 0.0
0.878TrpIle: 0.878 ± 0.335
1.756TrpLys: 1.756 ± 0.7
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.878TrpAsn: 0.878 ± 0.593
0.585TrpPro: 0.585 ± 0.318
0.293TrpGln: 0.293 ± 0.567
0.878TrpArg: 0.878 ± 0.508
1.171TrpSer: 1.171 ± 0.924
0.585TrpThr: 0.585 ± 0.339
0.878TrpVal: 0.878 ± 0.508
0.0TrpTrp: 0.0 ± 0.0
0.585TrpTyr: 0.585 ± 0.318
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.464TyrAla: 1.464 ± 0.53
0.585TyrCys: 0.585 ± 0.547
1.756TyrAsp: 1.756 ± 2.09
0.293TyrGlu: 0.293 ± 0.525
1.464TyrPhe: 1.464 ± 0.347
2.635TyrGly: 2.635 ± 1.015
1.756TyrHis: 1.756 ± 0.671
1.464TyrIle: 1.464 ± 0.847
0.585TyrLys: 0.585 ± 0.756
3.22TyrLeu: 3.22 ± 1.33
0.0TyrMet: 0.0 ± 0.362
1.171TyrAsn: 1.171 ± 0.636
1.171TyrPro: 1.171 ± 0.636
0.878TyrGln: 0.878 ± 0.467
1.171TyrArg: 1.171 ± 1.113
1.171TyrSer: 1.171 ± 0.503
3.22TyrThr: 3.22 ± 1.157
1.756TyrVal: 1.756 ± 0.585
0.293TyrTrp: 0.293 ± 0.169
2.342TyrTyr: 2.342 ± 1.09
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3417 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski