Amino acid dipepetide frequency for Hubei dimarhabdovirus virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.106AlaAla: 3.106 ± 0.778
0.0AlaCys: 0.0 ± 0.0
2.823AlaAsp: 2.823 ± 0.76
1.976AlaGlu: 1.976 ± 1.035
1.129AlaPhe: 1.129 ± 0.465
1.694AlaGly: 1.694 ± 0.434
0.847AlaHis: 0.847 ± 0.549
3.67AlaIle: 3.67 ± 0.904
1.976AlaLys: 1.976 ± 0.99
7.058AlaLeu: 7.058 ± 1.632
0.565AlaMet: 0.565 ± 0.333
1.412AlaAsn: 1.412 ± 0.582
1.129AlaPro: 1.129 ± 1.043
2.823AlaGln: 2.823 ± 0.344
1.694AlaArg: 1.694 ± 0.99
3.67AlaSer: 3.67 ± 1.681
3.388AlaThr: 3.388 ± 0.455
3.106AlaVal: 3.106 ± 1.108
0.282AlaTrp: 0.282 ± 0.154
1.694AlaTyr: 1.694 ± 0.748
0.0AlaXaa: 0.0 ± 0.0
Cys
2.259CysAla: 2.259 ± 1.351
0.282CysCys: 0.282 ± 0.154
0.565CysAsp: 0.565 ± 0.307
1.412CysGlu: 1.412 ± 0.458
0.847CysPhe: 0.847 ± 0.461
0.565CysGly: 0.565 ± 0.719
0.0CysHis: 0.0 ± 0.0
0.847CysIle: 0.847 ± 0.289
1.129CysLys: 1.129 ± 0.709
0.847CysLeu: 0.847 ± 0.461
0.0CysMet: 0.0 ± 0.0
1.129CysAsn: 1.129 ± 0.341
0.847CysPro: 0.847 ± 0.384
0.565CysGln: 0.565 ± 0.312
0.0CysArg: 0.0 ± 0.0
1.976CysSer: 1.976 ± 0.572
1.412CysThr: 1.412 ± 0.452
1.129CysVal: 1.129 ± 0.614
0.847CysTrp: 0.847 ± 0.461
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.694AspAla: 1.694 ± 0.458
1.129AspCys: 1.129 ± 0.666
3.106AspAsp: 3.106 ± 1.591
3.953AspGlu: 3.953 ± 0.767
2.259AspPhe: 2.259 ± 0.614
3.388AspGly: 3.388 ± 0.761
1.129AspHis: 1.129 ± 0.519
3.106AspIle: 3.106 ± 0.736
4.8AspLys: 4.8 ± 1.226
6.776AspLeu: 6.776 ± 1.596
1.976AspMet: 1.976 ± 1.017
1.976AspAsn: 1.976 ± 0.481
4.235AspPro: 4.235 ± 0.992
1.976AspGln: 1.976 ± 0.816
1.976AspArg: 1.976 ± 0.613
1.694AspSer: 1.694 ± 0.737
2.541AspThr: 2.541 ± 0.595
3.67AspVal: 3.67 ± 1.397
0.847AspTrp: 0.847 ± 0.551
3.67AspTyr: 3.67 ± 1.129
0.0AspXaa: 0.0 ± 0.0
Glu
2.259GluAla: 2.259 ± 0.77
0.282GluCys: 0.282 ± 0.399
2.823GluAsp: 2.823 ± 0.982
2.259GluGlu: 2.259 ± 0.583
2.259GluPhe: 2.259 ± 0.718
2.823GluGly: 2.823 ± 0.93
0.565GluHis: 0.565 ± 0.307
4.8GluIle: 4.8 ± 0.839
3.388GluLys: 3.388 ± 0.319
6.211GluLeu: 6.211 ± 1.527
1.412GluMet: 1.412 ± 0.862
2.541GluAsn: 2.541 ± 0.776
0.847GluPro: 0.847 ± 0.384
3.388GluGln: 3.388 ± 0.779
1.976GluArg: 1.976 ± 1.022
3.67GluSer: 3.67 ± 1.013
5.082GluThr: 5.082 ± 1.449
3.106GluVal: 3.106 ± 1.272
0.282GluTrp: 0.282 ± 0.399
2.541GluTyr: 2.541 ± 0.489
0.0GluXaa: 0.0 ± 0.0
Phe
1.412PheAla: 1.412 ± 0.288
0.847PheCys: 0.847 ± 0.461
1.976PheAsp: 1.976 ± 0.588
1.412PheGlu: 1.412 ± 0.595
2.541PhePhe: 2.541 ± 1.323
3.106PheGly: 3.106 ± 0.978
1.412PheHis: 1.412 ± 0.288
3.106PheIle: 3.106 ± 0.757
2.823PheLys: 2.823 ± 0.486
5.082PheLeu: 5.082 ± 1.107
0.847PheMet: 0.847 ± 0.404
2.823PheAsn: 2.823 ± 0.604
3.67PhePro: 3.67 ± 0.99
0.847PheGln: 0.847 ± 0.461
3.106PheArg: 3.106 ± 0.735
3.106PheSer: 3.106 ± 0.783
2.823PheThr: 2.823 ± 2.064
3.106PheVal: 3.106 ± 1.362
0.282PheTrp: 0.282 ± 0.154
1.412PheTyr: 1.412 ± 0.857
0.0PheXaa: 0.0 ± 0.0
Gly
2.259GlyAla: 2.259 ± 0.575
1.129GlyCys: 1.129 ± 0.666
3.67GlyAsp: 3.67 ± 0.882
3.106GlyGlu: 3.106 ± 0.736
2.823GlyPhe: 2.823 ± 0.583
2.823GlyGly: 2.823 ± 0.887
1.129GlyHis: 1.129 ± 0.625
4.517GlyIle: 4.517 ± 0.823
1.694GlyLys: 1.694 ± 0.714
7.058GlyLeu: 7.058 ± 1.235
1.694GlyMet: 1.694 ± 0.714
3.67GlyAsn: 3.67 ± 0.759
2.259GlyPro: 2.259 ± 1.03
1.412GlyGln: 1.412 ± 0.768
1.129GlyArg: 1.129 ± 0.507
5.647GlySer: 5.647 ± 0.645
2.259GlyThr: 2.259 ± 0.929
3.953GlyVal: 3.953 ± 0.941
0.847GlyTrp: 0.847 ± 0.461
2.259GlyTyr: 2.259 ± 0.683
0.0GlyXaa: 0.0 ± 0.0
His
0.565HisAla: 0.565 ± 0.491
0.282HisCys: 0.282 ± 0.154
0.565HisAsp: 0.565 ± 0.333
1.129HisGlu: 1.129 ± 0.303
0.565HisPhe: 0.565 ± 0.307
0.847HisGly: 0.847 ± 0.699
1.129HisHis: 1.129 ± 0.86
1.412HisIle: 1.412 ± 0.452
1.694HisLys: 1.694 ± 0.578
3.67HisLeu: 3.67 ± 0.725
0.282HisMet: 0.282 ± 0.359
0.847HisAsn: 0.847 ± 0.424
1.976HisPro: 1.976 ± 0.853
1.976HisGln: 1.976 ± 0.83
2.823HisArg: 2.823 ± 0.874
1.694HisSer: 1.694 ± 0.684
1.412HisThr: 1.412 ± 0.985
1.694HisVal: 1.694 ± 1.093
0.565HisTrp: 0.565 ± 0.307
1.412HisTyr: 1.412 ± 0.582
0.0HisXaa: 0.0 ± 0.0
Ile
3.106IleAla: 3.106 ± 0.821
0.847IleCys: 0.847 ± 0.549
3.67IleAsp: 3.67 ± 0.334
3.388IleGlu: 3.388 ± 0.904
3.388IlePhe: 3.388 ± 0.631
4.517IleGly: 4.517 ± 1.388
1.976IleHis: 1.976 ± 1.212
5.647IleIle: 5.647 ± 2.222
5.929IleLys: 5.929 ± 2.11
5.929IleLeu: 5.929 ± 1.519
0.847IleMet: 0.847 ± 0.62
4.235IleAsn: 4.235 ± 1.534
5.082IlePro: 5.082 ± 0.956
2.823IleGln: 2.823 ± 1.253
6.494IleArg: 6.494 ± 1.525
5.647IleSer: 5.647 ± 1.782
3.388IleThr: 3.388 ± 0.755
1.694IleVal: 1.694 ± 0.848
0.847IleTrp: 0.847 ± 0.289
1.976IleTyr: 1.976 ± 0.608
0.0IleXaa: 0.0 ± 0.0
Lys
2.823LysAla: 2.823 ± 1.274
1.694LysCys: 1.694 ± 0.937
3.953LysAsp: 3.953 ± 0.907
4.8LysGlu: 4.8 ± 1.096
2.823LysPhe: 2.823 ± 0.93
3.67LysGly: 3.67 ± 0.397
0.847LysHis: 0.847 ± 0.289
3.106LysIle: 3.106 ± 1.338
2.541LysLys: 2.541 ± 0.767
6.776LysLeu: 6.776 ± 0.888
1.129LysMet: 1.129 ± 0.456
3.67LysAsn: 3.67 ± 1.752
4.235LysPro: 4.235 ± 2.165
1.412LysGln: 1.412 ± 0.444
2.259LysArg: 2.259 ± 0.945
3.388LysSer: 3.388 ± 0.697
5.929LysThr: 5.929 ± 1.851
3.388LysVal: 3.388 ± 1.024
1.129LysTrp: 1.129 ± 0.511
1.694LysTyr: 1.694 ± 0.578
0.0LysXaa: 0.0 ± 0.0
Leu
5.647LeuAla: 5.647 ± 0.638
1.129LeuCys: 1.129 ± 0.922
3.953LeuAsp: 3.953 ± 1.387
4.8LeuGlu: 4.8 ± 0.207
4.517LeuPhe: 4.517 ± 1.057
5.364LeuGly: 5.364 ± 0.548
2.541LeuHis: 2.541 ± 0.539
9.317LeuIle: 9.317 ± 1.661
8.187LeuLys: 8.187 ± 1.01
10.164LeuLeu: 10.164 ± 1.057
3.106LeuMet: 3.106 ± 0.923
3.388LeuAsn: 3.388 ± 1.709
3.953LeuPro: 3.953 ± 2.091
1.694LeuGln: 1.694 ± 0.974
7.058LeuArg: 7.058 ± 2.246
7.34LeuSer: 7.34 ± 1.339
11.293LeuThr: 11.293 ± 1.544
5.647LeuVal: 5.647 ± 1.255
0.847LeuTrp: 0.847 ± 0.384
4.517LeuTyr: 4.517 ± 0.708
0.0LeuXaa: 0.0 ± 0.0
Met
1.129MetAla: 1.129 ± 0.465
0.282MetCys: 0.282 ± 0.154
1.976MetAsp: 1.976 ± 0.552
0.847MetGlu: 0.847 ± 0.424
1.412MetPhe: 1.412 ± 0.582
0.565MetGly: 0.565 ± 0.333
0.282MetHis: 0.282 ± 0.399
1.412MetIle: 1.412 ± 0.582
1.976MetLys: 1.976 ± 0.851
1.412MetLeu: 1.412 ± 0.536
0.565MetMet: 0.565 ± 0.307
1.412MetAsn: 1.412 ± 0.582
0.847MetPro: 0.847 ± 0.676
0.282MetGln: 0.282 ± 0.546
0.565MetArg: 0.565 ± 0.307
1.976MetSer: 1.976 ± 0.615
1.694MetThr: 1.694 ± 0.698
1.129MetVal: 1.129 ± 0.519
0.565MetTrp: 0.565 ± 0.333
1.129MetTyr: 1.129 ± 0.465
0.0MetXaa: 0.0 ± 0.0
Asn
2.259AsnAla: 2.259 ± 0.59
0.847AsnCys: 0.847 ± 0.461
2.259AsnAsp: 2.259 ± 0.929
1.129AsnGlu: 1.129 ± 0.465
2.259AsnPhe: 2.259 ± 0.583
2.259AsnGly: 2.259 ± 1.595
1.694AsnHis: 1.694 ± 0.999
3.953AsnIle: 3.953 ± 0.875
2.823AsnLys: 2.823 ± 1.2
6.776AsnLeu: 6.776 ± 1.477
0.565AsnMet: 0.565 ± 0.403
2.541AsnAsn: 2.541 ± 1.043
4.235AsnPro: 4.235 ± 1.409
2.823AsnGln: 2.823 ± 0.62
1.976AsnArg: 1.976 ± 0.905
5.082AsnSer: 5.082 ± 1.009
3.67AsnThr: 3.67 ± 1.574
1.129AsnVal: 1.129 ± 0.341
1.976AsnTrp: 1.976 ± 0.507
2.259AsnTyr: 2.259 ± 0.864
0.0AsnXaa: 0.0 ± 0.0
Pro
3.388ProAla: 3.388 ± 1.385
0.847ProCys: 0.847 ± 0.374
2.823ProAsp: 2.823 ± 1.027
1.976ProGlu: 1.976 ± 0.899
2.259ProPhe: 2.259 ± 1.199
3.67ProGly: 3.67 ± 0.652
1.129ProHis: 1.129 ± 0.625
3.106ProIle: 3.106 ± 1.733
3.106ProLys: 3.106 ± 1.511
5.082ProLeu: 5.082 ± 1.413
0.847ProMet: 0.847 ± 0.374
1.694ProAsn: 1.694 ± 0.348
2.259ProPro: 2.259 ± 1.537
1.129ProGln: 1.129 ± 1.016
2.823ProArg: 2.823 ± 0.361
5.929ProSer: 5.929 ± 1.059
2.541ProThr: 2.541 ± 0.916
1.976ProVal: 1.976 ± 1.092
0.565ProTrp: 0.565 ± 0.501
2.541ProTyr: 2.541 ± 1.108
0.0ProXaa: 0.0 ± 0.0
Gln
0.282GlnAla: 0.282 ± 0.154
0.282GlnCys: 0.282 ± 0.154
1.694GlnAsp: 1.694 ± 0.875
1.976GlnGlu: 1.976 ± 0.682
1.412GlnPhe: 1.412 ± 0.871
1.976GlnGly: 1.976 ± 1.075
0.565GlnHis: 0.565 ± 0.312
1.976GlnIle: 1.976 ± 0.369
1.129GlnLys: 1.129 ± 0.507
3.953GlnLeu: 3.953 ± 1.381
0.565GlnMet: 0.565 ± 0.307
2.259GlnAsn: 2.259 ± 0.718
0.565GlnPro: 0.565 ± 0.491
0.565GlnGln: 0.565 ± 0.333
2.541GlnArg: 2.541 ± 0.537
3.388GlnSer: 3.388 ± 2.107
2.541GlnThr: 2.541 ± 1.643
2.823GlnVal: 2.823 ± 0.514
0.282GlnTrp: 0.282 ± 0.154
1.129GlnTyr: 1.129 ± 0.983
0.0GlnXaa: 0.0 ± 0.0
Arg
2.541ArgAla: 2.541 ± 0.804
1.412ArgCys: 1.412 ± 0.532
2.823ArgAsp: 2.823 ± 0.93
2.541ArgGlu: 2.541 ± 1.08
2.823ArgPhe: 2.823 ± 0.604
3.106ArgGly: 3.106 ± 0.923
1.412ArgHis: 1.412 ± 0.532
3.953ArgIle: 3.953 ± 1.04
1.694ArgLys: 1.694 ± 0.646
3.388ArgLeu: 3.388 ± 0.834
1.412ArgMet: 1.412 ± 0.458
3.67ArgAsn: 3.67 ± 0.562
2.259ArgPro: 2.259 ± 0.491
1.412ArgGln: 1.412 ± 0.495
1.412ArgArg: 1.412 ± 0.532
4.235ArgSer: 4.235 ± 0.969
4.235ArgThr: 4.235 ± 0.641
3.953ArgVal: 3.953 ± 0.305
0.565ArgTrp: 0.565 ± 0.333
2.259ArgTyr: 2.259 ± 1.72
0.0ArgXaa: 0.0 ± 0.0
Ser
4.8SerAla: 4.8 ± 1.073
2.259SerCys: 2.259 ± 1.88
6.494SerAsp: 6.494 ± 2.455
4.517SerGlu: 4.517 ± 1.087
2.541SerPhe: 2.541 ± 0.954
4.517SerGly: 4.517 ± 1.11
2.259SerHis: 2.259 ± 0.996
5.082SerIle: 5.082 ± 0.956
4.517SerLys: 4.517 ± 1.167
8.752SerLeu: 8.752 ± 1.733
0.847SerMet: 0.847 ± 0.482
4.8SerAsn: 4.8 ± 1.354
3.388SerPro: 3.388 ± 0.676
2.259SerGln: 2.259 ± 0.58
4.235SerArg: 4.235 ± 0.655
5.364SerSer: 5.364 ± 1.444
4.235SerThr: 4.235 ± 0.656
3.953SerVal: 3.953 ± 0.967
1.694SerTrp: 1.694 ± 0.922
3.388SerTyr: 3.388 ± 1.186
0.0SerXaa: 0.0 ± 0.0
Thr
1.129ThrAla: 1.129 ± 0.456
0.847ThrCys: 0.847 ± 0.461
4.517ThrAsp: 4.517 ± 0.726
7.058ThrGlu: 7.058 ± 1.413
2.541ThrPhe: 2.541 ± 1.372
3.67ThrGly: 3.67 ± 1.607
3.953ThrHis: 3.953 ± 0.91
4.8ThrIle: 4.8 ± 1.64
3.953ThrLys: 3.953 ± 1.462
5.082ThrLeu: 5.082 ± 1.34
1.694ThrMet: 1.694 ± 0.714
5.364ThrAsn: 5.364 ± 1.074
3.106ThrPro: 3.106 ± 2.107
2.259ThrGln: 2.259 ± 0.986
4.235ThrArg: 4.235 ± 1.212
5.647ThrSer: 5.647 ± 1.212
6.211ThrThr: 6.211 ± 1.528
2.823ThrVal: 2.823 ± 0.514
0.847ThrTrp: 0.847 ± 0.289
1.694ThrTyr: 1.694 ± 0.434
0.0ThrXaa: 0.0 ± 0.0
Val
1.976ValAla: 1.976 ± 0.793
1.129ValCys: 1.129 ± 0.614
3.67ValAsp: 3.67 ± 0.92
1.412ValGlu: 1.412 ± 0.874
3.67ValPhe: 3.67 ± 0.882
2.541ValGly: 2.541 ± 0.405
2.541ValHis: 2.541 ± 0.949
3.953ValIle: 3.953 ± 0.83
4.235ValLys: 4.235 ± 1.283
5.082ValLeu: 5.082 ± 1.161
1.412ValMet: 1.412 ± 0.768
2.259ValAsn: 2.259 ± 0.683
1.976ValPro: 1.976 ± 0.455
1.412ValGln: 1.412 ± 0.62
1.976ValArg: 1.976 ± 0.575
4.517ValSer: 4.517 ± 0.837
4.517ValThr: 4.517 ± 1.155
2.259ValVal: 2.259 ± 1.438
0.565ValTrp: 0.565 ± 0.333
1.976ValTyr: 1.976 ± 0.706
0.0ValXaa: 0.0 ± 0.0
Trp
0.565TrpAla: 0.565 ± 0.501
0.0TrpCys: 0.0 ± 0.0
0.282TrpAsp: 0.282 ± 0.154
1.129TrpGlu: 1.129 ± 0.456
1.412TrpPhe: 1.412 ± 0.288
1.412TrpGly: 1.412 ± 0.595
0.282TrpHis: 0.282 ± 0.154
1.412TrpIle: 1.412 ± 0.444
1.129TrpLys: 1.129 ± 0.465
1.129TrpLeu: 1.129 ± 0.816
0.282TrpMet: 0.282 ± 0.154
0.847TrpAsn: 0.847 ± 0.461
0.847TrpPro: 0.847 ± 0.461
0.0TrpGln: 0.0 ± 0.0
0.847TrpArg: 0.847 ± 0.289
1.694TrpSer: 1.694 ± 0.848
1.129TrpThr: 1.129 ± 0.625
0.847TrpVal: 0.847 ± 0.775
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.847TyrAla: 0.847 ± 0.461
1.129TyrCys: 1.129 ± 0.511
2.259TyrAsp: 2.259 ± 0.334
1.694TyrGlu: 1.694 ± 0.714
2.541TyrPhe: 2.541 ± 0.489
2.823TyrGly: 2.823 ± 1.186
0.847TyrHis: 0.847 ± 0.482
2.259TyrIle: 2.259 ± 1.706
2.541TyrLys: 2.541 ± 1.382
4.235TyrLeu: 4.235 ± 0.955
1.129TyrMet: 1.129 ± 0.303
1.976TyrAsn: 1.976 ± 0.619
1.976TyrPro: 1.976 ± 0.507
0.847TyrGln: 0.847 ± 0.593
1.976TyrArg: 1.976 ± 0.455
4.235TyrSer: 4.235 ± 1.587
1.129TyrThr: 1.129 ± 0.658
1.694TyrVal: 1.694 ± 0.569
1.412TyrTrp: 1.412 ± 0.936
2.259TyrTyr: 2.259 ± 0.996
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3543 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski