Amino acid dipepetide frequency for Wuhan flea virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.744AlaAla: 3.744 ± 1.191
1.021AlaCys: 1.021 ± 0.743
2.383AlaAsp: 2.383 ± 0.419
1.702AlaGlu: 1.702 ± 0.331
2.383AlaPhe: 2.383 ± 0.699
7.488AlaGly: 7.488 ± 1.273
2.723AlaHis: 2.723 ± 0.729
5.786AlaIle: 5.786 ± 0.972
2.723AlaLys: 2.723 ± 0.704
8.509AlaLeu: 8.509 ± 1.588
1.021AlaMet: 1.021 ± 0.774
1.702AlaAsn: 1.702 ± 0.543
2.042AlaPro: 2.042 ± 0.912
2.383AlaGln: 2.383 ± 0.95
4.084AlaArg: 4.084 ± 1.703
2.383AlaSer: 2.383 ± 0.56
6.807AlaThr: 6.807 ± 0.584
1.702AlaVal: 1.702 ± 0.743
1.702AlaTrp: 1.702 ± 0.965
1.702AlaTyr: 1.702 ± 0.675
0.0AlaXaa: 0.0 ± 0.0
Cys
2.042CysAla: 2.042 ± 0.698
0.34CysCys: 0.34 ± 0.248
1.361CysAsp: 1.361 ± 0.963
0.681CysGlu: 0.681 ± 0.261
0.34CysPhe: 0.34 ± 0.363
1.021CysGly: 1.021 ± 0.34
0.0CysHis: 0.0 ± 0.0
1.361CysIle: 1.361 ± 0.519
1.021CysLys: 1.021 ± 0.459
1.702CysLeu: 1.702 ± 1.146
0.0CysMet: 0.0 ± 0.0
0.34CysAsn: 0.34 ± 0.248
0.34CysPro: 0.34 ± 0.363
0.0CysGln: 0.0 ± 0.0
0.34CysArg: 0.34 ± 0.261
0.681CysSer: 0.681 ± 0.299
0.681CysThr: 0.681 ± 0.352
1.021CysVal: 1.021 ± 0.513
0.0CysTrp: 0.0 ± 0.0
0.681CysTyr: 0.681 ± 0.462
0.0CysXaa: 0.0 ± 0.0
Asp
3.063AspAla: 3.063 ± 0.61
1.021AspCys: 1.021 ± 0.787
2.383AspAsp: 2.383 ± 0.703
2.723AspGlu: 2.723 ± 1.101
2.042AspPhe: 2.042 ± 0.733
3.404AspGly: 3.404 ± 1.668
0.0AspHis: 0.0 ± 0.0
2.383AspIle: 2.383 ± 1.112
1.021AspLys: 1.021 ± 0.437
2.723AspLeu: 2.723 ± 1.104
1.361AspMet: 1.361 ± 0.597
0.34AspAsn: 0.34 ± 0.261
2.723AspPro: 2.723 ± 0.688
1.361AspGln: 1.361 ± 0.7
2.723AspArg: 2.723 ± 0.657
2.383AspSer: 2.383 ± 0.771
5.786AspThr: 5.786 ± 1.024
3.063AspVal: 3.063 ± 1.24
1.702AspTrp: 1.702 ± 0.359
1.702AspTyr: 1.702 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
2.723GluAla: 2.723 ± 0.991
1.702GluCys: 1.702 ± 0.634
1.702GluAsp: 1.702 ± 0.331
4.765GluGlu: 4.765 ± 0.51
2.042GluPhe: 2.042 ± 0.794
4.084GluGly: 4.084 ± 1.314
2.723GluHis: 2.723 ± 0.987
3.404GluIle: 3.404 ± 0.919
5.106GluLys: 5.106 ± 0.758
8.169GluLeu: 8.169 ± 1.575
1.702GluMet: 1.702 ± 0.659
3.404GluAsn: 3.404 ± 1.001
1.702GluPro: 1.702 ± 0.359
2.383GluGln: 2.383 ± 0.911
4.084GluArg: 4.084 ± 0.77
3.744GluSer: 3.744 ± 0.313
4.084GluThr: 4.084 ± 0.358
4.425GluVal: 4.425 ± 0.76
2.042GluTrp: 2.042 ± 0.827
2.383GluTyr: 2.383 ± 0.771
0.0GluXaa: 0.0 ± 0.0
Phe
0.681PheAla: 0.681 ± 0.261
0.681PheCys: 0.681 ± 0.299
2.042PheAsp: 2.042 ± 0.851
2.383PheGlu: 2.383 ± 1.178
0.0PhePhe: 0.0 ± 0.0
3.404PheGly: 3.404 ± 1.41
0.681PheHis: 0.681 ± 0.523
2.723PheIle: 2.723 ± 0.742
1.702PheLys: 1.702 ± 0.924
4.084PheLeu: 4.084 ± 1.234
1.702PheMet: 1.702 ± 0.485
3.063PheAsn: 3.063 ± 1.083
1.702PhePro: 1.702 ± 0.34
0.681PheGln: 0.681 ± 0.462
1.361PheArg: 1.361 ± 0.746
1.021PheSer: 1.021 ± 0.34
2.383PheThr: 2.383 ± 0.663
2.383PheVal: 2.383 ± 0.53
0.34PheTrp: 0.34 ± 0.314
2.042PheTyr: 2.042 ± 0.97
0.0PheXaa: 0.0 ± 0.0
Gly
3.063GlyAla: 3.063 ± 0.828
1.702GlyCys: 1.702 ± 0.523
3.404GlyAsp: 3.404 ± 0.667
5.106GlyGlu: 5.106 ± 0.912
5.446GlyPhe: 5.446 ± 1.525
2.383GlyGly: 2.383 ± 1.093
1.702GlyHis: 1.702 ± 0.694
6.807GlyIle: 6.807 ± 1.598
5.786GlyLys: 5.786 ± 1.237
9.53GlyLeu: 9.53 ± 2.692
2.042GlyMet: 2.042 ± 0.325
3.404GlyAsn: 3.404 ± 0.791
3.404GlyPro: 3.404 ± 0.582
3.063GlyGln: 3.063 ± 0.669
4.765GlyArg: 4.765 ± 1.032
4.084GlySer: 4.084 ± 1.251
3.404GlyThr: 3.404 ± 1.27
7.828GlyVal: 7.828 ± 1.7
2.042GlyTrp: 2.042 ± 0.998
2.042GlyTyr: 2.042 ± 0.574
0.0GlyXaa: 0.0 ± 0.0
His
1.361HisAla: 1.361 ± 0.493
0.681HisCys: 0.681 ± 0.51
1.361HisAsp: 1.361 ± 0.993
1.361HisGlu: 1.361 ± 0.772
0.0HisPhe: 0.0 ± 0.0
1.021HisGly: 1.021 ± 0.787
0.34HisHis: 0.34 ± 0.248
0.34HisIle: 0.34 ± 0.248
1.702HisLys: 1.702 ± 0.494
3.063HisLeu: 3.063 ± 1.256
1.021HisMet: 1.021 ± 0.47
0.34HisAsn: 0.34 ± 0.261
1.361HisPro: 1.361 ± 0.442
0.681HisGln: 0.681 ± 0.342
2.042HisArg: 2.042 ± 0.366
1.021HisSer: 1.021 ± 0.315
0.34HisThr: 0.34 ± 0.314
2.723HisVal: 2.723 ± 0.767
0.681HisTrp: 0.681 ± 0.39
1.361HisTyr: 1.361 ± 0.313
0.0HisXaa: 0.0 ± 0.0
Ile
5.786IleAla: 5.786 ± 1.175
0.681IleCys: 0.681 ± 0.569
2.042IleAsp: 2.042 ± 0.618
2.383IleGlu: 2.383 ± 0.813
1.702IlePhe: 1.702 ± 0.359
5.446IleGly: 5.446 ± 1.326
1.702IleHis: 1.702 ± 0.699
4.765IleIle: 4.765 ± 1.695
2.723IleLys: 2.723 ± 0.753
8.509IleLeu: 8.509 ± 0.514
1.361IleMet: 1.361 ± 0.579
2.723IleAsn: 2.723 ± 1.121
2.042IlePro: 2.042 ± 0.756
2.042IleGln: 2.042 ± 0.629
4.765IleArg: 4.765 ± 0.74
4.084IleSer: 4.084 ± 1.18
4.765IleThr: 4.765 ± 1.191
5.786IleVal: 5.786 ± 1.035
1.702IleTrp: 1.702 ± 0.968
2.042IleTyr: 2.042 ± 0.554
0.0IleXaa: 0.0 ± 0.0
Lys
4.425LysAla: 4.425 ± 0.36
0.681LysCys: 0.681 ± 0.39
3.063LysAsp: 3.063 ± 0.904
6.127LysGlu: 6.127 ± 1.129
2.042LysPhe: 2.042 ± 1.119
4.084LysGly: 4.084 ± 1.23
0.34LysHis: 0.34 ± 0.248
3.744LysIle: 3.744 ± 1.691
2.042LysLys: 2.042 ± 0.442
5.446LysLeu: 5.446 ± 1.175
2.042LysMet: 2.042 ± 0.646
2.383LysAsn: 2.383 ± 0.813
2.042LysPro: 2.042 ± 0.903
1.021LysGln: 1.021 ± 0.708
4.084LysArg: 4.084 ± 0.788
2.723LysSer: 2.723 ± 0.597
0.681LysThr: 0.681 ± 0.523
3.404LysVal: 3.404 ± 0.811
2.042LysTrp: 2.042 ± 0.764
2.042LysTyr: 2.042 ± 0.933
0.0LysXaa: 0.0 ± 0.0
Leu
8.509LeuAla: 8.509 ± 2.111
1.361LeuCys: 1.361 ± 0.442
4.765LeuAsp: 4.765 ± 0.896
4.425LeuGlu: 4.425 ± 1.52
3.744LeuPhe: 3.744 ± 0.91
8.169LeuGly: 8.169 ± 1.12
3.063LeuHis: 3.063 ± 0.805
9.19LeuIle: 9.19 ± 1.682
7.488LeuLys: 7.488 ± 1.037
10.551LeuLeu: 10.551 ± 1.87
2.383LeuMet: 2.383 ± 0.475
2.042LeuAsn: 2.042 ± 0.764
1.702LeuPro: 1.702 ± 0.582
1.021LeuGln: 1.021 ± 0.747
6.807LeuArg: 6.807 ± 0.943
8.85LeuSer: 8.85 ± 1.8
6.467LeuThr: 6.467 ± 1.792
6.127LeuVal: 6.127 ± 1.707
1.361LeuTrp: 1.361 ± 0.327
1.702LeuTyr: 1.702 ± 1.109
0.0LeuXaa: 0.0 ± 0.0
Met
1.361MetAla: 1.361 ± 0.518
0.34MetCys: 0.34 ± 0.34
2.042MetAsp: 2.042 ± 0.492
2.042MetGlu: 2.042 ± 0.682
1.361MetPhe: 1.361 ± 0.767
3.744MetGly: 3.744 ± 0.86
0.34MetHis: 0.34 ± 0.248
2.383MetIle: 2.383 ± 1.112
2.723MetLys: 2.723 ± 1.048
2.723MetLeu: 2.723 ± 0.433
0.0MetMet: 0.0 ± 0.0
1.021MetAsn: 1.021 ± 0.561
1.021MetPro: 1.021 ± 0.315
0.34MetGln: 0.34 ± 0.248
2.383MetArg: 2.383 ± 0.619
1.702MetSer: 1.702 ± 0.799
3.404MetThr: 3.404 ± 0.985
1.361MetVal: 1.361 ± 0.518
0.0MetTrp: 0.0 ± 0.0
0.34MetTyr: 0.34 ± 0.363
0.0MetXaa: 0.0 ± 0.0
Asn
2.042AsnAla: 2.042 ± 0.874
0.681AsnCys: 0.681 ± 0.342
1.361AsnAsp: 1.361 ± 0.4
3.063AsnGlu: 3.063 ± 0.513
0.34AsnPhe: 0.34 ± 0.34
2.723AsnGly: 2.723 ± 1.293
0.681AsnHis: 0.681 ± 0.497
3.744AsnIle: 3.744 ± 1.443
0.34AsnLys: 0.34 ± 0.248
3.404AsnLeu: 3.404 ± 1.037
1.361AsnMet: 1.361 ± 0.78
1.702AsnAsn: 1.702 ± 0.45
1.021AsnPro: 1.021 ± 0.451
0.0AsnGln: 0.0 ± 0.0
3.404AsnArg: 3.404 ± 0.99
1.702AsnSer: 1.702 ± 0.322
2.042AsnThr: 2.042 ± 0.554
2.042AsnVal: 2.042 ± 0.666
1.702AsnTrp: 1.702 ± 0.388
2.723AsnTyr: 2.723 ± 0.49
0.0AsnXaa: 0.0 ± 0.0
Pro
2.042ProAla: 2.042 ± 0.529
0.0ProCys: 0.0 ± 0.0
0.34ProAsp: 0.34 ± 0.314
3.063ProGlu: 3.063 ± 1.248
1.021ProPhe: 1.021 ± 0.522
4.765ProGly: 4.765 ± 0.964
1.702ProHis: 1.702 ± 0.932
2.042ProIle: 2.042 ± 0.544
1.702ProLys: 1.702 ± 0.896
2.042ProLeu: 2.042 ± 0.614
1.361ProMet: 1.361 ± 0.59
1.361ProAsn: 1.361 ± 0.653
2.383ProPro: 2.383 ± 0.714
0.681ProGln: 0.681 ± 0.523
1.702ProArg: 1.702 ± 0.725
2.383ProSer: 2.383 ± 0.701
4.084ProThr: 4.084 ± 0.926
4.765ProVal: 4.765 ± 1.258
1.361ProTrp: 1.361 ± 0.859
1.021ProTyr: 1.021 ± 0.519
0.0ProXaa: 0.0 ± 0.0
Gln
2.383GlnAla: 2.383 ± 0.48
0.0GlnCys: 0.0 ± 0.0
0.681GlnAsp: 0.681 ± 0.261
3.063GlnGlu: 3.063 ± 1.078
0.34GlnPhe: 0.34 ± 0.261
3.063GlnGly: 3.063 ± 0.543
0.34GlnHis: 0.34 ± 0.314
1.361GlnIle: 1.361 ± 0.444
0.681GlnLys: 0.681 ± 0.352
3.404GlnLeu: 3.404 ± 0.927
0.681GlnMet: 0.681 ± 0.352
0.681GlnAsn: 0.681 ± 0.497
2.042GlnPro: 2.042 ± 0.88
2.723GlnGln: 2.723 ± 1.095
1.702GlnArg: 1.702 ± 0.467
2.383GlnSer: 2.383 ± 0.727
1.021GlnThr: 1.021 ± 0.478
1.021GlnVal: 1.021 ± 0.519
0.34GlnTrp: 0.34 ± 0.314
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.084ArgAla: 4.084 ± 0.77
0.34ArgCys: 0.34 ± 0.261
3.063ArgAsp: 3.063 ± 0.387
6.467ArgGlu: 6.467 ± 1.456
1.021ArgPhe: 1.021 ± 0.451
5.786ArgGly: 5.786 ± 1.332
1.021ArgHis: 1.021 ± 0.609
4.425ArgIle: 4.425 ± 1.17
3.063ArgLys: 3.063 ± 1.316
3.404ArgLeu: 3.404 ± 0.676
2.383ArgMet: 2.383 ± 0.412
2.723ArgAsn: 2.723 ± 0.768
2.383ArgPro: 2.383 ± 0.661
1.021ArgGln: 1.021 ± 0.561
3.744ArgArg: 3.744 ± 1.126
4.765ArgSer: 4.765 ± 1.127
3.404ArgThr: 3.404 ± 1.541
6.467ArgVal: 6.467 ± 1.269
1.021ArgTrp: 1.021 ± 0.437
2.042ArgTyr: 2.042 ± 0.642
0.0ArgXaa: 0.0 ± 0.0
Ser
3.404SerAla: 3.404 ± 0.551
0.681SerCys: 0.681 ± 0.559
1.702SerAsp: 1.702 ± 0.34
4.084SerGlu: 4.084 ± 1.271
3.404SerPhe: 3.404 ± 0.975
3.744SerGly: 3.744 ± 1.016
1.361SerHis: 1.361 ± 0.661
2.383SerIle: 2.383 ± 1.172
3.063SerLys: 3.063 ± 0.738
5.786SerLeu: 5.786 ± 1.584
2.723SerMet: 2.723 ± 0.734
2.723SerAsn: 2.723 ± 0.704
2.042SerPro: 2.042 ± 0.912
1.702SerGln: 1.702 ± 1.034
3.063SerArg: 3.063 ± 0.618
3.404SerSer: 3.404 ± 1.054
4.765SerThr: 4.765 ± 1.515
3.404SerVal: 3.404 ± 0.986
1.702SerTrp: 1.702 ± 0.501
1.361SerTyr: 1.361 ± 0.276
0.0SerXaa: 0.0 ± 0.0
Thr
4.425ThrAla: 4.425 ± 1.444
0.0ThrCys: 0.0 ± 0.0
3.063ThrAsp: 3.063 ± 0.97
3.744ThrGlu: 3.744 ± 0.843
1.702ThrPhe: 1.702 ± 0.467
5.106ThrGly: 5.106 ± 0.94
2.042ThrHis: 2.042 ± 0.574
3.404ThrIle: 3.404 ± 1.151
4.084ThrLys: 4.084 ± 0.974
6.807ThrLeu: 6.807 ± 1.394
2.042ThrMet: 2.042 ± 0.698
2.723ThrAsn: 2.723 ± 0.817
3.063ThrPro: 3.063 ± 0.528
2.723ThrGln: 2.723 ± 1.084
4.765ThrArg: 4.765 ± 1.244
3.063ThrSer: 3.063 ± 1.509
4.765ThrThr: 4.765 ± 1.666
3.744ThrVal: 3.744 ± 0.652
1.702ThrTrp: 1.702 ± 0.694
5.106ThrTyr: 5.106 ± 0.804
0.0ThrXaa: 0.0 ± 0.0
Val
4.765ValAla: 4.765 ± 1.111
1.021ValCys: 1.021 ± 0.605
3.744ValAsp: 3.744 ± 1.108
4.084ValGlu: 4.084 ± 0.665
3.404ValPhe: 3.404 ± 0.871
6.807ValGly: 6.807 ± 2.416
1.021ValHis: 1.021 ± 0.451
3.404ValIle: 3.404 ± 1.371
5.106ValLys: 5.106 ± 1.022
3.744ValLeu: 3.744 ± 1.207
4.084ValMet: 4.084 ± 1.196
1.361ValAsn: 1.361 ± 0.442
4.084ValPro: 4.084 ± 0.863
1.021ValGln: 1.021 ± 0.315
3.063ValArg: 3.063 ± 1.001
4.084ValSer: 4.084 ± 0.745
6.127ValThr: 6.127 ± 1.367
5.106ValVal: 5.106 ± 2.148
1.021ValTrp: 1.021 ± 0.415
2.723ValTyr: 2.723 ± 0.706
0.0ValXaa: 0.0 ± 0.0
Trp
1.021TrpAla: 1.021 ± 0.47
0.681TrpCys: 0.681 ± 0.387
0.681TrpAsp: 0.681 ± 0.342
2.383TrpGlu: 2.383 ± 0.536
1.021TrpPhe: 1.021 ± 0.522
1.361TrpGly: 1.361 ± 0.536
0.0TrpHis: 0.0 ± 0.0
1.361TrpIle: 1.361 ± 1.001
1.361TrpLys: 1.361 ± 0.997
2.723TrpLeu: 2.723 ± 0.863
0.681TrpMet: 0.681 ± 0.497
1.361TrpAsn: 1.361 ± 0.276
0.34TrpPro: 0.34 ± 0.261
1.361TrpGln: 1.361 ± 0.646
1.021TrpArg: 1.021 ± 0.54
0.681TrpSer: 0.681 ± 0.506
1.702TrpThr: 1.702 ± 0.399
1.361TrpVal: 1.361 ± 0.327
1.021TrpTrp: 1.021 ± 0.713
1.702TrpTyr: 1.702 ± 1.393
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.063TyrAla: 3.063 ± 1.103
0.34TyrCys: 0.34 ± 0.34
2.723TyrAsp: 2.723 ± 0.364
2.723TyrGlu: 2.723 ± 0.49
1.702TyrPhe: 1.702 ± 0.359
3.404TyrGly: 3.404 ± 1.054
1.021TyrHis: 1.021 ± 0.546
2.042TyrIle: 2.042 ± 0.932
1.021TyrLys: 1.021 ± 0.437
3.404TyrLeu: 3.404 ± 1.167
0.34TyrMet: 0.34 ± 0.482
0.681TyrAsn: 0.681 ± 0.497
2.383TyrPro: 2.383 ± 1.384
1.702TyrGln: 1.702 ± 0.909
2.723TyrArg: 2.723 ± 0.458
1.361TyrSer: 1.361 ± 0.47
1.361TyrThr: 1.361 ± 0.599
2.042TyrVal: 2.042 ± 0.581
0.34TyrTrp: 0.34 ± 0.248
2.042TyrTyr: 2.042 ± 0.618
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2939 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski