Amino acid dipepetide frequency for Wuhan Millipede virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.021AlaAla: 6.021 ± 0.446
1.505AlaCys: 1.505 ± 0.718
3.011AlaAsp: 3.011 ± 1.167
3.011AlaGlu: 3.011 ± 0.361
3.512AlaPhe: 3.512 ± 1.048
4.516AlaGly: 4.516 ± 1.458
1.004AlaHis: 1.004 ± 0.505
5.519AlaIle: 5.519 ± 1.521
6.021AlaLys: 6.021 ± 1.644
3.011AlaLeu: 3.011 ± 1.01
2.509AlaMet: 2.509 ± 0.319
4.516AlaAsn: 4.516 ± 1.241
3.011AlaPro: 3.011 ± 1.809
4.516AlaGln: 4.516 ± 0.552
5.018AlaArg: 5.018 ± 1.553
2.509AlaSer: 2.509 ± 0.788
3.512AlaThr: 3.512 ± 0.432
6.021AlaVal: 6.021 ± 1.46
0.502AlaTrp: 0.502 ± 0.402
3.011AlaTyr: 3.011 ± 1.858
0.0AlaXaa: 0.0 ± 0.0
Cys
0.502CysAla: 0.502 ± 0.432
0.502CysCys: 0.502 ± 0.432
1.505CysAsp: 1.505 ± 0.746
0.0CysGlu: 0.0 ± 0.0
1.505CysPhe: 1.505 ± 0.758
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.502CysIle: 0.502 ± 0.402
0.0CysLys: 0.0 ± 0.0
1.505CysLeu: 1.505 ± 0.805
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.004CysPro: 1.004 ± 0.505
0.0CysGln: 0.0 ± 0.0
1.505CysArg: 1.505 ± 0.859
1.004CysSer: 1.004 ± 0.863
1.004CysThr: 1.004 ± 0.468
1.004CysVal: 1.004 ± 0.468
0.502CysTrp: 0.502 ± 0.419
1.004CysTyr: 1.004 ± 0.512
0.0CysXaa: 0.0 ± 0.0
Asp
4.516AspAla: 4.516 ± 1.253
0.502AspCys: 0.502 ± 0.459
4.014AspAsp: 4.014 ± 1.321
3.011AspGlu: 3.011 ± 1.032
6.523AspPhe: 6.523 ± 0.541
2.007AspGly: 2.007 ± 0.605
3.512AspHis: 3.512 ± 0.929
1.004AspIle: 1.004 ± 0.505
4.014AspLys: 4.014 ± 1.211
6.021AspLeu: 6.021 ± 1.816
1.505AspMet: 1.505 ± 0.885
1.004AspAsn: 1.004 ± 0.496
3.011AspPro: 3.011 ± 0.872
2.007AspGln: 2.007 ± 0.768
3.512AspArg: 3.512 ± 1.539
5.519AspSer: 5.519 ± 1.552
4.014AspThr: 4.014 ± 1.518
5.018AspVal: 5.018 ± 0.467
2.007AspTrp: 2.007 ± 0.715
1.505AspTyr: 1.505 ± 0.795
0.0AspXaa: 0.0 ± 0.0
Glu
7.025GluAla: 7.025 ± 1.364
0.0GluCys: 0.0 ± 0.0
5.018GluAsp: 5.018 ± 0.723
8.028GluGlu: 8.028 ± 0.772
3.011GluPhe: 3.011 ± 1.036
1.505GluGly: 1.505 ± 0.758
2.007GluHis: 2.007 ± 0.65
3.512GluIle: 3.512 ± 0.967
2.509GluLys: 2.509 ± 1.258
7.025GluLeu: 7.025 ± 1.852
1.505GluMet: 1.505 ± 0.381
2.007GluAsn: 2.007 ± 1.114
4.014GluPro: 4.014 ± 1.14
1.505GluGln: 1.505 ± 0.798
3.011GluArg: 3.011 ± 1.596
3.512GluSer: 3.512 ± 1.002
3.512GluThr: 3.512 ± 1.539
1.505GluVal: 1.505 ± 0.758
1.004GluTrp: 1.004 ± 0.837
2.007GluTyr: 2.007 ± 0.754
0.0GluXaa: 0.0 ± 0.0
Phe
2.007PheAla: 2.007 ± 0.746
0.502PheCys: 0.502 ± 0.419
3.512PheAsp: 3.512 ± 1.064
2.007PheGlu: 2.007 ± 0.935
2.509PhePhe: 2.509 ± 0.788
4.014PheGly: 4.014 ± 0.823
1.004PheHis: 1.004 ± 0.512
2.007PheIle: 2.007 ± 0.715
4.014PheLys: 4.014 ± 1.322
2.509PheLeu: 2.509 ± 0.319
0.0PheMet: 0.0 ± 0.0
1.505PheAsn: 1.505 ± 0.886
5.519PhePro: 5.519 ± 1.611
0.502PheGln: 0.502 ± 0.459
3.512PheArg: 3.512 ± 0.348
5.018PheSer: 5.018 ± 0.893
3.512PheThr: 3.512 ± 0.704
1.004PheVal: 1.004 ± 0.512
0.502PheTrp: 0.502 ± 0.432
1.004PheTyr: 1.004 ± 0.804
0.0PheXaa: 0.0 ± 0.0
Gly
5.018GlyAla: 5.018 ± 1.18
1.004GlyCys: 1.004 ± 0.512
2.509GlyAsp: 2.509 ± 0.801
3.011GlyGlu: 3.011 ± 0.614
2.007GlyPhe: 2.007 ± 1.205
1.505GlyGly: 1.505 ± 0.381
0.0GlyHis: 0.0 ± 0.0
3.011GlyIle: 3.011 ± 1.039
3.512GlyLys: 3.512 ± 0.579
3.512GlyLeu: 3.512 ± 0.827
2.007GlyMet: 2.007 ± 0.65
1.505GlyAsn: 1.505 ± 0.905
2.509GlyPro: 2.509 ± 1.439
3.011GlyGln: 3.011 ± 1.073
2.509GlyArg: 2.509 ± 0.788
1.004GlySer: 1.004 ± 0.468
2.007GlyThr: 2.007 ± 0.754
4.516GlyVal: 4.516 ± 1.428
0.502GlyTrp: 0.502 ± 0.419
2.007GlyTyr: 2.007 ± 0.935
0.0GlyXaa: 0.0 ± 0.0
His
2.007HisAla: 2.007 ± 1.303
1.004HisCys: 1.004 ± 0.496
1.004HisAsp: 1.004 ± 0.512
0.0HisGlu: 0.0 ± 0.0
2.509HisPhe: 2.509 ± 1.112
1.505HisGly: 1.505 ± 0.489
0.502HisHis: 0.502 ± 0.432
0.502HisIle: 0.502 ± 0.419
0.502HisLys: 0.502 ± 0.432
6.523HisLeu: 6.523 ± 1.101
0.502HisMet: 0.502 ± 0.469
0.502HisAsn: 0.502 ± 0.432
2.007HisPro: 2.007 ± 0.746
0.0HisGln: 0.0 ± 0.0
2.007HisArg: 2.007 ± 1.119
2.007HisSer: 2.007 ± 0.715
2.509HisThr: 2.509 ± 0.981
1.004HisVal: 1.004 ± 0.468
0.0HisTrp: 0.0 ± 0.0
1.004HisTyr: 1.004 ± 0.512
0.0HisXaa: 0.0 ± 0.0
Ile
4.516IleAla: 4.516 ± 1.731
1.004IleCys: 1.004 ± 0.468
5.519IleAsp: 5.519 ± 1.332
2.509IleGlu: 2.509 ± 0.981
2.007IlePhe: 2.007 ± 0.768
2.509IleGly: 2.509 ± 1.088
1.004IleHis: 1.004 ± 0.557
4.516IleIle: 4.516 ± 1.342
2.509IleLys: 2.509 ± 1.728
4.014IleLeu: 4.014 ± 1.985
1.004IleMet: 1.004 ± 0.837
2.509IleAsn: 2.509 ± 0.943
4.516IlePro: 4.516 ± 0.896
1.505IleGln: 1.505 ± 0.859
6.021IleArg: 6.021 ± 2.022
2.509IleSer: 2.509 ± 1.728
5.519IleThr: 5.519 ± 1.913
3.512IleVal: 3.512 ± 1.715
1.004IleTrp: 1.004 ± 0.804
2.509IleTyr: 2.509 ± 0.799
0.0IleXaa: 0.0 ± 0.0
Lys
4.516LysAla: 4.516 ± 1.116
0.0LysCys: 0.0 ± 0.0
2.007LysAsp: 2.007 ± 1.17
4.014LysGlu: 4.014 ± 1.343
2.007LysPhe: 2.007 ± 0.094
1.004LysGly: 1.004 ± 0.505
1.505LysHis: 1.505 ± 0.746
6.523LysIle: 6.523 ± 2.652
3.512LysLys: 3.512 ± 1.584
3.011LysLeu: 3.011 ± 0.937
0.502LysMet: 0.502 ± 0.419
3.512LysAsn: 3.512 ± 0.733
6.021LysPro: 6.021 ± 1.285
1.505LysGln: 1.505 ± 1.295
5.519LysArg: 5.519 ± 2.352
3.512LysSer: 3.512 ± 0.704
3.512LysThr: 3.512 ± 1.411
2.007LysVal: 2.007 ± 1.314
0.0LysTrp: 0.0 ± 0.0
1.505LysTyr: 1.505 ± 1.206
0.0LysXaa: 0.0 ± 0.0
Leu
8.028LeuAla: 8.028 ± 2.291
1.004LeuCys: 1.004 ± 0.468
5.519LeuAsp: 5.519 ± 1.496
5.519LeuGlu: 5.519 ± 1.931
3.512LeuPhe: 3.512 ± 1.396
3.512LeuGly: 3.512 ± 0.704
1.004LeuHis: 1.004 ± 0.837
3.512LeuIle: 3.512 ± 0.885
5.519LeuLys: 5.519 ± 0.999
7.025LeuLeu: 7.025 ± 2.162
1.004LeuMet: 1.004 ± 0.429
4.014LeuAsn: 4.014 ± 1.168
7.025LeuPro: 7.025 ± 2.047
1.505LeuGln: 1.505 ± 0.863
6.523LeuArg: 6.523 ± 1.653
5.018LeuSer: 5.018 ± 0.308
5.519LeuThr: 5.519 ± 0.868
4.516LeuVal: 4.516 ± 0.896
0.502LeuTrp: 0.502 ± 0.432
2.007LeuTyr: 2.007 ± 0.682
0.0LeuXaa: 0.0 ± 0.0
Met
2.007MetAla: 2.007 ± 0.682
0.0MetCys: 0.0 ± 0.0
2.007MetAsp: 2.007 ± 1.17
2.007MetGlu: 2.007 ± 0.605
1.004MetPhe: 1.004 ± 0.496
1.004MetGly: 1.004 ± 0.837
0.0MetHis: 0.0 ± 0.0
3.011MetIle: 3.011 ± 0.842
0.502MetLys: 0.502 ± 0.459
2.007MetLeu: 2.007 ± 0.858
0.0MetMet: 0.0 ± 0.0
1.505MetAsn: 1.505 ± 0.798
0.0MetPro: 0.0 ± 0.0
0.502MetGln: 0.502 ± 0.402
2.007MetArg: 2.007 ± 1.119
1.004MetSer: 1.004 ± 0.496
0.502MetThr: 0.502 ± 0.402
0.502MetVal: 0.502 ± 0.459
0.502MetTrp: 0.502 ± 0.419
0.502MetTyr: 0.502 ± 0.402
0.0MetXaa: 0.0 ± 0.0
Asn
1.004AsnAla: 1.004 ± 0.429
0.0AsnCys: 0.0 ± 0.0
1.505AsnAsp: 1.505 ± 0.805
1.505AsnGlu: 1.505 ± 0.905
2.007AsnPhe: 2.007 ± 0.682
3.011AsnGly: 3.011 ± 0.966
0.502AsnHis: 0.502 ± 0.419
1.505AsnIle: 1.505 ± 0.421
1.004AsnLys: 1.004 ± 0.468
2.509AsnLeu: 2.509 ± 1.061
1.004AsnMet: 1.004 ± 0.429
1.505AsnAsn: 1.505 ± 0.436
4.014AsnPro: 4.014 ± 1.195
4.516AsnGln: 4.516 ± 1.586
2.509AsnArg: 2.509 ± 0.886
4.014AsnSer: 4.014 ± 0.601
3.512AsnThr: 3.512 ± 0.926
1.505AsnVal: 1.505 ± 0.436
0.502AsnTrp: 0.502 ± 0.419
1.505AsnTyr: 1.505 ± 0.718
0.0AsnXaa: 0.0 ± 0.0
Pro
5.018ProAla: 5.018 ± 1.571
1.004ProCys: 1.004 ± 0.863
5.519ProAsp: 5.519 ± 0.858
6.523ProGlu: 6.523 ± 1.923
1.004ProPhe: 1.004 ± 0.557
4.014ProGly: 4.014 ± 1.202
2.509ProHis: 2.509 ± 1.62
5.519ProIle: 5.519 ± 1.521
2.007ProLys: 2.007 ± 0.754
6.021ProLeu: 6.021 ± 0.516
1.505ProMet: 1.505 ± 0.718
4.014ProAsn: 4.014 ± 1.253
3.512ProPro: 3.512 ± 1.452
0.502ProGln: 0.502 ± 0.432
1.505ProArg: 1.505 ± 0.863
6.021ProSer: 6.021 ± 0.733
6.021ProThr: 6.021 ± 1.644
4.014ProVal: 4.014 ± 0.523
0.502ProTrp: 0.502 ± 0.459
2.007ProTyr: 2.007 ± 1.155
0.0ProXaa: 0.0 ± 0.0
Gln
1.505GlnAla: 1.505 ± 0.885
0.502GlnCys: 0.502 ± 0.419
1.505GlnAsp: 1.505 ± 0.421
1.505GlnGlu: 1.505 ± 0.905
1.505GlnPhe: 1.505 ± 0.885
3.512GlnGly: 3.512 ± 1.318
2.007GlnHis: 2.007 ± 0.615
1.004GlnIle: 1.004 ± 0.468
1.004GlnLys: 1.004 ± 0.468
4.516GlnLeu: 4.516 ± 1.766
0.502GlnMet: 0.502 ± 0.402
0.502GlnAsn: 0.502 ± 0.402
5.519GlnPro: 5.519 ± 1.186
0.502GlnGln: 0.502 ± 0.432
2.007GlnArg: 2.007 ± 0.778
0.502GlnSer: 0.502 ± 0.459
3.011GlnThr: 3.011 ± 1.154
1.004GlnVal: 1.004 ± 0.863
1.004GlnTrp: 1.004 ± 0.557
2.007GlnTyr: 2.007 ± 1.131
0.0GlnXaa: 0.0 ± 0.0
Arg
4.516ArgAla: 4.516 ± 1.601
1.004ArgCys: 1.004 ± 0.512
6.021ArgAsp: 6.021 ± 1.46
7.025ArgGlu: 7.025 ± 2.048
2.007ArgPhe: 2.007 ± 1.082
1.004ArgGly: 1.004 ± 0.557
2.509ArgHis: 2.509 ± 0.668
4.516ArgIle: 4.516 ± 0.243
6.523ArgLys: 6.523 ± 1.331
4.516ArgLeu: 4.516 ± 1.588
1.505ArgMet: 1.505 ± 0.472
2.007ArgAsn: 2.007 ± 0.935
2.007ArgPro: 2.007 ± 0.094
4.014ArgGln: 4.014 ± 1.986
6.021ArgArg: 6.021 ± 1.734
3.512ArgSer: 3.512 ± 1.453
5.018ArgThr: 5.018 ± 1.285
2.007ArgVal: 2.007 ± 0.992
0.502ArgTrp: 0.502 ± 0.402
2.007ArgTyr: 2.007 ± 1.024
0.0ArgXaa: 0.0 ± 0.0
Ser
3.512SerAla: 3.512 ± 0.929
1.004SerCys: 1.004 ± 0.468
6.021SerAsp: 6.021 ± 1.212
2.007SerGlu: 2.007 ± 0.746
1.505SerPhe: 1.505 ± 0.886
4.014SerGly: 4.014 ± 0.732
3.011SerHis: 3.011 ± 0.361
5.519SerIle: 5.519 ± 1.0
4.014SerLys: 4.014 ± 1.42
4.014SerLeu: 4.014 ± 1.09
2.509SerMet: 2.509 ± 0.913
0.502SerAsn: 0.502 ± 0.419
5.519SerPro: 5.519 ± 0.694
2.509SerGln: 2.509 ± 1.439
3.011SerArg: 3.011 ± 1.858
6.021SerSer: 6.021 ± 1.37
4.516SerThr: 4.516 ± 1.618
4.516SerVal: 4.516 ± 0.945
0.0SerTrp: 0.0 ± 0.0
2.007SerTyr: 2.007 ± 0.605
0.0SerXaa: 0.0 ± 0.0
Thr
4.516ThrAla: 4.516 ± 1.191
0.502ThrCys: 0.502 ± 0.432
3.011ThrAsp: 3.011 ± 1.19
3.512ThrGlu: 3.512 ± 1.571
3.512ThrPhe: 3.512 ± 1.47
4.516ThrGly: 4.516 ± 0.568
1.004ThrHis: 1.004 ± 0.429
5.519ThrIle: 5.519 ± 1.931
5.018ThrLys: 5.018 ± 1.079
3.011ThrLeu: 3.011 ± 1.201
1.004ThrMet: 1.004 ± 0.505
4.014ThrAsn: 4.014 ± 1.67
5.519ThrPro: 5.519 ± 0.675
1.004ThrGln: 1.004 ± 0.512
4.516ThrArg: 4.516 ± 0.552
6.021ThrSer: 6.021 ± 1.644
4.014ThrThr: 4.014 ± 0.523
4.516ThrVal: 4.516 ± 0.473
0.502ThrTrp: 0.502 ± 0.402
3.011ThrTyr: 3.011 ± 1.133
0.0ThrXaa: 0.0 ± 0.0
Val
4.014ValAla: 4.014 ± 1.325
0.502ValCys: 0.502 ± 0.432
2.007ValAsp: 2.007 ± 0.65
3.512ValGlu: 3.512 ± 1.121
2.007ValPhe: 2.007 ± 1.155
1.505ValGly: 1.505 ± 0.718
2.007ValHis: 2.007 ± 0.754
2.007ValIle: 2.007 ± 0.094
2.007ValLys: 2.007 ± 1.835
5.519ValLeu: 5.519 ± 0.858
0.502ValMet: 0.502 ± 0.402
2.509ValAsn: 2.509 ± 0.749
1.505ValPro: 1.505 ± 0.436
3.011ValGln: 3.011 ± 0.966
5.519ValArg: 5.519 ± 0.858
4.516ValSer: 4.516 ± 1.35
4.516ValThr: 4.516 ± 1.373
3.011ValVal: 3.011 ± 0.54
0.502ValTrp: 0.502 ± 0.432
3.011ValTyr: 3.011 ± 1.208
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.502TrpCys: 0.502 ± 0.432
0.0TrpAsp: 0.0 ± 0.0
1.004TrpGlu: 1.004 ± 0.837
0.0TrpPhe: 0.0 ± 0.0
0.502TrpGly: 0.502 ± 0.402
1.004TrpHis: 1.004 ± 0.557
0.502TrpIle: 0.502 ± 0.419
0.0TrpLys: 0.0 ± 0.0
1.505TrpLeu: 1.505 ± 0.381
0.502TrpMet: 0.502 ± 0.432
0.0TrpAsn: 0.0 ± 0.0
1.004TrpPro: 1.004 ± 0.429
0.502TrpGln: 0.502 ± 0.419
0.502TrpArg: 0.502 ± 0.402
1.004TrpSer: 1.004 ± 0.429
0.502TrpThr: 0.502 ± 0.432
0.502TrpVal: 0.502 ± 0.432
0.502TrpTrp: 0.502 ± 0.419
1.004TrpTyr: 1.004 ± 0.429
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.007TyrAla: 2.007 ± 1.155
1.004TyrCys: 1.004 ± 0.468
3.011TyrAsp: 3.011 ± 1.334
3.512TyrGlu: 3.512 ± 0.627
2.509TyrPhe: 2.509 ± 0.319
1.505TyrGly: 1.505 ± 0.758
2.007TyrHis: 2.007 ± 0.65
1.004TyrIle: 1.004 ± 0.918
1.505TyrLys: 1.505 ± 0.718
4.014TyrLeu: 4.014 ± 0.523
0.502TyrMet: 0.502 ± 0.459
1.505TyrAsn: 1.505 ± 0.381
1.505TyrPro: 1.505 ± 1.206
2.007TyrGln: 2.007 ± 0.682
1.505TyrArg: 1.505 ± 0.718
1.505TyrSer: 1.505 ± 0.746
2.007TyrThr: 2.007 ± 0.615
2.007TyrVal: 2.007 ± 1.131
0.0TyrTrp: 0.0 ± 0.0
3.512TyrTyr: 3.512 ± 1.002
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1994 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski