Amino acid dipepetide frequency for Wuhan Fly Virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.318AlaAla: 4.318 ± 1.414
1.524AlaCys: 1.524 ± 0.636
2.54AlaAsp: 2.54 ± 1.208
3.048AlaGlu: 3.048 ± 0.934
1.016AlaPhe: 1.016 ± 0.399
2.032AlaGly: 2.032 ± 0.825
0.762AlaHis: 0.762 ± 0.658
2.54AlaIle: 2.54 ± 0.937
3.556AlaLys: 3.556 ± 0.983
5.334AlaLeu: 5.334 ± 1.028
0.762AlaMet: 0.762 ± 0.327
2.286AlaAsn: 2.286 ± 0.501
1.27AlaPro: 1.27 ± 0.594
1.778AlaGln: 1.778 ± 0.528
3.302AlaArg: 3.302 ± 0.568
3.81AlaSer: 3.81 ± 1.084
2.286AlaThr: 2.286 ± 0.839
2.286AlaVal: 2.286 ± 0.503
1.016AlaTrp: 1.016 ± 0.602
1.778AlaTyr: 1.778 ± 0.562
0.0AlaXaa: 0.0 ± 0.0
Cys
0.762CysAla: 0.762 ± 0.327
0.0CysCys: 0.0 ± 0.0
1.778CysAsp: 1.778 ± 0.922
2.032CysGlu: 2.032 ± 0.689
0.762CysPhe: 0.762 ± 0.332
0.508CysGly: 0.508 ± 0.301
0.254CysHis: 0.254 ± 0.347
0.762CysIle: 0.762 ± 0.559
0.0CysLys: 0.0 ± 0.0
1.524CysLeu: 1.524 ± 0.903
0.0CysMet: 0.0 ± 0.0
0.762CysAsn: 0.762 ± 0.332
0.508CysPro: 0.508 ± 0.297
0.762CysGln: 0.762 ± 0.404
1.016CysArg: 1.016 ± 1.158
1.27CysSer: 1.27 ± 0.422
0.762CysThr: 0.762 ± 0.451
1.524CysVal: 1.524 ± 0.66
0.254CysTrp: 0.254 ± 0.15
0.254CysTyr: 0.254 ± 0.347
0.0CysXaa: 0.0 ± 0.0
Asp
1.524AspAla: 1.524 ± 0.355
0.508AspCys: 0.508 ± 0.562
3.048AspAsp: 3.048 ± 0.914
3.302AspGlu: 3.302 ± 1.177
2.286AspPhe: 2.286 ± 0.62
2.54AspGly: 2.54 ± 0.631
1.27AspHis: 1.27 ± 0.513
3.302AspIle: 3.302 ± 0.696
2.032AspLys: 2.032 ± 0.64
6.604AspLeu: 6.604 ± 2.183
3.048AspMet: 3.048 ± 1.073
2.286AspAsn: 2.286 ± 1.261
4.318AspPro: 4.318 ± 0.429
2.794AspGln: 2.794 ± 0.452
1.778AspArg: 1.778 ± 0.778
3.81AspSer: 3.81 ± 0.903
2.794AspThr: 2.794 ± 1.436
3.81AspVal: 3.81 ± 1.736
1.524AspTrp: 1.524 ± 0.403
3.81AspTyr: 3.81 ± 1.613
0.0AspXaa: 0.0 ± 0.0
Glu
2.794GluAla: 2.794 ± 1.222
1.016GluCys: 1.016 ± 0.585
4.064GluAsp: 4.064 ± 1.587
6.096GluGlu: 6.096 ± 1.744
2.286GluPhe: 2.286 ± 0.661
4.064GluGly: 4.064 ± 0.612
2.794GluHis: 2.794 ± 1.059
4.826GluIle: 4.826 ± 1.733
5.842GluLys: 5.842 ± 0.564
5.588GluLeu: 5.588 ± 0.719
1.524GluMet: 1.524 ± 0.683
3.81GluAsn: 3.81 ± 0.55
1.016GluPro: 1.016 ± 0.427
0.762GluGln: 0.762 ± 0.528
2.794GluArg: 2.794 ± 0.687
5.08GluSer: 5.08 ± 0.447
4.064GluThr: 4.064 ± 0.958
3.302GluVal: 3.302 ± 0.761
0.254GluTrp: 0.254 ± 0.347
1.778GluTyr: 1.778 ± 1.219
0.0GluXaa: 0.0 ± 0.0
Phe
1.778PheAla: 1.778 ± 0.587
0.254PheCys: 0.254 ± 0.15
2.032PheAsp: 2.032 ± 0.439
2.286PheGlu: 2.286 ± 1.246
2.286PhePhe: 2.286 ± 0.888
2.286PheGly: 2.286 ± 0.503
1.778PheHis: 1.778 ± 0.784
2.032PheIle: 2.032 ± 0.667
2.794PheLys: 2.794 ± 0.988
5.588PheLeu: 5.588 ± 1.653
1.778PheMet: 1.778 ± 0.655
1.27PheAsn: 1.27 ± 0.441
2.794PhePro: 2.794 ± 0.862
1.524PheGln: 1.524 ± 0.649
3.302PheArg: 3.302 ± 0.792
4.572PheSer: 4.572 ± 1.121
2.286PheThr: 2.286 ± 0.755
2.794PheVal: 2.794 ± 1.224
0.254PheTrp: 0.254 ± 0.347
1.778PheTyr: 1.778 ± 1.054
0.0PheXaa: 0.0 ± 0.0
Gly
2.54GlyAla: 2.54 ± 0.62
0.0GlyCys: 0.0 ± 0.0
2.54GlyAsp: 2.54 ± 0.902
2.794GlyGlu: 2.794 ± 0.84
2.54GlyPhe: 2.54 ± 0.661
3.302GlyGly: 3.302 ± 0.989
0.508GlyHis: 0.508 ± 0.297
1.016GlyIle: 1.016 ± 0.427
3.81GlyLys: 3.81 ± 1.176
6.858GlyLeu: 6.858 ± 1.339
1.016GlyMet: 1.016 ± 1.12
2.032GlyAsn: 2.032 ± 0.846
1.524GlyPro: 1.524 ± 0.714
2.794GlyGln: 2.794 ± 0.475
3.302GlyArg: 3.302 ± 0.83
4.318GlySer: 4.318 ± 1.4
2.032GlyThr: 2.032 ± 0.766
5.588GlyVal: 5.588 ± 1.007
0.762GlyTrp: 0.762 ± 0.332
1.27GlyTyr: 1.27 ± 0.31
0.0GlyXaa: 0.0 ± 0.0
His
1.016HisAla: 1.016 ± 0.384
0.0HisCys: 0.0 ± 0.0
0.762HisAsp: 0.762 ± 0.451
1.016HisGlu: 1.016 ± 0.625
0.762HisPhe: 0.762 ± 0.451
1.524HisGly: 1.524 ± 0.714
0.762HisHis: 0.762 ± 0.783
1.016HisIle: 1.016 ± 0.349
2.286HisLys: 2.286 ± 0.826
4.064HisLeu: 4.064 ± 1.474
0.508HisMet: 0.508 ± 0.301
1.524HisAsn: 1.524 ± 0.517
1.778HisPro: 1.778 ± 0.562
1.27HisGln: 1.27 ± 0.752
2.032HisArg: 2.032 ± 0.595
0.508HisSer: 0.508 ± 0.301
0.508HisThr: 0.508 ± 0.694
1.27HisVal: 1.27 ± 0.31
0.508HisTrp: 0.508 ± 0.301
2.286HisTyr: 2.286 ± 0.683
0.0HisXaa: 0.0 ± 0.0
Ile
3.048IleAla: 3.048 ± 0.927
1.27IleCys: 1.27 ± 0.353
3.302IleAsp: 3.302 ± 0.486
4.318IleGlu: 4.318 ± 1.358
3.302IlePhe: 3.302 ± 1.259
4.318IleGly: 4.318 ± 0.891
1.016IleHis: 1.016 ± 1.137
3.048IleIle: 3.048 ± 1.103
4.572IleLys: 4.572 ± 1.901
6.858IleLeu: 6.858 ± 1.225
0.762IleMet: 0.762 ± 0.318
3.302IleAsn: 3.302 ± 0.561
4.826IlePro: 4.826 ± 0.773
1.27IleGln: 1.27 ± 0.931
4.826IleArg: 4.826 ± 0.851
4.318IleSer: 4.318 ± 0.697
4.064IleThr: 4.064 ± 1.61
3.556IleVal: 3.556 ± 1.272
0.762IleTrp: 0.762 ± 0.559
2.032IleTyr: 2.032 ± 0.564
0.0IleXaa: 0.0 ± 0.0
Lys
0.762LysAla: 0.762 ± 0.318
2.032LysCys: 2.032 ± 0.551
5.08LysAsp: 5.08 ± 0.918
2.794LysGlu: 2.794 ± 0.92
2.286LysPhe: 2.286 ± 0.58
2.794LysGly: 2.794 ± 1.127
1.27LysHis: 1.27 ± 0.521
6.858LysIle: 6.858 ± 1.699
7.366LysLys: 7.366 ± 0.585
6.096LysLeu: 6.096 ± 1.285
3.556LysMet: 3.556 ± 0.869
3.302LysAsn: 3.302 ± 0.822
1.524LysPro: 1.524 ± 1.427
2.286LysGln: 2.286 ± 1.097
3.556LysArg: 3.556 ± 0.661
5.08LysSer: 5.08 ± 1.054
5.08LysThr: 5.08 ± 1.37
4.318LysVal: 4.318 ± 0.825
1.27LysTrp: 1.27 ± 0.551
1.524LysTyr: 1.524 ± 0.636
0.0LysXaa: 0.0 ± 0.0
Leu
5.842LeuAla: 5.842 ± 1.277
1.016LeuCys: 1.016 ± 0.971
7.112LeuAsp: 7.112 ± 1.286
4.064LeuGlu: 4.064 ± 1.451
3.556LeuPhe: 3.556 ± 1.568
4.318LeuGly: 4.318 ± 1.888
3.556LeuHis: 3.556 ± 0.938
9.144LeuIle: 9.144 ± 1.362
8.128LeuLys: 8.128 ± 1.466
7.62LeuLeu: 7.62 ± 1.485
3.302LeuMet: 3.302 ± 0.388
5.588LeuAsn: 5.588 ± 1.565
2.032LeuPro: 2.032 ± 0.437
2.286LeuGln: 2.286 ± 1.114
6.604LeuArg: 6.604 ± 1.781
6.096LeuSer: 6.096 ± 1.511
8.382LeuThr: 8.382 ± 2.495
3.048LeuVal: 3.048 ± 0.969
1.778LeuTrp: 1.778 ± 0.523
4.064LeuTyr: 4.064 ± 1.049
0.0LeuXaa: 0.0 ± 0.0
Met
0.762MetAla: 0.762 ± 0.318
0.508MetCys: 0.508 ± 0.398
1.778MetAsp: 1.778 ± 1.057
2.54MetGlu: 2.54 ± 1.407
2.794MetPhe: 2.794 ± 0.687
2.032MetGly: 2.032 ± 0.717
0.508MetHis: 0.508 ± 0.301
2.032MetIle: 2.032 ± 0.908
3.048MetLys: 3.048 ± 1.454
0.762MetLeu: 0.762 ± 0.39
0.762MetMet: 0.762 ± 0.39
1.27MetAsn: 1.27 ± 0.594
0.508MetPro: 0.508 ± 0.605
0.508MetGln: 0.508 ± 0.297
1.27MetArg: 1.27 ± 0.412
3.302MetSer: 3.302 ± 1.105
1.27MetThr: 1.27 ± 0.341
2.54MetVal: 2.54 ± 1.26
0.254MetTrp: 0.254 ± 0.347
1.524MetTyr: 1.524 ± 0.42
0.0MetXaa: 0.0 ± 0.0
Asn
4.572AsnAla: 4.572 ± 0.857
0.508AsnCys: 0.508 ± 0.301
2.032AsnAsp: 2.032 ± 1.073
2.794AsnGlu: 2.794 ± 0.622
2.032AsnPhe: 2.032 ± 0.821
2.54AsnGly: 2.54 ± 0.843
1.016AsnHis: 1.016 ± 0.602
3.048AsnIle: 3.048 ± 0.572
2.286AsnLys: 2.286 ± 0.801
6.604AsnLeu: 6.604 ± 0.785
1.778AsnMet: 1.778 ± 0.522
2.54AsnAsn: 2.54 ± 0.859
4.318AsnPro: 4.318 ± 0.659
2.794AsnGln: 2.794 ± 0.475
1.778AsnArg: 1.778 ± 1.034
3.048AsnSer: 3.048 ± 0.387
3.048AsnThr: 3.048 ± 0.623
1.778AsnVal: 1.778 ± 0.383
0.508AsnTrp: 0.508 ± 0.584
1.778AsnTyr: 1.778 ± 0.535
0.0AsnXaa: 0.0 ± 0.0
Pro
2.54ProAla: 2.54 ± 1.02
0.508ProCys: 0.508 ± 0.276
3.81ProAsp: 3.81 ± 0.667
4.826ProGlu: 4.826 ± 0.854
1.778ProPhe: 1.778 ± 0.965
1.524ProGly: 1.524 ± 0.517
1.524ProHis: 1.524 ± 0.658
1.778ProIle: 1.778 ± 0.828
1.778ProLys: 1.778 ± 0.887
4.318ProLeu: 4.318 ± 0.398
1.016ProMet: 1.016 ± 0.349
1.524ProAsn: 1.524 ± 1.001
3.048ProPro: 3.048 ± 0.961
0.508ProGln: 0.508 ± 0.501
2.286ProArg: 2.286 ± 0.599
5.334ProSer: 5.334 ± 1.483
2.794ProThr: 2.794 ± 0.923
3.302ProVal: 3.302 ± 0.572
0.254ProTrp: 0.254 ± 0.15
2.032ProTyr: 2.032 ± 0.442
0.0ProXaa: 0.0 ± 0.0
Gln
0.508GlnAla: 0.508 ± 0.604
0.254GlnCys: 0.254 ± 0.15
2.032GlnAsp: 2.032 ± 0.672
0.762GlnGlu: 0.762 ± 0.937
1.778GlnPhe: 1.778 ± 0.528
1.778GlnGly: 1.778 ± 0.745
0.254GlnHis: 0.254 ± 0.15
2.794GlnIle: 2.794 ± 0.773
2.286GlnLys: 2.286 ± 0.645
1.778GlnLeu: 1.778 ± 0.778
1.27GlnMet: 1.27 ± 0.828
2.54GlnAsn: 2.54 ± 0.636
1.27GlnPro: 1.27 ± 0.605
0.254GlnGln: 0.254 ± 0.302
1.016GlnArg: 1.016 ± 0.726
2.54GlnSer: 2.54 ± 0.536
2.794GlnThr: 2.794 ± 1.388
2.794GlnVal: 2.794 ± 0.583
0.508GlnTrp: 0.508 ± 0.301
0.762GlnTyr: 0.762 ± 0.451
0.0GlnXaa: 0.0 ± 0.0
Arg
2.54ArgAla: 2.54 ± 0.853
1.524ArgCys: 1.524 ± 0.642
2.794ArgAsp: 2.794 ± 0.572
3.81ArgGlu: 3.81 ± 0.953
3.302ArgPhe: 3.302 ± 0.616
3.048ArgGly: 3.048 ± 1.049
2.286ArgHis: 2.286 ± 0.795
2.54ArgIle: 2.54 ± 0.715
3.556ArgLys: 3.556 ± 1.576
3.556ArgLeu: 3.556 ± 0.612
1.778ArgMet: 1.778 ± 0.85
2.54ArgAsn: 2.54 ± 0.844
2.286ArgPro: 2.286 ± 0.712
1.524ArgGln: 1.524 ± 0.591
1.524ArgArg: 1.524 ± 0.643
4.318ArgSer: 4.318 ± 1.29
3.048ArgThr: 3.048 ± 0.909
3.556ArgVal: 3.556 ± 1.648
1.016ArgTrp: 1.016 ± 0.274
2.032ArgTyr: 2.032 ± 0.595
0.0ArgXaa: 0.0 ± 0.0
Ser
4.318SerAla: 4.318 ± 0.429
0.508SerCys: 0.508 ± 0.301
4.064SerAsp: 4.064 ± 0.865
6.604SerGlu: 6.604 ± 2.108
3.81SerPhe: 3.81 ± 0.953
3.556SerGly: 3.556 ± 0.727
1.524SerHis: 1.524 ± 0.571
6.858SerIle: 6.858 ± 1.312
5.334SerLys: 5.334 ± 1.017
7.874SerLeu: 7.874 ± 0.741
2.032SerMet: 2.032 ± 0.431
3.302SerAsn: 3.302 ± 0.489
3.81SerPro: 3.81 ± 1.375
1.524SerGln: 1.524 ± 0.857
3.556SerArg: 3.556 ± 0.87
5.08SerSer: 5.08 ± 0.71
5.842SerThr: 5.842 ± 0.999
4.064SerVal: 4.064 ± 2.278
1.524SerTrp: 1.524 ± 0.384
4.064SerTyr: 4.064 ± 1.03
0.0SerXaa: 0.0 ± 0.0
Thr
1.778ThrAla: 1.778 ± 0.517
1.27ThrCys: 1.27 ± 0.513
2.54ThrAsp: 2.54 ± 0.976
4.064ThrGlu: 4.064 ± 0.747
2.54ThrPhe: 2.54 ± 0.906
3.048ThrGly: 3.048 ± 0.683
2.032ThrHis: 2.032 ± 0.564
3.81ThrIle: 3.81 ± 2.065
2.54ThrLys: 2.54 ± 0.615
3.302ThrLeu: 3.302 ± 0.917
2.54ThrMet: 2.54 ± 0.489
5.08ThrAsn: 5.08 ± 1.011
3.81ThrPro: 3.81 ± 1.898
2.032ThrGln: 2.032 ± 0.695
2.54ThrArg: 2.54 ± 0.636
5.08ThrSer: 5.08 ± 0.56
3.556ThrThr: 3.556 ± 1.148
3.81ThrVal: 3.81 ± 0.938
1.524ThrTrp: 1.524 ± 0.52
3.556ThrTyr: 3.556 ± 1.936
0.0ThrXaa: 0.0 ± 0.0
Val
2.794ValAla: 2.794 ± 0.884
1.524ValCys: 1.524 ± 0.521
2.032ValAsp: 2.032 ± 0.337
4.572ValGlu: 4.572 ± 0.745
2.54ValPhe: 2.54 ± 0.859
2.794ValGly: 2.794 ± 1.052
0.762ValHis: 0.762 ± 0.491
4.826ValIle: 4.826 ± 1.101
4.572ValLys: 4.572 ± 1.43
7.62ValLeu: 7.62 ± 0.733
1.27ValMet: 1.27 ± 0.66
4.064ValAsn: 4.064 ± 0.717
2.794ValPro: 2.794 ± 1.059
1.016ValGln: 1.016 ± 0.752
2.794ValArg: 2.794 ± 0.757
5.588ValSer: 5.588 ± 1.962
2.794ValThr: 2.794 ± 0.453
4.318ValVal: 4.318 ± 1.392
0.762ValTrp: 0.762 ± 0.461
1.524ValTyr: 1.524 ± 1.053
0.0ValXaa: 0.0 ± 0.0
Trp
1.016TrpAla: 1.016 ± 0.274
0.254TrpCys: 0.254 ± 0.15
0.508TrpAsp: 0.508 ± 0.301
0.508TrpGlu: 0.508 ± 0.398
1.524TrpPhe: 1.524 ± 0.631
0.762TrpGly: 0.762 ± 0.451
0.254TrpHis: 0.254 ± 0.15
0.762TrpIle: 0.762 ± 0.404
0.762TrpLys: 0.762 ± 0.451
1.778TrpLeu: 1.778 ± 0.745
0.508TrpMet: 0.508 ± 0.276
0.762TrpAsn: 0.762 ± 0.318
0.508TrpPro: 0.508 ± 0.301
0.254TrpGln: 0.254 ± 0.15
0.508TrpArg: 0.508 ± 0.301
2.286TrpSer: 2.286 ± 0.864
0.762TrpThr: 0.762 ± 0.76
1.524TrpVal: 1.524 ± 0.464
0.254TrpTrp: 0.254 ± 0.347
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.032TyrAla: 2.032 ± 0.919
1.016TyrCys: 1.016 ± 0.349
2.54TyrAsp: 2.54 ± 1.417
1.524TyrGlu: 1.524 ± 0.567
2.286TyrPhe: 2.286 ± 0.672
1.778TyrGly: 1.778 ± 0.608
1.27TyrHis: 1.27 ± 0.521
1.778TyrIle: 1.778 ± 0.586
2.286TyrLys: 2.286 ± 0.794
3.81TyrLeu: 3.81 ± 1.085
0.508TyrMet: 0.508 ± 0.398
1.016TyrAsn: 1.016 ± 0.726
2.54TyrPro: 2.54 ± 0.766
2.032TyrGln: 2.032 ± 0.721
2.794TyrArg: 2.794 ± 1.815
3.81TyrSer: 3.81 ± 1.114
2.54TyrThr: 2.54 ± 0.96
1.778TyrVal: 1.778 ± 0.651
0.508TyrTrp: 0.508 ± 0.297
0.762TyrTyr: 0.762 ± 0.495
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3938 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski