Amino acid dipepetide frequency for Hubei myriapoda virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.208AlaAla: 2.208 ± 1.619
0.662AlaCys: 0.662 ± 0.207
1.545AlaAsp: 1.545 ± 0.713
2.87AlaGlu: 2.87 ± 0.943
1.545AlaPhe: 1.545 ± 0.713
2.87AlaGly: 2.87 ± 2.591
1.325AlaHis: 1.325 ± 0.997
2.87AlaIle: 2.87 ± 0.924
1.545AlaLys: 1.545 ± 0.843
2.87AlaLeu: 2.87 ± 1.08
0.883AlaMet: 0.883 ± 0.942
2.428AlaAsn: 2.428 ± 1.282
0.883AlaPro: 0.883 ± 1.003
1.325AlaGln: 1.325 ± 0.522
1.766AlaArg: 1.766 ± 1.723
2.649AlaSer: 2.649 ± 1.233
2.428AlaThr: 2.428 ± 1.076
1.766AlaVal: 1.766 ± 1.195
0.442AlaTrp: 0.442 ± 0.241
1.104AlaTyr: 1.104 ± 0.466
0.0AlaXaa: 0.0 ± 0.0
Cys
1.545CysAla: 1.545 ± 0.739
0.442CysCys: 0.442 ± 0.735
0.662CysAsp: 0.662 ± 0.361
1.325CysGlu: 1.325 ± 0.348
1.325CysPhe: 1.325 ± 1.273
0.662CysGly: 0.662 ± 0.361
1.325CysHis: 1.325 ± 0.348
1.104CysIle: 1.104 ± 0.257
2.428CysLys: 2.428 ± 0.083
1.325CysLeu: 1.325 ± 0.348
1.325CysMet: 1.325 ± 0.722
1.104CysAsn: 1.104 ± 0.602
0.442CysPro: 0.442 ± 0.431
0.883CysGln: 0.883 ± 0.421
1.104CysArg: 1.104 ± 0.257
1.987CysSer: 1.987 ± 1.084
2.428CysThr: 2.428 ± 0.393
0.442CysVal: 0.442 ± 0.273
0.221CysTrp: 0.221 ± 0.12
0.662CysTyr: 0.662 ± 0.636
0.0CysXaa: 0.0 ± 0.0
Asp
4.857AspAla: 4.857 ± 2.564
1.325AspCys: 1.325 ± 0.413
7.506AspAsp: 7.506 ± 2.017
2.649AspGlu: 2.649 ± 1.445
2.208AspPhe: 2.208 ± 0.795
2.208AspGly: 2.208 ± 0.504
1.987AspHis: 1.987 ± 0.444
4.857AspIle: 4.857 ± 0.987
6.623AspLys: 6.623 ± 1.575
5.519AspLeu: 5.519 ± 0.972
2.208AspMet: 2.208 ± 0.448
3.091AspAsn: 3.091 ± 0.907
1.766AspPro: 1.766 ± 0.662
3.311AspGln: 3.311 ± 0.562
2.428AspArg: 2.428 ± 0.083
4.636AspSer: 4.636 ± 0.529
3.532AspThr: 3.532 ± 0.965
5.74AspVal: 5.74 ± 0.77
0.221AspTrp: 0.221 ± 0.368
2.208AspTyr: 2.208 ± 0.045
0.0AspXaa: 0.0 ± 0.0
Glu
1.104GluAla: 1.104 ± 0.299
1.987GluCys: 1.987 ± 0.62
5.077GluAsp: 5.077 ± 1.97
5.519GluGlu: 5.519 ± 1.773
2.87GluPhe: 2.87 ± 0.639
2.87GluGly: 2.87 ± 0.943
2.208GluHis: 2.208 ± 0.045
4.415GluIle: 4.415 ± 0.424
5.077GluLys: 5.077 ± 0.726
5.519GluLeu: 5.519 ± 0.386
3.091GluMet: 3.091 ± 0.173
2.649GluAsn: 2.649 ± 0.202
1.766GluPro: 1.766 ± 0.282
2.87GluGln: 2.87 ± 0.433
1.545GluArg: 1.545 ± 0.402
4.415GluSer: 4.415 ± 0.473
1.987GluThr: 1.987 ± 0.347
3.753GluVal: 3.753 ± 1.243
0.662GluTrp: 0.662 ± 0.907
3.091GluTyr: 3.091 ± 0.797
0.0GluXaa: 0.0 ± 0.0
Phe
0.662PheAla: 0.662 ± 0.883
1.545PheCys: 1.545 ± 0.453
2.208PheAsp: 2.208 ± 0.938
2.87PheGlu: 2.87 ± 0.943
1.766PhePhe: 1.766 ± 0.671
1.545PheGly: 1.545 ± 0.453
1.104PheHis: 1.104 ± 0.602
2.428PheIle: 2.428 ± 0.767
5.077PheLys: 5.077 ± 0.754
5.077PheLeu: 5.077 ± 0.588
1.545PheMet: 1.545 ± 0.402
3.091PheAsn: 3.091 ± 1.245
1.766PhePro: 1.766 ± 0.564
2.87PheGln: 2.87 ± 0.639
1.766PheArg: 1.766 ± 0.842
3.311PheSer: 3.311 ± 0.779
2.428PheThr: 2.428 ± 0.6
1.545PheVal: 1.545 ± 0.852
0.883PheTrp: 0.883 ± 0.2
1.545PheTyr: 1.545 ± 0.453
0.0PheXaa: 0.0 ± 0.0
Gly
1.104GlyAla: 1.104 ± 0.299
1.325GlyCys: 1.325 ± 0.722
2.208GlyAsp: 2.208 ± 0.448
1.325GlyGlu: 1.325 ± 0.722
2.428GlyPhe: 2.428 ± 1.059
2.428GlyGly: 2.428 ± 0.522
0.883GlyHis: 0.883 ± 0.2
2.87GlyIle: 2.87 ± 1.127
3.091GlyLys: 3.091 ± 0.907
3.311GlyLeu: 3.311 ± 2.492
1.987GlyMet: 1.987 ± 0.632
2.87GlyAsn: 2.87 ± 0.543
0.883GlyPro: 0.883 ± 0.421
2.87GlyGln: 2.87 ± 1.031
1.545GlyArg: 1.545 ± 0.217
3.311GlySer: 3.311 ± 0.562
1.545GlyThr: 1.545 ± 0.889
1.987GlyVal: 1.987 ± 0.688
0.442GlyTrp: 0.442 ± 0.241
1.104GlyTyr: 1.104 ± 0.602
0.0GlyXaa: 0.0 ± 0.0
His
1.766HisAla: 1.766 ± 0.734
1.104HisCys: 1.104 ± 0.469
1.766HisAsp: 1.766 ± 0.263
1.545HisGlu: 1.545 ± 0.821
2.428HisPhe: 2.428 ± 0.582
1.104HisGly: 1.104 ± 0.257
1.766HisHis: 1.766 ± 0.564
2.87HisIle: 2.87 ± 0.793
1.766HisLys: 1.766 ± 0.963
2.649HisLeu: 2.649 ± 0.826
0.883HisMet: 0.883 ± 0.482
1.325HisAsn: 1.325 ± 0.413
1.325HisPro: 1.325 ± 0.348
2.428HisGln: 2.428 ± 0.867
1.545HisArg: 1.545 ± 0.619
2.87HisSer: 2.87 ± 0.799
1.766HisThr: 1.766 ± 0.282
1.545HisVal: 1.545 ± 0.217
0.0HisTrp: 0.0 ± 0.0
1.325HisTyr: 1.325 ± 0.348
0.0HisXaa: 0.0 ± 0.0
Ile
1.104IleAla: 1.104 ± 0.469
0.883IleCys: 0.883 ± 0.2
4.636IleAsp: 4.636 ± 0.869
4.636IleGlu: 4.636 ± 0.556
3.753IlePhe: 3.753 ± 1.659
1.987IleGly: 1.987 ± 0.679
1.987IleHis: 1.987 ± 0.679
5.74IleIle: 5.74 ± 1.279
5.298IleLys: 5.298 ± 2.014
7.064IleLeu: 7.064 ± 1.666
2.208IleMet: 2.208 ± 0.514
4.194IleAsn: 4.194 ± 1.882
3.753IlePro: 3.753 ± 1.718
4.194IleGln: 4.194 ± 1.865
3.311IleArg: 3.311 ± 1.455
7.506IleSer: 7.506 ± 0.621
4.415IleThr: 4.415 ± 1.325
5.519IleVal: 5.519 ± 0.73
1.104IleTrp: 1.104 ± 0.856
2.208IleTyr: 2.208 ± 0.514
0.0IleXaa: 0.0 ± 0.0
Lys
2.428LysAla: 2.428 ± 0.637
1.545LysCys: 1.545 ± 0.388
3.753LysAsp: 3.753 ± 1.625
4.636LysGlu: 4.636 ± 0.421
3.974LysPhe: 3.974 ± 0.694
2.649LysGly: 2.649 ± 1.445
2.649LysHis: 2.649 ± 0.599
7.064LysIle: 7.064 ± 1.761
7.947LysLys: 7.947 ± 2.746
6.843LysLeu: 6.843 ± 2.502
3.974LysMet: 3.974 ± 1.325
2.87LysAsn: 2.87 ± 1.149
2.428LysPro: 2.428 ± 0.6
1.545LysGln: 1.545 ± 0.852
2.208LysArg: 2.208 ± 0.514
9.272LysSer: 9.272 ± 2.213
4.415LysThr: 4.415 ± 1.007
3.532LysVal: 3.532 ± 0.855
1.987LysTrp: 1.987 ± 0.444
2.649LysTyr: 2.649 ± 0.673
0.0LysXaa: 0.0 ± 0.0
Leu
2.87LeuAla: 2.87 ± 1.08
2.208LeuCys: 2.208 ± 0.795
4.636LeuAsp: 4.636 ± 0.058
4.194LeuGlu: 4.194 ± 1.021
3.532LeuPhe: 3.532 ± 0.304
1.766LeuGly: 1.766 ± 0.786
2.649LeuHis: 2.649 ± 0.673
5.298LeuIle: 5.298 ± 0.155
5.519LeuLys: 5.519 ± 2.179
5.96LeuLeu: 5.96 ± 1.14
3.974LeuMet: 3.974 ± 0.769
7.506LeuAsn: 7.506 ± 0.744
4.636LeuPro: 4.636 ± 1.642
3.311LeuGln: 3.311 ± 0.771
3.311LeuArg: 3.311 ± 0.877
7.064LeuSer: 7.064 ± 1.995
7.064LeuThr: 7.064 ± 1.666
3.311LeuVal: 3.311 ± 0.343
1.766LeuTrp: 1.766 ± 0.4
3.311LeuTyr: 3.311 ± 0.683
0.0LeuXaa: 0.0 ± 0.0
Met
0.662MetAla: 0.662 ± 0.409
1.325MetCys: 1.325 ± 0.348
1.766MetAsp: 1.766 ± 0.4
2.87MetGlu: 2.87 ± 0.322
2.428MetPhe: 2.428 ± 0.582
1.545MetGly: 1.545 ± 0.453
0.883MetHis: 0.883 ± 0.2
2.208MetIle: 2.208 ± 0.514
4.857MetLys: 4.857 ± 1.164
2.428MetLeu: 2.428 ± 0.867
2.649MetMet: 2.649 ± 0.599
3.091MetAsn: 3.091 ± 1.038
1.325MetPro: 1.325 ± 0.817
1.325MetGln: 1.325 ± 0.348
1.325MetArg: 1.325 ± 0.778
3.974MetSer: 3.974 ± 0.923
0.883MetThr: 0.883 ± 1.218
2.649MetVal: 2.649 ± 0.464
0.442MetTrp: 0.442 ± 0.966
0.221MetTyr: 0.221 ± 0.368
0.0MetXaa: 0.0 ± 0.0
Asn
1.766AsnAla: 1.766 ± 1.723
1.545AsnCys: 1.545 ± 0.713
4.415AsnAsp: 4.415 ± 1.164
4.857AsnGlu: 4.857 ± 1.657
1.766AsnPhe: 1.766 ± 1.092
1.987AsnGly: 1.987 ± 0.62
1.766AsnHis: 1.766 ± 0.671
4.636AsnIle: 4.636 ± 1.731
4.415AsnLys: 4.415 ± 1.126
5.519AsnLeu: 5.519 ± 1.356
2.208AsnMet: 2.208 ± 1.178
3.091AsnAsn: 3.091 ± 0.173
3.091AsnPro: 3.091 ± 0.173
2.428AsnGln: 2.428 ± 1.059
1.987AsnArg: 1.987 ± 0.162
4.636AsnSer: 4.636 ± 1.205
2.428AsnThr: 2.428 ± 0.083
3.753AsnVal: 3.753 ± 1.278
0.442AsnTrp: 0.442 ± 0.241
2.208AsnTyr: 2.208 ± 0.589
0.0AsnXaa: 0.0 ± 0.0
Pro
1.325ProAla: 1.325 ± 0.232
1.104ProCys: 1.104 ± 0.469
3.532ProAsp: 3.532 ± 1.136
4.857ProGlu: 4.857 ± 1.405
1.766ProPhe: 1.766 ± 0.263
2.428ProGly: 2.428 ± 0.522
0.442ProHis: 0.442 ± 0.241
2.649ProIle: 2.649 ± 0.697
2.87ProLys: 2.87 ± 1.6
1.766ProLeu: 1.766 ± 0.564
0.883ProMet: 0.883 ± 0.763
2.428ProAsn: 2.428 ± 0.083
0.662ProPro: 0.662 ± 0.361
1.545ProGln: 1.545 ± 0.821
1.766ProArg: 1.766 ± 0.734
1.766ProSer: 1.766 ± 0.671
0.442ProThr: 0.442 ± 0.431
1.766ProVal: 1.766 ± 0.4
0.0ProTrp: 0.0 ± 0.0
1.104ProTyr: 1.104 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
1.766GlnAla: 1.766 ± 1.195
0.883GlnCys: 0.883 ± 0.482
2.87GlnAsp: 2.87 ± 0.322
2.87GlnGlu: 2.87 ± 0.433
1.766GlnPhe: 1.766 ± 0.564
1.987GlnGly: 1.987 ± 0.632
0.883GlnHis: 0.883 ± 0.482
5.519GlnIle: 5.519 ± 1.028
5.519GlnLys: 5.519 ± 2.209
3.974GlnLeu: 3.974 ± 0.261
2.649GlnMet: 2.649 ± 0.604
3.091GlnAsn: 3.091 ± 0.698
1.104GlnPro: 1.104 ± 0.469
0.883GlnGln: 0.883 ± 0.482
1.545GlnArg: 1.545 ± 1.813
2.428GlnSer: 2.428 ± 0.6
0.883GlnThr: 0.883 ± 0.2
2.649GlnVal: 2.649 ± 0.289
0.883GlnTrp: 0.883 ± 0.482
1.325GlnTyr: 1.325 ± 0.534
0.0GlnXaa: 0.0 ± 0.0
Arg
1.104ArgAla: 1.104 ± 0.831
0.883ArgCys: 0.883 ± 0.421
3.091ArgAsp: 3.091 ± 1.425
1.987ArgGlu: 1.987 ± 0.347
3.311ArgPhe: 3.311 ± 0.86
1.325ArgGly: 1.325 ± 1.292
2.649ArgHis: 2.649 ± 0.673
2.428ArgIle: 2.428 ± 0.393
2.428ArgLys: 2.428 ± 0.912
3.091ArgLeu: 3.091 ± 1.263
1.766ArgMet: 1.766 ± 0.263
1.766ArgAsn: 1.766 ± 0.263
0.662ArgPro: 0.662 ± 0.361
1.766ArgGln: 1.766 ± 1.237
1.987ArgArg: 1.987 ± 0.632
3.753ArgSer: 3.753 ± 1.197
3.532ArgThr: 3.532 ± 1.042
3.753ArgVal: 3.753 ± 1.289
0.442ArgTrp: 0.442 ± 0.431
0.662ArgTyr: 0.662 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
3.753SerAla: 3.753 ± 0.443
1.545SerCys: 1.545 ± 0.388
7.947SerAsp: 7.947 ± 0.742
6.623SerGlu: 6.623 ± 1.394
3.311SerPhe: 3.311 ± 0.216
2.649SerGly: 2.649 ± 1.03
3.091SerHis: 3.091 ± 1.91
6.843SerIle: 6.843 ± 0.976
5.74SerLys: 5.74 ± 0.812
7.947SerLeu: 7.947 ± 2.665
2.649SerMet: 2.649 ± 0.599
3.753SerAsn: 3.753 ± 0.803
2.428SerPro: 2.428 ± 0.876
4.194SerGln: 4.194 ± 0.635
3.753SerArg: 3.753 ± 0.803
7.285SerSer: 7.285 ± 1.632
2.208SerThr: 2.208 ± 0.795
3.974SerVal: 3.974 ± 0.371
0.221SerTrp: 0.221 ± 0.12
3.311SerTyr: 3.311 ± 0.216
0.0SerXaa: 0.0 ± 0.0
Thr
2.208ThrAla: 2.208 ± 2.464
0.662ThrCys: 0.662 ± 0.409
3.091ThrAsp: 3.091 ± 0.173
1.987ThrGlu: 1.987 ± 0.347
2.208ThrPhe: 2.208 ± 0.599
2.208ThrGly: 2.208 ± 1.722
1.545ThrHis: 1.545 ± 0.453
3.974ThrIle: 3.974 ± 1.106
1.766ThrLys: 1.766 ± 0.564
4.636ThrLeu: 4.636 ± 0.556
1.545ThrMet: 1.545 ± 0.889
3.753ThrAsn: 3.753 ± 1.659
2.428ThrPro: 2.428 ± 0.912
2.208ThrGln: 2.208 ± 0.514
3.532ThrArg: 3.532 ± 1.93
4.194ThrSer: 4.194 ± 1.882
2.208ThrThr: 2.208 ± 0.589
2.428ThrVal: 2.428 ± 0.876
1.104ThrTrp: 1.104 ± 0.257
1.325ThrTyr: 1.325 ± 0.413
0.0ThrXaa: 0.0 ± 0.0
Val
1.987ValAla: 1.987 ± 0.162
0.662ValCys: 0.662 ± 0.361
4.415ValAsp: 4.415 ± 0.927
2.649ValGlu: 2.649 ± 0.202
0.883ValPhe: 0.883 ± 0.546
2.428ValGly: 2.428 ± 0.767
2.649ValHis: 2.649 ± 1.207
4.415ValIle: 4.415 ± 0.999
3.311ValLys: 3.311 ± 1.018
3.974ValLeu: 3.974 ± 0.923
0.883ValMet: 0.883 ± 0.942
3.974ValAsn: 3.974 ± 1.456
1.987ValPro: 1.987 ± 0.618
3.311ValGln: 3.311 ± 0.562
4.194ValArg: 4.194 ± 1.249
4.415ValSer: 4.415 ± 0.576
2.87ValThr: 2.87 ± 1.555
2.208ValVal: 2.208 ± 0.977
0.662ValTrp: 0.662 ± 0.361
2.649ValTyr: 2.649 ± 0.599
0.0ValXaa: 0.0 ± 0.0
Trp
0.442TrpAla: 0.442 ± 0.966
0.0TrpCys: 0.0 ± 0.0
0.883TrpAsp: 0.883 ± 0.421
0.662TrpGlu: 0.662 ± 0.361
0.221TrpPhe: 0.221 ± 0.12
0.662TrpGly: 0.662 ± 0.409
0.662TrpHis: 0.662 ± 0.361
0.883TrpIle: 0.883 ± 0.546
1.104TrpLys: 1.104 ± 0.469
1.104TrpLeu: 1.104 ± 0.602
0.442TrpMet: 0.442 ± 0.241
1.104TrpAsn: 1.104 ± 0.831
0.221TrpPro: 0.221 ± 0.12
0.662TrpGln: 0.662 ± 0.361
0.221TrpArg: 0.221 ± 0.12
1.545TrpSer: 1.545 ± 0.402
0.662TrpThr: 0.662 ± 0.883
0.662TrpVal: 0.662 ± 0.361
0.0TrpTrp: 0.0 ± 0.0
0.883TrpTyr: 0.883 ± 0.393
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.325TyrAla: 1.325 ± 0.817
1.104TyrCys: 1.104 ± 0.257
2.87TyrAsp: 2.87 ± 0.793
1.545TyrGlu: 1.545 ± 0.453
1.545TyrPhe: 1.545 ± 0.453
2.208TyrGly: 2.208 ± 0.795
1.325TyrHis: 1.325 ± 0.534
2.428TyrIle: 2.428 ± 0.637
1.987TyrLys: 1.987 ± 1.01
3.311TyrLeu: 3.311 ± 0.562
0.883TyrMet: 0.883 ± 0.482
1.987TyrAsn: 1.987 ± 1.405
1.545TyrPro: 1.545 ± 0.453
1.545TyrGln: 1.545 ± 0.453
1.545TyrArg: 1.545 ± 0.388
2.208TyrSer: 2.208 ± 0.514
0.883TyrThr: 0.883 ± 0.393
1.545TyrVal: 1.545 ± 0.453
1.104TyrTrp: 1.104 ± 0.856
0.442TyrTyr: 0.442 ± 0.273
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4531 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski