Amino acid dipepetide frequency for Hubei tombus-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.574AlaAla: 6.574 ± 4.598
0.0AlaCys: 0.0 ± 0.0
0.822AlaAsp: 0.822 ± 0.699
1.643AlaGlu: 1.643 ± 0.666
0.0AlaPhe: 0.0 ± 0.0
4.93AlaGly: 4.93 ± 1.331
3.287AlaHis: 3.287 ± 1.494
4.108AlaIle: 4.108 ± 1.479
9.86AlaLys: 9.86 ± 3.585
2.465AlaLeu: 2.465 ± 1.218
4.93AlaMet: 4.93 ± 2.056
5.752AlaAsn: 5.752 ± 1.94
5.752AlaPro: 5.752 ± 2.534
2.465AlaGln: 2.465 ± 1.235
2.465AlaArg: 2.465 ± 1.197
3.287AlaSer: 3.287 ± 1.494
5.752AlaThr: 5.752 ± 2.876
4.108AlaVal: 4.108 ± 0.826
1.643AlaTrp: 1.643 ± 0.703
4.93AlaTyr: 4.93 ± 0.878
0.0AlaXaa: 0.0 ± 0.0
Cys
1.643CysAla: 1.643 ± 0.628
0.822CysCys: 0.822 ± 0.582
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.822CysPhe: 0.822 ± 0.582
1.643CysGly: 1.643 ± 1.401
0.822CysHis: 0.822 ± 0.701
0.822CysIle: 0.822 ± 0.582
1.643CysLys: 1.643 ± 0.628
0.822CysLeu: 0.822 ± 0.701
0.822CysMet: 0.822 ± 0.701
0.822CysAsn: 0.822 ± 0.701
0.822CysPro: 0.822 ± 0.701
0.822CysGln: 0.822 ± 0.582
1.643CysArg: 1.643 ± 0.628
2.465CysSer: 2.465 ± 0.988
0.0CysThr: 0.0 ± 0.0
1.643CysVal: 1.643 ± 0.703
0.0CysTrp: 0.0 ± 0.0
0.822CysTyr: 0.822 ± 0.582
0.0CysXaa: 0.0 ± 0.0
Asp
2.465AspAla: 2.465 ± 1.235
0.822AspCys: 0.822 ± 0.699
0.822AspAsp: 0.822 ± 0.582
0.822AspGlu: 0.822 ± 0.582
1.643AspPhe: 1.643 ± 0.628
1.643AspGly: 1.643 ± 0.666
1.643AspHis: 1.643 ± 0.666
1.643AspIle: 1.643 ± 0.703
0.0AspLys: 0.0 ± 0.0
5.752AspLeu: 5.752 ± 2.466
0.822AspMet: 0.822 ± 0.582
1.643AspAsn: 1.643 ± 0.628
3.287AspPro: 3.287 ± 1.332
1.643AspGln: 1.643 ± 0.703
0.0AspArg: 0.0 ± 0.0
6.574AspSer: 6.574 ± 1.983
0.822AspThr: 0.822 ± 0.699
2.465AspVal: 2.465 ± 0.123
0.822AspTrp: 0.822 ± 0.582
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
1.643GluAla: 1.643 ± 0.703
0.822GluCys: 0.822 ± 0.701
3.287GluAsp: 3.287 ± 1.407
4.108GluGlu: 4.108 ± 0.654
0.822GluPhe: 0.822 ± 0.582
4.93GluGly: 4.93 ± 1.272
1.643GluHis: 1.643 ± 0.628
0.822GluIle: 0.822 ± 0.699
3.287GluLys: 3.287 ± 2.328
3.287GluLeu: 3.287 ± 1.332
0.822GluMet: 0.822 ± 0.699
0.822GluAsn: 0.822 ± 0.582
1.643GluPro: 1.643 ± 1.164
2.465GluGln: 2.465 ± 0.988
3.287GluArg: 3.287 ± 1.494
1.643GluSer: 1.643 ± 1.398
0.0GluThr: 0.0 ± 0.0
2.465GluVal: 2.465 ± 1.197
0.0GluTrp: 0.0 ± 0.0
3.287GluTyr: 3.287 ± 1.494
0.0GluXaa: 0.0 ± 0.0
Phe
0.822PheAla: 0.822 ± 0.699
0.822PheCys: 0.822 ± 0.701
0.822PheAsp: 0.822 ± 0.582
1.643PheGlu: 1.643 ± 0.666
0.0PhePhe: 0.0 ± 0.0
1.643PheGly: 1.643 ± 1.401
0.0PheHis: 0.0 ± 0.0
3.287PheIle: 3.287 ± 1.545
1.643PheLys: 1.643 ± 0.703
0.0PheLeu: 0.0 ± 0.0
1.643PheMet: 1.643 ± 1.164
2.465PheAsn: 2.465 ± 0.123
1.643PhePro: 1.643 ± 0.703
2.465PheGln: 2.465 ± 1.037
2.465PheArg: 2.465 ± 0.123
3.287PheSer: 3.287 ± 1.894
3.287PheThr: 3.287 ± 0.461
2.465PheVal: 2.465 ± 0.123
0.0PheTrp: 0.0 ± 0.0
0.822PheTyr: 0.822 ± 0.582
0.0PheXaa: 0.0 ± 0.0
Gly
3.287GlyAla: 3.287 ± 0.783
0.822GlyCys: 0.822 ± 0.582
4.108GlyAsp: 4.108 ± 0.654
3.287GlyGlu: 3.287 ± 0.754
2.465GlyPhe: 2.465 ± 1.235
4.108GlyGly: 4.108 ± 1.858
2.465GlyHis: 2.465 ± 1.197
0.822GlyIle: 0.822 ± 0.699
6.574GlyLys: 6.574 ± 2.522
3.287GlyLeu: 3.287 ± 1.332
0.822GlyMet: 0.822 ± 0.701
4.93GlyAsn: 4.93 ± 1.331
1.643GlyPro: 1.643 ± 0.703
4.93GlyGln: 4.93 ± 2.149
1.643GlyArg: 1.643 ± 0.666
1.643GlySer: 1.643 ± 1.164
4.93GlyThr: 4.93 ± 2.11
5.752GlyVal: 5.752 ± 2.207
1.643GlyTrp: 1.643 ± 1.164
1.643GlyTyr: 1.643 ± 1.398
0.0GlyXaa: 0.0 ± 0.0
His
2.465HisAla: 2.465 ± 1.037
0.822HisCys: 0.822 ± 0.701
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.643HisPhe: 1.643 ± 0.703
1.643HisGly: 1.643 ± 1.164
0.822HisHis: 0.822 ± 0.701
1.643HisIle: 1.643 ± 1.164
0.822HisLys: 0.822 ± 0.701
2.465HisLeu: 2.465 ± 0.988
0.0HisMet: 0.0 ± 0.0
2.465HisAsn: 2.465 ± 2.102
2.465HisPro: 2.465 ± 1.235
2.465HisGln: 2.465 ± 0.123
3.287HisArg: 3.287 ± 1.407
3.287HisSer: 3.287 ± 1.545
3.287HisThr: 3.287 ± 1.332
1.643HisVal: 1.643 ± 0.666
0.822HisTrp: 0.822 ± 0.582
1.643HisTyr: 1.643 ± 1.164
0.0HisXaa: 0.0 ± 0.0
Ile
4.108IleAla: 4.108 ± 0.575
0.0IleCys: 0.0 ± 0.0
0.822IleAsp: 0.822 ± 0.582
4.108IleGlu: 4.108 ± 2.091
4.108IlePhe: 4.108 ± 1.043
0.0IleGly: 0.0 ± 0.0
1.643IleHis: 1.643 ± 1.164
1.643IleIle: 1.643 ± 1.164
2.465IleLys: 2.465 ± 1.746
2.465IleLeu: 2.465 ± 1.218
1.643IleMet: 1.643 ± 1.164
4.93IleAsn: 4.93 ± 2.075
2.465IlePro: 2.465 ± 1.218
1.643IleGln: 1.643 ± 0.703
3.287IleArg: 3.287 ± 0.783
2.465IleSer: 2.465 ± 2.097
0.822IleThr: 0.822 ± 0.582
5.752IleVal: 5.752 ± 0.876
0.822IleTrp: 0.822 ± 0.699
1.643IleTyr: 1.643 ± 1.398
0.0IleXaa: 0.0 ± 0.0
Lys
6.574LysAla: 6.574 ± 2.603
1.643LysCys: 1.643 ± 0.628
2.465LysAsp: 2.465 ± 1.746
2.465LysGlu: 2.465 ± 1.197
0.822LysPhe: 0.822 ± 0.582
4.93LysGly: 4.93 ± 2.604
2.465LysHis: 2.465 ± 1.215
3.287LysIle: 3.287 ± 1.256
4.93LysLys: 4.93 ± 2.393
7.395LysLeu: 7.395 ± 0.37
2.465LysMet: 2.465 ± 0.988
1.643LysAsn: 1.643 ± 0.628
4.93LysPro: 4.93 ± 1.884
2.465LysGln: 2.465 ± 2.102
4.93LysArg: 4.93 ± 0.247
4.93LysSer: 4.93 ± 2.149
4.93LysThr: 4.93 ± 2.435
2.465LysVal: 2.465 ± 0.988
4.108LysTrp: 4.108 ± 1.043
2.465LysTyr: 2.465 ± 0.123
0.0LysXaa: 0.0 ± 0.0
Leu
11.504LeuAla: 11.504 ± 4.69
0.822LeuCys: 0.822 ± 0.701
2.465LeuAsp: 2.465 ± 0.123
4.93LeuGlu: 4.93 ± 0.959
0.0LeuPhe: 0.0 ± 0.0
6.574LeuGly: 6.574 ± 1.455
2.465LeuHis: 2.465 ± 1.235
2.465LeuIle: 2.465 ± 0.988
5.752LeuLys: 5.752 ± 1.946
3.287LeuLeu: 3.287 ± 0.783
1.643LeuMet: 1.643 ± 0.628
1.643LeuAsn: 1.643 ± 0.628
4.93LeuPro: 4.93 ± 0.247
1.643LeuGln: 1.643 ± 0.628
8.217LeuArg: 8.217 ± 3.053
5.752LeuSer: 5.752 ± 1.959
4.108LeuThr: 4.108 ± 0.654
4.93LeuVal: 4.93 ± 0.247
0.0LeuTrp: 0.0 ± 0.0
2.465LeuTyr: 2.465 ± 0.123
0.0LeuXaa: 0.0 ± 0.0
Met
4.108MetAla: 4.108 ± 1.043
0.0MetCys: 0.0 ± 0.0
0.822MetAsp: 0.822 ± 0.582
0.822MetGlu: 0.822 ± 0.582
0.822MetPhe: 0.822 ± 0.699
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
2.465MetLys: 2.465 ± 0.988
4.108MetLeu: 4.108 ± 1.043
1.643MetMet: 1.643 ± 1.164
1.643MetAsn: 1.643 ± 0.628
1.643MetPro: 1.643 ± 0.703
1.643MetGln: 1.643 ± 0.628
2.465MetArg: 2.465 ± 1.197
4.108MetSer: 4.108 ± 2.042
0.822MetThr: 0.822 ± 0.582
2.465MetVal: 2.465 ± 1.037
0.822MetTrp: 0.822 ± 0.699
0.822MetTyr: 0.822 ± 0.699
0.0MetXaa: 0.0 ± 0.0
Asn
1.643AsnAla: 1.643 ± 0.628
0.822AsnCys: 0.822 ± 0.701
2.465AsnAsp: 2.465 ± 1.235
2.465AsnGlu: 2.465 ± 0.988
0.0AsnPhe: 0.0 ± 0.0
4.93AsnGly: 4.93 ± 0.878
0.0AsnHis: 0.0 ± 0.0
2.465AsnIle: 2.465 ± 1.235
4.108AsnLys: 4.108 ± 1.45
2.465AsnLeu: 2.465 ± 2.102
0.0AsnMet: 0.0 ± 0.0
1.643AsnAsn: 1.643 ± 0.703
2.465AsnPro: 2.465 ± 1.197
2.465AsnGln: 2.465 ± 1.197
0.822AsnArg: 0.822 ± 0.701
4.108AsnSer: 4.108 ± 1.043
7.395AsnThr: 7.395 ± 2.261
4.93AsnVal: 4.93 ± 1.331
1.643AsnTrp: 1.643 ± 0.666
2.465AsnTyr: 2.465 ± 1.746
0.0AsnXaa: 0.0 ± 0.0
Pro
8.217ProAla: 8.217 ± 2.657
0.822ProCys: 0.822 ± 0.582
2.465ProAsp: 2.465 ± 1.037
1.643ProGlu: 1.643 ± 1.398
3.287ProPhe: 3.287 ± 2.328
4.93ProGly: 4.93 ± 2.435
0.822ProHis: 0.822 ± 0.701
4.108ProIle: 4.108 ± 0.654
3.287ProLys: 3.287 ± 1.858
3.287ProLeu: 3.287 ± 1.256
2.465ProMet: 2.465 ± 0.123
0.822ProAsn: 0.822 ± 0.701
3.287ProPro: 3.287 ± 0.754
3.287ProGln: 3.287 ± 1.858
7.395ProArg: 7.395 ± 2.202
3.287ProSer: 3.287 ± 0.783
4.108ProThr: 4.108 ± 1.861
3.287ProVal: 3.287 ± 0.783
0.822ProTrp: 0.822 ± 0.699
0.822ProTyr: 0.822 ± 0.699
0.0ProXaa: 0.0 ± 0.0
Gln
3.287GlnAla: 3.287 ± 0.461
1.643GlnCys: 1.643 ± 0.628
0.0GlnAsp: 0.0 ± 0.0
2.465GlnGlu: 2.465 ± 1.197
0.822GlnPhe: 0.822 ± 0.582
0.0GlnGly: 0.0 ± 0.0
0.822GlnHis: 0.822 ± 0.582
2.465GlnIle: 2.465 ± 1.235
2.465GlnLys: 2.465 ± 1.197
6.574GlnLeu: 6.574 ± 0.546
1.643GlnMet: 1.643 ± 0.619
0.822GlnAsn: 0.822 ± 0.582
5.752GlnPro: 5.752 ± 3.062
3.287GlnGln: 3.287 ± 1.858
4.108GlnArg: 4.108 ± 1.861
4.93GlnSer: 4.93 ± 0.247
1.643GlnThr: 1.643 ± 0.666
3.287GlnVal: 3.287 ± 1.407
0.0GlnTrp: 0.0 ± 0.0
0.822GlnTyr: 0.822 ± 0.582
0.0GlnXaa: 0.0 ± 0.0
Arg
4.93ArgAla: 4.93 ± 2.149
1.643ArgCys: 1.643 ± 0.628
1.643ArgAsp: 1.643 ± 0.628
0.822ArgGlu: 0.822 ± 0.582
1.643ArgPhe: 1.643 ± 1.401
3.287ArgGly: 3.287 ± 0.461
4.108ArgHis: 4.108 ± 2.042
1.643ArgIle: 1.643 ± 1.398
4.93ArgLys: 4.93 ± 2.149
10.682ArgLeu: 10.682 ± 4.364
3.287ArgMet: 3.287 ± 2.328
5.752ArgAsn: 5.752 ± 0.343
3.287ArgPro: 3.287 ± 1.256
4.108ArgGln: 4.108 ± 0.826
1.643ArgArg: 1.643 ± 0.703
5.752ArgSer: 5.752 ± 0.876
0.822ArgThr: 0.822 ± 0.699
3.287ArgVal: 3.287 ± 0.783
0.0ArgTrp: 0.0 ± 0.0
2.465ArgTyr: 2.465 ± 1.037
0.0ArgXaa: 0.0 ± 0.0
Ser
3.287SerAla: 3.287 ± 1.332
2.465SerCys: 2.465 ± 0.988
4.108SerAsp: 4.108 ± 1.043
1.643SerGlu: 1.643 ± 1.398
3.287SerPhe: 3.287 ± 1.853
3.287SerGly: 3.287 ± 0.461
1.643SerHis: 1.643 ± 0.666
4.93SerIle: 4.93 ± 1.625
8.217SerLys: 8.217 ± 0.902
5.752SerLeu: 5.752 ± 0.343
1.643SerMet: 1.643 ± 0.666
4.93SerAsn: 4.93 ± 0.959
5.752SerPro: 5.752 ± 3.062
2.465SerGln: 2.465 ± 1.235
4.108SerArg: 4.108 ± 2.042
4.108SerSer: 4.108 ± 0.575
4.93SerThr: 4.93 ± 1.999
5.752SerVal: 5.752 ± 2.503
1.643SerTrp: 1.643 ± 0.628
1.643SerTyr: 1.643 ± 1.398
0.0SerXaa: 0.0 ± 0.0
Thr
1.643ThrAla: 1.643 ± 0.628
2.465ThrCys: 2.465 ± 2.102
2.465ThrAsp: 2.465 ± 0.123
1.643ThrGlu: 1.643 ± 0.666
2.465ThrPhe: 2.465 ± 0.988
3.287ThrGly: 3.287 ± 1.894
2.465ThrHis: 2.465 ± 0.988
3.287ThrIle: 3.287 ± 0.783
3.287ThrLys: 3.287 ± 1.858
5.752ThrLeu: 5.752 ± 1.314
1.643ThrMet: 1.643 ± 1.212
2.465ThrAsn: 2.465 ± 2.097
2.465ThrPro: 2.465 ± 1.215
2.465ThrGln: 2.465 ± 1.218
4.108ThrArg: 4.108 ± 1.858
6.574ThrSer: 6.574 ± 2.633
4.108ThrThr: 4.108 ± 1.644
6.574ThrVal: 6.574 ± 3.787
0.822ThrTrp: 0.822 ± 0.699
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
1.643ValAla: 1.643 ± 1.398
0.0ValCys: 0.0 ± 0.0
3.287ValAsp: 3.287 ± 0.461
4.93ValGlu: 4.93 ± 1.884
4.108ValPhe: 4.108 ± 1.858
7.395ValGly: 7.395 ± 2.505
3.287ValHis: 3.287 ± 1.894
4.93ValIle: 4.93 ± 0.878
4.93ValLys: 4.93 ± 0.878
4.108ValLeu: 4.108 ± 1.479
2.465ValMet: 2.465 ± 1.235
3.287ValAsn: 3.287 ± 0.461
4.93ValPro: 4.93 ± 0.959
0.822ValGln: 0.822 ± 0.582
5.752ValArg: 5.752 ± 1.529
5.752ValSer: 5.752 ± 0.343
3.287ValThr: 3.287 ± 1.853
4.93ValVal: 4.93 ± 1.331
1.643ValTrp: 1.643 ± 0.703
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.822TrpAla: 0.822 ± 0.582
0.0TrpCys: 0.0 ± 0.0
0.822TrpAsp: 0.822 ± 0.699
0.822TrpGlu: 0.822 ± 0.582
0.822TrpPhe: 0.822 ± 0.582
1.643TrpGly: 1.643 ± 0.703
0.822TrpHis: 0.822 ± 0.699
0.0TrpIle: 0.0 ± 0.0
0.822TrpLys: 0.822 ± 0.699
0.822TrpLeu: 0.822 ± 0.701
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
1.643TrpPro: 1.643 ± 0.666
1.643TrpGln: 1.643 ± 0.666
0.822TrpArg: 0.822 ± 0.699
0.822TrpSer: 0.822 ± 0.699
1.643TrpThr: 1.643 ± 1.164
2.465TrpVal: 2.465 ± 1.197
0.0TrpTrp: 0.0 ± 0.0
0.822TrpTyr: 0.822 ± 0.582
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.93TyrAla: 4.93 ± 1.272
1.643TyrCys: 1.643 ± 1.164
1.643TyrAsp: 1.643 ± 1.164
0.822TyrGlu: 0.822 ± 0.699
1.643TyrPhe: 1.643 ± 1.398
0.0TyrGly: 0.0 ± 0.0
2.465TyrHis: 2.465 ± 0.123
2.465TyrIle: 2.465 ± 0.988
0.822TyrLys: 0.822 ± 0.699
0.822TyrLeu: 0.822 ± 0.701
0.0TyrMet: 0.0 ± 0.0
0.822TyrAsn: 0.822 ± 0.701
1.643TyrPro: 1.643 ± 0.666
1.643TyrGln: 1.643 ± 1.164
3.287TyrArg: 3.287 ± 2.328
0.822TyrSer: 0.822 ± 0.582
3.287TyrThr: 3.287 ± 1.332
0.822TyrVal: 0.822 ± 0.699
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1218 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski