Amino acid dipepetide frequency for Hubei toti-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.866AlaAla: 6.866 ± 0.637
1.962AlaCys: 1.962 ± 1.088
2.452AlaAsp: 2.452 ± 0.567
1.471AlaGlu: 1.471 ± 0.023
1.471AlaPhe: 1.471 ± 0.77
7.847AlaGly: 7.847 ± 0.388
2.943AlaHis: 2.943 ± 0.046
4.414AlaIle: 4.414 ± 2.448
3.923AlaLys: 3.923 ± 0.995
6.866AlaLeu: 6.866 ± 0.156
2.943AlaMet: 2.943 ± 1.539
4.904AlaAsn: 4.904 ± 1.927
3.433AlaPro: 3.433 ± 1.111
2.452AlaGln: 2.452 ± 0.567
5.885AlaArg: 5.885 ± 0.093
9.318AlaSer: 9.318 ± 1.996
7.847AlaThr: 7.847 ± 1.991
4.414AlaVal: 4.414 ± 0.862
0.49AlaTrp: 0.49 ± 0.272
1.962AlaTyr: 1.962 ± 0.498
0.0AlaXaa: 0.0 ± 0.0
Cys
0.981CysAla: 0.981 ± 0.544
0.0CysCys: 0.0 ± 0.0
0.981CysAsp: 0.981 ± 0.544
0.981CysGlu: 0.981 ± 0.249
1.471CysPhe: 1.471 ± 0.023
0.981CysGly: 0.981 ± 0.249
0.49CysHis: 0.49 ± 0.272
0.981CysIle: 0.981 ± 0.544
0.49CysLys: 0.49 ± 0.521
1.471CysLeu: 1.471 ± 0.816
0.0CysMet: 0.0 ± 0.0
0.981CysAsn: 0.981 ± 0.249
0.49CysPro: 0.49 ± 0.272
0.981CysGln: 0.981 ± 0.249
1.471CysArg: 1.471 ± 0.816
0.0CysSer: 0.0 ± 0.0
0.49CysThr: 0.49 ± 0.521
0.49CysVal: 0.49 ± 0.272
0.49CysTrp: 0.49 ± 0.272
0.49CysTyr: 0.49 ± 0.521
0.0CysXaa: 0.0 ± 0.0
Asp
6.376AspAla: 6.376 ± 1.157
0.0AspCys: 0.0 ± 0.0
3.433AspAsp: 3.433 ± 1.111
3.923AspGlu: 3.923 ± 0.59
0.49AspPhe: 0.49 ± 0.272
5.395AspGly: 5.395 ± 0.179
0.49AspHis: 0.49 ± 0.272
1.471AspIle: 1.471 ± 0.023
4.904AspLys: 4.904 ± 1.134
1.962AspLeu: 1.962 ± 0.295
0.981AspMet: 0.981 ± 1.042
1.962AspAsn: 1.962 ± 1.088
0.981AspPro: 0.981 ± 0.544
2.452AspGln: 2.452 ± 0.567
2.452AspArg: 2.452 ± 0.226
2.943AspSer: 2.943 ± 0.839
3.923AspThr: 3.923 ± 1.383
4.414AspVal: 4.414 ± 0.069
1.471AspTrp: 1.471 ± 0.023
2.452AspTyr: 2.452 ± 0.226
0.0AspXaa: 0.0 ± 0.0
Glu
6.376GluAla: 6.376 ± 0.365
0.49GluCys: 0.49 ± 0.521
3.433GluAsp: 3.433 ± 1.111
3.433GluGlu: 3.433 ± 2.06
0.981GluPhe: 0.981 ± 1.042
2.943GluGly: 2.943 ± 0.747
0.49GluHis: 0.49 ± 0.272
4.904GluIle: 4.904 ± 0.341
2.452GluLys: 2.452 ± 0.567
3.433GluLeu: 3.433 ± 0.318
1.471GluMet: 1.471 ± 1.563
2.452GluAsn: 2.452 ± 0.567
1.471GluPro: 1.471 ± 0.023
3.433GluGln: 3.433 ± 0.475
2.452GluArg: 2.452 ± 0.226
1.962GluSer: 1.962 ± 0.295
4.904GluThr: 4.904 ± 0.451
1.962GluVal: 1.962 ± 0.295
2.452GluTrp: 2.452 ± 1.019
3.923GluTyr: 3.923 ± 0.203
0.0GluXaa: 0.0 ± 0.0
Phe
2.943PheAla: 2.943 ± 1.539
0.0PheCys: 0.0 ± 0.0
0.981PheAsp: 0.981 ± 0.249
0.49PheGlu: 0.49 ± 0.272
0.981PhePhe: 0.981 ± 0.249
1.471PheGly: 1.471 ± 0.77
1.962PheHis: 1.962 ± 0.498
0.49PheIle: 0.49 ± 0.272
0.49PheLys: 0.49 ± 0.521
1.962PheLeu: 1.962 ± 0.498
0.0PheMet: 0.0 ± 0.0
1.471PheAsn: 1.471 ± 0.023
0.981PhePro: 0.981 ± 0.544
0.0PheGln: 0.0 ± 0.0
1.962PheArg: 1.962 ± 2.083
2.452PheSer: 2.452 ± 0.567
1.962PheThr: 1.962 ± 0.498
1.471PheVal: 1.471 ± 0.816
0.0PheTrp: 0.0 ± 0.0
1.471PheTyr: 1.471 ± 0.023
0.0PheXaa: 0.0 ± 0.0
Gly
5.395GlyAla: 5.395 ± 0.179
0.49GlyCys: 0.49 ± 0.272
2.452GlyAsp: 2.452 ± 0.567
0.0GlyGlu: 0.0 ± 0.0
3.923GlyPhe: 3.923 ± 0.203
5.885GlyGly: 5.885 ± 0.7
1.471GlyHis: 1.471 ± 0.023
6.376GlyIle: 6.376 ± 0.428
2.943GlyLys: 2.943 ± 0.046
10.79GlyLeu: 10.79 ± 2.02
1.471GlyMet: 1.471 ± 0.023
2.452GlyAsn: 2.452 ± 0.567
1.962GlyPro: 1.962 ± 1.291
1.471GlyGln: 1.471 ± 0.77
2.452GlyArg: 2.452 ± 1.019
5.395GlySer: 5.395 ± 0.613
3.433GlyThr: 3.433 ± 0.318
5.395GlyVal: 5.395 ± 0.179
3.923GlyTrp: 3.923 ± 0.203
3.433GlyTyr: 3.433 ± 0.475
0.0GlyXaa: 0.0 ± 0.0
His
1.962HisAla: 1.962 ± 1.088
1.471HisCys: 1.471 ± 0.023
0.0HisAsp: 0.0 ± 0.0
0.981HisGlu: 0.981 ± 0.544
1.471HisPhe: 1.471 ± 0.77
0.981HisGly: 0.981 ± 0.544
1.471HisHis: 1.471 ± 0.77
1.962HisIle: 1.962 ± 1.291
1.471HisLys: 1.471 ± 0.023
3.433HisLeu: 3.433 ± 1.111
0.0HisMet: 0.0 ± 0.0
0.981HisAsn: 0.981 ± 0.544
0.49HisPro: 0.49 ± 0.521
1.471HisGln: 1.471 ± 0.77
1.471HisArg: 1.471 ± 0.023
0.981HisSer: 0.981 ± 0.249
1.471HisThr: 1.471 ± 0.023
0.981HisVal: 0.981 ± 0.249
0.981HisTrp: 0.981 ± 0.249
0.49HisTyr: 0.49 ± 0.521
0.0HisXaa: 0.0 ± 0.0
Ile
4.414IleAla: 4.414 ± 1.516
0.981IleCys: 0.981 ± 0.249
4.414IleAsp: 4.414 ± 0.069
3.433IleGlu: 3.433 ± 1.904
0.981IlePhe: 0.981 ± 0.249
3.923IleGly: 3.923 ± 0.59
2.452IleHis: 2.452 ± 0.567
0.981IleIle: 0.981 ± 0.249
3.923IleLys: 3.923 ± 0.59
3.923IleLeu: 3.923 ± 0.203
2.452IleMet: 2.452 ± 1.019
3.433IleAsn: 3.433 ± 0.318
5.395IlePro: 5.395 ± 1.406
0.981IleGln: 0.981 ± 0.249
4.414IleArg: 4.414 ± 3.102
2.452IleSer: 2.452 ± 0.567
4.414IleThr: 4.414 ± 1.516
3.923IleVal: 3.923 ± 0.203
0.49IleTrp: 0.49 ± 0.272
1.471IleTyr: 1.471 ± 0.77
0.0IleXaa: 0.0 ± 0.0
Lys
3.923LysAla: 3.923 ± 0.59
0.49LysCys: 0.49 ± 0.272
1.962LysAsp: 1.962 ± 0.498
2.943LysGlu: 2.943 ± 0.747
0.981LysPhe: 0.981 ± 0.544
3.923LysGly: 3.923 ± 0.203
1.962LysHis: 1.962 ± 0.295
4.414LysIle: 4.414 ± 1.516
3.923LysLys: 3.923 ± 0.203
4.414LysLeu: 4.414 ± 0.723
1.962LysMet: 1.962 ± 0.295
1.471LysAsn: 1.471 ± 0.77
1.471LysPro: 1.471 ± 0.77
0.981LysGln: 0.981 ± 1.042
1.471LysArg: 1.471 ± 0.816
4.414LysSer: 4.414 ± 0.723
2.452LysThr: 2.452 ± 0.226
4.414LysVal: 4.414 ± 0.862
0.49LysTrp: 0.49 ± 0.272
2.943LysTyr: 2.943 ± 0.747
0.0LysXaa: 0.0 ± 0.0
Leu
6.376LeuAla: 6.376 ± 1.157
1.962LeuCys: 1.962 ± 0.498
6.376LeuAsp: 6.376 ± 1.157
4.904LeuGlu: 4.904 ± 1.244
0.49LeuPhe: 0.49 ± 0.521
8.828LeuGly: 8.828 ± 1.447
0.981LeuHis: 0.981 ± 0.249
4.414LeuIle: 4.414 ± 0.069
5.885LeuLys: 5.885 ± 0.093
5.885LeuLeu: 5.885 ± 0.093
2.943LeuMet: 2.943 ± 0.747
4.414LeuAsn: 4.414 ± 1.655
4.904LeuPro: 4.904 ± 1.134
2.943LeuGln: 2.943 ± 1.632
7.847LeuArg: 7.847 ± 1.18
11.28LeuSer: 11.28 ± 2.465
1.962LeuThr: 1.962 ± 0.295
2.452LeuVal: 2.452 ± 0.567
1.962LeuTrp: 1.962 ± 1.291
2.943LeuTyr: 2.943 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
2.943MetAla: 2.943 ± 1.632
0.0MetCys: 0.0 ± 0.0
1.962MetAsp: 1.962 ± 1.291
1.471MetGlu: 1.471 ± 0.023
0.49MetPhe: 0.49 ± 0.521
1.962MetGly: 1.962 ± 0.295
0.49MetHis: 0.49 ± 0.521
2.452MetIle: 2.452 ± 0.567
1.471MetLys: 1.471 ± 0.77
1.962MetLeu: 1.962 ± 0.498
0.49MetMet: 0.49 ± 0.521
0.49MetAsn: 0.49 ± 0.272
1.471MetPro: 1.471 ± 1.563
0.0MetGln: 0.0 ± 0.0
2.452MetArg: 2.452 ± 0.226
1.471MetSer: 1.471 ± 0.77
1.962MetThr: 1.962 ± 1.291
1.471MetVal: 1.471 ± 0.77
0.49MetTrp: 0.49 ± 0.521
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.452AsnAla: 2.452 ± 0.226
0.981AsnCys: 0.981 ± 0.544
1.962AsnAsp: 1.962 ± 1.088
3.923AsnGlu: 3.923 ± 1.383
1.962AsnPhe: 1.962 ± 0.498
1.471AsnGly: 1.471 ± 0.023
0.49AsnHis: 0.49 ± 0.272
3.923AsnIle: 3.923 ± 0.203
0.981AsnLys: 0.981 ± 0.544
5.885AsnLeu: 5.885 ± 0.885
0.981AsnMet: 0.981 ± 0.439
2.943AsnAsn: 2.943 ± 1.632
1.962AsnPro: 1.962 ± 0.295
1.471AsnGln: 1.471 ± 0.77
3.433AsnArg: 3.433 ± 0.318
2.943AsnSer: 2.943 ± 1.632
3.433AsnThr: 3.433 ± 1.111
1.962AsnVal: 1.962 ± 0.295
0.49AsnTrp: 0.49 ± 0.272
2.452AsnTyr: 2.452 ± 1.36
0.0AsnXaa: 0.0 ± 0.0
Pro
3.433ProAla: 3.433 ± 0.318
0.0ProCys: 0.0 ± 0.0
2.452ProAsp: 2.452 ± 0.567
2.943ProGlu: 2.943 ± 0.046
0.49ProPhe: 0.49 ± 0.272
1.471ProGly: 1.471 ± 0.023
1.962ProHis: 1.962 ± 0.498
2.943ProIle: 2.943 ± 1.539
2.943ProLys: 2.943 ± 0.839
2.943ProLeu: 2.943 ± 0.046
0.981ProMet: 0.981 ± 1.042
0.981ProAsn: 0.981 ± 0.249
2.452ProPro: 2.452 ± 0.567
1.962ProGln: 1.962 ± 0.295
2.452ProArg: 2.452 ± 0.226
4.414ProSer: 4.414 ± 0.069
4.904ProThr: 4.904 ± 1.927
2.452ProVal: 2.452 ± 0.567
0.49ProTrp: 0.49 ± 0.521
0.981ProTyr: 0.981 ± 1.042
0.0ProXaa: 0.0 ± 0.0
Gln
1.962GlnAla: 1.962 ± 0.295
0.981GlnCys: 0.981 ± 0.544
1.962GlnAsp: 1.962 ± 1.088
1.471GlnGlu: 1.471 ± 1.563
0.0GlnPhe: 0.0 ± 0.0
1.962GlnGly: 1.962 ± 0.295
0.49GlnHis: 0.49 ± 0.272
1.962GlnIle: 1.962 ± 1.291
0.981GlnLys: 0.981 ± 0.249
2.452GlnLeu: 2.452 ± 0.226
0.0GlnMet: 0.0 ± 0.0
0.981GlnAsn: 0.981 ± 0.544
1.471GlnPro: 1.471 ± 0.023
0.981GlnGln: 0.981 ± 0.249
1.471GlnArg: 1.471 ± 0.023
1.962GlnSer: 1.962 ± 0.498
3.433GlnThr: 3.433 ± 1.111
2.943GlnVal: 2.943 ± 0.046
1.962GlnTrp: 1.962 ± 0.498
1.471GlnTyr: 1.471 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
4.414ArgAla: 4.414 ± 0.069
0.0ArgCys: 0.0 ± 0.0
1.471ArgAsp: 1.471 ± 0.77
4.414ArgGlu: 4.414 ± 0.069
2.943ArgPhe: 2.943 ± 0.839
5.395ArgGly: 5.395 ± 1.406
0.0ArgHis: 0.0 ± 0.0
3.923ArgIle: 3.923 ± 0.203
3.923ArgLys: 3.923 ± 0.203
4.414ArgLeu: 4.414 ± 2.309
1.962ArgMet: 1.962 ± 0.295
2.943ArgAsn: 2.943 ± 0.839
2.943ArgPro: 2.943 ± 1.539
1.962ArgGln: 1.962 ± 0.295
3.923ArgArg: 3.923 ± 0.995
4.414ArgSer: 4.414 ± 0.723
3.923ArgThr: 3.923 ± 0.995
3.433ArgVal: 3.433 ± 0.475
1.962ArgTrp: 1.962 ± 1.291
2.452ArgTyr: 2.452 ± 0.226
0.0ArgXaa: 0.0 ± 0.0
Ser
7.357SerAla: 7.357 ± 0.909
0.0SerCys: 0.0 ± 0.0
3.923SerAsp: 3.923 ± 0.203
5.395SerGlu: 5.395 ± 0.972
1.962SerPhe: 1.962 ± 0.498
4.414SerGly: 4.414 ± 0.069
1.471SerHis: 1.471 ± 0.023
4.414SerIle: 4.414 ± 0.069
2.943SerLys: 2.943 ± 1.539
8.828SerLeu: 8.828 ± 1.724
2.452SerMet: 2.452 ± 0.567
4.904SerAsn: 4.904 ± 0.341
0.981SerPro: 0.981 ± 0.249
3.433SerGln: 3.433 ± 0.318
4.414SerArg: 4.414 ± 0.862
5.885SerSer: 5.885 ± 2.471
4.904SerThr: 4.904 ± 0.341
4.904SerVal: 4.904 ± 2.72
1.962SerTrp: 1.962 ± 1.291
2.943SerTyr: 2.943 ± 0.046
0.0SerXaa: 0.0 ± 0.0
Thr
6.376ThrAla: 6.376 ± 0.365
2.452ThrCys: 2.452 ± 0.226
3.923ThrAsp: 3.923 ± 2.176
5.885ThrGlu: 5.885 ± 1.678
1.471ThrPhe: 1.471 ± 1.563
5.395ThrGly: 5.395 ± 1.765
0.981ThrHis: 0.981 ± 0.249
2.452ThrIle: 2.452 ± 0.567
2.452ThrLys: 2.452 ± 0.226
9.318ThrLeu: 9.318 ± 1.204
1.471ThrMet: 1.471 ± 1.041
2.452ThrAsn: 2.452 ± 1.36
2.943ThrPro: 2.943 ± 0.046
0.0ThrGln: 0.0 ± 0.0
4.414ThrArg: 4.414 ± 0.723
5.395ThrSer: 5.395 ± 0.613
6.866ThrThr: 6.866 ± 1.429
2.943ThrVal: 2.943 ± 0.747
1.471ThrTrp: 1.471 ± 0.77
1.471ThrTyr: 1.471 ± 0.77
0.0ThrXaa: 0.0 ± 0.0
Val
4.904ValAla: 4.904 ± 0.451
1.962ValCys: 1.962 ± 1.088
5.395ValAsp: 5.395 ± 0.613
3.433ValGlu: 3.433 ± 2.06
0.981ValPhe: 0.981 ± 0.544
1.962ValGly: 1.962 ± 1.088
1.962ValHis: 1.962 ± 0.295
2.452ValIle: 2.452 ± 0.567
1.962ValLys: 1.962 ± 1.291
3.433ValLeu: 3.433 ± 0.318
0.981ValMet: 0.981 ± 0.544
2.452ValAsn: 2.452 ± 1.36
3.923ValPro: 3.923 ± 1.383
0.981ValGln: 0.981 ± 0.249
2.452ValArg: 2.452 ± 1.811
4.904ValSer: 4.904 ± 1.134
4.904ValThr: 4.904 ± 1.134
4.904ValVal: 4.904 ± 1.927
1.962ValTrp: 1.962 ± 0.295
2.452ValTyr: 2.452 ± 0.567
0.0ValXaa: 0.0 ± 0.0
Trp
2.452TrpAla: 2.452 ± 1.36
0.49TrpCys: 0.49 ± 0.521
1.471TrpAsp: 1.471 ± 0.77
0.981TrpGlu: 0.981 ± 1.042
0.0TrpPhe: 0.0 ± 0.0
2.452TrpGly: 2.452 ± 0.567
0.981TrpHis: 0.981 ± 1.042
0.49TrpIle: 0.49 ± 0.521
1.471TrpLys: 1.471 ± 0.77
0.981TrpLeu: 0.981 ± 1.042
0.49TrpMet: 0.49 ± 0.521
2.452TrpAsn: 2.452 ± 1.019
3.923TrpPro: 3.923 ± 0.995
0.981TrpGln: 0.981 ± 0.544
1.471TrpArg: 1.471 ± 0.023
0.981TrpSer: 0.981 ± 0.544
1.962TrpThr: 1.962 ± 1.291
0.981TrpVal: 0.981 ± 0.249
0.0TrpTrp: 0.0 ± 0.0
0.49TrpTyr: 0.49 ± 0.272
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.962TyrAla: 1.962 ± 0.498
0.0TyrCys: 0.0 ± 0.0
1.471TyrAsp: 1.471 ± 0.816
2.943TyrGlu: 2.943 ± 0.747
0.0TyrPhe: 0.0 ± 0.0
2.452TyrGly: 2.452 ± 0.226
0.981TyrHis: 0.981 ± 0.249
3.433TyrIle: 3.433 ± 1.267
1.471TyrLys: 1.471 ± 0.77
5.395TyrLeu: 5.395 ± 2.558
0.981TyrMet: 0.981 ± 0.544
0.981TyrAsn: 0.981 ± 0.249
0.0TyrPro: 0.0 ± 0.0
1.962TyrGln: 1.962 ± 0.295
2.452TyrArg: 2.452 ± 0.567
3.923TyrSer: 3.923 ± 0.203
1.471TyrThr: 1.471 ± 0.816
2.452TyrVal: 2.452 ± 0.567
1.962TyrTrp: 1.962 ± 0.498
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2040 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski