Amino acid dipepetide frequency for Hubei toti-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.277AlaAla: 13.277 ± 3.161
3.186AlaCys: 3.186 ± 1.985
5.311AlaAsp: 5.311 ± 0.385
5.311AlaGlu: 5.311 ± 0.754
4.249AlaPhe: 4.249 ± 0.8
10.09AlaGly: 10.09 ± 2.063
1.593AlaHis: 1.593 ± 0.585
4.249AlaIle: 4.249 ± 1.514
1.593AlaLys: 1.593 ± 0.585
14.87AlaLeu: 14.87 ± 1.829
2.124AlaMet: 2.124 ± 0.529
3.717AlaAsn: 3.717 ± 1.739
6.373AlaPro: 6.373 ± 0.919
2.124AlaGln: 2.124 ± 1.345
6.904AlaArg: 6.904 ± 2.02
7.966AlaSer: 7.966 ± 1.672
6.904AlaThr: 6.904 ± 3.127
7.966AlaVal: 7.966 ± 0.164
2.655AlaTrp: 2.655 ± 0.295
4.249AlaTyr: 4.249 ± 1.514
0.0AlaXaa: 0.0 ± 0.0
Cys
1.593CysAla: 1.593 ± 0.456
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.062CysGlu: 1.062 ± 0.751
0.531CysPhe: 0.531 ± 0.397
2.124CysGly: 2.124 ± 2.221
0.0CysHis: 0.0 ± 0.0
1.593CysIle: 1.593 ± 1.44
0.0CysLys: 0.0 ± 0.0
2.655CysLeu: 2.655 ± 0.869
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.593CysPro: 1.593 ± 1.44
0.531CysGln: 0.531 ± 0.397
0.531CysArg: 0.531 ± 0.397
1.062CysSer: 1.062 ± 1.54
0.531CysThr: 0.531 ± 0.336
0.531CysVal: 0.531 ± 0.77
0.0CysTrp: 0.0 ± 0.0
0.531CysTyr: 0.531 ± 0.336
0.0CysXaa: 0.0 ± 0.0
Asp
5.842AspAla: 5.842 ± 1.342
0.0AspCys: 0.0 ± 0.0
4.249AspAsp: 4.249 ± 0.163
3.717AspGlu: 3.717 ± 1.045
3.717AspPhe: 3.717 ± 0.208
2.124AspGly: 2.124 ± 1.287
1.062AspHis: 1.062 ± 0.264
1.062AspIle: 1.062 ± 0.708
0.531AspLys: 0.531 ± 0.336
3.717AspLeu: 3.717 ± 1.045
0.0AspMet: 0.0 ± 0.0
0.531AspAsn: 0.531 ± 0.77
4.249AspPro: 4.249 ± 1.387
1.593AspGln: 1.593 ± 1.191
4.78AspArg: 4.78 ± 1.13
1.593AspSer: 1.593 ± 0.585
3.186AspThr: 3.186 ± 0.913
4.249AspVal: 4.249 ± 1.929
3.186AspTrp: 3.186 ± 1.17
0.531AspTyr: 0.531 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
7.435GluAla: 7.435 ± 0.901
1.062GluCys: 1.062 ± 0.751
2.124GluAsp: 2.124 ± 1.204
4.249GluGlu: 4.249 ± 1.191
2.124GluPhe: 2.124 ± 0.964
6.904GluGly: 6.904 ± 2.848
0.531GluHis: 0.531 ± 0.397
1.062GluIle: 1.062 ± 0.673
0.0GluLys: 0.0 ± 0.0
2.655GluLeu: 2.655 ± 1.538
1.593GluMet: 1.593 ± 0.456
0.531GluAsn: 0.531 ± 0.77
1.593GluPro: 1.593 ± 1.191
2.655GluGln: 2.655 ± 0.295
4.78GluArg: 4.78 ± 1.155
1.593GluSer: 1.593 ± 0.797
0.0GluThr: 0.0 ± 0.0
4.78GluVal: 4.78 ± 0.486
2.655GluTrp: 2.655 ± 1.354
1.062GluTyr: 1.062 ± 0.264
0.531GluXaa: 0.531 ± 0.77
Phe
3.717PheAla: 3.717 ± 0.905
0.0PheCys: 0.0 ± 0.0
0.531PheAsp: 0.531 ± 0.397
1.062PheGlu: 1.062 ± 0.264
0.531PhePhe: 0.531 ± 0.397
5.842PheGly: 5.842 ± 0.81
0.531PheHis: 0.531 ± 0.336
1.593PheIle: 1.593 ± 0.585
1.062PheLys: 1.062 ± 0.264
3.186PheLeu: 3.186 ± 0.793
1.062PheMet: 1.062 ± 0.436
1.062PheAsn: 1.062 ± 0.264
2.124PhePro: 2.124 ± 0.45
1.593PheGln: 1.593 ± 0.585
3.186PheArg: 3.186 ± 1.747
2.655PheSer: 2.655 ± 1.176
2.124PheThr: 2.124 ± 0.757
3.186PheVal: 3.186 ± 0.793
1.593PheTrp: 1.593 ± 0.585
0.531PheTyr: 0.531 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
10.621GlyAla: 10.621 ± 1.64
1.062GlyCys: 1.062 ± 0.751
8.497GlyAsp: 8.497 ± 2.515
3.717GlyGlu: 3.717 ± 1.871
2.655GlyPhe: 2.655 ± 1.354
15.932GlyGly: 15.932 ± 2.185
2.124GlyHis: 2.124 ± 1.287
5.311GlyIle: 5.311 ± 0.82
3.186GlyLys: 3.186 ± 1.213
5.842GlyLeu: 5.842 ± 1.562
2.655GlyMet: 2.655 ± 0.605
2.655GlyAsn: 2.655 ± 2.135
3.717GlyPro: 3.717 ± 0.479
4.249GlyGln: 4.249 ± 1.994
5.311GlyArg: 5.311 ± 1.101
5.311GlySer: 5.311 ± 1.322
6.904GlyThr: 6.904 ± 0.136
9.559GlyVal: 9.559 ± 1.82
4.249GlyTrp: 4.249 ± 0.901
3.717GlyTyr: 3.717 ± 0.479
0.0GlyXaa: 0.0 ± 0.0
His
1.062HisAla: 1.062 ± 0.794
0.0HisCys: 0.0 ± 0.0
1.062HisAsp: 1.062 ± 0.708
1.593HisGlu: 1.593 ± 0.52
1.062HisPhe: 1.062 ± 0.264
1.593HisGly: 1.593 ± 0.585
0.0HisHis: 0.0 ± 0.0
1.593HisIle: 1.593 ± 1.44
0.0HisLys: 0.0 ± 0.0
0.531HisLeu: 0.531 ± 0.77
0.0HisMet: 0.0 ± 0.0
0.531HisAsn: 0.531 ± 0.336
0.531HisPro: 0.531 ± 0.336
0.531HisGln: 0.531 ± 0.397
0.0HisArg: 0.0 ± 0.0
0.531HisSer: 0.531 ± 0.77
1.593HisThr: 1.593 ± 0.456
1.062HisVal: 1.062 ± 0.794
0.531HisTrp: 0.531 ± 0.77
0.531HisTyr: 0.531 ± 0.397
0.0HisXaa: 0.0 ± 0.0
Ile
4.78IleAla: 4.78 ± 1.917
0.531IleCys: 0.531 ± 0.77
2.124IleAsp: 2.124 ± 0.757
0.531IleGlu: 0.531 ± 0.336
2.124IlePhe: 2.124 ± 1.415
4.249IleGly: 4.249 ± 1.387
0.0IleHis: 0.0 ± 0.0
0.531IleIle: 0.531 ± 0.336
1.593IleLys: 1.593 ± 0.585
3.186IleLeu: 3.186 ± 0.913
0.0IleMet: 0.0 ± 0.0
2.124IleAsn: 2.124 ± 0.964
2.655IlePro: 2.655 ± 0.666
1.593IleGln: 1.593 ± 1.191
4.249IleArg: 4.249 ± 0.598
3.186IleSer: 3.186 ± 1.985
1.062IleThr: 1.062 ± 0.264
2.124IleVal: 2.124 ± 0.757
0.0IleTrp: 0.0 ± 0.0
1.062IleTyr: 1.062 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
4.249LysAla: 4.249 ± 1.191
0.0LysCys: 0.0 ± 0.0
0.531LysAsp: 0.531 ± 0.77
1.062LysGlu: 1.062 ± 0.794
1.062LysPhe: 1.062 ± 0.794
1.593LysGly: 1.593 ± 0.585
0.531LysHis: 0.531 ± 0.77
0.0LysIle: 0.0 ± 0.0
1.062LysLys: 1.062 ± 0.794
1.062LysLeu: 1.062 ± 0.264
0.531LysMet: 0.531 ± 0.397
0.0LysAsn: 0.0 ± 0.0
0.531LysPro: 0.531 ± 0.397
0.0LysGln: 0.0 ± 0.0
2.124LysArg: 2.124 ± 0.529
1.062LysSer: 1.062 ± 0.673
1.062LysThr: 1.062 ± 0.794
3.717LysVal: 3.717 ± 2.141
0.0LysTrp: 0.0 ± 0.0
1.062LysTyr: 1.062 ± 0.751
0.0LysXaa: 0.0 ± 0.0
Leu
11.683LeuAla: 11.683 ± 1.068
2.655LeuCys: 2.655 ± 1.247
3.717LeuAsp: 3.717 ± 2.141
3.717LeuGlu: 3.717 ± 0.479
3.186LeuPhe: 3.186 ± 0.793
7.435LeuGly: 7.435 ± 1.851
0.0LeuHis: 0.0 ± 0.0
2.655LeuIle: 2.655 ± 1.247
2.655LeuLys: 2.655 ± 0.816
2.655LeuLeu: 2.655 ± 1.079
1.593LeuMet: 1.593 ± 0.456
2.124LeuAsn: 2.124 ± 0.757
7.966LeuPro: 7.966 ± 1.047
2.124LeuGln: 2.124 ± 0.529
7.435LeuArg: 7.435 ± 0.902
7.435LeuSer: 7.435 ± 1.075
3.186LeuThr: 3.186 ± 1.407
5.842LeuVal: 5.842 ± 1.403
1.062LeuTrp: 1.062 ± 0.264
3.186LeuTyr: 3.186 ± 0.212
0.0LeuXaa: 0.0 ± 0.0
Met
0.531MetAla: 0.531 ± 0.336
0.0MetCys: 0.0 ± 0.0
1.062MetAsp: 1.062 ± 0.264
0.531MetGlu: 0.531 ± 0.336
1.062MetPhe: 1.062 ± 0.264
3.186MetGly: 3.186 ± 1.407
0.531MetHis: 0.531 ± 0.336
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
3.717MetLeu: 3.717 ± 0.905
1.062MetMet: 1.062 ± 0.673
0.531MetAsn: 0.531 ± 0.77
0.531MetPro: 0.531 ± 0.336
0.0MetGln: 0.0 ± 0.0
2.124MetArg: 2.124 ± 1.345
1.062MetSer: 1.062 ± 0.673
1.593MetThr: 1.593 ± 0.456
0.0MetVal: 0.0 ± 0.0
1.062MetTrp: 1.062 ± 0.794
0.531MetTyr: 0.531 ± 0.397
0.0MetXaa: 0.0 ± 0.0
Asn
3.186AsnAla: 3.186 ± 1.593
0.531AsnCys: 0.531 ± 0.77
0.0AsnAsp: 0.0 ± 0.0
0.0AsnGlu: 0.0 ± 0.0
0.0AsnPhe: 0.0 ± 0.0
1.593AsnGly: 1.593 ± 0.456
1.062AsnHis: 1.062 ± 0.794
0.531AsnIle: 0.531 ± 0.336
0.0AsnLys: 0.0 ± 0.0
1.593AsnLeu: 1.593 ± 0.456
0.0AsnMet: 0.0 ± 0.0
0.531AsnAsn: 0.531 ± 0.336
4.78AsnPro: 4.78 ± 1.008
1.062AsnGln: 1.062 ± 0.264
2.655AsnArg: 2.655 ± 1.247
2.124AsnSer: 2.124 ± 0.757
0.0AsnThr: 0.0 ± 0.0
1.593AsnVal: 1.593 ± 0.797
1.593AsnTrp: 1.593 ± 0.585
1.062AsnTyr: 1.062 ± 0.673
0.0AsnXaa: 0.0 ± 0.0
Pro
12.215ProAla: 12.215 ± 1.85
0.531ProCys: 0.531 ± 0.336
3.186ProAsp: 3.186 ± 1.407
3.717ProGlu: 3.717 ± 0.208
1.593ProPhe: 1.593 ± 1.009
8.497ProGly: 8.497 ± 0.928
1.062ProHis: 1.062 ± 0.264
3.186ProIle: 3.186 ± 1.17
0.531ProLys: 0.531 ± 0.397
4.78ProLeu: 4.78 ± 1.155
0.531ProMet: 0.531 ± 0.336
1.593ProAsn: 1.593 ± 0.456
4.249ProPro: 4.249 ± 1.058
2.124ProGln: 2.124 ± 1.501
5.311ProArg: 5.311 ± 0.652
4.249ProSer: 4.249 ± 0.598
1.062ProThr: 1.062 ± 0.264
2.655ProVal: 2.655 ± 1.682
1.062ProTrp: 1.062 ± 0.264
2.124ProTyr: 2.124 ± 0.757
0.0ProXaa: 0.0 ± 0.0
Gln
3.717GlnAla: 3.717 ± 0.913
0.0GlnCys: 0.0 ± 0.0
0.531GlnAsp: 0.531 ± 0.397
2.124GlnGlu: 2.124 ± 0.529
1.062GlnPhe: 1.062 ± 0.794
4.249GlnGly: 4.249 ± 1.395
1.593GlnHis: 1.593 ± 1.44
1.593GlnIle: 1.593 ± 0.456
0.531GlnLys: 0.531 ± 0.397
2.124GlnLeu: 2.124 ± 0.45
0.531GlnMet: 0.531 ± 0.336
1.062GlnAsn: 1.062 ± 0.673
2.655GlnPro: 2.655 ± 0.816
0.531GlnGln: 0.531 ± 0.397
2.124GlnArg: 2.124 ± 0.45
1.593GlnSer: 1.593 ± 0.52
0.0GlnThr: 0.0 ± 0.0
3.186GlnVal: 3.186 ± 1.17
2.124GlnTrp: 2.124 ± 0.997
0.531GlnTyr: 0.531 ± 0.336
0.0GlnXaa: 0.0 ± 0.0
Arg
10.09ArgAla: 10.09 ± 0.289
3.186ArgCys: 3.186 ± 1.937
5.311ArgAsp: 5.311 ± 1.39
4.78ArgGlu: 4.78 ± 0.993
3.717ArgPhe: 3.717 ± 0.905
7.435ArgGly: 7.435 ± 1.103
0.531ArgHis: 0.531 ± 0.77
3.717ArgIle: 3.717 ± 0.905
1.593ArgLys: 1.593 ± 1.468
5.842ArgLeu: 5.842 ± 1.62
2.124ArgMet: 2.124 ± 0.529
2.655ArgAsn: 2.655 ± 1.247
2.124ArgPro: 2.124 ± 1.415
3.186ArgGln: 3.186 ± 0.864
6.904ArgArg: 6.904 ± 3.913
4.249ArgSer: 4.249 ± 0.901
4.78ArgThr: 4.78 ± 1.369
4.78ArgVal: 4.78 ± 1.796
4.78ArgTrp: 4.78 ± 1.755
2.655ArgTyr: 2.655 ± 0.816
0.0ArgXaa: 0.0 ± 0.0
Ser
6.904SerAla: 6.904 ± 2.11
0.531SerCys: 0.531 ± 0.336
3.186SerAsp: 3.186 ± 1.039
4.249SerGlu: 4.249 ± 0.802
1.593SerPhe: 1.593 ± 1.191
6.373SerGly: 6.373 ± 2.089
0.531SerHis: 0.531 ± 0.397
1.062SerIle: 1.062 ± 0.708
2.124SerLys: 2.124 ± 1.204
7.435SerLeu: 7.435 ± 0.901
2.124SerMet: 2.124 ± 0.864
0.0SerAsn: 0.0 ± 0.0
2.124SerPro: 2.124 ± 0.757
2.655SerGln: 2.655 ± 1.257
7.435SerArg: 7.435 ± 1.825
6.904SerSer: 6.904 ± 1.34
3.186SerThr: 3.186 ± 0.212
3.186SerVal: 3.186 ± 0.864
2.655SerTrp: 2.655 ± 1.176
0.531SerTyr: 0.531 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
4.78ThrAla: 4.78 ± 0.878
0.531ThrCys: 0.531 ± 0.336
1.593ThrAsp: 1.593 ± 0.585
0.0ThrGlu: 0.0 ± 0.0
1.062ThrPhe: 1.062 ± 0.673
5.842ThrGly: 5.842 ± 1.562
0.531ThrHis: 0.531 ± 0.397
1.062ThrIle: 1.062 ± 0.264
1.593ThrLys: 1.593 ± 0.52
4.78ThrLeu: 4.78 ± 1.833
0.531ThrMet: 0.531 ± 0.336
1.593ThrAsn: 1.593 ± 0.456
5.842ThrPro: 5.842 ± 1.959
1.062ThrGln: 1.062 ± 0.264
4.78ThrArg: 4.78 ± 1.008
2.124ThrSer: 2.124 ± 0.757
2.124ThrThr: 2.124 ± 1.345
5.311ThrVal: 5.311 ± 1.332
2.124ThrTrp: 2.124 ± 0.964
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.249ValAla: 4.249 ± 1.111
0.531ValCys: 0.531 ± 0.397
3.717ValAsp: 3.717 ± 1.232
5.842ValGlu: 5.842 ± 2.452
2.655ValPhe: 2.655 ± 1.682
7.435ValGly: 7.435 ± 2.2
1.062ValHis: 1.062 ± 0.708
3.717ValIle: 3.717 ± 1.806
1.062ValLys: 1.062 ± 0.794
4.78ValLeu: 4.78 ± 2.317
1.593ValMet: 1.593 ± 1.009
1.593ValAsn: 1.593 ± 0.585
6.904ValPro: 6.904 ± 1.668
2.655ValGln: 2.655 ± 1.538
8.497ValArg: 8.497 ± 1.872
6.373ValSer: 6.373 ± 1.566
2.655ValThr: 2.655 ± 1.354
6.373ValVal: 6.373 ± 0.959
1.062ValTrp: 1.062 ± 0.264
1.062ValTyr: 1.062 ± 0.673
0.0ValXaa: 0.0 ± 0.0
Trp
1.593TrpAla: 1.593 ± 0.585
0.0TrpCys: 0.0 ± 0.0
2.124TrpAsp: 2.124 ± 0.45
2.124TrpGlu: 2.124 ± 0.529
3.186TrpPhe: 3.186 ± 0.535
2.655TrpGly: 2.655 ± 1.079
0.0TrpHis: 0.0 ± 0.0
1.593TrpIle: 1.593 ± 0.585
1.593TrpLys: 1.593 ± 0.585
5.311TrpLeu: 5.311 ± 0.385
0.531TrpMet: 0.531 ± 0.336
0.531TrpAsn: 0.531 ± 0.336
1.593TrpPro: 1.593 ± 0.456
0.0TrpGln: 0.0 ± 0.0
3.717TrpArg: 3.717 ± 1.582
1.062TrpSer: 1.062 ± 0.794
2.655TrpThr: 2.655 ± 0.869
2.124TrpVal: 2.124 ± 1.204
1.062TrpTrp: 1.062 ± 0.673
1.062TrpTyr: 1.062 ± 0.708
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.186TyrAla: 3.186 ± 0.212
0.531TyrCys: 0.531 ± 0.397
1.062TyrAsp: 1.062 ± 0.264
0.531TyrGlu: 0.531 ± 0.336
0.531TyrPhe: 0.531 ± 0.397
2.124TyrGly: 2.124 ± 0.757
1.062TyrHis: 1.062 ± 0.264
1.593TyrIle: 1.593 ± 0.456
0.531TyrLys: 0.531 ± 0.397
1.593TyrLeu: 1.593 ± 0.456
0.0TyrMet: 0.0 ± 0.0
0.531TyrAsn: 0.531 ± 0.336
2.124TyrPro: 2.124 ± 0.529
1.593TyrGln: 1.593 ± 0.456
1.593TyrArg: 1.593 ± 1.44
2.124TyrSer: 2.124 ± 0.997
2.655TyrThr: 2.655 ± 0.666
1.593TyrVal: 1.593 ± 0.585
1.062TyrTrp: 1.062 ± 0.673
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.531XaaSer: 0.531 ± 0.77
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1884 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski