Amino acid dipepetide frequency for Hubei toti-like virus 20

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.29AlaAla: 7.29 ± 0.908
3.431AlaCys: 3.431 ± 1.825
2.573AlaAsp: 2.573 ± 1.369
4.288AlaGlu: 4.288 ± 0.796
2.144AlaPhe: 2.144 ± 0.344
6.003AlaGly: 6.003 ± 0.966
3.002AlaHis: 3.002 ± 0.112
3.859AlaIle: 3.859 ± 0.568
4.288AlaLys: 4.288 ± 0.689
6.432AlaLeu: 6.432 ± 0.291
2.573AlaMet: 2.573 ± 1.601
4.288AlaAsn: 4.288 ± 0.796
4.717AlaPro: 4.717 ± 2.509
2.573AlaGln: 2.573 ± 0.859
5.146AlaArg: 5.146 ± 0.51
6.003AlaSer: 6.003 ± 0.224
6.861AlaThr: 6.861 ± 0.68
6.003AlaVal: 6.003 ± 0.224
1.715AlaTrp: 1.715 ± 0.572
3.002AlaTyr: 3.002 ± 0.631
0.0AlaXaa: 0.0 ± 0.0
Cys
1.715CysAla: 1.715 ± 0.572
0.0CysCys: 0.0 ± 0.0
0.429CysAsp: 0.429 ± 0.228
0.858CysGlu: 0.858 ± 0.286
0.429CysPhe: 0.429 ± 0.228
1.286CysGly: 1.286 ± 0.058
1.286CysHis: 1.286 ± 0.684
0.429CysIle: 0.429 ± 0.228
0.429CysLys: 0.429 ± 0.228
1.715CysLeu: 1.715 ± 0.912
0.429CysMet: 0.429 ± 0.514
0.858CysAsn: 0.858 ± 0.286
1.286CysPro: 1.286 ± 0.058
0.0CysGln: 0.0 ± 0.0
2.144CysArg: 2.144 ± 0.344
0.429CysSer: 0.429 ± 0.228
0.429CysThr: 0.429 ± 0.228
1.715CysVal: 1.715 ± 0.912
0.429CysTrp: 0.429 ± 0.228
1.286CysTyr: 1.286 ± 0.801
0.0CysXaa: 0.0 ± 0.0
Asp
7.29AspAla: 7.29 ± 0.166
0.858AspCys: 0.858 ± 0.286
3.431AspAsp: 3.431 ± 0.34
4.288AspGlu: 4.288 ± 1.539
0.429AspPhe: 0.429 ± 0.228
5.146AspGly: 5.146 ± 0.232
0.429AspHis: 0.429 ± 0.228
3.002AspIle: 3.002 ± 0.112
2.573AspLys: 2.573 ± 1.369
4.288AspLeu: 4.288 ± 1.431
0.858AspMet: 0.858 ± 0.286
0.858AspAsn: 0.858 ± 0.286
2.573AspPro: 2.573 ± 0.116
3.859AspGln: 3.859 ± 1.311
3.002AspArg: 3.002 ± 0.854
1.715AspSer: 1.715 ± 0.17
3.431AspThr: 3.431 ± 1.083
3.859AspVal: 3.859 ± 0.568
1.715AspTrp: 1.715 ± 0.17
2.573AspTyr: 2.573 ± 0.626
0.0AspXaa: 0.0 ± 0.0
Glu
5.575GluAla: 5.575 ± 0.738
0.858GluCys: 0.858 ± 0.286
2.573GluAsp: 2.573 ± 0.626
4.717GluGlu: 4.717 ± 1.945
1.286GluPhe: 1.286 ± 0.058
5.146GluGly: 5.146 ± 0.51
0.429GluHis: 0.429 ± 0.228
3.002GluIle: 3.002 ± 0.854
2.573GluLys: 2.573 ± 0.626
3.859GluLeu: 3.859 ± 0.174
0.858GluMet: 0.858 ± 1.029
2.144GluAsn: 2.144 ± 1.141
3.002GluPro: 3.002 ± 0.112
2.144GluGln: 2.144 ± 0.398
1.286GluArg: 1.286 ± 0.058
1.715GluSer: 1.715 ± 0.912
4.717GluThr: 4.717 ± 0.461
2.144GluVal: 2.144 ± 0.398
2.144GluTrp: 2.144 ± 1.829
3.002GluTyr: 3.002 ± 2.116
0.0GluXaa: 0.0 ± 0.0
Phe
1.715PheAla: 1.715 ± 1.315
0.858PheCys: 0.858 ± 0.286
0.858PheAsp: 0.858 ± 0.286
0.429PheGlu: 0.429 ± 0.228
1.286PhePhe: 1.286 ± 0.801
2.573PheGly: 2.573 ± 1.601
0.858PheHis: 0.858 ± 0.286
0.858PheIle: 0.858 ± 0.456
0.858PheLys: 0.858 ± 1.029
1.715PheLeu: 1.715 ± 0.17
0.0PheMet: 0.0 ± 0.0
1.286PheAsn: 1.286 ± 0.684
0.858PhePro: 0.858 ± 0.456
0.429PheGln: 0.429 ± 0.514
3.431PheArg: 3.431 ± 1.887
2.144PheSer: 2.144 ± 0.398
2.144PheThr: 2.144 ± 0.398
1.286PheVal: 1.286 ± 0.684
0.0PheTrp: 0.0 ± 0.0
1.286PheTyr: 1.286 ± 0.684
0.0PheXaa: 0.0 ± 0.0
Gly
4.288GlyAla: 4.288 ± 1.431
0.858GlyCys: 0.858 ± 0.456
3.002GlyAsp: 3.002 ± 0.112
2.144GlyGlu: 2.144 ± 0.344
3.002GlyPhe: 3.002 ± 0.631
5.575GlyGly: 5.575 ± 0.004
1.286GlyHis: 1.286 ± 0.058
8.576GlyIle: 8.576 ± 0.85
2.573GlyLys: 2.573 ± 0.859
6.432GlyLeu: 6.432 ± 0.452
2.144GlyMet: 2.144 ± 0.344
3.002GlyAsn: 3.002 ± 1.597
3.002GlyPro: 3.002 ± 0.112
2.144GlyGln: 2.144 ± 0.344
2.144GlyArg: 2.144 ± 0.398
4.288GlySer: 4.288 ± 0.054
4.717GlyThr: 4.717 ± 1.024
3.859GlyVal: 3.859 ± 0.917
2.144GlyTrp: 2.144 ± 0.398
3.859GlyTyr: 3.859 ± 0.174
0.0GlyXaa: 0.0 ± 0.0
His
1.286HisAla: 1.286 ± 0.058
1.286HisCys: 1.286 ± 0.058
0.429HisAsp: 0.429 ± 0.228
0.429HisGlu: 0.429 ± 0.228
0.429HisPhe: 0.429 ± 0.228
1.286HisGly: 1.286 ± 0.684
1.286HisHis: 1.286 ± 0.058
2.573HisIle: 2.573 ± 0.859
0.429HisLys: 0.429 ± 0.228
1.715HisLeu: 1.715 ± 0.572
0.0HisMet: 0.0 ± 0.0
0.858HisAsn: 0.858 ± 0.456
2.144HisPro: 2.144 ± 0.398
1.286HisGln: 1.286 ± 0.058
2.573HisArg: 2.573 ± 0.116
2.144HisSer: 2.144 ± 1.141
1.715HisThr: 1.715 ± 0.912
0.858HisVal: 0.858 ± 0.286
0.858HisTrp: 0.858 ± 0.286
0.858HisTyr: 0.858 ± 0.286
0.0HisXaa: 0.0 ± 0.0
Ile
3.002IleAla: 3.002 ± 0.854
1.286IleCys: 1.286 ± 0.801
4.288IleAsp: 4.288 ± 0.054
2.144IleGlu: 2.144 ± 1.141
0.429IlePhe: 0.429 ± 0.514
4.288IleGly: 4.288 ± 1.539
2.144IleHis: 2.144 ± 1.141
1.286IleIle: 1.286 ± 0.058
3.859IleLys: 3.859 ± 0.917
4.717IleLeu: 4.717 ± 1.767
1.715IleMet: 1.715 ± 1.315
1.715IleAsn: 1.715 ± 0.17
1.715IlePro: 1.715 ± 0.912
0.429IleGln: 0.429 ± 0.514
2.573IleArg: 2.573 ± 2.344
2.573IleSer: 2.573 ± 0.116
4.288IleThr: 4.288 ± 0.054
4.288IleVal: 4.288 ± 0.689
1.286IleTrp: 1.286 ± 0.058
1.715IleTyr: 1.715 ± 1.315
0.0IleXaa: 0.0 ± 0.0
Lys
4.717LysAla: 4.717 ± 1.024
0.858LysCys: 0.858 ± 0.286
2.144LysAsp: 2.144 ± 0.344
3.002LysGlu: 3.002 ± 0.854
1.286LysPhe: 1.286 ± 0.684
3.002LysGly: 3.002 ± 0.112
1.715LysHis: 1.715 ± 0.17
4.288LysIle: 4.288 ± 0.689
1.715LysLys: 1.715 ± 1.315
2.573LysLeu: 2.573 ± 1.601
0.429LysMet: 0.429 ± 0.514
1.286LysAsn: 1.286 ± 0.058
2.573LysPro: 2.573 ± 0.859
1.286LysGln: 1.286 ± 0.058
3.002LysArg: 3.002 ± 0.854
3.859LysSer: 3.859 ± 0.917
2.573LysThr: 2.573 ± 0.116
3.431LysVal: 3.431 ± 1.887
0.429LysTrp: 0.429 ± 0.228
3.002LysTyr: 3.002 ± 1.373
0.0LysXaa: 0.0 ± 0.0
Leu
7.719LeuAla: 7.719 ± 0.394
0.429LeuCys: 0.429 ± 0.514
6.432LeuAsp: 6.432 ± 1.937
5.146LeuGlu: 5.146 ± 0.975
1.286LeuPhe: 1.286 ± 0.801
6.861LeuGly: 6.861 ± 2.29
2.144LeuHis: 2.144 ± 0.398
1.286LeuIle: 1.286 ± 0.801
6.003LeuLys: 6.003 ± 0.224
8.148LeuLeu: 8.148 ± 1.605
1.715LeuMet: 1.715 ± 0.572
5.146LeuAsn: 5.146 ± 1.253
6.003LeuPro: 6.003 ± 0.224
3.002LeuGln: 3.002 ± 0.854
9.863LeuArg: 9.863 ± 0.05
7.719LeuSer: 7.719 ± 1.091
5.575LeuThr: 5.575 ± 1.489
2.573LeuVal: 2.573 ± 0.116
1.286LeuTrp: 1.286 ± 0.801
0.858LeuTyr: 0.858 ± 1.029
0.0LeuXaa: 0.0 ± 0.0
Met
2.144MetAla: 2.144 ± 1.141
0.429MetCys: 0.429 ± 0.228
1.715MetAsp: 1.715 ± 1.315
1.715MetGlu: 1.715 ± 0.572
0.858MetPhe: 0.858 ± 0.286
1.715MetGly: 1.715 ± 0.17
0.429MetHis: 0.429 ± 0.514
0.858MetIle: 0.858 ± 0.456
2.144MetLys: 2.144 ± 1.829
2.144MetLeu: 2.144 ± 1.087
0.0MetMet: 0.0 ± 0.0
0.858MetAsn: 0.858 ± 0.456
1.715MetPro: 1.715 ± 0.572
0.0MetGln: 0.0 ± 0.0
1.715MetArg: 1.715 ± 0.572
0.429MetSer: 0.429 ± 0.228
2.573MetThr: 2.573 ± 1.601
0.858MetVal: 0.858 ± 0.286
0.429MetTrp: 0.429 ± 0.514
0.858MetTyr: 0.858 ± 0.286
0.0MetXaa: 0.0 ± 0.0
Asn
2.573AsnAla: 2.573 ± 1.369
0.0AsnCys: 0.0 ± 0.0
4.288AsnAsp: 4.288 ± 0.796
0.429AsnGlu: 0.429 ± 0.228
2.144AsnPhe: 2.144 ± 1.087
2.144AsnGly: 2.144 ± 0.344
0.858AsnHis: 0.858 ± 0.456
1.286AsnIle: 1.286 ± 0.058
1.286AsnLys: 1.286 ± 0.058
5.146AsnLeu: 5.146 ± 0.51
1.286AsnMet: 1.286 ± 0.702
3.859AsnAsn: 3.859 ± 2.053
1.286AsnPro: 1.286 ± 0.684
1.715AsnGln: 1.715 ± 0.572
3.002AsnArg: 3.002 ± 1.597
1.715AsnSer: 1.715 ± 0.17
3.431AsnThr: 3.431 ± 1.083
3.431AsnVal: 3.431 ± 0.34
0.858AsnTrp: 0.858 ± 0.456
2.144AsnTyr: 2.144 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
5.575ProAla: 5.575 ± 0.738
0.858ProCys: 0.858 ± 0.456
3.002ProAsp: 3.002 ± 0.854
3.002ProGlu: 3.002 ± 0.631
0.429ProPhe: 0.429 ± 0.228
1.715ProGly: 1.715 ± 0.912
2.573ProHis: 2.573 ± 0.859
2.573ProIle: 2.573 ± 1.601
3.002ProLys: 3.002 ± 0.854
3.431ProLeu: 3.431 ± 0.34
1.286ProMet: 1.286 ± 0.801
0.858ProAsn: 0.858 ± 0.456
3.002ProPro: 3.002 ± 0.854
2.144ProGln: 2.144 ± 0.398
3.859ProArg: 3.859 ± 0.174
5.146ProSer: 5.146 ± 0.232
5.575ProThr: 5.575 ± 1.481
3.002ProVal: 3.002 ± 0.112
0.429ProTrp: 0.429 ± 0.514
0.858ProTyr: 0.858 ± 1.029
0.0ProXaa: 0.0 ± 0.0
Gln
2.144GlnAla: 2.144 ± 1.141
0.0GlnCys: 0.0 ± 0.0
1.286GlnAsp: 1.286 ± 0.058
1.286GlnGlu: 1.286 ± 0.801
0.858GlnPhe: 0.858 ± 0.456
2.573GlnGly: 2.573 ± 0.626
0.858GlnHis: 0.858 ± 0.456
3.002GlnIle: 3.002 ± 0.631
1.286GlnLys: 1.286 ± 0.058
4.717GlnLeu: 4.717 ± 0.282
0.429GlnMet: 0.429 ± 0.228
1.286GlnAsn: 1.286 ± 0.801
2.573GlnPro: 2.573 ± 0.859
0.0GlnGln: 0.0 ± 0.0
2.573GlnArg: 2.573 ± 0.626
2.573GlnSer: 2.573 ± 0.116
3.002GlnThr: 3.002 ± 0.854
2.144GlnVal: 2.144 ± 0.344
1.286GlnTrp: 1.286 ± 0.058
0.858GlnTyr: 0.858 ± 0.286
0.0GlnXaa: 0.0 ± 0.0
Arg
4.717ArgAla: 4.717 ± 0.461
0.429ArgCys: 0.429 ± 0.514
3.859ArgAsp: 3.859 ± 0.917
3.431ArgGlu: 3.431 ± 0.34
2.144ArgPhe: 2.144 ± 0.398
3.859ArgGly: 3.859 ± 0.568
0.429ArgHis: 0.429 ± 0.228
3.002ArgIle: 3.002 ± 0.112
3.859ArgLys: 3.859 ± 0.917
6.003ArgLeu: 6.003 ± 2.746
1.715ArgMet: 1.715 ± 0.17
3.431ArgAsn: 3.431 ± 1.145
2.144ArgPro: 2.144 ± 0.344
2.144ArgGln: 2.144 ± 0.398
4.717ArgArg: 4.717 ± 1.024
5.575ArgSer: 5.575 ± 0.738
4.717ArgThr: 4.717 ± 1.024
5.575ArgVal: 5.575 ± 0.004
1.715ArgTrp: 1.715 ± 1.315
1.715ArgTyr: 1.715 ± 0.17
0.0ArgXaa: 0.0 ± 0.0
Ser
7.719SerAla: 7.719 ± 0.394
0.0SerCys: 0.0 ± 0.0
4.288SerAsp: 4.288 ± 0.796
5.575SerGlu: 5.575 ± 0.738
1.286SerPhe: 1.286 ± 0.801
3.002SerGly: 3.002 ± 1.373
0.858SerHis: 0.858 ± 0.456
2.573SerIle: 2.573 ± 0.859
3.859SerLys: 3.859 ± 1.659
6.861SerLeu: 6.861 ± 2.165
2.144SerMet: 2.144 ± 1.141
3.431SerAsn: 3.431 ± 1.083
2.144SerPro: 2.144 ± 0.398
5.146SerGln: 5.146 ± 0.51
4.288SerArg: 4.288 ± 0.689
6.432SerSer: 6.432 ± 3.422
3.859SerThr: 3.859 ± 0.174
3.859SerVal: 3.859 ± 2.053
1.286SerTrp: 1.286 ± 0.801
3.002SerTyr: 3.002 ± 0.112
0.0SerXaa: 0.0 ± 0.0
Thr
7.29ThrAla: 7.29 ± 0.577
2.573ThrCys: 2.573 ± 1.369
5.146ThrAsp: 5.146 ± 2.737
2.573ThrGlu: 2.573 ± 0.116
2.573ThrPhe: 2.573 ± 1.601
5.146ThrGly: 5.146 ± 0.232
0.0ThrHis: 0.0 ± 0.0
3.002ThrIle: 3.002 ± 0.854
2.573ThrLys: 2.573 ± 0.626
8.148ThrLeu: 8.148 ± 1.364
2.144ThrMet: 2.144 ± 0.174
3.431ThrAsn: 3.431 ± 0.34
3.431ThrPro: 3.431 ± 0.402
2.144ThrGln: 2.144 ± 0.344
4.288ThrArg: 4.288 ± 1.431
6.861ThrSer: 6.861 ± 2.907
7.29ThrThr: 7.29 ± 2.393
5.146ThrVal: 5.146 ± 1.253
1.715ThrTrp: 1.715 ± 1.315
0.858ThrTyr: 0.858 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
4.717ValAla: 4.717 ± 0.282
1.715ValCys: 1.715 ± 0.912
3.859ValAsp: 3.859 ± 0.174
5.146ValGlu: 5.146 ± 0.975
1.286ValPhe: 1.286 ± 0.684
3.431ValGly: 3.431 ± 0.402
2.573ValHis: 2.573 ± 0.626
2.144ValIle: 2.144 ± 0.398
1.715ValLys: 1.715 ± 0.572
3.431ValLeu: 3.431 ± 1.145
1.715ValMet: 1.715 ± 0.912
1.715ValAsn: 1.715 ± 0.912
5.146ValPro: 5.146 ± 0.232
0.858ValGln: 0.858 ± 0.456
2.573ValArg: 2.573 ± 1.601
6.861ValSer: 6.861 ± 0.062
5.575ValThr: 5.575 ± 0.738
4.288ValVal: 4.288 ± 0.796
0.858ValTrp: 0.858 ± 0.286
0.858ValTyr: 0.858 ± 0.456
0.0ValXaa: 0.0 ± 0.0
Trp
2.144TrpAla: 2.144 ± 1.141
0.858TrpCys: 0.858 ± 1.029
0.858TrpAsp: 0.858 ± 1.029
0.858TrpGlu: 0.858 ± 0.286
0.0TrpPhe: 0.0 ± 0.0
2.573TrpGly: 2.573 ± 0.626
0.858TrpHis: 0.858 ± 1.029
0.429TrpIle: 0.429 ± 0.514
1.286TrpLys: 1.286 ± 0.801
0.858TrpLeu: 0.858 ± 1.029
0.429TrpMet: 0.429 ± 0.514
1.715TrpAsn: 1.715 ± 0.572
2.573TrpPro: 2.573 ± 1.601
1.715TrpGln: 1.715 ± 0.572
1.286TrpArg: 1.286 ± 0.058
1.286TrpSer: 1.286 ± 0.058
0.858TrpThr: 0.858 ± 0.286
0.858TrpVal: 0.858 ± 0.286
0.0TrpTrp: 0.0 ± 0.0
0.429TrpTyr: 0.429 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.573TyrAla: 2.573 ± 0.626
0.0TyrCys: 0.0 ± 0.0
1.715TyrAsp: 1.715 ± 0.17
1.715TyrGlu: 1.715 ± 0.572
1.286TyrPhe: 1.286 ± 0.058
1.715TyrGly: 1.715 ± 0.17
0.429TyrHis: 0.429 ± 0.514
1.715TyrIle: 1.715 ± 0.17
0.858TyrLys: 0.858 ± 0.456
6.861TyrLeu: 6.861 ± 3.032
1.286TyrMet: 1.286 ± 0.801
1.286TyrAsn: 1.286 ± 0.801
0.429TyrPro: 0.429 ± 0.228
1.715TyrGln: 1.715 ± 0.17
1.715TyrArg: 1.715 ± 0.17
2.144TyrSer: 2.144 ± 1.087
3.002TyrThr: 3.002 ± 0.112
1.286TyrVal: 1.286 ± 0.801
1.286TyrTrp: 1.286 ± 0.801
0.429TyrTyr: 0.429 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2333 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski