Amino acid dipepetide frequency for Hubei tombus-like virus 31

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.014AlaAla: 6.014 ± 3.62
0.859AlaCys: 0.859 ± 0.627
0.859AlaAsp: 0.859 ± 0.731
3.436AlaGlu: 3.436 ± 2.51
0.859AlaPhe: 0.859 ± 0.731
6.873AlaGly: 6.873 ± 3.601
3.436AlaHis: 3.436 ± 0.721
0.859AlaIle: 0.859 ± 0.804
1.718AlaLys: 1.718 ± 1.255
6.873AlaLeu: 6.873 ± 1.061
1.718AlaMet: 1.718 ± 1.461
2.577AlaAsn: 2.577 ± 1.243
5.155AlaPro: 5.155 ± 1.714
1.718AlaGln: 1.718 ± 0.538
3.436AlaArg: 3.436 ± 1.782
6.873AlaSer: 6.873 ± 1.226
6.873AlaThr: 6.873 ± 2.693
8.591AlaVal: 8.591 ± 4.03
0.0AlaTrp: 0.0 ± 0.0
2.577AlaTyr: 2.577 ± 1.32
0.0AlaXaa: 0.0 ± 0.0
Cys
0.859CysAla: 0.859 ± 0.731
0.0CysCys: 0.0 ± 0.0
0.859CysAsp: 0.859 ± 0.627
0.0CysGlu: 0.0 ± 0.0
0.859CysPhe: 0.859 ± 0.627
0.859CysGly: 0.859 ± 0.731
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.859CysLys: 0.859 ± 0.731
1.718CysLeu: 1.718 ± 0.838
0.859CysMet: 0.859 ± 0.747
1.718CysAsn: 1.718 ± 0.538
0.859CysPro: 0.859 ± 0.627
0.859CysGln: 0.859 ± 0.627
0.859CysArg: 0.859 ± 0.627
0.0CysSer: 0.0 ± 0.0
1.718CysThr: 1.718 ± 1.461
0.859CysVal: 0.859 ± 0.731
0.0CysTrp: 0.0 ± 0.0
0.859CysTyr: 0.859 ± 0.627
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
2.577AspAsp: 2.577 ± 1.518
1.718AspGlu: 1.718 ± 0.813
0.859AspPhe: 0.859 ± 0.731
4.296AspGly: 4.296 ± 2.365
3.436AspHis: 3.436 ± 2.51
4.296AspIle: 4.296 ± 1.425
1.718AspLys: 1.718 ± 0.813
1.718AspLeu: 1.718 ± 0.538
2.577AspMet: 2.577 ± 0.278
1.718AspAsn: 1.718 ± 1.461
0.859AspPro: 0.859 ± 0.731
0.0AspGln: 0.0 ± 0.0
2.577AspArg: 2.577 ± 1.243
4.296AspSer: 4.296 ± 1.013
2.577AspThr: 2.577 ± 0.278
3.436AspVal: 3.436 ± 0.53
0.0AspTrp: 0.0 ± 0.0
2.577AspTyr: 2.577 ± 0.278
0.0AspXaa: 0.0 ± 0.0
Glu
3.436GluAla: 3.436 ± 1.676
0.859GluCys: 0.859 ± 0.731
2.577GluAsp: 2.577 ± 1.518
0.859GluGlu: 0.859 ± 0.627
2.577GluPhe: 2.577 ± 1.119
3.436GluGly: 3.436 ± 1.782
0.0GluHis: 0.0 ± 0.0
4.296GluIle: 4.296 ± 1.128
4.296GluLys: 4.296 ± 2.069
6.014GluLeu: 6.014 ± 1.838
0.0GluMet: 0.0 ± 0.0
4.296GluAsn: 4.296 ± 1.128
1.718GluPro: 1.718 ± 1.609
3.436GluGln: 3.436 ± 1.782
3.436GluArg: 3.436 ± 1.676
3.436GluSer: 3.436 ± 2.19
1.718GluThr: 1.718 ± 0.838
5.155GluVal: 5.155 ± 1.747
0.0GluTrp: 0.0 ± 0.0
4.296GluTyr: 4.296 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
0.859PheAla: 0.859 ± 0.731
0.0PheCys: 0.0 ± 0.0
0.859PheAsp: 0.859 ± 0.627
0.859PheGlu: 0.859 ± 0.804
0.859PhePhe: 0.859 ± 0.731
2.577PheGly: 2.577 ± 1.119
1.718PheHis: 1.718 ± 0.538
1.718PheIle: 1.718 ± 0.538
3.436PheLys: 3.436 ± 0.721
1.718PheLeu: 1.718 ± 0.538
3.436PheMet: 3.436 ± 1.47
0.859PheAsn: 0.859 ± 0.731
2.577PhePro: 2.577 ± 2.192
0.859PheGln: 0.859 ± 0.731
4.296PheArg: 4.296 ± 1.88
2.577PheSer: 2.577 ± 1.119
4.296PheThr: 4.296 ± 1.425
2.577PheVal: 2.577 ± 0.278
0.859PheTrp: 0.859 ± 0.627
0.859PheTyr: 0.859 ± 0.731
0.0PheXaa: 0.0 ± 0.0
Gly
6.014GlyAla: 6.014 ± 2.922
1.718GlyCys: 1.718 ± 0.538
4.296GlyAsp: 4.296 ± 1.597
4.296GlyGlu: 4.296 ± 1.597
2.577GlyPhe: 2.577 ± 0.912
1.718GlyGly: 1.718 ± 1.461
0.0GlyHis: 0.0 ± 0.0
6.014GlyIle: 6.014 ± 1.702
1.718GlyLys: 1.718 ± 1.609
5.155GlyLeu: 5.155 ± 2.147
2.577GlyMet: 2.577 ± 1.691
4.296GlyAsn: 4.296 ± 1.597
4.296GlyPro: 4.296 ± 2.069
2.577GlyGln: 2.577 ± 1.119
2.577GlyArg: 2.577 ± 1.443
8.591GlySer: 8.591 ± 2.126
6.014GlyThr: 6.014 ± 1.351
3.436GlyVal: 3.436 ± 1.625
0.859GlyTrp: 0.859 ± 0.804
1.718GlyTyr: 1.718 ± 1.461
0.0GlyXaa: 0.0 ± 0.0
His
3.436HisAla: 3.436 ± 0.53
0.859HisCys: 0.859 ± 0.627
0.0HisAsp: 0.0 ± 0.0
2.577HisGlu: 2.577 ± 1.243
0.859HisPhe: 0.859 ± 0.731
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.718HisIle: 1.718 ± 1.255
1.718HisLys: 1.718 ± 0.538
0.859HisLeu: 0.859 ± 0.627
0.0HisMet: 0.0 ± 0.0
0.859HisAsn: 0.859 ± 0.804
0.859HisPro: 0.859 ± 0.627
0.0HisGln: 0.0 ± 0.0
0.859HisArg: 0.859 ± 0.627
1.718HisSer: 1.718 ± 0.538
1.718HisThr: 1.718 ± 1.255
0.0HisVal: 0.0 ± 0.0
0.859HisTrp: 0.859 ± 0.627
0.859HisTyr: 0.859 ± 0.731
0.0HisXaa: 0.0 ± 0.0
Ile
0.859IleAla: 0.859 ± 0.731
1.718IleCys: 1.718 ± 0.838
1.718IleAsp: 1.718 ± 0.538
6.014IleGlu: 6.014 ± 2.839
2.577IlePhe: 2.577 ± 1.32
4.296IleGly: 4.296 ± 2.526
0.859IleHis: 0.859 ± 0.627
0.0IleIle: 0.0 ± 0.0
1.718IleLys: 1.718 ± 0.838
6.014IleLeu: 6.014 ± 1.766
0.859IleMet: 0.859 ± 0.731
4.296IleAsn: 4.296 ± 1.36
2.577IlePro: 2.577 ± 1.882
1.718IleGln: 1.718 ± 0.813
4.296IleArg: 4.296 ± 1.128
5.155IleSer: 5.155 ± 0.979
1.718IleThr: 1.718 ± 1.461
5.155IleVal: 5.155 ± 1.295
0.859IleTrp: 0.859 ± 0.627
0.859IleTyr: 0.859 ± 0.731
0.0IleXaa: 0.0 ± 0.0
Lys
1.718LysAla: 1.718 ± 1.461
0.859LysCys: 0.859 ± 0.627
2.577LysAsp: 2.577 ± 1.243
4.296LysGlu: 4.296 ± 1.013
3.436LysPhe: 3.436 ± 1.625
2.577LysGly: 2.577 ± 0.278
0.859LysHis: 0.859 ± 0.627
2.577LysIle: 2.577 ± 1.243
2.577LysLys: 2.577 ± 2.192
4.296LysLeu: 4.296 ± 0.293
0.859LysMet: 0.859 ± 0.627
0.0LysAsn: 0.0 ± 0.0
1.718LysPro: 1.718 ± 0.538
1.718LysGln: 1.718 ± 0.813
2.577LysArg: 2.577 ± 1.882
3.436LysSer: 3.436 ± 1.972
2.577LysThr: 2.577 ± 0.912
5.155LysVal: 5.155 ± 1.824
0.0LysTrp: 0.0 ± 0.0
1.718LysTyr: 1.718 ± 0.538
0.0LysXaa: 0.0 ± 0.0
Leu
7.732LeuAla: 7.732 ± 1.402
0.859LeuCys: 0.859 ± 0.731
1.718LeuAsp: 1.718 ± 0.838
6.014LeuGlu: 6.014 ± 2.839
0.0LeuPhe: 0.0 ± 0.0
7.732LeuGly: 7.732 ± 2.137
1.718LeuHis: 1.718 ± 0.838
3.436LeuIle: 3.436 ± 1.076
4.296LeuLys: 4.296 ± 2.365
2.577LeuLeu: 2.577 ± 1.443
0.0LeuMet: 0.0 ± 0.0
1.718LeuAsn: 1.718 ± 0.813
2.577LeuPro: 2.577 ± 1.243
3.436LeuGln: 3.436 ± 0.721
8.591LeuArg: 8.591 ± 2.256
6.873LeuSer: 6.873 ± 2.116
7.732LeuThr: 7.732 ± 2.656
5.155LeuVal: 5.155 ± 0.979
2.577LeuTrp: 2.577 ± 1.443
1.718LeuTyr: 1.718 ± 0.838
0.0LeuXaa: 0.0 ± 0.0
Met
2.577MetAla: 2.577 ± 0.912
0.859MetCys: 0.859 ± 0.731
0.0MetAsp: 0.0 ± 0.0
0.859MetGlu: 0.859 ± 0.627
0.859MetPhe: 0.859 ± 0.804
3.436MetGly: 3.436 ± 1.812
1.718MetHis: 1.718 ± 0.538
0.859MetIle: 0.859 ± 0.731
1.718MetLys: 1.718 ± 1.255
0.859MetLeu: 0.859 ± 0.731
1.718MetMet: 1.718 ± 1.461
2.577MetAsn: 2.577 ± 1.243
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.718MetArg: 1.718 ± 1.255
1.718MetSer: 1.718 ± 1.255
1.718MetThr: 1.718 ± 0.538
0.859MetVal: 0.859 ± 0.627
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.718AsnAla: 1.718 ± 0.538
0.859AsnCys: 0.859 ± 0.731
1.718AsnAsp: 1.718 ± 1.255
3.436AsnGlu: 3.436 ± 1.47
1.718AsnPhe: 1.718 ± 0.813
2.577AsnGly: 2.577 ± 1.119
0.0AsnHis: 0.0 ± 0.0
0.859AsnIle: 0.859 ± 0.804
2.577AsnLys: 2.577 ± 1.119
2.577AsnLeu: 2.577 ± 0.278
0.859AsnMet: 0.859 ± 0.627
3.436AsnAsn: 3.436 ± 1.076
3.436AsnPro: 3.436 ± 0.721
1.718AsnGln: 1.718 ± 0.538
3.436AsnArg: 3.436 ± 1.076
4.296AsnSer: 4.296 ± 2.526
4.296AsnThr: 4.296 ± 1.88
5.155AsnVal: 5.155 ± 0.979
1.718AsnTrp: 1.718 ± 0.838
1.718AsnTyr: 1.718 ± 1.255
0.0AsnXaa: 0.0 ± 0.0
Pro
5.155ProAla: 5.155 ± 0.556
0.0ProCys: 0.0 ± 0.0
0.859ProAsp: 0.859 ± 0.731
0.859ProGlu: 0.859 ± 0.804
2.577ProPhe: 2.577 ± 1.119
5.155ProGly: 5.155 ± 0.556
0.0ProHis: 0.0 ± 0.0
3.436ProIle: 3.436 ± 1.47
0.0ProLys: 0.0 ± 0.0
3.436ProLeu: 3.436 ± 0.53
0.859ProMet: 0.859 ± 0.731
3.436ProAsn: 3.436 ± 1.076
1.718ProPro: 1.718 ± 0.538
3.436ProGln: 3.436 ± 1.077
2.577ProArg: 2.577 ± 0.912
3.436ProSer: 3.436 ± 1.47
2.577ProThr: 2.577 ± 1.518
5.155ProVal: 5.155 ± 2.486
0.859ProTrp: 0.859 ± 0.731
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
0.859GlnAla: 0.859 ± 0.804
2.577GlnCys: 2.577 ± 1.243
2.577GlnAsp: 2.577 ± 1.243
0.0GlnGlu: 0.0 ± 0.0
1.718GlnPhe: 1.718 ± 0.538
1.718GlnGly: 1.718 ± 0.813
0.0GlnHis: 0.0 ± 0.0
0.859GlnIle: 0.859 ± 0.731
0.859GlnLys: 0.859 ± 0.731
4.296GlnLeu: 4.296 ± 1.013
0.0GlnMet: 0.0 ± 0.0
2.577GlnAsn: 2.577 ± 1.119
0.859GlnPro: 0.859 ± 0.627
0.859GlnGln: 0.859 ± 0.804
2.577GlnArg: 2.577 ± 1.243
0.859GlnSer: 0.859 ± 0.804
6.873GlnThr: 6.873 ± 2.43
1.718GlnVal: 1.718 ± 0.538
0.0GlnTrp: 0.0 ± 0.0
0.859GlnTyr: 0.859 ± 0.731
0.0GlnXaa: 0.0 ± 0.0
Arg
5.155ArgAla: 5.155 ± 2.486
0.859ArgCys: 0.859 ± 0.627
1.718ArgAsp: 1.718 ± 0.838
0.859ArgGlu: 0.859 ± 0.804
2.577ArgPhe: 2.577 ± 0.912
5.155ArgGly: 5.155 ± 0.556
2.577ArgHis: 2.577 ± 0.912
6.014ArgIle: 6.014 ± 2.839
4.296ArgLys: 4.296 ± 1.128
9.45ArgLeu: 9.45 ± 3.537
2.577ArgMet: 2.577 ± 1.882
1.718ArgAsn: 1.718 ± 0.813
4.296ArgPro: 4.296 ± 2.365
4.296ArgGln: 4.296 ± 1.128
6.014ArgArg: 6.014 ± 2.839
6.873ArgSer: 6.873 ± 3.63
0.859ArgThr: 0.859 ± 0.804
0.0ArgVal: 0.0 ± 0.0
1.718ArgTrp: 1.718 ± 0.838
2.577ArgTyr: 2.577 ± 1.119
0.0ArgXaa: 0.0 ± 0.0
Ser
7.732SerAla: 7.732 ± 2.957
0.859SerCys: 0.859 ± 0.731
5.155SerAsp: 5.155 ± 1.295
2.577SerGlu: 2.577 ± 0.278
0.859SerPhe: 0.859 ± 0.804
4.296SerGly: 4.296 ± 1.425
0.859SerHis: 0.859 ± 0.627
2.577SerIle: 2.577 ± 1.119
4.296SerLys: 4.296 ± 0.293
2.577SerLeu: 2.577 ± 1.32
0.0SerMet: 0.0 ± 0.0
2.577SerAsn: 2.577 ± 0.912
5.155SerPro: 5.155 ± 0.556
4.296SerGln: 4.296 ± 1.04
5.155SerArg: 5.155 ± 0.556
9.45SerSer: 9.45 ± 2.363
9.45SerThr: 9.45 ± 3.819
11.168SerVal: 11.168 ± 3.445
2.577SerTrp: 2.577 ± 1.518
2.577SerTyr: 2.577 ± 0.912
0.0SerXaa: 0.0 ± 0.0
Thr
7.732ThrAla: 7.732 ± 0.904
0.859ThrCys: 0.859 ± 0.731
2.577ThrAsp: 2.577 ± 0.278
6.014ThrGlu: 6.014 ± 2.467
5.155ThrPhe: 5.155 ± 2.238
2.577ThrGly: 2.577 ± 1.119
1.718ThrHis: 1.718 ± 0.838
6.873ThrIle: 6.873 ± 1.873
4.296ThrLys: 4.296 ± 2.666
4.296ThrLeu: 4.296 ± 2.967
0.0ThrMet: 0.0 ± 0.0
6.873ThrAsn: 6.873 ± 1.442
1.718ThrPro: 1.718 ± 0.538
0.0ThrGln: 0.0 ± 0.0
4.296ThrArg: 4.296 ± 1.88
5.155ThrSer: 5.155 ± 3.248
6.014ThrThr: 6.014 ± 0.813
7.732ThrVal: 7.732 ± 3.159
0.0ThrTrp: 0.0 ± 0.0
2.577ThrTyr: 2.577 ± 1.119
0.0ThrXaa: 0.0 ± 0.0
Val
7.732ValAla: 7.732 ± 0.834
0.0ValCys: 0.0 ± 0.0
3.436ValAsp: 3.436 ± 1.972
6.873ValGlu: 6.873 ± 2.312
6.873ValPhe: 6.873 ± 1.084
6.873ValGly: 6.873 ± 2.43
0.0ValHis: 0.0 ± 0.0
3.436ValIle: 3.436 ± 2.281
2.577ValLys: 2.577 ± 0.278
5.155ValLeu: 5.155 ± 1.295
3.436ValMet: 3.436 ± 1.623
1.718ValAsn: 1.718 ± 0.538
2.577ValPro: 2.577 ± 1.32
0.859ValGln: 0.859 ± 0.731
6.873ValArg: 6.873 ± 2.116
5.155ValSer: 5.155 ± 0.663
4.296ValThr: 4.296 ± 1.425
7.732ValVal: 7.732 ± 1.994
0.859ValTrp: 0.859 ± 0.627
5.155ValTyr: 5.155 ± 0.556
0.0ValXaa: 0.0 ± 0.0
Trp
0.859TrpAla: 0.859 ± 0.804
0.859TrpCys: 0.859 ± 0.627
0.859TrpAsp: 0.859 ± 0.804
2.577TrpGlu: 2.577 ± 1.243
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.859TrpIle: 0.859 ± 0.627
0.0TrpLys: 0.0 ± 0.0
3.436TrpLeu: 3.436 ± 1.077
0.859TrpMet: 0.859 ± 0.627
0.859TrpAsn: 0.859 ± 0.804
0.0TrpPro: 0.0 ± 0.0
0.859TrpGln: 0.859 ± 0.627
0.859TrpArg: 0.859 ± 0.804
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
1.718TrpVal: 1.718 ± 0.538
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.859TyrAla: 0.859 ± 0.731
0.0TyrCys: 0.0 ± 0.0
4.296TyrAsp: 4.296 ± 2.526
1.718TyrGlu: 1.718 ± 0.538
0.0TyrPhe: 0.0 ± 0.0
4.296TyrGly: 4.296 ± 2.025
0.859TyrHis: 0.859 ± 0.627
2.577TyrIle: 2.577 ± 2.192
0.859TyrLys: 0.859 ± 0.627
3.436TyrLeu: 3.436 ± 0.721
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
2.577TyrPro: 2.577 ± 0.278
0.0TyrGln: 0.0 ± 0.0
2.577TyrArg: 2.577 ± 0.278
4.296TyrSer: 4.296 ± 0.293
3.436TyrThr: 3.436 ± 1.812
0.859TyrVal: 0.859 ± 0.731
0.859TyrTrp: 0.859 ± 0.627
3.436TyrTyr: 3.436 ± 1.676
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1165 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski