Amino acid dipepetide frequency for Wenzhou tombus-like virus 15

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.731AlaAla: 5.731 ± 4.39
0.716AlaCys: 0.716 ± 0.365
2.865AlaAsp: 2.865 ± 0.002
3.582AlaGlu: 3.582 ± 1.825
2.149AlaPhe: 2.149 ± 1.829
6.447AlaGly: 6.447 ± 4.025
0.716AlaHis: 0.716 ± 0.365
4.298AlaIle: 4.298 ± 0.734
5.014AlaLys: 5.014 ± 1.093
5.731AlaLeu: 5.731 ± 1.466
1.433AlaMet: 1.433 ± 2.194
2.865AlaAsn: 2.865 ± 1.464
3.582AlaPro: 3.582 ± 1.099
0.0AlaGln: 0.0 ± 0.0
2.149AlaArg: 2.149 ± 0.367
3.582AlaSer: 3.582 ± 1.099
7.88AlaThr: 7.88 ± 3.295
2.149AlaVal: 2.149 ± 1.095
0.0AlaTrp: 0.0 ± 0.0
1.433AlaTyr: 1.433 ± 0.732
0.0AlaXaa: 0.0 ± 0.0
Cys
2.149CysAla: 2.149 ± 0.367
0.0CysCys: 0.0 ± 0.0
1.433CysAsp: 1.433 ± 0.73
1.433CysGlu: 1.433 ± 0.73
2.149CysPhe: 2.149 ± 1.095
0.716CysGly: 0.716 ± 0.365
0.716CysHis: 0.716 ± 0.365
0.716CysIle: 0.716 ± 1.097
2.149CysLys: 2.149 ± 0.367
0.716CysLeu: 0.716 ± 0.365
0.0CysMet: 0.0 ± 0.0
1.433CysAsn: 1.433 ± 0.732
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.716CysArg: 0.716 ± 0.365
0.716CysSer: 0.716 ± 0.365
0.0CysThr: 0.0 ± 0.0
0.716CysVal: 0.716 ± 0.365
0.0CysTrp: 0.0 ± 0.0
0.716CysTyr: 0.716 ± 0.365
0.0CysXaa: 0.0 ± 0.0
Asp
3.582AspAla: 3.582 ± 1.099
0.0AspCys: 0.0 ± 0.0
2.865AspAsp: 2.865 ± 1.46
1.433AspGlu: 1.433 ± 0.73
1.433AspPhe: 1.433 ± 0.73
3.582AspGly: 3.582 ± 0.363
0.716AspHis: 0.716 ± 0.365
4.298AspIle: 4.298 ± 0.728
6.447AspLys: 6.447 ± 3.285
3.582AspLeu: 3.582 ± 1.825
2.149AspMet: 2.149 ± 1.3
3.582AspAsn: 3.582 ± 1.825
3.582AspPro: 3.582 ± 1.099
2.149AspGln: 2.149 ± 0.367
2.149AspArg: 2.149 ± 1.095
2.865AspSer: 2.865 ± 0.002
2.865AspThr: 2.865 ± 1.46
3.582AspVal: 3.582 ± 1.099
2.149AspTrp: 2.149 ± 1.095
3.582AspTyr: 3.582 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
2.865GluAla: 2.865 ± 1.46
0.716GluCys: 0.716 ± 0.365
2.865GluAsp: 2.865 ± 1.46
3.582GluGlu: 3.582 ± 0.363
2.865GluPhe: 2.865 ± 0.002
3.582GluGly: 3.582 ± 0.363
2.865GluHis: 2.865 ± 1.46
5.014GluIle: 5.014 ± 0.369
4.298GluLys: 4.298 ± 2.19
1.433GluLeu: 1.433 ± 0.73
0.0GluMet: 0.0 ± 0.0
5.014GluAsn: 5.014 ± 2.555
2.865GluPro: 2.865 ± 0.002
1.433GluGln: 1.433 ± 0.73
0.716GluArg: 0.716 ± 0.365
2.149GluSer: 2.149 ± 0.367
0.716GluThr: 0.716 ± 0.365
2.865GluVal: 2.865 ± 1.46
0.0GluTrp: 0.0 ± 0.0
5.731GluTyr: 5.731 ± 0.004
0.0GluXaa: 0.0 ± 0.0
Phe
0.716PheAla: 0.716 ± 0.365
2.149PheCys: 2.149 ± 1.095
1.433PheAsp: 1.433 ± 0.732
2.149PheGlu: 2.149 ± 1.095
0.716PhePhe: 0.716 ± 0.365
4.298PheGly: 4.298 ± 2.196
0.0PheHis: 0.0 ± 0.0
5.014PheIle: 5.014 ± 1.093
4.298PheLys: 4.298 ± 0.728
2.865PheLeu: 2.865 ± 0.002
0.716PheMet: 0.716 ± 0.365
2.865PheAsn: 2.865 ± 0.002
2.149PhePro: 2.149 ± 0.367
1.433PheGln: 1.433 ± 0.732
1.433PheArg: 1.433 ± 0.732
2.149PheSer: 2.149 ± 0.367
2.149PheThr: 2.149 ± 1.829
1.433PheVal: 1.433 ± 0.732
0.0PheTrp: 0.0 ± 0.0
0.716PheTyr: 0.716 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
3.582GlyAla: 3.582 ± 1.099
1.433GlyCys: 1.433 ± 0.73
6.447GlyAsp: 6.447 ± 1.823
1.433GlyGlu: 1.433 ± 0.732
2.865GlyPhe: 2.865 ± 0.002
5.014GlyGly: 5.014 ± 4.755
0.0GlyHis: 0.0 ± 0.0
5.014GlyIle: 5.014 ± 1.831
4.298GlyLys: 4.298 ± 0.728
3.582GlyLeu: 3.582 ± 0.363
0.0GlyMet: 0.0 ± 0.0
2.865GlyAsn: 2.865 ± 1.464
2.149GlyPro: 2.149 ± 1.829
2.149GlyGln: 2.149 ± 0.367
2.865GlyArg: 2.865 ± 1.46
5.731GlySer: 5.731 ± 1.466
7.88GlyThr: 7.88 ± 3.295
2.149GlyVal: 2.149 ± 0.367
0.0GlyTrp: 0.0 ± 0.0
2.149GlyTyr: 2.149 ± 1.829
0.0GlyXaa: 0.0 ± 0.0
His
0.716HisAla: 0.716 ± 0.365
0.0HisCys: 0.0 ± 0.0
0.716HisAsp: 0.716 ± 0.365
2.149HisGlu: 2.149 ± 0.367
2.149HisPhe: 2.149 ± 1.829
0.716HisGly: 0.716 ± 0.365
0.716HisHis: 0.716 ± 0.365
0.0HisIle: 0.0 ± 0.0
1.433HisLys: 1.433 ± 0.73
2.865HisLeu: 2.865 ± 1.46
0.0HisMet: 0.0 ± 0.0
1.433HisAsn: 1.433 ± 0.73
0.716HisPro: 0.716 ± 0.365
0.716HisGln: 0.716 ± 0.365
1.433HisArg: 1.433 ± 0.73
1.433HisSer: 1.433 ± 0.73
1.433HisThr: 1.433 ± 0.73
0.716HisVal: 0.716 ± 0.365
0.0HisTrp: 0.0 ± 0.0
1.433HisTyr: 1.433 ± 0.73
0.0HisXaa: 0.0 ± 0.0
Ile
5.731IleAla: 5.731 ± 1.466
0.716IleCys: 0.716 ± 1.097
6.447IleAsp: 6.447 ± 1.823
4.298IleGlu: 4.298 ± 0.728
0.716IlePhe: 0.716 ± 0.365
1.433IleGly: 1.433 ± 0.732
0.716IleHis: 0.716 ± 1.097
2.865IleIle: 2.865 ± 1.46
5.731IleLys: 5.731 ± 1.466
4.298IleLeu: 4.298 ± 0.734
1.433IleMet: 1.433 ± 0.73
4.298IleAsn: 4.298 ± 0.734
7.163IlePro: 7.163 ± 0.726
4.298IleGln: 4.298 ± 2.196
4.298IleArg: 4.298 ± 0.728
3.582IleSer: 3.582 ± 1.099
6.447IleThr: 6.447 ± 4.025
2.149IleVal: 2.149 ± 1.095
1.433IleTrp: 1.433 ± 0.73
5.014IleTyr: 5.014 ± 0.369
0.0IleXaa: 0.0 ± 0.0
Lys
3.582LysAla: 3.582 ± 0.363
1.433LysCys: 1.433 ± 0.73
4.298LysAsp: 4.298 ± 2.19
2.149LysGlu: 2.149 ± 1.095
3.582LysPhe: 3.582 ± 1.825
3.582LysGly: 3.582 ± 1.825
0.716LysHis: 0.716 ± 0.365
4.298LysIle: 4.298 ± 0.728
2.149LysLys: 2.149 ± 1.095
8.596LysLeu: 8.596 ± 2.918
0.0LysMet: 0.0 ± 0.0
6.447LysAsn: 6.447 ± 3.285
5.014LysPro: 5.014 ± 1.093
4.298LysGln: 4.298 ± 0.728
2.865LysArg: 2.865 ± 0.002
5.731LysSer: 5.731 ± 0.004
2.865LysThr: 2.865 ± 1.46
2.865LysVal: 2.865 ± 2.926
0.0LysTrp: 0.0 ± 0.0
5.731LysTyr: 5.731 ± 2.92
0.0LysXaa: 0.0 ± 0.0
Leu
4.298LeuAla: 4.298 ± 3.658
1.433LeuCys: 1.433 ± 0.73
4.298LeuAsp: 4.298 ± 2.19
2.865LeuGlu: 2.865 ± 1.46
3.582LeuPhe: 3.582 ± 1.099
2.865LeuGly: 2.865 ± 0.002
2.149LeuHis: 2.149 ± 1.095
6.447LeuIle: 6.447 ± 1.101
5.731LeuLys: 5.731 ± 2.92
7.163LeuLeu: 7.163 ± 2.188
2.865LeuMet: 2.865 ± 1.46
5.014LeuAsn: 5.014 ± 1.093
2.149LeuPro: 2.149 ± 1.095
1.433LeuGln: 1.433 ± 0.732
2.865LeuArg: 2.865 ± 0.002
3.582LeuSer: 3.582 ± 0.363
2.865LeuThr: 2.865 ± 1.464
4.298LeuVal: 4.298 ± 2.196
0.0LeuTrp: 0.0 ± 0.0
4.298LeuTyr: 4.298 ± 0.728
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
1.433MetAsp: 1.433 ± 0.73
0.0MetGlu: 0.0 ± 0.0
0.716MetPhe: 0.716 ± 0.365
2.149MetGly: 2.149 ± 1.095
0.0MetHis: 0.0 ± 0.0
2.149MetIle: 2.149 ± 1.095
0.0MetLys: 0.0 ± 0.0
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
0.716MetAsn: 0.716 ± 0.365
0.0MetPro: 0.0 ± 0.0
1.433MetGln: 1.433 ± 0.732
2.865MetArg: 2.865 ± 1.46
1.433MetSer: 1.433 ± 0.732
0.716MetThr: 0.716 ± 0.365
0.716MetVal: 0.716 ± 1.097
0.0MetTrp: 0.0 ± 0.0
0.716MetTyr: 0.716 ± 0.365
0.0MetXaa: 0.0 ± 0.0
Asn
5.731AsnAla: 5.731 ± 1.458
0.716AsnCys: 0.716 ± 0.365
2.149AsnAsp: 2.149 ± 1.095
4.298AsnGlu: 4.298 ± 2.19
2.149AsnPhe: 2.149 ± 0.367
1.433AsnGly: 1.433 ± 0.73
2.149AsnHis: 2.149 ± 1.095
5.731AsnIle: 5.731 ± 1.458
2.865AsnLys: 2.865 ± 1.46
4.298AsnLeu: 4.298 ± 0.734
0.716AsnMet: 0.716 ± 0.603
5.731AsnAsn: 5.731 ± 2.92
2.149AsnPro: 2.149 ± 0.367
2.149AsnGln: 2.149 ± 1.829
3.582AsnArg: 3.582 ± 1.825
1.433AsnSer: 1.433 ± 0.732
5.014AsnThr: 5.014 ± 1.093
5.014AsnVal: 5.014 ± 0.369
0.0AsnTrp: 0.0 ± 0.0
1.433AsnTyr: 1.433 ± 0.732
0.0AsnXaa: 0.0 ± 0.0
Pro
2.865ProAla: 2.865 ± 1.464
0.716ProCys: 0.716 ± 0.365
0.716ProAsp: 0.716 ± 0.365
4.298ProGlu: 4.298 ± 0.728
0.716ProPhe: 0.716 ± 0.365
5.014ProGly: 5.014 ± 4.755
1.433ProHis: 1.433 ± 0.73
5.014ProIle: 5.014 ± 0.369
2.149ProLys: 2.149 ± 1.095
2.149ProLeu: 2.149 ± 0.367
0.716ProMet: 0.716 ± 0.365
2.149ProAsn: 2.149 ± 0.367
2.865ProPro: 2.865 ± 1.464
2.865ProGln: 2.865 ± 0.002
2.149ProArg: 2.149 ± 0.367
2.149ProSer: 2.149 ± 1.095
3.582ProThr: 3.582 ± 1.825
5.014ProVal: 5.014 ± 0.369
1.433ProTrp: 1.433 ± 2.194
0.716ProTyr: 0.716 ± 0.365
0.0ProXaa: 0.0 ± 0.0
Gln
2.149GlnAla: 2.149 ± 1.829
0.0GlnCys: 0.0 ± 0.0
2.149GlnAsp: 2.149 ± 1.829
2.149GlnGlu: 2.149 ± 1.095
2.865GlnPhe: 2.865 ± 1.46
2.865GlnGly: 2.865 ± 1.464
2.149GlnHis: 2.149 ± 1.095
1.433GlnIle: 1.433 ± 0.732
0.716GlnLys: 0.716 ± 1.097
2.149GlnLeu: 2.149 ± 1.095
0.0GlnMet: 0.0 ± 0.0
2.149GlnAsn: 2.149 ± 1.095
2.865GlnPro: 2.865 ± 1.46
2.865GlnGln: 2.865 ± 1.46
2.865GlnArg: 2.865 ± 0.002
2.149GlnSer: 2.149 ± 1.095
7.163GlnThr: 7.163 ± 0.736
2.865GlnVal: 2.865 ± 1.464
0.0GlnTrp: 0.0 ± 0.0
4.298GlnTyr: 4.298 ± 2.196
0.0GlnXaa: 0.0 ± 0.0
Arg
2.865ArgAla: 2.865 ± 1.46
2.149ArgCys: 2.149 ± 0.367
1.433ArgAsp: 1.433 ± 0.73
3.582ArgGlu: 3.582 ± 0.363
4.298ArgPhe: 4.298 ± 0.734
2.865ArgGly: 2.865 ± 1.46
2.865ArgHis: 2.865 ± 0.002
4.298ArgIle: 4.298 ± 0.728
0.716ArgLys: 0.716 ± 0.365
2.149ArgLeu: 2.149 ± 1.095
1.433ArgMet: 1.433 ± 0.73
0.716ArgAsn: 0.716 ± 0.365
1.433ArgPro: 1.433 ± 0.732
0.716ArgGln: 0.716 ± 0.365
1.433ArgArg: 1.433 ± 0.73
5.731ArgSer: 5.731 ± 1.458
3.582ArgThr: 3.582 ± 1.825
2.149ArgVal: 2.149 ± 0.367
0.0ArgTrp: 0.0 ± 0.0
3.582ArgTyr: 3.582 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
4.298SerAla: 4.298 ± 3.658
1.433SerCys: 1.433 ± 0.732
2.865SerAsp: 2.865 ± 1.464
3.582SerGlu: 3.582 ± 1.825
0.716SerPhe: 0.716 ± 1.097
5.731SerGly: 5.731 ± 1.458
2.149SerHis: 2.149 ± 0.367
4.298SerIle: 4.298 ± 2.196
6.447SerLys: 6.447 ± 0.361
7.163SerLeu: 7.163 ± 0.736
0.716SerMet: 0.716 ± 0.365
2.149SerAsn: 2.149 ± 0.367
2.149SerPro: 2.149 ± 0.367
2.865SerGln: 2.865 ± 1.46
2.865SerArg: 2.865 ± 1.464
5.014SerSer: 5.014 ± 0.369
6.447SerThr: 6.447 ± 0.361
2.149SerVal: 2.149 ± 0.367
0.0SerTrp: 0.0 ± 0.0
4.298SerTyr: 4.298 ± 2.196
0.0SerXaa: 0.0 ± 0.0
Thr
4.298ThrAla: 4.298 ± 0.734
0.0ThrCys: 0.0 ± 0.0
4.298ThrAsp: 4.298 ± 2.19
0.716ThrGlu: 0.716 ± 0.365
0.716ThrPhe: 0.716 ± 0.365
5.014ThrGly: 5.014 ± 3.293
0.0ThrHis: 0.0 ± 0.0
5.014ThrIle: 5.014 ± 3.293
7.163ThrLys: 7.163 ± 3.65
3.582ThrLeu: 3.582 ± 2.561
0.0ThrMet: 0.0 ± 0.0
2.865ThrAsn: 2.865 ± 1.46
5.731ThrPro: 5.731 ± 1.466
5.014ThrGln: 5.014 ± 0.369
5.014ThrArg: 5.014 ± 2.555
9.312ThrSer: 9.312 ± 5.489
9.312ThrThr: 9.312 ± 4.027
6.447ThrVal: 6.447 ± 1.101
0.716ThrTrp: 0.716 ± 1.097
5.014ThrTyr: 5.014 ± 1.093
0.0ThrXaa: 0.0 ± 0.0
Val
2.865ValAla: 2.865 ± 1.464
0.716ValCys: 0.716 ± 1.097
4.298ValAsp: 4.298 ± 2.19
4.298ValGlu: 4.298 ± 0.728
1.433ValPhe: 1.433 ± 0.732
1.433ValGly: 1.433 ± 2.194
0.0ValHis: 0.0 ± 0.0
2.865ValIle: 2.865 ± 2.926
3.582ValLys: 3.582 ± 0.363
3.582ValLeu: 3.582 ± 1.099
1.433ValMet: 1.433 ± 0.73
4.298ValAsn: 4.298 ± 0.734
2.149ValPro: 2.149 ± 0.367
5.014ValGln: 5.014 ± 0.369
2.865ValArg: 2.865 ± 0.002
3.582ValSer: 3.582 ± 2.561
5.731ValThr: 5.731 ± 1.466
2.865ValVal: 2.865 ± 1.464
0.0ValTrp: 0.0 ± 0.0
0.716ValTyr: 0.716 ± 0.365
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.716TrpAsp: 0.716 ± 0.365
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.433TrpIle: 1.433 ± 0.73
0.716TrpLys: 0.716 ± 0.365
0.716TrpLeu: 0.716 ± 1.097
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.716TrpGln: 0.716 ± 1.097
0.716TrpArg: 0.716 ± 0.365
0.716TrpSer: 0.716 ± 0.365
0.716TrpThr: 0.716 ± 1.097
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.298TyrAla: 4.298 ± 2.196
2.149TyrCys: 2.149 ± 1.095
3.582TyrAsp: 3.582 ± 2.561
3.582TyrGlu: 3.582 ± 1.099
2.865TyrPhe: 2.865 ± 1.464
2.865TyrGly: 2.865 ± 1.46
0.716TyrHis: 0.716 ± 0.365
2.865TyrIle: 2.865 ± 0.002
4.298TyrLys: 4.298 ± 2.19
3.582TyrLeu: 3.582 ± 1.825
0.716TyrMet: 0.716 ± 0.365
2.149TyrAsn: 2.149 ± 0.367
0.0TyrPro: 0.0 ± 0.0
4.298TyrGln: 4.298 ± 2.19
2.149TyrArg: 2.149 ± 1.095
4.298TyrSer: 4.298 ± 0.734
2.865TyrThr: 2.865 ± 0.002
3.582TyrVal: 3.582 ± 2.561
0.716TyrTrp: 0.716 ± 0.365
2.865TyrTyr: 2.865 ± 1.464
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1397 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski