Amino acid dipepetide frequency for Wenling tombus-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.906AlaAla: 3.906 ± 0.196
2.79AlaCys: 2.79 ± 0.933
5.58AlaAsp: 5.58 ± 1.219
2.232AlaGlu: 2.232 ± 0.817
0.558AlaPhe: 0.558 ± 0.365
3.906AlaGly: 3.906 ± 0.644
2.79AlaHis: 2.79 ± 0.459
2.79AlaIle: 2.79 ± 0.488
5.022AlaLys: 5.022 ± 1.29
7.254AlaLeu: 7.254 ± 1.368
1.674AlaMet: 1.674 ± 0.576
3.348AlaAsn: 3.348 ± 0.44
4.464AlaPro: 4.464 ± 2.141
2.232AlaGln: 2.232 ± 1.547
5.022AlaArg: 5.022 ± 1.147
5.022AlaSer: 5.022 ± 1.927
5.58AlaThr: 5.58 ± 1.217
5.58AlaVal: 5.58 ± 1.657
1.116AlaTrp: 1.116 ± 0.729
2.79AlaTyr: 2.79 ± 0.936
0.0AlaXaa: 0.0 ± 0.0
Cys
0.558CysAla: 0.558 ± 0.365
0.558CysCys: 0.558 ± 0.514
2.232CysAsp: 2.232 ± 0.817
0.558CysGlu: 0.558 ± 0.365
1.116CysPhe: 1.116 ± 0.408
1.116CysGly: 1.116 ± 0.645
0.558CysHis: 0.558 ± 0.514
1.116CysIle: 1.116 ± 0.416
1.116CysLys: 1.116 ± 0.408
2.79CysLeu: 2.79 ± 2.571
0.558CysMet: 0.558 ± 0.514
0.0CysAsn: 0.0 ± 0.0
2.232CysPro: 2.232 ± 0.817
0.558CysGln: 0.558 ± 0.528
2.232CysArg: 2.232 ± 0.877
2.232CysSer: 2.232 ± 1.459
2.79CysThr: 2.79 ± 0.449
1.116CysVal: 1.116 ± 1.028
0.0CysTrp: 0.0 ± 0.0
0.558CysTyr: 0.558 ± 0.365
0.0CysXaa: 0.0 ± 0.0
Asp
6.138AspAla: 6.138 ± 1.485
1.674AspCys: 1.674 ± 0.579
3.348AspAsp: 3.348 ± 0.44
2.79AspGlu: 2.79 ± 1.212
3.348AspPhe: 3.348 ± 0.419
3.348AspGly: 3.348 ± 1.779
1.116AspHis: 1.116 ± 0.729
3.348AspIle: 3.348 ± 0.971
2.232AspLys: 2.232 ± 0.831
5.58AspLeu: 5.58 ± 2.581
1.674AspMet: 1.674 ± 1.061
1.674AspAsn: 1.674 ± 0.576
7.812AspPro: 7.812 ± 0.392
0.558AspGln: 0.558 ± 0.514
3.906AspArg: 3.906 ± 1.915
5.58AspSer: 5.58 ± 0.919
5.022AspThr: 5.022 ± 1.2
5.022AspVal: 5.022 ± 0.534
1.116AspTrp: 1.116 ± 0.416
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.348GluAla: 3.348 ± 0.419
1.674GluCys: 1.674 ± 0.579
0.558GluAsp: 0.558 ± 0.514
0.0GluGlu: 0.0 ± 0.0
0.558GluPhe: 0.558 ± 0.528
3.906GluGly: 3.906 ± 0.636
0.558GluHis: 0.558 ± 0.365
1.674GluIle: 1.674 ± 0.28
2.232GluLys: 2.232 ± 0.085
1.674GluLeu: 1.674 ± 0.878
0.558GluMet: 0.558 ± 0.365
1.116GluAsn: 1.116 ± 0.416
0.558GluPro: 0.558 ± 0.514
5.58GluGln: 5.58 ± 0.445
2.232GluArg: 2.232 ± 0.741
1.116GluSer: 1.116 ± 0.408
2.79GluThr: 2.79 ± 1.855
2.232GluVal: 2.232 ± 0.87
2.232GluTrp: 2.232 ± 0.817
1.674GluTyr: 1.674 ± 0.579
0.0GluXaa: 0.0 ± 0.0
Phe
2.79PheAla: 2.79 ± 0.459
0.558PheCys: 0.558 ± 0.514
2.232PheAsp: 2.232 ± 1.289
1.116PheGlu: 1.116 ± 0.408
1.116PhePhe: 1.116 ± 0.416
0.0PheGly: 0.0 ± 0.0
2.79PheHis: 2.79 ± 0.459
0.558PheIle: 0.558 ± 0.365
1.116PheLys: 1.116 ± 0.645
3.906PheLeu: 3.906 ± 1.631
0.0PheMet: 0.0 ± 0.0
2.232PheAsn: 2.232 ± 1.388
1.674PhePro: 1.674 ± 0.579
2.232PheGln: 2.232 ± 0.877
2.79PheArg: 2.79 ± 0.488
3.906PheSer: 3.906 ± 0.636
2.232PheThr: 2.232 ± 0.817
1.674PheVal: 1.674 ± 0.854
0.558PheTrp: 0.558 ± 0.528
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.464GlyAla: 4.464 ± 1.499
0.558GlyCys: 0.558 ± 0.528
10.045GlyAsp: 10.045 ± 1.319
2.79GlyGlu: 2.79 ± 1.204
2.79GlyPhe: 2.79 ± 0.459
6.138GlyGly: 6.138 ± 2.069
0.558GlyHis: 0.558 ± 0.365
2.79GlyIle: 2.79 ± 1.674
5.58GlyLys: 5.58 ± 1.635
7.254GlyLeu: 7.254 ± 1.014
0.0GlyMet: 0.0 ± 0.0
0.0GlyAsn: 0.0 ± 0.0
2.79GlyPro: 2.79 ± 0.925
1.674GlyGln: 1.674 ± 0.28
3.348GlyArg: 3.348 ± 0.419
9.487GlySer: 9.487 ± 3.215
5.022GlyThr: 5.022 ± 1.2
5.58GlyVal: 5.58 ± 1.229
0.0GlyTrp: 0.0 ± 0.0
3.348GlyTyr: 3.348 ± 0.44
0.0GlyXaa: 0.0 ± 0.0
His
1.674HisAla: 1.674 ± 1.094
0.0HisCys: 0.0 ± 0.0
1.116HisAsp: 1.116 ± 0.729
1.116HisGlu: 1.116 ± 0.408
0.0HisPhe: 0.0 ± 0.0
2.79HisGly: 2.79 ± 1.212
0.0HisHis: 0.0 ± 0.0
2.232HisIle: 2.232 ± 0.831
0.558HisLys: 0.558 ± 0.365
2.232HisLeu: 2.232 ± 0.741
0.0HisMet: 0.0 ± 0.0
0.558HisAsn: 0.558 ± 0.365
1.674HisPro: 1.674 ± 0.854
0.0HisGln: 0.0 ± 0.0
2.79HisArg: 2.79 ± 1.212
0.558HisSer: 0.558 ± 0.514
3.348HisThr: 3.348 ± 1.708
1.116HisVal: 1.116 ± 0.416
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.906IleAla: 3.906 ± 1.569
2.232IleCys: 2.232 ± 0.877
2.232IleAsp: 2.232 ± 0.87
1.116IleGlu: 1.116 ± 0.645
1.116IlePhe: 1.116 ± 1.028
3.348IleGly: 3.348 ± 1.551
0.558IleHis: 0.558 ± 0.365
2.79IleIle: 2.79 ± 1.212
2.232IleLys: 2.232 ± 0.085
3.906IleLeu: 3.906 ± 2.553
1.674IleMet: 1.674 ± 1.543
2.232IleAsn: 2.232 ± 1.508
1.116IlePro: 1.116 ± 0.645
1.116IleGln: 1.116 ± 0.416
2.79IleArg: 2.79 ± 0.488
5.022IleSer: 5.022 ± 1.286
2.79IleThr: 2.79 ± 2.642
0.558IleVal: 0.558 ± 0.365
1.116IleTrp: 1.116 ± 1.028
1.116IleTyr: 1.116 ± 0.416
0.0IleXaa: 0.0 ± 0.0
Lys
1.674LysAla: 1.674 ± 0.579
0.0LysCys: 0.0 ± 0.0
3.348LysAsp: 3.348 ± 0.44
2.79LysGlu: 2.79 ± 0.936
1.674LysPhe: 1.674 ± 1.04
5.022LysGly: 5.022 ± 0.46
0.558LysHis: 0.558 ± 0.365
1.116LysIle: 1.116 ± 0.729
2.79LysLys: 2.79 ± 0.936
4.464LysLeu: 4.464 ± 0.949
0.0LysMet: 0.0 ± 0.0
2.232LysAsn: 2.232 ± 0.741
2.79LysPro: 2.79 ± 0.449
1.116LysGln: 1.116 ± 0.729
3.906LysArg: 3.906 ± 0.196
3.906LysSer: 3.906 ± 1.179
1.674LysThr: 1.674 ± 0.576
4.464LysVal: 4.464 ± 1.205
1.674LysTrp: 1.674 ± 0.854
0.558LysTyr: 0.558 ± 0.365
0.0LysXaa: 0.0 ± 0.0
Leu
8.929LeuAla: 8.929 ± 1.889
2.79LeuCys: 2.79 ± 0.933
5.022LeuAsp: 5.022 ± 1.808
3.348LeuGlu: 3.348 ± 1.708
0.558LeuPhe: 0.558 ± 0.514
4.464LeuGly: 4.464 ± 1.498
2.232LeuHis: 2.232 ± 0.817
4.464LeuIle: 4.464 ± 0.95
3.348LeuLys: 3.348 ± 0.419
8.929LeuLeu: 8.929 ± 1.099
2.79LeuMet: 2.79 ± 0.449
3.906LeuAsn: 3.906 ± 0.862
7.812LeuPro: 7.812 ± 4.443
5.022LeuGln: 5.022 ± 1.2
4.464LeuArg: 4.464 ± 1.543
7.254LeuSer: 7.254 ± 1.2
6.138LeuThr: 6.138 ± 0.854
5.022LeuVal: 5.022 ± 1.567
0.558LeuTrp: 0.558 ± 0.528
3.906LeuTyr: 3.906 ± 0.862
0.0LeuXaa: 0.0 ± 0.0
Met
2.232MetAla: 2.232 ± 0.817
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.674MetGlu: 1.674 ± 0.28
0.558MetPhe: 0.558 ± 0.365
1.674MetGly: 1.674 ± 0.28
0.0MetHis: 0.0 ± 0.0
1.116MetIle: 1.116 ± 0.416
0.0MetLys: 0.0 ± 0.0
2.232MetLeu: 2.232 ± 0.831
0.0MetMet: 0.0 ± 0.0
1.674MetAsn: 1.674 ± 1.061
1.116MetPro: 1.116 ± 0.408
0.558MetGln: 0.558 ± 0.365
1.116MetArg: 1.116 ± 0.408
1.116MetSer: 1.116 ± 0.416
3.348MetThr: 3.348 ± 1.3
1.116MetVal: 1.116 ± 0.416
1.674MetTrp: 1.674 ± 0.576
0.558MetTyr: 0.558 ± 0.514
0.0MetXaa: 0.0 ± 0.0
Asn
1.116AsnAla: 1.116 ± 0.416
0.558AsnCys: 0.558 ± 0.365
0.0AsnAsp: 0.0 ± 0.0
0.558AsnGlu: 0.558 ± 0.365
1.116AsnPhe: 1.116 ± 0.416
5.58AsnGly: 5.58 ± 0.976
0.558AsnHis: 0.558 ± 0.365
1.116AsnIle: 1.116 ± 0.408
1.674AsnLys: 1.674 ± 0.878
2.79AsnLeu: 2.79 ± 1.204
0.558AsnMet: 0.558 ± 0.514
1.116AsnAsn: 1.116 ± 0.416
1.116AsnPro: 1.116 ± 0.645
1.116AsnGln: 1.116 ± 0.645
2.79AsnArg: 2.79 ± 0.449
2.79AsnSer: 2.79 ± 0.936
3.348AsnThr: 3.348 ± 0.56
1.674AsnVal: 1.674 ± 0.878
0.558AsnTrp: 0.558 ± 0.514
1.116AsnTyr: 1.116 ± 1.057
0.0AsnXaa: 0.0 ± 0.0
Pro
3.906ProAla: 3.906 ± 1.905
1.116ProCys: 1.116 ± 1.028
5.58ProAsp: 5.58 ± 1.219
1.674ProGlu: 1.674 ± 0.854
3.348ProPhe: 3.348 ± 0.419
2.79ProGly: 2.79 ± 0.925
1.674ProHis: 1.674 ± 0.854
3.348ProIle: 3.348 ± 0.419
2.232ProLys: 2.232 ± 0.831
3.906ProLeu: 3.906 ± 1.485
0.558ProMet: 0.558 ± 0.528
3.348ProAsn: 3.348 ± 0.44
4.464ProPro: 4.464 ± 1.543
2.232ProGln: 2.232 ± 0.831
4.464ProArg: 4.464 ± 1.307
5.58ProSer: 5.58 ± 0.976
4.464ProThr: 4.464 ± 0.851
3.348ProVal: 3.348 ± 1.551
0.558ProTrp: 0.558 ± 0.365
1.116ProTyr: 1.116 ± 0.645
0.0ProXaa: 0.0 ± 0.0
Gln
2.232GlnAla: 2.232 ± 0.831
0.0GlnCys: 0.0 ± 0.0
3.906GlnAsp: 3.906 ± 0.958
0.558GlnGlu: 0.558 ± 0.528
1.116GlnPhe: 1.116 ± 0.408
3.348GlnGly: 3.348 ± 0.44
1.116GlnHis: 1.116 ± 0.416
0.0GlnIle: 0.0 ± 0.0
1.674GlnLys: 1.674 ± 0.878
5.022GlnLeu: 5.022 ± 1.147
2.232GlnMet: 2.232 ± 0.831
0.0GlnAsn: 0.0 ± 0.0
1.674GlnPro: 1.674 ± 1.094
2.232GlnGln: 2.232 ± 1.547
1.674GlnArg: 1.674 ± 1.061
3.348GlnSer: 3.348 ± 1.247
4.464GlnThr: 4.464 ± 1.205
3.906GlnVal: 3.906 ± 0.823
2.232GlnTrp: 2.232 ± 0.085
1.674GlnTyr: 1.674 ± 1.094
0.0GlnXaa: 0.0 ± 0.0
Arg
5.58ArgAla: 5.58 ± 2.557
0.558ArgCys: 0.558 ± 0.514
3.348ArgAsp: 3.348 ± 0.814
2.79ArgGlu: 2.79 ± 1.215
5.58ArgPhe: 5.58 ± 1.657
5.58ArgGly: 5.58 ± 1.635
1.674ArgHis: 1.674 ± 0.854
0.558ArgIle: 0.558 ± 0.528
3.906ArgLys: 3.906 ± 0.862
4.464ArgLeu: 4.464 ± 0.822
0.558ArgMet: 0.558 ± 0.365
3.348ArgAsn: 3.348 ± 1.56
1.674ArgPro: 1.674 ± 0.878
3.906ArgGln: 3.906 ± 0.862
5.58ArgArg: 5.58 ± 1.214
5.58ArgSer: 5.58 ± 1.219
1.674ArgThr: 1.674 ± 0.576
4.464ArgVal: 4.464 ± 0.949
0.0ArgTrp: 0.0 ± 0.0
2.79ArgTyr: 2.79 ± 0.936
0.0ArgXaa: 0.0 ± 0.0
Ser
8.371SerAla: 8.371 ± 2.127
2.79SerCys: 2.79 ± 1.215
2.232SerAsp: 2.232 ± 1.289
0.558SerGlu: 0.558 ± 0.365
2.79SerPhe: 2.79 ± 0.449
7.812SerGly: 7.812 ± 0.527
0.558SerHis: 0.558 ± 0.365
2.232SerIle: 2.232 ± 1.349
3.348SerLys: 3.348 ± 0.971
6.138SerLeu: 6.138 ± 1.996
3.348SerMet: 3.348 ± 1.233
1.674SerAsn: 1.674 ± 0.576
6.696SerPro: 6.696 ± 0.837
6.138SerGln: 6.138 ± 0.878
5.58SerArg: 5.58 ± 0.476
5.58SerSer: 5.58 ± 2.993
2.79SerThr: 2.79 ± 0.449
6.138SerVal: 6.138 ± 0.113
1.674SerTrp: 1.674 ± 1.094
4.464SerTyr: 4.464 ± 0.822
0.0SerXaa: 0.0 ± 0.0
Thr
3.906ThrAla: 3.906 ± 0.636
2.232ThrCys: 2.232 ± 0.716
5.58ThrAsp: 5.58 ± 3.223
1.116ThrGlu: 1.116 ± 0.729
1.116ThrPhe: 1.116 ± 1.057
5.022ThrGly: 5.022 ± 1.993
1.674ThrHis: 1.674 ± 1.094
5.022ThrIle: 5.022 ± 2.069
1.116ThrLys: 1.116 ± 0.408
7.812ThrLeu: 7.812 ± 0.944
2.232ThrMet: 2.232 ± 1.508
0.558ThrAsn: 0.558 ± 0.365
4.464ThrPro: 4.464 ± 1.481
3.906ThrGln: 3.906 ± 0.196
3.906ThrArg: 3.906 ± 1.541
4.464ThrSer: 4.464 ± 1.633
6.696ThrThr: 6.696 ± 1.12
7.254ThrVal: 7.254 ± 2.859
1.674ThrTrp: 1.674 ± 0.28
2.79ThrTyr: 2.79 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
6.696ValAla: 6.696 ± 0.255
2.79ValCys: 2.79 ± 0.449
7.254ValAsp: 7.254 ± 1.262
5.022ValGlu: 5.022 ± 1.927
1.674ValPhe: 1.674 ± 0.854
6.138ValGly: 6.138 ± 1.99
2.232ValHis: 2.232 ± 0.831
4.464ValIle: 4.464 ± 1.923
3.348ValLys: 3.348 ± 1.225
4.464ValLeu: 4.464 ± 1.498
1.116ValMet: 1.116 ± 0.729
0.558ValAsn: 0.558 ± 0.528
4.464ValPro: 4.464 ± 0.678
0.558ValGln: 0.558 ± 0.365
1.674ValArg: 1.674 ± 0.878
7.254ValSer: 7.254 ± 3.196
5.022ValThr: 5.022 ± 1.993
2.79ValVal: 2.79 ± 1.268
0.0ValTrp: 0.0 ± 0.0
2.232ValTyr: 2.232 ± 0.741
0.0ValXaa: 0.0 ± 0.0
Trp
1.674TrpAla: 1.674 ± 1.04
0.558TrpCys: 0.558 ± 0.365
1.674TrpAsp: 1.674 ± 0.854
2.232TrpGlu: 2.232 ± 0.831
1.116TrpPhe: 1.116 ± 0.729
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.558TrpIle: 0.558 ± 0.365
0.558TrpLys: 0.558 ± 0.365
2.232TrpLeu: 2.232 ± 0.877
0.558TrpMet: 0.558 ± 0.365
1.116TrpAsn: 1.116 ± 0.645
0.0TrpPro: 0.0 ± 0.0
0.558TrpGln: 0.558 ± 0.365
0.558TrpArg: 0.558 ± 0.514
0.558TrpSer: 0.558 ± 0.514
1.116TrpThr: 1.116 ± 0.416
2.232TrpVal: 2.232 ± 0.716
0.558TrpTrp: 0.558 ± 0.514
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.558TyrAla: 0.558 ± 0.365
0.558TyrCys: 0.558 ± 0.514
0.0TyrAsp: 0.0 ± 0.0
2.232TyrGlu: 2.232 ± 0.831
2.232TyrPhe: 2.232 ± 0.085
2.232TyrGly: 2.232 ± 0.831
0.558TyrHis: 0.558 ± 0.365
1.674TyrIle: 1.674 ± 0.579
1.674TyrLys: 1.674 ± 0.579
3.906TyrLeu: 3.906 ± 0.196
1.116TyrMet: 1.116 ± 0.208
0.558TyrAsn: 0.558 ± 0.365
1.116TyrPro: 1.116 ± 0.645
1.116TyrGln: 1.116 ± 0.416
2.79TyrArg: 2.79 ± 0.936
0.558TyrSer: 0.558 ± 0.528
2.232TyrThr: 2.232 ± 0.085
4.464TyrVal: 4.464 ± 1.662
0.558TyrTrp: 0.558 ± 0.514
1.674TyrTyr: 1.674 ± 0.576
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski