Amino acid dipepetide frequency for Wenzhou tombus-like virus 11

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.868AlaAla: 7.868 ± 1.489
1.431AlaCys: 1.431 ± 1.001
2.146AlaAsp: 2.146 ± 0.801
3.577AlaGlu: 3.577 ± 1.06
2.146AlaPhe: 2.146 ± 0.104
7.868AlaGly: 7.868 ± 2.715
0.715AlaHis: 0.715 ± 0.537
4.292AlaIle: 4.292 ± 2.011
5.007AlaLys: 5.007 ± 0.583
6.438AlaLeu: 6.438 ± 2.182
0.0AlaMet: 0.0 ± 0.0
2.861AlaAsn: 2.861 ± 1.381
6.438AlaPro: 6.438 ± 2.767
3.577AlaGln: 3.577 ± 0.928
5.722AlaArg: 5.722 ± 0.861
5.007AlaSer: 5.007 ± 2.844
7.868AlaThr: 7.868 ± 2.191
1.431AlaVal: 1.431 ± 0.981
2.146AlaTrp: 2.146 ± 1.006
0.715AlaTyr: 0.715 ± 0.491
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.715CysAsp: 0.715 ± 0.491
1.431CysGlu: 1.431 ± 1.074
0.715CysPhe: 0.715 ± 0.537
0.715CysGly: 0.715 ± 0.501
1.431CysHis: 1.431 ± 0.581
0.715CysIle: 0.715 ± 0.537
0.0CysLys: 0.0 ± 0.0
1.431CysLeu: 1.431 ± 0.543
0.0CysMet: 0.0 ± 0.0
0.715CysAsn: 0.715 ± 0.537
0.0CysPro: 0.0 ± 0.0
0.715CysGln: 0.715 ± 0.537
2.146CysArg: 2.146 ± 0.801
0.715CysSer: 0.715 ± 0.491
0.715CysThr: 0.715 ± 0.501
2.146CysVal: 2.146 ± 1.611
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.861AspAla: 2.861 ± 1.372
1.431AspCys: 1.431 ± 1.074
3.577AspAsp: 3.577 ± 1.083
0.715AspGlu: 0.715 ± 0.501
2.861AspPhe: 2.861 ± 0.551
3.577AspGly: 3.577 ± 1.083
1.431AspHis: 1.431 ± 0.581
2.861AspIle: 2.861 ± 1.162
4.292AspLys: 4.292 ± 0.208
4.292AspLeu: 4.292 ± 1.428
1.431AspMet: 1.431 ± 0.581
0.0AspAsn: 0.0 ± 0.0
2.861AspPro: 2.861 ± 0.431
2.861AspGln: 2.861 ± 1.372
2.146AspArg: 2.146 ± 0.104
4.292AspSer: 4.292 ± 0.626
2.146AspThr: 2.146 ± 0.104
2.146AspVal: 2.146 ± 0.104
0.715AspTrp: 0.715 ± 0.491
0.715AspTyr: 0.715 ± 0.537
0.0AspXaa: 0.0 ± 0.0
Glu
2.861GluAla: 2.861 ± 0.431
1.431GluCys: 1.431 ± 0.398
2.146GluAsp: 2.146 ± 1.006
5.722GluGlu: 5.722 ± 1.088
2.861GluPhe: 2.861 ± 1.086
1.431GluGly: 1.431 ± 0.543
2.861GluHis: 2.861 ± 1.504
0.715GluIle: 0.715 ± 0.537
2.861GluLys: 2.861 ± 0.795
7.153GluLeu: 7.153 ± 1.273
2.146GluMet: 2.146 ± 0.727
2.146GluAsn: 2.146 ± 0.104
2.146GluPro: 2.146 ± 0.104
1.431GluGln: 1.431 ± 0.398
2.146GluArg: 2.146 ± 0.932
5.007GluSer: 5.007 ± 0.583
0.0GluThr: 0.0 ± 0.0
1.431GluVal: 1.431 ± 0.398
0.715GluTrp: 0.715 ± 0.537
1.431GluTyr: 1.431 ± 0.581
0.0GluXaa: 0.0 ± 0.0
Phe
4.292PheAla: 4.292 ± 0.208
0.715PheCys: 0.715 ± 0.537
2.861PheAsp: 2.861 ± 1.305
0.715PheGlu: 0.715 ± 0.501
1.431PhePhe: 1.431 ± 0.981
2.861PheGly: 2.861 ± 1.305
0.715PheHis: 0.715 ± 0.501
1.431PheIle: 1.431 ± 0.543
3.577PheLys: 3.577 ± 1.837
2.146PheLeu: 2.146 ± 1.006
0.715PheMet: 0.715 ± 0.501
1.431PheAsn: 1.431 ± 0.398
2.146PhePro: 2.146 ± 0.922
0.715PheGln: 0.715 ± 0.491
1.431PheArg: 1.431 ± 0.543
2.146PheSer: 2.146 ± 0.932
2.861PheThr: 2.861 ± 1.086
2.861PheVal: 2.861 ± 0.587
0.0PheTrp: 0.0 ± 0.0
2.146PheTyr: 2.146 ± 0.932
0.0PheXaa: 0.0 ± 0.0
Gly
7.868GlyAla: 7.868 ± 2.715
0.0GlyCys: 0.0 ± 0.0
5.007GlyAsp: 5.007 ± 0.583
3.577GlyGlu: 3.577 ± 0.311
0.0GlyPhe: 0.0 ± 0.0
3.577GlyGly: 3.577 ± 0.668
0.715GlyHis: 0.715 ± 0.501
2.146GlyIle: 2.146 ± 0.104
3.577GlyLys: 3.577 ± 0.668
7.153GlyLeu: 7.153 ± 1.336
0.715GlyMet: 0.715 ± 0.537
3.577GlyAsn: 3.577 ± 0.568
5.722GlyPro: 5.722 ± 1.807
4.292GlyGln: 4.292 ± 1.193
9.299GlyArg: 9.299 ± 1.265
10.014GlySer: 10.014 ± 2.18
7.153GlyThr: 7.153 ± 1.587
2.861GlyVal: 2.861 ± 0.431
0.715GlyTrp: 0.715 ± 0.501
0.715GlyTyr: 0.715 ± 0.537
0.0GlyXaa: 0.0 ± 0.0
His
1.431HisAla: 1.431 ± 0.398
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
0.715HisPhe: 0.715 ± 0.491
1.431HisGly: 1.431 ± 0.398
0.0HisHis: 0.0 ± 0.0
1.431HisIle: 1.431 ± 1.074
0.0HisLys: 0.0 ± 0.0
4.292HisLeu: 4.292 ± 0.626
0.0HisMet: 0.0 ± 0.0
0.715HisAsn: 0.715 ± 0.537
2.146HisPro: 2.146 ± 0.104
1.431HisGln: 1.431 ± 0.398
2.861HisArg: 2.861 ± 0.795
2.146HisSer: 2.146 ± 0.801
0.0HisThr: 0.0 ± 0.0
0.715HisVal: 0.715 ± 0.537
0.715HisTrp: 0.715 ± 0.537
0.715HisTyr: 0.715 ± 0.491
0.0HisXaa: 0.0 ± 0.0
Ile
2.146IleAla: 2.146 ± 1.006
0.0IleCys: 0.0 ± 0.0
4.292IleAsp: 4.292 ± 1.06
5.007IleGlu: 5.007 ± 3.759
1.431IlePhe: 1.431 ± 0.581
2.146IleGly: 2.146 ± 1.006
0.715IleHis: 0.715 ± 0.501
0.0IleIle: 0.0 ± 0.0
3.577IleLys: 3.577 ± 1.083
3.577IleLeu: 3.577 ± 1.411
0.0IleMet: 0.0 ± 0.0
3.577IleAsn: 3.577 ± 0.311
0.715IlePro: 0.715 ± 0.537
2.146IleGln: 2.146 ± 0.727
0.715IleArg: 0.715 ± 0.537
2.861IleSer: 2.861 ± 1.372
0.715IleThr: 0.715 ± 0.537
2.146IleVal: 2.146 ± 0.906
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.722LysAla: 5.722 ± 1.459
0.0LysCys: 0.0 ± 0.0
5.007LysAsp: 5.007 ± 0.583
2.146LysGlu: 2.146 ± 0.727
3.577LysPhe: 3.577 ± 1.474
5.722LysGly: 5.722 ± 1.59
0.715LysHis: 0.715 ± 0.537
0.715LysIle: 0.715 ± 0.537
5.007LysLys: 5.007 ± 0.376
5.007LysLeu: 5.007 ± 1.475
4.292LysMet: 4.292 ± 1.67
2.146LysAsn: 2.146 ± 1.611
4.292LysPro: 4.292 ± 0.208
6.438LysGln: 6.438 ± 1.042
1.431LysArg: 1.431 ± 0.581
1.431LysSer: 1.431 ± 0.543
2.146LysThr: 2.146 ± 0.104
0.715LysVal: 0.715 ± 0.501
1.431LysTrp: 1.431 ± 0.581
2.146LysTyr: 2.146 ± 0.104
0.0LysXaa: 0.0 ± 0.0
Leu
9.299LeuAla: 9.299 ± 1.898
0.715LeuCys: 0.715 ± 0.491
3.577LeuAsp: 3.577 ± 1.06
7.868LeuGlu: 7.868 ± 0.853
1.431LeuPhe: 1.431 ± 0.581
5.007LeuGly: 5.007 ± 1.542
0.715LeuHis: 0.715 ± 0.537
2.861LeuIle: 2.861 ± 0.431
3.577LeuLys: 3.577 ± 1.146
5.722LeuLeu: 5.722 ± 1.774
5.007LeuMet: 5.007 ± 0.516
1.431LeuAsn: 1.431 ± 1.001
5.722LeuPro: 5.722 ± 0.24
4.292LeuGln: 4.292 ± 1.743
2.861LeuArg: 2.861 ± 1.504
7.153LeuSer: 7.153 ± 0.786
3.577LeuThr: 3.577 ± 0.668
7.153LeuVal: 7.153 ± 1.51
0.0LeuTrp: 0.0 ± 0.0
3.577LeuTyr: 3.577 ± 0.568
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.715MetCys: 0.715 ± 0.537
0.0MetAsp: 0.0 ± 0.0
0.715MetGlu: 0.715 ± 0.537
3.577MetPhe: 3.577 ± 0.668
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.715MetIle: 0.715 ± 0.537
3.577MetLys: 3.577 ± 1.146
0.715MetLeu: 0.715 ± 0.537
0.715MetMet: 0.715 ± 0.537
0.715MetAsn: 0.715 ± 0.491
0.0MetPro: 0.0 ± 0.0
0.715MetGln: 0.715 ± 0.491
2.861MetArg: 2.861 ± 0.587
4.292MetSer: 4.292 ± 0.765
2.861MetThr: 2.861 ± 1.372
3.577MetVal: 3.577 ± 2.022
0.0MetTrp: 0.0 ± 0.0
1.431MetTyr: 1.431 ± 1.001
0.0MetXaa: 0.0 ± 0.0
Asn
3.577AsnAla: 3.577 ± 1.862
0.0AsnCys: 0.0 ± 0.0
0.715AsnAsp: 0.715 ± 0.501
1.431AsnGlu: 1.431 ± 0.581
0.0AsnPhe: 0.0 ± 0.0
2.146AsnGly: 2.146 ± 1.006
0.715AsnHis: 0.715 ± 0.501
0.715AsnIle: 0.715 ± 0.501
3.577AsnLys: 3.577 ± 0.311
2.146AsnLeu: 2.146 ± 0.104
0.715AsnMet: 0.715 ± 0.537
0.0AsnAsn: 0.0 ± 0.0
5.722AsnPro: 5.722 ± 1.807
2.861AsnGln: 2.861 ± 1.184
1.431AsnArg: 1.431 ± 1.074
4.292AsnSer: 4.292 ± 0.896
2.861AsnThr: 2.861 ± 1.162
2.861AsnVal: 2.861 ± 1.162
0.715AsnTrp: 0.715 ± 0.491
1.431AsnTyr: 1.431 ± 1.074
0.0AsnXaa: 0.0 ± 0.0
Pro
5.007ProAla: 5.007 ± 1.928
0.0ProCys: 0.0 ± 0.0
0.715ProAsp: 0.715 ± 0.501
5.722ProGlu: 5.722 ± 0.861
2.146ProPhe: 2.146 ± 0.922
10.014ProGly: 10.014 ± 1.419
1.431ProHis: 1.431 ± 0.543
3.577ProIle: 3.577 ± 2.022
2.146ProLys: 2.146 ± 0.906
3.577ProLeu: 3.577 ± 0.668
1.431ProMet: 1.431 ± 0.581
3.577ProAsn: 3.577 ± 2.504
1.431ProPro: 1.431 ± 1.001
2.861ProGln: 2.861 ± 1.086
7.153ProArg: 7.153 ± 3.43
5.007ProSer: 5.007 ± 1.335
4.292ProThr: 4.292 ± 3.004
3.577ProVal: 3.577 ± 0.568
1.431ProTrp: 1.431 ± 1.074
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
5.007GlnAla: 5.007 ± 1.954
2.146GlnCys: 2.146 ± 1.611
2.146GlnAsp: 2.146 ± 0.906
1.431GlnGlu: 1.431 ± 0.543
0.715GlnPhe: 0.715 ± 0.501
3.577GlnGly: 3.577 ± 1.078
2.146GlnHis: 2.146 ± 1.502
1.431GlnIle: 1.431 ± 0.398
0.715GlnLys: 0.715 ± 0.491
5.722GlnLeu: 5.722 ± 1.815
0.715GlnMet: 0.715 ± 0.876
0.715GlnAsn: 0.715 ± 0.491
5.007GlnPro: 5.007 ± 1.09
2.146GlnGln: 2.146 ± 0.922
7.153GlnArg: 7.153 ± 0.345
1.431GlnSer: 1.431 ± 0.543
3.577GlnThr: 3.577 ± 0.928
1.431GlnVal: 1.431 ± 0.581
0.715GlnTrp: 0.715 ± 0.501
2.146GlnTyr: 2.146 ± 1.611
0.0GlnXaa: 0.0 ± 0.0
Arg
7.868ArgAla: 7.868 ± 2.356
2.146ArgCys: 2.146 ± 0.104
1.431ArgAsp: 1.431 ± 0.398
0.715ArgGlu: 0.715 ± 0.537
4.292ArgPhe: 4.292 ± 0.208
9.299ArgGly: 9.299 ± 3.356
1.431ArgHis: 1.431 ± 1.074
2.861ArgIle: 2.861 ± 1.381
3.577ArgLys: 3.577 ± 0.928
6.438ArgLeu: 6.438 ± 1.828
4.292ArgMet: 4.292 ± 1.06
5.007ArgAsn: 5.007 ± 1.542
4.292ArgPro: 4.292 ± 1.455
4.292ArgGln: 4.292 ± 1.193
7.153ArgArg: 7.153 ± 0.623
3.577ArgSer: 3.577 ± 1.821
5.007ArgThr: 5.007 ± 1.928
3.577ArgVal: 3.577 ± 0.928
0.0ArgTrp: 0.0 ± 0.0
3.577ArgTyr: 3.577 ± 1.828
0.0ArgXaa: 0.0 ± 0.0
Ser
1.431SerAla: 1.431 ± 0.581
1.431SerCys: 1.431 ± 1.074
4.292SerAsp: 4.292 ± 1.035
2.861SerGlu: 2.861 ± 1.162
0.715SerPhe: 0.715 ± 0.501
9.299SerGly: 9.299 ± 2.218
0.715SerHis: 0.715 ± 0.501
2.146SerIle: 2.146 ± 0.932
4.292SerLys: 4.292 ± 0.208
6.438SerLeu: 6.438 ± 0.526
0.0SerMet: 0.0 ± 0.0
2.861SerAsn: 2.861 ± 1.184
7.153SerPro: 7.153 ± 2.716
2.861SerGln: 2.861 ± 1.086
8.584SerArg: 8.584 ± 2.856
5.007SerSer: 5.007 ± 1.928
9.299SerThr: 9.299 ± 3.545
2.861SerVal: 2.861 ± 0.795
1.431SerTrp: 1.431 ± 0.398
2.146SerTyr: 2.146 ± 0.932
0.0SerXaa: 0.0 ± 0.0
Thr
3.577ThrAla: 3.577 ± 1.862
0.715ThrCys: 0.715 ± 0.491
4.292ThrAsp: 4.292 ± 1.568
1.431ThrGlu: 1.431 ± 0.581
3.577ThrPhe: 3.577 ± 1.078
5.722ThrGly: 5.722 ± 1.572
2.146ThrHis: 2.146 ± 1.611
2.146ThrIle: 2.146 ± 0.104
5.007ThrLys: 5.007 ± 1.248
2.861ThrLeu: 2.861 ± 0.431
1.431ThrMet: 1.431 ± 0.581
2.146ThrAsn: 2.146 ± 1.472
8.584ThrPro: 8.584 ± 2.856
2.861ThrGln: 2.861 ± 1.086
5.722ThrArg: 5.722 ± 1.59
7.153ThrSer: 7.153 ± 2.857
3.577ThrThr: 3.577 ± 0.668
5.007ThrVal: 5.007 ± 2.295
0.0ThrTrp: 0.0 ± 0.0
1.431ThrTyr: 1.431 ± 1.074
0.0ThrXaa: 0.0 ± 0.0
Val
3.577ValAla: 3.577 ± 1.06
0.715ValCys: 0.715 ± 0.537
2.861ValAsp: 2.861 ± 1.352
2.146ValGlu: 2.146 ± 0.906
4.292ValPhe: 4.292 ± 0.626
3.577ValGly: 3.577 ± 1.666
1.431ValHis: 1.431 ± 0.398
2.146ValIle: 2.146 ± 1.006
4.292ValLys: 4.292 ± 1.06
4.292ValLeu: 4.292 ± 1.743
2.146ValMet: 2.146 ± 0.932
2.146ValAsn: 2.146 ± 0.932
0.0ValPro: 0.0 ± 0.0
2.146ValGln: 2.146 ± 1.006
7.153ValArg: 7.153 ± 0.623
2.861ValSer: 2.861 ± 1.086
4.292ValThr: 4.292 ± 0.971
5.722ValVal: 5.722 ± 1.572
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.715TrpAla: 0.715 ± 0.491
0.0TrpCys: 0.0 ± 0.0
0.715TrpAsp: 0.715 ± 0.491
0.0TrpGlu: 0.0 ± 0.0
0.715TrpPhe: 0.715 ± 0.537
0.715TrpGly: 0.715 ± 0.537
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.715TrpLys: 0.715 ± 0.501
1.431TrpLeu: 1.431 ± 1.074
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.431TrpGln: 1.431 ± 0.543
0.715TrpArg: 0.715 ± 0.501
0.715TrpSer: 0.715 ± 0.537
1.431TrpThr: 1.431 ± 0.581
1.431TrpVal: 1.431 ± 1.074
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.146TyrAla: 2.146 ± 0.801
0.715TyrCys: 0.715 ± 0.491
0.715TyrAsp: 0.715 ± 0.537
1.431TyrGlu: 1.431 ± 0.543
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.715TyrHis: 0.715 ± 0.537
2.861TyrIle: 2.861 ± 1.162
0.715TyrLys: 0.715 ± 0.491
0.715TyrLeu: 0.715 ± 0.537
0.715TyrMet: 0.715 ± 0.537
2.146TyrAsn: 2.146 ± 0.801
0.715TyrPro: 0.715 ± 0.537
0.715TyrGln: 0.715 ± 0.501
2.146TyrArg: 2.146 ± 0.104
0.715TyrSer: 0.715 ± 0.491
5.007TyrThr: 5.007 ± 2.155
2.146TyrVal: 2.146 ± 0.932
0.0TyrTrp: 0.0 ± 0.0
1.431TyrTyr: 1.431 ± 0.398
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1399 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski