Amino acid dipepetide frequency for Tortoise microvirus 50

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.176AlaAla: 13.176 ± 5.614
2.08AlaCys: 2.08 ± 1.992
4.161AlaAsp: 4.161 ± 1.465
8.322AlaGlu: 8.322 ± 4.15
1.387AlaPhe: 1.387 ± 0.807
4.161AlaGly: 4.161 ± 1.69
0.693AlaHis: 0.693 ± 0.694
2.08AlaIle: 2.08 ± 0.8
2.774AlaLys: 2.774 ± 1.66
3.467AlaLeu: 3.467 ± 1.361
0.693AlaMet: 0.693 ± 0.694
6.241AlaAsn: 6.241 ± 2.735
1.387AlaPro: 1.387 ± 0.846
4.854AlaGln: 4.854 ± 2.163
7.628AlaArg: 7.628 ± 1.23
4.854AlaSer: 4.854 ± 2.636
4.161AlaThr: 4.161 ± 1.938
3.467AlaVal: 3.467 ± 1.513
0.0AlaTrp: 0.0 ± 0.0
3.467AlaTyr: 3.467 ± 0.878
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.693CysCys: 0.693 ± 0.694
1.387CysAsp: 1.387 ± 1.437
0.0CysGlu: 0.0 ± 0.0
0.693CysPhe: 0.693 ± 0.694
1.387CysGly: 1.387 ± 1.389
0.693CysHis: 0.693 ± 0.694
1.387CysIle: 1.387 ± 1.097
0.0CysLys: 0.0 ± 0.0
0.693CysLeu: 0.693 ± 0.423
0.0CysMet: 0.0 ± 0.0
1.387CysAsn: 1.387 ± 1.097
0.0CysPro: 0.0 ± 0.0
0.693CysGln: 0.693 ± 0.759
0.693CysArg: 0.693 ± 0.694
0.693CysSer: 0.693 ± 1.01
1.387CysThr: 1.387 ± 1.352
1.387CysVal: 1.387 ± 0.849
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.548AspAla: 5.548 ± 2.334
0.693AspCys: 0.693 ± 0.759
1.387AspAsp: 1.387 ± 0.846
3.467AspGlu: 3.467 ± 0.891
4.854AspPhe: 4.854 ± 1.805
0.693AspGly: 0.693 ± 0.423
1.387AspHis: 1.387 ± 0.63
2.774AspIle: 2.774 ± 1.292
4.161AspLys: 4.161 ± 2.625
5.548AspLeu: 5.548 ± 1.951
1.387AspMet: 1.387 ± 0.868
4.161AspAsn: 4.161 ± 1.953
2.08AspPro: 2.08 ± 1.315
1.387AspGln: 1.387 ± 1.097
0.693AspArg: 0.693 ± 0.423
9.015AspSer: 9.015 ± 2.759
2.774AspThr: 2.774 ± 0.835
2.774AspVal: 2.774 ± 1.356
1.387AspTrp: 1.387 ± 0.63
4.161AspTyr: 4.161 ± 2.129
0.0AspXaa: 0.0 ± 0.0
Glu
4.854GluAla: 4.854 ± 2.142
0.693GluCys: 0.693 ± 0.718
1.387GluAsp: 1.387 ± 0.563
2.08GluGlu: 2.08 ± 1.529
4.161GluPhe: 4.161 ± 2.51
1.387GluGly: 1.387 ± 0.772
4.161GluHis: 4.161 ± 2.441
4.854GluIle: 4.854 ± 1.24
4.854GluLys: 4.854 ± 2.832
2.774GluLeu: 2.774 ± 2.013
0.693GluMet: 0.693 ± 0.688
6.241GluAsn: 6.241 ± 2.014
1.387GluPro: 1.387 ± 0.846
1.387GluGln: 1.387 ± 0.846
2.08GluArg: 2.08 ± 0.818
4.854GluSer: 4.854 ± 2.685
0.693GluThr: 0.693 ± 0.759
2.774GluVal: 2.774 ± 0.664
0.693GluTrp: 0.693 ± 0.694
4.161GluTyr: 4.161 ± 0.859
0.0GluXaa: 0.0 ± 0.0
Phe
4.854PheAla: 4.854 ± 3.016
0.0PheCys: 0.0 ± 0.0
4.854PheAsp: 4.854 ± 1.744
2.08PheGlu: 2.08 ± 1.453
4.854PhePhe: 4.854 ± 1.678
4.854PheGly: 4.854 ± 1.702
0.0PheHis: 0.0 ± 0.0
6.241PheIle: 6.241 ± 2.143
4.854PheLys: 4.854 ± 2.266
6.241PheLeu: 6.241 ± 2.277
2.08PheMet: 2.08 ± 1.276
2.08PheAsn: 2.08 ± 1.041
2.08PhePro: 2.08 ± 1.133
0.693PheGln: 0.693 ± 0.694
4.161PheArg: 4.161 ± 1.277
9.709PheSer: 9.709 ± 3.652
2.774PheThr: 2.774 ± 1.126
0.693PheVal: 0.693 ± 0.423
0.0PheTrp: 0.0 ± 0.0
0.693PheTyr: 0.693 ± 0.423
0.0PheXaa: 0.0 ± 0.0
Gly
1.387GlyAla: 1.387 ± 0.563
0.0GlyCys: 0.0 ± 0.0
4.854GlyAsp: 4.854 ± 0.828
4.854GlyGlu: 4.854 ± 1.601
3.467GlyPhe: 3.467 ± 1.023
4.854GlyGly: 4.854 ± 1.228
0.0GlyHis: 0.0 ± 0.0
1.387GlyIle: 1.387 ± 1.389
2.08GlyLys: 2.08 ± 1.256
6.935GlyLeu: 6.935 ± 2.868
0.693GlyMet: 0.693 ± 0.718
0.693GlyAsn: 0.693 ± 0.423
0.693GlyPro: 0.693 ± 0.423
0.0GlyGln: 0.0 ± 0.0
2.774GlyArg: 2.774 ± 0.596
4.854GlySer: 4.854 ± 0.779
2.08GlyThr: 2.08 ± 1.269
6.935GlyVal: 6.935 ± 2.465
0.0GlyTrp: 0.0 ± 0.0
2.774GlyTyr: 2.774 ± 1.692
0.0GlyXaa: 0.0 ± 0.0
His
1.387HisAla: 1.387 ± 1.097
0.0HisCys: 0.0 ± 0.0
2.08HisAsp: 2.08 ± 0.973
0.693HisGlu: 0.693 ± 0.694
2.774HisPhe: 2.774 ± 1.369
2.08HisGly: 2.08 ± 1.269
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.693HisLys: 0.693 ± 0.694
3.467HisLeu: 3.467 ± 2.614
0.0HisMet: 0.0 ± 0.0
0.693HisAsn: 0.693 ± 0.594
0.0HisPro: 0.0 ± 0.0
2.774HisGln: 2.774 ± 1.285
0.0HisArg: 0.0 ± 0.0
2.774HisSer: 2.774 ± 1.31
0.0HisThr: 0.0 ± 0.0
0.693HisVal: 0.693 ± 0.423
0.693HisTrp: 0.693 ± 0.694
2.08HisTyr: 2.08 ± 1.457
0.0HisXaa: 0.0 ± 0.0
Ile
3.467IleAla: 3.467 ± 1.022
0.0IleCys: 0.0 ± 0.0
4.854IleAsp: 4.854 ± 0.828
2.08IleGlu: 2.08 ± 1.416
3.467IlePhe: 3.467 ± 1.291
3.467IleGly: 3.467 ± 1.108
0.0IleHis: 0.0 ± 0.0
0.693IleIle: 0.693 ± 0.423
2.08IleLys: 2.08 ± 0.981
6.241IleLeu: 6.241 ± 2.487
2.08IleMet: 2.08 ± 0.827
2.774IleAsn: 2.774 ± 0.835
3.467IlePro: 3.467 ± 1.017
2.774IleGln: 2.774 ± 1.454
2.08IleArg: 2.08 ± 1.256
2.08IleSer: 2.08 ± 1.077
4.161IleThr: 4.161 ± 1.89
0.0IleVal: 0.0 ± 0.0
2.08IleTrp: 2.08 ± 0.719
3.467IleTyr: 3.467 ± 1.343
0.0IleXaa: 0.0 ± 0.0
Lys
4.161LysAla: 4.161 ± 2.388
0.693LysCys: 0.693 ± 0.694
0.693LysAsp: 0.693 ± 0.718
4.161LysGlu: 4.161 ± 1.108
4.161LysPhe: 4.161 ± 2.097
2.774LysGly: 2.774 ± 1.211
2.08LysHis: 2.08 ± 1.343
6.241LysIle: 6.241 ± 2.978
5.548LysLys: 5.548 ± 3.932
4.161LysLeu: 4.161 ± 1.747
0.0LysMet: 0.0 ± 0.0
3.467LysAsn: 3.467 ± 2.077
2.774LysPro: 2.774 ± 0.654
2.774LysGln: 2.774 ± 1.714
2.08LysArg: 2.08 ± 0.973
6.241LysSer: 6.241 ± 2.257
2.774LysThr: 2.774 ± 1.353
1.387LysVal: 1.387 ± 1.097
0.0LysTrp: 0.0 ± 0.0
1.387LysTyr: 1.387 ± 1.389
0.0LysXaa: 0.0 ± 0.0
Leu
9.015LeuAla: 9.015 ± 2.186
0.693LeuCys: 0.693 ± 1.01
6.241LeuAsp: 6.241 ± 3.177
4.161LeuGlu: 4.161 ± 3.308
2.08LeuPhe: 2.08 ± 0.873
2.774LeuGly: 2.774 ± 1.14
1.387LeuHis: 1.387 ± 1.097
4.854LeuIle: 4.854 ± 1.302
6.241LeuLys: 6.241 ± 3.031
4.854LeuLeu: 4.854 ± 2.032
0.693LeuMet: 0.693 ± 1.3
6.935LeuAsn: 6.935 ± 1.634
6.935LeuPro: 6.935 ± 2.295
2.08LeuGln: 2.08 ± 0.8
3.467LeuArg: 3.467 ± 0.863
6.241LeuSer: 6.241 ± 1.3
5.548LeuThr: 5.548 ± 2.034
5.548LeuVal: 5.548 ± 0.856
2.08LeuTrp: 2.08 ± 0.818
3.467LeuTyr: 3.467 ± 1.202
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
1.387MetCys: 1.387 ± 1.389
0.693MetAsp: 0.693 ± 0.423
1.387MetGlu: 1.387 ± 0.915
0.693MetPhe: 0.693 ± 0.759
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.693MetIle: 0.693 ± 0.423
2.774MetLys: 2.774 ± 1.353
1.387MetLeu: 1.387 ± 0.563
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.693MetPro: 0.693 ± 0.423
0.693MetGln: 0.693 ± 0.694
0.0MetArg: 0.0 ± 0.0
2.08MetSer: 2.08 ± 1.491
0.0MetThr: 0.0 ± 0.0
0.693MetVal: 0.693 ± 0.423
1.387MetTrp: 1.387 ± 0.563
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.467AsnAla: 3.467 ± 1.487
0.693AsnCys: 0.693 ± 0.694
4.854AsnAsp: 4.854 ± 1.744
4.161AsnGlu: 4.161 ± 1.912
0.0AsnPhe: 0.0 ± 0.0
1.387AsnGly: 1.387 ± 0.63
0.693AsnHis: 0.693 ± 0.423
2.08AsnIle: 2.08 ± 0.8
2.08AsnLys: 2.08 ± 0.544
6.241AsnLeu: 6.241 ± 1.924
0.693AsnMet: 0.693 ± 0.423
2.08AsnAsn: 2.08 ± 0.544
2.774AsnPro: 2.774 ± 1.009
0.693AsnGln: 0.693 ± 0.594
3.467AsnArg: 3.467 ± 1.157
4.854AsnSer: 4.854 ± 0.986
3.467AsnThr: 3.467 ± 0.501
4.854AsnVal: 4.854 ± 1.441
0.693AsnTrp: 0.693 ± 0.423
1.387AsnTyr: 1.387 ± 0.563
0.0AsnXaa: 0.0 ± 0.0
Pro
2.774ProAla: 2.774 ± 1.009
0.693ProCys: 0.693 ± 0.694
2.774ProAsp: 2.774 ± 1.26
2.08ProGlu: 2.08 ± 0.719
3.467ProPhe: 3.467 ± 1.132
2.774ProGly: 2.774 ± 1.14
2.08ProHis: 2.08 ± 0.818
3.467ProIle: 3.467 ± 1.317
1.387ProLys: 1.387 ± 2.02
3.467ProLeu: 3.467 ± 2.614
0.693ProMet: 0.693 ± 0.423
2.08ProAsn: 2.08 ± 0.873
1.387ProPro: 1.387 ± 1.013
2.08ProGln: 2.08 ± 1.269
0.693ProArg: 0.693 ± 0.423
9.015ProSer: 9.015 ± 2.293
1.387ProThr: 1.387 ± 0.846
4.161ProVal: 4.161 ± 1.876
1.387ProTrp: 1.387 ± 0.846
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.161GlnAla: 4.161 ± 1.69
0.0GlnCys: 0.0 ± 0.0
2.08GlnAsp: 2.08 ± 0.818
2.08GlnGlu: 2.08 ± 1.077
1.387GlnPhe: 1.387 ± 0.678
2.774GlnGly: 2.774 ± 1.149
0.0GlnHis: 0.0 ± 0.0
2.08GlnIle: 2.08 ± 1.077
4.854GlnLys: 4.854 ± 1.561
1.387GlnLeu: 1.387 ± 1.389
0.0GlnMet: 0.0 ± 0.0
0.693GlnAsn: 0.693 ± 0.594
2.774GlnPro: 2.774 ± 1.411
4.161GlnGln: 4.161 ± 1.562
6.241GlnArg: 6.241 ± 1.44
0.693GlnSer: 0.693 ± 1.01
1.387GlnThr: 1.387 ± 0.846
1.387GlnVal: 1.387 ± 0.772
1.387GlnTrp: 1.387 ± 1.188
0.693GlnTyr: 0.693 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
1.387ArgAla: 1.387 ± 0.846
0.693ArgCys: 0.693 ± 0.694
2.08ArgAsp: 2.08 ± 0.818
2.774ArgGlu: 2.774 ± 1.994
4.854ArgPhe: 4.854 ± 1.401
1.387ArgGly: 1.387 ± 0.846
2.08ArgHis: 2.08 ± 0.894
2.08ArgIle: 2.08 ± 1.189
2.08ArgLys: 2.08 ± 1.343
5.548ArgLeu: 5.548 ± 2.07
1.387ArgMet: 1.387 ± 0.63
0.693ArgAsn: 0.693 ± 0.423
4.161ArgPro: 4.161 ± 2.513
3.467ArgGln: 3.467 ± 1.157
2.774ArgArg: 2.774 ± 0.596
6.241ArgSer: 6.241 ± 1.736
0.0ArgThr: 0.0 ± 0.0
2.774ArgVal: 2.774 ± 1.126
0.693ArgTrp: 0.693 ± 0.594
4.161ArgTyr: 4.161 ± 1.102
0.0ArgXaa: 0.0 ± 0.0
Ser
10.402SerAla: 10.402 ± 4.517
2.774SerCys: 2.774 ± 0.927
6.241SerAsp: 6.241 ± 2.101
4.161SerGlu: 4.161 ± 1.936
9.015SerPhe: 9.015 ± 4.173
4.161SerGly: 4.161 ± 1.907
4.161SerHis: 4.161 ± 1.232
3.467SerIle: 3.467 ± 1.37
2.774SerLys: 2.774 ± 1.192
10.402SerLeu: 10.402 ± 2.722
0.693SerMet: 0.693 ± 0.423
1.387SerAsn: 1.387 ± 0.63
3.467SerPro: 3.467 ± 1.291
3.467SerGln: 3.467 ± 0.501
4.854SerArg: 4.854 ± 1.24
13.87SerSer: 13.87 ± 3.137
7.628SerThr: 7.628 ± 1.704
5.548SerVal: 5.548 ± 2.426
0.0SerTrp: 0.0 ± 0.0
1.387SerTyr: 1.387 ± 0.807
0.0SerXaa: 0.0 ± 0.0
Thr
2.774ThrAla: 2.774 ± 1.126
1.387ThrCys: 1.387 ± 1.352
3.467ThrAsp: 3.467 ± 1.376
2.08ThrGlu: 2.08 ± 1.269
4.854ThrPhe: 4.854 ± 1.541
4.854ThrGly: 4.854 ± 1.323
0.693ThrHis: 0.693 ± 0.759
3.467ThrIle: 3.467 ± 1.5
4.161ThrLys: 4.161 ± 2.18
2.774ThrLeu: 2.774 ± 0.596
0.693ThrMet: 0.693 ± 0.694
0.693ThrAsn: 0.693 ± 0.594
4.161ThrPro: 4.161 ± 1.243
1.387ThrGln: 1.387 ± 0.938
1.387ThrArg: 1.387 ± 0.846
2.08ThrSer: 2.08 ± 0.8
3.467ThrThr: 3.467 ± 1.37
2.774ThrVal: 2.774 ± 1.149
0.0ThrTrp: 0.0 ± 0.0
0.693ThrTyr: 0.693 ± 0.694
0.0ThrXaa: 0.0 ± 0.0
Val
3.467ValAla: 3.467 ± 1.398
0.0ValCys: 0.0 ± 0.0
1.387ValAsp: 1.387 ± 0.678
1.387ValGlu: 1.387 ± 1.188
4.161ValPhe: 4.161 ± 2.063
2.08ValGly: 2.08 ± 1.077
1.387ValHis: 1.387 ± 0.846
0.693ValIle: 0.693 ± 0.423
2.08ValLys: 2.08 ± 1.294
6.935ValLeu: 6.935 ± 1.614
1.387ValMet: 1.387 ± 0.914
3.467ValAsn: 3.467 ± 0.975
4.854ValPro: 4.854 ± 1.602
0.693ValGln: 0.693 ± 0.423
3.467ValArg: 3.467 ± 1.785
6.241ValSer: 6.241 ± 1.813
0.693ValThr: 0.693 ± 0.694
4.161ValVal: 4.161 ± 0.877
0.0ValTrp: 0.0 ± 0.0
4.161ValTyr: 4.161 ± 0.973
0.0ValXaa: 0.0 ± 0.0
Trp
0.693TrpAla: 0.693 ± 0.694
0.0TrpCys: 0.0 ± 0.0
2.08TrpAsp: 2.08 ± 1.269
0.693TrpGlu: 0.693 ± 0.594
1.387TrpPhe: 1.387 ± 0.63
0.0TrpGly: 0.0 ± 0.0
0.693TrpHis: 0.693 ± 0.423
0.693TrpIle: 0.693 ± 0.594
0.0TrpLys: 0.0 ± 0.0
0.693TrpLeu: 0.693 ± 0.594
0.0TrpMet: 0.0 ± 0.0
2.08TrpAsn: 2.08 ± 0.781
2.08TrpPro: 2.08 ± 0.818
2.08TrpGln: 2.08 ± 1.256
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.693TrpThr: 0.693 ± 0.423
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.08TyrAla: 2.08 ± 0.894
0.0TyrCys: 0.0 ± 0.0
2.774TyrAsp: 2.774 ± 1.501
3.467TyrGlu: 3.467 ± 0.973
2.08TyrPhe: 2.08 ± 1.269
3.467TyrGly: 3.467 ± 1.145
0.693TyrHis: 0.693 ± 0.694
2.08TyrIle: 2.08 ± 1.256
2.08TyrLys: 2.08 ± 0.8
2.774TyrLeu: 2.774 ± 1.149
0.0TyrMet: 0.0 ± 0.0
2.774TyrAsn: 2.774 ± 1.113
0.693TyrPro: 0.693 ± 0.423
2.08TyrGln: 2.08 ± 0.818
2.774TyrArg: 2.774 ± 0.837
3.467TyrSer: 3.467 ± 1.023
2.774TyrThr: 2.774 ± 0.596
0.693TyrVal: 0.693 ± 0.694
1.387TyrTrp: 1.387 ± 0.846
2.08TyrTyr: 2.08 ± 1.256
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1443 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski