Amino acid dipepetide frequency for Tortoise microvirus 104

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.072AlaAla: 4.072 ± 1.435
1.163AlaCys: 1.163 ± 1.149
4.072AlaAsp: 4.072 ± 1.256
2.327AlaGlu: 2.327 ± 0.841
2.909AlaPhe: 2.909 ± 1.058
2.327AlaGly: 2.327 ± 1.47
1.745AlaHis: 1.745 ± 0.743
3.49AlaIle: 3.49 ± 2.041
3.49AlaLys: 3.49 ± 1.498
6.399AlaLeu: 6.399 ± 1.163
0.582AlaMet: 0.582 ± 0.807
2.327AlaAsn: 2.327 ± 0.725
1.163AlaPro: 1.163 ± 0.825
5.236AlaGln: 5.236 ± 1.511
2.909AlaArg: 2.909 ± 1.353
5.817AlaSer: 5.817 ± 1.156
6.399AlaThr: 6.399 ± 1.422
2.909AlaVal: 2.909 ± 1.075
0.0AlaTrp: 0.0 ± 0.0
5.817AlaTyr: 5.817 ± 2.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.582CysAla: 0.582 ± 0.575
0.582CysCys: 0.582 ± 0.654
0.582CysAsp: 0.582 ± 0.413
1.163CysGlu: 1.163 ± 0.912
1.163CysPhe: 1.163 ± 0.86
0.582CysGly: 0.582 ± 0.575
0.582CysHis: 0.582 ± 0.575
0.582CysIle: 0.582 ± 0.413
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.582CysMet: 0.582 ± 0.81
0.582CysAsn: 0.582 ± 0.413
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.909CysArg: 2.909 ± 1.177
0.582CysSer: 0.582 ± 0.623
1.163CysThr: 1.163 ± 0.554
0.582CysVal: 0.582 ± 0.623
0.0CysTrp: 0.0 ± 0.0
0.582CysTyr: 0.582 ± 0.575
0.0CysXaa: 0.0 ± 0.0
Asp
2.909AspAla: 2.909 ± 1.266
0.582AspCys: 0.582 ± 0.413
4.654AspAsp: 4.654 ± 1.413
2.327AspGlu: 2.327 ± 1.341
2.909AspPhe: 2.909 ± 1.387
4.072AspGly: 4.072 ± 1.725
0.582AspHis: 0.582 ± 0.588
4.654AspIle: 4.654 ± 1.562
5.236AspLys: 5.236 ± 2.008
6.981AspLeu: 6.981 ± 1.536
3.49AspMet: 3.49 ± 2.018
4.072AspAsn: 4.072 ± 1.862
1.163AspPro: 1.163 ± 1.175
1.163AspGln: 1.163 ± 1.089
0.0AspArg: 0.0 ± 0.0
3.49AspSer: 3.49 ± 0.94
4.072AspThr: 4.072 ± 1.097
2.327AspVal: 2.327 ± 1.213
0.582AspTrp: 0.582 ± 0.544
6.399AspTyr: 6.399 ± 1.429
0.0AspXaa: 0.0 ± 0.0
Glu
2.909GluAla: 2.909 ± 1.482
0.0GluCys: 0.0 ± 0.0
1.745GluAsp: 1.745 ± 1.083
1.163GluGlu: 1.163 ± 1.149
1.745GluPhe: 1.745 ± 0.85
0.0GluGly: 0.0 ± 0.0
1.163GluHis: 1.163 ± 0.992
1.163GluIle: 1.163 ± 1.089
5.817GluLys: 5.817 ± 1.676
5.817GluLeu: 5.817 ± 1.838
1.163GluMet: 1.163 ± 1.129
4.654GluAsn: 4.654 ± 1.23
1.745GluPro: 1.745 ± 1.051
2.909GluGln: 2.909 ± 2.317
1.745GluArg: 1.745 ± 0.948
3.49GluSer: 3.49 ± 1.103
5.236GluThr: 5.236 ± 2.366
2.909GluVal: 2.909 ± 1.669
0.0GluTrp: 0.0 ± 0.0
2.909GluTyr: 2.909 ± 1.383
0.0GluXaa: 0.0 ± 0.0
Phe
3.49PheAla: 3.49 ± 1.522
0.0PheCys: 0.0 ± 0.0
4.654PheAsp: 4.654 ± 0.999
0.582PheGlu: 0.582 ± 0.81
2.327PhePhe: 2.327 ± 0.927
1.163PheGly: 1.163 ± 0.825
1.163PheHis: 1.163 ± 0.645
4.654PheIle: 4.654 ± 1.585
2.327PheLys: 2.327 ± 1.136
1.163PheLeu: 1.163 ± 0.687
1.163PheMet: 1.163 ± 0.825
3.49PheAsn: 3.49 ± 1.218
0.0PhePro: 0.0 ± 0.0
1.745PheGln: 1.745 ± 0.952
2.327PheArg: 2.327 ± 1.375
1.745PheSer: 1.745 ± 0.726
1.745PheThr: 1.745 ± 0.926
1.163PheVal: 1.163 ± 0.833
0.582PheTrp: 0.582 ± 0.623
2.327PheTyr: 2.327 ± 1.65
0.0PheXaa: 0.0 ± 0.0
Gly
1.745GlyAla: 1.745 ± 0.513
0.582GlyCys: 0.582 ± 0.575
3.49GlyAsp: 3.49 ± 1.056
1.745GlyGlu: 1.745 ± 0.513
0.582GlyPhe: 0.582 ± 0.413
3.49GlyGly: 3.49 ± 0.894
1.163GlyHis: 1.163 ± 0.825
2.909GlyIle: 2.909 ± 0.918
2.909GlyLys: 2.909 ± 1.19
6.981GlyLeu: 6.981 ± 1.506
0.582GlyMet: 0.582 ± 0.77
2.909GlyAsn: 2.909 ± 1.066
0.0GlyPro: 0.0 ± 0.0
4.072GlyGln: 4.072 ± 2.214
3.49GlyArg: 3.49 ± 1.903
4.072GlySer: 4.072 ± 1.028
2.909GlyThr: 2.909 ± 1.215
1.745GlyVal: 1.745 ± 0.945
0.582GlyTrp: 0.582 ± 0.623
3.49GlyTyr: 3.49 ± 1.704
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.582HisCys: 0.582 ± 0.575
1.745HisAsp: 1.745 ± 0.932
0.582HisGlu: 0.582 ± 0.81
1.163HisPhe: 1.163 ± 0.864
1.745HisGly: 1.745 ± 0.923
0.0HisHis: 0.0 ± 0.0
0.582HisIle: 0.582 ± 0.654
1.163HisLys: 1.163 ± 1.084
2.909HisLeu: 2.909 ± 0.876
0.0HisMet: 0.0 ± 0.0
2.327HisAsn: 2.327 ± 1.019
1.745HisPro: 1.745 ± 0.932
0.582HisGln: 0.582 ± 0.654
0.582HisArg: 0.582 ± 0.684
1.163HisSer: 1.163 ± 0.554
0.582HisThr: 0.582 ± 0.623
1.163HisVal: 1.163 ± 0.83
0.582HisTrp: 0.582 ± 0.413
1.745HisTyr: 1.745 ± 1.309
0.0HisXaa: 0.0 ± 0.0
Ile
5.236IleAla: 5.236 ± 1.287
0.582IleCys: 0.582 ± 0.623
6.981IleAsp: 6.981 ± 1.824
2.909IleGlu: 2.909 ± 2.546
1.163IlePhe: 1.163 ± 0.743
4.072IleGly: 4.072 ± 0.956
0.0IleHis: 0.0 ± 0.0
3.49IleIle: 3.49 ± 1.504
2.327IleLys: 2.327 ± 1.341
2.909IleLeu: 2.909 ± 1.244
2.327IleMet: 2.327 ± 1.378
1.745IleAsn: 1.745 ± 1.055
3.49IlePro: 3.49 ± 0.753
2.327IleGln: 2.327 ± 1.325
5.817IleArg: 5.817 ± 1.018
8.144IleSer: 8.144 ± 1.074
4.654IleThr: 4.654 ± 1.546
1.745IleVal: 1.745 ± 0.934
0.582IleTrp: 0.582 ± 0.413
3.49IleTyr: 3.49 ± 1.55
0.0IleXaa: 0.0 ± 0.0
Lys
4.072LysAla: 4.072 ± 2.216
0.0LysCys: 0.0 ± 0.0
5.236LysAsp: 5.236 ± 1.798
2.327LysGlu: 2.327 ± 1.382
0.582LysPhe: 0.582 ± 0.575
4.654LysGly: 4.654 ± 2.253
1.745LysHis: 1.745 ± 0.785
2.909LysIle: 2.909 ± 1.624
6.981LysLys: 6.981 ± 3.201
7.563LysLeu: 7.563 ± 2.36
4.654LysMet: 4.654 ± 1.508
4.654LysAsn: 4.654 ± 2.223
1.163LysPro: 1.163 ± 0.554
3.49LysGln: 3.49 ± 1.097
2.327LysArg: 2.327 ± 1.149
6.981LysSer: 6.981 ± 2.424
2.327LysThr: 2.327 ± 0.901
1.745LysVal: 1.745 ± 0.962
1.163LysTrp: 1.163 ± 0.715
2.909LysTyr: 2.909 ± 1.552
0.0LysXaa: 0.0 ± 0.0
Leu
3.49LeuAla: 3.49 ± 1.496
2.327LeuCys: 2.327 ± 0.813
4.654LeuAsp: 4.654 ± 1.111
5.817LeuGlu: 5.817 ± 2.293
2.909LeuPhe: 2.909 ± 1.444
6.399LeuGly: 6.399 ± 1.635
0.582LeuHis: 0.582 ± 0.654
3.49LeuIle: 3.49 ± 1.096
5.817LeuLys: 5.817 ± 1.965
5.817LeuLeu: 5.817 ± 1.734
1.163LeuMet: 1.163 ± 0.554
2.909LeuAsn: 2.909 ± 0.964
8.726LeuPro: 8.726 ± 2.605
5.236LeuGln: 5.236 ± 1.107
3.49LeuArg: 3.49 ± 1.59
11.053LeuSer: 11.053 ± 2.893
4.654LeuThr: 4.654 ± 1.292
4.654LeuVal: 4.654 ± 1.592
0.582LeuTrp: 0.582 ± 0.413
4.072LeuTyr: 4.072 ± 1.905
0.0LeuXaa: 0.0 ± 0.0
Met
2.909MetAla: 2.909 ± 0.937
0.0MetCys: 0.0 ± 0.0
1.163MetAsp: 1.163 ± 0.837
0.582MetGlu: 0.582 ± 0.588
1.745MetPhe: 1.745 ± 0.77
0.582MetGly: 0.582 ± 0.623
1.163MetHis: 1.163 ± 0.992
1.745MetIle: 1.745 ± 1.167
0.582MetLys: 0.582 ± 0.413
2.327MetLeu: 2.327 ± 1.033
1.745MetMet: 1.745 ± 0.79
1.745MetAsn: 1.745 ± 1.238
1.745MetPro: 1.745 ± 1.238
1.745MetGln: 1.745 ± 0.79
1.745MetArg: 1.745 ± 1.244
2.327MetSer: 2.327 ± 0.596
1.163MetThr: 1.163 ± 1.157
0.582MetVal: 0.582 ± 0.654
0.0MetTrp: 0.0 ± 0.0
1.163MetTyr: 1.163 ± 0.755
0.0MetXaa: 0.0 ± 0.0
Asn
2.909AsnAla: 2.909 ± 1.251
1.163AsnCys: 1.163 ± 0.917
2.327AsnAsp: 2.327 ± 1.284
4.072AsnGlu: 4.072 ± 1.15
3.49AsnPhe: 3.49 ± 1.4
2.909AsnGly: 2.909 ± 1.025
0.582AsnHis: 0.582 ± 0.77
2.909AsnIle: 2.909 ± 1.666
4.072AsnLys: 4.072 ± 1.226
2.909AsnLeu: 2.909 ± 1.34
0.582AsnMet: 0.582 ± 0.619
2.909AsnAsn: 2.909 ± 1.145
1.163AsnPro: 1.163 ± 0.554
1.163AsnGln: 1.163 ± 0.492
2.909AsnArg: 2.909 ± 1.514
2.909AsnSer: 2.909 ± 1.295
4.654AsnThr: 4.654 ± 1.222
2.327AsnVal: 2.327 ± 1.295
1.163AsnTrp: 1.163 ± 0.715
4.654AsnTyr: 4.654 ± 2.145
0.0AsnXaa: 0.0 ± 0.0
Pro
4.654ProAla: 4.654 ± 1.192
1.163ProCys: 1.163 ± 0.83
2.909ProAsp: 2.909 ± 1.177
2.327ProGlu: 2.327 ± 1.529
2.327ProPhe: 2.327 ± 1.133
0.582ProGly: 0.582 ± 0.413
1.163ProHis: 1.163 ± 0.796
4.072ProIle: 4.072 ± 1.145
0.582ProLys: 0.582 ± 0.623
4.072ProLeu: 4.072 ± 1.028
0.582ProMet: 0.582 ± 0.413
1.163ProAsn: 1.163 ± 0.825
0.582ProPro: 0.582 ± 0.413
2.909ProGln: 2.909 ± 1.418
1.163ProArg: 1.163 ± 0.554
2.909ProSer: 2.909 ± 1.101
3.49ProThr: 3.49 ± 2.476
2.909ProVal: 2.909 ± 0.996
0.0ProTrp: 0.0 ± 0.0
2.909ProTyr: 2.909 ± 0.918
0.0ProXaa: 0.0 ± 0.0
Gln
5.236GlnAla: 5.236 ± 2.279
0.0GlnCys: 0.0 ± 0.0
1.163GlnAsp: 1.163 ± 0.948
5.236GlnGlu: 5.236 ± 2.837
1.163GlnPhe: 1.163 ± 0.492
2.327GlnGly: 2.327 ± 1.18
1.163GlnHis: 1.163 ± 0.842
2.909GlnIle: 2.909 ± 1.998
4.072GlnLys: 4.072 ± 2.121
7.563GlnLeu: 7.563 ± 3.261
0.582GlnMet: 0.582 ± 0.544
4.072GlnAsn: 4.072 ± 1.618
2.327GlnPro: 2.327 ± 0.901
3.49GlnGln: 3.49 ± 0.8
4.654GlnArg: 4.654 ± 1.801
5.236GlnSer: 5.236 ± 1.847
5.236GlnThr: 5.236 ± 1.676
4.072GlnVal: 4.072 ± 1.607
0.582GlnTrp: 0.582 ± 0.654
1.745GlnTyr: 1.745 ± 1.284
0.0GlnXaa: 0.0 ± 0.0
Arg
2.909ArgAla: 2.909 ± 1.056
0.0ArgCys: 0.0 ± 0.0
2.909ArgAsp: 2.909 ± 1.06
3.49ArgGlu: 3.49 ± 1.474
0.582ArgPhe: 0.582 ± 0.654
1.745ArgGly: 1.745 ± 1.086
1.745ArgHis: 1.745 ± 1.25
4.072ArgIle: 4.072 ± 1.526
5.236ArgLys: 5.236 ± 1.787
2.909ArgLeu: 2.909 ± 1.674
2.327ArgMet: 2.327 ± 1.173
2.327ArgAsn: 2.327 ± 1.248
1.163ArgPro: 1.163 ± 0.554
4.654ArgGln: 4.654 ± 0.796
2.327ArgArg: 2.327 ± 1.328
1.745ArgSer: 1.745 ± 0.77
2.327ArgThr: 2.327 ± 1.143
1.163ArgVal: 1.163 ± 0.825
0.0ArgTrp: 0.0 ± 0.0
3.49ArgTyr: 3.49 ± 1.333
0.0ArgXaa: 0.0 ± 0.0
Ser
6.399SerAla: 6.399 ± 2.86
0.582SerCys: 0.582 ± 0.684
2.909SerAsp: 2.909 ± 1.012
4.072SerGlu: 4.072 ± 1.405
3.49SerPhe: 3.49 ± 2.094
3.49SerGly: 3.49 ± 0.942
1.163SerHis: 1.163 ± 0.645
7.563SerIle: 7.563 ± 2.364
5.236SerLys: 5.236 ± 1.16
6.399SerLeu: 6.399 ± 2.42
0.582SerMet: 0.582 ± 0.413
4.072SerAsn: 4.072 ± 1.521
4.072SerPro: 4.072 ± 1.529
6.981SerGln: 6.981 ± 1.983
2.327SerArg: 2.327 ± 0.989
5.236SerSer: 5.236 ± 2.64
7.563SerThr: 7.563 ± 2.057
4.072SerVal: 4.072 ± 1.028
0.0SerTrp: 0.0 ± 0.0
1.163SerTyr: 1.163 ± 0.825
0.0SerXaa: 0.0 ± 0.0
Thr
5.817ThrAla: 5.817 ± 1.471
0.582ThrCys: 0.582 ± 0.413
2.909ThrAsp: 2.909 ± 1.456
3.49ThrGlu: 3.49 ± 1.239
4.654ThrPhe: 4.654 ± 1.531
4.072ThrGly: 4.072 ± 1.271
1.163ThrHis: 1.163 ± 0.842
4.072ThrIle: 4.072 ± 1.712
4.654ThrLys: 4.654 ± 1.458
5.817ThrLeu: 5.817 ± 1.377
1.745ThrMet: 1.745 ± 1.086
1.745ThrAsn: 1.745 ± 1.007
4.072ThrPro: 4.072 ± 1.125
5.236ThrGln: 5.236 ± 2.279
2.327ThrArg: 2.327 ± 1.322
3.49ThrSer: 3.49 ± 1.263
2.909ThrThr: 2.909 ± 1.056
3.49ThrVal: 3.49 ± 0.872
2.327ThrTrp: 2.327 ± 0.782
2.909ThrTyr: 2.909 ± 1.626
0.0ThrXaa: 0.0 ± 0.0
Val
1.745ValAla: 1.745 ± 0.658
0.582ValCys: 0.582 ± 0.575
2.327ValAsp: 2.327 ± 0.943
1.745ValGlu: 1.745 ± 1.051
1.163ValPhe: 1.163 ± 1.308
1.745ValGly: 1.745 ± 0.79
1.745ValHis: 1.745 ± 0.77
2.327ValIle: 2.327 ± 1.033
1.745ValLys: 1.745 ± 0.945
4.654ValLeu: 4.654 ± 1.331
0.582ValMet: 0.582 ± 0.413
2.327ValAsn: 2.327 ± 1.601
3.49ValPro: 3.49 ± 1.609
4.072ValGln: 4.072 ± 2.042
1.745ValArg: 1.745 ± 1.086
3.49ValSer: 3.49 ± 1.588
3.49ValThr: 3.49 ± 1.107
2.909ValVal: 2.909 ± 1.503
0.0ValTrp: 0.0 ± 0.0
1.745ValTyr: 1.745 ± 0.658
0.0ValXaa: 0.0 ± 0.0
Trp
0.582TrpAla: 0.582 ± 0.544
0.582TrpCys: 0.582 ± 0.413
0.0TrpAsp: 0.0 ± 0.0
1.163TrpGlu: 1.163 ± 0.715
0.582TrpPhe: 0.582 ± 0.544
0.0TrpGly: 0.0 ± 0.0
0.582TrpHis: 0.582 ± 0.623
0.582TrpIle: 0.582 ± 0.413
0.582TrpLys: 0.582 ± 0.654
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.582TrpAsn: 0.582 ± 0.544
0.0TrpPro: 0.0 ± 0.0
1.163TrpGln: 1.163 ± 1.149
1.163TrpArg: 1.163 ± 0.755
0.0TrpSer: 0.0 ± 0.0
0.582TrpThr: 0.582 ± 0.413
0.582TrpVal: 0.582 ± 0.623
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.072TyrAla: 4.072 ± 1.556
1.745TyrCys: 1.745 ± 1.276
5.236TyrAsp: 5.236 ± 1.608
1.163TyrGlu: 1.163 ± 0.948
1.163TyrPhe: 1.163 ± 0.825
3.49TyrGly: 3.49 ± 0.94
1.745TyrHis: 1.745 ± 1.724
5.817TyrIle: 5.817 ± 2.102
4.654TyrLys: 4.654 ± 1.318
4.654TyrLeu: 4.654 ± 1.963
1.745TyrMet: 1.745 ± 0.53
1.163TyrAsn: 1.163 ± 0.797
4.654TyrPro: 4.654 ± 1.264
4.654TyrGln: 4.654 ± 1.315
1.163TyrArg: 1.163 ± 0.492
3.49TyrSer: 3.49 ± 1.663
2.327TyrThr: 2.327 ± 0.983
0.582TyrVal: 0.582 ± 0.81
0.0TyrTrp: 0.0 ± 0.0
2.909TyrTyr: 2.909 ± 1.171
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1720 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski