Amino acid dipepetide frequency for Tortoise microvirus 75

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.317AlaAla: 14.317 ± 2.317
0.0AlaCys: 0.0 ± 0.0
4.405AlaAsp: 4.405 ± 1.31
6.608AlaGlu: 6.608 ± 1.843
2.753AlaPhe: 2.753 ± 1.277
6.608AlaGly: 6.608 ± 2.345
3.855AlaHis: 3.855 ± 1.476
6.057AlaIle: 6.057 ± 1.163
6.608AlaLys: 6.608 ± 0.629
11.013AlaLeu: 11.013 ± 2.162
2.753AlaMet: 2.753 ± 0.515
6.608AlaAsn: 6.608 ± 2.259
6.608AlaPro: 6.608 ± 1.614
1.652AlaGln: 1.652 ± 1.012
7.709AlaArg: 7.709 ± 1.054
5.507AlaSer: 5.507 ± 2.321
6.057AlaThr: 6.057 ± 1.421
7.709AlaVal: 7.709 ± 1.814
1.101AlaTrp: 1.101 ± 0.547
2.753AlaTyr: 2.753 ± 1.074
0.0AlaXaa: 0.0 ± 0.0
Cys
0.551CysAla: 0.551 ± 0.547
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.551CysIle: 0.551 ± 0.454
0.551CysLys: 0.551 ± 0.592
0.0CysLeu: 0.0 ± 0.0
0.551CysMet: 0.551 ± 0.563
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.101CysArg: 1.101 ± 0.907
0.0CysSer: 0.0 ± 0.0
0.551CysThr: 0.551 ± 0.409
0.551CysVal: 0.551 ± 0.556
0.551CysTrp: 0.551 ± 0.454
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.057AspAla: 6.057 ± 2.427
0.0AspCys: 0.0 ± 0.0
2.753AspAsp: 2.753 ± 0.943
1.652AspGlu: 1.652 ± 0.58
1.652AspPhe: 1.652 ± 0.863
5.507AspGly: 5.507 ± 1.117
1.101AspHis: 1.101 ± 0.46
1.101AspIle: 1.101 ± 0.688
0.551AspLys: 0.551 ± 0.724
2.753AspLeu: 2.753 ± 0.583
0.551AspMet: 0.551 ± 0.578
1.652AspAsn: 1.652 ± 1.192
4.405AspPro: 4.405 ± 1.644
4.405AspGln: 4.405 ± 1.696
1.101AspArg: 1.101 ± 0.907
3.304AspSer: 3.304 ± 1.16
2.753AspThr: 2.753 ± 0.801
2.753AspVal: 2.753 ± 1.559
3.855AspTrp: 3.855 ± 0.92
1.652AspTyr: 1.652 ± 1.169
0.0AspXaa: 0.0 ± 0.0
Glu
8.26GluAla: 8.26 ± 2.677
0.0GluCys: 0.0 ± 0.0
2.753GluAsp: 2.753 ± 1.016
6.608GluGlu: 6.608 ± 3.119
0.551GluPhe: 0.551 ± 0.454
5.507GluGly: 5.507 ± 1.667
0.551GluHis: 0.551 ± 0.547
2.203GluIle: 2.203 ± 0.508
2.753GluLys: 2.753 ± 1.601
3.304GluLeu: 3.304 ± 1.044
1.652GluMet: 1.652 ± 0.628
2.203GluAsn: 2.203 ± 0.94
3.855GluPro: 3.855 ± 1.656
4.405GluGln: 4.405 ± 1.428
6.057GluArg: 6.057 ± 0.852
2.753GluSer: 2.753 ± 0.886
3.855GluThr: 3.855 ± 0.564
7.159GluVal: 7.159 ± 2.298
2.203GluTrp: 2.203 ± 1.27
1.652GluTyr: 1.652 ± 1.0
0.0GluXaa: 0.0 ± 0.0
Phe
3.304PheAla: 3.304 ± 0.826
0.0PheCys: 0.0 ± 0.0
1.101PheAsp: 1.101 ± 0.599
1.652PheGlu: 1.652 ± 0.817
0.551PhePhe: 0.551 ± 0.578
3.304PheGly: 3.304 ± 0.624
0.551PheHis: 0.551 ± 0.454
1.652PheIle: 1.652 ± 0.743
2.203PheLys: 2.203 ± 1.077
1.101PheLeu: 1.101 ± 0.907
1.101PheMet: 1.101 ± 0.539
3.304PheAsn: 3.304 ± 1.607
1.652PhePro: 1.652 ± 1.119
1.101PheGln: 1.101 ± 0.819
1.652PheArg: 1.652 ± 0.58
1.101PheSer: 1.101 ± 0.742
2.753PheThr: 2.753 ± 1.31
3.304PheVal: 3.304 ± 1.064
0.551PheTrp: 0.551 ± 0.454
2.753PheTyr: 2.753 ± 0.78
0.0PheXaa: 0.0 ± 0.0
Gly
12.665GlyAla: 12.665 ± 4.406
0.551GlyCys: 0.551 ± 0.547
6.057GlyAsp: 6.057 ± 2.08
5.507GlyGlu: 5.507 ± 1.505
3.855GlyPhe: 3.855 ± 1.889
10.463GlyGly: 10.463 ± 3.216
1.652GlyHis: 1.652 ± 0.524
4.956GlyIle: 4.956 ± 1.209
4.405GlyLys: 4.405 ± 1.677
6.057GlyLeu: 6.057 ± 0.851
0.551GlyMet: 0.551 ± 0.556
1.652GlyAsn: 1.652 ± 1.048
3.304GlyPro: 3.304 ± 1.632
1.652GlyGln: 1.652 ± 0.58
3.855GlyArg: 3.855 ± 2.048
4.956GlySer: 4.956 ± 2.671
3.304GlyThr: 3.304 ± 0.972
6.608GlyVal: 6.608 ± 1.561
1.101GlyTrp: 1.101 ± 0.641
2.753GlyTyr: 2.753 ± 0.827
0.0GlyXaa: 0.0 ± 0.0
His
1.101HisAla: 1.101 ± 0.819
0.551HisCys: 0.551 ± 0.592
0.551HisAsp: 0.551 ± 0.578
3.304HisGlu: 3.304 ± 1.322
0.551HisPhe: 0.551 ± 0.454
1.652HisGly: 1.652 ± 0.817
0.0HisHis: 0.0 ± 0.0
1.652HisIle: 1.652 ± 0.817
0.551HisLys: 0.551 ± 0.409
1.101HisLeu: 1.101 ± 0.907
0.551HisMet: 0.551 ± 0.547
1.101HisAsn: 1.101 ± 0.669
0.551HisPro: 0.551 ± 0.547
0.0HisGln: 0.0 ± 0.0
2.203HisArg: 2.203 ± 1.195
1.652HisSer: 1.652 ± 1.169
2.203HisThr: 2.203 ± 1.06
2.753HisVal: 2.753 ± 0.981
1.652HisTrp: 1.652 ± 0.715
0.551HisTyr: 0.551 ± 0.592
0.0HisXaa: 0.0 ± 0.0
Ile
5.507IleAla: 5.507 ± 1.686
0.0IleCys: 0.0 ± 0.0
1.652IleAsp: 1.652 ± 0.774
0.551IleGlu: 0.551 ± 0.409
1.101IlePhe: 1.101 ± 0.819
4.405IleGly: 4.405 ± 1.498
1.101IleHis: 1.101 ± 0.688
1.101IleIle: 1.101 ± 0.606
2.203IleLys: 2.203 ± 0.823
0.551IleLeu: 0.551 ± 0.578
1.101IleMet: 1.101 ± 0.901
0.551IleAsn: 0.551 ± 0.409
2.203IlePro: 2.203 ± 0.967
3.304IleGln: 3.304 ± 0.757
2.203IleArg: 2.203 ± 0.508
1.101IleSer: 1.101 ± 0.46
0.551IleThr: 0.551 ± 0.409
2.753IleVal: 2.753 ± 1.054
1.101IleTrp: 1.101 ± 0.641
0.551IleTyr: 0.551 ± 0.454
0.0IleXaa: 0.0 ± 0.0
Lys
7.159LysAla: 7.159 ± 1.7
0.0LysCys: 0.0 ± 0.0
3.855LysAsp: 3.855 ± 1.269
2.753LysGlu: 2.753 ± 1.079
1.652LysPhe: 1.652 ± 0.715
3.855LysGly: 3.855 ± 2.767
1.101LysHis: 1.101 ± 0.78
0.0LysIle: 0.0 ± 0.0
2.753LysLys: 2.753 ± 1.424
3.304LysLeu: 3.304 ± 1.438
3.304LysMet: 3.304 ± 1.332
2.753LysAsn: 2.753 ± 1.585
3.855LysPro: 3.855 ± 1.59
3.855LysGln: 3.855 ± 1.396
1.101LysArg: 1.101 ± 0.46
3.304LysSer: 3.304 ± 0.785
0.551LysThr: 0.551 ± 0.724
2.203LysVal: 2.203 ± 0.955
0.0LysTrp: 0.0 ± 0.0
1.101LysTyr: 1.101 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
4.956LeuAla: 4.956 ± 1.333
0.0LeuCys: 0.0 ± 0.0
3.855LeuAsp: 3.855 ± 0.855
3.855LeuGlu: 3.855 ± 0.974
2.203LeuPhe: 2.203 ± 1.108
8.26LeuGly: 8.26 ± 1.91
3.855LeuHis: 3.855 ± 0.874
1.101LeuIle: 1.101 ± 0.767
5.507LeuLys: 5.507 ± 1.511
6.057LeuLeu: 6.057 ± 1.629
1.652LeuMet: 1.652 ± 0.926
2.203LeuAsn: 2.203 ± 1.238
4.405LeuPro: 4.405 ± 1.419
2.203LeuGln: 2.203 ± 1.376
3.855LeuArg: 3.855 ± 1.237
3.304LeuSer: 3.304 ± 0.655
3.855LeuThr: 3.855 ± 1.573
3.304LeuVal: 3.304 ± 0.655
0.551LeuTrp: 0.551 ± 0.409
1.101LeuTyr: 1.101 ± 0.599
0.0LeuXaa: 0.0 ± 0.0
Met
4.405MetAla: 4.405 ± 0.847
0.551MetCys: 0.551 ± 0.454
1.652MetAsp: 1.652 ± 0.996
0.551MetGlu: 0.551 ± 0.454
0.551MetPhe: 0.551 ± 0.592
1.652MetGly: 1.652 ± 1.187
0.551MetHis: 0.551 ± 0.409
1.101MetIle: 1.101 ± 0.641
1.101MetLys: 1.101 ± 0.599
3.855MetLeu: 3.855 ± 1.635
0.551MetMet: 0.551 ± 0.724
0.551MetAsn: 0.551 ± 0.409
1.652MetPro: 1.652 ± 1.228
1.101MetGln: 1.101 ± 0.907
2.203MetArg: 2.203 ± 0.748
2.753MetSer: 2.753 ± 1.166
0.551MetThr: 0.551 ± 0.547
2.753MetVal: 2.753 ± 1.133
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.652AsnAla: 1.652 ± 1.048
0.0AsnCys: 0.0 ± 0.0
0.0AsnAsp: 0.0 ± 0.0
0.551AsnGlu: 0.551 ± 0.578
0.551AsnPhe: 0.551 ± 0.578
3.304AsnGly: 3.304 ± 1.052
0.551AsnHis: 0.551 ± 0.592
3.304AsnIle: 3.304 ± 0.622
2.753AsnLys: 2.753 ± 0.913
3.304AsnLeu: 3.304 ± 0.5
1.101AsnMet: 1.101 ± 0.828
0.551AsnAsn: 0.551 ± 0.454
3.304AsnPro: 3.304 ± 0.87
1.652AsnGln: 1.652 ± 1.302
4.956AsnArg: 4.956 ± 1.5
1.652AsnSer: 1.652 ± 0.563
1.652AsnThr: 1.652 ± 1.228
3.304AsnVal: 3.304 ± 1.178
1.101AsnTrp: 1.101 ± 0.46
2.753AsnTyr: 2.753 ± 1.086
0.0AsnXaa: 0.0 ± 0.0
Pro
6.608ProAla: 6.608 ± 2.363
0.551ProCys: 0.551 ± 0.556
1.101ProAsp: 1.101 ± 0.606
4.956ProGlu: 4.956 ± 1.035
1.101ProPhe: 1.101 ± 0.819
4.956ProGly: 4.956 ± 1.599
1.652ProHis: 1.652 ± 0.827
1.101ProIle: 1.101 ± 0.68
3.304ProLys: 3.304 ± 1.925
2.753ProLeu: 2.753 ± 1.088
2.753ProMet: 2.753 ± 1.086
3.855ProAsn: 3.855 ± 1.171
4.405ProPro: 4.405 ± 1.149
2.203ProGln: 2.203 ± 0.929
2.203ProArg: 2.203 ± 0.625
2.753ProSer: 2.753 ± 0.677
2.203ProThr: 2.203 ± 1.108
6.608ProVal: 6.608 ± 1.907
2.203ProTrp: 2.203 ± 1.288
0.551ProTyr: 0.551 ± 0.547
0.0ProXaa: 0.0 ± 0.0
Gln
4.956GlnAla: 4.956 ± 1.443
0.551GlnCys: 0.551 ± 0.454
1.652GlnAsp: 1.652 ± 1.012
7.159GlnGlu: 7.159 ± 3.296
2.203GlnPhe: 2.203 ± 1.41
1.652GlnGly: 1.652 ± 1.734
0.551GlnHis: 0.551 ± 0.454
1.101GlnIle: 1.101 ± 0.78
2.203GlnLys: 2.203 ± 0.823
3.855GlnLeu: 3.855 ± 1.966
1.101GlnMet: 1.101 ± 0.767
2.203GlnAsn: 2.203 ± 1.085
1.652GlnPro: 1.652 ± 1.127
3.855GlnGln: 3.855 ± 0.883
2.203GlnArg: 2.203 ± 1.112
2.753GlnSer: 2.753 ± 1.088
1.101GlnThr: 1.101 ± 0.547
3.855GlnVal: 3.855 ± 1.295
2.203GlnTrp: 2.203 ± 0.967
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.159ArgAla: 7.159 ± 1.197
0.0ArgCys: 0.0 ± 0.0
4.405ArgAsp: 4.405 ± 2.107
4.956ArgGlu: 4.956 ± 1.408
4.405ArgPhe: 4.405 ± 1.674
2.203ArgGly: 2.203 ± 1.168
0.551ArgHis: 0.551 ± 0.556
1.652ArgIle: 1.652 ± 0.926
2.203ArgLys: 2.203 ± 1.815
4.405ArgLeu: 4.405 ± 1.486
4.956ArgMet: 4.956 ± 1.777
3.304ArgAsn: 3.304 ± 0.778
2.753ArgPro: 2.753 ± 1.848
6.057ArgGln: 6.057 ± 1.684
4.956ArgArg: 4.956 ± 1.17
6.608ArgSer: 6.608 ± 2.556
1.652ArgThr: 1.652 ± 0.713
4.956ArgVal: 4.956 ± 1.298
1.652ArgTrp: 1.652 ± 0.743
2.203ArgTyr: 2.203 ± 1.376
0.0ArgXaa: 0.0 ± 0.0
Ser
3.855SerAla: 3.855 ± 1.005
1.101SerCys: 1.101 ± 0.46
3.304SerAsp: 3.304 ± 1.623
4.956SerGlu: 4.956 ± 0.844
1.652SerPhe: 1.652 ± 0.788
4.405SerGly: 4.405 ± 3.048
1.652SerHis: 1.652 ± 0.827
2.203SerIle: 2.203 ± 0.819
5.507SerLys: 5.507 ± 2.482
4.405SerLeu: 4.405 ± 1.901
0.0SerMet: 0.0 ± 0.0
1.652SerAsn: 1.652 ± 0.817
2.753SerPro: 2.753 ± 1.19
3.304SerGln: 3.304 ± 1.244
4.405SerArg: 4.405 ± 1.382
4.405SerSer: 4.405 ± 1.999
2.753SerThr: 2.753 ± 1.081
3.855SerVal: 3.855 ± 0.741
1.101SerTrp: 1.101 ± 0.669
2.203SerTyr: 2.203 ± 1.108
0.0SerXaa: 0.0 ± 0.0
Thr
6.057ThrAla: 6.057 ± 1.255
0.0ThrCys: 0.0 ± 0.0
2.753ThrAsp: 2.753 ± 1.66
1.652ThrGlu: 1.652 ± 1.012
1.652ThrPhe: 1.652 ± 0.996
5.507ThrGly: 5.507 ± 2.15
0.551ThrHis: 0.551 ± 0.578
0.0ThrIle: 0.0 ± 0.0
0.551ThrLys: 0.551 ± 0.409
2.753ThrLeu: 2.753 ± 0.571
0.551ThrMet: 0.551 ± 0.409
1.101ThrAsn: 1.101 ± 0.547
1.652ThrPro: 1.652 ± 0.996
0.551ThrGln: 0.551 ± 0.409
5.507ThrArg: 5.507 ± 1.75
2.203ThrSer: 2.203 ± 1.112
2.753ThrThr: 2.753 ± 1.559
3.304ThrVal: 3.304 ± 1.418
0.0ThrTrp: 0.0 ± 0.0
4.405ThrTyr: 4.405 ± 1.942
0.0ThrXaa: 0.0 ± 0.0
Val
7.709ValAla: 7.709 ± 0.665
0.551ValCys: 0.551 ± 0.454
3.855ValAsp: 3.855 ± 1.842
5.507ValGlu: 5.507 ± 1.296
4.956ValPhe: 4.956 ± 1.659
7.159ValGly: 7.159 ± 1.191
2.753ValHis: 2.753 ± 1.068
2.203ValIle: 2.203 ± 1.048
1.101ValLys: 1.101 ± 0.819
2.753ValLeu: 2.753 ± 1.116
1.652ValMet: 1.652 ± 0.817
0.551ValAsn: 0.551 ± 0.409
6.608ValPro: 6.608 ± 2.501
3.304ValGln: 3.304 ± 2.83
9.361ValArg: 9.361 ± 0.823
5.507ValSer: 5.507 ± 1.466
3.304ValThr: 3.304 ± 1.894
6.057ValVal: 6.057 ± 2.758
0.551ValTrp: 0.551 ± 0.556
2.203ValTyr: 2.203 ± 1.201
0.0ValXaa: 0.0 ± 0.0
Trp
2.203TrpAla: 2.203 ± 1.112
0.551TrpCys: 0.551 ± 0.592
1.652TrpAsp: 1.652 ± 1.012
2.203TrpGlu: 2.203 ± 0.999
1.652TrpPhe: 1.652 ± 1.228
1.101TrpGly: 1.101 ± 0.641
0.551TrpHis: 0.551 ± 0.454
0.0TrpIle: 0.0 ± 0.0
0.551TrpLys: 0.551 ± 0.547
1.101TrpLeu: 1.101 ± 0.907
0.551TrpMet: 0.551 ± 0.454
1.101TrpAsn: 1.101 ± 0.742
1.652TrpPro: 1.652 ± 0.926
1.101TrpGln: 1.101 ± 0.571
2.753TrpArg: 2.753 ± 1.079
1.101TrpSer: 1.101 ± 0.907
0.0TrpThr: 0.0 ± 0.0
1.101TrpVal: 1.101 ± 0.669
0.0TrpTrp: 0.0 ± 0.0
0.551TrpTyr: 0.551 ± 0.556
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.203TyrAla: 2.203 ± 1.815
0.0TyrCys: 0.0 ± 0.0
2.203TyrAsp: 2.203 ± 0.967
3.304TyrGlu: 3.304 ± 1.461
1.101TyrPhe: 1.101 ± 0.46
4.405TyrGly: 4.405 ± 1.186
1.101TyrHis: 1.101 ± 0.767
0.551TyrIle: 0.551 ± 0.454
1.652TyrLys: 1.652 ± 0.58
1.652TyrLeu: 1.652 ± 1.361
0.551TyrMet: 0.551 ± 0.556
1.101TyrAsn: 1.101 ± 0.547
0.551TyrPro: 0.551 ± 0.547
1.101TyrGln: 1.101 ± 1.094
1.652TyrArg: 1.652 ± 0.874
2.753TyrSer: 2.753 ± 1.391
1.101TyrThr: 1.101 ± 0.599
2.753TyrVal: 2.753 ± 1.04
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1817 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski