Amino acid dipepetide frequency for Tortoise microvirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.777AlaAla: 4.777 ± 2.01
1.062AlaCys: 1.062 ± 0.779
3.715AlaAsp: 3.715 ± 1.032
6.369AlaGlu: 6.369 ± 1.857
3.185AlaPhe: 3.185 ± 1.307
5.308AlaGly: 5.308 ± 2.781
1.062AlaHis: 1.062 ± 0.788
2.654AlaIle: 2.654 ± 0.995
4.777AlaLys: 4.777 ± 1.698
5.308AlaLeu: 5.308 ± 1.215
3.185AlaMet: 3.185 ± 0.864
4.246AlaAsn: 4.246 ± 1.238
5.839AlaPro: 5.839 ± 1.868
5.839AlaGln: 5.839 ± 2.306
2.654AlaArg: 2.654 ± 0.873
5.839AlaSer: 5.839 ± 2.201
3.185AlaThr: 3.185 ± 0.924
4.777AlaVal: 4.777 ± 1.556
0.0AlaTrp: 0.0 ± 0.0
3.185AlaTyr: 3.185 ± 0.744
0.0AlaXaa: 0.0 ± 0.0
Cys
1.062CysAla: 1.062 ± 0.692
0.0CysCys: 0.0 ± 0.0
0.531CysAsp: 0.531 ± 0.382
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.531CysGly: 0.531 ± 0.546
1.062CysHis: 1.062 ± 0.779
0.0CysIle: 0.0 ± 0.0
1.062CysLys: 1.062 ± 0.779
2.123CysLeu: 2.123 ± 1.836
0.0CysMet: 0.0 ± 0.0
1.062CysAsn: 1.062 ± 0.697
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.592CysArg: 1.592 ± 0.839
1.062CysSer: 1.062 ± 0.7
2.123CysThr: 2.123 ± 1.286
1.062CysVal: 1.062 ± 0.854
0.0CysTrp: 0.0 ± 0.0
1.062CysTyr: 1.062 ± 1.389
0.0CysXaa: 0.0 ± 0.0
Asp
6.9AspAla: 6.9 ± 2.361
1.592AspCys: 1.592 ± 1.006
5.839AspAsp: 5.839 ± 2.383
1.062AspGlu: 1.062 ± 0.764
6.9AspPhe: 6.9 ± 2.191
2.654AspGly: 2.654 ± 1.075
0.531AspHis: 0.531 ± 0.382
3.715AspIle: 3.715 ± 1.199
3.185AspLys: 3.185 ± 1.087
6.369AspLeu: 6.369 ± 1.516
0.531AspMet: 0.531 ± 0.382
1.062AspAsn: 1.062 ± 0.692
1.062AspPro: 1.062 ± 0.828
1.062AspGln: 1.062 ± 0.941
3.185AspArg: 3.185 ± 1.039
4.777AspSer: 4.777 ± 1.933
4.777AspThr: 4.777 ± 1.679
4.246AspVal: 4.246 ± 1.751
0.531AspTrp: 0.531 ± 0.546
5.839AspTyr: 5.839 ± 1.676
0.0AspXaa: 0.0 ± 0.0
Glu
2.123GluAla: 2.123 ± 1.212
0.531GluCys: 0.531 ± 0.674
0.531GluAsp: 0.531 ± 0.47
1.592GluGlu: 1.592 ± 0.737
2.654GluPhe: 2.654 ± 0.846
1.062GluGly: 1.062 ± 1.093
0.531GluHis: 0.531 ± 0.695
2.123GluIle: 2.123 ± 1.048
3.185GluLys: 3.185 ± 0.907
4.246GluLeu: 4.246 ± 1.913
1.592GluMet: 1.592 ± 0.743
2.123GluAsn: 2.123 ± 1.372
1.062GluPro: 1.062 ± 0.967
2.654GluGln: 2.654 ± 0.95
4.777GluArg: 4.777 ± 2.497
4.777GluSer: 4.777 ± 1.436
1.592GluThr: 1.592 ± 0.757
3.715GluVal: 3.715 ± 1.308
0.531GluTrp: 0.531 ± 0.546
2.123GluTyr: 2.123 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
3.185PheAla: 3.185 ± 1.187
1.062PheCys: 1.062 ± 1.093
2.654PheAsp: 2.654 ± 1.499
3.715PheGlu: 3.715 ± 1.34
2.654PhePhe: 2.654 ± 1.402
2.654PheGly: 2.654 ± 1.404
0.0PheHis: 0.0 ± 0.0
3.715PheIle: 3.715 ± 1.385
2.123PheLys: 2.123 ± 1.642
3.715PheLeu: 3.715 ± 1.576
3.715PheMet: 3.715 ± 1.123
4.246PheAsn: 4.246 ± 0.616
2.123PhePro: 2.123 ± 1.061
2.654PheGln: 2.654 ± 1.687
2.654PheArg: 2.654 ± 1.205
2.654PheSer: 2.654 ± 0.967
4.246PheThr: 4.246 ± 1.242
1.062PheVal: 1.062 ± 0.396
0.531PheTrp: 0.531 ± 0.47
2.123PheTyr: 2.123 ± 0.932
0.0PheXaa: 0.0 ± 0.0
Gly
3.185GlyAla: 3.185 ± 0.958
1.592GlyCys: 1.592 ± 0.892
4.777GlyAsp: 4.777 ± 1.234
1.592GlyGlu: 1.592 ± 0.955
2.123GlyPhe: 2.123 ± 0.997
2.654GlyGly: 2.654 ± 1.039
1.062GlyHis: 1.062 ± 0.396
2.123GlyIle: 2.123 ± 0.606
1.062GlyLys: 1.062 ± 0.675
8.493GlyLeu: 8.493 ± 1.622
1.592GlyMet: 1.592 ± 0.604
1.592GlyAsn: 1.592 ± 0.878
0.531GlyPro: 0.531 ± 0.546
1.592GlyGln: 1.592 ± 0.781
2.654GlyArg: 2.654 ± 0.82
8.493GlySer: 8.493 ± 2.399
3.185GlyThr: 3.185 ± 2.293
6.369GlyVal: 6.369 ± 1.212
0.531GlyTrp: 0.531 ± 0.546
3.715GlyTyr: 3.715 ± 0.767
0.0GlyXaa: 0.0 ± 0.0
His
1.592HisAla: 1.592 ± 0.836
0.531HisCys: 0.531 ± 0.649
0.531HisAsp: 0.531 ± 0.674
0.531HisGlu: 0.531 ± 0.493
1.062HisPhe: 1.062 ± 0.941
2.123HisGly: 2.123 ± 0.666
0.0HisHis: 0.0 ± 0.0
1.062HisIle: 1.062 ± 0.78
0.531HisLys: 0.531 ± 0.546
1.592HisLeu: 1.592 ± 0.69
0.0HisMet: 0.0 ± 0.0
1.592HisAsn: 1.592 ± 1.069
1.592HisPro: 1.592 ± 0.923
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.062HisSer: 1.062 ± 0.62
2.123HisThr: 2.123 ± 0.768
1.062HisVal: 1.062 ± 0.954
0.0HisTrp: 0.0 ± 0.0
1.062HisTyr: 1.062 ± 0.596
0.0HisXaa: 0.0 ± 0.0
Ile
6.369IleAla: 6.369 ± 1.953
0.531IleCys: 0.531 ± 0.546
3.715IleAsp: 3.715 ± 1.911
2.654IleGlu: 2.654 ± 1.457
1.062IlePhe: 1.062 ± 0.675
3.715IleGly: 3.715 ± 1.878
0.0IleHis: 0.0 ± 0.0
0.0IleIle: 0.0 ± 0.0
0.531IleLys: 0.531 ± 0.587
3.715IleLeu: 3.715 ± 1.497
1.062IleMet: 1.062 ± 0.396
1.592IleAsn: 1.592 ± 0.62
5.839IlePro: 5.839 ± 2.262
1.592IleGln: 1.592 ± 0.946
3.185IleArg: 3.185 ± 0.978
2.654IleSer: 2.654 ± 1.301
1.062IleThr: 1.062 ± 0.596
1.062IleVal: 1.062 ± 0.596
0.531IleTrp: 0.531 ± 0.382
1.062IleTyr: 1.062 ± 0.601
0.0IleXaa: 0.0 ± 0.0
Lys
2.654LysAla: 2.654 ± 1.338
0.0LysCys: 0.0 ± 0.0
1.592LysAsp: 1.592 ± 1.172
2.654LysGlu: 2.654 ± 0.79
1.062LysPhe: 1.062 ± 0.832
2.123LysGly: 2.123 ± 1.128
0.531LysHis: 0.531 ± 0.47
2.123LysIle: 2.123 ± 0.786
0.531LysLys: 0.531 ± 0.493
3.185LysLeu: 3.185 ± 1.088
2.123LysMet: 2.123 ± 0.782
2.654LysAsn: 2.654 ± 0.778
1.062LysPro: 1.062 ± 0.674
1.062LysGln: 1.062 ± 0.812
2.123LysArg: 2.123 ± 1.642
3.715LysSer: 3.715 ± 1.454
3.185LysThr: 3.185 ± 1.375
0.531LysVal: 0.531 ± 0.47
0.0LysTrp: 0.0 ± 0.0
3.185LysTyr: 3.185 ± 3.278
0.0LysXaa: 0.0 ± 0.0
Leu
5.839LeuAla: 5.839 ± 1.652
2.654LeuCys: 2.654 ± 1.333
5.308LeuAsp: 5.308 ± 2.24
3.185LeuGlu: 3.185 ± 1.422
4.777LeuPhe: 4.777 ± 1.848
10.616LeuGly: 10.616 ± 2.199
2.123LeuHis: 2.123 ± 0.786
3.185LeuIle: 3.185 ± 1.449
2.654LeuLys: 2.654 ± 1.351
5.839LeuLeu: 5.839 ± 3.191
1.062LeuMet: 1.062 ± 1.348
5.839LeuAsn: 5.839 ± 1.417
8.493LeuPro: 8.493 ± 2.675
3.715LeuGln: 3.715 ± 0.781
2.654LeuArg: 2.654 ± 0.846
9.554LeuSer: 9.554 ± 1.692
5.839LeuThr: 5.839 ± 1.31
4.246LeuVal: 4.246 ± 1.763
1.062LeuTrp: 1.062 ± 0.764
3.715LeuTyr: 3.715 ± 1.501
0.0LeuXaa: 0.0 ± 0.0
Met
4.246MetAla: 4.246 ± 1.7
0.0MetCys: 0.0 ± 0.0
2.654MetAsp: 2.654 ± 1.121
0.531MetGlu: 0.531 ± 0.47
1.062MetPhe: 1.062 ± 0.832
1.062MetGly: 1.062 ± 0.764
0.0MetHis: 0.0 ± 0.0
1.592MetIle: 1.592 ± 0.923
1.592MetLys: 1.592 ± 1.198
1.592MetLeu: 1.592 ± 0.69
0.531MetMet: 0.531 ± 0.382
0.0MetAsn: 0.0 ± 0.0
2.123MetPro: 2.123 ± 0.763
1.592MetGln: 1.592 ± 0.781
1.062MetArg: 1.062 ± 0.396
3.185MetSer: 3.185 ± 1.888
2.123MetThr: 2.123 ± 0.994
0.531MetVal: 0.531 ± 0.47
0.0MetTrp: 0.0 ± 0.0
3.185MetTyr: 3.185 ± 1.33
0.0MetXaa: 0.0 ± 0.0
Asn
4.777AsnAla: 4.777 ± 1.42
0.0AsnCys: 0.0 ± 0.0
2.654AsnAsp: 2.654 ± 1.035
4.246AsnGlu: 4.246 ± 1.159
3.715AsnPhe: 3.715 ± 0.899
2.654AsnGly: 2.654 ± 0.647
0.0AsnHis: 0.0 ± 0.0
1.062AsnIle: 1.062 ± 0.711
2.123AsnLys: 2.123 ± 1.191
3.185AsnLeu: 3.185 ± 0.924
1.592AsnMet: 1.592 ± 0.955
2.123AsnAsn: 2.123 ± 0.999
3.185AsnPro: 3.185 ± 1.174
1.062AsnGln: 1.062 ± 0.396
2.123AsnArg: 2.123 ± 1.018
2.654AsnSer: 2.654 ± 0.908
2.654AsnThr: 2.654 ± 0.778
4.246AsnVal: 4.246 ± 1.934
1.062AsnTrp: 1.062 ± 0.812
2.123AsnTyr: 2.123 ± 0.666
0.0AsnXaa: 0.0 ± 0.0
Pro
4.246ProAla: 4.246 ± 0.884
0.531ProCys: 0.531 ± 0.695
3.715ProAsp: 3.715 ± 0.687
3.715ProGlu: 3.715 ± 1.814
1.592ProPhe: 1.592 ± 1.077
1.062ProGly: 1.062 ± 0.764
1.062ProHis: 1.062 ± 0.832
3.715ProIle: 3.715 ± 1.91
1.062ProLys: 1.062 ± 1.093
3.715ProLeu: 3.715 ± 0.81
1.592ProMet: 1.592 ± 0.461
2.654ProAsn: 2.654 ± 1.336
2.123ProPro: 2.123 ± 0.997
1.592ProGln: 1.592 ± 0.62
3.185ProArg: 3.185 ± 1.346
11.146ProSer: 11.146 ± 1.742
4.246ProThr: 4.246 ± 1.496
5.839ProVal: 5.839 ± 3.855
0.0ProTrp: 0.0 ± 0.0
3.185ProTyr: 3.185 ± 1.292
0.0ProXaa: 0.0 ± 0.0
Gln
2.654GlnAla: 2.654 ± 1.938
0.0GlnCys: 0.0 ± 0.0
3.715GlnAsp: 3.715 ± 0.986
1.062GlnGlu: 1.062 ± 0.62
4.246GlnPhe: 4.246 ± 2.042
1.592GlnGly: 1.592 ± 0.479
1.592GlnHis: 1.592 ± 1.056
1.592GlnIle: 1.592 ± 0.65
1.062GlnLys: 1.062 ± 0.941
2.654GlnLeu: 2.654 ± 1.938
0.531GlnMet: 0.531 ± 0.546
3.185GlnAsn: 3.185 ± 1.011
0.531GlnPro: 0.531 ± 0.382
5.839GlnGln: 5.839 ± 4.677
3.715GlnArg: 3.715 ± 0.926
4.246GlnSer: 4.246 ± 1.583
2.123GlnThr: 2.123 ± 0.912
2.123GlnVal: 2.123 ± 0.912
0.531GlnTrp: 0.531 ± 0.47
1.062GlnTyr: 1.062 ± 0.711
0.0GlnXaa: 0.0 ± 0.0
Arg
4.777ArgAla: 4.777 ± 2.012
0.0ArgCys: 0.0 ± 0.0
2.654ArgAsp: 2.654 ± 0.639
2.654ArgGlu: 2.654 ± 0.973
0.531ArgPhe: 0.531 ± 0.47
3.715ArgGly: 3.715 ± 1.436
0.531ArgHis: 0.531 ± 0.47
2.123ArgIle: 2.123 ± 0.72
1.592ArgLys: 1.592 ± 0.924
8.493ArgLeu: 8.493 ± 2.684
1.592ArgMet: 1.592 ± 0.693
1.592ArgAsn: 1.592 ± 0.736
4.777ArgPro: 4.777 ± 2.671
3.715ArgGln: 3.715 ± 2.038
3.185ArgArg: 3.185 ± 0.966
3.715ArgSer: 3.715 ± 1.152
1.592ArgThr: 1.592 ± 0.955
3.715ArgVal: 3.715 ± 1.308
0.0ArgTrp: 0.0 ± 0.0
4.246ArgTyr: 4.246 ± 1.322
0.0ArgXaa: 0.0 ± 0.0
Ser
5.308SerAla: 5.308 ± 1.478
0.0SerCys: 0.0 ± 0.0
5.839SerAsp: 5.839 ± 1.827
3.185SerGlu: 3.185 ± 1.543
4.777SerPhe: 4.777 ± 1.417
5.839SerGly: 5.839 ± 1.608
3.715SerHis: 3.715 ± 0.73
4.246SerIle: 4.246 ± 1.981
2.654SerLys: 2.654 ± 1.143
9.554SerLeu: 9.554 ± 1.415
1.592SerMet: 1.592 ± 0.923
3.185SerAsn: 3.185 ± 0.816
7.431SerPro: 7.431 ± 1.992
3.715SerGln: 3.715 ± 1.275
3.715SerArg: 3.715 ± 1.341
7.962SerSer: 7.962 ± 1.828
3.715SerThr: 3.715 ± 1.813
9.023SerVal: 9.023 ± 2.042
0.0SerTrp: 0.0 ± 0.0
2.654SerTyr: 2.654 ± 0.868
0.0SerXaa: 0.0 ± 0.0
Thr
6.369ThrAla: 6.369 ± 1.139
0.531ThrCys: 0.531 ± 0.649
4.777ThrAsp: 4.777 ± 1.071
2.123ThrGlu: 2.123 ± 1.286
5.839ThrPhe: 5.839 ± 2.031
2.654ThrGly: 2.654 ± 1.035
1.592ThrHis: 1.592 ± 1.077
1.062ThrIle: 1.062 ± 0.676
2.123ThrLys: 2.123 ± 0.816
6.369ThrLeu: 6.369 ± 2.514
1.592ThrMet: 1.592 ± 0.892
0.531ThrAsn: 0.531 ± 0.47
3.715ThrPro: 3.715 ± 1.521
2.123ThrGln: 2.123 ± 0.691
3.715ThrArg: 3.715 ± 1.382
2.654ThrSer: 2.654 ± 1.211
5.308ThrThr: 5.308 ± 2.434
3.185ThrVal: 3.185 ± 0.812
0.0ThrTrp: 0.0 ± 0.0
3.185ThrTyr: 3.185 ± 1.135
0.0ThrXaa: 0.0 ± 0.0
Val
2.654ValAla: 2.654 ± 0.746
2.123ValCys: 2.123 ± 1.749
6.9ValAsp: 6.9 ± 1.386
1.592ValGlu: 1.592 ± 1.254
1.062ValPhe: 1.062 ± 0.613
4.246ValGly: 4.246 ± 1.455
0.531ValHis: 0.531 ± 0.674
3.185ValIle: 3.185 ± 1.335
2.123ValLys: 2.123 ± 1.636
7.962ValLeu: 7.962 ± 1.24
2.654ValMet: 2.654 ± 1.967
3.715ValAsn: 3.715 ± 1.194
5.839ValPro: 5.839 ± 1.64
1.062ValGln: 1.062 ± 0.941
4.777ValArg: 4.777 ± 2.238
5.308ValSer: 5.308 ± 1.514
2.654ValThr: 2.654 ± 1.248
2.654ValVal: 2.654 ± 0.99
0.0ValTrp: 0.0 ± 0.0
1.062ValTyr: 1.062 ± 0.88
0.0ValXaa: 0.0 ± 0.0
Trp
0.531TrpAla: 0.531 ± 0.47
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.531TrpLeu: 0.531 ± 0.546
0.0TrpMet: 0.0 ± 0.0
1.062TrpAsn: 1.062 ± 0.396
0.0TrpPro: 0.0 ± 0.0
0.531TrpGln: 0.531 ± 0.47
1.062TrpArg: 1.062 ± 0.675
1.062TrpSer: 1.062 ± 1.093
1.062TrpThr: 1.062 ± 0.596
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.715TyrAla: 3.715 ± 1.413
1.592TyrCys: 1.592 ± 0.839
3.715TyrAsp: 3.715 ± 2.125
1.062TyrGlu: 1.062 ± 0.396
3.185TyrPhe: 3.185 ± 1.161
2.654TyrGly: 2.654 ± 1.053
2.123TyrHis: 2.123 ± 1.355
2.654TyrIle: 2.654 ± 1.509
2.123TyrLys: 2.123 ± 1.106
3.715TyrLeu: 3.715 ± 1.317
1.592TyrMet: 1.592 ± 0.621
3.185TyrAsn: 3.185 ± 0.986
2.654TyrPro: 2.654 ± 1.169
2.654TyrGln: 2.654 ± 0.639
3.185TyrArg: 3.185 ± 1.493
1.592TyrSer: 1.592 ± 0.839
2.654TyrThr: 2.654 ± 0.99
3.185TyrVal: 3.185 ± 1.505
0.531TyrTrp: 0.531 ± 0.546
2.123TyrTyr: 2.123 ± 0.691
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (1885 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski