Amino acid dipepetide frequency for Tortoise microvirus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.905AlaAla: 4.905 ± 3.995
0.701AlaCys: 0.701 ± 0.534
4.905AlaAsp: 4.905 ± 2.301
4.205AlaGlu: 4.205 ± 2.543
2.102AlaPhe: 2.102 ± 1.212
9.11AlaGly: 9.11 ± 4.221
0.701AlaHis: 0.701 ± 0.653
1.402AlaIle: 1.402 ± 0.57
1.402AlaLys: 1.402 ± 0.782
7.008AlaLeu: 7.008 ± 1.228
0.0AlaMet: 0.0 ± 0.0
4.205AlaAsn: 4.205 ± 1.783
3.504AlaPro: 3.504 ± 1.28
3.504AlaGln: 3.504 ± 2.357
4.905AlaArg: 4.905 ± 2.282
7.708AlaSer: 7.708 ± 3.067
2.102AlaThr: 2.102 ± 1.103
4.205AlaVal: 4.205 ± 0.959
0.701AlaTrp: 0.701 ± 0.534
3.504AlaTyr: 3.504 ± 1.288
0.0AlaXaa: 0.0 ± 0.0
Cys
0.701CysAla: 0.701 ± 0.661
0.0CysCys: 0.0 ± 0.0
1.402CysAsp: 1.402 ± 1.608
0.0CysGlu: 0.0 ± 0.0
0.701CysPhe: 0.701 ± 0.661
0.701CysGly: 0.701 ± 0.661
0.0CysHis: 0.0 ± 0.0
0.701CysIle: 0.701 ± 0.661
0.701CysLys: 0.701 ± 0.661
1.402CysLeu: 1.402 ± 0.663
0.0CysMet: 0.0 ± 0.0
0.701CysAsn: 0.701 ± 0.661
4.205CysPro: 4.205 ± 2.013
0.0CysGln: 0.0 ± 0.0
0.701CysArg: 0.701 ± 0.661
0.701CysSer: 0.701 ± 0.534
0.0CysThr: 0.0 ± 0.0
0.701CysVal: 0.701 ± 0.661
0.0CysTrp: 0.0 ± 0.0
0.701CysTyr: 0.701 ± 0.661
0.0CysXaa: 0.0 ± 0.0
Asp
4.205AspAla: 4.205 ± 1.023
1.402AspCys: 1.402 ± 1.322
4.905AspAsp: 4.905 ± 1.906
1.402AspGlu: 1.402 ± 1.322
2.803AspPhe: 2.803 ± 1.476
2.803AspGly: 2.803 ± 1.352
0.0AspHis: 0.0 ± 0.0
3.504AspIle: 3.504 ± 2.154
1.402AspLys: 1.402 ± 0.782
8.409AspLeu: 8.409 ± 2.647
1.402AspMet: 1.402 ± 0.663
4.205AspAsn: 4.205 ± 1.791
4.905AspPro: 4.905 ± 1.74
3.504AspGln: 3.504 ± 1.347
2.803AspArg: 2.803 ± 1.476
8.409AspSer: 8.409 ± 5.188
4.905AspThr: 4.905 ± 2.893
2.102AspVal: 2.102 ± 1.287
1.402AspTrp: 1.402 ± 0.57
6.307AspTyr: 6.307 ± 1.104
0.0AspXaa: 0.0 ± 0.0
Glu
1.402GluAla: 1.402 ± 1.305
1.402GluCys: 1.402 ± 0.663
4.905GluAsp: 4.905 ± 1.56
2.803GluGlu: 2.803 ± 1.368
1.402GluPhe: 1.402 ± 1.322
0.701GluGly: 0.701 ± 0.653
1.402GluHis: 1.402 ± 0.992
2.803GluIle: 2.803 ± 2.001
0.701GluLys: 0.701 ± 0.534
2.102GluLeu: 2.102 ± 0.479
1.402GluMet: 1.402 ± 1.078
2.102GluAsn: 2.102 ± 1.103
0.701GluPro: 0.701 ± 0.534
2.803GluGln: 2.803 ± 2.211
2.803GluArg: 2.803 ± 2.012
6.307GluSer: 6.307 ± 4.362
2.803GluThr: 2.803 ± 2.032
3.504GluVal: 3.504 ± 1.856
0.0GluTrp: 0.0 ± 0.0
1.402GluTyr: 1.402 ± 0.663
0.0GluXaa: 0.0 ± 0.0
Phe
4.205PheAla: 4.205 ± 2.023
0.701PheCys: 0.701 ± 0.534
4.205PheAsp: 4.205 ± 0.959
2.102PheGlu: 2.102 ± 1.287
1.402PhePhe: 1.402 ± 1.146
1.402PheGly: 1.402 ± 0.663
2.803PheHis: 2.803 ± 2.643
0.701PheIle: 0.701 ± 0.534
2.803PheLys: 2.803 ± 1.476
3.504PheLeu: 3.504 ± 1.131
0.0PheMet: 0.0 ± 0.0
2.102PheAsn: 2.102 ± 1.006
1.402PhePro: 1.402 ± 0.663
1.402PheGln: 1.402 ± 1.068
4.905PheArg: 4.905 ± 3.04
6.307PheSer: 6.307 ± 2.149
0.701PheThr: 0.701 ± 0.661
4.205PheVal: 4.205 ± 1.543
1.402PheTrp: 1.402 ± 0.663
4.905PheTyr: 4.905 ± 1.116
0.0PheXaa: 0.0 ± 0.0
Gly
4.205GlyAla: 4.205 ± 2.5
0.0GlyCys: 0.0 ± 0.0
2.803GlyAsp: 2.803 ± 1.669
0.701GlyGlu: 0.701 ± 1.028
2.803GlyPhe: 2.803 ± 2.135
2.803GlyGly: 2.803 ± 1.327
0.701GlyHis: 0.701 ± 0.661
4.205GlyIle: 4.205 ± 1.625
2.803GlyLys: 2.803 ± 1.004
4.205GlyLeu: 4.205 ± 1.711
0.0GlyMet: 0.0 ± 0.0
1.402GlyAsn: 1.402 ± 1.305
1.402GlyPro: 1.402 ± 1.068
2.102GlyGln: 2.102 ± 0.891
4.905GlyArg: 4.905 ± 1.425
9.811GlySer: 9.811 ± 1.922
2.102GlyThr: 2.102 ± 1.103
7.708GlyVal: 7.708 ± 1.8
1.402GlyTrp: 1.402 ± 0.663
1.402GlyTyr: 1.402 ± 1.322
0.0GlyXaa: 0.0 ± 0.0
His
1.402HisAla: 1.402 ± 1.305
0.701HisCys: 0.701 ± 0.534
0.701HisAsp: 0.701 ± 0.534
0.0HisGlu: 0.0 ± 0.0
1.402HisPhe: 1.402 ± 1.322
2.102HisGly: 2.102 ± 1.006
2.102HisHis: 2.102 ± 1.601
1.402HisIle: 1.402 ± 1.068
2.102HisLys: 2.102 ± 0.908
1.402HisLeu: 1.402 ± 0.663
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.102HisPro: 2.102 ± 1.575
1.402HisGln: 1.402 ± 0.57
1.402HisArg: 1.402 ± 1.322
2.803HisSer: 2.803 ± 1.349
0.0HisThr: 0.0 ± 0.0
1.402HisVal: 1.402 ± 0.57
0.701HisTrp: 0.701 ± 0.661
0.701HisTyr: 0.701 ± 0.534
0.0HisXaa: 0.0 ± 0.0
Ile
3.504IleAla: 3.504 ± 1.084
0.701IleCys: 0.701 ± 0.534
5.606IleAsp: 5.606 ± 1.508
0.701IleGlu: 0.701 ± 0.661
2.102IlePhe: 2.102 ± 1.212
3.504IleGly: 3.504 ± 1.538
2.102IleHis: 2.102 ± 0.479
2.102IleIle: 2.102 ± 1.28
1.402IleLys: 1.402 ± 1.068
3.504IleLeu: 3.504 ± 1.406
2.102IleMet: 2.102 ± 1.28
1.402IleAsn: 1.402 ± 1.068
2.803IlePro: 2.803 ± 1.064
0.701IleGln: 0.701 ± 0.534
2.102IleArg: 2.102 ± 0.908
4.905IleSer: 4.905 ± 1.33
2.102IleThr: 2.102 ± 1.489
2.102IleVal: 2.102 ± 1.575
0.701IleTrp: 0.701 ± 0.661
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
2.102LysAla: 2.102 ± 1.28
0.701LysCys: 0.701 ± 0.661
1.402LysAsp: 1.402 ± 0.992
2.102LysGlu: 2.102 ± 1.103
0.701LysPhe: 0.701 ± 0.534
0.0LysGly: 0.0 ± 0.0
0.701LysHis: 0.701 ± 0.534
1.402LysIle: 1.402 ± 0.663
0.0LysLys: 0.0 ± 0.0
2.803LysLeu: 2.803 ± 1.349
0.701LysMet: 0.701 ± 0.618
2.102LysAsn: 2.102 ± 1.293
2.803LysPro: 2.803 ± 1.019
1.402LysGln: 1.402 ± 0.57
4.905LysArg: 4.905 ± 1.425
1.402LysSer: 1.402 ± 0.782
1.402LysThr: 1.402 ± 1.068
3.504LysVal: 3.504 ± 0.821
0.0LysTrp: 0.0 ± 0.0
3.504LysTyr: 3.504 ± 2.479
0.0LysXaa: 0.0 ± 0.0
Leu
9.11LeuAla: 9.11 ± 2.986
2.102LeuCys: 2.102 ± 1.475
5.606LeuAsp: 5.606 ± 1.626
1.402LeuGlu: 1.402 ± 0.782
3.504LeuPhe: 3.504 ± 2.21
7.708LeuGly: 7.708 ± 1.028
2.102LeuHis: 2.102 ± 1.006
5.606LeuIle: 5.606 ± 1.235
0.701LeuLys: 0.701 ± 0.653
7.008LeuLeu: 7.008 ± 2.47
3.504LeuMet: 3.504 ± 1.884
6.307LeuAsn: 6.307 ± 0.911
4.905LeuPro: 4.905 ± 2.21
3.504LeuGln: 3.504 ± 0.821
8.409LeuArg: 8.409 ± 2.138
9.811LeuSer: 9.811 ± 1.851
3.504LeuThr: 3.504 ± 1.631
4.205LeuVal: 4.205 ± 1.582
0.0LeuTrp: 0.0 ± 0.0
1.402LeuTyr: 1.402 ± 1.087
0.0LeuXaa: 0.0 ± 0.0
Met
1.402MetAla: 1.402 ± 1.305
0.0MetCys: 0.0 ± 0.0
2.102MetAsp: 2.102 ± 1.212
0.701MetGlu: 0.701 ± 1.052
0.701MetPhe: 0.701 ± 0.534
0.701MetGly: 0.701 ± 0.534
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.402MetLys: 1.402 ± 0.663
2.803MetLeu: 2.803 ± 0.783
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
2.102MetPro: 2.102 ± 1.006
0.0MetGln: 0.0 ± 0.0
0.701MetArg: 0.701 ± 1.028
1.402MetSer: 1.402 ± 0.782
1.402MetThr: 1.402 ± 0.992
0.701MetVal: 0.701 ± 0.534
0.0MetTrp: 0.0 ± 0.0
0.701MetTyr: 0.701 ± 0.534
0.0MetXaa: 0.0 ± 0.0
Asn
3.504AsnAla: 3.504 ± 1.288
0.701AsnCys: 0.701 ± 0.661
3.504AsnAsp: 3.504 ± 0.821
2.803AsnGlu: 2.803 ± 0.933
3.504AsnPhe: 3.504 ± 1.398
7.008AsnGly: 7.008 ± 1.238
1.402AsnHis: 1.402 ± 0.57
0.701AsnIle: 0.701 ± 0.653
2.803AsnLys: 2.803 ± 1.468
4.205AsnLeu: 4.205 ± 0.943
0.701AsnMet: 0.701 ± 0.534
1.402AsnAsn: 1.402 ± 0.57
2.803AsnPro: 2.803 ± 1.836
0.0AsnGln: 0.0 ± 0.0
2.102AsnArg: 2.102 ± 1.103
2.102AsnSer: 2.102 ± 0.891
0.701AsnThr: 0.701 ± 0.534
2.102AsnVal: 2.102 ± 1.958
0.0AsnTrp: 0.0 ± 0.0
2.102AsnTyr: 2.102 ± 0.891
0.0AsnXaa: 0.0 ± 0.0
Pro
2.102ProAla: 2.102 ± 1.601
0.0ProCys: 0.0 ± 0.0
6.307ProAsp: 6.307 ± 2.149
2.102ProGlu: 2.102 ± 1.028
2.803ProPhe: 2.803 ± 1.823
2.803ProGly: 2.803 ± 0.646
1.402ProHis: 1.402 ± 1.047
1.402ProIle: 1.402 ± 2.057
2.803ProLys: 2.803 ± 1.327
5.606ProLeu: 5.606 ± 2.62
0.0ProMet: 0.0 ± 0.0
2.803ProAsn: 2.803 ± 1.468
2.803ProPro: 2.803 ± 1.836
2.102ProGln: 2.102 ± 1.293
3.504ProArg: 3.504 ± 1.213
2.102ProSer: 2.102 ± 1.601
1.402ProThr: 1.402 ± 1.322
4.905ProVal: 4.905 ± 2.205
0.0ProTrp: 0.0 ± 0.0
2.803ProTyr: 2.803 ± 0.646
0.0ProXaa: 0.0 ± 0.0
Gln
4.905GlnAla: 4.905 ± 4.048
0.0GlnCys: 0.0 ± 0.0
2.102GlnAsp: 2.102 ± 0.891
2.102GlnGlu: 2.102 ± 1.006
2.803GlnPhe: 2.803 ± 1.141
0.701GlnGly: 0.701 ± 0.534
0.0GlnHis: 0.0 ± 0.0
3.504GlnIle: 3.504 ± 1.538
1.402GlnLys: 1.402 ± 0.57
2.102GlnLeu: 2.102 ± 1.958
0.701GlnMet: 0.701 ± 1.028
2.803GlnAsn: 2.803 ± 1.141
0.701GlnPro: 0.701 ± 0.534
0.701GlnGln: 0.701 ± 0.534
2.102GlnArg: 2.102 ± 0.891
4.905GlnSer: 4.905 ± 3.114
3.504GlnThr: 3.504 ± 1.398
2.102GlnVal: 2.102 ± 0.479
0.0GlnTrp: 0.0 ± 0.0
3.504GlnTyr: 3.504 ± 1.084
0.0GlnXaa: 0.0 ± 0.0
Arg
5.606ArgAla: 5.606 ± 2.192
0.0ArgCys: 0.0 ± 0.0
2.102ArgAsp: 2.102 ± 1.983
3.504ArgGlu: 3.504 ± 2.039
5.606ArgPhe: 5.606 ± 1.9
2.803ArgGly: 2.803 ± 1.327
2.803ArgHis: 2.803 ± 1.004
2.102ArgIle: 2.102 ± 1.212
2.803ArgLys: 2.803 ± 1.899
7.008ArgLeu: 7.008 ± 2.563
0.0ArgMet: 0.0 ± 0.0
1.402ArgAsn: 1.402 ± 1.047
1.402ArgPro: 1.402 ± 1.322
2.803ArgGln: 2.803 ± 1.721
2.803ArgArg: 2.803 ± 1.327
6.307ArgSer: 6.307 ± 1.945
2.803ArgThr: 2.803 ± 1.064
4.205ArgVal: 4.205 ± 1.783
0.701ArgTrp: 0.701 ± 0.534
8.409ArgTyr: 8.409 ± 3.06
0.0ArgXaa: 0.0 ± 0.0
Ser
6.307SerAla: 6.307 ± 2.943
0.701SerCys: 0.701 ± 0.661
8.409SerAsp: 8.409 ± 2.726
8.409SerGlu: 8.409 ± 4.98
6.307SerPhe: 6.307 ± 2.402
4.905SerGly: 4.905 ± 1.941
1.402SerHis: 1.402 ± 1.146
6.307SerIle: 6.307 ± 0.988
2.803SerLys: 2.803 ± 0.933
13.315SerLeu: 13.315 ± 0.673
2.102SerMet: 2.102 ± 1.824
2.102SerAsn: 2.102 ± 1.28
4.205SerPro: 4.205 ± 0.79
5.606SerGln: 5.606 ± 3.873
2.803SerArg: 2.803 ± 1.064
18.22SerSer: 18.22 ± 8.696
4.205SerThr: 4.205 ± 1.035
9.811SerVal: 9.811 ± 3.574
0.701SerTrp: 0.701 ± 0.661
2.803SerTyr: 2.803 ± 0.855
0.0SerXaa: 0.0 ± 0.0
Thr
4.205ThrAla: 4.205 ± 1.783
0.0ThrCys: 0.0 ± 0.0
1.402ThrAsp: 1.402 ± 2.057
1.402ThrGlu: 1.402 ± 0.663
2.803ThrPhe: 2.803 ± 2.085
2.102ThrGly: 2.102 ± 0.479
0.0ThrHis: 0.0 ± 0.0
0.701ThrIle: 0.701 ± 0.653
2.803ThrLys: 2.803 ± 1.004
4.905ThrLeu: 4.905 ± 1.41
0.0ThrMet: 0.0 ± 0.0
0.0ThrAsn: 0.0 ± 0.0
0.701ThrPro: 0.701 ± 0.534
2.102ThrGln: 2.102 ± 0.891
4.905ThrArg: 4.905 ± 3.114
3.504ThrSer: 3.504 ± 2.045
2.102ThrThr: 2.102 ± 0.891
4.905ThrVal: 4.905 ± 2.326
0.701ThrTrp: 0.701 ± 0.534
1.402ThrTyr: 1.402 ± 1.068
0.0ThrXaa: 0.0 ± 0.0
Val
2.803ValAla: 2.803 ± 1.543
2.102ValCys: 2.102 ± 1.011
4.905ValAsp: 4.905 ± 2.998
3.504ValGlu: 3.504 ± 1.398
2.803ValPhe: 2.803 ± 1.14
2.803ValGly: 2.803 ± 1.354
3.504ValHis: 3.504 ± 2.045
2.102ValIle: 2.102 ± 1.011
2.102ValLys: 2.102 ± 0.891
5.606ValLeu: 5.606 ± 0.6
2.102ValMet: 2.102 ± 1.006
4.205ValAsn: 4.205 ± 1.783
5.606ValPro: 5.606 ± 3.098
5.606ValGln: 5.606 ± 2.282
2.102ValArg: 2.102 ± 0.479
7.708ValSer: 7.708 ± 2.067
2.803ValThr: 2.803 ± 1.899
2.102ValVal: 2.102 ± 1.287
1.402ValTrp: 1.402 ± 0.663
2.102ValTyr: 2.102 ± 0.479
0.0ValXaa: 0.0 ± 0.0
Trp
0.701TrpAla: 0.701 ± 0.534
0.701TrpCys: 0.701 ± 0.661
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
1.402TrpPhe: 1.402 ± 1.305
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.402TrpIle: 1.402 ± 1.068
0.0TrpLys: 0.0 ± 0.0
0.701TrpLeu: 0.701 ± 0.534
0.0TrpMet: 0.0 ± 0.0
0.701TrpAsn: 0.701 ± 0.653
0.0TrpPro: 0.0 ± 0.0
0.701TrpGln: 0.701 ± 0.534
1.402TrpArg: 1.402 ± 1.322
2.102TrpSer: 2.102 ± 1.212
0.701TrpThr: 0.701 ± 0.534
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.701TrpTyr: 0.701 ± 0.661
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.504TyrAla: 3.504 ± 1.35
1.402TyrCys: 1.402 ± 1.322
3.504TyrAsp: 3.504 ± 1.758
3.504TyrGlu: 3.504 ± 1.084
3.504TyrPhe: 3.504 ± 1.465
2.803TyrGly: 2.803 ± 1.004
0.701TyrHis: 0.701 ± 0.661
2.102TyrIle: 2.102 ± 1.212
1.402TyrLys: 1.402 ± 1.146
2.803TyrLeu: 2.803 ± 1.14
1.402TyrMet: 1.402 ± 0.641
4.205TyrAsn: 4.205 ± 1.711
0.701TyrPro: 0.701 ± 0.661
0.701TyrGln: 0.701 ± 0.534
4.905TyrArg: 4.905 ± 2.93
4.905TyrSer: 4.905 ± 1.33
1.402TyrThr: 1.402 ± 0.663
3.504TyrVal: 3.504 ± 1.465
1.402TyrTrp: 1.402 ± 1.305
4.205TyrTyr: 4.205 ± 2.48
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1428 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski