Amino acid dipepetide frequency for Tortoise microvirus 111

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.071AlaAla: 8.071 ± 2.717
1.614AlaCys: 1.614 ± 1.596
4.036AlaAsp: 4.036 ± 1.108
5.65AlaGlu: 5.65 ± 4.01
2.421AlaPhe: 2.421 ± 1.122
5.65AlaGly: 5.65 ± 2.482
1.614AlaHis: 1.614 ± 0.638
2.421AlaIle: 2.421 ± 1.079
3.228AlaLys: 3.228 ± 3.192
8.878AlaLeu: 8.878 ± 2.938
0.807AlaMet: 0.807 ± 0.901
4.036AlaAsn: 4.036 ± 1.947
4.843AlaPro: 4.843 ± 2.243
2.421AlaGln: 2.421 ± 1.103
4.843AlaArg: 4.843 ± 1.987
5.65AlaSer: 5.65 ± 2.305
1.614AlaThr: 1.614 ± 1.059
8.071AlaVal: 8.071 ± 3.101
2.421AlaTrp: 2.421 ± 1.544
4.036AlaTyr: 4.036 ± 1.712
0.0AlaXaa: 0.0 ± 0.0
Cys
2.421CysAla: 2.421 ± 1.726
0.807CysCys: 0.807 ± 0.798
0.807CysAsp: 0.807 ± 1.006
0.0CysGlu: 0.0 ± 0.0
0.807CysPhe: 0.807 ± 0.798
1.614CysGly: 1.614 ± 1.596
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.807CysLys: 0.807 ± 0.798
2.421CysLeu: 2.421 ± 1.35
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.614CysPro: 1.614 ± 1.596
0.0CysGln: 0.0 ± 0.0
0.807CysArg: 0.807 ± 1.136
0.0CysSer: 0.0 ± 0.0
0.807CysThr: 0.807 ± 0.798
1.614CysVal: 1.614 ± 1.059
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.228AspAla: 3.228 ± 1.015
0.0AspCys: 0.0 ± 0.0
2.421AspAsp: 2.421 ± 1.35
4.843AspGlu: 4.843 ± 1.838
0.807AspPhe: 0.807 ± 0.515
3.228AspGly: 3.228 ± 1.696
0.807AspHis: 0.807 ± 0.515
2.421AspIle: 2.421 ± 1.845
3.228AspLys: 3.228 ± 2.395
9.685AspLeu: 9.685 ± 1.123
1.614AspMet: 1.614 ± 0.869
4.036AspAsn: 4.036 ± 1.598
2.421AspPro: 2.421 ± 1.217
1.614AspGln: 1.614 ± 1.03
0.807AspArg: 0.807 ± 0.515
2.421AspSer: 2.421 ± 0.827
4.036AspThr: 4.036 ± 0.925
3.228AspVal: 3.228 ± 1.202
0.807AspTrp: 0.807 ± 0.515
5.65AspTyr: 5.65 ± 2.009
0.0AspXaa: 0.0 ± 0.0
Glu
2.421GluAla: 2.421 ± 1.27
0.807GluCys: 0.807 ± 0.798
0.807GluAsp: 0.807 ± 0.515
2.421GluGlu: 2.421 ± 2.003
0.807GluPhe: 0.807 ± 0.515
4.036GluGly: 4.036 ± 2.673
1.614GluHis: 1.614 ± 1.03
2.421GluIle: 2.421 ± 1.467
3.228GluLys: 3.228 ± 2.83
4.843GluLeu: 4.843 ± 2.748
0.807GluMet: 0.807 ± 0.891
1.614GluAsn: 1.614 ± 0.908
3.228GluPro: 3.228 ± 2.111
4.036GluGln: 4.036 ± 1.507
5.65GluArg: 5.65 ± 1.405
1.614GluSer: 1.614 ± 1.353
4.036GluThr: 4.036 ± 2.7
0.807GluVal: 0.807 ± 0.798
1.614GluTrp: 1.614 ± 0.638
4.843GluTyr: 4.843 ± 1.791
0.0GluXaa: 0.0 ± 0.0
Phe
3.228PheAla: 3.228 ± 1.275
0.0PheCys: 0.0 ± 0.0
3.228PheAsp: 3.228 ± 1.125
1.614PheGlu: 1.614 ± 2.011
1.614PhePhe: 1.614 ± 0.638
4.843PheGly: 4.843 ± 1.058
0.0PheHis: 0.0 ± 0.0
0.807PheIle: 0.807 ± 0.515
0.0PheLys: 0.0 ± 0.0
4.843PheLeu: 4.843 ± 0.972
0.0PheMet: 0.0 ± 0.0
0.807PheAsn: 0.807 ± 0.798
0.0PhePro: 0.0 ± 0.0
0.807PheGln: 0.807 ± 0.515
2.421PheArg: 2.421 ± 1.544
0.807PheSer: 0.807 ± 0.885
3.228PheThr: 3.228 ± 1.24
5.65PheVal: 5.65 ± 1.478
0.807PheTrp: 0.807 ± 0.515
0.807PheTyr: 0.807 ± 0.515
0.0PheXaa: 0.0 ± 0.0
Gly
4.036GlyAla: 4.036 ± 1.182
0.807GlyCys: 0.807 ± 1.006
6.457GlyAsp: 6.457 ± 1.362
5.65GlyGlu: 5.65 ± 1.847
1.614GlyPhe: 1.614 ± 0.638
4.843GlyGly: 4.843 ± 2.243
1.614GlyHis: 1.614 ± 1.353
3.228GlyIle: 3.228 ± 1.24
0.807GlyLys: 0.807 ± 1.006
3.228GlyLeu: 3.228 ± 0.831
1.614GlyMet: 1.614 ± 1.03
2.421GlyAsn: 2.421 ± 0.841
0.0GlyPro: 0.0 ± 0.0
4.036GlyGln: 4.036 ± 0.603
1.614GlyArg: 1.614 ± 1.77
9.685GlySer: 9.685 ± 4.486
6.457GlyThr: 6.457 ± 2.574
5.65GlyVal: 5.65 ± 2.131
0.807GlyTrp: 0.807 ± 0.798
4.036GlyTyr: 4.036 ± 1.031
0.0GlyXaa: 0.0 ± 0.0
His
1.614HisAla: 1.614 ± 1.665
0.0HisCys: 0.0 ± 0.0
0.807HisAsp: 0.807 ± 0.515
0.807HisGlu: 0.807 ± 0.798
1.614HisPhe: 1.614 ± 1.03
2.421HisGly: 2.421 ± 0.841
0.807HisHis: 0.807 ± 0.515
1.614HisIle: 1.614 ± 1.353
0.807HisLys: 0.807 ± 0.515
0.0HisLeu: 0.0 ± 0.0
0.807HisMet: 0.807 ± 0.798
1.614HisAsn: 1.614 ± 0.638
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.807HisArg: 0.807 ± 0.798
0.0HisSer: 0.0 ± 0.0
2.421HisThr: 2.421 ± 1.35
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.807HisTyr: 0.807 ± 0.798
0.0HisXaa: 0.0 ± 0.0
Ile
4.843IleAla: 4.843 ± 2.723
1.614IleCys: 1.614 ± 1.665
4.036IleAsp: 4.036 ± 1.194
1.614IleGlu: 1.614 ± 0.638
3.228IlePhe: 3.228 ± 1.696
4.843IleGly: 4.843 ± 1.515
0.0IleHis: 0.0 ± 0.0
2.421IleIle: 2.421 ± 0.827
1.614IleLys: 1.614 ± 1.099
1.614IleLeu: 1.614 ± 1.03
0.807IleMet: 0.807 ± 0.798
3.228IleAsn: 3.228 ± 2.462
1.614IlePro: 1.614 ± 2.011
1.614IleGln: 1.614 ± 1.146
2.421IleArg: 2.421 ± 1.35
4.036IleSer: 4.036 ± 1.815
1.614IleThr: 1.614 ± 1.302
0.807IleVal: 0.807 ± 1.006
0.0IleTrp: 0.0 ± 0.0
3.228IleTyr: 3.228 ± 1.24
0.0IleXaa: 0.0 ± 0.0
Lys
1.614LysAla: 1.614 ± 1.596
0.0LysCys: 0.0 ± 0.0
1.614LysAsp: 1.614 ± 0.638
1.614LysGlu: 1.614 ± 1.353
0.0LysPhe: 0.0 ± 0.0
1.614LysGly: 1.614 ± 0.908
0.0LysHis: 0.0 ± 0.0
3.228LysIle: 3.228 ± 2.291
3.228LysLys: 3.228 ± 2.291
4.843LysLeu: 4.843 ± 3.702
2.421LysMet: 2.421 ± 1.276
0.0LysAsn: 0.0 ± 0.0
2.421LysPro: 2.421 ± 1.544
1.614LysGln: 1.614 ± 0.908
2.421LysArg: 2.421 ± 2.394
4.036LysSer: 4.036 ± 0.603
1.614LysThr: 1.614 ± 0.908
2.421LysVal: 2.421 ± 1.35
0.0LysTrp: 0.0 ± 0.0
3.228LysTyr: 3.228 ± 1.308
0.0LysXaa: 0.0 ± 0.0
Leu
4.843LeuAla: 4.843 ± 0.857
0.0LeuCys: 0.0 ± 0.0
5.65LeuAsp: 5.65 ± 2.009
8.071LeuGlu: 8.071 ± 4.031
2.421LeuPhe: 2.421 ± 0.827
5.65LeuGly: 5.65 ± 1.397
1.614LeuHis: 1.614 ± 0.638
4.036LeuIle: 4.036 ± 1.359
1.614LeuLys: 1.614 ± 0.638
2.421LeuLeu: 2.421 ± 1.35
0.807LeuMet: 0.807 ± 0.798
4.036LeuAsn: 4.036 ± 1.359
4.036LeuPro: 4.036 ± 1.702
10.492LeuGln: 10.492 ± 3.76
5.65LeuArg: 5.65 ± 1.806
6.457LeuSer: 6.457 ± 2.25
6.457LeuThr: 6.457 ± 1.733
3.228LeuVal: 3.228 ± 1.275
0.0LeuTrp: 0.0 ± 0.0
3.228LeuTyr: 3.228 ± 1.595
0.0LeuXaa: 0.0 ± 0.0
Met
0.807MetAla: 0.807 ± 0.798
0.0MetCys: 0.0 ± 0.0
1.614MetAsp: 1.614 ± 1.099
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
2.421MetGly: 2.421 ± 1.122
0.807MetHis: 0.807 ± 1.136
0.807MetIle: 0.807 ± 0.798
1.614MetLys: 1.614 ± 1.146
2.421MetLeu: 2.421 ± 0.827
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.807MetPro: 0.807 ± 0.515
2.421MetGln: 2.421 ± 1.217
1.614MetArg: 1.614 ± 1.03
0.807MetSer: 0.807 ± 0.798
0.807MetThr: 0.807 ± 0.515
0.807MetVal: 0.807 ± 0.798
0.0MetTrp: 0.0 ± 0.0
1.614MetTyr: 1.614 ± 1.059
0.0MetXaa: 0.0 ± 0.0
Asn
5.65AsnAla: 5.65 ± 2.142
0.0AsnCys: 0.0 ± 0.0
0.807AsnAsp: 0.807 ± 0.515
4.036AsnGlu: 4.036 ± 2.729
0.0AsnPhe: 0.0 ± 0.0
1.614AsnGly: 1.614 ± 1.03
0.0AsnHis: 0.0 ± 0.0
1.614AsnIle: 1.614 ± 1.03
1.614AsnLys: 1.614 ± 1.03
4.843AsnLeu: 4.843 ± 1.328
1.614AsnMet: 1.614 ± 1.105
4.843AsnAsn: 4.843 ± 4.825
4.036AsnPro: 4.036 ± 0.603
0.0AsnGln: 0.0 ± 0.0
2.421AsnArg: 2.421 ± 1.122
1.614AsnSer: 1.614 ± 1.596
2.421AsnThr: 2.421 ± 0.827
5.65AsnVal: 5.65 ± 2.228
0.0AsnTrp: 0.0 ± 0.0
1.614AsnTyr: 1.614 ± 0.638
0.0AsnXaa: 0.0 ± 0.0
Pro
3.228ProAla: 3.228 ± 1.513
0.0ProCys: 0.0 ± 0.0
2.421ProAsp: 2.421 ± 1.699
1.614ProGlu: 1.614 ± 0.908
3.228ProPhe: 3.228 ± 1.275
3.228ProGly: 3.228 ± 0.831
0.807ProHis: 0.807 ± 0.798
2.421ProIle: 2.421 ± 1.544
0.0ProLys: 0.0 ± 0.0
1.614ProLeu: 1.614 ± 1.03
1.614ProMet: 1.614 ± 0.638
0.807ProAsn: 0.807 ± 0.515
2.421ProPro: 2.421 ± 1.079
3.228ProGln: 3.228 ± 1.364
1.614ProArg: 1.614 ± 0.638
5.65ProSer: 5.65 ± 1.806
2.421ProThr: 2.421 ± 0.794
8.071ProVal: 8.071 ± 2.683
0.807ProTrp: 0.807 ± 1.136
3.228ProTyr: 3.228 ± 1.428
0.0ProXaa: 0.0 ± 0.0
Gln
1.614GlnAla: 1.614 ± 1.596
0.807GlnCys: 0.807 ± 0.798
1.614GlnAsp: 1.614 ± 0.638
5.65GlnGlu: 5.65 ± 2.969
1.614GlnPhe: 1.614 ± 1.059
3.228GlnGly: 3.228 ± 1.54
0.0GlnHis: 0.0 ± 0.0
2.421GlnIle: 2.421 ± 3.017
4.036GlnLys: 4.036 ± 1.361
4.843GlnLeu: 4.843 ± 1.603
0.0GlnMet: 0.0 ± 0.0
2.421GlnAsn: 2.421 ± 1.103
3.228GlnPro: 3.228 ± 2.059
3.228GlnGln: 3.228 ± 1.24
5.65GlnArg: 5.65 ± 2.266
4.036GlnSer: 4.036 ± 1.507
1.614GlnThr: 1.614 ± 0.908
2.421GlnVal: 2.421 ± 1.122
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
4.036ArgAla: 4.036 ± 1.498
1.614ArgCys: 1.614 ± 1.596
4.036ArgAsp: 4.036 ± 1.814
0.807ArgGlu: 0.807 ± 0.798
4.843ArgPhe: 4.843 ± 1.862
3.228ArgGly: 3.228 ± 1.739
0.807ArgHis: 0.807 ± 0.798
3.228ArgIle: 3.228 ± 0.691
3.228ArgLys: 3.228 ± 2.124
5.65ArgLeu: 5.65 ± 1.722
0.0ArgMet: 0.0 ± 0.0
2.421ArgAsn: 2.421 ± 1.103
4.036ArgPro: 4.036 ± 2.016
2.421ArgGln: 2.421 ± 1.987
6.457ArgArg: 6.457 ± 3.047
3.228ArgSer: 3.228 ± 1.24
3.228ArgThr: 3.228 ± 1.24
4.036ArgVal: 4.036 ± 1.401
0.807ArgTrp: 0.807 ± 0.798
4.843ArgTyr: 4.843 ± 1.2
0.0ArgXaa: 0.0 ± 0.0
Ser
12.107SerAla: 12.107 ± 5.283
1.614SerCys: 1.614 ± 0.638
1.614SerAsp: 1.614 ± 1.059
1.614SerGlu: 1.614 ± 1.059
2.421SerPhe: 2.421 ± 1.08
4.036SerGly: 4.036 ± 1.94
1.614SerHis: 1.614 ± 1.146
5.65SerIle: 5.65 ± 1.847
3.228SerLys: 3.228 ± 1.24
4.843SerLeu: 4.843 ± 1.2
1.614SerMet: 1.614 ± 1.099
0.807SerAsn: 0.807 ± 0.515
4.843SerPro: 4.843 ± 0.972
3.228SerGln: 3.228 ± 1.428
8.071SerArg: 8.071 ± 1.207
8.071SerSer: 8.071 ± 3.926
5.65SerThr: 5.65 ± 1.268
6.457SerVal: 6.457 ± 4.118
0.807SerTrp: 0.807 ± 0.885
3.228SerTyr: 3.228 ± 0.831
0.0SerXaa: 0.0 ± 0.0
Thr
5.65ThrAla: 5.65 ± 1.543
0.0ThrCys: 0.0 ± 0.0
7.264ThrAsp: 7.264 ± 1.404
1.614ThrGlu: 1.614 ± 1.146
2.421ThrPhe: 2.421 ± 1.544
3.228ThrGly: 3.228 ± 1.202
0.807ThrHis: 0.807 ± 0.798
1.614ThrIle: 1.614 ± 1.665
3.228ThrLys: 3.228 ± 0.899
4.036ThrLeu: 4.036 ± 1.031
0.0ThrMet: 0.0 ± 0.0
1.614ThrAsn: 1.614 ± 1.03
2.421ThrPro: 2.421 ± 1.335
0.807ThrGln: 0.807 ± 1.006
2.421ThrArg: 2.421 ± 1.35
8.071ThrSer: 8.071 ± 3.727
4.843ThrThr: 4.843 ± 1.719
6.457ThrVal: 6.457 ± 2.224
0.807ThrTrp: 0.807 ± 0.515
3.228ThrTyr: 3.228 ± 1.24
0.0ThrXaa: 0.0 ± 0.0
Val
6.457ValAla: 6.457 ± 2.172
1.614ValCys: 1.614 ± 0.638
5.65ValAsp: 5.65 ± 1.165
3.228ValGlu: 3.228 ± 2.117
0.807ValPhe: 0.807 ± 0.798
5.65ValGly: 5.65 ± 1.617
0.807ValHis: 0.807 ± 0.515
3.228ValIle: 3.228 ± 0.831
1.614ValLys: 1.614 ± 1.596
4.036ValLeu: 4.036 ± 1.031
2.421ValMet: 2.421 ± 1.217
6.457ValAsn: 6.457 ± 2.479
4.036ValPro: 4.036 ± 2.574
2.421ValGln: 2.421 ± 1.103
2.421ValArg: 2.421 ± 1.079
7.264ValSer: 7.264 ± 2.056
4.843ValThr: 4.843 ± 1.324
4.843ValVal: 4.843 ± 2.867
0.807ValTrp: 0.807 ± 0.515
5.65ValTyr: 5.65 ± 1.165
0.0ValXaa: 0.0 ± 0.0
Trp
1.614TrpAla: 1.614 ± 1.03
0.807TrpCys: 0.807 ± 1.136
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.807TrpPhe: 0.807 ± 0.515
0.0TrpGly: 0.0 ± 0.0
0.807TrpHis: 0.807 ± 0.515
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.807TrpLeu: 0.807 ± 0.885
0.807TrpMet: 0.807 ± 0.515
0.0TrpAsn: 0.0 ± 0.0
0.807TrpPro: 0.807 ± 0.515
0.0TrpGln: 0.0 ± 0.0
1.614TrpArg: 1.614 ± 0.638
2.421TrpSer: 2.421 ± 1.35
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.807TrpTyr: 0.807 ± 0.515
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.843TyrAla: 4.843 ± 1.937
2.421TyrCys: 2.421 ± 1.909
2.421TyrAsp: 2.421 ± 0.794
0.807TyrGlu: 0.807 ± 1.006
4.036TyrPhe: 4.036 ± 1.182
3.228TyrGly: 3.228 ± 2.197
2.421TyrHis: 2.421 ± 1.35
1.614TyrIle: 1.614 ± 1.146
1.614TyrLys: 1.614 ± 0.638
4.843TyrLeu: 4.843 ± 1.328
0.807TyrMet: 0.807 ± 0.515
3.228TyrAsn: 3.228 ± 2.059
1.614TyrPro: 1.614 ± 1.03
4.036TyrGln: 4.036 ± 1.498
3.228TyrArg: 3.228 ± 1.923
5.65TyrSer: 5.65 ± 2.8
2.421TyrThr: 2.421 ± 0.827
4.036TyrVal: 4.036 ± 0.925
0.807TyrTrp: 0.807 ± 0.515
4.036TyrTyr: 4.036 ± 1.702
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1240 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski