Amino acid dipepetide frequency for Tortoise microvirus 30

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.791AlaAla: 3.791 ± 2.721
0.758AlaCys: 0.758 ± 0.697
4.549AlaAsp: 4.549 ± 1.562
5.307AlaGlu: 5.307 ± 2.447
1.516AlaPhe: 1.516 ± 0.84
5.307AlaGly: 5.307 ± 1.836
0.758AlaHis: 0.758 ± 0.817
3.033AlaIle: 3.033 ± 1.597
5.307AlaLys: 5.307 ± 2.103
5.307AlaLeu: 5.307 ± 3.735
0.758AlaMet: 0.758 ± 0.534
4.549AlaAsn: 4.549 ± 2.239
1.516AlaPro: 1.516 ± 1.067
3.791AlaGln: 3.791 ± 3.115
4.549AlaArg: 4.549 ± 2.056
1.516AlaSer: 1.516 ± 0.746
4.549AlaThr: 4.549 ± 1.368
4.549AlaVal: 4.549 ± 1.291
1.516AlaTrp: 1.516 ± 1.067
2.274AlaTyr: 2.274 ± 1.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.758CysAla: 0.758 ± 0.534
0.758CysCys: 0.758 ± 0.697
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
2.274CysGly: 2.274 ± 1.261
0.758CysHis: 0.758 ± 0.534
0.758CysIle: 0.758 ± 0.697
0.758CysLys: 0.758 ± 0.697
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.516CysAsn: 1.516 ± 1.393
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.516CysSer: 1.516 ± 1.393
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.791AspAla: 3.791 ± 1.909
0.0AspCys: 0.0 ± 0.0
3.033AspAsp: 3.033 ± 0.759
3.033AspGlu: 3.033 ± 0.709
1.516AspPhe: 1.516 ± 0.746
1.516AspGly: 1.516 ± 1.244
0.758AspHis: 0.758 ± 0.534
7.582AspIle: 7.582 ± 2.525
3.791AspLys: 3.791 ± 2.328
6.065AspLeu: 6.065 ± 1.115
2.274AspMet: 2.274 ± 1.011
3.791AspAsn: 3.791 ± 3.063
3.033AspPro: 3.033 ± 2.199
3.033AspGln: 3.033 ± 1.679
3.033AspArg: 3.033 ± 1.43
2.274AspSer: 2.274 ± 2.1
1.516AspThr: 1.516 ± 1.067
1.516AspVal: 1.516 ± 1.067
0.0AspTrp: 0.0 ± 0.0
3.033AspTyr: 3.033 ± 1.369
0.0AspXaa: 0.0 ± 0.0
Glu
7.582GluAla: 7.582 ± 4.403
0.0GluCys: 0.0 ± 0.0
0.758GluAsp: 0.758 ± 0.949
7.582GluGlu: 7.582 ± 6.366
3.791GluPhe: 3.791 ± 1.102
1.516GluGly: 1.516 ± 0.673
2.274GluHis: 2.274 ± 0.781
6.065GluIle: 6.065 ± 0.951
3.033GluLys: 3.033 ± 1.065
6.065GluLeu: 6.065 ± 3.191
2.274GluMet: 2.274 ± 1.488
5.307GluAsn: 5.307 ± 1.084
2.274GluPro: 2.274 ± 1.601
4.549GluGln: 4.549 ± 2.436
2.274GluArg: 2.274 ± 1.008
6.065GluSer: 6.065 ± 2.786
3.033GluThr: 3.033 ± 1.36
1.516GluVal: 1.516 ± 0.84
1.516GluTrp: 1.516 ± 1.244
3.033GluTyr: 3.033 ± 0.709
0.0GluXaa: 0.0 ± 0.0
Phe
2.274PheAla: 2.274 ± 1.455
1.516PheCys: 1.516 ± 1.393
4.549PheAsp: 4.549 ± 1.901
3.033PheGlu: 3.033 ± 1.559
0.758PhePhe: 0.758 ± 0.534
1.516PheGly: 1.516 ± 0.888
0.0PheHis: 0.0 ± 0.0
3.791PheIle: 3.791 ± 1.612
2.274PheLys: 2.274 ± 1.218
2.274PheLeu: 2.274 ± 1.111
0.758PheMet: 0.758 ± 0.994
6.065PheAsn: 6.065 ± 1.641
3.033PhePro: 3.033 ± 1.448
0.0PheGln: 0.0 ± 0.0
0.758PheArg: 0.758 ± 0.534
0.0PheSer: 0.0 ± 0.0
3.033PheThr: 3.033 ± 1.833
0.0PheVal: 0.0 ± 0.0
0.758PheTrp: 0.758 ± 0.697
1.516PheTyr: 1.516 ± 0.673
0.0PheXaa: 0.0 ± 0.0
Gly
4.549GlyAla: 4.549 ± 2.887
0.0GlyCys: 0.0 ± 0.0
3.033GlyAsp: 3.033 ± 1.422
3.791GlyGlu: 3.791 ± 1.928
2.274GlyPhe: 2.274 ± 0.995
2.274GlyGly: 2.274 ± 1.455
0.758GlyHis: 0.758 ± 0.534
5.307GlyIle: 5.307 ± 1.567
2.274GlyLys: 2.274 ± 1.261
5.307GlyLeu: 5.307 ± 1.396
0.758GlyMet: 0.758 ± 0.817
2.274GlyAsn: 2.274 ± 0.6
1.516GlyPro: 1.516 ± 1.067
4.549GlyGln: 4.549 ± 1.921
0.758GlyArg: 0.758 ± 0.534
3.033GlySer: 3.033 ± 0.759
4.549GlyThr: 4.549 ± 1.901
2.274GlyVal: 2.274 ± 1.601
0.0GlyTrp: 0.0 ± 0.0
2.274GlyTyr: 2.274 ± 1.218
0.0GlyXaa: 0.0 ± 0.0
His
3.033HisAla: 3.033 ± 1.776
0.0HisCys: 0.0 ± 0.0
1.516HisAsp: 1.516 ± 0.746
1.516HisGlu: 1.516 ± 0.673
1.516HisPhe: 1.516 ± 0.746
0.758HisGly: 0.758 ± 0.534
0.0HisHis: 0.0 ± 0.0
0.758HisIle: 0.758 ± 0.534
1.516HisLys: 1.516 ± 0.888
3.791HisLeu: 3.791 ± 1.102
0.758HisMet: 0.758 ± 0.817
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
1.516HisGln: 1.516 ± 1.096
0.758HisArg: 0.758 ± 0.534
0.758HisSer: 0.758 ± 0.697
3.033HisThr: 3.033 ± 0.768
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.516HisTyr: 1.516 ± 1.393
0.0HisXaa: 0.0 ± 0.0
Ile
4.549IleAla: 4.549 ± 1.163
0.758IleCys: 0.758 ± 0.534
4.549IleAsp: 4.549 ± 1.201
3.033IleGlu: 3.033 ± 2.314
8.34IlePhe: 8.34 ± 3.206
5.307IleGly: 5.307 ± 1.583
0.758IleHis: 0.758 ± 0.697
8.34IleIle: 8.34 ± 2.938
7.582IleLys: 7.582 ± 1.984
4.549IleLeu: 4.549 ± 0.925
4.549IleMet: 4.549 ± 1.126
3.791IleAsn: 3.791 ± 1.928
3.033IlePro: 3.033 ± 1.422
4.549IleGln: 4.549 ± 1.88
6.065IleArg: 6.065 ± 2.698
4.549IleSer: 4.549 ± 1.163
4.549IleThr: 4.549 ± 1.422
4.549IleVal: 4.549 ± 1.427
1.516IleTrp: 1.516 ± 1.067
3.033IleTyr: 3.033 ± 1.411
0.0IleXaa: 0.0 ± 0.0
Lys
0.758LysAla: 0.758 ± 0.697
1.516LysCys: 1.516 ± 1.393
4.549LysAsp: 4.549 ± 0.968
12.889LysGlu: 12.889 ± 5.749
3.033LysPhe: 3.033 ± 2.786
3.033LysGly: 3.033 ± 1.065
0.758LysHis: 0.758 ± 0.534
4.549LysIle: 4.549 ± 2.236
7.582LysLys: 7.582 ± 4.218
5.307LysLeu: 5.307 ± 1.071
0.758LysMet: 0.758 ± 0.534
6.823LysAsn: 6.823 ± 4.246
0.0LysPro: 0.0 ± 0.0
4.549LysGln: 4.549 ± 2.704
3.791LysArg: 3.791 ± 1.44
3.791LysSer: 3.791 ± 0.979
6.823LysThr: 6.823 ± 2.713
3.791LysVal: 3.791 ± 1.763
2.274LysTrp: 2.274 ± 1.008
3.033LysTyr: 3.033 ± 1.345
0.0LysXaa: 0.0 ± 0.0
Leu
4.549LeuAla: 4.549 ± 1.201
0.0LeuCys: 0.0 ± 0.0
2.274LeuAsp: 2.274 ± 1.471
3.033LeuGlu: 3.033 ± 2.399
0.758LeuPhe: 0.758 ± 1.16
6.065LeuGly: 6.065 ± 2.048
1.516LeuHis: 1.516 ± 0.746
7.582LeuIle: 7.582 ± 2.149
8.34LeuLys: 8.34 ± 2.547
4.549LeuLeu: 4.549 ± 1.962
5.307LeuMet: 5.307 ± 2.462
4.549LeuAsn: 4.549 ± 1.425
3.033LeuPro: 3.033 ± 2.134
6.823LeuGln: 6.823 ± 2.144
6.823LeuArg: 6.823 ± 1.081
6.065LeuSer: 6.065 ± 1.479
4.549LeuThr: 4.549 ± 1.163
0.758LeuVal: 0.758 ± 0.697
1.516LeuTrp: 1.516 ± 0.84
1.516LeuTyr: 1.516 ± 1.067
0.0LeuXaa: 0.0 ± 0.0
Met
3.791MetAla: 3.791 ± 1.692
0.0MetCys: 0.0 ± 0.0
0.758MetAsp: 0.758 ± 0.534
3.033MetGlu: 3.033 ± 1.266
0.758MetPhe: 0.758 ± 1.16
0.758MetGly: 0.758 ± 0.534
0.0MetHis: 0.0 ± 0.0
1.516MetIle: 1.516 ± 0.888
1.516MetLys: 1.516 ± 1.32
3.033MetLeu: 3.033 ± 2.003
0.0MetMet: 0.0 ± 0.0
3.033MetAsn: 3.033 ± 0.768
0.758MetPro: 0.758 ± 0.534
0.758MetGln: 0.758 ± 0.817
2.274MetArg: 2.274 ± 1.008
0.758MetSer: 0.758 ± 0.534
6.065MetThr: 6.065 ± 2.895
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.274AsnAla: 2.274 ± 1.008
0.0AsnCys: 0.0 ± 0.0
3.791AsnAsp: 3.791 ± 2.025
6.065AsnGlu: 6.065 ± 2.258
2.274AsnPhe: 2.274 ± 1.497
0.758AsnGly: 0.758 ± 0.534
0.0AsnHis: 0.0 ± 0.0
12.889AsnIle: 12.889 ± 6.717
7.582AsnLys: 7.582 ± 2.907
9.856AsnLeu: 9.856 ± 2.076
3.033AsnMet: 3.033 ± 1.5
2.274AsnAsn: 2.274 ± 0.995
0.758AsnPro: 0.758 ± 0.697
3.033AsnGln: 3.033 ± 1.065
3.033AsnArg: 3.033 ± 0.759
2.274AsnSer: 2.274 ± 0.995
3.791AsnThr: 3.791 ± 1.612
1.516AsnVal: 1.516 ± 0.746
0.0AsnTrp: 0.0 ± 0.0
4.549AsnTyr: 4.549 ± 1.186
0.0AsnXaa: 0.0 ± 0.0
Pro
2.274ProAla: 2.274 ± 1.455
0.758ProCys: 0.758 ± 0.697
0.0ProAsp: 0.0 ± 0.0
3.033ProGlu: 3.033 ± 1.313
2.274ProPhe: 2.274 ± 1.261
2.274ProGly: 2.274 ± 1.601
1.516ProHis: 1.516 ± 0.888
3.791ProIle: 3.791 ± 2.668
3.033ProLys: 3.033 ± 1.045
3.033ProLeu: 3.033 ± 1.422
0.0ProMet: 0.0 ± 0.0
2.274ProAsn: 2.274 ± 0.995
0.758ProPro: 0.758 ± 0.534
1.516ProGln: 1.516 ± 1.067
3.033ProArg: 3.033 ± 1.045
2.274ProSer: 2.274 ± 1.601
3.033ProThr: 3.033 ± 0.768
1.516ProVal: 1.516 ± 1.067
0.758ProTrp: 0.758 ± 0.534
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.549GlnAla: 4.549 ± 2.942
0.758GlnCys: 0.758 ± 0.697
3.033GlnAsp: 3.033 ± 1.36
3.791GlnGlu: 3.791 ± 1.61
0.0GlnPhe: 0.0 ± 0.0
3.033GlnGly: 3.033 ± 1.679
2.274GlnHis: 2.274 ± 2.09
2.274GlnIle: 2.274 ± 0.6
4.549GlnLys: 4.549 ± 0.847
2.274GlnLeu: 2.274 ± 0.995
2.274GlnMet: 2.274 ± 1.471
9.098GlnAsn: 9.098 ± 1.746
3.033GlnPro: 3.033 ± 1.422
3.033GlnGln: 3.033 ± 2.26
3.791GlnArg: 3.791 ± 1.162
3.033GlnSer: 3.033 ± 0.768
3.791GlnThr: 3.791 ± 1.198
0.0GlnVal: 0.0 ± 0.0
1.516GlnTrp: 1.516 ± 1.203
0.758GlnTyr: 0.758 ± 0.949
0.0GlnXaa: 0.0 ± 0.0
Arg
3.033ArgAla: 3.033 ± 1.43
0.0ArgCys: 0.0 ± 0.0
4.549ArgAsp: 4.549 ± 1.425
3.033ArgGlu: 3.033 ± 1.266
0.0ArgPhe: 0.0 ± 0.0
3.791ArgGly: 3.791 ± 2.185
2.274ArgHis: 2.274 ± 1.558
6.823ArgIle: 6.823 ± 0.761
4.549ArgLys: 4.549 ± 1.849
3.791ArgLeu: 3.791 ± 1.942
1.516ArgMet: 1.516 ± 1.32
3.033ArgAsn: 3.033 ± 1.045
3.033ArgPro: 3.033 ± 1.345
3.033ArgGln: 3.033 ± 1.57
7.582ArgArg: 7.582 ± 3.172
1.516ArgSer: 1.516 ± 1.067
0.758ArgThr: 0.758 ± 0.534
2.274ArgVal: 2.274 ± 1.008
0.758ArgTrp: 0.758 ± 0.534
3.033ArgTyr: 3.033 ± 1.266
0.0ArgXaa: 0.0 ± 0.0
Ser
3.033SerAla: 3.033 ± 1.43
0.758SerCys: 0.758 ± 0.534
3.791SerAsp: 3.791 ± 1.23
2.274SerGlu: 2.274 ± 1.008
2.274SerPhe: 2.274 ± 0.6
4.549SerGly: 4.549 ± 0.968
2.274SerHis: 2.274 ± 1.218
3.033SerIle: 3.033 ± 1.345
6.065SerLys: 6.065 ± 2.457
3.791SerLeu: 3.791 ± 1.44
0.758SerMet: 0.758 ± 0.817
3.791SerAsn: 3.791 ± 1.697
3.791SerPro: 3.791 ± 0.911
2.274SerGln: 2.274 ± 1.601
0.758SerArg: 0.758 ± 0.697
1.516SerSer: 1.516 ± 1.067
3.033SerThr: 3.033 ± 2.134
0.758SerVal: 0.758 ± 0.534
0.758SerTrp: 0.758 ± 0.949
0.0SerTyr: 0.0 ± 0.0
0.0SerXaa: 0.0 ± 0.0
Thr
3.033ThrAla: 3.033 ± 1.313
0.758ThrCys: 0.758 ± 0.697
5.307ThrAsp: 5.307 ± 2.874
2.274ThrGlu: 2.274 ± 1.471
2.274ThrPhe: 2.274 ± 1.601
3.033ThrGly: 3.033 ± 0.768
3.791ThrHis: 3.791 ± 1.909
3.791ThrIle: 3.791 ± 1.124
4.549ThrLys: 4.549 ± 1.761
4.549ThrLeu: 4.549 ± 1.186
2.274ThrMet: 2.274 ± 1.471
3.033ThrAsn: 3.033 ± 1.493
2.274ThrPro: 2.274 ± 1.601
3.791ThrGln: 3.791 ± 1.067
3.033ThrArg: 3.033 ± 1.065
4.549ThrSer: 4.549 ± 1.163
3.033ThrThr: 3.033 ± 1.266
0.758ThrVal: 0.758 ± 0.697
2.274ThrTrp: 2.274 ± 1.371
3.791ThrTyr: 3.791 ± 1.102
0.0ThrXaa: 0.0 ± 0.0
Val
4.549ValAla: 4.549 ± 1.498
0.758ValCys: 0.758 ± 0.534
0.758ValAsp: 0.758 ± 0.534
0.758ValGlu: 0.758 ± 0.949
1.516ValPhe: 1.516 ± 1.067
0.758ValGly: 0.758 ± 0.697
0.758ValHis: 0.758 ± 0.817
1.516ValIle: 1.516 ± 1.067
0.758ValLys: 0.758 ± 0.697
4.549ValLeu: 4.549 ± 1.989
0.0ValMet: 0.0 ± 0.0
1.516ValAsn: 1.516 ± 1.067
1.516ValPro: 1.516 ± 0.888
3.791ValGln: 3.791 ± 1.603
1.516ValArg: 1.516 ± 1.067
0.758ValSer: 0.758 ± 0.697
0.758ValThr: 0.758 ± 0.949
1.516ValVal: 1.516 ± 1.067
0.0ValTrp: 0.0 ± 0.0
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.758TrpAla: 0.758 ± 0.697
0.0TrpCys: 0.0 ± 0.0
1.516TrpAsp: 1.516 ± 1.067
2.274TrpGlu: 2.274 ± 1.88
0.0TrpPhe: 0.0 ± 0.0
1.516TrpGly: 1.516 ± 0.673
0.758TrpHis: 0.758 ± 0.534
0.0TrpIle: 0.0 ± 0.0
2.274TrpLys: 2.274 ± 0.995
0.758TrpLeu: 0.758 ± 0.534
0.0TrpMet: 0.0 ± 0.0
0.758TrpAsn: 0.758 ± 0.534
2.274TrpPro: 2.274 ± 0.995
0.758TrpGln: 0.758 ± 0.817
0.758TrpArg: 0.758 ± 0.949
1.516TrpSer: 1.516 ± 1.203
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.758TrpTrp: 0.758 ± 0.817
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.274TyrAla: 2.274 ± 1.218
0.0TyrCys: 0.0 ± 0.0
2.274TyrAsp: 2.274 ± 1.111
0.758TyrGlu: 0.758 ± 0.949
3.791TyrPhe: 3.791 ± 0.487
1.516TyrGly: 1.516 ± 0.888
0.758TyrHis: 0.758 ± 0.697
4.549TyrIle: 4.549 ± 1.849
2.274TyrLys: 2.274 ± 1.111
0.758TyrLeu: 0.758 ± 0.697
0.0TyrMet: 0.0 ± 0.0
2.274TyrAsn: 2.274 ± 1.008
1.516TyrPro: 1.516 ± 0.746
1.516TyrGln: 1.516 ± 1.067
3.791TyrArg: 3.791 ± 1.909
1.516TyrSer: 1.516 ± 0.673
2.274TyrThr: 2.274 ± 1.204
0.758TyrVal: 0.758 ± 0.697
0.758TyrTrp: 0.758 ± 0.534
0.758TyrTyr: 0.758 ± 0.697
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1320 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski