Amino acid dipepetide frequency for Tortoise microvirus 68

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.381AlaAla: 10.381 ± 8.595
0.692AlaCys: 0.692 ± 0.608
6.228AlaAsp: 6.228 ± 1.929
3.46AlaGlu: 3.46 ± 1.068
2.076AlaPhe: 2.076 ± 0.849
4.844AlaGly: 4.844 ± 2.007
0.692AlaHis: 0.692 ± 0.686
4.844AlaIle: 4.844 ± 2.319
2.768AlaLys: 2.768 ± 0.926
2.076AlaLeu: 2.076 ± 1.352
2.076AlaMet: 2.076 ± 2.059
6.228AlaAsn: 6.228 ± 3.714
2.076AlaPro: 2.076 ± 0.841
4.152AlaGln: 4.152 ± 0.809
5.536AlaArg: 5.536 ± 1.387
12.457AlaSer: 12.457 ± 3.906
4.152AlaThr: 4.152 ± 1.322
6.92AlaVal: 6.92 ± 2.164
1.384AlaTrp: 1.384 ± 0.901
1.384AlaTyr: 1.384 ± 0.587
0.0AlaXaa: 0.0 ± 0.0
Cys
0.692CysAla: 0.692 ± 0.608
0.0CysCys: 0.0 ± 0.0
0.692CysAsp: 0.692 ± 0.451
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.384CysGly: 1.384 ± 1.217
0.0CysHis: 0.0 ± 0.0
0.692CysIle: 0.692 ± 0.608
0.0CysLys: 0.0 ± 0.0
1.384CysLeu: 1.384 ± 0.587
0.692CysMet: 0.692 ± 0.608
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.692CysArg: 0.692 ± 0.608
1.384CysSer: 1.384 ± 1.048
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.692CysTrp: 0.692 ± 0.608
0.692CysTyr: 0.692 ± 0.608
0.0CysXaa: 0.0 ± 0.0
Asp
2.768AspAla: 2.768 ± 0.867
0.0AspCys: 0.0 ± 0.0
5.536AspAsp: 5.536 ± 2.233
4.844AspGlu: 4.844 ± 2.239
0.692AspPhe: 0.692 ± 0.451
0.692AspGly: 0.692 ± 0.608
1.384AspHis: 1.384 ± 0.901
4.152AspIle: 4.152 ± 1.2
3.46AspLys: 3.46 ± 1.109
6.92AspLeu: 6.92 ± 3.238
0.692AspMet: 0.692 ± 0.451
1.384AspAsn: 1.384 ± 0.806
2.076AspPro: 2.076 ± 0.78
2.768AspGln: 2.768 ± 1.323
4.152AspArg: 4.152 ± 0.996
6.92AspSer: 6.92 ± 3.6
2.768AspThr: 2.768 ± 1.125
6.228AspVal: 6.228 ± 2.822
1.384AspTrp: 1.384 ± 1.372
5.536AspTyr: 5.536 ± 1.433
0.0AspXaa: 0.0 ± 0.0
Glu
4.152GluAla: 4.152 ± 1.679
0.0GluCys: 0.0 ± 0.0
1.384GluAsp: 1.384 ± 0.587
1.384GluGlu: 1.384 ± 0.961
2.768GluPhe: 2.768 ± 1.531
2.076GluGly: 2.076 ± 1.717
3.46GluHis: 3.46 ± 2.283
4.152GluIle: 4.152 ± 0.996
2.076GluLys: 2.076 ± 1.003
2.076GluLeu: 2.076 ± 1.183
2.076GluMet: 2.076 ± 0.737
3.46GluAsn: 3.46 ± 1.691
0.692GluPro: 0.692 ± 0.608
0.692GluGln: 0.692 ± 0.451
2.076GluArg: 2.076 ± 0.841
2.768GluSer: 2.768 ± 0.837
5.536GluThr: 5.536 ± 1.518
3.46GluVal: 3.46 ± 1.046
1.384GluTrp: 1.384 ± 0.621
3.46GluTyr: 3.46 ± 1.666
0.0GluXaa: 0.0 ± 0.0
Phe
4.152PheAla: 4.152 ± 1.534
1.384PheCys: 1.384 ± 0.587
4.844PheAsp: 4.844 ± 1.509
2.076PheGlu: 2.076 ± 1.36
4.152PhePhe: 4.152 ± 2.25
4.844PheGly: 4.844 ± 0.759
0.0PheHis: 0.0 ± 0.0
1.384PheIle: 1.384 ± 0.961
1.384PheLys: 1.384 ± 0.587
0.0PheLeu: 0.0 ± 0.0
1.384PheMet: 1.384 ± 1.048
4.152PheAsn: 4.152 ± 1.555
1.384PhePro: 1.384 ± 0.806
1.384PheGln: 1.384 ± 0.587
1.384PheArg: 1.384 ± 0.901
2.768PheSer: 2.768 ± 1.724
3.46PheThr: 3.46 ± 1.046
4.152PheVal: 4.152 ± 2.075
0.692PheTrp: 0.692 ± 0.451
2.076PheTyr: 2.076 ± 1.241
0.0PheXaa: 0.0 ± 0.0
Gly
4.152GlyAla: 4.152 ± 2.628
0.692GlyCys: 0.692 ± 0.608
6.228GlyAsp: 6.228 ± 1.33
5.536GlyGlu: 5.536 ± 2.036
3.46GlyPhe: 3.46 ± 1.046
5.536GlyGly: 5.536 ± 1.659
0.692GlyHis: 0.692 ± 0.608
4.152GlyIle: 4.152 ± 1.715
2.076GlyLys: 2.076 ± 0.898
4.152GlyLeu: 4.152 ± 0.996
1.384GlyMet: 1.384 ± 1.047
2.076GlyAsn: 2.076 ± 1.229
0.692GlyPro: 0.692 ± 0.451
1.384GlyGln: 1.384 ± 0.901
2.076GlyArg: 2.076 ± 1.36
4.844GlySer: 4.844 ± 2.859
4.152GlyThr: 4.152 ± 2.704
4.844GlyVal: 4.844 ± 1.965
2.076GlyTrp: 2.076 ± 0.841
1.384GlyTyr: 1.384 ± 0.587
0.0GlyXaa: 0.0 ± 0.0
His
2.768HisAla: 2.768 ± 1.646
0.692HisCys: 0.692 ± 0.608
2.076HisAsp: 2.076 ± 0.841
1.384HisGlu: 1.384 ± 1.217
1.384HisPhe: 1.384 ± 0.587
0.692HisGly: 0.692 ± 0.451
1.384HisHis: 1.384 ± 0.901
0.0HisIle: 0.0 ± 0.0
0.692HisLys: 0.692 ± 0.608
0.692HisLeu: 0.692 ± 0.451
0.692HisMet: 0.692 ± 0.428
0.692HisAsn: 0.692 ± 0.608
0.692HisPro: 0.692 ± 0.608
0.692HisGln: 0.692 ± 0.686
0.0HisArg: 0.0 ± 0.0
1.384HisSer: 1.384 ± 1.377
0.692HisThr: 0.692 ± 0.451
1.384HisVal: 1.384 ± 0.961
0.692HisTrp: 0.692 ± 0.451
0.692HisTyr: 0.692 ± 0.608
0.0HisXaa: 0.0 ± 0.0
Ile
2.768IleAla: 2.768 ± 0.635
0.692IleCys: 0.692 ± 0.978
4.152IleAsp: 4.152 ± 1.81
2.076IleGlu: 2.076 ± 0.78
3.46IlePhe: 3.46 ± 0.95
6.228IleGly: 6.228 ± 0.949
1.384IleHis: 1.384 ± 1.047
1.384IleIle: 1.384 ± 0.587
3.46IleLys: 3.46 ± 1.746
2.768IleLeu: 2.768 ± 0.635
1.384IleMet: 1.384 ± 1.217
3.46IleAsn: 3.46 ± 0.849
2.768IlePro: 2.768 ± 1.23
4.152IleGln: 4.152 ± 1.773
2.768IleArg: 2.768 ± 1.175
4.152IleSer: 4.152 ± 0.633
3.46IleThr: 3.46 ± 1.215
2.076IleVal: 2.076 ± 0.501
0.0IleTrp: 0.0 ± 0.0
2.768IleTyr: 2.768 ± 1.208
0.0IleXaa: 0.0 ± 0.0
Lys
2.768LysAla: 2.768 ± 1.724
0.692LysCys: 0.692 ± 0.608
2.768LysAsp: 2.768 ± 1.428
3.46LysGlu: 3.46 ± 1.32
1.384LysPhe: 1.384 ± 1.716
3.46LysGly: 3.46 ± 1.506
0.692LysHis: 0.692 ± 0.686
4.152LysIle: 4.152 ± 1.719
2.768LysLys: 2.768 ± 1.876
2.768LysLeu: 2.768 ± 1.688
0.0LysMet: 0.0 ± 0.0
2.768LysAsn: 2.768 ± 0.923
2.768LysPro: 2.768 ± 1.612
0.692LysGln: 0.692 ± 0.451
3.46LysArg: 3.46 ± 1.746
4.844LysSer: 4.844 ± 0.618
1.384LysThr: 1.384 ± 0.901
0.692LysVal: 0.692 ± 0.451
0.692LysTrp: 0.692 ± 0.686
2.768LysTyr: 2.768 ± 0.923
0.0LysXaa: 0.0 ± 0.0
Leu
3.46LeuAla: 3.46 ± 2.253
0.0LeuCys: 0.0 ± 0.0
3.46LeuAsp: 3.46 ± 1.393
4.844LeuGlu: 4.844 ± 1.529
2.076LeuPhe: 2.076 ± 1.108
2.768LeuGly: 2.768 ± 0.926
0.0LeuHis: 0.0 ± 0.0
2.768LeuIle: 2.768 ± 1.888
3.46LeuLys: 3.46 ± 1.082
3.46LeuLeu: 3.46 ± 1.195
2.768LeuMet: 2.768 ± 1.646
3.46LeuAsn: 3.46 ± 1.068
6.228LeuPro: 6.228 ± 1.944
3.46LeuGln: 3.46 ± 1.681
5.536LeuArg: 5.536 ± 3.376
6.228LeuSer: 6.228 ± 1.439
3.46LeuThr: 3.46 ± 1.393
4.152LeuVal: 4.152 ± 1.532
0.692LeuTrp: 0.692 ± 0.451
5.536LeuTyr: 5.536 ± 1.015
0.0LeuXaa: 0.0 ± 0.0
Met
2.768MetAla: 2.768 ± 1.503
0.0MetCys: 0.0 ± 0.0
2.076MetAsp: 2.076 ± 0.841
0.692MetGlu: 0.692 ± 0.858
1.384MetPhe: 1.384 ± 1.372
0.692MetGly: 0.692 ± 0.686
0.0MetHis: 0.0 ± 0.0
1.384MetIle: 1.384 ± 1.252
1.384MetLys: 1.384 ± 1.217
0.692MetLeu: 0.692 ± 0.608
1.384MetMet: 1.384 ± 1.372
0.0MetAsn: 0.0 ± 0.0
1.384MetPro: 1.384 ± 0.587
0.692MetGln: 0.692 ± 0.686
0.692MetArg: 0.692 ± 0.451
6.228MetSer: 6.228 ± 1.935
0.692MetThr: 0.692 ± 0.978
0.692MetVal: 0.692 ± 0.451
0.0MetTrp: 0.0 ± 0.0
1.384MetTyr: 1.384 ± 1.252
0.0MetXaa: 0.0 ± 0.0
Asn
4.152AsnAla: 4.152 ± 2.457
0.0AsnCys: 0.0 ± 0.0
2.076AsnAsp: 2.076 ± 1.408
1.384AsnGlu: 1.384 ± 0.621
2.076AsnPhe: 2.076 ± 2.229
4.152AsnGly: 4.152 ± 1.134
0.692AsnHis: 0.692 ± 0.858
6.228AsnIle: 6.228 ± 1.503
2.768AsnLys: 2.768 ± 1.128
4.152AsnLeu: 4.152 ± 1.426
1.384AsnMet: 1.384 ± 1.372
1.384AsnAsn: 1.384 ± 1.372
2.076AsnPro: 2.076 ± 0.501
4.152AsnGln: 4.152 ± 1.055
3.46AsnArg: 3.46 ± 1.094
3.46AsnSer: 3.46 ± 1.7
2.768AsnThr: 2.768 ± 0.867
2.768AsnVal: 2.768 ± 1.302
0.0AsnTrp: 0.0 ± 0.0
0.692AsnTyr: 0.692 ± 0.608
0.0AsnXaa: 0.0 ± 0.0
Pro
3.46ProAla: 3.46 ± 1.407
0.692ProCys: 0.692 ± 0.608
3.46ProAsp: 3.46 ± 1.046
1.384ProGlu: 1.384 ± 0.587
2.076ProPhe: 2.076 ± 0.852
2.768ProGly: 2.768 ± 1.175
1.384ProHis: 1.384 ± 0.587
2.768ProIle: 2.768 ± 0.635
1.384ProLys: 1.384 ± 0.961
3.46ProLeu: 3.46 ± 1.393
0.692ProMet: 0.692 ± 0.451
0.692ProAsn: 0.692 ± 0.451
0.692ProPro: 0.692 ± 0.608
2.076ProGln: 2.076 ± 1.352
0.0ProArg: 0.0 ± 0.0
2.768ProSer: 2.768 ± 1.302
2.768ProThr: 2.768 ± 1.302
6.228ProVal: 6.228 ± 1.409
0.0ProTrp: 0.0 ± 0.0
1.384ProTyr: 1.384 ± 0.901
0.0ProXaa: 0.0 ± 0.0
Gln
4.152GlnAla: 4.152 ± 1.002
0.0GlnCys: 0.0 ± 0.0
2.076GlnAsp: 2.076 ± 0.841
2.076GlnGlu: 2.076 ± 0.841
1.384GlnPhe: 1.384 ± 0.806
3.46GlnGly: 3.46 ± 1.091
0.0GlnHis: 0.0 ± 0.0
2.076GlnIle: 2.076 ± 1.352
1.384GlnLys: 1.384 ± 0.901
4.152GlnLeu: 4.152 ± 0.861
0.692GlnMet: 0.692 ± 0.686
3.46GlnAsn: 3.46 ± 1.215
0.0GlnPro: 0.0 ± 0.0
2.768GlnGln: 2.768 ± 0.867
3.46GlnArg: 3.46 ± 1.7
4.152GlnSer: 4.152 ± 1.275
1.384GlnThr: 1.384 ± 0.621
2.076GlnVal: 2.076 ± 1.352
0.692GlnTrp: 0.692 ± 0.608
2.768GlnTyr: 2.768 ± 1.288
0.0GlnXaa: 0.0 ± 0.0
Arg
2.768ArgAla: 2.768 ± 1.317
0.692ArgCys: 0.692 ± 0.608
3.46ArgAsp: 3.46 ± 0.822
2.768ArgGlu: 2.768 ± 1.891
4.844ArgPhe: 4.844 ± 1.557
1.384ArgGly: 1.384 ± 0.587
0.692ArgHis: 0.692 ± 0.608
2.076ArgIle: 2.076 ± 1.102
2.768ArgLys: 2.768 ± 1.876
6.92ArgLeu: 6.92 ± 2.233
2.076ArgMet: 2.076 ± 1.208
2.076ArgAsn: 2.076 ± 1.11
2.768ArgPro: 2.768 ± 1.175
2.768ArgGln: 2.768 ± 1.724
2.768ArgArg: 2.768 ± 1.724
4.844ArgSer: 4.844 ± 1.015
0.692ArgThr: 0.692 ± 0.451
3.46ArgVal: 3.46 ± 1.094
0.0ArgTrp: 0.0 ± 0.0
4.152ArgTyr: 4.152 ± 1.055
0.0ArgXaa: 0.0 ± 0.0
Ser
13.149SerAla: 13.149 ± 6.36
1.384SerCys: 1.384 ± 0.587
3.46SerAsp: 3.46 ± 2.144
4.152SerGlu: 4.152 ± 2.417
4.844SerPhe: 4.844 ± 1.861
4.844SerGly: 4.844 ± 1.148
2.768SerHis: 2.768 ± 1.125
2.768SerIle: 2.768 ± 1.425
6.92SerLys: 6.92 ± 2.556
7.612SerLeu: 7.612 ± 1.452
1.384SerMet: 1.384 ± 1.372
6.228SerAsn: 6.228 ± 1.719
3.46SerPro: 3.46 ± 1.681
4.152SerGln: 4.152 ± 1.275
8.304SerArg: 8.304 ± 1.32
8.304SerSer: 8.304 ± 1.498
4.844SerThr: 4.844 ± 1.015
6.92SerVal: 6.92 ± 3.797
0.692SerTrp: 0.692 ± 0.451
3.46SerTyr: 3.46 ± 0.909
0.0SerXaa: 0.0 ± 0.0
Thr
4.844ThrAla: 4.844 ± 1.493
0.692ThrCys: 0.692 ± 0.608
2.076ThrAsp: 2.076 ± 1.352
3.46ThrGlu: 3.46 ± 1.082
2.076ThrPhe: 2.076 ± 1.352
4.844ThrGly: 4.844 ± 1.242
0.692ThrHis: 0.692 ± 0.978
2.076ThrIle: 2.076 ± 1.352
0.692ThrLys: 0.692 ± 0.451
4.152ThrLeu: 4.152 ± 2.22
0.0ThrMet: 0.0 ± 0.0
2.076ThrAsn: 2.076 ± 1.229
2.768ThrPro: 2.768 ± 1.125
1.384ThrGln: 1.384 ± 1.047
3.46ThrArg: 3.46 ± 1.681
10.381ThrSer: 10.381 ± 2.884
4.844ThrThr: 4.844 ± 1.455
2.768ThrVal: 2.768 ± 1.302
1.384ThrTrp: 1.384 ± 0.587
4.152ThrTyr: 4.152 ± 1.241
0.0ThrXaa: 0.0 ± 0.0
Val
6.228ValAla: 6.228 ± 2.315
0.0ValCys: 0.0 ± 0.0
4.152ValAsp: 4.152 ± 2.293
1.384ValGlu: 1.384 ± 0.961
2.768ValPhe: 2.768 ± 1.262
4.152ValGly: 4.152 ± 1.537
0.0ValHis: 0.0 ± 0.0
4.152ValIle: 4.152 ± 1.534
4.152ValLys: 4.152 ± 1.798
7.612ValLeu: 7.612 ± 2.851
2.076ValMet: 2.076 ± 0.802
4.152ValAsn: 4.152 ± 0.952
4.844ValPro: 4.844 ± 2.068
1.384ValGln: 1.384 ± 0.901
1.384ValArg: 1.384 ± 1.956
3.46ValSer: 3.46 ± 1.602
6.92ValThr: 6.92 ± 0.933
3.46ValVal: 3.46 ± 1.742
0.692ValTrp: 0.692 ± 0.451
1.384ValTyr: 1.384 ± 0.587
0.0ValXaa: 0.0 ± 0.0
Trp
1.384TrpAla: 1.384 ± 0.587
0.0TrpCys: 0.0 ± 0.0
0.692TrpAsp: 0.692 ± 0.451
0.692TrpGlu: 0.692 ± 0.608
0.692TrpPhe: 0.692 ± 0.451
0.0TrpGly: 0.0 ± 0.0
0.692TrpHis: 0.692 ± 0.451
1.384TrpIle: 1.384 ± 0.621
0.692TrpLys: 0.692 ± 0.451
0.0TrpLeu: 0.0 ± 0.0
0.692TrpMet: 0.692 ± 0.451
0.0TrpAsn: 0.0 ± 0.0
1.384TrpPro: 1.384 ± 0.621
1.384TrpGln: 1.384 ± 1.372
0.0TrpArg: 0.0 ± 0.0
2.076TrpSer: 2.076 ± 0.841
1.384TrpThr: 1.384 ± 0.587
0.692TrpVal: 0.692 ± 0.451
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.46TyrAla: 3.46 ± 1.091
0.692TyrCys: 0.692 ± 0.608
3.46TyrAsp: 3.46 ± 1.999
2.076TyrGlu: 2.076 ± 0.501
3.46TyrPhe: 3.46 ± 1.393
2.768TyrGly: 2.768 ± 1.709
2.768TyrHis: 2.768 ± 1.688
2.768TyrIle: 2.768 ± 1.144
0.692TyrLys: 0.692 ± 0.451
2.768TyrLeu: 2.768 ± 1.23
0.0TyrMet: 0.0 ± 0.0
2.076TyrAsn: 2.076 ± 0.898
1.384TyrPro: 1.384 ± 0.587
2.076TyrGln: 2.076 ± 0.852
2.768TyrArg: 2.768 ± 1.323
6.228TyrSer: 6.228 ± 1.817
3.46TyrThr: 3.46 ± 2.015
2.076TyrVal: 2.076 ± 1.352
0.692TyrTrp: 0.692 ± 0.451
4.152TyrTyr: 4.152 ± 2.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1446 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski