Amino acid dipepetide frequency for Tortoise microvirus 86

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.015AlaAla: 7.015 ± 1.514
0.779AlaCys: 0.779 ± 0.896
5.456AlaAsp: 5.456 ± 2.243
6.235AlaGlu: 6.235 ± 2.997
1.559AlaPhe: 1.559 ± 0.92
6.235AlaGly: 6.235 ± 1.392
3.118AlaHis: 3.118 ± 1.238
3.897AlaIle: 3.897 ± 1.42
3.118AlaLys: 3.118 ± 1.65
5.456AlaLeu: 5.456 ± 1.347
2.338AlaMet: 2.338 ± 1.338
3.897AlaAsn: 3.897 ± 1.446
4.677AlaPro: 4.677 ± 1.858
9.353AlaGln: 9.353 ± 4.017
3.897AlaArg: 3.897 ± 1.607
3.118AlaSer: 3.118 ± 2.314
3.118AlaThr: 3.118 ± 1.1
3.897AlaVal: 3.897 ± 0.501
3.118AlaTrp: 3.118 ± 1.1
3.897AlaTyr: 3.897 ± 0.86
0.0AlaXaa: 0.0 ± 0.0
Cys
0.779CysAla: 0.779 ± 0.896
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
1.559CysHis: 1.559 ± 1.229
0.0CysIle: 0.0 ± 0.0
0.779CysLys: 0.779 ± 0.833
0.779CysLeu: 0.779 ± 0.896
0.0CysMet: 0.0 ± 0.826
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.779CysGln: 0.779 ± 0.504
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.779CysVal: 0.779 ± 0.896
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.559AspAla: 1.559 ± 1.007
0.779AspCys: 0.779 ± 0.833
3.897AspAsp: 3.897 ± 1.664
2.338AspGlu: 2.338 ± 0.951
3.897AspPhe: 3.897 ± 1.467
3.118AspGly: 3.118 ± 2.015
0.0AspHis: 0.0 ± 0.0
3.897AspIle: 3.897 ± 1.82
2.338AspLys: 2.338 ± 0.839
8.574AspLeu: 8.574 ± 2.058
2.338AspMet: 2.338 ± 0.848
2.338AspAsn: 2.338 ± 1.49
3.897AspPro: 3.897 ± 1.693
3.118AspGln: 3.118 ± 0.956
2.338AspArg: 2.338 ± 1.506
2.338AspSer: 2.338 ± 0.877
3.118AspThr: 3.118 ± 1.006
3.118AspVal: 3.118 ± 0.506
0.779AspTrp: 0.779 ± 0.755
4.677AspTyr: 4.677 ± 1.944
0.0AspXaa: 0.0 ± 0.0
Glu
5.456GluAla: 5.456 ± 2.467
0.0GluCys: 0.0 ± 0.0
1.559GluAsp: 1.559 ± 0.752
2.338GluGlu: 2.338 ± 1.506
3.118GluPhe: 3.118 ± 2.166
1.559GluGly: 1.559 ± 0.752
2.338GluHis: 2.338 ± 1.18
2.338GluIle: 2.338 ± 1.18
6.235GluLys: 6.235 ± 2.568
5.456GluLeu: 5.456 ± 2.011
0.0GluMet: 0.0 ± 0.0
3.118GluAsn: 3.118 ± 1.548
0.779GluPro: 0.779 ± 0.896
0.779GluGln: 0.779 ± 0.504
6.235GluArg: 6.235 ± 3.566
3.897GluSer: 3.897 ± 0.501
3.897GluThr: 3.897 ± 1.889
3.118GluVal: 3.118 ± 1.577
0.779GluTrp: 0.779 ± 0.504
4.677GluTyr: 4.677 ± 0.722
0.0GluXaa: 0.0 ± 0.0
Phe
5.456PheAla: 5.456 ± 2.096
0.0PheCys: 0.0 ± 0.0
3.118PheAsp: 3.118 ± 1.238
1.559PheGlu: 1.559 ± 0.941
1.559PhePhe: 1.559 ± 1.007
2.338PheGly: 2.338 ± 1.511
1.559PheHis: 1.559 ± 0.752
1.559PheIle: 1.559 ± 1.007
2.338PheLys: 2.338 ± 2.266
0.779PheLeu: 0.779 ± 0.755
1.559PheMet: 1.559 ± 0.752
1.559PheAsn: 1.559 ± 0.92
0.0PhePro: 0.0 ± 0.0
4.677PheGln: 4.677 ± 0.844
3.118PheArg: 3.118 ± 2.314
2.338PheSer: 2.338 ± 0.972
1.559PheThr: 1.559 ± 1.007
1.559PheVal: 1.559 ± 0.752
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.897GlyAla: 3.897 ± 1.79
0.0GlyCys: 0.0 ± 0.0
5.456GlyAsp: 5.456 ± 2.392
6.235GlyGlu: 6.235 ± 0.734
0.0GlyPhe: 0.0 ± 0.0
7.015GlyGly: 7.015 ± 1.77
0.779GlyHis: 0.779 ± 0.504
5.456GlyIle: 5.456 ± 2.054
3.118GlyLys: 3.118 ± 1.28
5.456GlyLeu: 5.456 ± 2.091
0.0GlyMet: 0.0 ± 0.0
3.118GlyAsn: 3.118 ± 2.015
3.118GlyPro: 3.118 ± 1.368
1.559GlyGln: 1.559 ± 0.898
2.338GlyArg: 2.338 ± 0.563
6.235GlySer: 6.235 ± 1.84
4.677GlyThr: 4.677 ± 0.987
3.118GlyVal: 3.118 ± 1.353
0.779GlyTrp: 0.779 ± 0.504
3.118GlyTyr: 3.118 ± 1.158
0.0GlyXaa: 0.0 ± 0.0
His
3.897HisAla: 3.897 ± 1.859
0.779HisCys: 0.779 ± 0.504
0.779HisAsp: 0.779 ± 0.504
0.0HisGlu: 0.0 ± 0.0
3.118HisPhe: 3.118 ± 1.353
2.338HisGly: 2.338 ± 1.182
0.0HisHis: 0.0 ± 0.0
1.559HisIle: 1.559 ± 1.099
3.897HisLys: 3.897 ± 1.488
0.779HisLeu: 0.779 ± 0.504
1.559HisMet: 1.559 ± 0.619
0.0HisAsn: 0.0 ± 0.0
0.779HisPro: 0.779 ± 0.504
0.779HisGln: 0.779 ± 0.504
1.559HisArg: 1.559 ± 0.752
0.779HisSer: 0.779 ± 0.504
1.559HisThr: 1.559 ± 1.064
0.779HisVal: 0.779 ± 0.504
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
4.677IleAla: 4.677 ± 0.731
0.779IleCys: 0.779 ± 0.896
3.897IleAsp: 3.897 ± 1.338
1.559IleGlu: 1.559 ± 1.007
2.338IlePhe: 2.338 ± 1.511
5.456IleGly: 5.456 ± 1.135
0.0IleHis: 0.0 ± 0.0
3.897IleIle: 3.897 ± 1.664
2.338IleLys: 2.338 ± 0.839
1.559IleLeu: 1.559 ± 1.209
3.118IleMet: 3.118 ± 1.162
3.897IleAsn: 3.897 ± 0.841
3.897IlePro: 3.897 ± 1.664
1.559IleGln: 1.559 ± 1.064
1.559IleArg: 1.559 ± 0.898
4.677IleSer: 4.677 ± 1.789
3.118IleThr: 3.118 ± 1.238
0.779IleVal: 0.779 ± 0.504
0.0IleTrp: 0.0 ± 0.0
2.338IleTyr: 2.338 ± 1.506
0.0IleXaa: 0.0 ± 0.0
Lys
5.456LysAla: 5.456 ± 2.467
0.779LysCys: 0.779 ± 0.896
1.559LysAsp: 1.559 ± 0.619
3.118LysGlu: 3.118 ± 1.548
2.338LysPhe: 2.338 ± 1.511
4.677LysGly: 4.677 ± 1.393
0.779LysHis: 0.779 ± 0.504
3.897LysIle: 3.897 ± 1.093
4.677LysLys: 4.677 ± 1.635
7.015LysLeu: 7.015 ± 2.396
2.338LysMet: 2.338 ± 0.858
4.677LysAsn: 4.677 ± 1.393
0.779LysPro: 0.779 ± 0.504
0.779LysGln: 0.779 ± 0.504
3.897LysArg: 3.897 ± 2.23
1.559LysSer: 1.559 ± 0.752
3.897LysThr: 3.897 ± 1.263
5.456LysVal: 5.456 ± 1.353
0.0LysTrp: 0.0 ± 0.0
2.338LysTyr: 2.338 ± 1.338
0.0LysXaa: 0.0 ± 0.0
Leu
5.456LeuAla: 5.456 ± 1.349
0.779LeuCys: 0.779 ± 0.833
5.456LeuAsp: 5.456 ± 1.361
4.677LeuGlu: 4.677 ± 0.988
1.559LeuPhe: 1.559 ± 0.941
6.235LeuGly: 6.235 ± 1.559
0.779LeuHis: 0.779 ± 0.833
4.677LeuIle: 4.677 ± 1.682
4.677LeuLys: 4.677 ± 2.739
3.897LeuLeu: 3.897 ± 1.455
2.338LeuMet: 2.338 ± 1.338
3.897LeuAsn: 3.897 ± 1.386
5.456LeuPro: 5.456 ± 2.739
4.677LeuGln: 4.677 ± 1.807
8.574LeuArg: 8.574 ± 1.557
8.574LeuSer: 8.574 ± 2.126
1.559LeuThr: 1.559 ± 0.752
4.677LeuVal: 4.677 ± 1.506
0.779LeuTrp: 0.779 ± 0.854
1.559LeuTyr: 1.559 ± 1.007
0.0LeuXaa: 0.0 ± 0.0
Met
3.897MetAla: 3.897 ± 1.803
0.0MetCys: 0.0 ± 0.0
0.779MetAsp: 0.779 ± 0.854
1.559MetGlu: 1.559 ± 0.619
0.0MetPhe: 0.0 ± 0.0
3.897MetGly: 3.897 ± 1.693
2.338MetHis: 2.338 ± 1.506
0.779MetIle: 0.779 ± 0.504
2.338MetLys: 2.338 ± 1.11
0.779MetLeu: 0.779 ± 0.504
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.118MetPro: 3.118 ± 1.353
2.338MetGln: 2.338 ± 1.806
1.559MetArg: 1.559 ± 0.941
3.118MetSer: 3.118 ± 1.1
2.338MetThr: 2.338 ± 1.754
1.559MetVal: 1.559 ± 0.752
0.0MetTrp: 0.0 ± 0.0
1.559MetTyr: 1.559 ± 1.511
0.0MetXaa: 0.0 ± 0.0
Asn
2.338AsnAla: 2.338 ± 0.839
0.0AsnCys: 0.0 ± 0.0
2.338AsnAsp: 2.338 ± 0.972
4.677AsnGlu: 4.677 ± 2.533
0.779AsnPhe: 0.779 ± 0.833
2.338AsnGly: 2.338 ± 0.951
0.0AsnHis: 0.0 ± 0.0
3.118AsnIle: 3.118 ± 1.368
0.779AsnLys: 0.779 ± 0.755
6.235AsnLeu: 6.235 ± 1.392
2.338AsnMet: 2.338 ± 1.511
0.779AsnAsn: 0.779 ± 0.833
4.677AsnPro: 4.677 ± 1.754
0.779AsnGln: 0.779 ± 0.504
4.677AsnArg: 4.677 ± 1.122
6.235AsnSer: 6.235 ± 3.259
3.897AsnThr: 3.897 ± 1.801
1.559AsnVal: 1.559 ± 0.752
1.559AsnTrp: 1.559 ± 0.941
3.118AsnTyr: 3.118 ± 2.384
0.0AsnXaa: 0.0 ± 0.0
Pro
2.338ProAla: 2.338 ± 1.678
0.0ProCys: 0.0 ± 0.0
3.118ProAsp: 3.118 ± 1.797
1.559ProGlu: 1.559 ± 0.752
2.338ProPhe: 2.338 ± 1.511
0.779ProGly: 0.779 ± 0.504
1.559ProHis: 1.559 ± 1.229
2.338ProIle: 2.338 ± 1.511
3.118ProLys: 3.118 ± 2.015
4.677ProLeu: 4.677 ± 0.844
2.338ProMet: 2.338 ± 0.563
2.338ProAsn: 2.338 ± 0.839
0.779ProPro: 0.779 ± 0.854
3.118ProGln: 3.118 ± 1.006
3.118ProArg: 3.118 ± 1.505
3.897ProSer: 3.897 ± 1.607
4.677ProThr: 4.677 ± 1.678
6.235ProVal: 6.235 ± 1.912
0.0ProTrp: 0.0 ± 0.0
1.559ProTyr: 1.559 ± 1.007
0.0ProXaa: 0.0 ± 0.0
Gln
4.677GlnAla: 4.677 ± 3.549
0.779GlnCys: 0.779 ± 0.896
3.118GlnAsp: 3.118 ± 1.567
4.677GlnGlu: 4.677 ± 0.722
2.338GlnPhe: 2.338 ± 1.49
3.118GlnGly: 3.118 ± 1.567
0.779GlnHis: 0.779 ± 0.504
0.0GlnIle: 0.0 ± 0.0
1.559GlnLys: 1.559 ± 0.619
3.897GlnLeu: 3.897 ± 1.455
3.118GlnMet: 3.118 ± 0.624
6.235GlnAsn: 6.235 ± 1.397
1.559GlnPro: 1.559 ± 0.898
3.897GlnGln: 3.897 ± 1.652
3.118GlnArg: 3.118 ± 2.017
3.118GlnSer: 3.118 ± 1.368
5.456GlnThr: 5.456 ± 2.475
0.0GlnVal: 0.0 ± 0.0
0.779GlnTrp: 0.779 ± 0.755
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
7.794ArgAla: 7.794 ± 1.47
0.0ArgCys: 0.0 ± 0.0
3.118ArgAsp: 3.118 ± 0.506
5.456ArgGlu: 5.456 ± 2.015
1.559ArgPhe: 1.559 ± 0.619
3.118ArgGly: 3.118 ± 1.151
0.779ArgHis: 0.779 ± 0.854
3.897ArgIle: 3.897 ± 0.841
3.897ArgLys: 3.897 ± 2.23
3.897ArgLeu: 3.897 ± 1.093
1.559ArgMet: 1.559 ± 1.064
3.118ArgAsn: 3.118 ± 2.384
2.338ArgPro: 2.338 ± 0.972
3.118ArgGln: 3.118 ± 1.548
2.338ArgArg: 2.338 ± 0.563
2.338ArgSer: 2.338 ± 0.563
3.118ArgThr: 3.118 ± 0.699
2.338ArgVal: 2.338 ± 1.511
0.0ArgTrp: 0.0 ± 0.0
3.118ArgTyr: 3.118 ± 1.505
0.0ArgXaa: 0.0 ± 0.0
Ser
4.677SerAla: 4.677 ± 1.944
0.0SerCys: 0.0 ± 0.0
4.677SerAsp: 4.677 ± 2.371
3.897SerGlu: 3.897 ± 1.708
3.118SerPhe: 3.118 ± 1.353
4.677SerGly: 4.677 ± 1.131
2.338SerHis: 2.338 ± 1.18
3.897SerIle: 3.897 ± 0.841
6.235SerLys: 6.235 ± 1.612
6.235SerLeu: 6.235 ± 1.304
1.559SerMet: 1.559 ± 0.619
3.897SerAsn: 3.897 ± 0.841
3.897SerPro: 3.897 ± 1.79
3.118SerGln: 3.118 ± 2.127
2.338SerArg: 2.338 ± 0.972
3.897SerSer: 3.897 ± 2.029
3.897SerThr: 3.897 ± 1.094
2.338SerVal: 2.338 ± 0.839
0.0SerTrp: 0.0 ± 0.0
3.897SerTyr: 3.897 ± 1.386
0.0SerXaa: 0.0 ± 0.0
Thr
6.235ThrAla: 6.235 ± 3.419
0.779ThrCys: 0.779 ± 0.896
3.897ThrAsp: 3.897 ± 2.086
3.118ThrGlu: 3.118 ± 1.053
3.118ThrPhe: 3.118 ± 1.053
3.118ThrGly: 3.118 ± 1.053
0.779ThrHis: 0.779 ± 0.755
2.338ThrIle: 2.338 ± 0.563
3.118ThrLys: 3.118 ± 1.862
5.456ThrLeu: 5.456 ± 1.794
0.0ThrMet: 0.0 ± 0.0
3.118ThrAsn: 3.118 ± 1.368
5.456ThrPro: 5.456 ± 2.745
1.559ThrGln: 1.559 ± 1.511
2.338ThrArg: 2.338 ± 0.563
4.677ThrSer: 4.677 ± 1.651
3.897ThrThr: 3.897 ± 2.036
3.118ThrVal: 3.118 ± 1.65
1.559ThrTrp: 1.559 ± 0.92
1.559ThrTyr: 1.559 ± 0.752
0.0ThrXaa: 0.0 ± 0.0
Val
5.456ValAla: 5.456 ± 1.244
0.0ValCys: 0.0 ± 0.0
1.559ValAsp: 1.559 ± 0.898
2.338ValGlu: 2.338 ± 1.286
3.118ValPhe: 3.118 ± 2.015
3.118ValGly: 3.118 ± 0.506
3.118ValHis: 3.118 ± 1.577
1.559ValIle: 1.559 ± 0.898
3.897ValLys: 3.897 ± 2.23
2.338ValLeu: 2.338 ± 1.182
0.779ValMet: 0.779 ± 0.504
1.559ValAsn: 1.559 ± 0.752
3.897ValPro: 3.897 ± 1.349
0.779ValGln: 0.779 ± 0.755
1.559ValArg: 1.559 ± 1.007
3.118ValSer: 3.118 ± 2.166
3.118ValThr: 3.118 ± 1.505
0.779ValVal: 0.779 ± 0.504
0.0ValTrp: 0.0 ± 0.0
3.897ValTyr: 3.897 ± 1.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.559TrpGlu: 1.559 ± 1.007
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
1.559TrpHis: 1.559 ± 0.619
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.559TrpLeu: 1.559 ± 0.619
0.0TrpMet: 0.0 ± 0.0
1.559TrpAsn: 1.559 ± 1.511
0.779TrpPro: 0.779 ± 0.504
2.338TrpGln: 2.338 ± 1.9
0.0TrpArg: 0.0 ± 0.0
2.338TrpSer: 2.338 ± 0.877
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.118TyrAla: 3.118 ± 1.353
0.0TyrCys: 0.0 ± 0.0
4.677TyrAsp: 4.677 ± 0.844
0.779TyrGlu: 0.779 ± 0.755
0.779TyrPhe: 0.779 ± 0.504
2.338TyrGly: 2.338 ± 1.49
0.779TyrHis: 0.779 ± 0.833
2.338TyrIle: 2.338 ± 0.972
1.559TyrLys: 1.559 ± 0.92
5.456TyrLeu: 5.456 ± 0.78
3.897TyrMet: 3.897 ± 2.23
3.118TyrAsn: 3.118 ± 0.506
0.0TyrPro: 0.0 ± 0.0
3.118TyrGln: 3.118 ± 1.238
2.338TyrArg: 2.338 ± 1.083
2.338TyrSer: 2.338 ± 0.877
2.338TyrThr: 2.338 ± 1.506
0.779TyrVal: 0.779 ± 0.833
1.559TyrTrp: 1.559 ± 1.007
5.456TyrTyr: 5.456 ± 1.353
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1284 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski