Amino acid dipepetide frequency for Tortoise microvirus 34

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.647AlaAla: 1.647 ± 1.611
0.0AlaCys: 0.0 ± 0.0
4.119AlaAsp: 4.119 ± 2.046
4.119AlaGlu: 4.119 ± 1.482
3.295AlaPhe: 3.295 ± 2.068
5.766AlaGly: 5.766 ± 1.794
1.647AlaHis: 1.647 ± 1.077
2.471AlaIle: 2.471 ± 1.096
2.471AlaLys: 2.471 ± 1.615
6.59AlaLeu: 6.59 ± 1.79
1.647AlaMet: 1.647 ± 0.75
3.295AlaAsn: 3.295 ± 1.5
3.295AlaPro: 3.295 ± 2.154
3.295AlaGln: 3.295 ± 1.252
4.942AlaArg: 4.942 ± 0.909
5.766AlaSer: 5.766 ± 2.956
3.295AlaThr: 3.295 ± 1.106
5.766AlaVal: 5.766 ± 1.328
2.471AlaTrp: 2.471 ± 0.589
1.647AlaTyr: 1.647 ± 1.077
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
2.471CysCys: 2.471 ± 2.319
0.824CysAsp: 0.824 ± 0.773
0.824CysGlu: 0.824 ± 0.538
1.647CysPhe: 1.647 ± 0.651
2.471CysGly: 2.471 ± 2.319
0.824CysHis: 0.824 ± 0.538
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.647CysLeu: 1.647 ± 0.651
0.0CysMet: 0.0 ± 0.0
0.824CysAsn: 0.824 ± 0.773
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.824CysArg: 0.824 ± 0.773
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
1.647CysVal: 1.647 ± 1.546
0.0CysTrp: 0.0 ± 0.0
0.824CysTyr: 0.824 ± 0.773
0.0CysXaa: 0.0 ± 0.0
Asp
4.942AspAla: 4.942 ± 2.106
0.824AspCys: 0.824 ± 0.773
3.295AspAsp: 3.295 ± 1.529
2.471AspGlu: 2.471 ± 2.241
4.119AspPhe: 4.119 ± 1.944
7.414AspGly: 7.414 ± 1.904
0.824AspHis: 0.824 ± 0.538
2.471AspIle: 2.471 ± 1.46
1.647AspLys: 1.647 ± 1.343
7.414AspLeu: 7.414 ± 3.322
2.471AspMet: 2.471 ± 1.077
0.0AspAsn: 0.0 ± 0.0
1.647AspPro: 1.647 ± 1.077
3.295AspGln: 3.295 ± 1.648
3.295AspArg: 3.295 ± 2.237
4.942AspSer: 4.942 ± 0.729
4.119AspThr: 4.119 ± 1.944
3.295AspVal: 3.295 ± 1.529
0.0AspTrp: 0.0 ± 0.0
5.766AspTyr: 5.766 ± 3.219
0.0AspXaa: 0.0 ± 0.0
Glu
2.471GluAla: 2.471 ± 1.324
0.824GluCys: 0.824 ± 0.538
3.295GluAsp: 3.295 ± 2.134
4.942GluGlu: 4.942 ± 1.019
2.471GluPhe: 2.471 ± 1.53
4.119GluGly: 4.119 ± 2.162
0.0GluHis: 0.0 ± 0.0
3.295GluIle: 3.295 ± 2.237
5.766GluLys: 5.766 ± 3.382
3.295GluLeu: 3.295 ± 1.302
1.647GluMet: 1.647 ± 1.189
0.824GluAsn: 0.824 ± 0.773
1.647GluPro: 1.647 ± 1.295
1.647GluGln: 1.647 ± 1.546
3.295GluArg: 3.295 ± 1.728
4.942GluSer: 4.942 ± 5.752
1.647GluThr: 1.647 ± 0.947
6.59GluVal: 6.59 ± 4.37
1.647GluTrp: 1.647 ± 1.077
4.119GluTyr: 4.119 ± 1.938
0.0GluXaa: 0.0 ± 0.0
Phe
1.647PheAla: 1.647 ± 0.651
0.824PheCys: 0.824 ± 0.773
3.295PheAsp: 3.295 ± 1.459
6.59PheGlu: 6.59 ± 1.404
2.471PhePhe: 2.471 ± 0.589
3.295PheGly: 3.295 ± 1.459
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.824PheLys: 0.824 ± 0.773
1.647PheLeu: 1.647 ± 0.651
0.0PheMet: 0.0 ± 0.667
0.0PheAsn: 0.0 ± 0.0
2.471PhePro: 2.471 ± 0.911
0.824PheGln: 0.824 ± 0.538
3.295PheArg: 3.295 ± 1.648
2.471PheSer: 2.471 ± 1.53
2.471PheThr: 2.471 ± 0.911
1.647PheVal: 1.647 ± 1.077
0.824PheTrp: 0.824 ± 0.538
3.295PheTyr: 3.295 ± 0.613
0.0PheXaa: 0.0 ± 0.0
Gly
3.295GlyAla: 3.295 ± 1.085
0.824GlyCys: 0.824 ± 0.773
6.59GlyAsp: 6.59 ± 1.967
5.766GlyGlu: 5.766 ± 3.347
3.295GlyPhe: 3.295 ± 1.302
8.237GlyGly: 8.237 ± 2.551
2.471GlyHis: 2.471 ± 1.46
1.647GlyIle: 1.647 ± 0.651
5.766GlyLys: 5.766 ± 1.075
11.532GlyLeu: 11.532 ± 4.16
1.647GlyMet: 1.647 ± 1.611
1.647GlyAsn: 1.647 ± 1.295
0.824GlyPro: 0.824 ± 0.806
1.647GlyGln: 1.647 ± 1.122
4.942GlyArg: 4.942 ± 1.019
6.59GlySer: 6.59 ± 2.999
0.824GlyThr: 0.824 ± 0.538
7.414GlyVal: 7.414 ± 2.226
0.824GlyTrp: 0.824 ± 0.773
4.942GlyTyr: 4.942 ± 0.729
0.0GlyXaa: 0.0 ± 0.0
His
0.824HisAla: 0.824 ± 0.773
0.0HisCys: 0.0 ± 0.0
1.647HisAsp: 1.647 ± 0.75
0.824HisGlu: 0.824 ± 0.806
0.824HisPhe: 0.824 ± 0.538
1.647HisGly: 1.647 ± 0.75
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
0.824HisLeu: 0.824 ± 0.538
1.647HisMet: 1.647 ± 0.75
2.471HisAsn: 2.471 ± 1.615
0.824HisPro: 0.824 ± 0.538
1.647HisGln: 1.647 ± 0.947
0.824HisArg: 0.824 ± 0.538
0.824HisSer: 0.824 ± 0.773
0.0HisThr: 0.0 ± 0.0
2.471HisVal: 2.471 ± 1.324
0.0HisTrp: 0.0 ± 0.0
1.647HisTyr: 1.647 ± 0.651
0.0HisXaa: 0.0 ± 0.0
Ile
3.295IleAla: 3.295 ± 1.085
1.647IleCys: 1.647 ± 1.546
0.824IleAsp: 0.824 ± 0.806
4.942IleGlu: 4.942 ± 2.134
0.0IlePhe: 0.0 ± 0.0
0.824IleGly: 0.824 ± 0.806
0.0IleHis: 0.0 ± 0.0
0.824IleIle: 0.824 ± 1.182
0.824IleLys: 0.824 ± 0.538
3.295IleLeu: 3.295 ± 0.613
0.0IleMet: 0.0 ± 0.0
1.647IleAsn: 1.647 ± 1.611
1.647IlePro: 1.647 ± 0.75
2.471IleGln: 2.471 ± 1.317
4.119IleArg: 4.119 ± 1.105
2.471IleSer: 2.471 ± 0.911
0.824IleThr: 0.824 ± 1.182
2.471IleVal: 2.471 ± 1.027
0.824IleTrp: 0.824 ± 0.538
1.647IleTyr: 1.647 ± 1.077
0.0IleXaa: 0.0 ± 0.0
Lys
4.119LysAla: 4.119 ± 2.204
0.824LysCys: 0.824 ± 0.773
4.119LysAsp: 4.119 ± 1.366
2.471LysGlu: 2.471 ± 1.324
2.471LysPhe: 2.471 ± 1.615
4.119LysGly: 4.119 ± 1.43
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
2.471LysLys: 2.471 ± 2.319
2.471LysLeu: 2.471 ± 2.319
2.471LysMet: 2.471 ± 1.324
4.942LysAsn: 4.942 ± 1.177
1.647LysPro: 1.647 ± 1.077
0.824LysGln: 0.824 ± 0.538
5.766LysArg: 5.766 ± 3.382
6.59LysSer: 6.59 ± 1.596
0.824LysThr: 0.824 ± 0.806
2.471LysVal: 2.471 ± 1.324
0.0LysTrp: 0.0 ± 0.0
2.471LysTyr: 2.471 ± 0.911
0.0LysXaa: 0.0 ± 0.0
Leu
7.414LeuAla: 7.414 ± 2.133
1.647LeuCys: 1.647 ± 0.651
4.119LeuAsp: 4.119 ± 0.594
6.59LeuGlu: 6.59 ± 3.966
2.471LeuPhe: 2.471 ± 1.317
6.59LeuGly: 6.59 ± 1.912
1.647LeuHis: 1.647 ± 0.75
0.824LeuIle: 0.824 ± 0.538
4.119LeuLys: 4.119 ± 1.965
5.766LeuLeu: 5.766 ± 1.794
4.119LeuMet: 4.119 ± 2.495
3.295LeuAsn: 3.295 ± 1.252
6.59LeuPro: 6.59 ± 3.4
4.942LeuGln: 4.942 ± 2.054
8.237LeuArg: 8.237 ± 1.55
7.414LeuSer: 7.414 ± 2.223
6.59LeuThr: 6.59 ± 1.603
4.119LeuVal: 4.119 ± 2.046
0.824LeuTrp: 0.824 ± 0.538
1.647LeuTyr: 1.647 ± 0.651
0.0LeuXaa: 0.0 ± 0.0
Met
4.119MetAla: 4.119 ± 4.029
0.0MetCys: 0.0 ± 0.0
2.471MetAsp: 2.471 ± 2.345
2.471MetGlu: 2.471 ± 2.41
0.0MetPhe: 0.0 ± 0.0
1.647MetGly: 1.647 ± 0.651
1.647MetHis: 1.647 ± 1.077
1.647MetIle: 1.647 ± 0.651
3.295MetLys: 3.295 ± 0.613
3.295MetLeu: 3.295 ± 0.868
0.824MetMet: 0.824 ± 0.538
1.647MetAsn: 1.647 ± 1.295
0.824MetPro: 0.824 ± 0.538
1.647MetGln: 1.647 ± 1.295
1.647MetArg: 1.647 ± 1.611
3.295MetSer: 3.295 ± 0.613
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.824MetTyr: 0.824 ± 0.806
0.0MetXaa: 0.0 ± 0.0
Asn
2.471AsnAla: 2.471 ± 1.46
0.824AsnCys: 0.824 ± 0.773
1.647AsnAsp: 1.647 ± 1.122
0.824AsnGlu: 0.824 ± 0.538
0.0AsnPhe: 0.0 ± 0.0
4.119AsnGly: 4.119 ± 2.692
0.0AsnHis: 0.0 ± 0.0
2.471AsnIle: 2.471 ± 1.46
0.824AsnLys: 0.824 ± 0.806
3.295AsnLeu: 3.295 ± 2.237
1.647AsnMet: 1.647 ± 0.782
3.295AsnAsn: 3.295 ± 1.5
4.942AsnPro: 4.942 ± 0.909
0.824AsnGln: 0.824 ± 0.806
2.471AsnArg: 2.471 ± 2.319
2.471AsnSer: 2.471 ± 1.027
1.647AsnThr: 1.647 ± 1.122
3.295AsnVal: 3.295 ± 1.536
3.295AsnTrp: 3.295 ± 0.613
0.824AsnTyr: 0.824 ± 0.806
0.0AsnXaa: 0.0 ± 0.0
Pro
4.119ProAla: 4.119 ± 2.078
1.647ProCys: 1.647 ± 0.651
1.647ProAsp: 1.647 ± 2.365
2.471ProGlu: 2.471 ± 1.096
1.647ProPhe: 1.647 ± 1.122
4.119ProGly: 4.119 ± 0.993
0.824ProHis: 0.824 ± 0.773
4.942ProIle: 4.942 ± 1.475
2.471ProLys: 2.471 ± 1.027
3.295ProLeu: 3.295 ± 1.085
0.824ProMet: 0.824 ± 0.538
3.295ProAsn: 3.295 ± 1.5
1.647ProPro: 1.647 ± 1.343
0.824ProGln: 0.824 ± 0.538
2.471ProArg: 2.471 ± 0.911
4.119ProSer: 4.119 ± 2.204
1.647ProThr: 1.647 ± 1.077
5.766ProVal: 5.766 ± 2.237
0.824ProTrp: 0.824 ± 0.538
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.647GlnAla: 1.647 ± 1.077
0.0GlnCys: 0.0 ± 0.0
2.471GlnAsp: 2.471 ± 1.096
0.824GlnGlu: 0.824 ± 0.773
0.824GlnPhe: 0.824 ± 0.538
2.471GlnGly: 2.471 ± 2.241
0.824GlnHis: 0.824 ± 0.806
1.647GlnIle: 1.647 ± 1.611
2.471GlnLys: 2.471 ± 1.324
0.824GlnLeu: 0.824 ± 0.773
1.647GlnMet: 1.647 ± 0.75
2.471GlnAsn: 2.471 ± 1.46
0.824GlnPro: 0.824 ± 0.538
1.647GlnGln: 1.647 ± 0.75
4.942GlnArg: 4.942 ± 1.694
1.647GlnSer: 1.647 ± 1.611
2.471GlnThr: 2.471 ± 1.615
2.471GlnVal: 2.471 ± 0.589
0.0GlnTrp: 0.0 ± 0.0
2.471GlnTyr: 2.471 ± 0.589
0.0GlnXaa: 0.0 ± 0.0
Arg
4.942ArgAla: 4.942 ± 1.822
0.0ArgCys: 0.0 ± 0.0
3.295ArgAsp: 3.295 ± 0.613
0.824ArgGlu: 0.824 ± 0.773
3.295ArgPhe: 3.295 ± 1.252
6.59ArgGly: 6.59 ± 2.55
0.824ArgHis: 0.824 ± 0.773
4.119ArgIle: 4.119 ± 2.162
5.766ArgLys: 5.766 ± 2.237
6.59ArgLeu: 6.59 ± 1.911
2.471ArgMet: 2.471 ± 1.313
2.471ArgAsn: 2.471 ± 1.096
6.59ArgPro: 6.59 ± 1.204
0.824ArgGln: 0.824 ± 0.806
7.414ArgArg: 7.414 ± 3.358
4.942ArgSer: 4.942 ± 2.942
2.471ArgThr: 2.471 ± 1.027
7.414ArgVal: 7.414 ± 1.56
0.0ArgTrp: 0.0 ± 0.0
4.942ArgTyr: 4.942 ± 1.954
0.0ArgXaa: 0.0 ± 0.0
Ser
9.061SerAla: 9.061 ± 1.944
0.824SerCys: 0.824 ± 0.773
4.942SerAsp: 4.942 ± 2.154
3.295SerGlu: 3.295 ± 1.252
5.766SerPhe: 5.766 ± 1.075
2.471SerGly: 2.471 ± 0.589
2.471SerHis: 2.471 ± 1.53
4.119SerIle: 4.119 ± 3.338
4.942SerLys: 4.942 ± 1.613
8.237SerLeu: 8.237 ± 2.085
1.647SerMet: 1.647 ± 1.295
3.295SerAsn: 3.295 ± 1.5
4.942SerPro: 4.942 ± 2.154
0.824SerGln: 0.824 ± 0.538
6.59SerArg: 6.59 ± 2.264
6.59SerSer: 6.59 ± 2.152
4.119SerThr: 4.119 ± 1.839
4.119SerVal: 4.119 ± 0.594
0.824SerTrp: 0.824 ± 0.806
0.824SerTyr: 0.824 ± 0.538
0.0SerXaa: 0.0 ± 0.0
Thr
1.647ThrAla: 1.647 ± 0.75
0.0ThrCys: 0.0 ± 0.0
5.766ThrAsp: 5.766 ± 2.211
3.295ThrGlu: 3.295 ± 1.151
0.824ThrPhe: 0.824 ± 0.538
4.119ThrGly: 4.119 ± 2.177
0.824ThrHis: 0.824 ± 0.538
0.0ThrIle: 0.0 ± 0.0
1.647ThrLys: 1.647 ± 0.75
3.295ThrLeu: 3.295 ± 1.348
1.647ThrMet: 1.647 ± 0.75
2.471ThrAsn: 2.471 ± 1.077
1.647ThrPro: 1.647 ± 0.651
0.824ThrGln: 0.824 ± 0.538
2.471ThrArg: 2.471 ± 1.615
4.119ThrSer: 4.119 ± 1.316
4.119ThrThr: 4.119 ± 1.944
2.471ThrVal: 2.471 ± 1.615
0.0ThrTrp: 0.0 ± 0.0
0.824ThrTyr: 0.824 ± 0.773
0.0ThrXaa: 0.0 ± 0.0
Val
7.414ValAla: 7.414 ± 2.082
0.824ValCys: 0.824 ± 0.773
4.942ValAsp: 4.942 ± 0.909
1.647ValGlu: 1.647 ± 0.651
1.647ValPhe: 1.647 ± 1.077
4.942ValGly: 4.942 ± 0.823
2.471ValHis: 2.471 ± 0.911
1.647ValIle: 1.647 ± 0.75
4.119ValLys: 4.119 ± 1.361
6.59ValLeu: 6.59 ± 1.887
2.471ValMet: 2.471 ± 1.077
2.471ValAsn: 2.471 ± 1.615
4.942ValPro: 4.942 ± 2.109
3.295ValGln: 3.295 ± 0.613
2.471ValArg: 2.471 ± 1.096
6.59ValSer: 6.59 ± 2.382
4.119ValThr: 4.119 ± 1.944
5.766ValVal: 5.766 ± 2.111
0.824ValTrp: 0.824 ± 0.773
1.647ValTyr: 1.647 ± 0.75
0.0ValXaa: 0.0 ± 0.0
Trp
0.824TrpAla: 0.824 ± 0.806
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.824TrpGly: 0.824 ± 0.773
0.824TrpHis: 0.824 ± 0.538
0.0TrpIle: 0.0 ± 0.0
0.824TrpLys: 0.824 ± 0.538
1.647TrpLeu: 1.647 ± 0.75
0.824TrpMet: 0.824 ± 0.538
0.0TrpAsn: 0.0 ± 0.0
0.824TrpPro: 0.824 ± 0.773
0.0TrpGln: 0.0 ± 0.0
0.824TrpArg: 0.824 ± 0.538
4.119TrpSer: 4.119 ± 0.993
0.824TrpThr: 0.824 ± 0.773
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.824TrpTyr: 0.824 ± 0.538
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.647TyrAla: 1.647 ± 1.077
0.824TyrCys: 0.824 ± 0.538
4.942TyrAsp: 4.942 ± 1.177
3.295TyrGlu: 3.295 ± 3.403
1.647TyrPhe: 1.647 ± 1.077
4.942TyrGly: 4.942 ± 1.613
0.824TyrHis: 0.824 ± 0.773
2.471TyrIle: 2.471 ± 0.911
0.824TyrLys: 0.824 ± 0.806
7.414TyrLeu: 7.414 ± 0.243
0.824TyrMet: 0.824 ± 0.773
0.824TyrAsn: 0.824 ± 0.538
0.824TyrPro: 0.824 ± 0.538
3.295TyrGln: 3.295 ± 1.302
4.942TyrArg: 4.942 ± 1.019
0.0TyrSer: 0.0 ± 0.0
0.0TyrThr: 0.0 ± 0.0
1.647TyrVal: 1.647 ± 1.546
0.0TyrTrp: 0.0 ± 0.0
4.119TyrTyr: 4.119 ± 3.134
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1215 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski