Amino acid dipepetide frequency for Tortoise microvirus 98

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.86AlaAla: 7.86 ± 1.69
0.605AlaCys: 0.605 ± 0.836
6.651AlaAsp: 6.651 ± 1.461
4.232AlaGlu: 4.232 ± 2.151
3.023AlaPhe: 3.023 ± 1.178
7.255AlaGly: 7.255 ± 2.455
2.418AlaHis: 2.418 ± 1.446
1.814AlaIle: 1.814 ± 0.53
3.628AlaLys: 3.628 ± 1.552
6.651AlaLeu: 6.651 ± 1.208
1.814AlaMet: 1.814 ± 1.318
6.046AlaAsn: 6.046 ± 1.097
6.651AlaPro: 6.651 ± 2.143
3.023AlaGln: 3.023 ± 1.435
3.628AlaArg: 3.628 ± 1.13
4.232AlaSer: 4.232 ± 1.587
5.441AlaThr: 5.441 ± 1.766
6.046AlaVal: 6.046 ± 2.039
0.605AlaTrp: 0.605 ± 0.544
3.023AlaTyr: 3.023 ± 1.572
0.0AlaXaa: 0.0 ± 0.0
Cys
1.814CysAla: 1.814 ± 1.601
0.605CysCys: 0.605 ± 0.874
0.605CysAsp: 0.605 ± 0.421
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.209CysGly: 1.209 ± 0.903
0.605CysHis: 0.605 ± 0.594
0.605CysIle: 0.605 ± 0.836
0.0CysLys: 0.0 ± 0.0
0.605CysLeu: 0.605 ± 0.874
1.209CysMet: 1.209 ± 0.903
0.605CysAsn: 0.605 ± 0.789
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.814CysArg: 1.814 ± 1.109
1.209CysSer: 1.209 ± 0.593
0.605CysThr: 0.605 ± 0.594
0.605CysVal: 0.605 ± 0.874
0.0CysTrp: 0.0 ± 0.0
0.605CysTyr: 0.605 ± 0.594
0.0CysXaa: 0.0 ± 0.0
Asp
3.023AspAla: 3.023 ± 1.646
1.209AspCys: 1.209 ± 0.593
4.232AspAsp: 4.232 ± 1.966
1.814AspGlu: 1.814 ± 0.955
1.814AspPhe: 1.814 ± 1.082
1.209AspGly: 1.209 ± 0.504
1.209AspHis: 1.209 ± 0.504
6.046AspIle: 6.046 ± 1.659
3.023AspLys: 3.023 ± 1.54
6.651AspLeu: 6.651 ± 2.471
1.209AspMet: 1.209 ± 0.955
1.209AspAsn: 1.209 ± 0.854
1.814AspPro: 1.814 ± 0.761
2.418AspGln: 2.418 ± 0.896
5.441AspArg: 5.441 ± 1.483
4.232AspSer: 4.232 ± 1.713
3.628AspThr: 3.628 ± 1.166
6.651AspVal: 6.651 ± 1.051
0.605AspTrp: 0.605 ± 0.594
4.837AspTyr: 4.837 ± 2.202
0.0AspXaa: 0.0 ± 0.0
Glu
4.232GluAla: 4.232 ± 1.978
0.0GluCys: 0.0 ± 0.0
3.628GluAsp: 3.628 ± 1.237
3.628GluGlu: 3.628 ± 1.814
1.209GluPhe: 1.209 ± 0.841
1.209GluGly: 1.209 ± 0.937
0.605GluHis: 0.605 ± 0.544
2.418GluIle: 2.418 ± 1.761
1.814GluLys: 1.814 ± 0.771
6.651GluLeu: 6.651 ± 0.971
2.418GluMet: 2.418 ± 0.675
3.023GluAsn: 3.023 ± 1.565
0.605GluPro: 0.605 ± 0.544
2.418GluGln: 2.418 ± 2.177
3.023GluArg: 3.023 ± 1.486
2.418GluSer: 2.418 ± 1.422
2.418GluThr: 2.418 ± 1.009
2.418GluVal: 2.418 ± 1.422
0.605GluTrp: 0.605 ± 0.805
4.232GluTyr: 4.232 ± 1.333
0.0GluXaa: 0.0 ± 0.0
Phe
3.628PheAla: 3.628 ± 1.235
0.0PheCys: 0.0 ± 0.0
3.023PheAsp: 3.023 ± 1.168
1.814PheGlu: 1.814 ± 0.753
1.814PhePhe: 1.814 ± 1.086
3.628PheGly: 3.628 ± 1.778
0.0PheHis: 0.0 ± 0.0
4.232PheIle: 4.232 ± 1.589
3.628PheLys: 3.628 ± 1.917
1.814PheLeu: 1.814 ± 0.861
1.814PheMet: 1.814 ± 0.998
4.232PheAsn: 4.232 ± 1.522
1.209PhePro: 1.209 ± 0.816
1.209PheGln: 1.209 ± 0.504
3.023PheArg: 3.023 ± 1.39
4.232PheSer: 4.232 ± 1.715
2.418PheThr: 2.418 ± 1.006
1.814PheVal: 1.814 ± 1.347
1.209PheTrp: 1.209 ± 0.892
1.209PheTyr: 1.209 ± 0.841
0.0PheXaa: 0.0 ± 0.0
Gly
6.651GlyAla: 6.651 ± 0.977
0.0GlyCys: 0.0 ± 0.0
1.814GlyAsp: 1.814 ± 0.839
3.628GlyGlu: 3.628 ± 1.385
3.023GlyPhe: 3.023 ± 0.941
3.628GlyGly: 3.628 ± 1.27
0.605GlyHis: 0.605 ± 0.421
4.837GlyIle: 4.837 ± 1.866
2.418GlyLys: 2.418 ± 1.188
7.86GlyLeu: 7.86 ± 2.011
3.023GlyMet: 3.023 ± 1.055
0.605GlyAsn: 0.605 ± 0.421
0.0GlyPro: 0.0 ± 0.0
0.605GlyGln: 0.605 ± 0.421
3.628GlyArg: 3.628 ± 0.909
6.651GlySer: 6.651 ± 1.803
4.837GlyThr: 4.837 ± 1.526
1.814GlyVal: 1.814 ± 1.632
0.0GlyTrp: 0.0 ± 0.0
2.418GlyTyr: 2.418 ± 1.188
0.0GlyXaa: 0.0 ± 0.0
His
1.209HisAla: 1.209 ± 0.816
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.209HisGlu: 1.209 ± 0.937
0.605HisPhe: 0.605 ± 0.421
1.814HisGly: 1.814 ± 0.53
0.0HisHis: 0.0 ± 0.0
1.209HisIle: 1.209 ± 0.708
0.0HisLys: 0.0 ± 0.0
0.605HisLeu: 0.605 ± 0.421
0.605HisMet: 0.605 ± 0.544
1.814HisAsn: 1.814 ± 1.109
0.605HisPro: 0.605 ± 0.421
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.814HisSer: 1.814 ± 1.188
0.605HisThr: 0.605 ± 0.544
0.0HisVal: 0.0 ± 0.0
0.605HisTrp: 0.605 ± 0.874
1.209HisTyr: 1.209 ± 0.708
0.0HisXaa: 0.0 ± 0.0
Ile
4.232IleAla: 4.232 ± 0.768
0.605IleCys: 0.605 ± 0.594
6.046IleAsp: 6.046 ± 3.141
3.023IleGlu: 3.023 ± 1.407
1.209IlePhe: 1.209 ± 0.892
4.232IleGly: 4.232 ± 1.495
1.209IleHis: 1.209 ± 0.593
3.023IleIle: 3.023 ± 1.103
1.814IleLys: 1.814 ± 0.53
3.628IleLeu: 3.628 ± 2.156
1.209IleMet: 1.209 ± 0.708
3.023IleAsn: 3.023 ± 1.181
7.255IlePro: 7.255 ± 1.495
1.209IleGln: 1.209 ± 0.708
3.023IleArg: 3.023 ± 1.02
7.255IleSer: 7.255 ± 1.864
1.814IleThr: 1.814 ± 1.262
1.814IleVal: 1.814 ± 1.183
0.605IleTrp: 0.605 ± 0.421
1.209IleTyr: 1.209 ± 0.504
0.0IleXaa: 0.0 ± 0.0
Lys
2.418LysAla: 2.418 ± 1.416
0.605LysCys: 0.605 ± 0.421
1.209LysAsp: 1.209 ± 0.978
2.418LysGlu: 2.418 ± 1.188
1.814LysPhe: 1.814 ± 1.188
1.209LysGly: 1.209 ± 0.776
1.209LysHis: 1.209 ± 0.708
1.814LysIle: 1.814 ± 0.809
3.023LysLys: 3.023 ± 1.485
3.023LysLeu: 3.023 ± 1.296
3.023LysMet: 3.023 ± 1.459
5.441LysAsn: 5.441 ± 0.683
1.814LysPro: 1.814 ± 1.241
1.209LysGln: 1.209 ± 0.708
3.023LysArg: 3.023 ± 2.379
1.209LysSer: 1.209 ± 1.222
4.232LysThr: 4.232 ± 1.913
1.209LysVal: 1.209 ± 0.918
0.605LysTrp: 0.605 ± 0.544
1.814LysTyr: 1.814 ± 1.781
0.0LysXaa: 0.0 ± 0.0
Leu
7.255LeuAla: 7.255 ± 3.307
1.814LeuCys: 1.814 ± 2.158
3.023LeuAsp: 3.023 ± 1.408
3.023LeuGlu: 3.023 ± 0.89
5.441LeuPhe: 5.441 ± 0.987
6.651LeuGly: 6.651 ± 2.255
1.209LeuHis: 1.209 ± 0.892
4.837LeuIle: 4.837 ± 2.53
1.814LeuLys: 1.814 ± 1.347
8.464LeuLeu: 8.464 ± 2.111
1.209LeuMet: 1.209 ± 0.504
6.046LeuAsn: 6.046 ± 1.913
7.255LeuPro: 7.255 ± 3.095
5.441LeuGln: 5.441 ± 1.257
4.232LeuArg: 4.232 ± 1.401
9.069LeuSer: 9.069 ± 2.164
4.837LeuThr: 4.837 ± 1.401
3.628LeuVal: 3.628 ± 1.13
1.209LeuTrp: 1.209 ± 0.593
2.418LeuTyr: 2.418 ± 0.902
0.0LeuXaa: 0.0 ± 0.0
Met
1.814MetAla: 1.814 ± 0.961
0.605MetCys: 0.605 ± 0.594
1.209MetAsp: 1.209 ± 0.504
2.418MetGlu: 2.418 ± 1.175
2.418MetPhe: 2.418 ± 0.914
1.814MetGly: 1.814 ± 0.753
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
3.628MetLys: 3.628 ± 2.714
2.418MetLeu: 2.418 ± 1.119
1.814MetMet: 1.814 ± 0.828
0.605MetAsn: 0.605 ± 0.421
0.0MetPro: 0.0 ± 0.0
3.023MetGln: 3.023 ± 1.435
1.814MetArg: 1.814 ± 0.817
1.814MetSer: 1.814 ± 1.781
2.418MetThr: 2.418 ± 0.946
1.814MetVal: 1.814 ± 0.861
0.605MetTrp: 0.605 ± 0.874
1.209MetTyr: 1.209 ± 0.776
0.0MetXaa: 0.0 ± 0.0
Asn
6.046AsnAla: 6.046 ± 2.182
0.605AsnCys: 0.605 ± 0.421
2.418AsnAsp: 2.418 ± 0.818
3.628AsnGlu: 3.628 ± 1.513
4.837AsnPhe: 4.837 ± 1.369
3.628AsnGly: 3.628 ± 1.828
0.605AsnHis: 0.605 ± 0.594
2.418AsnIle: 2.418 ± 1.613
1.814AsnLys: 1.814 ± 1.262
4.232AsnLeu: 4.232 ± 0.896
2.418AsnMet: 2.418 ± 0.896
6.651AsnAsn: 6.651 ± 1.247
1.814AsnPro: 1.814 ± 0.839
0.605AsnGln: 0.605 ± 0.594
3.023AsnArg: 3.023 ± 1.435
1.209AsnSer: 1.209 ± 0.776
4.232AsnThr: 4.232 ± 0.992
4.232AsnVal: 4.232 ± 1.625
0.605AsnTrp: 0.605 ± 0.594
0.605AsnTyr: 0.605 ± 0.421
0.0AsnXaa: 0.0 ± 0.0
Pro
3.628ProAla: 3.628 ± 1.58
1.209ProCys: 1.209 ± 0.978
3.628ProAsp: 3.628 ± 1.517
3.023ProGlu: 3.023 ± 2.599
2.418ProPhe: 2.418 ± 1.185
0.605ProGly: 0.605 ± 0.421
0.605ProHis: 0.605 ± 0.544
3.023ProIle: 3.023 ± 1.044
0.0ProLys: 0.0 ± 0.0
7.255ProLeu: 7.255 ± 2.169
0.0ProMet: 0.0 ± 0.0
2.418ProAsn: 2.418 ± 1.11
0.605ProPro: 0.605 ± 0.594
1.209ProGln: 1.209 ± 0.841
1.814ProArg: 1.814 ± 0.839
4.837ProSer: 4.837 ± 1.889
2.418ProThr: 2.418 ± 1.11
6.651ProVal: 6.651 ± 3.593
0.0ProTrp: 0.0 ± 0.0
1.814ProTyr: 1.814 ± 1.262
0.0ProXaa: 0.0 ± 0.0
Gln
1.814GlnAla: 1.814 ± 1.114
0.0GlnCys: 0.0 ± 0.0
1.814GlnAsp: 1.814 ± 1.114
1.209GlnGlu: 1.209 ± 1.088
1.814GlnPhe: 1.814 ± 0.961
1.209GlnGly: 1.209 ± 0.841
0.0GlnHis: 0.0 ± 0.0
1.209GlnIle: 1.209 ± 0.504
1.814GlnLys: 1.814 ± 1.114
1.209GlnLeu: 1.209 ± 0.989
1.814GlnMet: 1.814 ± 1.632
2.418GlnAsn: 2.418 ± 0.946
0.605GlnPro: 0.605 ± 0.594
1.209GlnGln: 1.209 ± 1.088
2.418GlnArg: 2.418 ± 0.918
6.651GlnSer: 6.651 ± 2.262
3.628GlnThr: 3.628 ± 0.994
1.209GlnVal: 1.209 ± 0.504
0.0GlnTrp: 0.0 ± 0.0
3.628GlnTyr: 3.628 ± 2.229
0.0GlnXaa: 0.0 ± 0.0
Arg
7.255ArgAla: 7.255 ± 2.184
0.605ArgCys: 0.605 ± 0.836
4.837ArgAsp: 4.837 ± 1.571
4.837ArgGlu: 4.837 ± 1.208
0.0ArgPhe: 0.0 ± 0.0
2.418ArgGly: 2.418 ± 1.678
0.605ArgHis: 0.605 ± 0.544
3.023ArgIle: 3.023 ± 1.473
3.628ArgLys: 3.628 ± 2.164
7.255ArgLeu: 7.255 ± 2.068
1.814ArgMet: 1.814 ± 0.572
0.605ArgAsn: 0.605 ± 0.544
4.232ArgPro: 4.232 ± 1.735
3.023ArgGln: 3.023 ± 0.848
2.418ArgArg: 2.418 ± 1.006
4.232ArgSer: 4.232 ± 1.478
3.023ArgThr: 3.023 ± 0.674
1.814ArgVal: 1.814 ± 1.086
0.605ArgTrp: 0.605 ± 0.594
4.232ArgTyr: 4.232 ± 1.665
0.0ArgXaa: 0.0 ± 0.0
Ser
5.441SerAla: 5.441 ± 2.059
1.209SerCys: 1.209 ± 1.187
6.046SerAsp: 6.046 ± 2.502
4.232SerGlu: 4.232 ± 1.644
6.046SerPhe: 6.046 ± 1.619
4.232SerGly: 4.232 ± 1.368
0.605SerHis: 0.605 ± 0.421
4.837SerIle: 4.837 ± 1.633
1.209SerLys: 1.209 ± 0.593
6.651SerLeu: 6.651 ± 1.864
1.209SerMet: 1.209 ± 1.094
4.232SerAsn: 4.232 ± 1.132
6.651SerPro: 6.651 ± 2.586
2.418SerGln: 2.418 ± 1.11
7.255SerArg: 7.255 ± 2.485
3.023SerSer: 3.023 ± 1.669
4.837SerThr: 4.837 ± 1.873
7.255SerVal: 7.255 ± 2.249
0.0SerTrp: 0.0 ± 0.0
3.628SerTyr: 3.628 ± 1.383
0.0SerXaa: 0.0 ± 0.0
Thr
6.046ThrAla: 6.046 ± 1.246
0.0ThrCys: 0.0 ± 0.0
3.628ThrAsp: 3.628 ± 1.145
2.418ThrGlu: 2.418 ± 1.238
4.232ThrPhe: 4.232 ± 1.102
4.232ThrGly: 4.232 ± 0.832
0.605ThrHis: 0.605 ± 0.544
3.628ThrIle: 3.628 ± 0.814
4.232ThrLys: 4.232 ± 1.761
6.651ThrLeu: 6.651 ± 2.561
1.814ThrMet: 1.814 ± 1.262
2.418ThrAsn: 2.418 ± 0.896
3.628ThrPro: 3.628 ± 1.452
1.209ThrGln: 1.209 ± 0.504
2.418ThrArg: 2.418 ± 1.26
6.046ThrSer: 6.046 ± 2.149
4.837ThrThr: 4.837 ± 1.023
1.814ThrVal: 1.814 ± 0.955
0.605ThrTrp: 0.605 ± 0.421
3.628ThrTyr: 3.628 ± 1.552
0.0ThrXaa: 0.0 ± 0.0
Val
5.441ValAla: 5.441 ± 1.755
1.814ValCys: 1.814 ± 1.715
3.628ValAsp: 3.628 ± 1.18
2.418ValGlu: 2.418 ± 1.119
0.605ValPhe: 0.605 ± 0.594
4.837ValGly: 4.837 ± 1.612
0.605ValHis: 0.605 ± 0.594
3.628ValIle: 3.628 ± 1.775
2.418ValLys: 2.418 ± 0.695
3.023ValLeu: 3.023 ± 1.572
1.209ValMet: 1.209 ± 0.964
2.418ValAsn: 2.418 ± 0.695
2.418ValPro: 2.418 ± 1.021
1.814ValGln: 1.814 ± 1.245
3.628ValArg: 3.628 ± 1.172
5.441ValSer: 5.441 ± 1.575
3.628ValThr: 3.628 ± 2.797
1.209ValVal: 1.209 ± 0.504
0.605ValTrp: 0.605 ± 0.874
3.023ValTyr: 3.023 ± 2.124
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.209TrpAsp: 1.209 ± 0.776
0.0TrpGlu: 0.0 ± 0.0
1.209TrpPhe: 1.209 ± 0.593
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.605TrpIle: 0.605 ± 0.874
0.605TrpLys: 0.605 ± 0.594
1.209TrpLeu: 1.209 ± 1.749
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.209TrpGln: 1.209 ± 1.088
1.209TrpArg: 1.209 ± 0.593
1.209TrpSer: 1.209 ± 0.593
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.605TrpTyr: 0.605 ± 0.874
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.837TyrAla: 4.837 ± 2.119
1.209TyrCys: 1.209 ± 1.187
3.628TyrAsp: 3.628 ± 1.552
0.0TyrGlu: 0.0 ± 0.0
2.418TyrPhe: 2.418 ± 0.886
3.023TyrGly: 3.023 ± 1.057
1.209TyrHis: 1.209 ± 0.593
4.232TyrIle: 4.232 ± 1.064
1.814TyrLys: 1.814 ± 2.414
3.023TyrLeu: 3.023 ± 1.676
1.209TyrMet: 1.209 ± 0.504
1.814TyrAsn: 1.814 ± 1.112
0.0TyrPro: 0.0 ± 0.0
2.418TyrGln: 2.418 ± 1.006
3.628TyrArg: 3.628 ± 1.13
4.837TyrSer: 4.837 ± 1.139
4.232TyrThr: 4.232 ± 2.213
1.814TyrVal: 1.814 ± 1.262
0.0TyrTrp: 0.0 ± 0.0
1.209TyrTyr: 1.209 ± 0.841
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (1655 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski