Amino acid dipepetide frequency for Tortoise microvirus 99

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.028AlaAla: 5.028 ± 2.907
1.676AlaCys: 1.676 ± 0.761
3.352AlaAsp: 3.352 ± 1.281
5.028AlaGlu: 5.028 ± 1.939
3.352AlaPhe: 3.352 ± 1.233
5.028AlaGly: 5.028 ± 2.152
1.117AlaHis: 1.117 ± 0.551
3.352AlaIle: 3.352 ± 1.329
5.028AlaLys: 5.028 ± 2.172
7.263AlaLeu: 7.263 ± 1.02
1.676AlaMet: 1.676 ± 1.276
3.911AlaAsn: 3.911 ± 1.641
3.911AlaPro: 3.911 ± 1.406
2.793AlaGln: 2.793 ± 1.343
1.117AlaArg: 1.117 ± 0.703
11.173AlaSer: 11.173 ± 1.847
2.235AlaThr: 2.235 ± 1.059
4.469AlaVal: 4.469 ± 1.634
0.559AlaTrp: 0.559 ± 0.405
5.028AlaTyr: 5.028 ± 1.468
0.0AlaXaa: 0.0 ± 0.0
Cys
1.117CysAla: 1.117 ± 1.091
1.117CysCys: 1.117 ± 1.128
2.235CysAsp: 2.235 ± 1.132
0.0CysGlu: 0.0 ± 0.0
1.676CysPhe: 1.676 ± 1.31
0.559CysGly: 0.559 ± 0.564
0.0CysHis: 0.0 ± 0.0
0.559CysIle: 0.559 ± 0.405
2.235CysLys: 2.235 ± 1.399
1.117CysLeu: 1.117 ± 0.964
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.559CysGln: 0.559 ± 0.405
2.235CysArg: 2.235 ± 0.976
0.0CysSer: 0.0 ± 0.0
1.117CysThr: 1.117 ± 0.703
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
2.235CysTyr: 2.235 ± 1.815
0.0CysXaa: 0.0 ± 0.0
Asp
5.587AspAla: 5.587 ± 1.804
2.235AspCys: 2.235 ± 1.102
6.145AspAsp: 6.145 ± 1.82
3.911AspGlu: 3.911 ± 1.043
4.469AspPhe: 4.469 ± 1.287
2.235AspGly: 2.235 ± 1.266
0.559AspHis: 0.559 ± 0.646
6.145AspIle: 6.145 ± 2.499
2.793AspLys: 2.793 ± 0.896
3.352AspLeu: 3.352 ± 1.103
1.117AspMet: 1.117 ± 0.668
2.793AspAsn: 2.793 ± 0.975
1.117AspPro: 1.117 ± 0.792
1.676AspGln: 1.676 ± 1.276
2.235AspArg: 2.235 ± 0.826
2.793AspSer: 2.793 ± 1.493
5.587AspThr: 5.587 ± 1.795
3.352AspVal: 3.352 ± 1.81
0.559AspTrp: 0.559 ± 0.564
3.911AspTyr: 3.911 ± 1.28
0.0AspXaa: 0.0 ± 0.0
Glu
7.821GluAla: 7.821 ± 3.37
1.676GluCys: 1.676 ± 1.844
1.117GluAsp: 1.117 ± 0.996
1.117GluGlu: 1.117 ± 1.034
1.676GluPhe: 1.676 ± 1.142
0.0GluGly: 0.0 ± 0.0
0.559GluHis: 0.559 ± 0.58
3.911GluIle: 3.911 ± 1.435
1.676GluLys: 1.676 ± 0.986
7.821GluLeu: 7.821 ± 1.515
0.559GluMet: 0.559 ± 0.812
0.559GluAsn: 0.559 ± 0.812
2.235GluPro: 2.235 ± 1.191
0.559GluGln: 0.559 ± 0.517
3.911GluArg: 3.911 ± 1.171
2.793GluSer: 2.793 ± 1.684
4.469GluThr: 4.469 ± 1.098
3.352GluVal: 3.352 ± 1.085
0.559GluTrp: 0.559 ± 0.405
2.235GluTyr: 2.235 ± 1.032
0.0GluXaa: 0.0 ± 0.0
Phe
2.235PheAla: 2.235 ± 1.055
1.117PheCys: 1.117 ± 0.996
3.911PheAsp: 3.911 ± 1.913
2.793PheGlu: 2.793 ± 0.923
3.352PhePhe: 3.352 ± 1.627
2.235PheGly: 2.235 ± 0.579
0.559PheHis: 0.559 ± 0.646
2.235PheIle: 2.235 ± 1.132
3.352PheLys: 3.352 ± 1.255
3.352PheLeu: 3.352 ± 1.222
3.352PheMet: 3.352 ± 1.8
3.911PheAsn: 3.911 ± 1.851
3.911PhePro: 3.911 ± 1.064
2.235PheGln: 2.235 ± 0.679
2.235PheArg: 2.235 ± 0.922
2.793PheSer: 2.793 ± 0.814
3.352PheThr: 3.352 ± 1.878
3.352PheVal: 3.352 ± 1.626
0.559PheTrp: 0.559 ± 0.405
4.469PheTyr: 4.469 ± 0.952
0.0PheXaa: 0.0 ± 0.0
Gly
3.352GlyAla: 3.352 ± 1.172
0.559GlyCys: 0.559 ± 0.405
2.235GlyAsp: 2.235 ± 1.323
2.235GlyGlu: 2.235 ± 1.369
5.028GlyPhe: 5.028 ± 1.831
3.352GlyGly: 3.352 ± 1.412
1.676GlyHis: 1.676 ± 0.841
3.352GlyIle: 3.352 ± 1.167
3.911GlyLys: 3.911 ± 1.068
5.587GlyLeu: 5.587 ± 1.792
1.117GlyMet: 1.117 ± 0.754
2.235GlyAsn: 2.235 ± 1.62
0.0GlyPro: 0.0 ± 0.0
0.559GlyGln: 0.559 ± 0.517
1.676GlyArg: 1.676 ± 0.481
6.145GlySer: 6.145 ± 1.235
2.793GlyThr: 2.793 ± 0.876
2.235GlyVal: 2.235 ± 1.074
0.0GlyTrp: 0.0 ± 0.0
4.469GlyTyr: 4.469 ± 0.871
0.0GlyXaa: 0.0 ± 0.0
His
2.793HisAla: 2.793 ± 0.896
0.0HisCys: 0.0 ± 0.0
0.559HisAsp: 0.559 ± 0.405
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
1.676HisGly: 1.676 ± 1.107
0.0HisHis: 0.0 ± 0.0
0.559HisIle: 0.559 ± 0.405
0.559HisLys: 0.559 ± 0.646
1.676HisLeu: 1.676 ± 0.788
0.0HisMet: 0.0 ± 0.0
1.117HisAsn: 1.117 ± 0.792
1.676HisPro: 1.676 ± 1.285
0.0HisGln: 0.0 ± 0.0
1.117HisArg: 1.117 ± 0.598
3.911HisSer: 3.911 ± 1.618
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.559HisTrp: 0.559 ± 0.564
1.117HisTyr: 1.117 ± 0.996
0.0HisXaa: 0.0 ± 0.0
Ile
3.352IleAla: 3.352 ± 1.021
0.0IleCys: 0.0 ± 0.0
3.352IleAsp: 3.352 ± 2.019
3.352IleGlu: 3.352 ± 1.085
2.793IlePhe: 2.793 ± 0.963
2.793IleGly: 2.793 ± 1.659
1.117IleHis: 1.117 ± 0.81
1.676IleIle: 1.676 ± 0.936
1.117IleLys: 1.117 ± 1.16
6.145IleLeu: 6.145 ± 2.539
0.559IleMet: 0.559 ± 0.405
3.911IleAsn: 3.911 ± 1.121
3.352IlePro: 3.352 ± 2.033
1.117IleGln: 1.117 ± 0.913
4.469IleArg: 4.469 ± 1.829
4.469IleSer: 4.469 ± 2.23
1.117IleThr: 1.117 ± 0.79
1.676IleVal: 1.676 ± 1.244
0.559IleTrp: 0.559 ± 0.405
3.911IleTyr: 3.911 ± 1.643
0.0IleXaa: 0.0 ± 0.0
Lys
1.676LysAla: 1.676 ± 1.288
0.559LysCys: 0.559 ± 0.58
2.793LysAsp: 2.793 ± 1.64
3.911LysGlu: 3.911 ± 1.226
1.676LysPhe: 1.676 ± 1.308
1.676LysGly: 1.676 ± 0.481
0.0LysHis: 0.0 ± 0.0
2.235LysIle: 2.235 ± 1.691
1.117LysLys: 1.117 ± 0.844
2.793LysLeu: 2.793 ± 1.24
0.559LysMet: 0.559 ± 0.984
2.235LysAsn: 2.235 ± 1.128
1.117LysPro: 1.117 ± 0.913
1.117LysGln: 1.117 ± 0.853
3.911LysArg: 3.911 ± 2.197
5.028LysSer: 5.028 ± 1.568
3.352LysThr: 3.352 ± 0.759
2.793LysVal: 2.793 ± 0.942
0.559LysTrp: 0.559 ± 0.646
1.117LysTyr: 1.117 ± 1.128
0.0LysXaa: 0.0 ± 0.0
Leu
7.821LeuAla: 7.821 ± 1.948
1.117LeuCys: 1.117 ± 1.133
6.704LeuAsp: 6.704 ± 1.222
3.911LeuGlu: 3.911 ± 1.218
3.352LeuPhe: 3.352 ± 1.316
6.145LeuGly: 6.145 ± 1.806
1.117LeuHis: 1.117 ± 0.79
5.587LeuIle: 5.587 ± 1.766
3.911LeuLys: 3.911 ± 2.092
5.587LeuLeu: 5.587 ± 2.0
2.235LeuMet: 2.235 ± 0.944
4.469LeuAsn: 4.469 ± 2.244
6.704LeuPro: 6.704 ± 2.237
1.676LeuGln: 1.676 ± 1.215
6.704LeuArg: 6.704 ± 1.842
10.056LeuSer: 10.056 ± 2.048
6.704LeuThr: 6.704 ± 1.762
5.028LeuVal: 5.028 ± 1.58
0.559LeuTrp: 0.559 ± 0.405
5.028LeuTyr: 5.028 ± 1.498
0.0LeuXaa: 0.0 ± 0.0
Met
2.793MetAla: 2.793 ± 1.097
0.559MetCys: 0.559 ± 0.58
1.117MetAsp: 1.117 ± 0.471
1.676MetGlu: 1.676 ± 1.051
0.559MetPhe: 0.559 ± 0.826
2.235MetGly: 2.235 ± 1.191
1.117MetHis: 1.117 ± 0.754
0.0MetIle: 0.0 ± 0.0
1.676MetLys: 1.676 ± 1.201
2.235MetLeu: 2.235 ± 1.311
0.0MetMet: 0.0 ± 0.0
0.559MetAsn: 0.559 ± 0.517
0.559MetPro: 0.559 ± 0.517
0.559MetGln: 0.559 ± 0.517
1.676MetArg: 1.676 ± 0.71
4.469MetSer: 4.469 ± 0.793
1.676MetThr: 1.676 ± 0.603
0.559MetVal: 0.559 ± 0.826
0.559MetTrp: 0.559 ± 0.405
1.676MetTyr: 1.676 ± 0.936
0.0MetXaa: 0.0 ± 0.0
Asn
2.235AsnAla: 2.235 ± 1.074
0.0AsnCys: 0.0 ± 0.0
2.793AsnAsp: 2.793 ± 1.493
0.559AsnGlu: 0.559 ± 0.517
1.117AsnPhe: 1.117 ± 0.996
3.911AsnGly: 3.911 ± 2.124
0.0AsnHis: 0.0 ± 0.0
2.235AsnIle: 2.235 ± 1.055
2.793AsnLys: 2.793 ± 0.876
5.028AsnLeu: 5.028 ± 2.161
0.559AsnMet: 0.559 ± 0.564
1.676AsnAsn: 1.676 ± 0.71
2.235AsnPro: 2.235 ± 1.014
3.352AsnGln: 3.352 ± 0.835
3.352AsnArg: 3.352 ± 2.03
4.469AsnSer: 4.469 ± 2.266
4.469AsnThr: 4.469 ± 1.698
1.117AsnVal: 1.117 ± 0.668
0.0AsnTrp: 0.0 ± 0.0
2.793AsnTyr: 2.793 ± 0.814
0.0AsnXaa: 0.0 ± 0.0
Pro
1.117ProAla: 1.117 ± 0.79
0.0ProCys: 0.0 ± 0.0
3.911ProAsp: 3.911 ± 1.502
1.676ProGlu: 1.676 ± 1.285
2.793ProPhe: 2.793 ± 0.876
0.559ProGly: 0.559 ± 0.517
0.559ProHis: 0.559 ± 0.517
1.676ProIle: 1.676 ± 1.648
1.117ProLys: 1.117 ± 0.853
7.821ProLeu: 7.821 ± 1.157
0.559ProMet: 0.559 ± 0.517
0.559ProAsn: 0.559 ± 0.405
1.117ProPro: 1.117 ± 0.831
2.793ProGln: 2.793 ± 0.931
1.676ProArg: 1.676 ± 0.761
7.263ProSer: 7.263 ± 2.107
2.235ProThr: 2.235 ± 1.062
3.352ProVal: 3.352 ± 1.305
0.0ProTrp: 0.0 ± 0.0
2.235ProTyr: 2.235 ± 1.62
0.0ProXaa: 0.0 ± 0.0
Gln
1.676GlnAla: 1.676 ± 1.551
0.0GlnCys: 0.0 ± 0.0
1.117GlnAsp: 1.117 ± 0.471
1.676GlnGlu: 1.676 ± 1.551
5.028GlnPhe: 5.028 ± 1.008
1.117GlnGly: 1.117 ± 0.598
0.0GlnHis: 0.0 ± 0.0
1.117GlnIle: 1.117 ± 0.471
1.676GlnLys: 1.676 ± 1.551
1.117GlnLeu: 1.117 ± 0.942
1.676GlnMet: 1.676 ± 0.481
1.117GlnAsn: 1.117 ± 0.551
0.559GlnPro: 0.559 ± 0.405
4.469GlnGln: 4.469 ± 3.434
5.028GlnArg: 5.028 ± 1.529
3.911GlnSer: 3.911 ± 1.166
3.352GlnThr: 3.352 ± 1.088
2.235GlnVal: 2.235 ± 0.721
0.559GlnTrp: 0.559 ± 0.517
0.559GlnTyr: 0.559 ± 0.517
0.0GlnXaa: 0.0 ± 0.0
Arg
6.145ArgAla: 6.145 ± 1.296
0.559ArgCys: 0.559 ± 0.646
2.793ArgAsp: 2.793 ± 1.299
2.793ArgGlu: 2.793 ± 1.696
2.793ArgPhe: 2.793 ± 0.814
3.911ArgGly: 3.911 ± 1.562
1.117ArgHis: 1.117 ± 0.942
1.676ArgIle: 1.676 ± 1.046
2.235ArgLys: 2.235 ± 1.428
7.821ArgLeu: 7.821 ± 2.14
3.352ArgMet: 3.352 ± 1.313
3.352ArgAsn: 3.352 ± 0.961
2.235ArgPro: 2.235 ± 1.102
2.793ArgGln: 2.793 ± 1.898
2.235ArgArg: 2.235 ± 0.79
5.587ArgSer: 5.587 ± 1.865
1.117ArgThr: 1.117 ± 0.471
2.235ArgVal: 2.235 ± 1.849
0.559ArgTrp: 0.559 ± 0.517
3.911ArgTyr: 3.911 ± 0.844
0.0ArgXaa: 0.0 ± 0.0
Ser
8.38SerAla: 8.38 ± 1.981
2.235SerCys: 2.235 ± 0.883
6.704SerAsp: 6.704 ± 1.377
7.263SerGlu: 7.263 ± 2.514
6.145SerPhe: 6.145 ± 1.779
8.38SerGly: 8.38 ± 3.66
2.235SerHis: 2.235 ± 0.936
5.587SerIle: 5.587 ± 2.231
0.559SerLys: 0.559 ± 0.405
9.497SerLeu: 9.497 ± 2.333
4.469SerMet: 4.469 ± 1.262
5.028SerAsn: 5.028 ± 1.44
3.352SerPro: 3.352 ± 1.362
3.911SerGln: 3.911 ± 1.753
3.911SerArg: 3.911 ± 0.928
14.525SerSer: 14.525 ± 3.118
5.587SerThr: 5.587 ± 1.684
2.235SerVal: 2.235 ± 0.721
1.117SerTrp: 1.117 ± 0.598
3.352SerTyr: 3.352 ± 1.248
0.0SerXaa: 0.0 ± 0.0
Thr
7.821ThrAla: 7.821 ± 2.592
0.559ThrCys: 0.559 ± 0.405
3.352ThrAsp: 3.352 ± 1.198
1.117ThrGlu: 1.117 ± 0.844
3.911ThrPhe: 3.911 ± 1.894
2.793ThrGly: 2.793 ± 1.557
1.117ThrHis: 1.117 ± 0.996
2.793ThrIle: 2.793 ± 1.101
1.676ThrLys: 1.676 ± 0.788
3.352ThrLeu: 3.352 ± 1.456
1.676ThrMet: 1.676 ± 0.863
1.117ThrAsn: 1.117 ± 0.792
2.235ThrPro: 2.235 ± 1.122
3.911ThrGln: 3.911 ± 2.289
2.793ThrArg: 2.793 ± 1.231
3.911ThrSer: 3.911 ± 1.484
3.911ThrThr: 3.911 ± 2.216
3.352ThrVal: 3.352 ± 1.12
0.559ThrTrp: 0.559 ± 0.405
3.911ThrTyr: 3.911 ± 1.0
0.0ThrXaa: 0.0 ± 0.0
Val
3.911ValAla: 3.911 ± 0.937
1.117ValCys: 1.117 ± 1.049
3.911ValAsp: 3.911 ± 2.241
3.911ValGlu: 3.911 ± 1.814
2.235ValPhe: 2.235 ± 0.976
1.117ValGly: 1.117 ± 0.674
0.559ValHis: 0.559 ± 0.812
1.676ValIle: 1.676 ± 1.064
2.235ValLys: 2.235 ± 1.136
4.469ValLeu: 4.469 ± 1.439
1.117ValMet: 1.117 ± 0.593
2.235ValAsn: 2.235 ± 1.393
5.587ValPro: 5.587 ± 1.916
0.559ValGln: 0.559 ± 0.826
2.793ValArg: 2.793 ± 1.124
5.028ValSer: 5.028 ± 1.547
1.117ValThr: 1.117 ± 0.773
2.235ValVal: 2.235 ± 0.897
1.117ValTrp: 1.117 ± 0.598
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.559TrpGlu: 0.559 ± 0.646
1.117TrpPhe: 1.117 ± 0.79
0.0TrpGly: 0.0 ± 0.0
1.676TrpHis: 1.676 ± 1.107
1.117TrpIle: 1.117 ± 0.81
0.0TrpLys: 0.0 ± 0.0
1.676TrpLeu: 1.676 ± 1.131
0.0TrpMet: 0.0 ± 0.0
0.559TrpAsn: 0.559 ± 0.405
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.235TrpArg: 2.235 ± 1.055
0.559TrpSer: 0.559 ± 0.405
0.559TrpThr: 0.559 ± 0.405
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.235TyrAla: 2.235 ± 0.953
1.676TyrCys: 1.676 ± 1.244
4.469TyrAsp: 4.469 ± 1.768
1.117TyrGlu: 1.117 ± 0.754
2.235TyrPhe: 2.235 ± 0.85
2.793TyrGly: 2.793 ± 0.992
2.235TyrHis: 2.235 ± 1.229
3.352TyrIle: 3.352 ± 2.024
1.117TyrLys: 1.117 ± 0.674
6.704TyrLeu: 6.704 ± 1.659
1.117TyrMet: 1.117 ± 0.901
3.352TyrAsn: 3.352 ± 1.166
1.117TyrPro: 1.117 ± 0.844
3.352TyrGln: 3.352 ± 0.961
3.911TyrArg: 3.911 ± 1.434
5.587TyrSer: 5.587 ± 1.751
1.117TyrThr: 1.117 ± 0.81
3.352TyrVal: 3.352 ± 2.242
1.117TyrTrp: 1.117 ± 0.79
3.911TyrTyr: 3.911 ± 2.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (1791 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski