Amino acid dipepetide frequency for Tortoise microvirus 47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.758AlaAla: 11.758 ± 3.87
0.588AlaCys: 0.588 ± 0.485
1.176AlaAsp: 1.176 ± 0.533
7.643AlaGlu: 7.643 ± 0.985
4.115AlaPhe: 4.115 ± 1.836
9.994AlaGly: 9.994 ± 3.833
5.291AlaHis: 5.291 ± 1.266
2.939AlaIle: 2.939 ± 1.172
5.291AlaLys: 5.291 ± 0.958
3.527AlaLeu: 3.527 ± 1.435
1.176AlaMet: 1.176 ± 0.505
2.352AlaAsn: 2.352 ± 1.295
5.879AlaPro: 5.879 ± 1.583
2.939AlaGln: 2.939 ± 1.424
8.818AlaArg: 8.818 ± 2.886
5.291AlaSer: 5.291 ± 2.277
5.879AlaThr: 5.879 ± 2.288
7.643AlaVal: 7.643 ± 2.06
0.588AlaTrp: 0.588 ± 0.567
4.115AlaTyr: 4.115 ± 1.33
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.588CysCys: 0.588 ± 0.485
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.588CysPhe: 0.588 ± 0.567
1.176CysGly: 1.176 ± 0.769
0.0CysHis: 0.0 ± 0.0
1.176CysIle: 1.176 ± 0.63
1.176CysLys: 1.176 ± 0.769
0.588CysLeu: 0.588 ± 0.485
0.588CysMet: 0.588 ± 0.567
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.176CysArg: 1.176 ± 0.971
0.0CysSer: 0.0 ± 0.0
0.588CysThr: 0.588 ± 0.452
0.588CysVal: 0.588 ± 0.452
0.588CysTrp: 0.588 ± 0.485
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.879AspAla: 5.879 ± 1.102
0.0AspCys: 0.0 ± 0.0
4.115AspAsp: 4.115 ± 1.03
2.939AspGlu: 2.939 ± 1.793
2.352AspPhe: 2.352 ± 0.808
4.703AspGly: 4.703 ± 1.065
1.176AspHis: 1.176 ± 0.5
2.352AspIle: 2.352 ± 1.331
1.176AspLys: 1.176 ± 0.745
2.939AspLeu: 2.939 ± 0.862
1.764AspMet: 1.764 ± 0.939
0.588AspAsn: 0.588 ± 0.58
5.291AspPro: 5.291 ± 0.72
3.527AspGln: 3.527 ± 2.087
2.352AspArg: 2.352 ± 0.872
2.939AspSer: 2.939 ± 0.607
1.176AspThr: 1.176 ± 0.903
4.703AspVal: 4.703 ± 2.65
2.939AspTrp: 2.939 ± 0.862
1.764AspTyr: 1.764 ± 1.169
0.0AspXaa: 0.0 ± 0.0
Glu
5.879GluAla: 5.879 ± 2.147
0.588GluCys: 0.588 ± 0.485
1.764GluAsp: 1.764 ± 1.355
2.352GluGlu: 2.352 ± 0.872
1.764GluPhe: 1.764 ± 0.748
4.703GluGly: 4.703 ± 1.199
0.588GluHis: 0.588 ± 0.485
3.527GluIle: 3.527 ± 0.765
4.115GluLys: 4.115 ± 1.764
4.703GluLeu: 4.703 ± 0.805
2.352GluMet: 2.352 ± 0.641
5.291GluAsn: 5.291 ± 1.696
5.291GluPro: 5.291 ± 1.195
5.291GluGln: 5.291 ± 1.613
7.055GluArg: 7.055 ± 1.573
3.527GluSer: 3.527 ± 1.088
2.939GluThr: 2.939 ± 1.341
4.703GluVal: 4.703 ± 1.958
1.176GluTrp: 1.176 ± 1.274
0.588GluTyr: 0.588 ± 0.485
0.0GluXaa: 0.0 ± 0.0
Phe
2.352PheAla: 2.352 ± 1.005
1.176PheCys: 1.176 ± 0.646
2.352PheAsp: 2.352 ± 0.859
2.939PheGlu: 2.939 ± 1.14
0.588PhePhe: 0.588 ± 0.637
4.115PheGly: 4.115 ± 1.278
1.176PheHis: 1.176 ± 0.646
1.176PheIle: 1.176 ± 0.592
2.352PheLys: 2.352 ± 0.773
2.352PheLeu: 2.352 ± 0.641
1.176PheMet: 1.176 ± 0.769
2.352PheAsn: 2.352 ± 1.114
1.764PhePro: 1.764 ± 0.876
0.0PheGln: 0.0 ± 0.0
1.176PheArg: 1.176 ± 0.5
2.352PheSer: 2.352 ± 1.265
0.588PheThr: 0.588 ± 0.509
2.352PheVal: 2.352 ± 1.325
0.588PheTrp: 0.588 ± 0.452
2.939PheTyr: 2.939 ± 0.617
0.0PheXaa: 0.0 ± 0.0
Gly
8.818GlyAla: 8.818 ± 2.669
0.0GlyCys: 0.0 ± 0.0
4.115GlyAsp: 4.115 ± 1.232
6.467GlyGlu: 6.467 ± 2.109
2.352GlyPhe: 2.352 ± 1.779
7.643GlyGly: 7.643 ± 1.002
2.352GlyHis: 2.352 ± 1.031
3.527GlyIle: 3.527 ± 1.553
4.703GlyLys: 4.703 ± 0.912
3.527GlyLeu: 3.527 ± 0.762
1.764GlyMet: 1.764 ± 0.748
1.764GlyAsn: 1.764 ± 0.873
3.527GlyPro: 3.527 ± 1.353
5.291GlyGln: 5.291 ± 1.826
6.467GlyArg: 6.467 ± 0.842
4.703GlySer: 4.703 ± 2.914
6.467GlyThr: 6.467 ± 2.228
2.352GlyVal: 2.352 ± 0.728
2.352GlyTrp: 2.352 ± 1.423
1.764GlyTyr: 1.764 ± 0.876
0.0GlyXaa: 0.0 ± 0.0
His
1.176HisAla: 1.176 ± 1.274
1.176HisCys: 1.176 ± 0.891
2.352HisAsp: 2.352 ± 1.246
1.176HisGlu: 1.176 ± 0.971
0.588HisPhe: 0.588 ± 0.485
2.939HisGly: 2.939 ± 0.532
0.0HisHis: 0.0 ± 0.0
1.764HisIle: 1.764 ± 0.876
0.588HisLys: 0.588 ± 0.485
1.764HisLeu: 1.764 ± 0.666
0.0HisMet: 0.0 ± 0.0
0.588HisAsn: 0.588 ± 0.567
0.588HisPro: 0.588 ± 0.567
0.0HisGln: 0.0 ± 0.0
1.176HisArg: 1.176 ± 0.74
1.176HisSer: 1.176 ± 1.135
1.176HisThr: 1.176 ± 0.903
0.588HisVal: 0.588 ± 0.637
1.764HisTrp: 1.764 ± 0.993
0.588HisTyr: 0.588 ± 0.637
0.0HisXaa: 0.0 ± 0.0
Ile
3.527IleAla: 3.527 ± 1.203
0.0IleCys: 0.0 ± 0.0
1.764IleAsp: 1.764 ± 0.82
4.703IleGlu: 4.703 ± 1.919
2.352IlePhe: 2.352 ± 0.641
4.115IleGly: 4.115 ± 1.01
0.588IleHis: 0.588 ± 0.567
3.527IleIle: 3.527 ± 1.707
1.176IleLys: 1.176 ± 0.769
1.176IleLeu: 1.176 ± 0.5
1.764IleMet: 1.764 ± 1.051
2.352IleAsn: 2.352 ± 1.331
2.939IlePro: 2.939 ± 1.051
1.176IleGln: 1.176 ± 0.592
4.115IleArg: 4.115 ± 1.302
1.764IleSer: 1.764 ± 0.645
2.939IleThr: 2.939 ± 1.753
2.352IleVal: 2.352 ± 1.267
1.764IleTrp: 1.764 ± 0.82
1.176IleTyr: 1.176 ± 0.903
0.0IleXaa: 0.0 ± 0.0
Lys
5.291LysAla: 5.291 ± 3.042
0.0LysCys: 0.0 ± 0.0
0.588LysAsp: 0.588 ± 0.637
3.527LysGlu: 3.527 ± 0.883
1.176LysPhe: 1.176 ± 0.641
2.939LysGly: 2.939 ± 1.515
1.764LysHis: 1.764 ± 0.632
1.764LysIle: 1.764 ± 1.456
3.527LysLys: 3.527 ± 1.743
2.352LysLeu: 2.352 ± 1.001
2.352LysMet: 2.352 ± 0.877
3.527LysAsn: 3.527 ± 1.322
4.703LysPro: 4.703 ± 1.229
2.939LysGln: 2.939 ± 0.973
3.527LysArg: 3.527 ± 1.781
2.352LysSer: 2.352 ± 1.821
2.352LysThr: 2.352 ± 0.824
2.352LysVal: 2.352 ± 0.699
0.0LysTrp: 0.0 ± 0.0
2.939LysTyr: 2.939 ± 1.88
0.0LysXaa: 0.0 ± 0.0
Leu
9.406LeuAla: 9.406 ± 2.122
0.588LeuCys: 0.588 ± 0.452
4.703LeuAsp: 4.703 ± 0.512
4.703LeuGlu: 4.703 ± 1.632
1.764LeuPhe: 1.764 ± 0.551
5.291LeuGly: 5.291 ± 1.266
2.939LeuHis: 2.939 ± 0.717
0.588LeuIle: 0.588 ± 0.452
2.352LeuLys: 2.352 ± 1.222
2.352LeuLeu: 2.352 ± 1.226
1.176LeuMet: 1.176 ± 0.903
4.703LeuAsn: 4.703 ± 0.796
2.939LeuPro: 2.939 ± 1.686
2.939LeuGln: 2.939 ± 0.772
5.879LeuArg: 5.879 ± 1.413
4.115LeuSer: 4.115 ± 1.583
4.115LeuThr: 4.115 ± 1.232
1.764LeuVal: 1.764 ± 0.551
0.0LeuTrp: 0.0 ± 0.0
1.764LeuTyr: 1.764 ± 0.751
0.0LeuXaa: 0.0 ± 0.0
Met
2.939MetAla: 2.939 ± 0.77
1.176MetCys: 1.176 ± 0.646
1.176MetAsp: 1.176 ± 0.903
2.939MetGlu: 2.939 ± 1.065
1.764MetPhe: 1.764 ± 0.496
2.352MetGly: 2.352 ± 1.506
0.588MetHis: 0.588 ± 0.452
1.764MetIle: 1.764 ± 0.582
0.0MetLys: 0.0 ± 0.0
1.764MetLeu: 1.764 ± 1.169
0.588MetMet: 0.588 ± 0.485
1.764MetAsn: 1.764 ± 0.551
1.176MetPro: 1.176 ± 0.5
0.588MetGln: 0.588 ± 0.509
1.764MetArg: 1.764 ± 0.645
0.588MetSer: 0.588 ± 0.452
1.176MetThr: 1.176 ± 0.74
1.764MetVal: 1.764 ± 1.169
0.588MetTrp: 0.588 ± 0.567
0.588MetTyr: 0.588 ± 0.509
0.0MetXaa: 0.0 ± 0.0
Asn
4.115AsnAla: 4.115 ± 1.26
0.588AsnCys: 0.588 ± 0.452
1.764AsnAsp: 1.764 ± 0.582
1.176AsnGlu: 1.176 ± 0.74
1.764AsnPhe: 1.764 ± 0.582
4.115AsnGly: 4.115 ± 1.294
0.0AsnHis: 0.0 ± 0.0
4.115AsnIle: 4.115 ± 0.816
1.764AsnLys: 1.764 ± 0.871
5.291AsnLeu: 5.291 ± 1.761
1.176AsnMet: 1.176 ± 0.533
0.0AsnAsn: 0.0 ± 0.0
4.115AsnPro: 4.115 ± 1.302
1.764AsnGln: 1.764 ± 0.582
3.527AsnArg: 3.527 ± 1.858
1.764AsnSer: 1.764 ± 0.666
2.352AsnThr: 2.352 ± 1.232
3.527AsnVal: 3.527 ± 0.936
0.0AsnTrp: 0.0 ± 0.0
0.588AsnTyr: 0.588 ± 0.567
0.0AsnXaa: 0.0 ± 0.0
Pro
4.703ProAla: 4.703 ± 1.236
0.0ProCys: 0.0 ± 0.0
8.23ProAsp: 8.23 ± 2.233
4.115ProGlu: 4.115 ± 1.167
2.352ProPhe: 2.352 ± 1.226
4.115ProGly: 4.115 ± 2.775
1.764ProHis: 1.764 ± 0.751
1.764ProIle: 1.764 ± 0.645
2.352ProLys: 2.352 ± 1.048
5.879ProLeu: 5.879 ± 1.095
2.939ProMet: 2.939 ± 0.755
1.176ProAsn: 1.176 ± 0.616
4.115ProPro: 4.115 ± 1.295
2.352ProGln: 2.352 ± 0.877
3.527ProArg: 3.527 ± 0.546
3.527ProSer: 3.527 ± 1.458
3.527ProThr: 3.527 ± 1.304
3.527ProVal: 3.527 ± 1.295
1.176ProTrp: 1.176 ± 0.769
2.352ProTyr: 2.352 ± 0.773
0.0ProXaa: 0.0 ± 0.0
Gln
6.467GlnAla: 6.467 ± 0.9
0.588GlnCys: 0.588 ± 0.485
2.352GlnAsp: 2.352 ± 0.877
1.764GlnGlu: 1.764 ± 0.692
0.588GlnPhe: 0.588 ± 0.485
2.352GlnGly: 2.352 ± 1.001
0.0GlnHis: 0.0 ± 0.0
2.352GlnIle: 2.352 ± 0.95
2.352GlnLys: 2.352 ± 1.022
4.703GlnLeu: 4.703 ± 1.677
1.176GlnMet: 1.176 ± 1.274
0.588GlnAsn: 0.588 ± 0.58
2.939GlnPro: 2.939 ± 1.065
2.352GlnGln: 2.352 ± 1.005
2.939GlnArg: 2.939 ± 1.293
2.352GlnSer: 2.352 ± 0.859
4.703GlnThr: 4.703 ± 1.881
1.176GlnVal: 1.176 ± 0.5
1.176GlnTrp: 1.176 ± 0.744
2.939GlnTyr: 2.939 ± 0.651
0.0GlnXaa: 0.0 ± 0.0
Arg
6.467ArgAla: 6.467 ± 2.075
0.0ArgCys: 0.0 ± 0.0
2.352ArgAsp: 2.352 ± 0.778
5.879ArgGlu: 5.879 ± 1.304
5.291ArgPhe: 5.291 ± 1.101
3.527ArgGly: 3.527 ± 1.304
0.588ArgHis: 0.588 ± 0.509
4.703ArgIle: 4.703 ± 1.861
4.703ArgLys: 4.703 ± 1.889
6.467ArgLeu: 6.467 ± 1.619
1.176ArgMet: 1.176 ± 0.533
3.527ArgAsn: 3.527 ± 1.504
4.703ArgPro: 4.703 ± 0.757
5.291ArgGln: 5.291 ± 0.848
5.291ArgArg: 5.291 ± 2.197
3.527ArgSer: 3.527 ± 1.776
4.703ArgThr: 4.703 ± 0.805
2.352ArgVal: 2.352 ± 1.022
1.764ArgTrp: 1.764 ± 0.666
2.352ArgTyr: 2.352 ± 1.325
0.0ArgXaa: 0.0 ± 0.0
Ser
4.703SerAla: 4.703 ± 1.297
0.0SerCys: 0.0 ± 0.0
2.352SerAsp: 2.352 ± 0.824
4.703SerGlu: 4.703 ± 1.572
1.764SerPhe: 1.764 ± 0.582
5.291SerGly: 5.291 ± 2.196
0.588SerHis: 0.588 ± 0.452
2.939SerIle: 2.939 ± 1.064
4.703SerLys: 4.703 ± 1.495
3.527SerLeu: 3.527 ± 0.765
2.939SerMet: 2.939 ± 0.792
1.176SerAsn: 1.176 ± 0.63
3.527SerPro: 3.527 ± 1.897
1.176SerGln: 1.176 ± 1.018
1.764SerArg: 1.764 ± 0.953
2.352SerSer: 2.352 ± 1.456
2.939SerThr: 2.939 ± 1.096
2.939SerVal: 2.939 ± 1.139
0.0SerTrp: 0.0 ± 0.0
1.764SerTyr: 1.764 ± 0.876
0.0SerXaa: 0.0 ± 0.0
Thr
5.291ThrAla: 5.291 ± 1.581
0.0ThrCys: 0.0 ± 0.0
3.527ThrAsp: 3.527 ± 1.607
2.939ThrGlu: 2.939 ± 1.073
2.352ThrPhe: 2.352 ± 1.066
3.527ThrGly: 3.527 ± 1.53
0.0ThrHis: 0.0 ± 0.0
2.939ThrIle: 2.939 ± 1.142
2.939ThrLys: 2.939 ± 0.532
3.527ThrLeu: 3.527 ± 1.102
0.0ThrMet: 0.0 ± 0.0
4.703ThrAsn: 4.703 ± 1.807
1.176ThrPro: 1.176 ± 0.903
3.527ThrGln: 3.527 ± 1.641
4.703ThrArg: 4.703 ± 2.037
2.939ThrSer: 2.939 ± 1.254
5.879ThrThr: 5.879 ± 2.58
4.115ThrVal: 4.115 ± 0.659
0.588ThrTrp: 0.588 ± 0.567
2.352ThrTyr: 2.352 ± 1.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.115ValAla: 4.115 ± 0.846
0.588ValCys: 0.588 ± 0.452
5.879ValAsp: 5.879 ± 0.894
5.291ValGlu: 5.291 ± 1.32
1.176ValPhe: 1.176 ± 0.903
2.939ValGly: 2.939 ± 1.796
0.0ValHis: 0.0 ± 0.0
1.176ValIle: 1.176 ± 0.861
3.527ValLys: 3.527 ± 0.675
2.352ValLeu: 2.352 ± 0.699
2.352ValMet: 2.352 ± 0.699
3.527ValAsn: 3.527 ± 0.936
7.643ValPro: 7.643 ± 1.077
2.939ValGln: 2.939 ± 1.455
5.291ValArg: 5.291 ± 2.041
2.352ValSer: 2.352 ± 1.1
2.352ValThr: 2.352 ± 1.226
5.291ValVal: 5.291 ± 1.192
0.0ValTrp: 0.0 ± 0.0
1.176ValTyr: 1.176 ± 0.646
0.0ValXaa: 0.0 ± 0.0
Trp
1.176TrpAla: 1.176 ± 0.5
0.588TrpCys: 0.588 ± 0.58
2.939TrpAsp: 2.939 ± 1.585
0.588TrpGlu: 0.588 ± 0.567
0.0TrpPhe: 0.0 ± 0.0
2.939TrpGly: 2.939 ± 1.865
1.764TrpHis: 1.764 ± 1.084
0.588TrpIle: 0.588 ± 0.452
0.588TrpLys: 0.588 ± 0.509
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.176TrpAsn: 1.176 ± 0.903
0.588TrpPro: 0.588 ± 0.485
1.176TrpGln: 1.176 ± 0.971
2.352TrpArg: 2.352 ± 1.103
1.176TrpSer: 1.176 ± 0.5
0.588TrpThr: 0.588 ± 0.637
0.588TrpVal: 0.588 ± 0.485
0.588TrpTrp: 0.588 ± 0.637
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.939TyrAla: 2.939 ± 1.534
0.588TyrCys: 0.588 ± 0.485
1.176TyrAsp: 1.176 ± 0.646
2.352TyrGlu: 2.352 ± 1.226
1.176TyrPhe: 1.176 ± 0.971
1.764TyrGly: 1.764 ± 0.993
0.0TyrHis: 0.0 ± 0.0
0.588TyrIle: 0.588 ± 0.452
0.588TyrLys: 0.588 ± 0.485
5.291TyrLeu: 5.291 ± 2.299
0.0TyrMet: 0.0 ± 0.0
2.352TyrAsn: 2.352 ± 0.729
0.588TyrPro: 0.588 ± 0.485
0.588TyrGln: 0.588 ± 0.509
1.764TyrArg: 1.764 ± 0.666
2.352TyrSer: 2.352 ± 1.942
0.588TyrThr: 0.588 ± 0.452
5.291TyrVal: 5.291 ± 2.397
1.764TyrTrp: 1.764 ± 1.084
1.176TyrTyr: 1.176 ± 0.971
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1702 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski