Amino acid dipepetide frequency for Tortoise microvirus 70

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.646AlaAla: 5.646 ± 2.403
0.0AlaCys: 0.0 ± 0.0
1.255AlaAsp: 1.255 ± 0.589
6.901AlaGlu: 6.901 ± 1.676
6.901AlaPhe: 6.901 ± 1.991
9.41AlaGly: 9.41 ± 3.095
2.509AlaHis: 2.509 ± 1.237
2.509AlaIle: 2.509 ± 1.575
2.509AlaLys: 2.509 ± 0.601
8.156AlaLeu: 8.156 ± 2.385
3.137AlaMet: 3.137 ± 1.086
3.764AlaAsn: 3.764 ± 1.534
7.528AlaPro: 7.528 ± 2.851
3.137AlaGln: 3.137 ± 1.554
6.901AlaArg: 6.901 ± 2.196
6.901AlaSer: 6.901 ± 2.181
3.137AlaThr: 3.137 ± 1.109
6.274AlaVal: 6.274 ± 1.527
0.627AlaTrp: 0.627 ± 0.539
3.137AlaTyr: 3.137 ± 1.214
0.0AlaXaa: 0.0 ± 0.0
Cys
0.627CysAla: 0.627 ± 0.551
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.627CysGlu: 0.627 ± 0.551
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.627CysArg: 0.627 ± 0.465
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.627CysVal: 0.627 ± 0.724
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.764AspAla: 3.764 ± 1.105
0.0AspCys: 0.0 ± 0.0
6.274AspAsp: 6.274 ± 1.373
3.764AspGlu: 3.764 ± 0.466
4.391AspPhe: 4.391 ± 1.323
3.764AspGly: 3.764 ± 0.805
1.255AspHis: 1.255 ± 0.93
3.764AspIle: 3.764 ± 1.477
2.509AspLys: 2.509 ± 0.601
3.764AspLeu: 3.764 ± 1.345
1.882AspMet: 1.882 ± 0.9
1.882AspAsn: 1.882 ± 1.032
3.764AspPro: 3.764 ± 0.973
3.764AspGln: 3.764 ± 1.534
8.156AspArg: 8.156 ± 3.282
1.882AspSer: 1.882 ± 0.985
3.764AspThr: 3.764 ± 1.399
2.509AspVal: 2.509 ± 0.958
1.882AspTrp: 1.882 ± 1.977
3.137AspTyr: 3.137 ± 0.724
0.0AspXaa: 0.0 ± 0.0
Glu
7.528GluAla: 7.528 ± 2.446
0.0GluCys: 0.0 ± 0.0
4.391GluAsp: 4.391 ± 1.199
1.255GluGlu: 1.255 ± 0.511
1.255GluPhe: 1.255 ± 1.071
4.391GluGly: 4.391 ± 2.207
0.0GluHis: 0.0 ± 0.0
5.019GluIle: 5.019 ± 1.74
1.882GluLys: 1.882 ± 0.816
5.646GluLeu: 5.646 ± 1.973
1.882GluMet: 1.882 ± 1.339
0.0GluAsn: 0.0 ± 0.0
1.882GluPro: 1.882 ± 0.914
3.137GluGln: 3.137 ± 0.527
7.528GluArg: 7.528 ± 1.891
1.882GluSer: 1.882 ± 0.914
2.509GluThr: 2.509 ± 1.226
4.391GluVal: 4.391 ± 1.265
1.255GluTrp: 1.255 ± 1.052
1.882GluTyr: 1.882 ± 1.654
0.0GluXaa: 0.0 ± 0.0
Phe
2.509PheAla: 2.509 ± 0.958
0.0PheCys: 0.0 ± 0.0
2.509PheAsp: 2.509 ± 0.651
1.882PheGlu: 1.882 ± 0.372
1.882PhePhe: 1.882 ± 1.006
4.391PheGly: 4.391 ± 2.167
1.255PheHis: 1.255 ± 0.511
1.882PheIle: 1.882 ± 0.904
1.882PheLys: 1.882 ± 0.985
3.137PheLeu: 3.137 ± 0.816
1.882PheMet: 1.882 ± 1.396
1.882PheAsn: 1.882 ± 0.553
0.627PhePro: 0.627 ± 0.465
1.882PheGln: 1.882 ± 0.985
3.137PheArg: 3.137 ± 1.235
3.137PheSer: 3.137 ± 1.629
0.0PheThr: 0.0 ± 0.0
4.391PheVal: 4.391 ± 1.567
3.137PheTrp: 3.137 ± 1.482
1.255PheTyr: 1.255 ± 0.583
0.0PheXaa: 0.0 ± 0.0
Gly
7.528GlyAla: 7.528 ± 2.33
0.0GlyCys: 0.0 ± 0.0
5.019GlyAsp: 5.019 ± 1.536
5.646GlyGlu: 5.646 ± 1.714
5.019GlyPhe: 5.019 ± 2.153
11.92GlyGly: 11.92 ± 5.041
1.255GlyHis: 1.255 ± 0.583
5.019GlyIle: 5.019 ± 2.084
3.764GlyLys: 3.764 ± 2.099
1.255GlyLeu: 1.255 ± 1.044
0.627GlyMet: 0.627 ± 0.551
4.391GlyAsn: 4.391 ± 1.761
2.509GlyPro: 2.509 ± 0.583
1.882GlyGln: 1.882 ± 0.904
8.156GlyArg: 8.156 ± 1.667
2.509GlySer: 2.509 ± 0.651
2.509GlyThr: 2.509 ± 0.789
4.391GlyVal: 4.391 ± 1.199
0.0GlyTrp: 0.0 ± 0.0
5.019GlyTyr: 5.019 ± 1.437
0.0GlyXaa: 0.0 ± 0.0
His
3.137HisAla: 3.137 ± 1.009
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.0HisGlu: 0.0 ± 0.0
1.882HisPhe: 1.882 ± 1.273
0.627HisGly: 0.627 ± 0.551
0.0HisHis: 0.0 ± 0.0
1.882HisIle: 1.882 ± 0.9
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.255HisAsn: 1.255 ± 0.583
0.627HisPro: 0.627 ± 0.465
1.255HisGln: 1.255 ± 0.511
1.882HisArg: 1.882 ± 0.372
1.255HisSer: 1.255 ± 0.589
0.627HisThr: 0.627 ± 0.539
1.882HisVal: 1.882 ± 0.985
1.882HisTrp: 1.882 ± 1.032
0.627HisTyr: 0.627 ± 0.465
0.0HisXaa: 0.0 ± 0.0
Ile
1.255IleAla: 1.255 ± 0.788
0.627IleCys: 0.627 ± 0.551
1.882IleAsp: 1.882 ± 0.553
2.509IleGlu: 2.509 ± 1.986
1.255IlePhe: 1.255 ± 0.583
3.764IleGly: 3.764 ± 0.841
1.255IleHis: 1.255 ± 0.583
1.882IleIle: 1.882 ± 0.9
3.137IleLys: 3.137 ± 1.001
2.509IleLeu: 2.509 ± 1.308
1.255IleMet: 1.255 ± 0.993
2.509IleAsn: 2.509 ± 1.308
3.764IlePro: 3.764 ± 0.957
1.255IleGln: 1.255 ± 1.103
3.137IleArg: 3.137 ± 0.781
3.137IleSer: 3.137 ± 1.009
1.255IleThr: 1.255 ± 1.077
5.019IleVal: 5.019 ± 2.115
0.0IleTrp: 0.0 ± 0.0
1.882IleTyr: 1.882 ± 0.9
0.0IleXaa: 0.0 ± 0.0
Lys
3.137LysAla: 3.137 ± 1.794
0.0LysCys: 0.0 ± 0.0
3.764LysAsp: 3.764 ± 1.292
3.764LysGlu: 3.764 ± 2.188
1.882LysPhe: 1.882 ± 0.553
1.255LysGly: 1.255 ± 0.589
0.0LysHis: 0.0 ± 0.0
1.882LysIle: 1.882 ± 1.383
3.137LysLys: 3.137 ± 1.31
4.391LysLeu: 4.391 ± 1.172
0.0LysMet: 0.0 ± 0.0
1.882LysAsn: 1.882 ± 0.9
2.509LysPro: 2.509 ± 1.474
4.391LysGln: 4.391 ± 0.964
3.137LysArg: 3.137 ± 1.685
3.137LysSer: 3.137 ± 0.781
0.627LysThr: 0.627 ± 0.465
1.882LysVal: 1.882 ± 1.396
0.627LysTrp: 0.627 ± 0.539
4.391LysTyr: 4.391 ± 1.103
0.0LysXaa: 0.0 ± 0.0
Leu
6.274LeuAla: 6.274 ± 1.551
0.0LeuCys: 0.0 ± 0.0
10.665LeuAsp: 10.665 ± 1.272
3.764LeuGlu: 3.764 ± 0.841
0.627LeuPhe: 0.627 ± 0.551
5.019LeuGly: 5.019 ± 0.839
0.0LeuHis: 0.0 ± 0.0
0.627LeuIle: 0.627 ± 0.996
3.764LeuLys: 3.764 ± 0.841
6.274LeuLeu: 6.274 ± 1.217
1.255LeuMet: 1.255 ± 0.93
5.019LeuAsn: 5.019 ± 1.822
3.137LeuPro: 3.137 ± 0.889
3.137LeuGln: 3.137 ± 0.816
6.901LeuArg: 6.901 ± 0.678
4.391LeuSer: 4.391 ± 1.293
5.019LeuThr: 5.019 ± 1.842
3.764LeuVal: 3.764 ± 1.114
0.627LeuTrp: 0.627 ± 0.465
1.882LeuTyr: 1.882 ± 1.035
0.0LeuXaa: 0.0 ± 0.0
Met
5.646MetAla: 5.646 ± 0.757
0.0MetCys: 0.0 ± 0.0
3.764MetAsp: 3.764 ± 1.452
1.882MetGlu: 1.882 ± 0.816
0.627MetPhe: 0.627 ± 0.465
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.627MetLys: 0.627 ± 0.551
0.627MetLeu: 0.627 ± 0.551
0.627MetMet: 0.627 ± 0.724
2.509MetAsn: 2.509 ± 0.651
3.137MetPro: 3.137 ± 1.615
1.255MetGln: 1.255 ± 0.788
3.137MetArg: 3.137 ± 1.441
0.627MetSer: 0.627 ± 0.551
1.255MetThr: 1.255 ± 0.589
1.255MetVal: 1.255 ± 0.993
1.255MetTrp: 1.255 ± 0.589
0.627MetTyr: 0.627 ± 0.539
0.0MetXaa: 0.0 ± 0.0
Asn
1.255AsnAla: 1.255 ± 0.511
0.0AsnCys: 0.0 ± 0.0
0.627AsnAsp: 0.627 ± 0.465
4.391AsnGlu: 4.391 ± 1.009
3.764AsnPhe: 3.764 ± 0.805
1.882AsnGly: 1.882 ± 0.979
1.255AsnHis: 1.255 ± 0.583
1.882AsnIle: 1.882 ± 1.199
0.627AsnLys: 0.627 ± 0.539
2.509AsnLeu: 2.509 ± 1.27
1.882AsnMet: 1.882 ± 0.858
1.882AsnAsn: 1.882 ± 1.616
3.764AsnPro: 3.764 ± 0.864
3.764AsnGln: 3.764 ± 0.957
2.509AsnArg: 2.509 ± 0.718
1.882AsnSer: 1.882 ± 0.9
3.764AsnThr: 3.764 ± 2.41
5.019AsnVal: 5.019 ± 0.614
0.627AsnTrp: 0.627 ± 0.551
1.255AsnTyr: 1.255 ± 0.589
0.0AsnXaa: 0.0 ± 0.0
Pro
8.156ProAla: 8.156 ± 3.296
0.0ProCys: 0.0 ± 0.0
4.391ProAsp: 4.391 ± 1.01
5.019ProGlu: 5.019 ± 2.274
3.137ProPhe: 3.137 ± 1.665
1.882ProGly: 1.882 ± 1.616
1.255ProHis: 1.255 ± 0.583
3.764ProIle: 3.764 ± 1.75
0.627ProLys: 0.627 ± 0.996
5.019ProLeu: 5.019 ± 1.328
1.255ProMet: 1.255 ± 0.511
1.255ProAsn: 1.255 ± 0.93
2.509ProPro: 2.509 ± 1.275
1.255ProGln: 1.255 ± 0.511
1.882ProArg: 1.882 ± 0.914
2.509ProSer: 2.509 ± 1.125
2.509ProThr: 2.509 ± 0.651
8.156ProVal: 8.156 ± 1.264
0.627ProTrp: 0.627 ± 0.551
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
3.764GlnAla: 3.764 ± 1.006
0.0GlnCys: 0.0 ± 0.0
1.255GlnAsp: 1.255 ± 0.511
1.255GlnGlu: 1.255 ± 1.103
0.627GlnPhe: 0.627 ± 0.465
6.901GlnGly: 6.901 ± 1.548
1.255GlnHis: 1.255 ± 0.511
1.882GlnIle: 1.882 ± 0.553
2.509GlnLys: 2.509 ± 0.601
5.646GlnLeu: 5.646 ± 0.952
2.509GlnMet: 2.509 ± 1.575
0.627GlnAsn: 0.627 ± 0.551
1.882GlnPro: 1.882 ± 0.372
2.509GlnGln: 2.509 ± 0.859
2.509GlnArg: 2.509 ± 0.718
1.255GlnSer: 1.255 ± 0.993
2.509GlnThr: 2.509 ± 1.226
2.509GlnVal: 2.509 ± 0.583
1.882GlnTrp: 1.882 ± 1.035
1.255GlnTyr: 1.255 ± 1.077
0.0GlnXaa: 0.0 ± 0.0
Arg
9.41ArgAla: 9.41 ± 2.899
0.0ArgCys: 0.0 ± 0.0
3.137ArgAsp: 3.137 ± 1.214
4.391ArgGlu: 4.391 ± 1.671
2.509ArgPhe: 2.509 ± 1.189
4.391ArgGly: 4.391 ± 0.964
0.627ArgHis: 0.627 ± 0.465
3.764ArgIle: 3.764 ± 1.799
6.274ArgLys: 6.274 ± 2.568
6.901ArgLeu: 6.901 ± 0.678
3.764ArgMet: 3.764 ± 1.739
6.274ArgAsn: 6.274 ± 1.976
3.137ArgPro: 3.137 ± 1.261
4.391ArgGln: 4.391 ± 1.199
6.901ArgArg: 6.901 ± 2.59
2.509ArgSer: 2.509 ± 1.308
2.509ArgThr: 2.509 ± 0.583
3.137ArgVal: 3.137 ± 0.889
0.0ArgTrp: 0.0 ± 0.0
5.019ArgTyr: 5.019 ± 2.474
0.0ArgXaa: 0.0 ± 0.0
Ser
6.274SerAla: 6.274 ± 2.634
0.0SerCys: 0.0 ± 0.0
5.019SerAsp: 5.019 ± 1.166
0.627SerGlu: 0.627 ± 0.465
0.627SerPhe: 0.627 ± 0.551
3.764SerGly: 3.764 ± 1.613
1.255SerHis: 1.255 ± 0.511
0.627SerIle: 0.627 ± 0.539
1.882SerLys: 1.882 ± 1.654
5.019SerLeu: 5.019 ± 0.949
2.509SerMet: 2.509 ± 0.747
1.255SerAsn: 1.255 ± 0.511
6.274SerPro: 6.274 ± 1.029
3.137SerGln: 3.137 ± 1.744
1.882SerArg: 1.882 ± 0.372
1.255SerSer: 1.255 ± 1.077
3.764SerThr: 3.764 ± 2.115
4.391SerVal: 4.391 ± 1.837
0.627SerTrp: 0.627 ± 0.539
2.509SerTyr: 2.509 ± 0.958
0.0SerXaa: 0.0 ± 0.0
Thr
4.391ThrAla: 4.391 ± 1.286
0.627ThrCys: 0.627 ± 0.465
3.764ThrAsp: 3.764 ± 0.86
1.255ThrGlu: 1.255 ± 0.583
2.509ThrPhe: 2.509 ± 1.115
6.901ThrGly: 6.901 ± 3.252
0.627ThrHis: 0.627 ± 0.539
3.137ThrIle: 3.137 ± 1.686
1.882ThrLys: 1.882 ± 0.985
1.882ThrLeu: 1.882 ± 0.858
0.0ThrMet: 0.0 ± 0.0
1.882ThrAsn: 1.882 ± 0.816
1.255ThrPro: 1.255 ± 0.93
0.627ThrGln: 0.627 ± 0.539
1.882ThrArg: 1.882 ± 0.979
3.764ThrSer: 3.764 ± 2.115
6.274ThrThr: 6.274 ± 1.294
4.391ThrVal: 4.391 ± 1.837
1.882ThrTrp: 1.882 ± 0.914
1.255ThrTyr: 1.255 ± 0.583
0.0ThrXaa: 0.0 ± 0.0
Val
3.764ValAla: 3.764 ± 1.345
0.627ValCys: 0.627 ± 0.724
4.391ValAsp: 4.391 ± 1.68
6.274ValGlu: 6.274 ± 1.342
2.509ValPhe: 2.509 ± 0.958
5.646ValGly: 5.646 ± 1.868
3.137ValHis: 3.137 ± 0.889
2.509ValIle: 2.509 ± 1.476
1.882ValLys: 1.882 ± 0.985
5.646ValLeu: 5.646 ± 1.132
2.509ValMet: 2.509 ± 0.718
1.882ValAsn: 1.882 ± 0.904
6.274ValPro: 6.274 ± 1.65
0.0ValGln: 0.0 ± 0.0
3.764ValArg: 3.764 ± 0.805
6.274ValSer: 6.274 ± 0.732
5.646ValThr: 5.646 ± 2.353
5.019ValVal: 5.019 ± 3.444
1.882ValTrp: 1.882 ± 1.987
0.0ValTyr: 0.0 ± 0.0
0.0ValXaa: 0.0 ± 0.0
Trp
1.882TrpAla: 1.882 ± 0.979
0.627TrpCys: 0.627 ± 0.551
2.509TrpAsp: 2.509 ± 1.908
0.627TrpGlu: 0.627 ± 0.996
0.627TrpPhe: 0.627 ± 0.539
0.627TrpGly: 0.627 ± 0.996
1.882TrpHis: 1.882 ± 0.372
0.0TrpIle: 0.0 ± 0.0
5.019TrpLys: 5.019 ± 1.775
1.255TrpLeu: 1.255 ± 0.993
0.627TrpMet: 0.627 ± 0.471
1.255TrpAsn: 1.255 ± 1.052
0.627TrpPro: 0.627 ± 0.551
0.627TrpGln: 0.627 ± 0.551
1.882TrpArg: 1.882 ± 1.383
0.627TrpSer: 0.627 ± 0.551
0.627TrpThr: 0.627 ± 0.539
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.019TyrAla: 5.019 ± 1.002
0.0TyrCys: 0.0 ± 0.0
1.255TyrAsp: 1.255 ± 0.511
1.882TyrGlu: 1.882 ± 0.816
0.627TyrPhe: 0.627 ± 0.551
1.882TyrGly: 1.882 ± 1.654
0.0TyrHis: 0.0 ± 0.0
1.255TyrIle: 1.255 ± 1.103
1.882TyrLys: 1.882 ± 1.006
2.509TyrLeu: 2.509 ± 1.167
0.627TyrMet: 0.627 ± 0.539
3.137TyrAsn: 3.137 ± 0.942
0.0TyrPro: 0.0 ± 0.0
3.137TyrGln: 3.137 ± 0.868
3.137TyrArg: 3.137 ± 0.703
3.764TyrSer: 3.764 ± 1.21
1.882TyrThr: 1.882 ± 0.553
0.627TyrVal: 0.627 ± 0.551
2.509TyrTrp: 2.509 ± 1.659
1.255TyrTyr: 1.255 ± 0.583
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1595 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski