Amino acid dipepetide frequency for Thiafora orthonairovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.096AlaAla: 3.096 ± 0.632
1.548AlaCys: 1.548 ± 0.643
1.376AlaAsp: 1.376 ± 0.315
2.064AlaGlu: 2.064 ± 0.121
3.44AlaPhe: 3.44 ± 1.079
2.58AlaGly: 2.58 ± 0.736
0.344AlaHis: 0.344 ± 0.181
2.58AlaIle: 2.58 ± 0.617
3.268AlaLys: 3.268 ± 0.328
3.612AlaLeu: 3.612 ± 0.192
1.376AlaMet: 1.376 ± 0.219
2.58AlaAsn: 2.58 ± 0.981
1.72AlaPro: 1.72 ± 1.446
1.892AlaGln: 1.892 ± 0.522
2.236AlaArg: 2.236 ± 0.739
3.096AlaSer: 3.096 ± 0.024
3.612AlaThr: 3.612 ± 1.099
3.784AlaVal: 3.784 ± 0.758
1.72AlaTrp: 1.72 ± 0.576
1.204AlaTyr: 1.204 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.86CysAla: 0.86 ± 0.357
1.204CysCys: 1.204 ± 0.145
0.516CysAsp: 0.516 ± 0.364
1.032CysGlu: 1.032 ± 0.544
1.548CysPhe: 1.548 ± 0.012
1.032CysGly: 1.032 ± 0.727
1.032CysHis: 1.032 ± 0.445
1.892CysIle: 1.892 ± 0.49
2.064CysLys: 2.064 ± 1.168
1.72CysLeu: 1.72 ± 0.221
0.516CysMet: 0.516 ± 0.111
1.032CysAsn: 1.032 ± 0.56
1.72CysPro: 1.72 ± 1.021
0.688CysGln: 0.688 ± 0.195
3.096CysArg: 3.096 ± 0.665
1.548CysSer: 1.548 ± 0.806
2.064CysThr: 2.064 ± 1.747
1.204CysVal: 1.204 ± 0.371
0.516CysTrp: 0.516 ± 0.364
0.344CysTyr: 0.344 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
2.064AspAla: 2.064 ± 0.437
2.064AspCys: 2.064 ± 0.444
2.236AspAsp: 2.236 ± 0.368
4.472AspGlu: 4.472 ± 1.02
2.58AspPhe: 2.58 ± 0.413
3.612AspGly: 3.612 ± 0.661
0.688AspHis: 0.688 ± 0.138
3.784AspIle: 3.784 ± 0.499
2.924AspLys: 2.924 ± 0.471
5.16AspLeu: 5.16 ± 1.099
0.86AspMet: 0.86 ± 0.698
1.892AspAsn: 1.892 ± 0.652
2.064AspPro: 2.064 ± 0.268
1.032AspGln: 1.032 ± 0.39
2.236AspArg: 2.236 ± 0.473
4.644AspSer: 4.644 ± 1.399
3.612AspThr: 3.612 ± 0.575
3.44AspVal: 3.44 ± 0.468
0.688AspTrp: 0.688 ± 0.239
2.064AspTyr: 2.064 ± 0.121
0.0AspXaa: 0.0 ± 0.0
Glu
4.128GluAla: 4.128 ± 0.84
1.548GluCys: 1.548 ± 0.806
4.988GluAsp: 4.988 ± 0.484
5.504GluGlu: 5.504 ± 0.89
2.924GluPhe: 2.924 ± 0.521
3.612GluGly: 3.612 ± 0.284
1.376GluHis: 1.376 ± 0.39
2.58GluIle: 2.58 ± 0.836
5.16GluLys: 5.16 ± 0.712
6.02GluLeu: 6.02 ± 0.382
1.032GluMet: 1.032 ± 0.361
2.064GluAsn: 2.064 ± 0.202
1.548GluPro: 1.548 ± 0.012
2.236GluGln: 2.236 ± 0.861
3.44GluArg: 3.44 ± 0.733
4.3GluSer: 4.3 ± 1.456
4.128GluThr: 4.128 ± 0.565
5.676GluVal: 5.676 ± 0.074
0.516GluTrp: 0.516 ± 0.277
1.72GluTyr: 1.72 ± 0.519
0.0GluXaa: 0.0 ± 0.0
Phe
2.408PheAla: 2.408 ± 0.978
0.86PheCys: 0.86 ± 0.51
2.752PheAsp: 2.752 ± 0.331
3.096PheGlu: 3.096 ± 0.812
4.644PhePhe: 4.644 ± 1.34
2.408PheGly: 2.408 ± 0.646
0.86PheHis: 0.86 ± 0.801
3.44PheIle: 3.44 ± 0.499
4.128PheLys: 4.128 ± 1.457
4.816PheLeu: 4.816 ± 1.201
1.376PheMet: 1.376 ± 0.219
1.376PheAsn: 1.376 ± 0.459
2.064PhePro: 2.064 ± 0.467
1.032PheGln: 1.032 ± 0.101
0.688PheArg: 0.688 ± 0.363
4.3PheSer: 4.3 ± 1.519
2.236PheThr: 2.236 ± 0.368
1.376PheVal: 1.376 ± 0.351
0.172PheTrp: 0.172 ± 0.091
1.548PheTyr: 1.548 ± 0.547
0.0PheXaa: 0.0 ± 0.0
Gly
1.032GlyAla: 1.032 ± 0.39
1.204GlyCys: 1.204 ± 0.658
2.236GlyAsp: 2.236 ± 0.833
2.408GlyGlu: 2.408 ± 0.216
2.752GlyPhe: 2.752 ± 0.982
2.408GlyGly: 2.408 ± 1.707
0.516GlyHis: 0.516 ± 0.111
3.268GlyIle: 3.268 ± 0.104
3.784GlyLys: 3.784 ± 0.161
7.052GlyLeu: 7.052 ± 1.148
1.032GlyMet: 1.032 ± 0.445
3.44GlyAsn: 3.44 ± 0.198
2.236GlyPro: 2.236 ± 1.555
1.892GlyGln: 1.892 ± 0.355
3.096GlyArg: 3.096 ± 0.601
5.848GlySer: 5.848 ± 0.977
3.096GlyThr: 3.096 ± 1.093
3.44GlyVal: 3.44 ± 1.299
0.344GlyTrp: 0.344 ± 0.181
1.548GlyTyr: 1.548 ± 0.3
0.0GlyXaa: 0.0 ± 0.0
His
1.376HisAla: 1.376 ± 0.276
0.688HisCys: 0.688 ± 0.297
0.516HisAsp: 0.516 ± 0.364
1.204HisGlu: 1.204 ± 0.371
0.516HisPhe: 0.516 ± 0.111
1.204HisGly: 1.204 ± 1.073
1.032HisHis: 1.032 ± 0.445
1.376HisIle: 1.376 ± 0.52
0.516HisLys: 0.516 ± 0.277
3.44HisLeu: 3.44 ± 0.379
1.032HisMet: 1.032 ± 0.423
0.516HisAsn: 0.516 ± 0.111
1.72HisPro: 1.72 ± 0.847
0.688HisGln: 0.688 ± 0.448
1.376HisArg: 1.376 ± 0.086
1.548HisSer: 1.548 ± 0.806
0.516HisThr: 0.516 ± 0.596
1.548HisVal: 1.548 ± 0.338
0.344HisTrp: 0.344 ± 0.148
0.688HisTyr: 0.688 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
2.408IleAla: 2.408 ± 0.289
1.204IleCys: 1.204 ± 0.233
2.408IleAsp: 2.408 ± 0.289
4.644IleGlu: 4.644 ± 0.687
2.58IlePhe: 2.58 ± 0.232
1.892IleGly: 1.892 ± 0.187
1.204IleHis: 1.204 ± 0.343
4.3IleIle: 4.3 ± 0.646
6.02IleLys: 6.02 ± 0.794
6.364IleLeu: 6.364 ± 0.923
1.032IleMet: 1.032 ± 0.101
2.924IleAsn: 2.924 ± 0.822
2.752IlePro: 2.752 ± 1.04
2.924IleGln: 2.924 ± 0.304
4.128IleArg: 4.128 ± 0.243
5.504IleSer: 5.504 ± 0.563
3.956IleThr: 3.956 ± 0.553
4.3IleVal: 4.3 ± 0.257
0.516IleTrp: 0.516 ± 0.111
2.408IleTyr: 2.408 ± 0.289
0.0IleXaa: 0.0 ± 0.0
Lys
4.128LysAla: 4.128 ± 0.934
0.86LysCys: 0.86 ± 0.246
4.644LysAsp: 4.644 ± 0.562
7.224LysGlu: 7.224 ± 0.567
2.752LysPhe: 2.752 ± 0.142
3.268LysGly: 3.268 ± 1.023
2.064LysHis: 2.064 ± 0.121
6.02LysIle: 6.02 ± 0.958
5.676LysLys: 5.676 ± 0.658
9.976LysLeu: 9.976 ± 1.233
1.204LysMet: 1.204 ± 0.233
4.3LysAsn: 4.3 ± 0.14
2.58LysPro: 2.58 ± 0.555
2.408LysGln: 2.408 ± 0.353
3.268LysArg: 3.268 ± 0.642
5.676LysSer: 5.676 ± 0.962
4.644LysThr: 4.644 ± 0.3
4.128LysVal: 4.128 ± 1.015
1.72LysTrp: 1.72 ± 0.715
1.892LysTyr: 1.892 ± 0.726
0.0LysXaa: 0.0 ± 0.0
Leu
4.472LeuAla: 4.472 ± 0.736
2.752LeuCys: 2.752 ± 0.555
6.02LeuAsp: 6.02 ± 0.824
5.676LeuGlu: 5.676 ± 0.254
4.644LeuPhe: 4.644 ± 0.307
5.16LeuGly: 5.16 ± 1.054
2.58LeuHis: 2.58 ± 0.295
4.988LeuIle: 4.988 ± 0.477
9.116LeuLys: 9.116 ± 1.292
12.384LeuLeu: 12.384 ± 1.555
2.752LeuMet: 2.752 ± 0.438
5.16LeuAsn: 5.16 ± 1.382
3.44LeuPro: 3.44 ± 0.468
3.784LeuGln: 3.784 ± 0.161
3.44LeuArg: 3.44 ± 1.578
11.352LeuSer: 11.352 ± 1.398
9.46LeuThr: 9.46 ± 0.607
5.504LeuVal: 5.504 ± 0.49
0.516LeuTrp: 0.516 ± 0.364
2.924LeuTyr: 2.924 ± 0.61
0.0LeuXaa: 0.0 ± 0.0
Met
0.688MetAla: 0.688 ± 0.443
0.172MetCys: 0.172 ± 0.22
1.548MetAsp: 1.548 ± 0.316
0.86MetGlu: 0.86 ± 0.126
0.86MetPhe: 0.86 ± 0.205
1.548MetGly: 1.548 ± 0.527
0.688MetHis: 0.688 ± 0.195
1.548MetIle: 1.548 ± 0.012
2.064MetLys: 2.064 ± 0.572
3.268MetLeu: 3.268 ± 0.392
0.86MetMet: 0.86 ± 0.453
1.204MetAsn: 1.204 ± 0.439
0.172MetPro: 0.172 ± 0.22
0.688MetGln: 0.688 ± 0.138
0.344MetArg: 0.344 ± 0.221
1.204MetSer: 1.204 ± 0.712
0.688MetThr: 0.688 ± 0.363
1.72MetVal: 1.72 ± 0.747
0.344MetTrp: 0.344 ± 0.221
0.516MetTyr: 0.516 ± 0.111
0.0MetXaa: 0.0 ± 0.0
Asn
2.064AsnAla: 2.064 ± 0.467
1.892AsnCys: 1.892 ± 0.367
2.58AsnAsp: 2.58 ± 0.55
1.376AsnGlu: 1.376 ± 0.219
2.408AsnPhe: 2.408 ± 0.289
2.064AsnGly: 2.064 ± 0.66
1.204AsnHis: 1.204 ± 0.371
3.096AsnIle: 3.096 ± 0.303
4.3AsnLys: 4.3 ± 0.388
5.16AsnLeu: 5.16 ± 1.122
1.032AsnMet: 1.032 ± 0.362
1.548AsnAsn: 1.548 ± 0.816
1.892AsnPro: 1.892 ± 0.516
1.032AsnGln: 1.032 ± 0.39
2.58AsnArg: 2.58 ± 0.829
4.644AsnSer: 4.644 ± 1.184
2.408AsnThr: 2.408 ± 0.216
3.268AsnVal: 3.268 ± 0.392
1.032AsnTrp: 1.032 ± 0.101
2.064AsnTyr: 2.064 ± 0.121
0.0AsnXaa: 0.0 ± 0.0
Pro
1.548ProAla: 1.548 ± 0.635
0.516ProCys: 0.516 ± 0.364
2.924ProAsp: 2.924 ± 0.615
3.268ProGlu: 3.268 ± 0.436
0.172ProPhe: 0.172 ± 0.091
1.72ProGly: 1.72 ± 0.715
0.688ProHis: 0.688 ± 0.138
2.58ProIle: 2.58 ± 0.957
3.268ProLys: 3.268 ± 0.809
2.064ProLeu: 2.064 ± 0.585
0.86ProMet: 0.86 ± 0.453
1.72ProAsn: 1.72 ± 0.492
0.688ProPro: 0.688 ± 0.448
1.204ProGln: 1.204 ± 0.343
1.376ProArg: 1.376 ± 0.086
2.58ProSer: 2.58 ± 0.413
2.236ProThr: 2.236 ± 0.718
2.064ProVal: 2.064 ± 0.958
0.688ProTrp: 0.688 ± 0.297
1.548ProTyr: 1.548 ± 0.602
0.0ProXaa: 0.0 ± 0.0
Gln
2.064GlnAla: 2.064 ± 0.524
0.516GlnCys: 0.516 ± 0.111
1.032GlnAsp: 1.032 ± 0.423
2.752GlnGlu: 2.752 ± 0.142
1.204GlnPhe: 1.204 ± 0.442
2.236GlnGly: 2.236 ± 0.185
1.032GlnHis: 1.032 ± 0.554
2.236GlnIle: 2.236 ± 0.718
1.72GlnLys: 1.72 ± 0.335
3.956GlnLeu: 3.956 ± 0.348
1.204GlnMet: 1.204 ± 0.145
2.236GlnAsn: 2.236 ± 0.976
0.516GlnPro: 0.516 ± 0.212
2.064GlnGln: 2.064 ± 0.524
1.72GlnArg: 1.72 ± 0.586
1.892GlnSer: 1.892 ± 1.193
2.924GlnThr: 2.924 ± 0.874
2.236GlnVal: 2.236 ± 1.055
0.0GlnTrp: 0.0 ± 0.0
1.032GlnTyr: 1.032 ± 0.101
0.0GlnXaa: 0.0 ± 0.0
Arg
2.752ArgAla: 2.752 ± 0.461
1.204ArgCys: 1.204 ± 0.176
2.752ArgAsp: 2.752 ± 1.219
1.548ArgGlu: 1.548 ± 0.303
1.892ArgPhe: 1.892 ± 0.726
2.064ArgGly: 2.064 ± 1.421
0.86ArgHis: 0.86 ± 0.126
2.924ArgIle: 2.924 ± 0.466
4.472ArgLys: 4.472 ± 0.081
5.332ArgLeu: 5.332 ± 1.19
1.376ArgMet: 1.376 ± 0.622
2.408ArgAsn: 2.408 ± 1.041
1.032ArgPro: 1.032 ± 0.267
1.892ArgGln: 1.892 ± 0.153
2.58ArgArg: 2.58 ± 0.36
4.3ArgSer: 4.3 ± 1.182
2.408ArgThr: 2.408 ± 0.746
1.72ArgVal: 1.72 ± 0.411
0.344ArgTrp: 0.344 ± 0.221
1.72ArgTyr: 1.72 ± 0.636
0.0ArgXaa: 0.0 ± 0.0
Ser
3.268SerAla: 3.268 ± 0.809
2.58SerCys: 2.58 ± 0.738
4.816SerAsp: 4.816 ± 1.159
5.676SerGlu: 5.676 ± 0.358
3.612SerPhe: 3.612 ± 0.434
4.472SerGly: 4.472 ± 0.368
2.752SerHis: 2.752 ± 0.661
6.708SerIle: 6.708 ± 1.969
5.848SerLys: 5.848 ± 0.397
8.6SerLeu: 8.6 ± 1.827
0.86SerMet: 0.86 ± 0.246
3.784SerAsn: 3.784 ± 0.306
1.72SerPro: 1.72 ± 0.492
2.408SerGln: 2.408 ± 0.877
4.472SerArg: 4.472 ± 0.736
9.46SerSer: 9.46 ± 0.674
5.848SerThr: 5.848 ± 2.003
6.88SerVal: 6.88 ± 0.5
1.548SerTrp: 1.548 ± 1.094
2.236SerTyr: 2.236 ± 0.368
0.0SerXaa: 0.0 ± 0.0
Thr
3.784ThrAla: 3.784 ± 0.306
1.204ThrCys: 1.204 ± 1.24
3.612ThrAsp: 3.612 ± 0.26
4.988ThrGlu: 4.988 ± 1.039
2.064ThrPhe: 2.064 ± 0.534
4.472ThrGly: 4.472 ± 1.47
1.204ThrHis: 1.204 ± 0.176
4.472ThrIle: 4.472 ± 0.621
5.504ThrLys: 5.504 ± 0.417
4.816ThrLeu: 4.816 ± 0.578
0.344ThrMet: 0.344 ± 0.181
3.096ThrAsn: 3.096 ± 0.349
2.752ThrPro: 2.752 ± 0.172
2.064ThrGln: 2.064 ± 0.585
1.376ThrArg: 1.376 ± 0.086
5.848ThrSer: 5.848 ± 1.069
4.472ThrThr: 4.472 ± 1.681
5.848ThrVal: 5.848 ± 0.761
1.376ThrTrp: 1.376 ± 0.594
1.204ThrTyr: 1.204 ± 0.371
0.0ThrXaa: 0.0 ± 0.0
Val
3.44ValAla: 3.44 ± 1.381
1.892ValCys: 1.892 ± 0.685
3.612ValAsp: 3.612 ± 0.284
4.472ValGlu: 4.472 ± 0.582
3.096ValPhe: 3.096 ± 0.349
3.956ValGly: 3.956 ± 0.755
1.376ValHis: 1.376 ± 0.086
2.924ValIle: 2.924 ± 0.288
6.02ValLys: 6.02 ± 0.483
7.396ValLeu: 7.396 ± 0.728
1.032ValMet: 1.032 ± 0.756
3.956ValAsn: 3.956 ± 1.342
1.548ValPro: 1.548 ± 0.969
2.752ValGln: 2.752 ± 1.481
2.064ValArg: 2.064 ± 0.567
5.16ValSer: 5.16 ± 0.437
3.612ValThr: 3.612 ± 1.337
5.504ValVal: 5.504 ± 1.452
0.516ValTrp: 0.516 ± 0.111
1.204ValTyr: 1.204 ± 0.145
0.0ValXaa: 0.0 ± 0.0
Trp
0.344TrpAla: 0.344 ± 0.148
0.86TrpCys: 0.86 ± 0.649
0.86TrpAsp: 0.86 ± 0.246
0.86TrpGlu: 0.86 ± 0.126
0.516TrpPhe: 0.516 ± 0.277
0.86TrpGly: 0.86 ± 0.357
0.172TrpHis: 0.172 ± 0.264
0.86TrpIle: 0.86 ± 0.453
1.204TrpLys: 1.204 ± 0.371
1.548TrpLeu: 1.548 ± 0.3
0.344TrpMet: 0.344 ± 0.363
0.344TrpAsn: 0.344 ± 0.181
0.86TrpPro: 0.86 ± 0.51
0.516TrpGln: 0.516 ± 0.596
0.688TrpArg: 0.688 ± 0.52
1.032TrpSer: 1.032 ± 0.267
1.032TrpThr: 1.032 ± 0.56
0.344TrpVal: 0.344 ± 0.148
0.344TrpTrp: 0.344 ± 0.148
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.204TyrAla: 1.204 ± 0.371
0.86TyrCys: 0.86 ± 0.246
0.688TyrAsp: 0.688 ± 0.138
1.548TyrGlu: 1.548 ± 0.604
1.376TyrPhe: 1.376 ± 0.594
2.064TyrGly: 2.064 ± 0.414
0.172TyrHis: 0.172 ± 0.091
1.892TyrIle: 1.892 ± 0.187
1.376TyrLys: 1.376 ± 0.351
3.44TyrLeu: 3.44 ± 0.821
0.688TyrMet: 0.688 ± 0.343
2.064TyrAsn: 2.064 ± 0.572
0.688TyrPro: 0.688 ± 0.138
1.376TyrGln: 1.376 ± 0.715
1.376TyrArg: 1.376 ± 0.52
3.612TyrSer: 3.612 ± 0.26
1.548TyrThr: 1.548 ± 0.338
1.548TyrVal: 1.548 ± 0.012
0.344TyrTrp: 0.344 ± 0.221
1.032TyrTyr: 1.032 ± 0.101
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5815 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski