Amino acid dipepetide frequency for Tuhoko virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.058AlaAla: 5.058 ± 1.827
1.012AlaCys: 1.012 ± 0.512
2.428AlaAsp: 2.428 ± 0.615
2.832AlaGlu: 2.832 ± 0.909
2.225AlaPhe: 2.225 ± 0.78
4.046AlaGly: 4.046 ± 0.919
2.023AlaHis: 2.023 ± 0.827
5.058AlaIle: 5.058 ± 1.251
2.832AlaLys: 2.832 ± 0.792
7.688AlaLeu: 7.688 ± 2.041
1.012AlaMet: 1.012 ± 1.045
3.035AlaAsn: 3.035 ± 0.889
4.855AlaPro: 4.855 ± 1.316
3.439AlaGln: 3.439 ± 1.757
3.844AlaArg: 3.844 ± 2.322
4.046AlaSer: 4.046 ± 1.462
4.046AlaThr: 4.046 ± 1.237
4.653AlaVal: 4.653 ± 1.187
0.405AlaTrp: 0.405 ± 0.252
2.023AlaTyr: 2.023 ± 0.63
0.0AlaXaa: 0.0 ± 0.0
Cys
0.809CysAla: 0.809 ± 0.364
0.202CysCys: 0.202 ± 0.123
0.809CysAsp: 0.809 ± 0.367
0.607CysGlu: 0.607 ± 0.289
0.607CysPhe: 0.607 ± 0.253
0.809CysGly: 0.809 ± 0.39
0.202CysHis: 0.202 ± 0.257
2.225CysIle: 2.225 ± 0.253
0.607CysLys: 0.607 ± 0.498
1.416CysLeu: 1.416 ± 0.605
0.809CysMet: 0.809 ± 0.298
1.214CysAsn: 1.214 ± 0.55
1.821CysPro: 1.821 ± 0.906
0.405CysGln: 0.405 ± 0.246
1.214CysArg: 1.214 ± 0.521
1.214CysSer: 1.214 ± 0.964
2.023CysThr: 2.023 ± 0.462
1.618CysVal: 1.618 ± 1.05
0.0CysTrp: 0.0 ± 0.0
0.607CysTyr: 0.607 ± 0.369
0.0CysXaa: 0.0 ± 0.0
Asp
2.832AspAla: 2.832 ± 0.712
0.809AspCys: 0.809 ± 0.539
2.023AspAsp: 2.023 ± 0.287
3.844AspGlu: 3.844 ± 1.164
0.809AspPhe: 0.809 ± 0.343
1.821AspGly: 1.821 ± 0.502
2.023AspHis: 2.023 ± 0.847
3.035AspIle: 3.035 ± 0.69
3.642AspLys: 3.642 ± 0.757
7.89AspLeu: 7.89 ± 1.661
1.214AspMet: 1.214 ± 0.596
3.035AspAsn: 3.035 ± 0.354
3.439AspPro: 3.439 ± 0.812
2.63AspGln: 2.63 ± 0.227
1.618AspArg: 1.618 ± 0.388
5.058AspSer: 5.058 ± 0.672
2.63AspThr: 2.63 ± 0.886
1.618AspVal: 1.618 ± 0.8
0.202AspTrp: 0.202 ± 0.123
2.023AspTyr: 2.023 ± 0.853
0.0AspXaa: 0.0 ± 0.0
Glu
2.63GluAla: 2.63 ± 1.444
1.012GluCys: 1.012 ± 0.467
2.832GluAsp: 2.832 ± 0.933
5.26GluGlu: 5.26 ± 2.91
2.63GluPhe: 2.63 ± 0.691
2.428GluGly: 2.428 ± 0.399
0.809GluHis: 0.809 ± 0.26
5.665GluIle: 5.665 ± 1.265
2.225GluLys: 2.225 ± 0.358
6.474GluLeu: 6.474 ± 1.614
1.012GluMet: 1.012 ± 0.262
2.023GluAsn: 2.023 ± 0.391
1.821GluPro: 1.821 ± 1.116
2.428GluGln: 2.428 ± 0.965
2.832GluArg: 2.832 ± 0.448
3.439GluSer: 3.439 ± 0.817
2.428GluThr: 2.428 ± 0.476
1.214GluVal: 1.214 ± 0.747
0.607GluTrp: 0.607 ± 0.345
0.809GluTyr: 0.809 ± 0.304
0.0GluXaa: 0.0 ± 0.0
Phe
2.428PheAla: 2.428 ± 0.832
1.214PheCys: 1.214 ± 0.284
2.023PheAsp: 2.023 ± 0.668
1.416PheGlu: 1.416 ± 0.649
1.214PhePhe: 1.214 ± 0.353
1.012PheGly: 1.012 ± 0.485
1.012PheHis: 1.012 ± 0.464
2.63PheIle: 2.63 ± 0.784
1.821PheLys: 1.821 ± 0.7
2.63PheLeu: 2.63 ± 0.762
1.416PheMet: 1.416 ± 0.656
2.023PheAsn: 2.023 ± 0.492
1.618PhePro: 1.618 ± 0.628
1.214PheGln: 1.214 ± 0.284
1.012PheArg: 1.012 ± 0.454
2.225PheSer: 2.225 ± 0.814
3.035PheThr: 3.035 ± 0.55
1.416PheVal: 1.416 ± 0.343
0.0PheTrp: 0.0 ± 0.0
1.012PheTyr: 1.012 ± 0.467
0.0PheXaa: 0.0 ± 0.0
Gly
3.844GlyAla: 3.844 ± 1.479
0.405GlyCys: 0.405 ± 0.31
3.642GlyAsp: 3.642 ± 0.833
1.821GlyGlu: 1.821 ± 0.308
1.214GlyPhe: 1.214 ± 0.251
1.214GlyGly: 1.214 ± 0.687
1.012GlyHis: 1.012 ± 0.315
5.26GlyIle: 5.26 ± 1.147
2.225GlyLys: 2.225 ± 1.158
4.451GlyLeu: 4.451 ± 1.279
0.809GlyMet: 0.809 ± 0.518
2.832GlyAsn: 2.832 ± 0.671
1.214GlyPro: 1.214 ± 0.973
1.618GlyGln: 1.618 ± 0.461
2.63GlyArg: 2.63 ± 0.587
3.642GlySer: 3.642 ± 1.199
2.428GlyThr: 2.428 ± 0.858
2.832GlyVal: 2.832 ± 0.893
0.202GlyTrp: 0.202 ± 0.257
1.821GlyTyr: 1.821 ± 0.786
0.0GlyXaa: 0.0 ± 0.0
His
1.821HisAla: 1.821 ± 0.238
0.405HisCys: 0.405 ± 0.214
1.012HisAsp: 1.012 ± 0.43
0.405HisGlu: 0.405 ± 0.214
1.012HisPhe: 1.012 ± 0.379
1.416HisGly: 1.416 ± 0.704
1.012HisHis: 1.012 ± 0.303
1.618HisIle: 1.618 ± 0.396
1.012HisLys: 1.012 ± 0.254
2.832HisLeu: 2.832 ± 1.325
0.202HisMet: 0.202 ± 0.123
1.416HisAsn: 1.416 ± 0.697
0.809HisPro: 0.809 ± 0.364
0.607HisGln: 0.607 ± 0.272
1.214HisArg: 1.214 ± 0.428
1.618HisSer: 1.618 ± 0.59
1.618HisThr: 1.618 ± 0.44
0.809HisVal: 0.809 ± 0.351
0.202HisTrp: 0.202 ± 0.123
0.202HisTyr: 0.202 ± 0.123
0.0HisXaa: 0.0 ± 0.0
Ile
4.855IleAla: 4.855 ± 1.364
1.416IleCys: 1.416 ± 0.408
2.63IleAsp: 2.63 ± 1.022
5.665IleGlu: 5.665 ± 1.332
2.63IlePhe: 2.63 ± 0.616
3.237IleGly: 3.237 ± 1.055
0.607IleHis: 0.607 ± 0.369
7.89IleIle: 7.89 ± 1.758
4.046IleLys: 4.046 ± 1.653
7.081IleLeu: 7.081 ± 1.785
2.428IleMet: 2.428 ± 0.889
4.653IleAsn: 4.653 ± 0.982
4.653IlePro: 4.653 ± 1.149
4.451IleGln: 4.451 ± 1.407
4.855IleArg: 4.855 ± 0.724
7.283IleSer: 7.283 ± 1.294
6.069IleThr: 6.069 ± 0.855
4.046IleVal: 4.046 ± 1.156
1.618IleTrp: 1.618 ± 0.397
2.023IleTyr: 2.023 ± 0.793
0.0IleXaa: 0.0 ± 0.0
Lys
4.046LysAla: 4.046 ± 2.088
1.012LysCys: 1.012 ± 0.645
1.214LysAsp: 1.214 ± 0.38
3.237LysGlu: 3.237 ± 0.755
2.023LysPhe: 2.023 ± 0.583
2.63LysGly: 2.63 ± 0.974
1.012LysHis: 1.012 ± 0.464
3.237LysIle: 3.237 ± 1.252
3.237LysLys: 3.237 ± 0.84
7.485LysLeu: 7.485 ± 0.69
1.821LysMet: 1.821 ± 0.351
2.428LysAsn: 2.428 ± 0.478
1.821LysPro: 1.821 ± 0.504
1.618LysGln: 1.618 ± 0.875
2.63LysArg: 2.63 ± 0.521
5.867LysSer: 5.867 ± 1.188
1.618LysThr: 1.618 ± 0.796
2.63LysVal: 2.63 ± 1.182
0.405LysTrp: 0.405 ± 0.353
2.225LysTyr: 2.225 ± 0.841
0.0LysXaa: 0.0 ± 0.0
Leu
7.081LeuAla: 7.081 ± 1.918
2.023LeuCys: 2.023 ± 0.44
6.676LeuAsp: 6.676 ± 1.113
4.248LeuGlu: 4.248 ± 0.764
3.035LeuPhe: 3.035 ± 0.739
5.058LeuGly: 5.058 ± 0.811
2.225LeuHis: 2.225 ± 0.47
8.295LeuIle: 8.295 ± 1.403
6.271LeuLys: 6.271 ± 1.282
9.711LeuLeu: 9.711 ± 1.33
2.63LeuMet: 2.63 ± 0.625
5.462LeuAsn: 5.462 ± 1.078
6.271LeuPro: 6.271 ± 0.894
5.867LeuGln: 5.867 ± 1.158
4.855LeuArg: 4.855 ± 1.153
9.711LeuSer: 9.711 ± 1.692
8.497LeuThr: 8.497 ± 2.094
3.237LeuVal: 3.237 ± 0.824
0.809LeuTrp: 0.809 ± 0.364
3.642LeuTyr: 3.642 ± 0.985
0.0LeuXaa: 0.0 ± 0.0
Met
2.023MetAla: 2.023 ± 1.151
0.607MetCys: 0.607 ± 0.29
1.416MetAsp: 1.416 ± 0.932
1.618MetGlu: 1.618 ± 0.5
0.607MetPhe: 0.607 ± 0.285
0.607MetGly: 0.607 ± 0.253
0.607MetHis: 0.607 ± 0.285
1.821MetIle: 1.821 ± 0.925
1.012MetLys: 1.012 ± 0.494
2.63MetLeu: 2.63 ± 0.219
0.405MetMet: 0.405 ± 0.542
1.012MetAsn: 1.012 ± 0.437
0.809MetPro: 0.809 ± 0.504
0.809MetGln: 0.809 ± 0.661
1.821MetArg: 1.821 ± 0.549
2.023MetSer: 2.023 ± 0.859
2.023MetThr: 2.023 ± 0.384
1.416MetVal: 1.416 ± 0.368
0.405MetTrp: 0.405 ± 0.241
0.809MetTyr: 0.809 ± 0.773
0.0MetXaa: 0.0 ± 0.0
Asn
3.844AsnAla: 3.844 ± 1.082
0.607AsnCys: 0.607 ± 0.54
2.225AsnAsp: 2.225 ± 0.322
1.618AsnGlu: 1.618 ± 0.363
1.618AsnPhe: 1.618 ± 0.663
2.023AsnGly: 2.023 ± 0.718
1.618AsnHis: 1.618 ± 0.405
3.642AsnIle: 3.642 ± 0.68
2.63AsnLys: 2.63 ± 0.574
5.462AsnLeu: 5.462 ± 1.046
1.416AsnMet: 1.416 ± 0.838
2.023AsnAsn: 2.023 ± 0.528
3.844AsnPro: 3.844 ± 1.713
3.844AsnGln: 3.844 ± 0.674
1.416AsnArg: 1.416 ± 0.711
4.451AsnSer: 4.451 ± 0.621
3.439AsnThr: 3.439 ± 1.196
2.63AsnVal: 2.63 ± 1.022
1.214AsnTrp: 1.214 ± 0.576
2.225AsnTyr: 2.225 ± 1.04
0.0AsnXaa: 0.0 ± 0.0
Pro
3.237ProAla: 3.237 ± 0.757
0.405ProCys: 0.405 ± 0.399
3.844ProAsp: 3.844 ± 0.707
2.428ProGlu: 2.428 ± 0.703
2.63ProPhe: 2.63 ± 0.892
1.416ProGly: 1.416 ± 0.407
0.202ProHis: 0.202 ± 0.123
3.642ProIle: 3.642 ± 1.249
3.439ProLys: 3.439 ± 0.703
5.058ProLeu: 5.058 ± 0.55
0.809ProMet: 0.809 ± 0.249
3.035ProAsn: 3.035 ± 0.808
4.046ProPro: 4.046 ± 1.761
2.023ProGln: 2.023 ± 1.574
2.832ProArg: 2.832 ± 0.993
6.474ProSer: 6.474 ± 1.798
2.832ProThr: 2.832 ± 0.319
3.439ProVal: 3.439 ± 1.306
0.405ProTrp: 0.405 ± 0.512
2.832ProTyr: 2.832 ± 0.551
0.0ProXaa: 0.0 ± 0.0
Gln
3.642GlnAla: 3.642 ± 0.467
0.809GlnCys: 0.809 ± 0.482
2.023GlnAsp: 2.023 ± 0.699
2.832GlnGlu: 2.832 ± 0.848
1.618GlnPhe: 1.618 ± 0.396
2.63GlnGly: 2.63 ± 1.073
1.214GlnHis: 1.214 ± 0.57
4.451GlnIle: 4.451 ± 1.77
1.416GlnLys: 1.416 ± 0.532
7.283GlnLeu: 7.283 ± 1.468
1.012GlnMet: 1.012 ± 0.49
2.63GlnAsn: 2.63 ± 0.479
3.035GlnPro: 3.035 ± 1.701
2.63GlnGln: 2.63 ± 1.127
1.618GlnArg: 1.618 ± 0.439
3.844GlnSer: 3.844 ± 1.556
3.035GlnThr: 3.035 ± 0.904
3.237GlnVal: 3.237 ± 0.489
0.405GlnTrp: 0.405 ± 0.241
0.202GlnTyr: 0.202 ± 0.123
0.0GlnXaa: 0.0 ± 0.0
Arg
2.63ArgAla: 2.63 ± 0.798
0.607ArgCys: 0.607 ± 0.56
1.821ArgAsp: 1.821 ± 0.613
1.618ArgGlu: 1.618 ± 0.668
1.821ArgPhe: 1.821 ± 0.721
2.63ArgGly: 2.63 ± 0.57
0.809ArgHis: 0.809 ± 0.507
4.046ArgIle: 4.046 ± 0.726
3.237ArgLys: 3.237 ± 0.776
5.058ArgLeu: 5.058 ± 1.507
1.214ArgMet: 1.214 ± 0.724
2.63ArgAsn: 2.63 ± 0.325
2.023ArgPro: 2.023 ± 0.531
1.821ArgGln: 1.821 ± 0.311
2.832ArgArg: 2.832 ± 0.827
3.035ArgSer: 3.035 ± 1.284
1.618ArgThr: 1.618 ± 0.621
3.844ArgVal: 3.844 ± 0.711
0.405ArgTrp: 0.405 ± 0.252
1.214ArgTyr: 1.214 ± 0.371
0.0ArgXaa: 0.0 ± 0.0
Ser
4.855SerAla: 4.855 ± 1.109
3.237SerCys: 3.237 ± 0.966
7.485SerAsp: 7.485 ± 1.005
4.451SerGlu: 4.451 ± 0.516
1.416SerPhe: 1.416 ± 0.646
3.844SerGly: 3.844 ± 1.197
0.607SerHis: 0.607 ± 0.249
6.069SerIle: 6.069 ± 1.304
4.248SerLys: 4.248 ± 0.899
7.283SerLeu: 7.283 ± 0.76
2.023SerMet: 2.023 ± 0.602
5.058SerAsn: 5.058 ± 0.807
3.439SerPro: 3.439 ± 0.482
5.26SerGln: 5.26 ± 0.92
2.832SerArg: 2.832 ± 0.627
8.901SerSer: 8.901 ± 1.737
5.26SerThr: 5.26 ± 1.095
5.462SerVal: 5.462 ± 0.503
0.607SerTrp: 0.607 ± 0.369
4.248SerTyr: 4.248 ± 0.93
0.0SerXaa: 0.0 ± 0.0
Thr
4.248ThrAla: 4.248 ± 2.152
1.214ThrCys: 1.214 ± 0.77
3.642ThrAsp: 3.642 ± 0.535
3.439ThrGlu: 3.439 ± 0.556
1.618ThrPhe: 1.618 ± 0.5
3.642ThrGly: 3.642 ± 0.743
0.607ThrHis: 0.607 ± 0.283
5.665ThrIle: 5.665 ± 1.126
2.832ThrLys: 2.832 ± 0.88
5.26ThrLeu: 5.26 ± 0.967
1.416ThrMet: 1.416 ± 0.368
2.832ThrAsn: 2.832 ± 1.13
4.248ThrPro: 4.248 ± 0.809
3.642ThrGln: 3.642 ± 0.54
2.225ThrArg: 2.225 ± 0.733
5.867ThrSer: 5.867 ± 1.265
4.451ThrThr: 4.451 ± 1.093
3.642ThrVal: 3.642 ± 2.357
1.012ThrTrp: 1.012 ± 0.514
1.416ThrTyr: 1.416 ± 0.746
0.0ThrXaa: 0.0 ± 0.0
Val
3.439ValAla: 3.439 ± 0.735
1.416ValCys: 1.416 ± 0.343
2.428ValAsp: 2.428 ± 0.497
1.618ValGlu: 1.618 ± 0.413
1.821ValPhe: 1.821 ± 0.448
2.428ValGly: 2.428 ± 0.655
2.428ValHis: 2.428 ± 0.544
4.046ValIle: 4.046 ± 1.605
3.035ValLys: 3.035 ± 0.88
5.26ValLeu: 5.26 ± 1.124
1.618ValMet: 1.618 ± 0.414
1.821ValAsn: 1.821 ± 0.516
3.439ValPro: 3.439 ± 1.096
2.225ValGln: 2.225 ± 0.515
1.618ValArg: 1.618 ± 0.621
3.844ValSer: 3.844 ± 1.128
3.844ValThr: 3.844 ± 1.431
3.035ValVal: 3.035 ± 1.44
1.214ValTrp: 1.214 ± 0.371
2.023ValTyr: 2.023 ± 0.828
0.0ValXaa: 0.0 ± 0.0
Trp
1.012TrpAla: 1.012 ± 0.241
0.405TrpCys: 0.405 ± 0.376
0.405TrpAsp: 0.405 ± 0.246
0.202TrpGlu: 0.202 ± 0.256
0.202TrpPhe: 0.202 ± 0.123
0.607TrpGly: 0.607 ± 0.283
0.202TrpHis: 0.202 ± 0.123
1.618TrpIle: 1.618 ± 0.529
0.809TrpLys: 0.809 ± 0.282
1.012TrpLeu: 1.012 ± 0.241
0.607TrpMet: 0.607 ± 0.345
0.405TrpAsn: 0.405 ± 0.351
0.405TrpPro: 0.405 ± 0.246
0.0TrpGln: 0.0 ± 0.0
0.405TrpArg: 0.405 ± 0.241
0.809TrpSer: 0.809 ± 0.335
0.809TrpThr: 0.809 ± 0.364
0.202TrpVal: 0.202 ± 0.123
0.0TrpTrp: 0.0 ± 0.0
0.202TrpTyr: 0.202 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.225TyrAla: 2.225 ± 0.617
0.405TyrCys: 0.405 ± 0.241
1.821TyrAsp: 1.821 ± 0.477
1.416TyrGlu: 1.416 ± 0.598
1.618TyrPhe: 1.618 ± 0.46
1.618TyrGly: 1.618 ± 0.71
1.012TyrHis: 1.012 ± 0.51
2.225TyrIle: 2.225 ± 0.517
1.618TyrLys: 1.618 ± 0.243
3.642TyrLeu: 3.642 ± 1.437
0.405TyrMet: 0.405 ± 0.241
2.225TyrAsn: 2.225 ± 0.849
1.416TyrPro: 1.416 ± 0.407
3.035TyrGln: 3.035 ± 0.591
0.607TyrArg: 0.607 ± 0.29
3.237TyrSer: 3.237 ± 0.688
1.214TyrThr: 1.214 ± 0.579
1.618TyrVal: 1.618 ± 0.44
0.202TyrTrp: 0.202 ± 0.123
3.035TyrTyr: 3.035 ± 0.996
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4944 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski