Amino acid dipepetide frequency for Klamath virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.56AlaAla: 2.56 ± 1.267
0.768AlaCys: 0.768 ± 0.323
4.352AlaAsp: 4.352 ± 1.695
2.56AlaGlu: 2.56 ± 1.163
1.28AlaPhe: 1.28 ± 0.579
3.84AlaGly: 3.84 ± 1.308
1.28AlaHis: 1.28 ± 0.498
4.352AlaIle: 4.352 ± 0.772
2.304AlaLys: 2.304 ± 0.659
4.096AlaLeu: 4.096 ± 1.161
0.512AlaMet: 0.512 ± 0.313
1.536AlaAsn: 1.536 ± 0.653
2.304AlaPro: 2.304 ± 1.666
2.304AlaGln: 2.304 ± 0.584
4.608AlaArg: 4.608 ± 1.224
2.816AlaSer: 2.816 ± 1.569
2.304AlaThr: 2.304 ± 0.779
3.328AlaVal: 3.328 ± 0.593
0.768AlaTrp: 0.768 ± 0.781
1.536AlaTyr: 1.536 ± 0.665
0.0AlaXaa: 0.0 ± 0.0
Cys
1.28CysAla: 1.28 ± 0.345
0.0CysCys: 0.0 ± 0.0
0.512CysAsp: 0.512 ± 0.284
0.512CysGlu: 0.512 ± 0.584
1.024CysPhe: 1.024 ± 0.417
0.512CysGly: 0.512 ± 0.284
0.768CysHis: 0.768 ± 0.323
0.512CysIle: 0.512 ± 0.295
1.28CysLys: 1.28 ± 1.157
1.28CysLeu: 1.28 ± 0.345
0.768CysMet: 0.768 ± 0.934
0.768CysAsn: 0.768 ± 0.418
0.768CysPro: 0.768 ± 0.459
0.256CysGln: 0.256 ± 0.147
1.536CysArg: 1.536 ± 1.107
0.768CysSer: 0.768 ± 0.326
1.28CysThr: 1.28 ± 0.407
1.024CysVal: 1.024 ± 0.627
0.768CysTrp: 0.768 ± 0.323
1.024CysTyr: 1.024 ± 0.589
0.0CysXaa: 0.0 ± 0.0
Asp
3.072AspAla: 3.072 ± 1.164
0.768AspCys: 0.768 ± 0.323
2.56AspAsp: 2.56 ± 0.668
3.328AspGlu: 3.328 ± 0.927
2.304AspPhe: 2.304 ± 1.084
1.792AspGly: 1.792 ± 0.829
1.024AspHis: 1.024 ± 0.59
1.536AspIle: 1.536 ± 1.049
2.56AspLys: 2.56 ± 1.588
8.449AspLeu: 8.449 ± 1.144
1.536AspMet: 1.536 ± 0.738
3.072AspAsn: 3.072 ± 0.584
3.84AspPro: 3.84 ± 0.828
2.304AspGln: 2.304 ± 0.674
2.816AspArg: 2.816 ± 0.695
4.352AspSer: 4.352 ± 1.38
3.072AspThr: 3.072 ± 0.887
1.792AspVal: 1.792 ± 0.86
1.536AspTrp: 1.536 ± 0.386
2.56AspTyr: 2.56 ± 0.819
0.0AspXaa: 0.0 ± 0.0
Glu
3.072GluAla: 3.072 ± 1.236
0.512GluCys: 0.512 ± 0.493
3.328GluAsp: 3.328 ± 0.686
2.304GluGlu: 2.304 ± 1.142
2.56GluPhe: 2.56 ± 0.744
4.352GluGly: 4.352 ± 0.892
1.28GluHis: 1.28 ± 0.594
3.84GluIle: 3.84 ± 1.303
3.072GluLys: 3.072 ± 0.767
6.144GluLeu: 6.144 ± 1.38
1.536GluMet: 1.536 ± 0.615
1.792GluAsn: 1.792 ± 0.872
1.792GluPro: 1.792 ± 0.872
0.512GluGln: 0.512 ± 0.295
2.304GluArg: 2.304 ± 0.724
6.144GluSer: 6.144 ± 1.629
5.376GluThr: 5.376 ± 1.019
4.096GluVal: 4.096 ± 1.116
0.256GluTrp: 0.256 ± 0.368
1.28GluTyr: 1.28 ± 0.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.56PheAla: 2.56 ± 1.125
1.792PheCys: 1.792 ± 0.81
1.28PheAsp: 1.28 ± 1.051
1.28PheGlu: 1.28 ± 0.62
2.304PhePhe: 2.304 ± 0.601
2.048PheGly: 2.048 ± 0.888
1.536PheHis: 1.536 ± 0.667
1.024PheIle: 1.024 ± 0.419
3.328PheLys: 3.328 ± 0.962
6.144PheLeu: 6.144 ± 2.425
0.512PheMet: 0.512 ± 0.327
2.304PheAsn: 2.304 ± 1.036
3.328PhePro: 3.328 ± 1.158
1.792PheGln: 1.792 ± 0.496
3.328PheArg: 3.328 ± 0.862
3.328PheSer: 3.328 ± 0.803
1.536PheThr: 1.536 ± 0.416
3.328PheVal: 3.328 ± 0.865
1.28PheTrp: 1.28 ± 0.422
2.304PheTyr: 2.304 ± 0.643
0.0PheXaa: 0.0 ± 0.0
Gly
2.304GlyAla: 2.304 ± 0.751
1.024GlyCys: 1.024 ± 0.419
4.096GlyAsp: 4.096 ± 1.021
1.792GlyGlu: 1.792 ± 0.889
2.56GlyPhe: 2.56 ± 0.776
3.84GlyGly: 3.84 ± 1.585
0.768GlyHis: 0.768 ± 0.326
4.096GlyIle: 4.096 ± 1.637
3.328GlyLys: 3.328 ± 1.115
9.473GlyLeu: 9.473 ± 1.662
1.536GlyMet: 1.536 ± 0.53
1.792GlyAsn: 1.792 ± 0.456
3.328GlyPro: 3.328 ± 1.803
3.328GlyGln: 3.328 ± 1.793
4.096GlyArg: 4.096 ± 0.654
5.12GlySer: 5.12 ± 1.133
3.072GlyThr: 3.072 ± 0.797
3.328GlyVal: 3.328 ± 0.739
0.768GlyTrp: 0.768 ± 0.326
2.048GlyTyr: 2.048 ± 0.747
0.0GlyXaa: 0.0 ± 0.0
His
0.768HisAla: 0.768 ± 0.326
0.512HisCys: 0.512 ± 0.313
0.512HisAsp: 0.512 ± 0.356
1.536HisGlu: 1.536 ± 0.444
1.28HisPhe: 1.28 ± 0.345
1.024HisGly: 1.024 ± 0.388
0.768HisHis: 0.768 ± 0.442
1.28HisIle: 1.28 ± 0.43
0.768HisLys: 0.768 ± 0.326
3.072HisLeu: 3.072 ± 0.832
0.512HisMet: 0.512 ± 0.493
0.768HisAsn: 0.768 ± 0.668
2.304HisPro: 2.304 ± 0.788
0.768HisGln: 0.768 ± 0.442
2.304HisArg: 2.304 ± 1.084
1.024HisSer: 1.024 ± 0.619
0.768HisThr: 0.768 ± 0.418
1.792HisVal: 1.792 ± 0.737
0.768HisTrp: 0.768 ± 0.58
1.28HisTyr: 1.28 ± 0.422
0.0HisXaa: 0.0 ± 0.0
Ile
1.792IleAla: 1.792 ± 0.846
1.024IleCys: 1.024 ± 0.675
3.328IleAsp: 3.328 ± 0.964
3.584IleGlu: 3.584 ± 0.946
2.56IlePhe: 2.56 ± 0.921
3.584IleGly: 3.584 ± 0.991
1.28IleHis: 1.28 ± 0.594
3.072IleIle: 3.072 ± 0.731
2.56IleLys: 2.56 ± 0.921
7.168IleLeu: 7.168 ± 1.703
0.512IleMet: 0.512 ± 0.295
2.304IleAsn: 2.304 ± 1.595
5.632IlePro: 5.632 ± 1.048
2.048IleGln: 2.048 ± 0.612
5.12IleArg: 5.12 ± 1.528
4.352IleSer: 4.352 ± 1.626
1.792IleThr: 1.792 ± 0.52
2.56IleVal: 2.56 ± 0.765
0.768IleTrp: 0.768 ± 0.837
2.048IleTyr: 2.048 ± 0.74
0.0IleXaa: 0.0 ± 0.0
Lys
2.048LysAla: 2.048 ± 0.69
0.256LysCys: 0.256 ± 0.368
2.304LysAsp: 2.304 ± 1.086
3.072LysGlu: 3.072 ± 0.921
1.536LysPhe: 1.536 ± 0.701
3.072LysGly: 3.072 ± 1.642
0.512LysHis: 0.512 ± 0.533
2.816LysIle: 2.816 ± 0.974
2.816LysLys: 2.816 ± 0.416
5.632LysLeu: 5.632 ± 0.943
1.792LysMet: 1.792 ± 0.573
1.536LysAsn: 1.536 ± 0.94
1.792LysPro: 1.792 ± 0.52
1.28LysGln: 1.28 ± 0.858
3.84LysArg: 3.84 ± 0.655
4.608LysSer: 4.608 ± 0.654
2.816LysThr: 2.816 ± 0.588
4.608LysVal: 4.608 ± 1.222
1.024LysTrp: 1.024 ± 0.419
2.304LysTyr: 2.304 ± 0.982
0.0LysXaa: 0.0 ± 0.0
Leu
5.888LeuAla: 5.888 ± 1.172
2.048LeuCys: 2.048 ± 0.359
7.168LeuAsp: 7.168 ± 1.676
7.937LeuGlu: 7.937 ± 0.998
4.608LeuPhe: 4.608 ± 0.818
10.753LeuGly: 10.753 ± 1.59
3.072LeuHis: 3.072 ± 0.748
7.424LeuIle: 7.424 ± 1.758
5.376LeuLys: 5.376 ± 0.899
13.569LeuLeu: 13.569 ± 1.712
3.072LeuMet: 3.072 ± 0.812
5.12LeuAsn: 5.12 ± 0.953
6.144LeuPro: 6.144 ± 1.326
3.84LeuGln: 3.84 ± 1.12
8.193LeuArg: 8.193 ± 2.206
8.961LeuSer: 8.961 ± 0.876
7.424LeuThr: 7.424 ± 1.593
4.352LeuVal: 4.352 ± 1.883
1.28LeuTrp: 1.28 ± 1.085
2.304LeuTyr: 2.304 ± 0.772
0.0LeuXaa: 0.0 ± 0.0
Met
2.048MetAla: 2.048 ± 0.895
0.768MetCys: 0.768 ± 0.483
0.768MetAsp: 0.768 ± 0.628
2.304MetGlu: 2.304 ± 0.836
1.792MetPhe: 1.792 ± 0.904
1.28MetGly: 1.28 ± 0.594
0.256MetHis: 0.256 ± 0.147
1.536MetIle: 1.536 ± 0.576
0.512MetLys: 0.512 ± 0.313
1.536MetLeu: 1.536 ± 0.74
0.768MetMet: 0.768 ± 0.323
1.024MetAsn: 1.024 ± 0.393
0.512MetPro: 0.512 ± 0.493
1.536MetGln: 1.536 ± 0.371
0.512MetArg: 0.512 ± 0.464
1.536MetSer: 1.536 ± 0.475
1.28MetThr: 1.28 ± 0.726
2.56MetVal: 2.56 ± 0.972
0.256MetTrp: 0.256 ± 0.314
0.768MetTyr: 0.768 ± 0.377
0.0MetXaa: 0.0 ± 0.0
Asn
2.304AsnAla: 2.304 ± 1.337
1.024AsnCys: 1.024 ± 0.639
1.024AsnAsp: 1.024 ± 0.627
0.256AsnGlu: 0.256 ± 0.453
2.304AsnPhe: 2.304 ± 1.29
1.28AsnGly: 1.28 ± 0.574
1.28AsnHis: 1.28 ± 0.397
2.048AsnIle: 2.048 ± 0.632
1.28AsnLys: 1.28 ± 0.429
4.864AsnLeu: 4.864 ± 1.105
0.256AsnMet: 0.256 ± 0.147
2.048AsnAsn: 2.048 ± 0.631
3.584AsnPro: 3.584 ± 1.408
2.56AsnGln: 2.56 ± 0.814
2.56AsnArg: 2.56 ± 1.074
3.584AsnSer: 3.584 ± 0.601
2.048AsnThr: 2.048 ± 0.888
1.792AsnVal: 1.792 ± 0.864
1.024AsnTrp: 1.024 ± 0.419
2.304AsnTyr: 2.304 ± 0.848
0.0AsnXaa: 0.0 ± 0.0
Pro
2.816ProAla: 2.816 ± 0.796
0.512ProCys: 0.512 ± 0.313
4.608ProAsp: 4.608 ± 1.137
4.096ProGlu: 4.096 ± 1.095
3.584ProPhe: 3.584 ± 1.152
2.304ProGly: 2.304 ± 1.786
0.768ProHis: 0.768 ± 0.58
3.84ProIle: 3.84 ± 1.033
1.28ProLys: 1.28 ± 0.533
5.888ProLeu: 5.888 ± 1.577
0.768ProMet: 0.768 ± 0.377
1.792ProAsn: 1.792 ± 0.606
2.816ProPro: 2.816 ± 2.093
2.56ProGln: 2.56 ± 1.651
4.096ProArg: 4.096 ± 0.826
5.376ProSer: 5.376 ± 1.285
3.328ProThr: 3.328 ± 1.409
2.56ProVal: 2.56 ± 1.033
1.28ProTrp: 1.28 ± 0.513
2.816ProTyr: 2.816 ± 1.426
0.0ProXaa: 0.0 ± 0.0
Gln
1.536GlnAla: 1.536 ± 0.318
1.28GlnCys: 1.28 ± 0.345
1.536GlnAsp: 1.536 ± 0.923
1.536GlnGlu: 1.536 ± 1.047
2.048GlnPhe: 2.048 ± 0.444
2.56GlnGly: 2.56 ± 0.876
0.768GlnHis: 0.768 ± 0.442
2.048GlnIle: 2.048 ± 0.563
1.792GlnLys: 1.792 ± 0.71
3.328GlnLeu: 3.328 ± 1.006
0.768GlnMet: 0.768 ± 0.668
1.28GlnAsn: 1.28 ± 0.58
1.536GlnPro: 1.536 ± 0.935
0.512GlnGln: 0.512 ± 0.468
1.536GlnArg: 1.536 ± 0.475
4.096GlnSer: 4.096 ± 1.178
2.304GlnThr: 2.304 ± 0.907
2.56GlnVal: 2.56 ± 1.154
0.512GlnTrp: 0.512 ± 0.284
1.28GlnTyr: 1.28 ± 0.724
0.0GlnXaa: 0.0 ± 0.0
Arg
3.584ArgAla: 3.584 ± 1.627
0.512ArgCys: 0.512 ± 0.464
4.352ArgAsp: 4.352 ± 0.596
4.608ArgGlu: 4.608 ± 1.29
4.096ArgPhe: 4.096 ± 1.218
4.352ArgGly: 4.352 ± 0.986
2.048ArgHis: 2.048 ± 0.641
2.304ArgIle: 2.304 ± 0.587
4.096ArgLys: 4.096 ± 0.898
7.424ArgLeu: 7.424 ± 1.33
0.768ArgMet: 0.768 ± 0.323
3.072ArgAsn: 3.072 ± 1.187
3.328ArgPro: 3.328 ± 0.945
1.28ArgGln: 1.28 ± 0.345
3.072ArgArg: 3.072 ± 1.127
4.352ArgSer: 4.352 ± 0.757
3.072ArgThr: 3.072 ± 1.005
5.12ArgVal: 5.12 ± 1.507
1.536ArgTrp: 1.536 ± 0.677
1.792ArgTyr: 1.792 ± 0.758
0.0ArgXaa: 0.0 ± 0.0
Ser
5.376SerAla: 5.376 ± 1.249
1.28SerCys: 1.28 ± 0.504
4.608SerAsp: 4.608 ± 1.256
6.4SerGlu: 6.4 ± 1.842
3.84SerPhe: 3.84 ± 0.797
4.608SerGly: 4.608 ± 0.796
1.536SerHis: 1.536 ± 0.667
5.12SerIle: 5.12 ± 1.505
4.608SerLys: 4.608 ± 0.521
9.729SerLeu: 9.729 ± 2.106
2.048SerMet: 2.048 ± 1.225
3.328SerAsn: 3.328 ± 0.606
4.608SerPro: 4.608 ± 1.447
2.56SerGln: 2.56 ± 1.327
6.4SerArg: 6.4 ± 0.673
7.937SerSer: 7.937 ± 3.444
4.096SerThr: 4.096 ± 1.181
3.072SerVal: 3.072 ± 0.905
0.768SerTrp: 0.768 ± 0.323
2.048SerTyr: 2.048 ± 0.769
0.0SerXaa: 0.0 ± 0.0
Thr
2.048ThrAla: 2.048 ± 0.865
0.256ThrCys: 0.256 ± 0.314
2.048ThrAsp: 2.048 ± 0.743
2.56ThrGlu: 2.56 ± 0.876
2.56ThrPhe: 2.56 ± 0.706
3.072ThrGly: 3.072 ± 1.465
3.072ThrHis: 3.072 ± 1.329
4.608ThrIle: 4.608 ± 1.074
2.304ThrLys: 2.304 ± 1.176
5.632ThrLeu: 5.632 ± 1.348
2.56ThrMet: 2.56 ± 0.369
1.792ThrAsn: 1.792 ± 0.373
3.84ThrPro: 3.84 ± 1.137
1.28ThrGln: 1.28 ± 0.43
1.792ThrArg: 1.792 ± 0.53
4.864ThrSer: 4.864 ± 1.991
3.584ThrThr: 3.584 ± 1.389
2.56ThrVal: 2.56 ± 1.054
0.768ThrTrp: 0.768 ± 0.391
1.792ThrTyr: 1.792 ± 0.579
0.0ThrXaa: 0.0 ± 0.0
Val
2.56ValAla: 2.56 ± 1.608
1.792ValCys: 1.792 ± 0.882
3.584ValAsp: 3.584 ± 0.691
2.048ValGlu: 2.048 ± 0.838
2.048ValPhe: 2.048 ± 0.641
4.864ValGly: 4.864 ± 0.949
0.512ValHis: 0.512 ± 0.295
3.072ValIle: 3.072 ± 1.022
3.84ValLys: 3.84 ± 0.713
8.449ValLeu: 8.449 ± 0.99
0.512ValMet: 0.512 ± 0.295
1.792ValAsn: 1.792 ± 0.652
3.072ValPro: 3.072 ± 0.767
2.816ValGln: 2.816 ± 1.379
2.56ValArg: 2.56 ± 0.945
5.376ValSer: 5.376 ± 1.218
2.048ValThr: 2.048 ± 0.7
3.584ValVal: 3.584 ± 1.247
1.28ValTrp: 1.28 ± 0.345
2.048ValTyr: 2.048 ± 0.422
0.0ValXaa: 0.0 ± 0.0
Trp
1.024TrpAla: 1.024 ± 0.393
0.512TrpCys: 0.512 ± 0.284
1.024TrpAsp: 1.024 ± 0.59
1.536TrpGlu: 1.536 ± 0.653
1.024TrpPhe: 1.024 ± 0.33
1.024TrpGly: 1.024 ± 0.827
0.768TrpHis: 0.768 ± 0.668
0.512TrpIle: 0.512 ± 0.464
1.024TrpLys: 1.024 ± 0.419
1.536TrpLeu: 1.536 ± 1.103
0.768TrpMet: 0.768 ± 0.781
0.768TrpAsn: 0.768 ± 0.442
0.256TrpPro: 0.256 ± 0.147
0.0TrpGln: 0.0 ± 0.0
0.768TrpArg: 0.768 ± 1.195
2.304TrpSer: 2.304 ± 0.724
0.768TrpThr: 0.768 ± 0.58
1.536TrpVal: 1.536 ± 0.616
0.768TrpTrp: 0.768 ± 0.431
0.256TrpTyr: 0.256 ± 0.381
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.768TyrAla: 0.768 ± 0.582
0.256TyrCys: 0.256 ± 0.368
1.28TyrAsp: 1.28 ± 0.397
2.304TyrGlu: 2.304 ± 0.869
1.024TyrPhe: 1.024 ± 0.464
1.536TyrGly: 1.536 ± 0.524
0.512TyrHis: 0.512 ± 0.284
2.048TyrIle: 2.048 ± 0.686
1.536TyrLys: 1.536 ± 0.444
5.12TyrLeu: 5.12 ± 1.421
1.792TyrMet: 1.792 ± 1.02
1.536TyrAsn: 1.536 ± 0.887
2.304TyrPro: 2.304 ± 0.657
1.28TyrGln: 1.28 ± 0.921
3.328TyrArg: 3.328 ± 0.823
3.072TyrSer: 3.072 ± 0.982
1.024TyrThr: 1.024 ± 0.619
2.304TyrVal: 2.304 ± 0.972
0.768TyrTrp: 0.768 ± 0.459
0.512TyrTyr: 0.512 ± 0.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (3907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski