Amino acid dipepetide frequency for BtMf-AlphaCoV/GD2012

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.023AlaAla: 6.023 ± 1.627
3.17AlaCys: 3.17 ± 0.579
2.642AlaAsp: 2.642 ± 0.947
2.642AlaGlu: 2.642 ± 0.72
4.755AlaPhe: 4.755 ± 0.899
4.121AlaGly: 4.121 ± 0.696
1.268AlaHis: 1.268 ± 0.397
4.65AlaIle: 4.65 ± 0.974
3.699AlaLys: 3.699 ± 1.102
6.763AlaLeu: 6.763 ± 0.512
1.585AlaMet: 1.585 ± 0.426
3.699AlaAsn: 3.699 ± 0.876
2.325AlaPro: 2.325 ± 1.392
1.374AlaGln: 1.374 ± 0.551
3.276AlaArg: 3.276 ± 0.703
5.495AlaSer: 5.495 ± 1.023
3.593AlaThr: 3.593 ± 0.892
7.397AlaVal: 7.397 ± 1.292
0.845AlaTrp: 0.845 ± 0.353
3.91AlaTyr: 3.91 ± 1.127
0.0AlaXaa: 0.0 ± 0.0
Cys
2.113CysAla: 2.113 ± 0.577
0.951CysCys: 0.951 ± 0.367
2.113CysAsp: 2.113 ± 1.099
0.528CysGlu: 0.528 ± 0.174
2.431CysPhe: 2.431 ± 0.695
2.748CysGly: 2.748 ± 0.987
0.423CysHis: 0.423 ± 0.385
1.057CysIle: 1.057 ± 0.586
2.113CysLys: 2.113 ± 0.746
2.113CysLeu: 2.113 ± 0.617
0.317CysMet: 0.317 ± 0.548
2.219CysAsn: 2.219 ± 0.738
0.845CysPro: 0.845 ± 0.361
1.057CysGln: 1.057 ± 0.349
1.268CysArg: 1.268 ± 0.631
2.536CysSer: 2.536 ± 0.756
2.325CysThr: 2.325 ± 0.843
3.593CysVal: 3.593 ± 0.965
0.528CysTrp: 0.528 ± 0.275
1.902CysTyr: 1.902 ± 0.989
0.0CysXaa: 0.0 ± 0.0
Asp
4.861AspAla: 4.861 ± 1.6
1.585AspCys: 1.585 ± 0.523
1.796AspAsp: 1.796 ± 0.591
2.219AspGlu: 2.219 ± 0.724
4.121AspPhe: 4.121 ± 0.529
5.706AspGly: 5.706 ± 1.516
1.162AspHis: 1.162 ± 0.399
2.431AspIle: 2.431 ± 0.824
2.431AspLys: 2.431 ± 0.476
5.072AspLeu: 5.072 ± 0.382
1.057AspMet: 1.057 ± 0.375
2.748AspAsn: 2.748 ± 0.66
2.008AspPro: 2.008 ± 0.822
1.057AspGln: 1.057 ± 0.357
1.057AspArg: 1.057 ± 0.418
3.382AspSer: 3.382 ± 0.281
2.219AspThr: 2.219 ± 0.361
5.178AspVal: 5.178 ± 0.766
0.845AspTrp: 0.845 ± 0.485
2.853AspTyr: 2.853 ± 0.701
0.0AspXaa: 0.0 ± 0.0
Glu
2.642GluAla: 2.642 ± 1.102
1.691GluCys: 1.691 ± 0.629
1.691GluAsp: 1.691 ± 0.558
1.268GluGlu: 1.268 ± 0.51
3.065GluPhe: 3.065 ± 0.897
2.748GluGly: 2.748 ± 0.721
1.479GluHis: 1.479 ± 1.084
2.008GluIle: 2.008 ± 0.693
1.902GluLys: 1.902 ± 0.579
3.276GluLeu: 3.276 ± 2.083
0.528GluMet: 0.528 ± 0.273
1.902GluAsn: 1.902 ± 0.355
2.536GluPro: 2.536 ± 0.41
1.691GluGln: 1.691 ± 1.075
1.796GluArg: 1.796 ± 0.56
1.162GluSer: 1.162 ± 0.503
2.325GluThr: 2.325 ± 0.688
3.593GluVal: 3.593 ± 0.633
0.528GluTrp: 0.528 ± 0.282
1.268GluTyr: 1.268 ± 0.307
0.0GluXaa: 0.0 ± 0.0
Phe
4.227PheAla: 4.227 ± 1.041
1.691PheCys: 1.691 ± 0.74
4.227PheAsp: 4.227 ± 0.94
2.748PheGlu: 2.748 ± 0.66
2.748PhePhe: 2.748 ± 0.762
4.016PheGly: 4.016 ± 1.013
0.74PheHis: 0.74 ± 0.586
2.642PheIle: 2.642 ± 0.904
4.016PheLys: 4.016 ± 1.021
3.804PheLeu: 3.804 ± 0.658
1.374PheMet: 1.374 ± 0.499
4.227PheAsn: 4.227 ± 1.061
0.634PhePro: 0.634 ± 0.252
1.268PheGln: 1.268 ± 0.452
1.268PheArg: 1.268 ± 0.712
3.487PheSer: 3.487 ± 0.924
3.17PheThr: 3.17 ± 0.965
7.186PheVal: 7.186 ± 1.866
1.268PheTrp: 1.268 ± 0.334
3.382PheTyr: 3.382 ± 0.574
0.0PheXaa: 0.0 ± 0.0
Gly
4.967GlyAla: 4.967 ± 1.241
2.219GlyCys: 2.219 ± 0.73
4.967GlyAsp: 4.967 ± 1.207
2.113GlyGlu: 2.113 ± 0.606
4.016GlyPhe: 4.016 ± 0.706
5.072GlyGly: 5.072 ± 0.762
0.423GlyHis: 0.423 ± 0.14
2.536GlyIle: 2.536 ± 0.769
3.382GlyLys: 3.382 ± 1.305
5.495GlyLeu: 5.495 ± 0.664
1.057GlyMet: 1.057 ± 0.555
4.121GlyAsn: 4.121 ± 1.68
2.008GlyPro: 2.008 ± 0.66
1.057GlyGln: 1.057 ± 0.357
2.219GlyArg: 2.219 ± 0.714
6.129GlySer: 6.129 ± 0.475
4.016GlyThr: 4.016 ± 0.771
8.877GlyVal: 8.877 ± 1.981
0.634GlyTrp: 0.634 ± 0.237
2.959GlyTyr: 2.959 ± 0.825
0.0GlyXaa: 0.0 ± 0.0
His
1.796HisAla: 1.796 ± 0.812
0.634HisCys: 0.634 ± 0.33
0.634HisAsp: 0.634 ± 0.394
1.162HisGlu: 1.162 ± 0.936
1.057HisPhe: 1.057 ± 0.969
1.057HisGly: 1.057 ± 0.715
0.211HisHis: 0.211 ± 0.365
0.74HisIle: 0.74 ± 0.265
0.74HisLys: 0.74 ± 0.385
1.479HisLeu: 1.479 ± 0.815
0.106HisMet: 0.106 ± 0.055
1.268HisAsn: 1.268 ± 0.469
0.423HisPro: 0.423 ± 0.491
0.423HisGln: 0.423 ± 0.14
0.528HisArg: 0.528 ± 0.823
1.162HisSer: 1.162 ± 0.436
1.374HisThr: 1.374 ± 0.578
2.325HisVal: 2.325 ± 0.64
0.317HisTrp: 0.317 ± 0.122
0.845HisTyr: 0.845 ± 0.43
0.0HisXaa: 0.0 ± 0.0
Ile
3.276IleAla: 3.276 ± 0.954
1.162IleCys: 1.162 ± 0.721
2.642IleAsp: 2.642 ± 1.106
1.691IleGlu: 1.691 ± 0.427
2.642IlePhe: 2.642 ± 0.904
2.642IleGly: 2.642 ± 1.102
0.423IleHis: 0.423 ± 0.14
1.796IleIle: 1.796 ± 1.73
2.536IleLys: 2.536 ± 0.758
3.487IleLeu: 3.487 ± 1.24
1.268IleMet: 1.268 ± 0.542
2.325IleAsn: 2.325 ± 0.487
2.536IlePro: 2.536 ± 1.144
1.796IleGln: 1.796 ± 0.654
1.796IleArg: 1.796 ± 0.993
3.065IleSer: 3.065 ± 1.062
3.382IleThr: 3.382 ± 1.105
5.389IleVal: 5.389 ± 0.909
0.317IleTrp: 0.317 ± 0.337
1.057IleTyr: 1.057 ± 0.317
0.0IleXaa: 0.0 ± 0.0
Lys
3.804LysAla: 3.804 ± 1.242
2.008LysCys: 2.008 ± 0.784
3.065LysAsp: 3.065 ± 1.159
2.113LysGlu: 2.113 ± 1.605
3.065LysPhe: 3.065 ± 0.704
3.17LysGly: 3.17 ± 1.564
2.113LysHis: 2.113 ± 1.099
2.113LysIle: 2.113 ± 0.399
1.374LysLys: 1.374 ± 1.377
4.65LysLeu: 4.65 ± 1.247
1.057LysMet: 1.057 ± 0.354
2.431LysAsn: 2.431 ± 0.78
3.276LysPro: 3.276 ± 0.577
1.691LysGln: 1.691 ± 0.446
2.536LysArg: 2.536 ± 0.957
3.593LysSer: 3.593 ± 1.809
3.065LysThr: 3.065 ± 0.698
4.016LysVal: 4.016 ± 1.088
0.423LysTrp: 0.423 ± 0.255
2.219LysTyr: 2.219 ± 0.487
0.0LysXaa: 0.0 ± 0.0
Leu
6.235LeuAla: 6.235 ± 1.507
2.959LeuCys: 2.959 ± 0.537
3.699LeuAsp: 3.699 ± 0.442
3.91LeuGlu: 3.91 ± 1.266
4.65LeuPhe: 4.65 ± 1.482
5.495LeuGly: 5.495 ± 0.763
1.585LeuHis: 1.585 ± 0.942
3.593LeuIle: 3.593 ± 1.738
4.227LeuLys: 4.227 ± 1.607
7.397LeuLeu: 7.397 ± 2.991
1.268LeuMet: 1.268 ± 0.724
4.967LeuAsn: 4.967 ± 0.812
4.121LeuPro: 4.121 ± 2.166
3.804LeuGln: 3.804 ± 1.17
3.065LeuArg: 3.065 ± 0.582
7.292LeuSer: 7.292 ± 1.548
4.544LeuThr: 4.544 ± 0.803
6.023LeuVal: 6.023 ± 1.602
1.162LeuTrp: 1.162 ± 0.855
4.333LeuTyr: 4.333 ± 1.146
0.0LeuXaa: 0.0 ± 0.0
Met
1.162MetAla: 1.162 ± 0.639
1.057MetCys: 1.057 ± 0.55
1.162MetAsp: 1.162 ± 0.471
0.634MetGlu: 0.634 ± 0.217
1.479MetPhe: 1.479 ± 0.321
1.374MetGly: 1.374 ± 0.261
0.211MetHis: 0.211 ± 0.11
0.845MetIle: 0.845 ± 0.315
0.423MetLys: 0.423 ± 0.516
2.642MetLeu: 2.642 ± 1.37
0.423MetMet: 0.423 ± 0.22
0.74MetAsn: 0.74 ± 0.448
0.845MetPro: 0.845 ± 0.306
0.634MetGln: 0.634 ± 0.476
1.268MetArg: 1.268 ± 0.71
1.374MetSer: 1.374 ± 0.499
0.951MetThr: 0.951 ± 0.312
1.479MetVal: 1.479 ± 0.366
0.106MetTrp: 0.106 ± 0.055
0.951MetTyr: 0.951 ± 0.284
0.0MetXaa: 0.0 ± 0.0
Asn
3.699AsnAla: 3.699 ± 1.419
2.325AsnCys: 2.325 ± 0.843
1.691AsnAsp: 1.691 ± 0.712
2.113AsnGlu: 2.113 ± 0.394
2.536AsnPhe: 2.536 ± 0.625
5.918AsnGly: 5.918 ± 1.323
0.74AsnHis: 0.74 ± 0.265
3.382AsnIle: 3.382 ± 0.61
2.431AsnLys: 2.431 ± 0.703
3.804AsnLeu: 3.804 ± 0.649
1.162AsnMet: 1.162 ± 0.427
2.642AsnAsn: 2.642 ± 0.594
2.008AsnPro: 2.008 ± 0.524
1.374AsnGln: 1.374 ± 1.206
1.585AsnArg: 1.585 ± 0.413
5.178AsnSer: 5.178 ± 2.282
3.382AsnThr: 3.382 ± 0.827
6.446AsnVal: 6.446 ± 1.443
0.845AsnTrp: 0.845 ± 0.665
2.219AsnTyr: 2.219 ± 0.548
0.0AsnXaa: 0.0 ± 0.0
Pro
2.748ProAla: 2.748 ± 0.721
1.057ProCys: 1.057 ± 0.418
2.113ProAsp: 2.113 ± 0.514
2.113ProGlu: 2.113 ± 0.631
1.691ProPhe: 1.691 ± 0.427
2.642ProGly: 2.642 ± 0.72
0.951ProHis: 0.951 ± 0.424
1.268ProIle: 1.268 ± 0.399
2.219ProLys: 2.219 ± 2.163
3.804ProLeu: 3.804 ± 0.853
0.634ProMet: 0.634 ± 0.217
2.008ProAsn: 2.008 ± 1.212
2.748ProPro: 2.748 ± 0.503
1.796ProGln: 1.796 ± 1.718
1.374ProArg: 1.374 ± 0.818
3.065ProSer: 3.065 ± 1.291
2.642ProThr: 2.642 ± 1.268
4.544ProVal: 4.544 ± 1.219
0.423ProTrp: 0.423 ± 0.14
0.845ProTyr: 0.845 ± 0.439
0.0ProXaa: 0.0 ± 0.0
Gln
2.219GlnAla: 2.219 ± 0.555
0.634GlnCys: 0.634 ± 0.476
1.479GlnAsp: 1.479 ± 0.328
0.951GlnGlu: 0.951 ± 0.369
1.374GlnPhe: 1.374 ± 0.346
1.585GlnGly: 1.585 ± 0.492
0.528GlnHis: 0.528 ± 0.317
0.951GlnIle: 0.951 ± 0.39
1.374GlnLys: 1.374 ± 0.481
4.016GlnLeu: 4.016 ± 1.323
0.74GlnMet: 0.74 ± 0.257
1.162GlnAsn: 1.162 ± 0.533
1.902GlnPro: 1.902 ± 1.082
1.585GlnGln: 1.585 ± 1.244
1.796GlnArg: 1.796 ± 1.01
2.325GlnSer: 2.325 ± 1.066
1.585GlnThr: 1.585 ± 0.487
2.219GlnVal: 2.219 ± 1.767
0.317GlnTrp: 0.317 ± 0.122
1.057GlnTyr: 1.057 ± 0.727
0.0GlnXaa: 0.0 ± 0.0
Arg
2.642ArgAla: 2.642 ± 0.688
1.585ArgCys: 1.585 ± 0.482
1.796ArgAsp: 1.796 ± 1.117
1.057ArgGlu: 1.057 ± 0.72
2.325ArgPhe: 2.325 ± 0.69
2.219ArgGly: 2.219 ± 1.201
1.057ArgHis: 1.057 ± 1.505
1.479ArgIle: 1.479 ± 0.853
2.431ArgLys: 2.431 ± 1.502
3.065ArgLeu: 3.065 ± 1.601
0.845ArgMet: 0.845 ± 0.223
1.902ArgAsn: 1.902 ± 0.534
1.268ArgPro: 1.268 ± 0.465
1.057ArgGln: 1.057 ± 0.521
1.268ArgArg: 1.268 ± 0.383
2.431ArgSer: 2.431 ± 2.095
2.536ArgThr: 2.536 ± 1.091
4.016ArgVal: 4.016 ± 0.496
0.423ArgTrp: 0.423 ± 0.284
1.902ArgTyr: 1.902 ± 0.55
0.0ArgXaa: 0.0 ± 0.0
Ser
4.967SerAla: 4.967 ± 0.913
1.374SerCys: 1.374 ± 0.55
4.227SerAsp: 4.227 ± 1.014
2.748SerGlu: 2.748 ± 0.863
4.65SerPhe: 4.65 ± 1.293
4.861SerGly: 4.861 ± 0.761
1.374SerHis: 1.374 ± 0.388
3.487SerIle: 3.487 ± 1.058
4.544SerLys: 4.544 ± 0.983
4.861SerLeu: 4.861 ± 1.095
1.691SerMet: 1.691 ± 0.629
3.699SerAsn: 3.699 ± 1.739
1.902SerPro: 1.902 ± 0.498
2.431SerGln: 2.431 ± 1.947
2.325SerArg: 2.325 ± 3.026
4.967SerSer: 4.967 ± 1.638
5.178SerThr: 5.178 ± 1.216
7.714SerVal: 7.714 ± 1.723
0.845SerTrp: 0.845 ± 0.298
2.853SerTyr: 2.853 ± 0.868
0.0SerXaa: 0.0 ± 0.0
Thr
3.804ThrAla: 3.804 ± 0.546
1.691ThrCys: 1.691 ± 0.74
3.487ThrAsp: 3.487 ± 0.836
2.325ThrGlu: 2.325 ± 0.889
3.699ThrPhe: 3.699 ± 1.022
3.17ThrGly: 3.17 ± 1.094
0.528ThrHis: 0.528 ± 0.568
3.276ThrIle: 3.276 ± 0.652
2.536ThrLys: 2.536 ± 1.128
5.706ThrLeu: 5.706 ± 0.848
1.585ThrMet: 1.585 ± 0.824
2.853ThrAsn: 2.853 ± 0.787
3.17ThrPro: 3.17 ± 1.939
1.902ThrGln: 1.902 ± 0.565
2.959ThrArg: 2.959 ± 0.47
4.438ThrSer: 4.438 ± 1.001
5.072ThrThr: 5.072 ± 0.802
6.129ThrVal: 6.129 ± 1.398
0.317ThrTrp: 0.317 ± 0.165
2.325ThrTyr: 2.325 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
8.348ValAla: 8.348 ± 0.866
3.487ValCys: 3.487 ± 1.176
6.658ValAsp: 6.658 ± 0.99
4.544ValGlu: 4.544 ± 1.107
4.967ValPhe: 4.967 ± 0.599
6.023ValGly: 6.023 ± 1.196
1.585ValHis: 1.585 ± 0.686
5.284ValIle: 5.284 ± 0.961
6.869ValLys: 6.869 ± 2.777
7.503ValLeu: 7.503 ± 1.312
1.796ValMet: 1.796 ± 0.46
6.763ValAsn: 6.763 ± 1.474
4.544ValPro: 4.544 ± 0.816
2.748ValGln: 2.748 ± 0.818
3.487ValArg: 3.487 ± 2.13
6.869ValSer: 6.869 ± 1.409
5.918ValThr: 5.918 ± 0.974
11.519ValVal: 11.519 ± 1.897
1.057ValTrp: 1.057 ± 0.442
3.699ValTyr: 3.699 ± 0.751
0.0ValXaa: 0.0 ± 0.0
Trp
1.057TrpAla: 1.057 ± 0.757
0.211TrpCys: 0.211 ± 0.128
0.845TrpAsp: 0.845 ± 0.315
0.211TrpGlu: 0.211 ± 0.11
0.845TrpPhe: 0.845 ± 0.442
0.211TrpGly: 0.211 ± 0.128
0.423TrpHis: 0.423 ± 0.284
0.317TrpIle: 0.317 ± 0.236
0.317TrpLys: 0.317 ± 0.304
1.585TrpLeu: 1.585 ± 0.697
0.211TrpMet: 0.211 ± 0.11
0.845TrpAsn: 0.845 ± 0.665
0.634TrpPro: 0.634 ± 0.334
0.106TrpGln: 0.106 ± 0.055
0.74TrpArg: 0.74 ± 0.232
0.528TrpSer: 0.528 ± 0.275
0.951TrpThr: 0.951 ± 0.667
0.951TrpVal: 0.951 ± 0.335
0.106TrpTrp: 0.106 ± 0.055
0.951TrpTyr: 0.951 ± 0.37
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.748TyrAla: 2.748 ± 0.84
1.691TyrCys: 1.691 ± 0.721
2.853TyrAsp: 2.853 ± 1.21
2.008TyrGlu: 2.008 ± 0.766
2.008TyrPhe: 2.008 ± 0.466
3.487TyrGly: 3.487 ± 0.715
1.057TyrHis: 1.057 ± 0.486
1.374TyrIle: 1.374 ± 0.42
2.219TyrLys: 2.219 ± 0.504
3.804TyrLeu: 3.804 ± 0.75
0.951TyrMet: 0.951 ± 0.949
2.959TyrAsn: 2.959 ± 1.15
0.951TyrPro: 0.951 ± 0.495
0.951TyrGln: 0.951 ± 0.383
1.691TyrArg: 1.691 ± 0.978
2.219TyrSer: 2.219 ± 0.723
2.748TyrThr: 2.748 ± 0.744
5.072TyrVal: 5.072 ± 1.087
0.74TyrTrp: 0.74 ± 0.585
2.748TyrTyr: 2.748 ± 0.647
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (9464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski