Amino acid dipepetide frequency for Flavobacterium phage vB_FspS_sniff9-2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.837AlaAla: 0.837 ± 0.358
0.753AlaCys: 0.753 ± 0.257
1.758AlaAsp: 1.758 ± 0.336
2.846AlaGlu: 2.846 ± 0.641
2.093AlaPhe: 2.093 ± 0.435
2.009AlaGly: 2.009 ± 0.436
0.419AlaHis: 0.419 ± 0.184
3.684AlaIle: 3.684 ± 0.48
5.19AlaLys: 5.19 ± 0.676
5.19AlaLeu: 5.19 ± 0.562
1.507AlaMet: 1.507 ± 0.501
4.604AlaAsn: 4.604 ± 0.66
0.67AlaPro: 0.67 ± 0.193
2.344AlaGln: 2.344 ± 0.551
1.256AlaArg: 1.256 ± 0.365
3.014AlaSer: 3.014 ± 0.577
4.102AlaThr: 4.102 ± 0.652
3.181AlaVal: 3.181 ± 0.632
0.67AlaTrp: 0.67 ± 0.213
1.925AlaTyr: 1.925 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
0.335CysAla: 0.335 ± 0.227
0.251CysCys: 0.251 ± 0.131
1.088CysAsp: 1.088 ± 0.329
1.172CysGlu: 1.172 ± 0.325
0.753CysPhe: 0.753 ± 0.244
1.005CysGly: 1.005 ± 0.26
0.167CysHis: 0.167 ± 0.108
0.67CysIle: 0.67 ± 0.261
0.921CysLys: 0.921 ± 0.299
1.591CysLeu: 1.591 ± 0.388
0.084CysMet: 0.084 ± 0.092
0.251CysAsn: 0.251 ± 0.152
0.586CysPro: 0.586 ± 0.223
0.335CysGln: 0.335 ± 0.151
0.335CysArg: 0.335 ± 0.161
0.837CysSer: 0.837 ± 0.245
0.67CysThr: 0.67 ± 0.194
0.586CysVal: 0.586 ± 0.194
0.084CysTrp: 0.084 ± 0.087
0.586CysTyr: 0.586 ± 0.226
0.0CysXaa: 0.0 ± 0.0
Asp
3.516AspAla: 3.516 ± 0.465
0.921AspCys: 0.921 ± 0.241
2.177AspAsp: 2.177 ± 0.437
4.521AspGlu: 4.521 ± 0.761
3.516AspPhe: 3.516 ± 0.553
2.595AspGly: 2.595 ± 0.492
0.586AspHis: 0.586 ± 0.222
3.6AspIle: 3.6 ± 0.526
6.111AspLys: 6.111 ± 1.0
5.19AspLeu: 5.19 ± 0.562
1.591AspMet: 1.591 ± 0.371
4.688AspAsn: 4.688 ± 0.704
0.335AspPro: 0.335 ± 0.147
0.419AspGln: 0.419 ± 0.196
1.591AspArg: 1.591 ± 0.264
3.6AspSer: 3.6 ± 0.531
3.684AspThr: 3.684 ± 0.597
2.846AspVal: 2.846 ± 0.576
0.67AspTrp: 0.67 ± 0.209
3.014AspTyr: 3.014 ± 0.513
0.0AspXaa: 0.0 ± 0.0
Glu
3.014GluAla: 3.014 ± 0.784
0.921GluCys: 0.921 ± 0.32
3.349GluAsp: 3.349 ± 0.648
4.27GluGlu: 4.27 ± 0.602
5.023GluPhe: 5.023 ± 0.729
2.009GluGly: 2.009 ± 0.313
0.921GluHis: 0.921 ± 0.235
7.618GluIle: 7.618 ± 0.791
6.697GluLys: 6.697 ± 0.946
7.367GluLeu: 7.367 ± 0.814
1.758GluMet: 1.758 ± 0.38
6.697GluAsn: 6.697 ± 0.866
2.093GluPro: 2.093 ± 0.366
3.516GluGln: 3.516 ± 0.463
2.512GluArg: 2.512 ± 0.597
3.6GluSer: 3.6 ± 0.489
4.186GluThr: 4.186 ± 0.544
3.767GluVal: 3.767 ± 0.536
0.335GluTrp: 0.335 ± 0.158
3.6GluTyr: 3.6 ± 0.678
0.0GluXaa: 0.0 ± 0.0
Phe
2.093PheAla: 2.093 ± 0.396
0.67PheCys: 0.67 ± 0.199
3.851PheAsp: 3.851 ± 0.487
3.851PheGlu: 3.851 ± 0.553
2.344PhePhe: 2.344 ± 0.402
2.93PheGly: 2.93 ± 0.627
0.586PheHis: 0.586 ± 0.228
3.6PheIle: 3.6 ± 0.624
4.353PheLys: 4.353 ± 0.631
3.6PheLeu: 3.6 ± 0.62
1.256PheMet: 1.256 ± 0.272
4.856PheAsn: 4.856 ± 0.854
1.172PhePro: 1.172 ± 0.303
1.591PheGln: 1.591 ± 0.444
0.921PheArg: 0.921 ± 0.309
3.767PheSer: 3.767 ± 0.595
4.437PheThr: 4.437 ± 0.669
2.595PheVal: 2.595 ± 0.488
0.502PheTrp: 0.502 ± 0.158
1.842PheTyr: 1.842 ± 0.401
0.0PheXaa: 0.0 ± 0.0
Gly
3.181GlyAla: 3.181 ± 0.571
0.419GlyCys: 0.419 ± 0.178
2.679GlyAsp: 2.679 ± 0.491
2.177GlyGlu: 2.177 ± 0.421
2.763GlyPhe: 2.763 ± 0.661
2.093GlyGly: 2.093 ± 0.623
0.586GlyHis: 0.586 ± 0.245
3.516GlyIle: 3.516 ± 0.431
3.349GlyLys: 3.349 ± 0.594
3.684GlyLeu: 3.684 ± 0.496
1.339GlyMet: 1.339 ± 0.335
4.772GlyAsn: 4.772 ± 0.579
0.0GlyPro: 0.0 ± 0.0
1.674GlyGln: 1.674 ± 0.36
1.674GlyArg: 1.674 ± 0.335
2.93GlySer: 2.93 ± 0.471
4.437GlyThr: 4.437 ± 0.717
2.93GlyVal: 2.93 ± 0.366
0.251GlyTrp: 0.251 ± 0.13
2.26GlyTyr: 2.26 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
0.251HisAla: 0.251 ± 0.134
0.335HisCys: 0.335 ± 0.176
0.921HisAsp: 0.921 ± 0.267
0.586HisGlu: 0.586 ± 0.22
1.005HisPhe: 1.005 ± 0.267
0.753HisGly: 0.753 ± 0.273
0.419HisHis: 0.419 ± 0.225
1.172HisIle: 1.172 ± 0.292
1.088HisLys: 1.088 ± 0.349
1.005HisLeu: 1.005 ± 0.293
0.084HisMet: 0.084 ± 0.088
1.088HisAsn: 1.088 ± 0.296
0.502HisPro: 0.502 ± 0.223
0.502HisGln: 0.502 ± 0.163
0.586HisArg: 0.586 ± 0.238
1.088HisSer: 1.088 ± 0.291
0.753HisThr: 0.753 ± 0.185
0.586HisVal: 0.586 ± 0.251
0.0HisTrp: 0.0 ± 0.0
0.586HisTyr: 0.586 ± 0.199
0.0HisXaa: 0.0 ± 0.0
Ile
4.604IleAla: 4.604 ± 0.581
0.921IleCys: 0.921 ± 0.288
5.19IleAsp: 5.19 ± 0.607
7.786IleGlu: 7.786 ± 0.741
2.93IlePhe: 2.93 ± 0.499
3.432IleGly: 3.432 ± 0.519
1.256IleHis: 1.256 ± 0.409
5.525IleIle: 5.525 ± 0.767
8.707IleLys: 8.707 ± 0.799
6.697IleLeu: 6.697 ± 0.768
1.088IleMet: 1.088 ± 0.315
6.53IleAsn: 6.53 ± 0.753
2.344IlePro: 2.344 ± 0.424
2.679IleGln: 2.679 ± 0.545
1.674IleArg: 1.674 ± 0.471
5.442IleSer: 5.442 ± 0.831
4.772IleThr: 4.772 ± 0.788
4.772IleVal: 4.772 ± 0.65
0.753IleTrp: 0.753 ± 0.291
3.181IleTyr: 3.181 ± 0.499
0.0IleXaa: 0.0 ± 0.0
Lys
4.772LysAla: 4.772 ± 0.591
1.339LysCys: 1.339 ± 0.341
4.939LysAsp: 4.939 ± 0.637
8.79LysGlu: 8.79 ± 1.197
3.432LysPhe: 3.432 ± 0.43
3.767LysGly: 3.767 ± 0.612
2.009LysHis: 2.009 ± 0.462
7.786LysIle: 7.786 ± 0.751
7.869LysLys: 7.869 ± 0.952
8.288LysLeu: 8.288 ± 0.836
3.6LysMet: 3.6 ± 0.543
5.86LysAsn: 5.86 ± 0.613
2.93LysPro: 2.93 ± 0.491
5.442LysGln: 5.442 ± 0.889
3.767LysArg: 3.767 ± 0.631
5.442LysSer: 5.442 ± 0.513
6.195LysThr: 6.195 ± 0.785
5.442LysVal: 5.442 ± 0.682
1.256LysTrp: 1.256 ± 0.347
4.102LysTyr: 4.102 ± 0.606
0.0LysXaa: 0.0 ± 0.0
Leu
4.018LeuAla: 4.018 ± 0.735
0.502LeuCys: 0.502 ± 0.255
5.86LeuAsp: 5.86 ± 0.595
6.279LeuGlu: 6.279 ± 0.91
3.767LeuPhe: 3.767 ± 0.592
3.684LeuGly: 3.684 ± 0.491
0.837LeuHis: 0.837 ± 0.244
7.618LeuIle: 7.618 ± 0.847
9.962LeuLys: 9.962 ± 0.793
6.949LeuLeu: 6.949 ± 0.724
2.009LeuMet: 2.009 ± 0.462
7.869LeuAsn: 7.869 ± 0.856
4.018LeuPro: 4.018 ± 0.52
4.186LeuGln: 4.186 ± 0.654
2.763LeuArg: 2.763 ± 0.472
5.776LeuSer: 5.776 ± 0.656
6.111LeuThr: 6.111 ± 0.693
4.688LeuVal: 4.688 ± 0.637
1.005LeuTrp: 1.005 ± 0.294
3.265LeuTyr: 3.265 ± 0.476
0.0LeuXaa: 0.0 ± 0.0
Met
1.674MetAla: 1.674 ± 0.428
0.084MetCys: 0.084 ± 0.084
0.921MetAsp: 0.921 ± 0.279
1.925MetGlu: 1.925 ± 0.392
1.256MetPhe: 1.256 ± 0.281
0.921MetGly: 0.921 ± 0.259
0.167MetHis: 0.167 ± 0.123
1.423MetIle: 1.423 ± 0.386
2.93MetLys: 2.93 ± 0.59
1.758MetLeu: 1.758 ± 0.345
0.335MetMet: 0.335 ± 0.177
1.339MetAsn: 1.339 ± 0.366
1.005MetPro: 1.005 ± 0.281
1.339MetGln: 1.339 ± 0.304
1.088MetArg: 1.088 ± 0.243
1.591MetSer: 1.591 ± 0.374
0.921MetThr: 0.921 ± 0.255
0.921MetVal: 0.921 ± 0.201
0.335MetTrp: 0.335 ± 0.141
0.419MetTyr: 0.419 ± 0.182
0.0MetXaa: 0.0 ± 0.0
Asn
4.437AsnAla: 4.437 ± 0.636
0.753AsnCys: 0.753 ± 0.238
4.27AsnAsp: 4.27 ± 0.832
6.028AsnGlu: 6.028 ± 0.828
3.851AsnPhe: 3.851 ± 0.639
4.353AsnGly: 4.353 ± 0.628
0.921AsnHis: 0.921 ± 0.283
5.274AsnIle: 5.274 ± 0.584
9.209AsnLys: 9.209 ± 1.264
7.535AsnLeu: 7.535 ± 0.998
1.088AsnMet: 1.088 ± 0.332
4.856AsnAsn: 4.856 ± 1.173
2.26AsnPro: 2.26 ± 0.391
2.93AsnGln: 2.93 ± 0.51
2.093AsnArg: 2.093 ± 0.444
5.107AsnSer: 5.107 ± 0.701
4.186AsnThr: 4.186 ± 0.591
5.107AsnVal: 5.107 ± 0.562
0.837AsnTrp: 0.837 ± 0.284
4.772AsnTyr: 4.772 ± 0.609
0.0AsnXaa: 0.0 ± 0.0
Pro
1.339ProAla: 1.339 ± 0.292
0.335ProCys: 0.335 ± 0.138
1.172ProAsp: 1.172 ± 0.313
2.009ProGlu: 2.009 ± 0.534
1.591ProPhe: 1.591 ± 0.352
0.0ProGly: 0.0 ± 0.0
0.167ProHis: 0.167 ± 0.098
1.758ProIle: 1.758 ± 0.333
1.674ProLys: 1.674 ± 0.379
3.432ProLeu: 3.432 ± 0.505
0.67ProMet: 0.67 ± 0.227
2.595ProAsn: 2.595 ± 0.43
0.586ProPro: 0.586 ± 0.315
1.088ProGln: 1.088 ± 0.285
0.335ProArg: 0.335 ± 0.214
2.093ProSer: 2.093 ± 0.424
2.177ProThr: 2.177 ± 0.454
1.088ProVal: 1.088 ± 0.292
0.0ProTrp: 0.0 ± 0.0
1.423ProTyr: 1.423 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
2.009GlnAla: 2.009 ± 0.568
0.586GlnCys: 0.586 ± 0.245
1.339GlnAsp: 1.339 ± 0.368
1.925GlnGlu: 1.925 ± 0.451
1.339GlnPhe: 1.339 ± 0.293
2.428GlnGly: 2.428 ± 0.441
0.837GlnHis: 0.837 ± 0.314
4.102GlnIle: 4.102 ± 0.662
4.186GlnLys: 4.186 ± 0.813
4.186GlnLeu: 4.186 ± 0.658
1.256GlnMet: 1.256 ± 0.419
2.428GlnAsn: 2.428 ± 0.507
1.339GlnPro: 1.339 ± 0.341
2.177GlnGln: 2.177 ± 0.69
1.674GlnArg: 1.674 ± 0.373
2.512GlnSer: 2.512 ± 0.398
2.177GlnThr: 2.177 ± 0.444
1.925GlnVal: 1.925 ± 0.435
0.419GlnTrp: 0.419 ± 0.193
1.674GlnTyr: 1.674 ± 0.362
0.0GlnXaa: 0.0 ± 0.0
Arg
0.753ArgAla: 0.753 ± 0.314
0.502ArgCys: 0.502 ± 0.187
1.256ArgAsp: 1.256 ± 0.304
2.093ArgGlu: 2.093 ± 0.441
1.172ArgPhe: 1.172 ± 0.327
1.172ArgGly: 1.172 ± 0.31
0.419ArgHis: 0.419 ± 0.179
2.763ArgIle: 2.763 ± 0.577
2.763ArgLys: 2.763 ± 0.427
3.767ArgLeu: 3.767 ± 0.512
0.586ArgMet: 0.586 ± 0.208
2.512ArgAsn: 2.512 ± 0.488
0.251ArgPro: 0.251 ± 0.151
1.005ArgGln: 1.005 ± 0.267
0.921ArgArg: 0.921 ± 0.227
2.26ArgSer: 2.26 ± 0.468
2.26ArgThr: 2.26 ± 0.401
2.177ArgVal: 2.177 ± 0.38
0.084ArgTrp: 0.084 ± 0.099
1.507ArgTyr: 1.507 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
2.177SerAla: 2.177 ± 0.409
0.586SerCys: 0.586 ± 0.254
4.018SerAsp: 4.018 ± 0.716
5.274SerGlu: 5.274 ± 0.682
4.102SerPhe: 4.102 ± 0.67
4.604SerGly: 4.604 ± 0.706
0.67SerHis: 0.67 ± 0.19
5.86SerIle: 5.86 ± 0.725
5.525SerLys: 5.525 ± 0.746
5.19SerLeu: 5.19 ± 0.668
1.339SerMet: 1.339 ± 0.367
5.19SerAsn: 5.19 ± 0.784
1.256SerPro: 1.256 ± 0.331
2.595SerGln: 2.595 ± 0.497
1.842SerArg: 1.842 ± 0.337
3.516SerSer: 3.516 ± 0.675
2.846SerThr: 2.846 ± 0.523
4.604SerVal: 4.604 ± 0.748
0.502SerTrp: 0.502 ± 0.234
1.591SerTyr: 1.591 ± 0.333
0.0SerXaa: 0.0 ± 0.0
Thr
3.935ThrAla: 3.935 ± 0.661
0.837ThrCys: 0.837 ± 0.283
4.521ThrAsp: 4.521 ± 0.61
4.939ThrGlu: 4.939 ± 0.852
4.353ThrPhe: 4.353 ± 0.764
3.349ThrGly: 3.349 ± 0.686
0.753ThrHis: 0.753 ± 0.206
5.944ThrIle: 5.944 ± 0.781
5.023ThrLys: 5.023 ± 0.596
6.028ThrLeu: 6.028 ± 0.67
1.005ThrMet: 1.005 ± 0.27
4.521ThrAsn: 4.521 ± 0.799
1.842ThrPro: 1.842 ± 0.47
2.763ThrGln: 2.763 ± 0.58
1.591ThrArg: 1.591 ± 0.258
3.516ThrSer: 3.516 ± 0.614
4.772ThrThr: 4.772 ± 0.969
1.591ThrVal: 1.591 ± 0.351
0.67ThrTrp: 0.67 ± 0.192
2.009ThrTyr: 2.009 ± 0.417
0.0ThrXaa: 0.0 ± 0.0
Val
3.098ValAla: 3.098 ± 0.456
0.753ValCys: 0.753 ± 0.276
2.846ValAsp: 2.846 ± 0.456
3.265ValGlu: 3.265 ± 0.6
2.595ValPhe: 2.595 ± 0.481
3.265ValGly: 3.265 ± 0.381
0.586ValHis: 0.586 ± 0.2
4.018ValIle: 4.018 ± 0.459
5.609ValLys: 5.609 ± 0.64
4.856ValLeu: 4.856 ± 0.585
1.256ValMet: 1.256 ± 0.308
4.856ValAsn: 4.856 ± 0.805
1.088ValPro: 1.088 ± 0.275
2.26ValGln: 2.26 ± 0.367
1.674ValArg: 1.674 ± 0.451
4.102ValSer: 4.102 ± 0.691
2.093ValThr: 2.093 ± 0.38
3.098ValVal: 3.098 ± 0.59
0.753ValTrp: 0.753 ± 0.25
2.093ValTyr: 2.093 ± 0.379
0.0ValXaa: 0.0 ± 0.0
Trp
0.502TrpAla: 0.502 ± 0.185
0.167TrpCys: 0.167 ± 0.121
0.753TrpAsp: 0.753 ± 0.285
0.753TrpGlu: 0.753 ± 0.232
0.419TrpPhe: 0.419 ± 0.168
0.335TrpGly: 0.335 ± 0.147
0.167TrpHis: 0.167 ± 0.1
0.921TrpIle: 0.921 ± 0.315
1.088TrpLys: 1.088 ± 0.377
1.256TrpLeu: 1.256 ± 0.306
0.084TrpMet: 0.084 ± 0.086
0.753TrpAsn: 0.753 ± 0.312
0.0TrpPro: 0.0 ± 0.0
0.335TrpGln: 0.335 ± 0.124
0.419TrpArg: 0.419 ± 0.155
0.419TrpSer: 0.419 ± 0.205
0.335TrpThr: 0.335 ± 0.159
0.251TrpVal: 0.251 ± 0.176
0.0TrpTrp: 0.0 ± 0.0
0.502TrpTyr: 0.502 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.507TyrAla: 1.507 ± 0.243
0.67TyrCys: 0.67 ± 0.196
2.26TyrAsp: 2.26 ± 0.388
2.846TyrGlu: 2.846 ± 0.436
2.512TyrPhe: 2.512 ± 0.468
2.009TyrGly: 2.009 ± 0.323
0.837TyrHis: 0.837 ± 0.251
3.432TyrIle: 3.432 ± 0.522
4.772TyrLys: 4.772 ± 0.643
3.6TyrLeu: 3.6 ± 0.526
0.335TyrMet: 0.335 ± 0.168
3.6TyrAsn: 3.6 ± 0.506
1.256TyrPro: 1.256 ± 0.384
1.339TyrGln: 1.339 ± 0.282
1.507TyrArg: 1.507 ± 0.462
2.679TyrSer: 2.679 ± 0.485
2.846TyrThr: 2.846 ± 0.589
2.009TyrVal: 2.009 ± 0.424
0.335TyrTrp: 0.335 ± 0.13
2.177TyrTyr: 2.177 ± 0.491
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (11946 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski