Amino acid dipepetide frequency for Flavobacterium phage FPSV-S27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.329AlaAla: 8.329 ± 1.216
0.661AlaCys: 0.661 ± 0.369
3.702AlaAsp: 3.702 ± 0.717
3.437AlaGlu: 3.437 ± 0.744
3.57AlaPhe: 3.57 ± 0.561
6.478AlaGly: 6.478 ± 1.299
0.793AlaHis: 0.793 ± 0.344
5.156AlaIle: 5.156 ± 0.881
5.685AlaLys: 5.685 ± 0.842
7.271AlaLeu: 7.271 ± 1.019
1.586AlaMet: 1.586 ± 0.518
2.644AlaAsn: 2.644 ± 0.497
1.454AlaPro: 1.454 ± 0.418
3.437AlaGln: 3.437 ± 0.792
1.586AlaArg: 1.586 ± 0.465
4.892AlaSer: 4.892 ± 1.015
3.966AlaThr: 3.966 ± 0.624
5.288AlaVal: 5.288 ± 1.197
0.925AlaTrp: 0.925 ± 0.358
2.247AlaTyr: 2.247 ± 0.536
0.0AlaXaa: 0.0 ± 0.0
Cys
0.264CysAla: 0.264 ± 0.173
0.0CysCys: 0.0 ± 0.0
0.264CysAsp: 0.264 ± 0.182
0.661CysGlu: 0.661 ± 0.31
0.397CysPhe: 0.397 ± 0.324
0.661CysGly: 0.661 ± 0.246
0.0CysHis: 0.0 ± 0.0
1.322CysIle: 1.322 ± 0.331
0.397CysLys: 0.397 ± 0.21
0.925CysLeu: 0.925 ± 0.356
0.264CysMet: 0.264 ± 0.164
0.661CysAsn: 0.661 ± 0.289
0.0CysPro: 0.0 ± 0.0
0.397CysGln: 0.397 ± 0.198
0.132CysArg: 0.132 ± 0.116
0.397CysSer: 0.397 ± 0.272
0.264CysThr: 0.264 ± 0.167
0.529CysVal: 0.529 ± 0.295
0.397CysTrp: 0.397 ± 0.258
0.397CysTyr: 0.397 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
4.759AspAla: 4.759 ± 0.877
0.397AspCys: 0.397 ± 0.196
3.041AspAsp: 3.041 ± 0.701
6.081AspGlu: 6.081 ± 1.057
4.627AspPhe: 4.627 ± 0.9
4.892AspGly: 4.892 ± 0.73
0.925AspHis: 0.925 ± 0.333
5.288AspIle: 5.288 ± 0.863
5.024AspLys: 5.024 ± 0.582
5.156AspLeu: 5.156 ± 0.743
1.058AspMet: 1.058 ± 0.38
3.173AspAsn: 3.173 ± 0.479
2.776AspPro: 2.776 ± 0.446
0.661AspGln: 0.661 ± 0.245
0.529AspArg: 0.529 ± 0.318
1.983AspSer: 1.983 ± 0.479
3.041AspThr: 3.041 ± 0.701
4.231AspVal: 4.231 ± 0.82
0.925AspTrp: 0.925 ± 0.35
3.173AspTyr: 3.173 ± 0.695
0.0AspXaa: 0.0 ± 0.0
Glu
5.817GluAla: 5.817 ± 1.044
0.397GluCys: 0.397 ± 0.254
3.437GluAsp: 3.437 ± 0.542
4.098GluGlu: 4.098 ± 0.692
3.702GluPhe: 3.702 ± 0.62
3.834GluGly: 3.834 ± 0.619
0.661GluHis: 0.661 ± 0.305
8.065GluIle: 8.065 ± 1.265
4.627GluLys: 4.627 ± 1.127
5.817GluLeu: 5.817 ± 0.878
1.19GluMet: 1.19 ± 0.474
5.288GluAsn: 5.288 ± 0.763
1.19GluPro: 1.19 ± 0.375
2.115GluGln: 2.115 ± 0.444
1.983GluArg: 1.983 ± 0.452
2.512GluSer: 2.512 ± 0.574
2.909GluThr: 2.909 ± 0.452
5.949GluVal: 5.949 ± 0.718
0.529GluTrp: 0.529 ± 0.269
2.115GluTyr: 2.115 ± 0.452
0.0GluXaa: 0.0 ± 0.0
Phe
2.512PheAla: 2.512 ± 0.633
0.397PheCys: 0.397 ± 0.212
5.288PheAsp: 5.288 ± 0.817
3.437PheGlu: 3.437 ± 0.689
2.115PhePhe: 2.115 ± 0.542
3.305PheGly: 3.305 ± 0.79
0.529PheHis: 0.529 ± 0.336
3.57PheIle: 3.57 ± 0.552
5.42PheLys: 5.42 ± 1.032
4.495PheLeu: 4.495 ± 0.823
0.925PheMet: 0.925 ± 0.269
4.627PheAsn: 4.627 ± 0.626
0.661PhePro: 0.661 ± 0.308
1.322PheGln: 1.322 ± 0.358
1.719PheArg: 1.719 ± 0.479
4.098PheSer: 4.098 ± 0.802
3.702PheThr: 3.702 ± 0.8
2.38PheVal: 2.38 ± 0.541
0.397PheTrp: 0.397 ± 0.184
2.38PheTyr: 2.38 ± 0.508
0.0PheXaa: 0.0 ± 0.0
Gly
4.627GlyAla: 4.627 ± 1.316
0.529GlyCys: 0.529 ± 0.22
4.363GlyAsp: 4.363 ± 0.969
3.702GlyGlu: 3.702 ± 0.667
5.156GlyPhe: 5.156 ± 0.665
5.42GlyGly: 5.42 ± 1.603
0.793GlyHis: 0.793 ± 0.363
3.966GlyIle: 3.966 ± 0.63
4.892GlyLys: 4.892 ± 0.693
6.346GlyLeu: 6.346 ± 0.772
1.19GlyMet: 1.19 ± 0.412
2.512GlyAsn: 2.512 ± 0.604
0.132GlyPro: 0.132 ± 0.141
2.115GlyGln: 2.115 ± 0.562
2.115GlyArg: 2.115 ± 0.566
4.627GlySer: 4.627 ± 0.663
3.966GlyThr: 3.966 ± 0.63
6.478GlyVal: 6.478 ± 0.934
0.397GlyTrp: 0.397 ± 0.233
2.247GlyTyr: 2.247 ± 0.562
0.0GlyXaa: 0.0 ± 0.0
His
0.264HisAla: 0.264 ± 0.157
0.132HisCys: 0.132 ± 0.117
0.397HisAsp: 0.397 ± 0.188
0.793HisGlu: 0.793 ± 0.29
0.793HisPhe: 0.793 ± 0.319
0.925HisGly: 0.925 ± 0.387
0.0HisHis: 0.0 ± 0.0
1.058HisIle: 1.058 ± 0.522
0.925HisLys: 0.925 ± 0.292
0.925HisLeu: 0.925 ± 0.322
0.132HisMet: 0.132 ± 0.125
0.397HisAsn: 0.397 ± 0.23
0.661HisPro: 0.661 ± 0.235
0.264HisGln: 0.264 ± 0.17
0.397HisArg: 0.397 ± 0.264
0.397HisSer: 0.397 ± 0.241
0.397HisThr: 0.397 ± 0.172
1.058HisVal: 1.058 ± 0.387
0.264HisTrp: 0.264 ± 0.158
0.397HisTyr: 0.397 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
5.42IleAla: 5.42 ± 0.981
0.793IleCys: 0.793 ± 0.282
5.553IleAsp: 5.553 ± 0.728
7.403IleGlu: 7.403 ± 0.961
3.966IlePhe: 3.966 ± 0.586
5.024IleGly: 5.024 ± 1.336
0.529IleHis: 0.529 ± 0.238
4.892IleIle: 4.892 ± 0.845
8.197IleLys: 8.197 ± 1.048
5.949IleLeu: 5.949 ± 0.983
0.925IleMet: 0.925 ± 0.276
6.742IleAsn: 6.742 ± 0.861
3.173IlePro: 3.173 ± 0.581
2.909IleGln: 2.909 ± 0.566
2.776IleArg: 2.776 ± 0.589
5.42IleSer: 5.42 ± 0.721
3.702IleThr: 3.702 ± 0.68
3.437IleVal: 3.437 ± 0.553
0.529IleTrp: 0.529 ± 0.298
2.38IleTyr: 2.38 ± 0.432
0.0IleXaa: 0.0 ± 0.0
Lys
7.668LysAla: 7.668 ± 1.5
0.661LysCys: 0.661 ± 0.317
5.288LysAsp: 5.288 ± 0.745
6.61LysGlu: 6.61 ± 1.11
2.909LysPhe: 2.909 ± 0.55
4.759LysGly: 4.759 ± 0.934
1.322LysHis: 1.322 ± 0.419
7.536LysIle: 7.536 ± 0.919
8.197LysLys: 8.197 ± 1.228
4.098LysLeu: 4.098 ± 0.594
1.322LysMet: 1.322 ± 0.328
5.553LysAsn: 5.553 ± 0.666
2.512LysPro: 2.512 ± 0.634
3.305LysGln: 3.305 ± 0.86
3.57LysArg: 3.57 ± 0.76
3.834LysSer: 3.834 ± 0.84
6.081LysThr: 6.081 ± 1.048
4.892LysVal: 4.892 ± 0.823
0.925LysTrp: 0.925 ± 0.25
2.644LysTyr: 2.644 ± 0.457
0.0LysXaa: 0.0 ± 0.0
Leu
3.834LeuAla: 3.834 ± 0.855
0.397LeuCys: 0.397 ± 0.212
4.892LeuAsp: 4.892 ± 0.84
6.081LeuGlu: 6.081 ± 0.919
3.702LeuPhe: 3.702 ± 0.627
5.156LeuGly: 5.156 ± 0.747
1.19LeuHis: 1.19 ± 0.31
5.685LeuIle: 5.685 ± 0.817
7.271LeuLys: 7.271 ± 1.055
6.478LeuLeu: 6.478 ± 0.876
1.058LeuMet: 1.058 ± 0.345
5.553LeuAsn: 5.553 ± 0.91
3.834LeuPro: 3.834 ± 0.72
5.156LeuGln: 5.156 ± 0.692
3.041LeuArg: 3.041 ± 0.687
5.685LeuSer: 5.685 ± 0.511
5.949LeuThr: 5.949 ± 0.95
4.098LeuVal: 4.098 ± 0.486
0.793LeuTrp: 0.793 ± 0.338
3.173LeuTyr: 3.173 ± 0.643
0.0LeuXaa: 0.0 ± 0.0
Met
1.454MetAla: 1.454 ± 0.378
0.132MetCys: 0.132 ± 0.148
0.925MetAsp: 0.925 ± 0.312
0.661MetGlu: 0.661 ± 0.313
0.397MetPhe: 0.397 ± 0.195
1.058MetGly: 1.058 ± 0.401
0.0MetHis: 0.0 ± 0.0
0.793MetIle: 0.793 ± 0.377
1.586MetLys: 1.586 ± 0.443
1.851MetLeu: 1.851 ± 0.558
0.132MetMet: 0.132 ± 0.133
0.793MetAsn: 0.793 ± 0.304
0.397MetPro: 0.397 ± 0.2
0.793MetGln: 0.793 ± 0.262
0.793MetArg: 0.793 ± 0.338
1.851MetSer: 1.851 ± 0.591
0.793MetThr: 0.793 ± 0.314
0.925MetVal: 0.925 ± 0.415
0.0MetTrp: 0.0 ± 0.0
0.264MetTyr: 0.264 ± 0.168
0.0MetXaa: 0.0 ± 0.0
Asn
4.495AsnAla: 4.495 ± 0.741
0.661AsnCys: 0.661 ± 0.247
5.024AsnAsp: 5.024 ± 0.765
3.173AsnGlu: 3.173 ± 0.753
4.892AsnPhe: 4.892 ± 0.92
4.627AsnGly: 4.627 ± 0.792
0.793AsnHis: 0.793 ± 0.276
5.817AsnIle: 5.817 ± 0.674
4.231AsnLys: 4.231 ± 0.601
4.892AsnLeu: 4.892 ± 0.766
1.454AsnMet: 1.454 ± 0.471
5.156AsnAsn: 5.156 ± 1.029
2.644AsnPro: 2.644 ± 0.593
1.586AsnGln: 1.586 ± 0.497
2.776AsnArg: 2.776 ± 0.605
3.305AsnSer: 3.305 ± 0.515
3.437AsnThr: 3.437 ± 0.531
5.553AsnVal: 5.553 ± 0.932
0.793AsnTrp: 0.793 ± 0.226
2.909AsnTyr: 2.909 ± 0.823
0.0AsnXaa: 0.0 ± 0.0
Pro
2.115ProAla: 2.115 ± 0.552
0.397ProCys: 0.397 ± 0.209
1.983ProAsp: 1.983 ± 0.492
2.115ProGlu: 2.115 ± 0.493
1.058ProPhe: 1.058 ± 0.391
0.0ProGly: 0.0 ± 0.0
0.397ProHis: 0.397 ± 0.215
2.38ProIle: 2.38 ± 0.467
3.041ProLys: 3.041 ± 0.606
1.719ProLeu: 1.719 ± 0.475
0.397ProMet: 0.397 ± 0.203
2.909ProAsn: 2.909 ± 0.626
0.661ProPro: 0.661 ± 0.316
0.661ProGln: 0.661 ± 0.238
1.454ProArg: 1.454 ± 0.466
1.851ProSer: 1.851 ± 0.539
1.851ProThr: 1.851 ± 0.483
2.38ProVal: 2.38 ± 0.551
0.397ProTrp: 0.397 ± 0.23
1.19ProTyr: 1.19 ± 0.396
0.0ProXaa: 0.0 ± 0.0
Gln
2.115GlnAla: 2.115 ± 0.658
0.529GlnCys: 0.529 ± 0.255
2.38GlnAsp: 2.38 ± 0.402
1.851GlnGlu: 1.851 ± 0.447
1.586GlnPhe: 1.586 ± 0.429
1.983GlnGly: 1.983 ± 0.403
0.397GlnHis: 0.397 ± 0.193
3.041GlnIle: 3.041 ± 0.592
1.851GlnLys: 1.851 ± 0.449
2.512GlnLeu: 2.512 ± 0.451
0.925GlnMet: 0.925 ± 0.284
4.627GlnAsn: 4.627 ± 0.858
0.793GlnPro: 0.793 ± 0.275
1.19GlnGln: 1.19 ± 0.328
2.512GlnArg: 2.512 ± 0.58
2.247GlnSer: 2.247 ± 0.613
2.909GlnThr: 2.909 ± 0.605
1.454GlnVal: 1.454 ± 0.589
0.397GlnTrp: 0.397 ± 0.19
1.454GlnTyr: 1.454 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
2.38ArgAla: 2.38 ± 0.461
0.529ArgCys: 0.529 ± 0.271
2.776ArgAsp: 2.776 ± 0.704
2.512ArgGlu: 2.512 ± 0.612
1.983ArgPhe: 1.983 ± 0.417
1.851ArgGly: 1.851 ± 0.566
0.264ArgHis: 0.264 ± 0.212
2.38ArgIle: 2.38 ± 0.503
2.512ArgLys: 2.512 ± 0.611
3.041ArgLeu: 3.041 ± 0.864
0.397ArgMet: 0.397 ± 0.263
2.247ArgAsn: 2.247 ± 0.622
1.454ArgPro: 1.454 ± 0.413
1.851ArgGln: 1.851 ± 0.486
1.454ArgArg: 1.454 ± 0.408
0.793ArgSer: 0.793 ± 0.297
1.983ArgThr: 1.983 ± 0.464
2.115ArgVal: 2.115 ± 0.528
0.264ArgTrp: 0.264 ± 0.169
1.586ArgTyr: 1.586 ± 0.49
0.0ArgXaa: 0.0 ± 0.0
Ser
4.892SerAla: 4.892 ± 0.765
0.529SerCys: 0.529 ± 0.276
2.776SerAsp: 2.776 ± 0.612
3.702SerGlu: 3.702 ± 0.631
3.437SerPhe: 3.437 ± 0.743
3.966SerGly: 3.966 ± 0.573
0.264SerHis: 0.264 ± 0.179
3.437SerIle: 3.437 ± 0.769
4.363SerLys: 4.363 ± 0.569
5.949SerLeu: 5.949 ± 0.756
0.925SerMet: 0.925 ± 0.337
3.305SerAsn: 3.305 ± 0.655
1.851SerPro: 1.851 ± 0.464
2.512SerGln: 2.512 ± 0.485
1.851SerArg: 1.851 ± 0.491
3.173SerSer: 3.173 ± 0.887
3.57SerThr: 3.57 ± 0.511
5.156SerVal: 5.156 ± 0.892
0.925SerTrp: 0.925 ± 0.309
1.851SerTyr: 1.851 ± 0.554
0.0SerXaa: 0.0 ± 0.0
Thr
5.949ThrAla: 5.949 ± 1.194
0.397ThrCys: 0.397 ± 0.211
4.363ThrAsp: 4.363 ± 0.788
3.57ThrGlu: 3.57 ± 0.583
2.115ThrPhe: 2.115 ± 0.454
4.231ThrGly: 4.231 ± 0.57
0.264ThrHis: 0.264 ± 0.167
5.553ThrIle: 5.553 ± 1.015
4.892ThrLys: 4.892 ± 1.073
4.759ThrLeu: 4.759 ± 1.017
0.264ThrMet: 0.264 ± 0.179
3.966ThrAsn: 3.966 ± 0.712
1.719ThrPro: 1.719 ± 0.484
3.173ThrGln: 3.173 ± 0.469
1.719ThrArg: 1.719 ± 0.45
3.834ThrSer: 3.834 ± 0.888
2.776ThrThr: 2.776 ± 0.559
3.173ThrVal: 3.173 ± 0.677
0.264ThrTrp: 0.264 ± 0.201
2.909ThrTyr: 2.909 ± 0.501
0.0ThrXaa: 0.0 ± 0.0
Val
3.57ValAla: 3.57 ± 0.693
0.264ValCys: 0.264 ± 0.194
2.115ValAsp: 2.115 ± 0.438
3.834ValGlu: 3.834 ± 0.675
3.966ValPhe: 3.966 ± 0.864
4.495ValGly: 4.495 ± 0.772
1.058ValHis: 1.058 ± 0.396
7.007ValIle: 7.007 ± 0.924
6.081ValLys: 6.081 ± 0.835
5.156ValLeu: 5.156 ± 1.02
0.661ValMet: 0.661 ± 0.294
4.231ValAsn: 4.231 ± 0.724
1.851ValPro: 1.851 ± 0.502
2.38ValGln: 2.38 ± 0.541
2.512ValArg: 2.512 ± 0.557
4.627ValSer: 4.627 ± 0.768
5.156ValThr: 5.156 ± 0.843
3.57ValVal: 3.57 ± 0.67
0.925ValTrp: 0.925 ± 0.374
2.115ValTyr: 2.115 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.661TrpAla: 0.661 ± 0.226
0.132TrpCys: 0.132 ± 0.121
1.058TrpAsp: 1.058 ± 0.378
0.0TrpGlu: 0.0 ± 0.0
0.529TrpPhe: 0.529 ± 0.241
0.397TrpGly: 0.397 ± 0.187
0.0TrpHis: 0.0 ± 0.0
0.661TrpIle: 0.661 ± 0.24
0.793TrpLys: 0.793 ± 0.287
1.851TrpLeu: 1.851 ± 0.426
0.0TrpMet: 0.0 ± 0.0
0.661TrpAsn: 0.661 ± 0.393
0.0TrpPro: 0.0 ± 0.0
0.529TrpGln: 0.529 ± 0.27
0.397TrpArg: 0.397 ± 0.198
0.793TrpSer: 0.793 ± 0.367
0.661TrpThr: 0.661 ± 0.314
1.322TrpVal: 1.322 ± 0.282
0.0TrpTrp: 0.0 ± 0.0
0.132TrpTyr: 0.132 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.983TyrAla: 1.983 ± 0.425
0.397TyrCys: 0.397 ± 0.263
2.115TyrAsp: 2.115 ± 0.452
2.512TyrGlu: 2.512 ± 0.463
2.38TyrPhe: 2.38 ± 0.654
2.512TyrGly: 2.512 ± 0.706
0.397TyrHis: 0.397 ± 0.198
2.115TyrIle: 2.115 ± 0.596
3.702TyrLys: 3.702 ± 0.594
3.834TyrLeu: 3.834 ± 0.828
0.661TyrMet: 0.661 ± 0.246
3.173TyrAsn: 3.173 ± 0.492
1.19TyrPro: 1.19 ± 0.357
0.529TyrGln: 0.529 ± 0.249
1.058TyrArg: 1.058 ± 0.335
2.115TyrSer: 2.115 ± 0.511
2.776TyrThr: 2.776 ± 0.556
1.586TyrVal: 1.586 ± 0.384
0.529TyrTrp: 0.529 ± 0.257
0.793TyrTyr: 0.793 ± 0.399
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (7565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski