Amino acid dipepetide frequency for Enterococcus phage phiFL2A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.128AlaAla: 4.128 ± 0.79
0.188AlaCys: 0.188 ± 0.156
3.472AlaAsp: 3.472 ± 0.631
4.597AlaGlu: 4.597 ± 0.815
2.815AlaPhe: 2.815 ± 0.498
3.284AlaGly: 3.284 ± 0.615
1.032AlaHis: 1.032 ± 0.322
5.442AlaIle: 5.442 ± 0.899
5.536AlaLys: 5.536 ± 0.705
6.662AlaLeu: 6.662 ± 0.864
1.501AlaMet: 1.501 ± 0.312
3.753AlaAsn: 3.753 ± 0.633
1.314AlaPro: 1.314 ± 0.297
2.158AlaGln: 2.158 ± 0.42
2.064AlaArg: 2.064 ± 0.466
3.941AlaSer: 3.941 ± 0.716
5.723AlaThr: 5.723 ± 0.881
4.316AlaVal: 4.316 ± 0.636
0.469AlaTrp: 0.469 ± 0.185
3.472AlaTyr: 3.472 ± 0.526
0.0AlaXaa: 0.0 ± 0.0
Cys
0.188CysAla: 0.188 ± 0.126
0.094CysCys: 0.094 ± 0.102
0.094CysAsp: 0.094 ± 0.091
0.188CysGlu: 0.188 ± 0.116
0.094CysPhe: 0.094 ± 0.102
0.375CysGly: 0.375 ± 0.231
0.188CysHis: 0.188 ± 0.135
0.375CysIle: 0.375 ± 0.179
0.657CysLys: 0.657 ± 0.249
0.563CysLeu: 0.563 ± 0.265
0.188CysMet: 0.188 ± 0.125
0.563CysAsn: 0.563 ± 0.191
0.188CysPro: 0.188 ± 0.123
0.281CysGln: 0.281 ± 0.165
0.281CysArg: 0.281 ± 0.161
0.375CysSer: 0.375 ± 0.22
0.188CysThr: 0.188 ± 0.133
0.563CysVal: 0.563 ± 0.222
0.0CysTrp: 0.0 ± 0.0
0.281CysTyr: 0.281 ± 0.163
0.0CysXaa: 0.0 ± 0.0
Asp
3.565AspAla: 3.565 ± 0.633
0.375AspCys: 0.375 ± 0.227
4.035AspAsp: 4.035 ± 0.693
6.005AspGlu: 6.005 ± 0.917
2.909AspPhe: 2.909 ± 0.593
4.691AspGly: 4.691 ± 0.588
0.469AspHis: 0.469 ± 0.241
3.565AspIle: 3.565 ± 0.443
5.067AspLys: 5.067 ± 0.526
5.723AspLeu: 5.723 ± 0.869
0.938AspMet: 0.938 ± 0.342
3.19AspAsn: 3.19 ± 0.578
1.126AspPro: 1.126 ± 0.284
1.501AspGln: 1.501 ± 0.42
2.252AspArg: 2.252 ± 0.563
3.096AspSer: 3.096 ± 0.716
3.002AspThr: 3.002 ± 0.535
3.378AspVal: 3.378 ± 0.446
1.126AspTrp: 1.126 ± 0.311
2.627AspTyr: 2.627 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
4.41GluAla: 4.41 ± 0.719
0.938GluCys: 0.938 ± 0.262
3.378GluAsp: 3.378 ± 0.681
5.723GluGlu: 5.723 ± 1.01
3.002GluPhe: 3.002 ± 0.506
4.035GluGly: 4.035 ± 0.675
1.22GluHis: 1.22 ± 0.292
6.286GluIle: 6.286 ± 0.697
6.474GluLys: 6.474 ± 1.019
7.6GluLeu: 7.6 ± 0.828
1.501GluMet: 1.501 ± 0.375
4.597GluAsn: 4.597 ± 0.745
2.815GluPro: 2.815 ± 0.498
3.565GluGln: 3.565 ± 0.51
3.941GluArg: 3.941 ± 0.758
2.533GluSer: 2.533 ± 0.431
4.597GluThr: 4.597 ± 0.603
5.723GluVal: 5.723 ± 0.751
1.314GluTrp: 1.314 ± 0.419
3.096GluTyr: 3.096 ± 0.537
0.0GluXaa: 0.0 ± 0.0
Phe
2.158PheAla: 2.158 ± 0.368
0.188PheCys: 0.188 ± 0.122
3.472PheAsp: 3.472 ± 0.648
3.659PheGlu: 3.659 ± 0.659
2.158PhePhe: 2.158 ± 0.492
2.439PheGly: 2.439 ± 0.405
0.375PheHis: 0.375 ± 0.173
2.439PheIle: 2.439 ± 0.486
3.002PheLys: 3.002 ± 0.439
2.627PheLeu: 2.627 ± 0.512
1.501PheMet: 1.501 ± 0.4
2.627PheAsn: 2.627 ± 0.369
0.751PhePro: 0.751 ± 0.263
1.407PheGln: 1.407 ± 0.372
1.501PheArg: 1.501 ± 0.382
2.439PheSer: 2.439 ± 0.488
2.439PheThr: 2.439 ± 0.424
2.815PheVal: 2.815 ± 0.695
0.563PheTrp: 0.563 ± 0.24
1.22PheTyr: 1.22 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
4.222GlyAla: 4.222 ± 0.692
0.094GlyCys: 0.094 ± 0.086
3.096GlyAsp: 3.096 ± 0.535
4.128GlyGlu: 4.128 ± 0.671
3.378GlyPhe: 3.378 ± 0.521
3.096GlyGly: 3.096 ± 0.531
0.563GlyHis: 0.563 ± 0.236
5.723GlyIle: 5.723 ± 0.79
5.63GlyLys: 5.63 ± 0.65
5.16GlyLeu: 5.16 ± 0.805
2.064GlyMet: 2.064 ± 0.602
4.035GlyAsn: 4.035 ± 0.624
0.657GlyPro: 0.657 ± 0.3
2.346GlyGln: 2.346 ± 0.475
2.064GlyArg: 2.064 ± 0.441
3.565GlySer: 3.565 ± 0.758
4.222GlyThr: 4.222 ± 0.777
3.659GlyVal: 3.659 ± 0.523
0.938GlyTrp: 0.938 ± 0.337
3.753GlyTyr: 3.753 ± 0.674
0.0GlyXaa: 0.0 ± 0.0
His
0.375HisAla: 0.375 ± 0.166
0.094HisCys: 0.094 ± 0.093
0.657HisAsp: 0.657 ± 0.274
0.657HisGlu: 0.657 ± 0.22
0.375HisPhe: 0.375 ± 0.16
0.657HisGly: 0.657 ± 0.217
0.281HisHis: 0.281 ± 0.197
0.751HisIle: 0.751 ± 0.276
0.844HisLys: 0.844 ± 0.263
1.314HisLeu: 1.314 ± 0.348
0.281HisMet: 0.281 ± 0.164
0.563HisAsn: 0.563 ± 0.206
0.281HisPro: 0.281 ± 0.149
0.563HisGln: 0.563 ± 0.246
0.844HisArg: 0.844 ± 0.292
0.657HisSer: 0.657 ± 0.232
0.469HisThr: 0.469 ± 0.173
1.22HisVal: 1.22 ± 0.368
0.094HisTrp: 0.094 ± 0.102
1.126HisTyr: 1.126 ± 0.326
0.0HisXaa: 0.0 ± 0.0
Ile
4.973IleAla: 4.973 ± 1.057
0.375IleCys: 0.375 ± 0.194
5.16IleAsp: 5.16 ± 0.677
5.254IleGlu: 5.254 ± 0.771
2.533IlePhe: 2.533 ± 0.584
4.41IleGly: 4.41 ± 0.732
0.938IleHis: 0.938 ± 0.289
3.659IleIle: 3.659 ± 0.628
6.849IleLys: 6.849 ± 0.601
6.38IleLeu: 6.38 ± 0.701
1.501IleMet: 1.501 ± 0.397
4.316IleAsn: 4.316 ± 0.698
2.346IlePro: 2.346 ± 0.428
2.815IleGln: 2.815 ± 0.507
1.595IleArg: 1.595 ± 0.462
5.067IleSer: 5.067 ± 0.881
3.847IleThr: 3.847 ± 0.681
4.41IleVal: 4.41 ± 0.579
0.844IleTrp: 0.844 ± 0.237
2.815IleTyr: 2.815 ± 0.517
0.0IleXaa: 0.0 ± 0.0
Lys
5.817LysAla: 5.817 ± 0.781
0.281LysCys: 0.281 ± 0.168
3.941LysAsp: 3.941 ± 0.635
8.82LysGlu: 8.82 ± 0.979
2.533LysPhe: 2.533 ± 0.465
4.597LysGly: 4.597 ± 0.503
0.938LysHis: 0.938 ± 0.373
6.099LysIle: 6.099 ± 0.973
8.82LysLys: 8.82 ± 1.401
5.442LysLeu: 5.442 ± 0.644
3.096LysMet: 3.096 ± 0.506
5.911LysAsn: 5.911 ± 0.684
2.064LysPro: 2.064 ± 0.409
4.035LysGln: 4.035 ± 0.693
3.565LysArg: 3.565 ± 0.727
4.691LysSer: 4.691 ± 0.71
5.817LysThr: 5.817 ± 0.85
5.067LysVal: 5.067 ± 0.76
1.032LysTrp: 1.032 ± 0.39
3.941LysTyr: 3.941 ± 0.656
0.0LysXaa: 0.0 ± 0.0
Leu
6.286LeuAla: 6.286 ± 0.686
0.281LeuCys: 0.281 ± 0.139
5.63LeuAsp: 5.63 ± 0.741
6.474LeuGlu: 6.474 ± 0.84
3.378LeuPhe: 3.378 ± 0.66
5.254LeuGly: 5.254 ± 0.933
1.032LeuHis: 1.032 ± 0.297
4.222LeuIle: 4.222 ± 0.813
6.662LeuLys: 6.662 ± 0.749
6.755LeuLeu: 6.755 ± 0.97
2.064LeuMet: 2.064 ± 0.411
6.193LeuAsn: 6.193 ± 0.718
2.627LeuPro: 2.627 ± 0.588
3.096LeuGln: 3.096 ± 0.514
3.847LeuArg: 3.847 ± 0.629
6.849LeuSer: 6.849 ± 0.841
5.348LeuThr: 5.348 ± 0.586
5.536LeuVal: 5.536 ± 0.607
0.657LeuTrp: 0.657 ± 0.242
2.627LeuTyr: 2.627 ± 0.484
0.0LeuXaa: 0.0 ± 0.0
Met
2.627MetAla: 2.627 ± 0.441
0.094MetCys: 0.094 ± 0.117
1.877MetAsp: 1.877 ± 0.472
1.032MetGlu: 1.032 ± 0.306
0.563MetPhe: 0.563 ± 0.203
1.501MetGly: 1.501 ± 0.452
0.188MetHis: 0.188 ± 0.142
1.501MetIle: 1.501 ± 0.345
2.627MetLys: 2.627 ± 0.517
1.97MetLeu: 1.97 ± 0.434
0.188MetMet: 0.188 ± 0.15
3.19MetAsn: 3.19 ± 0.855
0.844MetPro: 0.844 ± 0.221
1.501MetGln: 1.501 ± 0.334
1.407MetArg: 1.407 ± 0.416
2.158MetSer: 2.158 ± 0.388
1.126MetThr: 1.126 ± 0.29
1.126MetVal: 1.126 ± 0.265
0.0MetTrp: 0.0 ± 0.0
0.375MetTyr: 0.375 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
4.035AsnAla: 4.035 ± 0.653
0.281AsnCys: 0.281 ± 0.136
2.909AsnAsp: 2.909 ± 0.743
5.254AsnGlu: 5.254 ± 0.916
1.314AsnPhe: 1.314 ± 0.48
5.348AsnGly: 5.348 ± 0.754
1.032AsnHis: 1.032 ± 0.266
5.723AsnIle: 5.723 ± 0.779
5.16AsnLys: 5.16 ± 0.72
4.597AsnLeu: 4.597 ± 0.984
1.314AsnMet: 1.314 ± 0.244
3.941AsnAsn: 3.941 ± 0.78
2.158AsnPro: 2.158 ± 0.421
3.284AsnGln: 3.284 ± 0.605
2.815AsnArg: 2.815 ± 0.46
5.067AsnSer: 5.067 ± 1.305
3.096AsnThr: 3.096 ± 0.501
5.16AsnVal: 5.16 ± 0.885
0.281AsnTrp: 0.281 ± 0.153
2.439AsnTyr: 2.439 ± 0.393
0.0AsnXaa: 0.0 ± 0.0
Pro
2.158ProAla: 2.158 ± 0.449
0.281ProCys: 0.281 ± 0.142
2.064ProAsp: 2.064 ± 0.48
3.659ProGlu: 3.659 ± 0.606
1.783ProPhe: 1.783 ± 0.468
1.126ProGly: 1.126 ± 0.323
0.188ProHis: 0.188 ± 0.128
1.501ProIle: 1.501 ± 0.334
1.97ProLys: 1.97 ± 0.391
1.595ProLeu: 1.595 ± 0.358
0.938ProMet: 0.938 ± 0.275
2.252ProAsn: 2.252 ± 0.411
0.657ProPro: 0.657 ± 0.245
1.314ProGln: 1.314 ± 0.354
0.657ProArg: 0.657 ± 0.284
1.595ProSer: 1.595 ± 0.291
1.407ProThr: 1.407 ± 0.427
1.126ProVal: 1.126 ± 0.287
0.188ProTrp: 0.188 ± 0.111
0.751ProTyr: 0.751 ± 0.288
0.0ProXaa: 0.0 ± 0.0
Gln
3.847GlnAla: 3.847 ± 0.613
0.094GlnCys: 0.094 ± 0.086
1.97GlnAsp: 1.97 ± 0.462
2.346GlnGlu: 2.346 ± 0.508
1.501GlnPhe: 1.501 ± 0.358
1.877GlnGly: 1.877 ± 0.483
0.281GlnHis: 0.281 ± 0.152
3.472GlnIle: 3.472 ± 0.699
3.659GlnLys: 3.659 ± 0.551
4.316GlnLeu: 4.316 ± 0.716
1.314GlnMet: 1.314 ± 0.351
3.19GlnAsn: 3.19 ± 0.58
1.032GlnPro: 1.032 ± 0.3
2.346GlnGln: 2.346 ± 0.431
1.501GlnArg: 1.501 ± 0.367
2.815GlnSer: 2.815 ± 0.44
2.439GlnThr: 2.439 ± 0.406
2.627GlnVal: 2.627 ± 0.506
0.094GlnTrp: 0.094 ± 0.084
1.877GlnTyr: 1.877 ± 0.462
0.0GlnXaa: 0.0 ± 0.0
Arg
2.158ArgAla: 2.158 ± 0.438
0.281ArgCys: 0.281 ± 0.164
2.252ArgAsp: 2.252 ± 0.615
2.439ArgGlu: 2.439 ± 0.557
1.689ArgPhe: 1.689 ± 0.403
2.064ArgGly: 2.064 ± 0.446
0.657ArgHis: 0.657 ± 0.306
3.284ArgIle: 3.284 ± 0.474
3.378ArgLys: 3.378 ± 0.648
3.565ArgLeu: 3.565 ± 0.764
1.877ArgMet: 1.877 ± 0.356
2.252ArgAsn: 2.252 ± 0.473
1.126ArgPro: 1.126 ± 0.359
2.064ArgGln: 2.064 ± 0.492
1.783ArgArg: 1.783 ± 0.435
1.595ArgSer: 1.595 ± 0.412
1.689ArgThr: 1.689 ± 0.371
2.439ArgVal: 2.439 ± 0.565
0.188ArgTrp: 0.188 ± 0.12
1.407ArgTyr: 1.407 ± 0.351
0.0ArgXaa: 0.0 ± 0.0
Ser
4.41SerAla: 4.41 ± 0.881
0.563SerCys: 0.563 ± 0.22
3.659SerAsp: 3.659 ± 0.732
3.659SerGlu: 3.659 ± 0.528
2.533SerPhe: 2.533 ± 0.498
4.504SerGly: 4.504 ± 0.602
0.281SerHis: 0.281 ± 0.17
5.067SerIle: 5.067 ± 0.627
5.348SerLys: 5.348 ± 0.716
5.067SerLeu: 5.067 ± 0.897
2.158SerMet: 2.158 ± 0.6
4.035SerAsn: 4.035 ± 1.445
1.407SerPro: 1.407 ± 0.377
3.002SerGln: 3.002 ± 0.501
2.252SerArg: 2.252 ± 0.444
4.973SerSer: 4.973 ± 1.083
4.879SerThr: 4.879 ± 0.592
2.909SerVal: 2.909 ± 0.522
0.657SerTrp: 0.657 ± 0.224
1.97SerTyr: 1.97 ± 0.413
0.0SerXaa: 0.0 ± 0.0
Thr
3.378ThrAla: 3.378 ± 0.452
0.375ThrCys: 0.375 ± 0.285
5.067ThrAsp: 5.067 ± 0.715
4.222ThrGlu: 4.222 ± 0.646
2.627ThrPhe: 2.627 ± 0.485
4.879ThrGly: 4.879 ± 0.56
0.563ThrHis: 0.563 ± 0.241
3.378ThrIle: 3.378 ± 0.484
5.63ThrLys: 5.63 ± 0.725
5.442ThrLeu: 5.442 ± 0.581
1.407ThrMet: 1.407 ± 0.323
3.941ThrAsn: 3.941 ± 0.595
1.877ThrPro: 1.877 ± 0.424
2.439ThrGln: 2.439 ± 0.464
1.314ThrArg: 1.314 ± 0.313
4.128ThrSer: 4.128 ± 0.757
6.193ThrThr: 6.193 ± 1.849
3.565ThrVal: 3.565 ± 0.785
1.22ThrTrp: 1.22 ± 0.361
2.158ThrTyr: 2.158 ± 0.464
0.0ThrXaa: 0.0 ± 0.0
Val
4.316ValAla: 4.316 ± 0.745
0.375ValCys: 0.375 ± 0.149
3.753ValAsp: 3.753 ± 0.517
4.973ValGlu: 4.973 ± 0.73
2.439ValPhe: 2.439 ± 0.485
4.504ValGly: 4.504 ± 0.668
0.469ValHis: 0.469 ± 0.188
4.504ValIle: 4.504 ± 0.622
5.067ValLys: 5.067 ± 0.655
4.879ValLeu: 4.879 ± 0.811
0.938ValMet: 0.938 ± 0.233
4.035ValAsn: 4.035 ± 0.818
2.064ValPro: 2.064 ± 0.409
2.533ValGln: 2.533 ± 0.443
2.346ValArg: 2.346 ± 0.526
4.597ValSer: 4.597 ± 0.71
4.128ValThr: 4.128 ± 0.733
3.472ValVal: 3.472 ± 0.483
0.281ValTrp: 0.281 ± 0.165
1.783ValTyr: 1.783 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.563TrpAla: 0.563 ± 0.267
0.0TrpCys: 0.0 ± 0.0
0.751TrpAsp: 0.751 ± 0.286
0.751TrpGlu: 0.751 ± 0.271
0.375TrpPhe: 0.375 ± 0.186
0.751TrpGly: 0.751 ± 0.253
0.281TrpHis: 0.281 ± 0.199
0.469TrpIle: 0.469 ± 0.223
0.938TrpLys: 0.938 ± 0.42
1.595TrpLeu: 1.595 ± 0.492
0.188TrpMet: 0.188 ± 0.12
0.657TrpAsn: 0.657 ± 0.241
0.188TrpPro: 0.188 ± 0.131
0.657TrpGln: 0.657 ± 0.213
0.844TrpArg: 0.844 ± 0.242
0.375TrpSer: 0.375 ± 0.163
0.563TrpThr: 0.563 ± 0.25
0.375TrpVal: 0.375 ± 0.141
0.0TrpTrp: 0.0 ± 0.0
0.188TrpTyr: 0.188 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.252TyrAla: 2.252 ± 0.412
0.469TyrCys: 0.469 ± 0.215
1.407TyrAsp: 1.407 ± 0.364
2.909TyrGlu: 2.909 ± 0.457
1.783TyrPhe: 1.783 ± 0.347
3.002TyrGly: 3.002 ± 0.614
1.126TyrHis: 1.126 ± 0.317
2.627TyrIle: 2.627 ± 0.459
3.284TyrLys: 3.284 ± 0.52
3.565TyrLeu: 3.565 ± 0.598
0.844TyrMet: 0.844 ± 0.309
1.97TyrAsn: 1.97 ± 0.49
1.689TyrPro: 1.689 ± 0.622
1.689TyrGln: 1.689 ± 0.497
1.314TyrArg: 1.314 ± 0.414
2.815TyrSer: 2.815 ± 0.469
2.627TyrThr: 2.627 ± 0.523
1.97TyrVal: 1.97 ± 0.468
0.563TyrTrp: 0.563 ± 0.214
1.22TyrTyr: 1.22 ± 0.383
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (10659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski