Amino acid dipepetide frequency for Lactococcus phage P118

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.248AlaAla: 6.248 ± 1.724
0.175AlaCys: 0.175 ± 0.091
3.095AlaAsp: 3.095 ± 0.564
2.628AlaGlu: 2.628 ± 0.47
2.277AlaPhe: 2.277 ± 0.39
5.08AlaGly: 5.08 ± 1.482
0.876AlaHis: 0.876 ± 0.251
5.197AlaIle: 5.197 ± 0.982
5.314AlaLys: 5.314 ± 0.609
5.372AlaLeu: 5.372 ± 1.033
2.628AlaMet: 2.628 ± 0.637
3.27AlaAsn: 3.27 ± 0.429
1.927AlaPro: 1.927 ± 0.38
1.927AlaGln: 1.927 ± 0.404
2.452AlaArg: 2.452 ± 0.467
4.204AlaSer: 4.204 ± 0.687
4.263AlaThr: 4.263 ± 0.664
4.379AlaVal: 4.379 ± 0.522
0.876AlaTrp: 0.876 ± 0.238
2.686AlaTyr: 2.686 ± 0.435
0.0AlaXaa: 0.0 ± 0.0
Cys
0.175CysAla: 0.175 ± 0.088
0.058CysCys: 0.058 ± 0.066
0.292CysAsp: 0.292 ± 0.139
0.409CysGlu: 0.409 ± 0.155
0.175CysPhe: 0.175 ± 0.11
0.35CysGly: 0.35 ± 0.16
0.058CysHis: 0.058 ± 0.067
0.701CysIle: 0.701 ± 0.248
0.467CysLys: 0.467 ± 0.15
0.35CysLeu: 0.35 ± 0.128
0.058CysMet: 0.058 ± 0.049
0.117CysAsn: 0.117 ± 0.081
0.292CysPro: 0.292 ± 0.124
0.058CysGln: 0.058 ± 0.063
0.234CysArg: 0.234 ± 0.108
0.234CysSer: 0.234 ± 0.115
0.175CysThr: 0.175 ± 0.107
0.292CysVal: 0.292 ± 0.119
0.058CysTrp: 0.058 ± 0.053
0.526CysTyr: 0.526 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
3.503AspAla: 3.503 ± 0.395
0.234AspCys: 0.234 ± 0.104
4.438AspAsp: 4.438 ± 0.7
4.438AspGlu: 4.438 ± 0.552
2.978AspPhe: 2.978 ± 0.403
4.671AspGly: 4.671 ± 0.589
0.817AspHis: 0.817 ± 0.215
5.314AspIle: 5.314 ± 0.412
5.197AspLys: 5.197 ± 0.721
5.956AspLeu: 5.956 ± 0.66
2.102AspMet: 2.102 ± 0.416
4.671AspAsn: 4.671 ± 0.462
2.219AspPro: 2.219 ± 0.403
1.927AspGln: 1.927 ± 0.347
1.693AspArg: 1.693 ± 0.325
4.146AspSer: 4.146 ± 0.512
4.554AspThr: 4.554 ± 0.704
4.087AspVal: 4.087 ± 0.409
0.934AspTrp: 0.934 ± 0.174
3.503AspTyr: 3.503 ± 0.614
0.0AspXaa: 0.0 ± 0.0
Glu
3.971GluAla: 3.971 ± 0.504
0.409GluCys: 0.409 ± 0.147
4.846GluAsp: 4.846 ± 0.549
5.956GluGlu: 5.956 ± 0.972
1.869GluPhe: 1.869 ± 0.332
4.087GluGly: 4.087 ± 0.469
1.401GluHis: 1.401 ± 0.35
3.854GluIle: 3.854 ± 0.554
4.087GluLys: 4.087 ± 0.581
4.379GluLeu: 4.379 ± 0.528
1.81GluMet: 1.81 ± 0.368
3.387GluAsn: 3.387 ± 0.584
1.46GluPro: 1.46 ± 0.276
1.985GluGln: 1.985 ± 0.349
2.628GluArg: 2.628 ± 0.484
3.211GluSer: 3.211 ± 0.444
3.562GluThr: 3.562 ± 0.393
4.671GluVal: 4.671 ± 0.577
1.051GluTrp: 1.051 ± 0.241
3.387GluTyr: 3.387 ± 0.574
0.0GluXaa: 0.0 ± 0.0
Phe
1.927PheAla: 1.927 ± 0.313
0.292PheCys: 0.292 ± 0.132
3.795PheAsp: 3.795 ± 0.453
2.044PheGlu: 2.044 ± 0.403
1.518PhePhe: 1.518 ± 0.277
3.328PheGly: 3.328 ± 0.447
0.584PheHis: 0.584 ± 0.237
2.452PheIle: 2.452 ± 0.327
3.679PheLys: 3.679 ± 0.509
2.336PheLeu: 2.336 ± 0.502
0.701PheMet: 0.701 ± 0.179
3.27PheAsn: 3.27 ± 0.47
0.993PhePro: 0.993 ± 0.226
0.642PheGln: 0.642 ± 0.182
1.343PheArg: 1.343 ± 0.26
1.81PheSer: 1.81 ± 0.358
2.978PheThr: 2.978 ± 0.505
1.927PheVal: 1.927 ± 0.354
0.35PheTrp: 0.35 ± 0.115
1.518PheTyr: 1.518 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
4.438GlyAla: 4.438 ± 1.183
0.234GlyCys: 0.234 ± 0.115
3.62GlyAsp: 3.62 ± 0.558
3.328GlyGlu: 3.328 ± 0.439
2.744GlyPhe: 2.744 ± 0.513
3.445GlyGly: 3.445 ± 0.553
1.168GlyHis: 1.168 ± 0.247
5.255GlyIle: 5.255 ± 0.859
4.846GlyLys: 4.846 ± 0.705
5.372GlyLeu: 5.372 ± 1.229
1.752GlyMet: 1.752 ± 0.351
4.087GlyAsn: 4.087 ± 0.53
1.401GlyPro: 1.401 ± 0.216
1.927GlyGln: 1.927 ± 0.441
2.744GlyArg: 2.744 ± 0.446
5.022GlySer: 5.022 ± 0.772
4.73GlyThr: 4.73 ± 0.657
4.554GlyVal: 4.554 ± 0.533
1.051GlyTrp: 1.051 ± 0.228
3.679GlyTyr: 3.679 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
0.526HisAla: 0.526 ± 0.134
0.117HisCys: 0.117 ± 0.087
1.168HisAsp: 1.168 ± 0.262
0.817HisGlu: 0.817 ± 0.234
0.35HisPhe: 0.35 ± 0.165
1.46HisGly: 1.46 ± 0.287
0.292HisHis: 0.292 ± 0.138
1.168HisIle: 1.168 ± 0.276
1.343HisLys: 1.343 ± 0.279
1.109HisLeu: 1.109 ± 0.332
0.526HisMet: 0.526 ± 0.179
1.168HisAsn: 1.168 ± 0.32
0.642HisPro: 0.642 ± 0.192
0.409HisGln: 0.409 ± 0.143
0.526HisArg: 0.526 ± 0.176
1.285HisSer: 1.285 ± 0.296
1.168HisThr: 1.168 ± 0.323
1.109HisVal: 1.109 ± 0.284
0.058HisTrp: 0.058 ± 0.053
0.993HisTyr: 0.993 ± 0.256
0.0HisXaa: 0.0 ± 0.0
Ile
4.788IleAla: 4.788 ± 0.631
0.292IleCys: 0.292 ± 0.126
6.598IleAsp: 6.598 ± 0.471
4.379IleGlu: 4.379 ± 0.541
1.577IlePhe: 1.577 ± 0.26
4.73IleGly: 4.73 ± 1.15
1.46IleHis: 1.46 ± 0.281
5.781IleIle: 5.781 ± 0.637
6.657IleLys: 6.657 ± 0.507
5.197IleLeu: 5.197 ± 0.836
1.985IleMet: 1.985 ± 0.361
3.971IleAsn: 3.971 ± 0.484
2.686IlePro: 2.686 ± 0.385
2.511IleGln: 2.511 ± 0.357
2.219IleArg: 2.219 ± 0.33
5.43IleSer: 5.43 ± 0.635
5.314IleThr: 5.314 ± 0.68
5.255IleVal: 5.255 ± 0.543
0.642IleTrp: 0.642 ± 0.287
2.628IleTyr: 2.628 ± 0.403
0.0IleXaa: 0.0 ± 0.0
Lys
4.496LysAla: 4.496 ± 0.484
0.35LysCys: 0.35 ± 0.17
5.547LysAsp: 5.547 ± 0.824
6.598LysGlu: 6.598 ± 0.932
3.679LysPhe: 3.679 ± 0.604
4.613LysGly: 4.613 ± 0.664
1.693LysHis: 1.693 ± 0.405
5.781LysIle: 5.781 ± 0.586
5.722LysLys: 5.722 ± 0.859
7.065LysLeu: 7.065 ± 0.525
3.153LysMet: 3.153 ± 0.387
3.62LysAsn: 3.62 ± 0.41
2.569LysPro: 2.569 ± 0.521
2.861LysGln: 2.861 ± 0.498
3.27LysArg: 3.27 ± 0.544
4.204LysSer: 4.204 ± 0.535
6.014LysThr: 6.014 ± 0.461
4.613LysVal: 4.613 ± 0.475
1.226LysTrp: 1.226 ± 0.29
3.328LysTyr: 3.328 ± 0.504
0.0LysXaa: 0.0 ± 0.0
Leu
6.423LeuAla: 6.423 ± 1.342
0.409LeuCys: 0.409 ± 0.169
4.438LeuAsp: 4.438 ± 0.546
4.73LeuGlu: 4.73 ± 0.565
2.803LeuPhe: 2.803 ± 0.434
5.43LeuGly: 5.43 ± 0.946
0.876LeuHis: 0.876 ± 0.223
5.43LeuIle: 5.43 ± 0.982
6.832LeuLys: 6.832 ± 0.896
5.547LeuLeu: 5.547 ± 0.644
1.693LeuMet: 1.693 ± 0.301
4.496LeuAsn: 4.496 ± 0.459
2.16LeuPro: 2.16 ± 0.322
3.095LeuGln: 3.095 ± 0.453
3.328LeuArg: 3.328 ± 0.463
6.89LeuSer: 6.89 ± 0.762
5.314LeuThr: 5.314 ± 0.504
5.022LeuVal: 5.022 ± 0.572
0.642LeuTrp: 0.642 ± 0.217
3.153LeuTyr: 3.153 ± 0.648
0.0LeuXaa: 0.0 ± 0.0
Met
2.336MetAla: 2.336 ± 0.599
0.175MetCys: 0.175 ± 0.099
2.044MetAsp: 2.044 ± 0.283
1.927MetGlu: 1.927 ± 0.412
1.051MetPhe: 1.051 ± 0.237
1.693MetGly: 1.693 ± 0.641
0.35MetHis: 0.35 ± 0.141
1.985MetIle: 1.985 ± 0.31
3.679MetLys: 3.679 ± 0.397
2.044MetLeu: 2.044 ± 0.409
0.584MetMet: 0.584 ± 0.227
1.927MetAsn: 1.927 ± 0.37
0.876MetPro: 0.876 ± 0.267
0.759MetGln: 0.759 ± 0.237
0.993MetArg: 0.993 ± 0.256
2.16MetSer: 2.16 ± 0.289
1.518MetThr: 1.518 ± 0.263
1.927MetVal: 1.927 ± 0.382
0.234MetTrp: 0.234 ± 0.109
0.759MetTyr: 0.759 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
4.087AsnAla: 4.087 ± 0.49
0.292AsnCys: 0.292 ± 0.159
3.912AsnAsp: 3.912 ± 0.528
3.095AsnGlu: 3.095 ± 0.459
2.511AsnPhe: 2.511 ± 0.33
5.138AsnGly: 5.138 ± 0.559
0.934AsnHis: 0.934 ± 0.261
5.606AsnIle: 5.606 ± 0.383
6.131AsnLys: 6.131 ± 0.755
4.554AsnLeu: 4.554 ± 0.483
2.16AsnMet: 2.16 ± 0.281
3.503AsnAsn: 3.503 ± 0.559
2.628AsnPro: 2.628 ± 0.518
2.394AsnGln: 2.394 ± 0.405
2.219AsnArg: 2.219 ± 0.473
3.211AsnSer: 3.211 ± 0.442
3.503AsnThr: 3.503 ± 0.515
3.854AsnVal: 3.854 ± 0.484
0.701AsnTrp: 0.701 ± 0.177
2.16AsnTyr: 2.16 ± 0.402
0.0AsnXaa: 0.0 ± 0.0
Pro
1.635ProAla: 1.635 ± 0.347
0.117ProCys: 0.117 ± 0.1
2.16ProAsp: 2.16 ± 0.415
2.277ProGlu: 2.277 ± 0.408
1.285ProPhe: 1.285 ± 0.271
1.577ProGly: 1.577 ± 0.261
0.467ProHis: 0.467 ± 0.142
2.16ProIle: 2.16 ± 0.277
2.803ProLys: 2.803 ± 0.566
2.861ProLeu: 2.861 ± 0.444
1.051ProMet: 1.051 ± 0.2
2.511ProAsn: 2.511 ± 0.395
0.817ProPro: 0.817 ± 0.259
1.226ProGln: 1.226 ± 0.297
1.051ProArg: 1.051 ± 0.241
2.803ProSer: 2.803 ± 0.422
2.628ProThr: 2.628 ± 0.443
2.16ProVal: 2.16 ± 0.388
0.467ProTrp: 0.467 ± 0.181
1.343ProTyr: 1.343 ± 0.283
0.0ProXaa: 0.0 ± 0.0
Gln
2.16GlnAla: 2.16 ± 0.399
0.058GlnCys: 0.058 ± 0.07
2.102GlnAsp: 2.102 ± 0.555
1.518GlnGlu: 1.518 ± 0.308
1.168GlnPhe: 1.168 ± 0.198
2.044GlnGly: 2.044 ± 0.305
0.642GlnHis: 0.642 ± 0.254
1.577GlnIle: 1.577 ± 0.31
1.927GlnLys: 1.927 ± 0.335
3.036GlnLeu: 3.036 ± 0.563
0.934GlnMet: 0.934 ± 0.235
1.985GlnAsn: 1.985 ± 0.325
1.168GlnPro: 1.168 ± 0.313
1.752GlnGln: 1.752 ± 0.317
1.051GlnArg: 1.051 ± 0.272
2.394GlnSer: 2.394 ± 0.338
2.569GlnThr: 2.569 ± 0.337
2.102GlnVal: 2.102 ± 0.409
0.292GlnTrp: 0.292 ± 0.111
2.102GlnTyr: 2.102 ± 0.418
0.0GlnXaa: 0.0 ± 0.0
Arg
2.336ArgAla: 2.336 ± 0.452
0.234ArgCys: 0.234 ± 0.145
2.219ArgAsp: 2.219 ± 0.327
1.985ArgGlu: 1.985 ± 0.41
1.693ArgPhe: 1.693 ± 0.286
1.635ArgGly: 1.635 ± 0.244
0.584ArgHis: 0.584 ± 0.197
3.211ArgIle: 3.211 ± 0.377
3.387ArgLys: 3.387 ± 0.479
3.153ArgLeu: 3.153 ± 0.524
1.285ArgMet: 1.285 ± 0.275
2.336ArgAsn: 2.336 ± 0.442
1.577ArgPro: 1.577 ± 0.261
1.693ArgGln: 1.693 ± 0.28
1.46ArgArg: 1.46 ± 0.394
2.219ArgSer: 2.219 ± 0.365
2.219ArgThr: 2.219 ± 0.479
2.219ArgVal: 2.219 ± 0.361
0.35ArgTrp: 0.35 ± 0.108
1.285ArgTyr: 1.285 ± 0.29
0.0ArgXaa: 0.0 ± 0.0
Ser
3.562SerAla: 3.562 ± 0.618
0.234SerCys: 0.234 ± 0.121
4.146SerAsp: 4.146 ± 0.495
3.679SerGlu: 3.679 ± 0.42
2.452SerPhe: 2.452 ± 0.495
5.022SerGly: 5.022 ± 0.449
0.876SerHis: 0.876 ± 0.254
4.905SerIle: 4.905 ± 0.714
5.489SerLys: 5.489 ± 0.596
5.197SerLeu: 5.197 ± 0.732
1.46SerMet: 1.46 ± 0.39
4.613SerAsn: 4.613 ± 0.52
2.452SerPro: 2.452 ± 0.423
1.869SerGln: 1.869 ± 0.37
2.219SerArg: 2.219 ± 0.367
4.087SerSer: 4.087 ± 0.622
4.788SerThr: 4.788 ± 0.591
4.029SerVal: 4.029 ± 0.576
1.051SerTrp: 1.051 ± 0.273
2.686SerTyr: 2.686 ± 0.459
0.0SerXaa: 0.0 ± 0.0
Thr
5.138ThrAla: 5.138 ± 0.782
0.817ThrCys: 0.817 ± 0.253
4.204ThrAsp: 4.204 ± 0.438
4.788ThrGlu: 4.788 ± 0.753
2.978ThrPhe: 2.978 ± 0.468
4.379ThrGly: 4.379 ± 0.516
1.109ThrHis: 1.109 ± 0.254
5.372ThrIle: 5.372 ± 0.706
4.204ThrLys: 4.204 ± 0.494
5.372ThrLeu: 5.372 ± 0.477
1.693ThrMet: 1.693 ± 0.281
3.912ThrAsn: 3.912 ± 0.465
3.095ThrPro: 3.095 ± 0.463
1.869ThrGln: 1.869 ± 0.405
2.511ThrArg: 2.511 ± 0.379
3.795ThrSer: 3.795 ± 0.547
4.73ThrThr: 4.73 ± 0.576
5.138ThrVal: 5.138 ± 0.677
0.759ThrTrp: 0.759 ± 0.207
2.511ThrTyr: 2.511 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
3.737ValAla: 3.737 ± 0.708
0.175ValCys: 0.175 ± 0.113
4.73ValAsp: 4.73 ± 0.493
3.795ValGlu: 3.795 ± 0.53
2.277ValPhe: 2.277 ± 0.387
3.795ValGly: 3.795 ± 0.504
0.993ValHis: 0.993 ± 0.289
4.73ValIle: 4.73 ± 0.483
5.08ValLys: 5.08 ± 0.595
4.846ValLeu: 4.846 ± 0.783
1.752ValMet: 1.752 ± 0.25
4.846ValAsn: 4.846 ± 0.5
2.511ValPro: 2.511 ± 0.419
1.927ValGln: 1.927 ± 0.304
2.569ValArg: 2.569 ± 0.385
3.912ValSer: 3.912 ± 0.428
5.606ValThr: 5.606 ± 0.627
4.554ValVal: 4.554 ± 0.374
0.526ValTrp: 0.526 ± 0.165
2.394ValTyr: 2.394 ± 0.464
0.0ValXaa: 0.0 ± 0.0
Trp
0.934TrpAla: 0.934 ± 0.21
0.0TrpCys: 0.0 ± 0.0
1.226TrpAsp: 1.226 ± 0.309
0.759TrpGlu: 0.759 ± 0.217
0.526TrpPhe: 0.526 ± 0.164
0.642TrpGly: 0.642 ± 0.235
0.234TrpHis: 0.234 ± 0.119
0.467TrpIle: 0.467 ± 0.175
0.292TrpLys: 0.292 ± 0.153
1.168TrpLeu: 1.168 ± 0.19
0.409TrpMet: 0.409 ± 0.176
0.934TrpAsn: 0.934 ± 0.233
0.292TrpPro: 0.292 ± 0.18
0.35TrpGln: 0.35 ± 0.124
0.409TrpArg: 0.409 ± 0.138
1.051TrpSer: 1.051 ± 0.224
0.759TrpThr: 0.759 ± 0.198
0.584TrpVal: 0.584 ± 0.178
0.175TrpTrp: 0.175 ± 0.1
0.701TrpTyr: 0.701 ± 0.228
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.452TyrAla: 2.452 ± 0.452
0.409TyrCys: 0.409 ± 0.184
2.803TyrAsp: 2.803 ± 0.626
2.861TyrGlu: 2.861 ± 0.472
1.577TyrPhe: 1.577 ± 0.313
2.277TyrGly: 2.277 ± 0.408
0.817TyrHis: 0.817 ± 0.222
3.153TyrIle: 3.153 ± 0.532
3.27TyrLys: 3.27 ± 0.54
3.854TyrLeu: 3.854 ± 0.598
1.226TyrMet: 1.226 ± 0.278
3.971TyrAsn: 3.971 ± 0.633
1.635TyrPro: 1.635 ± 0.331
1.401TyrGln: 1.401 ± 0.36
2.16TyrArg: 2.16 ± 0.395
2.744TyrSer: 2.744 ± 0.551
1.985TyrThr: 1.985 ± 0.426
2.277TyrVal: 2.277 ± 0.359
0.467TyrTrp: 0.467 ± 0.143
2.452TyrTyr: 2.452 ± 0.51
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (17127 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski