Amino acid dipepetide frequency for Pectobacterium phage MA13

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.842AlaAla: 11.842 ± 1.629
0.619AlaCys: 0.619 ± 0.267
6.628AlaAsp: 6.628 ± 0.831
6.716AlaGlu: 6.716 ± 0.885
3.446AlaPhe: 3.446 ± 0.551
7.511AlaGly: 7.511 ± 0.798
1.944AlaHis: 1.944 ± 0.482
3.535AlaIle: 3.535 ± 0.623
5.921AlaLys: 5.921 ± 0.876
8.13AlaLeu: 8.13 ± 0.789
3.888AlaMet: 3.888 ± 0.738
3.8AlaAsn: 3.8 ± 0.56
2.828AlaPro: 2.828 ± 0.375
4.949AlaGln: 4.949 ± 0.775
4.772AlaArg: 4.772 ± 0.777
5.479AlaSer: 5.479 ± 0.646
5.656AlaThr: 5.656 ± 0.731
6.274AlaVal: 6.274 ± 0.846
1.237AlaTrp: 1.237 ± 0.32
3.446AlaTyr: 3.446 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.53CysAla: 0.53 ± 0.247
0.088CysCys: 0.088 ± 0.099
0.619CysAsp: 0.619 ± 0.235
0.265CysGlu: 0.265 ± 0.132
0.353CysPhe: 0.353 ± 0.194
0.795CysGly: 0.795 ± 0.248
0.353CysHis: 0.353 ± 0.152
0.177CysIle: 0.177 ± 0.094
0.265CysLys: 0.265 ± 0.145
0.972CysLeu: 0.972 ± 0.341
0.265CysMet: 0.265 ± 0.152
0.442CysAsn: 0.442 ± 0.246
0.619CysPro: 0.619 ± 0.288
0.353CysGln: 0.353 ± 0.165
0.265CysArg: 0.265 ± 0.146
0.442CysSer: 0.442 ± 0.209
0.177CysThr: 0.177 ± 0.125
0.972CysVal: 0.972 ± 0.309
0.0CysTrp: 0.0 ± 0.0
0.53CysTyr: 0.53 ± 0.295
0.0CysXaa: 0.0 ± 0.0
Asp
5.302AspAla: 5.302 ± 0.733
0.619AspCys: 0.619 ± 0.182
4.242AspAsp: 4.242 ± 0.618
3.8AspGlu: 3.8 ± 0.565
2.474AspPhe: 2.474 ± 0.424
5.125AspGly: 5.125 ± 0.932
0.619AspHis: 0.619 ± 0.289
3.27AspIle: 3.27 ± 0.594
4.507AspLys: 4.507 ± 0.484
4.772AspLeu: 4.772 ± 0.768
1.856AspMet: 1.856 ± 0.36
2.563AspAsn: 2.563 ± 0.485
2.828AspPro: 2.828 ± 0.538
2.121AspGln: 2.121 ± 0.475
2.474AspArg: 2.474 ± 0.466
4.507AspSer: 4.507 ± 0.616
3.888AspThr: 3.888 ± 0.657
5.479AspVal: 5.479 ± 0.856
1.06AspTrp: 1.06 ± 0.306
3.093AspTyr: 3.093 ± 0.532
0.0AspXaa: 0.0 ± 0.0
Glu
5.656GluAla: 5.656 ± 0.693
0.353GluCys: 0.353 ± 0.146
3.446GluAsp: 3.446 ± 0.631
3.977GluGlu: 3.977 ± 0.608
2.033GluPhe: 2.033 ± 0.402
4.153GluGly: 4.153 ± 0.551
1.237GluHis: 1.237 ± 0.305
2.121GluIle: 2.121 ± 0.471
3.8GluLys: 3.8 ± 0.538
5.921GluLeu: 5.921 ± 0.841
2.209GluMet: 2.209 ± 0.411
1.767GluAsn: 1.767 ± 0.447
2.033GluPro: 2.033 ± 0.58
4.419GluGln: 4.419 ± 0.737
3.977GluArg: 3.977 ± 0.568
4.242GluSer: 4.242 ± 0.565
3.005GluThr: 3.005 ± 0.497
3.8GluVal: 3.8 ± 0.569
0.972GluTrp: 0.972 ± 0.392
1.944GluTyr: 1.944 ± 0.567
0.0GluXaa: 0.0 ± 0.0
Phe
2.474PheAla: 2.474 ± 0.343
0.353PheCys: 0.353 ± 0.178
2.563PheAsp: 2.563 ± 0.455
1.767PheGlu: 1.767 ± 0.359
1.502PhePhe: 1.502 ± 0.332
2.828PheGly: 2.828 ± 0.491
0.353PheHis: 0.353 ± 0.178
2.033PheIle: 2.033 ± 0.484
2.033PheLys: 2.033 ± 0.426
2.474PheLeu: 2.474 ± 0.434
1.326PheMet: 1.326 ± 0.284
1.502PheAsn: 1.502 ± 0.331
1.237PhePro: 1.237 ± 0.309
1.856PheGln: 1.856 ± 0.383
1.944PheArg: 1.944 ± 0.316
2.209PheSer: 2.209 ± 0.391
2.474PheThr: 2.474 ± 0.681
2.386PheVal: 2.386 ± 0.497
0.442PheTrp: 0.442 ± 0.184
1.237PheTyr: 1.237 ± 0.359
0.0PheXaa: 0.0 ± 0.0
Gly
6.981GlyAla: 6.981 ± 1.067
1.06GlyCys: 1.06 ± 0.378
5.744GlyAsp: 5.744 ± 0.816
4.507GlyGlu: 4.507 ± 0.778
2.563GlyPhe: 2.563 ± 0.552
7.423GlyGly: 7.423 ± 1.06
1.326GlyHis: 1.326 ± 0.358
3.712GlyIle: 3.712 ± 0.558
4.684GlyLys: 4.684 ± 0.626
6.274GlyLeu: 6.274 ± 0.773
2.828GlyMet: 2.828 ± 0.68
2.828GlyAsn: 2.828 ± 0.622
2.121GlyPro: 2.121 ± 0.447
2.651GlyGln: 2.651 ± 0.665
5.125GlyArg: 5.125 ± 0.606
5.214GlySer: 5.214 ± 0.501
5.302GlyThr: 5.302 ± 0.799
5.214GlyVal: 5.214 ± 0.595
0.972GlyTrp: 0.972 ± 0.285
2.651GlyTyr: 2.651 ± 0.678
0.0GlyXaa: 0.0 ± 0.0
His
1.944HisAla: 1.944 ± 0.466
0.088HisCys: 0.088 ± 0.079
1.326HisAsp: 1.326 ± 0.324
1.326HisGlu: 1.326 ± 0.366
0.53HisPhe: 0.53 ± 0.186
1.856HisGly: 1.856 ± 0.519
0.265HisHis: 0.265 ± 0.185
0.795HisIle: 0.795 ± 0.254
0.795HisLys: 0.795 ± 0.174
1.767HisLeu: 1.767 ± 0.336
0.795HisMet: 0.795 ± 0.244
0.442HisAsn: 0.442 ± 0.193
0.972HisPro: 0.972 ± 0.318
0.884HisGln: 0.884 ± 0.288
1.149HisArg: 1.149 ± 0.352
0.884HisSer: 0.884 ± 0.285
0.795HisThr: 0.795 ± 0.252
0.53HisVal: 0.53 ± 0.201
0.265HisTrp: 0.265 ± 0.123
0.884HisTyr: 0.884 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
4.507IleAla: 4.507 ± 0.786
0.353IleCys: 0.353 ± 0.184
2.828IleAsp: 2.828 ± 0.511
2.121IleGlu: 2.121 ± 0.313
1.326IlePhe: 1.326 ± 0.371
3.446IleGly: 3.446 ± 0.675
1.06IleHis: 1.06 ± 0.245
2.033IleIle: 2.033 ± 0.473
2.651IleLys: 2.651 ± 0.409
2.298IleLeu: 2.298 ± 0.548
1.414IleMet: 1.414 ± 0.378
1.502IleAsn: 1.502 ± 0.361
1.944IlePro: 1.944 ± 0.315
1.944IleGln: 1.944 ± 0.397
3.181IleArg: 3.181 ± 0.569
2.121IleSer: 2.121 ± 0.483
2.298IleThr: 2.298 ± 0.525
2.298IleVal: 2.298 ± 0.348
0.619IleTrp: 0.619 ± 0.235
1.502IleTyr: 1.502 ± 0.31
0.0IleXaa: 0.0 ± 0.0
Lys
7.865LysAla: 7.865 ± 1.075
0.177LysCys: 0.177 ± 0.124
4.507LysAsp: 4.507 ± 0.808
2.916LysGlu: 2.916 ± 0.485
1.414LysPhe: 1.414 ± 0.311
4.684LysGly: 4.684 ± 0.693
1.237LysHis: 1.237 ± 0.358
2.121LysIle: 2.121 ± 0.626
4.065LysLys: 4.065 ± 0.676
7.07LysLeu: 7.07 ± 0.775
2.033LysMet: 2.033 ± 0.434
1.591LysAsn: 1.591 ± 0.343
3.005LysPro: 3.005 ± 0.545
2.828LysGln: 2.828 ± 0.581
3.535LysArg: 3.535 ± 0.678
2.828LysSer: 2.828 ± 0.516
2.386LysThr: 2.386 ± 0.405
3.977LysVal: 3.977 ± 0.512
0.884LysTrp: 0.884 ± 0.328
2.739LysTyr: 2.739 ± 0.453
0.0LysXaa: 0.0 ± 0.0
Leu
9.544LeuAla: 9.544 ± 1.103
0.884LeuCys: 0.884 ± 0.308
5.391LeuAsp: 5.391 ± 0.555
5.125LeuGlu: 5.125 ± 0.773
3.446LeuPhe: 3.446 ± 0.476
6.716LeuGly: 6.716 ± 1.086
1.856LeuHis: 1.856 ± 0.338
3.181LeuIle: 3.181 ± 0.597
4.949LeuLys: 4.949 ± 0.831
6.981LeuLeu: 6.981 ± 0.931
2.033LeuMet: 2.033 ± 0.4
4.507LeuAsn: 4.507 ± 0.513
4.684LeuPro: 4.684 ± 0.539
3.535LeuGln: 3.535 ± 0.62
3.623LeuArg: 3.623 ± 0.514
5.744LeuSer: 5.744 ± 0.658
3.623LeuThr: 3.623 ± 0.596
4.86LeuVal: 4.86 ± 0.674
0.795LeuTrp: 0.795 ± 0.205
2.828LeuTyr: 2.828 ± 0.544
0.0LeuXaa: 0.0 ± 0.0
Met
3.358MetAla: 3.358 ± 0.534
0.265MetCys: 0.265 ± 0.154
2.033MetAsp: 2.033 ± 0.417
1.414MetGlu: 1.414 ± 0.294
1.149MetPhe: 1.149 ± 0.343
1.944MetGly: 1.944 ± 0.449
0.972MetHis: 0.972 ± 0.249
0.53MetIle: 0.53 ± 0.185
2.121MetLys: 2.121 ± 0.57
3.446MetLeu: 3.446 ± 0.559
0.972MetMet: 0.972 ± 0.303
1.591MetAsn: 1.591 ± 0.539
1.237MetPro: 1.237 ± 0.295
2.033MetGln: 2.033 ± 0.486
2.121MetArg: 2.121 ± 0.428
2.121MetSer: 2.121 ± 0.392
1.944MetThr: 1.944 ± 0.491
1.856MetVal: 1.856 ± 0.316
0.619MetTrp: 0.619 ± 0.237
0.707MetTyr: 0.707 ± 0.238
0.0MetXaa: 0.0 ± 0.0
Asn
3.623AsnAla: 3.623 ± 0.54
0.353AsnCys: 0.353 ± 0.142
2.209AsnAsp: 2.209 ± 0.313
2.386AsnGlu: 2.386 ± 0.393
1.767AsnPhe: 1.767 ± 0.382
3.093AsnGly: 3.093 ± 0.462
0.353AsnHis: 0.353 ± 0.167
1.679AsnIle: 1.679 ± 0.395
3.005AsnLys: 3.005 ± 0.543
3.181AsnLeu: 3.181 ± 0.518
1.767AsnMet: 1.767 ± 0.414
1.591AsnAsn: 1.591 ± 0.404
2.474AsnPro: 2.474 ± 0.348
2.209AsnGln: 2.209 ± 0.661
2.563AsnArg: 2.563 ± 0.638
1.502AsnSer: 1.502 ± 0.438
2.121AsnThr: 2.121 ± 0.36
2.916AsnVal: 2.916 ± 0.485
0.353AsnTrp: 0.353 ± 0.149
1.502AsnTyr: 1.502 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
4.772ProAla: 4.772 ± 0.986
0.265ProCys: 0.265 ± 0.143
3.005ProAsp: 3.005 ± 0.441
3.888ProGlu: 3.888 ± 0.638
1.149ProPhe: 1.149 ± 0.296
3.623ProGly: 3.623 ± 0.537
0.442ProHis: 0.442 ± 0.214
1.856ProIle: 1.856 ± 0.326
2.739ProLys: 2.739 ± 0.579
3.623ProLeu: 3.623 ± 0.586
0.884ProMet: 0.884 ± 0.238
1.414ProAsn: 1.414 ± 0.287
1.767ProPro: 1.767 ± 0.743
2.033ProGln: 2.033 ± 0.424
1.149ProArg: 1.149 ± 0.288
2.209ProSer: 2.209 ± 0.356
2.386ProThr: 2.386 ± 0.546
3.8ProVal: 3.8 ± 0.627
0.53ProTrp: 0.53 ± 0.175
1.237ProTyr: 1.237 ± 0.339
0.0ProXaa: 0.0 ± 0.0
Gln
5.214GlnAla: 5.214 ± 1.068
0.353GlnCys: 0.353 ± 0.201
1.944GlnAsp: 1.944 ± 0.365
3.181GlnGlu: 3.181 ± 0.472
1.237GlnPhe: 1.237 ± 0.261
3.181GlnGly: 3.181 ± 0.478
1.237GlnHis: 1.237 ± 0.317
2.121GlnIle: 2.121 ± 0.557
2.474GlnLys: 2.474 ± 0.435
4.684GlnLeu: 4.684 ± 0.676
2.298GlnMet: 2.298 ± 0.647
1.856GlnAsn: 1.856 ± 0.447
1.679GlnPro: 1.679 ± 0.498
3.181GlnGln: 3.181 ± 0.847
2.651GlnArg: 2.651 ± 0.537
2.474GlnSer: 2.474 ± 0.533
2.209GlnThr: 2.209 ± 0.413
3.005GlnVal: 3.005 ± 0.428
0.53GlnTrp: 0.53 ± 0.19
2.033GlnTyr: 2.033 ± 0.387
0.0GlnXaa: 0.0 ± 0.0
Arg
4.242ArgAla: 4.242 ± 0.536
0.442ArgCys: 0.442 ± 0.186
3.005ArgAsp: 3.005 ± 0.521
3.888ArgGlu: 3.888 ± 0.378
3.093ArgPhe: 3.093 ± 0.463
3.712ArgGly: 3.712 ± 0.535
1.502ArgHis: 1.502 ± 0.343
3.093ArgIle: 3.093 ± 0.531
3.093ArgLys: 3.093 ± 0.654
3.977ArgLeu: 3.977 ± 0.52
1.856ArgMet: 1.856 ± 0.469
3.005ArgAsn: 3.005 ± 0.479
2.121ArgPro: 2.121 ± 0.548
2.298ArgGln: 2.298 ± 0.629
3.27ArgArg: 3.27 ± 0.43
2.828ArgSer: 2.828 ± 0.529
3.005ArgThr: 3.005 ± 0.479
3.535ArgVal: 3.535 ± 0.625
1.237ArgTrp: 1.237 ± 0.364
1.767ArgTyr: 1.767 ± 0.395
0.0ArgXaa: 0.0 ± 0.0
Ser
6.186SerAla: 6.186 ± 0.876
0.353SerCys: 0.353 ± 0.174
4.242SerAsp: 4.242 ± 0.759
3.712SerGlu: 3.712 ± 0.654
1.767SerPhe: 1.767 ± 0.338
5.479SerGly: 5.479 ± 0.452
0.53SerHis: 0.53 ± 0.197
2.563SerIle: 2.563 ± 0.389
3.977SerLys: 3.977 ± 0.693
4.595SerLeu: 4.595 ± 0.676
1.856SerMet: 1.856 ± 0.356
2.563SerAsn: 2.563 ± 0.438
3.093SerPro: 3.093 ± 0.651
2.121SerGln: 2.121 ± 0.49
3.535SerArg: 3.535 ± 0.478
2.033SerSer: 2.033 ± 0.44
3.181SerThr: 3.181 ± 0.489
3.358SerVal: 3.358 ± 0.452
0.707SerTrp: 0.707 ± 0.255
2.298SerTyr: 2.298 ± 0.529
0.0SerXaa: 0.0 ± 0.0
Thr
4.949ThrAla: 4.949 ± 0.565
0.795ThrCys: 0.795 ± 0.302
2.916ThrAsp: 2.916 ± 0.44
2.828ThrGlu: 2.828 ± 0.508
1.591ThrPhe: 1.591 ± 0.279
4.949ThrGly: 4.949 ± 0.794
1.06ThrHis: 1.06 ± 0.31
1.767ThrIle: 1.767 ± 0.445
3.446ThrLys: 3.446 ± 0.525
4.949ThrLeu: 4.949 ± 0.79
1.149ThrMet: 1.149 ± 0.299
1.502ThrAsn: 1.502 ± 0.444
3.535ThrPro: 3.535 ± 0.534
1.944ThrGln: 1.944 ± 0.396
3.005ThrArg: 3.005 ± 0.31
3.8ThrSer: 3.8 ± 0.618
3.358ThrThr: 3.358 ± 0.553
4.419ThrVal: 4.419 ± 0.919
0.795ThrTrp: 0.795 ± 0.258
1.591ThrTyr: 1.591 ± 0.465
0.0ThrXaa: 0.0 ± 0.0
Val
5.656ValAla: 5.656 ± 0.763
0.353ValCys: 0.353 ± 0.141
4.33ValAsp: 4.33 ± 0.597
3.535ValGlu: 3.535 ± 0.41
2.033ValPhe: 2.033 ± 0.407
5.744ValGly: 5.744 ± 0.823
1.237ValHis: 1.237 ± 0.298
2.563ValIle: 2.563 ± 0.521
3.8ValLys: 3.8 ± 0.69
4.949ValLeu: 4.949 ± 0.618
1.767ValMet: 1.767 ± 0.459
3.535ValAsn: 3.535 ± 0.558
2.739ValPro: 2.739 ± 0.577
4.242ValGln: 4.242 ± 0.581
4.242ValArg: 4.242 ± 0.602
4.419ValSer: 4.419 ± 0.482
3.977ValThr: 3.977 ± 0.697
4.684ValVal: 4.684 ± 0.652
0.884ValTrp: 0.884 ± 0.246
2.121ValTyr: 2.121 ± 0.381
0.0ValXaa: 0.0 ± 0.0
Trp
1.326TrpAla: 1.326 ± 0.255
0.088TrpCys: 0.088 ± 0.093
0.795TrpAsp: 0.795 ± 0.251
0.972TrpGlu: 0.972 ± 0.302
1.149TrpPhe: 1.149 ± 0.182
0.177TrpGly: 0.177 ± 0.119
0.265TrpHis: 0.265 ± 0.139
0.619TrpIle: 0.619 ± 0.291
0.972TrpLys: 0.972 ± 0.326
1.767TrpLeu: 1.767 ± 0.449
0.265TrpMet: 0.265 ± 0.149
0.442TrpAsn: 0.442 ± 0.16
0.442TrpPro: 0.442 ± 0.164
0.53TrpGln: 0.53 ± 0.17
0.442TrpArg: 0.442 ± 0.189
0.972TrpSer: 0.972 ± 0.243
1.06TrpThr: 1.06 ± 0.353
0.53TrpVal: 0.53 ± 0.235
0.795TrpTrp: 0.795 ± 0.205
0.353TrpTyr: 0.353 ± 0.167
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.651TyrAla: 2.651 ± 0.317
0.619TyrCys: 0.619 ± 0.25
2.828TyrAsp: 2.828 ± 0.545
2.739TyrGlu: 2.739 ± 0.474
0.972TyrPhe: 0.972 ± 0.252
2.739TyrGly: 2.739 ± 0.561
0.442TyrHis: 0.442 ± 0.196
1.679TyrIle: 1.679 ± 0.44
2.563TyrLys: 2.563 ± 0.432
2.474TyrLeu: 2.474 ± 0.521
0.884TyrMet: 0.884 ± 0.359
2.209TyrAsn: 2.209 ± 0.323
1.414TyrPro: 1.414 ± 0.322
1.414TyrGln: 1.414 ± 0.32
2.033TyrArg: 2.033 ± 0.38
2.121TyrSer: 2.121 ± 0.33
1.502TyrThr: 1.502 ± 0.37
2.916TyrVal: 2.916 ± 0.543
0.265TyrTrp: 0.265 ± 0.151
1.06TyrTyr: 1.06 ± 0.387
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 33 proteins (11317 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski