Amino acid dipepetide frequency for Pseudomonas phage Epa5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.816AlaAla: 6.816 ± 1.283
0.401AlaCys: 0.401 ± 0.273
4.411AlaAsp: 4.411 ± 0.596
11.227AlaGlu: 11.227 ± 2.57
3.609AlaPhe: 3.609 ± 1.205
8.821AlaGly: 8.821 ± 1.983
1.604AlaHis: 1.604 ± 0.523
3.609AlaIle: 3.609 ± 0.994
6.816AlaLys: 6.816 ± 2.272
9.623AlaLeu: 9.623 ± 1.338
3.609AlaMet: 3.609 ± 0.607
2.807AlaAsn: 2.807 ± 1.439
4.01AlaPro: 4.01 ± 0.658
2.406AlaGln: 2.406 ± 0.975
4.411AlaArg: 4.411 ± 1.208
5.613AlaSer: 5.613 ± 1.343
5.613AlaThr: 5.613 ± 2.144
5.613AlaVal: 5.613 ± 0.915
0.802AlaTrp: 0.802 ± 0.283
3.208AlaTyr: 3.208 ± 0.975
0.0AlaXaa: 0.0 ± 0.0
Cys
0.401CysAla: 0.401 ± 0.36
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.802CysGlu: 0.802 ± 0.283
0.0CysPhe: 0.0 ± 0.0
1.604CysGly: 1.604 ± 0.811
0.401CysHis: 0.401 ± 0.273
0.401CysIle: 0.401 ± 0.273
0.0CysLys: 0.0 ± 0.0
0.401CysLeu: 0.401 ± 0.474
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.401CysPro: 0.401 ± 0.273
0.0CysGln: 0.0 ± 0.0
1.203CysArg: 1.203 ± 0.492
1.203CysSer: 1.203 ± 0.481
0.401CysThr: 0.401 ± 0.279
0.802CysVal: 0.802 ± 0.283
0.401CysTrp: 0.401 ± 0.273
0.401CysTyr: 0.401 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
5.613AspAla: 5.613 ± 1.45
0.802AspCys: 0.802 ± 0.283
2.807AspAsp: 2.807 ± 0.503
6.415AspGlu: 6.415 ± 1.543
2.406AspPhe: 2.406 ± 0.908
5.613AspGly: 5.613 ± 1.716
0.802AspHis: 0.802 ± 0.283
3.609AspIle: 3.609 ± 1.437
5.213AspLys: 5.213 ± 1.096
6.014AspLeu: 6.014 ± 0.84
1.203AspMet: 1.203 ± 0.342
2.807AspAsn: 2.807 ± 0.859
4.01AspPro: 4.01 ± 1.163
2.807AspGln: 2.807 ± 0.602
2.807AspArg: 2.807 ± 1.022
2.807AspSer: 2.807 ± 1.232
2.807AspThr: 2.807 ± 1.247
4.01AspVal: 4.01 ± 0.881
1.604AspTrp: 1.604 ± 0.566
1.604AspTyr: 1.604 ± 0.643
0.0AspXaa: 0.0 ± 0.0
Glu
7.618GluAla: 7.618 ± 1.555
0.401GluCys: 0.401 ± 0.273
6.816GluAsp: 6.816 ± 1.871
4.411GluGlu: 4.411 ± 0.306
3.609GluPhe: 3.609 ± 0.832
4.812GluGly: 4.812 ± 0.857
1.604GluHis: 1.604 ± 0.729
3.609GluIle: 3.609 ± 1.745
4.01GluLys: 4.01 ± 1.188
6.415GluLeu: 6.415 ± 1.309
2.807GluMet: 2.807 ± 0.883
2.807GluAsn: 2.807 ± 0.916
2.807GluPro: 2.807 ± 0.875
2.005GluGln: 2.005 ± 1.438
5.213GluArg: 5.213 ± 1.353
3.609GluSer: 3.609 ± 0.565
2.807GluThr: 2.807 ± 0.683
4.812GluVal: 4.812 ± 1.409
0.0GluTrp: 0.0 ± 0.0
1.604GluTyr: 1.604 ± 0.866
0.0GluXaa: 0.0 ± 0.0
Phe
5.213PheAla: 5.213 ± 0.951
0.0PheCys: 0.0 ± 0.0
2.005PheAsp: 2.005 ± 0.511
2.807PheGlu: 2.807 ± 1.217
0.401PhePhe: 0.401 ± 0.456
4.01PheGly: 4.01 ± 1.59
0.802PheHis: 0.802 ± 0.546
0.0PheIle: 0.0 ± 0.0
2.005PheLys: 2.005 ± 0.66
3.208PheLeu: 3.208 ± 0.914
1.203PheMet: 1.203 ± 0.777
2.005PheAsn: 2.005 ± 0.482
0.802PhePro: 0.802 ± 0.521
1.604PheGln: 1.604 ± 1.023
2.807PheArg: 2.807 ± 0.272
2.005PheSer: 2.005 ± 0.516
2.807PheThr: 2.807 ± 1.747
2.807PheVal: 2.807 ± 0.762
0.401PheTrp: 0.401 ± 0.273
0.802PheTyr: 0.802 ± 0.546
0.0PheXaa: 0.0 ± 0.0
Gly
4.812GlyAla: 4.812 ± 1.588
0.0GlyCys: 0.0 ± 0.0
8.019GlyAsp: 8.019 ± 0.976
4.411GlyGlu: 4.411 ± 1.067
1.604GlyPhe: 1.604 ± 1.092
7.618GlyGly: 7.618 ± 1.297
1.203GlyHis: 1.203 ± 0.492
4.01GlyIle: 4.01 ± 1.749
6.415GlyLys: 6.415 ± 2.307
8.42GlyLeu: 8.42 ± 2.436
0.802GlyMet: 0.802 ± 0.496
3.609GlyAsn: 3.609 ± 0.88
4.01GlyPro: 4.01 ± 0.687
6.014GlyGln: 6.014 ± 2.03
4.812GlyArg: 4.812 ± 0.657
5.213GlySer: 5.213 ± 1.422
6.415GlyThr: 6.415 ± 3.138
6.415GlyVal: 6.415 ± 1.15
0.802GlyTrp: 0.802 ± 0.283
1.604GlyTyr: 1.604 ± 0.566
0.0GlyXaa: 0.0 ± 0.0
His
2.005HisAla: 2.005 ± 0.598
0.0HisCys: 0.0 ± 0.0
0.802HisAsp: 0.802 ± 0.559
1.604HisGlu: 1.604 ± 0.729
1.203HisPhe: 1.203 ± 0.481
1.203HisGly: 1.203 ± 0.565
0.0HisHis: 0.0 ± 0.0
1.604HisIle: 1.604 ± 0.729
2.406HisLys: 2.406 ± 0.961
2.406HisLeu: 2.406 ± 1.256
0.401HisMet: 0.401 ± 0.279
0.401HisAsn: 0.401 ± 0.273
1.203HisPro: 1.203 ± 0.481
0.401HisGln: 0.401 ± 0.279
0.802HisArg: 0.802 ± 0.49
0.802HisSer: 0.802 ± 0.552
0.802HisThr: 0.802 ± 0.446
0.401HisVal: 0.401 ± 0.273
0.0HisTrp: 0.0 ± 0.0
0.802HisTyr: 0.802 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
4.812IleAla: 4.812 ± 1.616
0.401IleCys: 0.401 ± 0.273
3.208IleAsp: 3.208 ± 1.215
4.411IleGlu: 4.411 ± 1.154
2.005IlePhe: 2.005 ± 0.647
2.807IleGly: 2.807 ± 0.836
1.203IleHis: 1.203 ± 0.492
2.406IleIle: 2.406 ± 0.592
2.406IleLys: 2.406 ± 1.215
1.604IleLeu: 1.604 ± 0.762
1.203IleMet: 1.203 ± 0.44
1.604IleAsn: 1.604 ± 0.724
1.604IlePro: 1.604 ± 0.724
2.807IleGln: 2.807 ± 0.416
2.807IleArg: 2.807 ± 1.674
3.208IleSer: 3.208 ± 0.676
2.807IleThr: 2.807 ± 1.507
2.406IleVal: 2.406 ± 0.685
0.401IleTrp: 0.401 ± 0.279
0.802IleTyr: 0.802 ± 0.559
0.0IleXaa: 0.0 ± 0.0
Lys
8.42LysAla: 8.42 ± 2.859
0.401LysCys: 0.401 ± 0.273
3.609LysAsp: 3.609 ± 1.087
4.01LysGlu: 4.01 ± 0.796
1.604LysPhe: 1.604 ± 0.855
4.411LysGly: 4.411 ± 1.674
2.005LysHis: 2.005 ± 1.048
1.604LysIle: 1.604 ± 0.756
3.609LysLys: 3.609 ± 1.344
5.613LysLeu: 5.613 ± 0.938
2.406LysMet: 2.406 ± 0.961
2.406LysAsn: 2.406 ± 0.924
0.802LysPro: 0.802 ± 0.446
0.0LysGln: 0.0 ± 0.0
2.807LysArg: 2.807 ± 0.915
1.604LysSer: 1.604 ± 0.602
4.411LysThr: 4.411 ± 1.296
5.613LysVal: 5.613 ± 0.99
1.203LysTrp: 1.203 ± 0.53
4.01LysTyr: 4.01 ± 0.591
0.0LysXaa: 0.0 ± 0.0
Leu
8.821LeuAla: 8.821 ± 2.339
2.005LeuCys: 2.005 ± 0.902
7.618LeuAsp: 7.618 ± 0.847
4.411LeuGlu: 4.411 ± 1.038
2.005LeuPhe: 2.005 ± 0.808
6.014LeuGly: 6.014 ± 0.921
1.203LeuHis: 1.203 ± 0.819
3.208LeuIle: 3.208 ± 0.811
5.213LeuLys: 5.213 ± 0.838
6.816LeuLeu: 6.816 ± 2.242
3.208LeuMet: 3.208 ± 0.792
4.01LeuAsn: 4.01 ± 1.045
7.618LeuPro: 7.618 ± 1.754
6.014LeuGln: 6.014 ± 1.305
4.812LeuArg: 4.812 ± 1.294
4.411LeuSer: 4.411 ± 0.872
4.01LeuThr: 4.01 ± 1.401
4.812LeuVal: 4.812 ± 0.606
1.203LeuTrp: 1.203 ± 0.492
1.203LeuTyr: 1.203 ± 0.837
0.0LeuXaa: 0.0 ± 0.0
Met
3.609MetAla: 3.609 ± 0.701
0.401MetCys: 0.401 ± 0.273
1.604MetAsp: 1.604 ± 0.428
0.401MetGlu: 0.401 ± 0.279
0.401MetPhe: 0.401 ± 0.36
2.406MetGly: 2.406 ± 1.256
0.401MetHis: 0.401 ± 0.36
0.401MetIle: 0.401 ± 0.273
1.203MetLys: 1.203 ± 0.392
2.005MetLeu: 2.005 ± 0.398
1.203MetMet: 1.203 ± 0.492
0.401MetAsn: 0.401 ± 0.273
0.802MetPro: 0.802 ± 0.552
2.406MetGln: 2.406 ± 0.961
2.807MetArg: 2.807 ± 0.817
2.406MetSer: 2.406 ± 0.948
1.203MetThr: 1.203 ± 0.492
2.005MetVal: 2.005 ± 0.74
0.401MetTrp: 0.401 ± 0.279
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.609AsnAla: 3.609 ± 1.045
0.401AsnCys: 0.401 ± 0.273
2.005AsnAsp: 2.005 ± 0.88
2.406AsnGlu: 2.406 ± 0.728
1.203AsnPhe: 1.203 ± 0.583
4.812AsnGly: 4.812 ± 1.58
0.401AsnHis: 0.401 ± 0.279
2.807AsnIle: 2.807 ± 0.642
0.802AsnLys: 0.802 ± 0.559
4.411AsnLeu: 4.411 ± 0.96
0.401AsnMet: 0.401 ± 0.328
0.0AsnAsn: 0.0 ± 0.0
1.604AsnPro: 1.604 ± 0.614
0.802AsnGln: 0.802 ± 0.434
2.406AsnArg: 2.406 ± 0.668
0.401AsnSer: 0.401 ± 0.273
0.802AsnThr: 0.802 ± 0.553
5.613AsnVal: 5.613 ± 1.481
0.0AsnTrp: 0.0 ± 0.0
0.802AsnTyr: 0.802 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
6.014ProAla: 6.014 ± 1.363
0.0ProCys: 0.0 ± 0.0
5.213ProAsp: 5.213 ± 1.767
4.411ProGlu: 4.411 ± 0.849
3.208ProPhe: 3.208 ± 0.656
4.812ProGly: 4.812 ± 1.572
0.401ProHis: 0.401 ± 0.273
2.406ProIle: 2.406 ± 0.425
2.807ProLys: 2.807 ± 1.012
2.807ProLeu: 2.807 ± 0.878
1.203ProMet: 1.203 ± 0.639
2.005ProAsn: 2.005 ± 0.74
0.802ProPro: 0.802 ± 0.464
0.802ProGln: 0.802 ± 0.648
3.208ProArg: 3.208 ± 1.276
2.005ProSer: 2.005 ± 0.902
2.005ProThr: 2.005 ± 0.909
1.604ProVal: 1.604 ± 0.396
0.802ProTrp: 0.802 ± 0.546
1.604ProTyr: 1.604 ± 0.476
0.0ProXaa: 0.0 ± 0.0
Gln
4.01GlnAla: 4.01 ± 0.596
0.0GlnCys: 0.0 ± 0.0
1.604GlnAsp: 1.604 ± 0.602
4.411GlnGlu: 4.411 ± 1.207
1.203GlnPhe: 1.203 ± 0.632
3.208GlnGly: 3.208 ± 0.659
0.802GlnHis: 0.802 ± 0.283
1.203GlnIle: 1.203 ± 0.562
2.807GlnLys: 2.807 ± 1.177
5.213GlnLeu: 5.213 ± 1.591
0.802GlnMet: 0.802 ± 0.283
0.802GlnAsn: 0.802 ± 0.636
2.005GlnPro: 2.005 ± 0.834
2.807GlnGln: 2.807 ± 0.898
1.203GlnArg: 1.203 ± 0.392
1.604GlnSer: 1.604 ± 0.495
2.807GlnThr: 2.807 ± 0.55
4.411GlnVal: 4.411 ± 1.136
0.802GlnTrp: 0.802 ± 0.468
1.604GlnTyr: 1.604 ± 0.569
0.0GlnXaa: 0.0 ± 0.0
Arg
5.613ArgAla: 5.613 ± 1.46
0.802ArgCys: 0.802 ± 0.559
2.005ArgAsp: 2.005 ± 0.74
1.604ArgGlu: 1.604 ± 0.441
1.604ArgPhe: 1.604 ± 0.809
5.613ArgGly: 5.613 ± 1.613
3.208ArgHis: 3.208 ± 1.182
3.208ArgIle: 3.208 ± 0.636
3.609ArgLys: 3.609 ± 1.087
4.812ArgLeu: 4.812 ± 0.886
2.406ArgMet: 2.406 ± 0.462
0.802ArgAsn: 0.802 ± 0.434
2.807ArgPro: 2.807 ± 0.97
3.609ArgGln: 3.609 ± 0.738
6.014ArgArg: 6.014 ± 1.428
4.411ArgSer: 4.411 ± 1.488
2.406ArgThr: 2.406 ± 1.055
2.406ArgVal: 2.406 ± 0.668
0.401ArgTrp: 0.401 ± 0.279
2.406ArgTyr: 2.406 ± 0.783
0.0ArgXaa: 0.0 ± 0.0
Ser
2.807SerAla: 2.807 ± 1.052
1.203SerCys: 1.203 ± 0.342
2.406SerAsp: 2.406 ± 0.756
3.609SerGlu: 3.609 ± 1.093
4.812SerPhe: 4.812 ± 1.271
5.213SerGly: 5.213 ± 0.309
0.0SerHis: 0.0 ± 0.0
3.208SerIle: 3.208 ± 1.01
3.609SerLys: 3.609 ± 0.98
3.609SerLeu: 3.609 ± 1.065
1.203SerMet: 1.203 ± 0.98
1.203SerAsn: 1.203 ± 0.583
2.406SerPro: 2.406 ± 0.616
0.401SerGln: 0.401 ± 0.279
2.807SerArg: 2.807 ± 0.889
1.604SerSer: 1.604 ± 1.028
1.203SerThr: 1.203 ± 1.001
4.01SerVal: 4.01 ± 0.765
1.203SerTrp: 1.203 ± 0.492
2.807SerTyr: 2.807 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
5.213ThrAla: 5.213 ± 1.56
0.802ThrCys: 0.802 ± 0.521
4.01ThrAsp: 4.01 ± 1.597
2.005ThrGlu: 2.005 ± 0.74
2.005ThrPhe: 2.005 ± 0.902
4.812ThrGly: 4.812 ± 1.352
1.203ThrHis: 1.203 ± 0.635
2.807ThrIle: 2.807 ± 0.969
2.406ThrLys: 2.406 ± 0.801
5.613ThrLeu: 5.613 ± 2.207
0.802ThrMet: 0.802 ± 0.553
2.005ThrAsn: 2.005 ± 1.12
4.01ThrPro: 4.01 ± 1.058
1.604ThrGln: 1.604 ± 0.945
2.005ThrArg: 2.005 ± 0.748
2.005ThrSer: 2.005 ± 1.254
2.807ThrThr: 2.807 ± 1.402
2.807ThrVal: 2.807 ± 0.272
0.802ThrTrp: 0.802 ± 0.446
1.604ThrTyr: 1.604 ± 0.637
0.0ThrXaa: 0.0 ± 0.0
Val
5.613ValAla: 5.613 ± 1.154
0.401ValCys: 0.401 ± 0.273
5.613ValAsp: 5.613 ± 1.061
5.213ValGlu: 5.213 ± 1.123
2.406ValPhe: 2.406 ± 0.701
4.812ValGly: 4.812 ± 0.935
1.604ValHis: 1.604 ± 0.748
4.01ValIle: 4.01 ± 1.319
2.406ValLys: 2.406 ± 0.78
4.812ValLeu: 4.812 ± 0.883
0.401ValMet: 0.401 ± 0.279
3.208ValAsn: 3.208 ± 0.489
4.812ValPro: 4.812 ± 0.606
5.613ValGln: 5.613 ± 0.47
3.609ValArg: 3.609 ± 1.167
2.406ValSer: 2.406 ± 0.608
4.411ValThr: 4.411 ± 2.128
4.01ValVal: 4.01 ± 1.364
0.401ValTrp: 0.401 ± 0.279
0.802ValTyr: 0.802 ± 0.546
0.0ValXaa: 0.0 ± 0.0
Trp
2.005TrpAla: 2.005 ± 0.74
0.0TrpCys: 0.0 ± 0.0
0.802TrpAsp: 0.802 ± 0.283
1.203TrpGlu: 1.203 ± 0.819
0.401TrpPhe: 0.401 ± 0.273
0.401TrpGly: 0.401 ± 0.279
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.604TrpLys: 1.604 ± 0.566
1.604TrpLeu: 1.604 ± 0.729
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.203TrpArg: 1.203 ± 0.688
0.401TrpSer: 0.401 ± 0.279
0.401TrpThr: 0.401 ± 0.279
0.802TrpVal: 0.802 ± 0.546
0.802TrpTrp: 0.802 ± 0.546
0.802TrpTyr: 0.802 ± 0.362
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.005TyrAla: 2.005 ± 0.736
0.401TyrCys: 0.401 ± 0.36
1.203TyrAsp: 1.203 ± 0.492
2.005TyrGlu: 2.005 ± 1.048
2.005TyrPhe: 2.005 ± 0.666
3.208TyrGly: 3.208 ± 1.026
0.401TyrHis: 0.401 ± 0.456
0.802TyrIle: 0.802 ± 0.721
1.203TyrLys: 1.203 ± 0.837
3.208TyrLeu: 3.208 ± 1.188
0.802TyrMet: 0.802 ± 0.283
2.807TyrAsn: 2.807 ± 0.624
1.604TyrPro: 1.604 ± 0.81
1.203TyrGln: 1.203 ± 0.53
2.005TyrArg: 2.005 ± 0.479
2.005TyrSer: 2.005 ± 0.74
0.401TyrThr: 0.401 ± 0.273
1.203TyrVal: 1.203 ± 0.562
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2495 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski