Amino acid dipepetide frequency for Pseudomonas phage JBD68

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.852AlaAla: 14.852 ± 1.727
1.204AlaCys: 1.204 ± 0.3
6.985AlaAsp: 6.985 ± 0.837
7.145AlaGlu: 7.145 ± 0.649
2.408AlaPhe: 2.408 ± 0.509
10.116AlaGly: 10.116 ± 1.049
1.285AlaHis: 1.285 ± 0.329
4.817AlaIle: 4.817 ± 0.607
4.656AlaLys: 4.656 ± 0.587
12.203AlaLeu: 12.203 ± 1.217
4.094AlaMet: 4.094 ± 0.532
3.131AlaAsn: 3.131 ± 0.596
5.459AlaPro: 5.459 ± 0.866
4.335AlaGln: 4.335 ± 0.612
7.466AlaArg: 7.466 ± 0.875
6.583AlaSer: 6.583 ± 0.63
6.503AlaThr: 6.503 ± 0.805
6.744AlaVal: 6.744 ± 0.561
1.846AlaTrp: 1.846 ± 0.385
2.328AlaTyr: 2.328 ± 0.577
0.0AlaXaa: 0.0 ± 0.0
Cys
0.562CysAla: 0.562 ± 0.223
0.08CysCys: 0.08 ± 0.082
0.723CysAsp: 0.723 ± 0.196
0.562CysGlu: 0.562 ± 0.222
0.161CysPhe: 0.161 ± 0.124
0.562CysGly: 0.562 ± 0.24
0.08CysHis: 0.08 ± 0.075
0.321CysIle: 0.321 ± 0.165
0.241CysLys: 0.241 ± 0.126
1.124CysLeu: 1.124 ± 0.229
0.241CysMet: 0.241 ± 0.143
0.321CysAsn: 0.321 ± 0.183
0.401CysPro: 0.401 ± 0.189
0.562CysGln: 0.562 ± 0.23
1.044CysArg: 1.044 ± 0.293
0.723CysSer: 0.723 ± 0.298
0.562CysThr: 0.562 ± 0.216
0.963CysVal: 0.963 ± 0.305
0.241CysTrp: 0.241 ± 0.183
0.161CysTyr: 0.161 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
6.021AspAla: 6.021 ± 0.666
0.482AspCys: 0.482 ± 0.208
3.854AspAsp: 3.854 ± 0.698
4.094AspGlu: 4.094 ± 0.863
1.606AspPhe: 1.606 ± 0.347
6.101AspGly: 6.101 ± 0.778
1.204AspHis: 1.204 ± 0.339
2.087AspIle: 2.087 ± 0.375
1.525AspLys: 1.525 ± 0.417
6.503AspLeu: 6.503 ± 0.712
0.883AspMet: 0.883 ± 0.267
0.723AspAsn: 0.723 ± 0.207
2.81AspPro: 2.81 ± 0.591
2.248AspGln: 2.248 ± 0.459
3.613AspArg: 3.613 ± 0.599
2.81AspSer: 2.81 ± 0.417
2.408AspThr: 2.408 ± 0.401
4.094AspVal: 4.094 ± 0.539
1.365AspTrp: 1.365 ± 0.354
1.365AspTyr: 1.365 ± 0.396
0.0AspXaa: 0.0 ± 0.0
Glu
7.386GluAla: 7.386 ± 0.579
0.562GluCys: 0.562 ± 0.24
2.248GluAsp: 2.248 ± 0.334
4.255GluGlu: 4.255 ± 0.65
2.569GluPhe: 2.569 ± 0.351
3.532GluGly: 3.532 ± 0.549
1.044GluHis: 1.044 ± 0.278
4.014GluIle: 4.014 ± 0.526
1.927GluLys: 1.927 ± 0.334
6.824GluLeu: 6.824 ± 0.726
2.168GluMet: 2.168 ± 0.388
1.204GluAsn: 1.204 ± 0.325
2.81GluPro: 2.81 ± 0.532
3.211GluGln: 3.211 ± 0.611
6.744GluArg: 6.744 ± 0.723
3.532GluSer: 3.532 ± 0.493
2.73GluThr: 2.73 ± 0.414
5.058GluVal: 5.058 ± 0.679
1.204GluTrp: 1.204 ± 0.411
1.525GluTyr: 1.525 ± 0.366
0.0GluXaa: 0.0 ± 0.0
Phe
3.854PheAla: 3.854 ± 0.569
0.241PheCys: 0.241 ± 0.136
1.606PheAsp: 1.606 ± 0.346
2.007PheGlu: 2.007 ± 0.315
0.562PhePhe: 0.562 ± 0.206
2.649PheGly: 2.649 ± 0.471
0.401PheHis: 0.401 ± 0.193
0.963PheIle: 0.963 ± 0.24
1.124PheLys: 1.124 ± 0.28
2.168PheLeu: 2.168 ± 0.44
0.642PheMet: 0.642 ± 0.199
1.204PheAsn: 1.204 ± 0.403
1.044PhePro: 1.044 ± 0.25
1.204PheGln: 1.204 ± 0.344
1.044PheArg: 1.044 ± 0.276
2.168PheSer: 2.168 ± 0.473
2.328PheThr: 2.328 ± 0.338
2.97PheVal: 2.97 ± 0.53
0.401PheTrp: 0.401 ± 0.165
1.124PheTyr: 1.124 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
8.59GlyAla: 8.59 ± 0.851
0.642GlyCys: 0.642 ± 0.219
4.335GlyAsp: 4.335 ± 0.534
5.62GlyGlu: 5.62 ± 0.663
3.452GlyPhe: 3.452 ± 0.467
7.627GlyGly: 7.627 ± 1.112
1.285GlyHis: 1.285 ± 0.341
3.292GlyIle: 3.292 ± 0.512
4.094GlyLys: 4.094 ± 0.604
7.145GlyLeu: 7.145 ± 0.636
2.73GlyMet: 2.73 ± 0.475
3.051GlyAsn: 3.051 ± 0.487
2.97GlyPro: 2.97 ± 0.382
3.452GlyGln: 3.452 ± 0.517
5.861GlyArg: 5.861 ± 0.707
5.379GlySer: 5.379 ± 0.634
5.058GlyThr: 5.058 ± 0.826
6.423GlyVal: 6.423 ± 0.588
1.525GlyTrp: 1.525 ± 0.342
1.686GlyTyr: 1.686 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
1.846HisAla: 1.846 ± 0.315
0.241HisCys: 0.241 ± 0.134
0.883HisAsp: 0.883 ± 0.225
1.124HisGlu: 1.124 ± 0.31
0.401HisPhe: 0.401 ± 0.162
1.285HisGly: 1.285 ± 0.271
0.321HisHis: 0.321 ± 0.144
0.482HisIle: 0.482 ± 0.205
0.803HisLys: 0.803 ± 0.223
0.883HisLeu: 0.883 ± 0.262
0.161HisMet: 0.161 ± 0.112
0.321HisAsn: 0.321 ± 0.142
0.883HisPro: 0.883 ± 0.213
0.723HisGln: 0.723 ± 0.243
1.365HisArg: 1.365 ± 0.321
0.803HisSer: 0.803 ± 0.225
0.723HisThr: 0.723 ± 0.307
1.365HisVal: 1.365 ± 0.334
0.321HisTrp: 0.321 ± 0.15
0.482HisTyr: 0.482 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
6.182IleAla: 6.182 ± 0.607
0.321IleCys: 0.321 ± 0.14
3.292IleAsp: 3.292 ± 0.485
2.89IleGlu: 2.89 ± 0.428
1.285IlePhe: 1.285 ± 0.376
4.576IleGly: 4.576 ± 0.558
0.642IleHis: 0.642 ± 0.247
1.365IleIle: 1.365 ± 0.317
1.365IleLys: 1.365 ± 0.33
2.248IleLeu: 2.248 ± 0.443
1.124IleMet: 1.124 ± 0.293
2.087IleAsn: 2.087 ± 0.496
3.372IlePro: 3.372 ± 0.446
1.445IleGln: 1.445 ± 0.316
3.372IleArg: 3.372 ± 0.369
2.81IleSer: 2.81 ± 0.57
3.372IleThr: 3.372 ± 0.428
2.649IleVal: 2.649 ± 0.515
0.161IleTrp: 0.161 ± 0.122
1.365IleTyr: 1.365 ± 0.282
0.0IleXaa: 0.0 ± 0.0
Lys
4.737LysAla: 4.737 ± 0.857
0.642LysCys: 0.642 ± 0.235
1.846LysAsp: 1.846 ± 0.428
1.927LysGlu: 1.927 ± 0.434
0.963LysPhe: 0.963 ± 0.238
3.292LysGly: 3.292 ± 0.416
1.124LysHis: 1.124 ± 0.337
1.525LysIle: 1.525 ± 0.339
2.328LysLys: 2.328 ± 0.451
2.97LysLeu: 2.97 ± 0.464
0.803LysMet: 0.803 ± 0.222
0.642LysAsn: 0.642 ± 0.207
2.489LysPro: 2.489 ± 0.371
1.124LysGln: 1.124 ± 0.282
2.97LysArg: 2.97 ± 0.579
2.569LysSer: 2.569 ± 0.556
2.248LysThr: 2.248 ± 0.408
2.569LysVal: 2.569 ± 0.589
0.321LysTrp: 0.321 ± 0.204
1.124LysTyr: 1.124 ± 0.28
0.0LysXaa: 0.0 ± 0.0
Leu
9.794LeuAla: 9.794 ± 0.816
0.883LeuCys: 0.883 ± 0.297
5.299LeuAsp: 5.299 ± 0.617
6.182LeuGlu: 6.182 ± 0.641
2.248LeuPhe: 2.248 ± 0.391
6.985LeuGly: 6.985 ± 0.735
1.285LeuHis: 1.285 ± 0.268
4.175LeuIle: 4.175 ± 0.472
3.372LeuLys: 3.372 ± 0.639
7.225LeuLeu: 7.225 ± 0.915
2.81LeuMet: 2.81 ± 0.529
4.094LeuAsn: 4.094 ± 0.603
4.978LeuPro: 4.978 ± 0.713
4.817LeuGln: 4.817 ± 0.7
6.985LeuArg: 6.985 ± 0.753
5.941LeuSer: 5.941 ± 0.622
6.021LeuThr: 6.021 ± 0.686
6.503LeuVal: 6.503 ± 0.596
0.723LeuTrp: 0.723 ± 0.232
1.445LeuTyr: 1.445 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
3.693MetAla: 3.693 ± 0.505
0.241MetCys: 0.241 ± 0.118
1.686MetAsp: 1.686 ± 0.361
1.606MetGlu: 1.606 ± 0.318
0.161MetPhe: 0.161 ± 0.107
1.204MetGly: 1.204 ± 0.347
0.08MetHis: 0.08 ± 0.081
1.124MetIle: 1.124 ± 0.322
1.124MetLys: 1.124 ± 0.327
2.408MetLeu: 2.408 ± 0.386
0.562MetMet: 0.562 ± 0.236
0.963MetAsn: 0.963 ± 0.286
1.285MetPro: 1.285 ± 0.296
1.285MetGln: 1.285 ± 0.28
2.007MetArg: 2.007 ± 0.33
2.087MetSer: 2.087 ± 0.397
2.168MetThr: 2.168 ± 0.346
1.285MetVal: 1.285 ± 0.216
0.482MetTrp: 0.482 ± 0.231
0.642MetTyr: 0.642 ± 0.261
0.0MetXaa: 0.0 ± 0.0
Asn
2.89AsnAla: 2.89 ± 0.365
0.241AsnCys: 0.241 ± 0.157
1.686AsnAsp: 1.686 ± 0.307
1.124AsnGlu: 1.124 ± 0.401
1.204AsnPhe: 1.204 ± 0.382
3.372AsnGly: 3.372 ± 0.57
0.562AsnHis: 0.562 ± 0.175
0.803AsnIle: 0.803 ± 0.278
1.606AsnLys: 1.606 ± 0.334
3.211AsnLeu: 3.211 ± 0.624
0.642AsnMet: 0.642 ± 0.222
0.723AsnAsn: 0.723 ± 0.283
2.408AsnPro: 2.408 ± 0.419
0.963AsnGln: 0.963 ± 0.312
2.168AsnArg: 2.168 ± 0.5
1.846AsnSer: 1.846 ± 0.327
1.686AsnThr: 1.686 ± 0.427
1.846AsnVal: 1.846 ± 0.358
0.963AsnTrp: 0.963 ± 0.314
0.803AsnTyr: 0.803 ± 0.279
0.0AsnXaa: 0.0 ± 0.0
Pro
5.7ProAla: 5.7 ± 0.874
0.161ProCys: 0.161 ± 0.122
3.372ProAsp: 3.372 ± 0.604
3.854ProGlu: 3.854 ± 0.513
1.686ProPhe: 1.686 ± 0.404
4.817ProGly: 4.817 ± 0.564
0.723ProHis: 0.723 ± 0.208
2.569ProIle: 2.569 ± 0.512
1.525ProLys: 1.525 ± 0.437
4.817ProLeu: 4.817 ± 0.501
1.686ProMet: 1.686 ± 0.355
2.007ProAsn: 2.007 ± 0.537
3.211ProPro: 3.211 ± 0.547
1.606ProGln: 1.606 ± 0.356
2.489ProArg: 2.489 ± 0.386
3.292ProSer: 3.292 ± 0.537
2.81ProThr: 2.81 ± 0.396
2.81ProVal: 2.81 ± 0.503
1.204ProTrp: 1.204 ± 0.304
1.285ProTyr: 1.285 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
5.218GlnAla: 5.218 ± 0.742
0.723GlnCys: 0.723 ± 0.238
1.044GlnAsp: 1.044 ± 0.323
2.89GlnGlu: 2.89 ± 0.511
1.525GlnPhe: 1.525 ± 0.328
2.649GlnGly: 2.649 ± 0.377
0.883GlnHis: 0.883 ± 0.212
2.73GlnIle: 2.73 ± 0.443
1.204GlnLys: 1.204 ± 0.375
3.854GlnLeu: 3.854 ± 0.567
1.606GlnMet: 1.606 ± 0.299
1.606GlnAsn: 1.606 ± 0.411
2.73GlnPro: 2.73 ± 0.514
3.292GlnGln: 3.292 ± 0.679
2.73GlnArg: 2.73 ± 0.586
1.766GlnSer: 1.766 ± 0.452
1.846GlnThr: 1.846 ± 0.313
2.649GlnVal: 2.649 ± 0.396
0.321GlnTrp: 0.321 ± 0.153
1.044GlnTyr: 1.044 ± 0.243
0.0GlnXaa: 0.0 ± 0.0
Arg
6.663ArgAla: 6.663 ± 0.661
0.883ArgCys: 0.883 ± 0.305
3.934ArgAsp: 3.934 ± 0.551
5.299ArgGlu: 5.299 ± 0.6
2.168ArgPhe: 2.168 ± 0.401
5.62ArgGly: 5.62 ± 0.635
0.883ArgHis: 0.883 ± 0.285
3.934ArgIle: 3.934 ± 0.473
3.452ArgLys: 3.452 ± 0.699
6.824ArgLeu: 6.824 ± 0.721
1.124ArgMet: 1.124 ± 0.331
1.766ArgAsn: 1.766 ± 0.314
3.613ArgPro: 3.613 ± 0.618
2.89ArgGln: 2.89 ± 0.469
6.182ArgArg: 6.182 ± 0.815
3.372ArgSer: 3.372 ± 0.452
3.693ArgThr: 3.693 ± 0.538
4.817ArgVal: 4.817 ± 0.774
1.606ArgTrp: 1.606 ± 0.415
2.007ArgTyr: 2.007 ± 0.403
0.0ArgXaa: 0.0 ± 0.0
Ser
7.306SerAla: 7.306 ± 0.754
0.562SerCys: 0.562 ± 0.198
3.693SerAsp: 3.693 ± 0.504
3.211SerGlu: 3.211 ± 0.486
1.686SerPhe: 1.686 ± 0.343
6.503SerGly: 6.503 ± 0.683
0.562SerHis: 0.562 ± 0.221
3.613SerIle: 3.613 ± 0.471
2.408SerLys: 2.408 ± 0.44
5.058SerLeu: 5.058 ± 0.667
1.525SerMet: 1.525 ± 0.318
2.408SerAsn: 2.408 ± 0.468
2.489SerPro: 2.489 ± 0.484
2.649SerGln: 2.649 ± 0.5
3.773SerArg: 3.773 ± 0.539
4.656SerSer: 4.656 ± 0.682
2.81SerThr: 2.81 ± 0.512
4.496SerVal: 4.496 ± 0.618
0.883SerTrp: 0.883 ± 0.242
1.606SerTyr: 1.606 ± 0.29
0.0SerXaa: 0.0 ± 0.0
Thr
6.824ThrAla: 6.824 ± 0.692
0.401ThrCys: 0.401 ± 0.151
2.168ThrAsp: 2.168 ± 0.372
3.292ThrGlu: 3.292 ± 0.464
2.248ThrPhe: 2.248 ± 0.499
4.897ThrGly: 4.897 ± 0.621
1.204ThrHis: 1.204 ± 0.286
2.408ThrIle: 2.408 ± 0.501
1.846ThrLys: 1.846 ± 0.326
6.021ThrLeu: 6.021 ± 0.76
0.562ThrMet: 0.562 ± 0.202
1.445ThrAsn: 1.445 ± 0.273
2.649ThrPro: 2.649 ± 0.493
2.007ThrGln: 2.007 ± 0.354
2.489ThrArg: 2.489 ± 0.437
4.094ThrSer: 4.094 ± 0.463
2.489ThrThr: 2.489 ± 0.587
5.138ThrVal: 5.138 ± 0.625
1.766ThrTrp: 1.766 ± 0.383
1.606ThrTyr: 1.606 ± 0.35
0.0ThrXaa: 0.0 ± 0.0
Val
7.466ValAla: 7.466 ± 0.784
0.642ValCys: 0.642 ± 0.268
5.138ValAsp: 5.138 ± 0.672
5.62ValGlu: 5.62 ± 0.755
2.007ValPhe: 2.007 ± 0.368
5.138ValGly: 5.138 ± 0.632
1.124ValHis: 1.124 ± 0.284
3.532ValIle: 3.532 ± 0.519
2.489ValLys: 2.489 ± 0.468
5.299ValLeu: 5.299 ± 0.556
1.766ValMet: 1.766 ± 0.365
1.686ValAsn: 1.686 ± 0.293
3.854ValPro: 3.854 ± 0.66
3.051ValGln: 3.051 ± 0.631
5.058ValArg: 5.058 ± 0.769
4.416ValSer: 4.416 ± 0.822
4.416ValThr: 4.416 ± 0.687
4.737ValVal: 4.737 ± 0.629
1.124ValTrp: 1.124 ± 0.38
1.846ValTyr: 1.846 ± 0.449
0.0ValXaa: 0.0 ± 0.0
Trp
1.606TrpAla: 1.606 ± 0.47
0.161TrpCys: 0.161 ± 0.117
0.723TrpAsp: 0.723 ± 0.261
0.803TrpGlu: 0.803 ± 0.257
0.562TrpPhe: 0.562 ± 0.182
1.124TrpGly: 1.124 ± 0.266
0.161TrpHis: 0.161 ± 0.111
1.204TrpIle: 1.204 ± 0.312
0.321TrpLys: 0.321 ± 0.187
2.89TrpLeu: 2.89 ± 0.598
0.642TrpMet: 0.642 ± 0.235
0.482TrpAsn: 0.482 ± 0.223
0.883TrpPro: 0.883 ± 0.264
0.723TrpGln: 0.723 ± 0.256
0.883TrpArg: 0.883 ± 0.239
0.963TrpSer: 0.963 ± 0.31
0.723TrpThr: 0.723 ± 0.345
1.365TrpVal: 1.365 ± 0.298
0.241TrpTrp: 0.241 ± 0.153
0.482TrpTyr: 0.482 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.97TyrAla: 2.97 ± 0.447
0.321TyrCys: 0.321 ± 0.162
1.285TyrAsp: 1.285 ± 0.278
1.044TyrGlu: 1.044 ± 0.268
0.803TyrPhe: 0.803 ± 0.288
2.168TyrGly: 2.168 ± 0.319
0.562TyrHis: 0.562 ± 0.254
0.963TyrIle: 0.963 ± 0.276
0.803TyrLys: 0.803 ± 0.283
2.087TyrLeu: 2.087 ± 0.403
0.241TyrMet: 0.241 ± 0.16
0.883TyrAsn: 0.883 ± 0.213
1.285TyrPro: 1.285 ± 0.331
0.723TyrGln: 0.723 ± 0.199
2.569TyrArg: 2.569 ± 0.476
2.087TyrSer: 2.087 ± 0.396
0.963TyrThr: 0.963 ± 0.314
1.927TyrVal: 1.927 ± 0.406
0.241TyrTrp: 0.241 ± 0.12
0.401TyrTyr: 0.401 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (12457 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski