Amino acid dipepetide frequency for Pseudoalteromonas phage PH1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.297AlaAla: 7.297 ± 1.176
0.602AlaCys: 0.602 ± 0.248
4.062AlaAsp: 4.062 ± 0.472
5.642AlaGlu: 5.642 ± 0.692
2.558AlaPhe: 2.558 ± 0.441
5.115AlaGly: 5.115 ± 0.733
1.58AlaHis: 1.58 ± 0.312
4.514AlaIle: 4.514 ± 0.672
6.244AlaLys: 6.244 ± 1.158
7.372AlaLeu: 7.372 ± 0.831
2.407AlaMet: 2.407 ± 0.571
4.062AlaAsn: 4.062 ± 0.468
2.332AlaPro: 2.332 ± 0.298
3.987AlaGln: 3.987 ± 0.605
4.288AlaArg: 4.288 ± 0.616
5.868AlaSer: 5.868 ± 0.803
5.341AlaThr: 5.341 ± 0.698
5.341AlaVal: 5.341 ± 0.693
1.429AlaTrp: 1.429 ± 0.409
2.483AlaTyr: 2.483 ± 0.47
0.0AlaXaa: 0.0 ± 0.0
Cys
0.602CysAla: 0.602 ± 0.212
0.15CysCys: 0.15 ± 0.113
0.376CysAsp: 0.376 ± 0.214
0.752CysGlu: 0.752 ± 0.237
0.15CysPhe: 0.15 ± 0.102
0.527CysGly: 0.527 ± 0.205
0.15CysHis: 0.15 ± 0.099
0.527CysIle: 0.527 ± 0.199
0.978CysLys: 0.978 ± 0.269
0.677CysLeu: 0.677 ± 0.243
0.527CysMet: 0.527 ± 0.164
0.451CysAsn: 0.451 ± 0.221
0.828CysPro: 0.828 ± 0.275
0.226CysGln: 0.226 ± 0.137
0.301CysArg: 0.301 ± 0.147
0.602CysSer: 0.602 ± 0.223
0.828CysThr: 0.828 ± 0.279
0.677CysVal: 0.677 ± 0.277
0.226CysTrp: 0.226 ± 0.165
0.226CysTyr: 0.226 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
5.266AspAla: 5.266 ± 0.711
0.677AspCys: 0.677 ± 0.221
3.235AspAsp: 3.235 ± 0.545
5.341AspGlu: 5.341 ± 0.785
3.31AspPhe: 3.31 ± 0.391
4.815AspGly: 4.815 ± 0.632
0.677AspHis: 0.677 ± 0.234
4.514AspIle: 4.514 ± 0.534
3.686AspLys: 3.686 ± 0.554
5.341AspLeu: 5.341 ± 0.531
1.805AspMet: 1.805 ± 0.344
2.934AspAsn: 2.934 ± 0.481
2.407AspPro: 2.407 ± 0.404
2.031AspGln: 2.031 ± 0.4
2.031AspArg: 2.031 ± 0.487
3.686AspSer: 3.686 ± 0.432
4.89AspThr: 4.89 ± 0.584
2.934AspVal: 2.934 ± 0.442
0.602AspTrp: 0.602 ± 0.201
2.708AspTyr: 2.708 ± 0.5
0.0AspXaa: 0.0 ± 0.0
Glu
6.47GluAla: 6.47 ± 0.493
0.602GluCys: 0.602 ± 0.216
4.438GluAsp: 4.438 ± 0.633
5.793GluGlu: 5.793 ± 0.704
3.31GluPhe: 3.31 ± 0.354
5.341GluGly: 5.341 ± 0.534
1.73GluHis: 1.73 ± 0.587
3.837GluIle: 3.837 ± 0.463
2.934GluLys: 2.934 ± 0.423
8.35GluLeu: 8.35 ± 0.932
1.354GluMet: 1.354 ± 0.351
2.708GluAsn: 2.708 ± 0.48
1.655GluPro: 1.655 ± 0.415
3.611GluGln: 3.611 ± 0.421
3.31GluArg: 3.31 ± 0.603
3.686GluSer: 3.686 ± 0.28
3.31GluThr: 3.31 ± 0.429
5.642GluVal: 5.642 ± 0.683
1.053GluTrp: 1.053 ± 0.3
3.611GluTyr: 3.611 ± 0.453
0.0GluXaa: 0.0 ± 0.0
Phe
2.708PheAla: 2.708 ± 0.325
0.752PheCys: 0.752 ± 0.267
3.46PheAsp: 3.46 ± 0.44
3.16PheGlu: 3.16 ± 0.392
0.677PhePhe: 0.677 ± 0.236
3.009PheGly: 3.009 ± 0.4
0.226PheHis: 0.226 ± 0.13
2.106PheIle: 2.106 ± 0.361
3.536PheLys: 3.536 ± 0.417
2.859PheLeu: 2.859 ± 0.598
0.677PheMet: 0.677 ± 0.289
2.332PheAsn: 2.332 ± 0.361
0.978PhePro: 0.978 ± 0.273
1.354PheGln: 1.354 ± 0.36
1.279PheArg: 1.279 ± 0.345
2.708PheSer: 2.708 ± 0.59
2.332PheThr: 2.332 ± 0.486
2.031PheVal: 2.031 ± 0.501
0.752PheTrp: 0.752 ± 0.247
1.053PheTyr: 1.053 ± 0.277
0.0PheXaa: 0.0 ± 0.0
Gly
4.138GlyAla: 4.138 ± 0.787
0.677GlyCys: 0.677 ± 0.216
4.739GlyAsp: 4.739 ± 0.523
4.89GlyGlu: 4.89 ± 0.588
2.332GlyPhe: 2.332 ± 0.399
3.987GlyGly: 3.987 ± 0.6
1.128GlyHis: 1.128 ± 0.298
4.363GlyIle: 4.363 ± 0.572
4.589GlyLys: 4.589 ± 0.678
4.514GlyLeu: 4.514 ± 0.486
1.73GlyMet: 1.73 ± 0.293
3.987GlyAsn: 3.987 ± 0.568
1.58GlyPro: 1.58 ± 0.291
1.956GlyGln: 1.956 ± 0.425
3.009GlyArg: 3.009 ± 0.522
3.912GlySer: 3.912 ± 0.674
4.589GlyThr: 4.589 ± 0.637
4.664GlyVal: 4.664 ± 0.59
0.828GlyTrp: 0.828 ± 0.22
2.934GlyTyr: 2.934 ± 0.405
0.0GlyXaa: 0.0 ± 0.0
His
1.429HisAla: 1.429 ± 0.304
0.301HisCys: 0.301 ± 0.133
0.677HisAsp: 0.677 ± 0.194
1.053HisGlu: 1.053 ± 0.294
0.602HisPhe: 0.602 ± 0.163
1.354HisGly: 1.354 ± 0.295
0.226HisHis: 0.226 ± 0.126
0.978HisIle: 0.978 ± 0.292
1.505HisLys: 1.505 ± 0.358
1.505HisLeu: 1.505 ± 0.319
0.527HisMet: 0.527 ± 0.216
1.505HisAsn: 1.505 ± 0.363
0.602HisPro: 0.602 ± 0.218
0.677HisGln: 0.677 ± 0.315
0.828HisArg: 0.828 ± 0.235
1.354HisSer: 1.354 ± 0.336
1.053HisThr: 1.053 ± 0.3
0.978HisVal: 0.978 ± 0.319
0.226HisTrp: 0.226 ± 0.165
1.053HisTyr: 1.053 ± 0.343
0.0HisXaa: 0.0 ± 0.0
Ile
5.341IleAla: 5.341 ± 0.631
0.451IleCys: 0.451 ± 0.187
3.761IleAsp: 3.761 ± 0.627
4.514IleGlu: 4.514 ± 0.484
1.279IlePhe: 1.279 ± 0.257
2.633IleGly: 2.633 ± 0.384
0.903IleHis: 0.903 ± 0.254
2.407IleIle: 2.407 ± 0.428
3.46IleLys: 3.46 ± 0.391
4.213IleLeu: 4.213 ± 0.664
1.58IleMet: 1.58 ± 0.489
2.558IleAsn: 2.558 ± 0.39
1.956IlePro: 1.956 ± 0.344
2.859IleGln: 2.859 ± 0.354
2.633IleArg: 2.633 ± 0.5
3.686IleSer: 3.686 ± 0.666
4.213IleThr: 4.213 ± 0.618
2.633IleVal: 2.633 ± 0.491
0.527IleTrp: 0.527 ± 0.169
1.956IleTyr: 1.956 ± 0.331
0.0IleXaa: 0.0 ± 0.0
Lys
5.416LysAla: 5.416 ± 0.852
0.451LysCys: 0.451 ± 0.17
3.46LysAsp: 3.46 ± 0.462
4.664LysGlu: 4.664 ± 0.58
3.009LysPhe: 3.009 ± 0.593
3.761LysGly: 3.761 ± 0.482
1.58LysHis: 1.58 ± 0.322
2.783LysIle: 2.783 ± 0.467
4.138LysLys: 4.138 ± 0.603
7.974LysLeu: 7.974 ± 0.808
1.881LysMet: 1.881 ± 0.382
2.407LysAsn: 2.407 ± 0.493
2.332LysPro: 2.332 ± 0.384
2.934LysGln: 2.934 ± 0.527
4.062LysArg: 4.062 ± 0.501
3.837LysSer: 3.837 ± 0.527
4.213LysThr: 4.213 ± 0.507
4.815LysVal: 4.815 ± 0.697
1.053LysTrp: 1.053 ± 0.312
2.859LysTyr: 2.859 ± 0.577
0.0LysXaa: 0.0 ± 0.0
Leu
7.372LeuAla: 7.372 ± 0.691
0.978LeuCys: 0.978 ± 0.279
7.147LeuAsp: 7.147 ± 0.62
6.77LeuGlu: 6.77 ± 0.77
2.257LeuPhe: 2.257 ± 0.355
6.394LeuGly: 6.394 ± 0.615
1.429LeuHis: 1.429 ± 0.324
3.31LeuIle: 3.31 ± 0.458
5.943LeuLys: 5.943 ± 0.731
4.589LeuLeu: 4.589 ± 0.794
2.407LeuMet: 2.407 ± 0.378
4.438LeuAsn: 4.438 ± 0.53
3.686LeuPro: 3.686 ± 0.419
4.363LeuGln: 4.363 ± 0.627
3.009LeuArg: 3.009 ± 0.421
6.47LeuSer: 6.47 ± 0.609
4.89LeuThr: 4.89 ± 0.526
6.394LeuVal: 6.394 ± 0.749
0.903LeuTrp: 0.903 ± 0.365
3.235LeuTyr: 3.235 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
2.633MetAla: 2.633 ± 0.554
0.301MetCys: 0.301 ± 0.158
1.204MetAsp: 1.204 ± 0.275
1.128MetGlu: 1.128 ± 0.314
1.429MetPhe: 1.429 ± 0.299
1.279MetGly: 1.279 ± 0.358
1.204MetHis: 1.204 ± 0.279
1.204MetIle: 1.204 ± 0.374
1.881MetLys: 1.881 ± 0.392
2.558MetLeu: 2.558 ± 0.358
0.677MetMet: 0.677 ± 0.193
0.752MetAsn: 0.752 ± 0.249
0.903MetPro: 0.903 ± 0.331
0.978MetGln: 0.978 ± 0.349
1.429MetArg: 1.429 ± 0.35
2.558MetSer: 2.558 ± 0.436
1.73MetThr: 1.73 ± 0.275
1.279MetVal: 1.279 ± 0.411
0.451MetTrp: 0.451 ± 0.166
0.978MetTyr: 0.978 ± 0.252
0.0MetXaa: 0.0 ± 0.0
Asn
3.686AsnAla: 3.686 ± 0.548
0.602AsnCys: 0.602 ± 0.237
3.46AsnAsp: 3.46 ± 0.507
3.385AsnGlu: 3.385 ± 0.403
2.257AsnPhe: 2.257 ± 0.379
2.934AsnGly: 2.934 ± 0.472
0.752AsnHis: 0.752 ± 0.214
3.686AsnIle: 3.686 ± 0.48
3.686AsnLys: 3.686 ± 0.625
3.536AsnLeu: 3.536 ± 0.537
1.73AsnMet: 1.73 ± 0.255
2.859AsnAsn: 2.859 ± 0.416
2.407AsnPro: 2.407 ± 0.424
0.978AsnGln: 0.978 ± 0.257
1.805AsnArg: 1.805 ± 0.367
3.385AsnSer: 3.385 ± 0.457
2.934AsnThr: 2.934 ± 0.414
2.483AsnVal: 2.483 ± 0.391
0.828AsnTrp: 0.828 ± 0.194
2.407AsnTyr: 2.407 ± 0.427
0.0AsnXaa: 0.0 ± 0.0
Pro
2.934ProAla: 2.934 ± 0.501
0.226ProCys: 0.226 ± 0.127
2.483ProAsp: 2.483 ± 0.374
3.235ProGlu: 3.235 ± 0.509
0.828ProPhe: 0.828 ± 0.249
1.73ProGly: 1.73 ± 0.327
0.828ProHis: 0.828 ± 0.22
1.881ProIle: 1.881 ± 0.411
1.805ProLys: 1.805 ± 0.426
2.031ProLeu: 2.031 ± 0.34
1.053ProMet: 1.053 ± 0.3
2.633ProAsn: 2.633 ± 0.372
1.279ProPro: 1.279 ± 0.296
1.204ProGln: 1.204 ± 0.376
1.354ProArg: 1.354 ± 0.242
2.106ProSer: 2.106 ± 0.355
2.483ProThr: 2.483 ± 0.459
2.332ProVal: 2.332 ± 0.425
0.301ProTrp: 0.301 ± 0.149
1.505ProTyr: 1.505 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
3.761GlnAla: 3.761 ± 0.609
0.301GlnCys: 0.301 ± 0.167
2.031GlnAsp: 2.031 ± 0.37
2.182GlnGlu: 2.182 ± 0.533
1.354GlnPhe: 1.354 ± 0.327
3.46GlnGly: 3.46 ± 0.446
0.903GlnHis: 0.903 ± 0.221
1.73GlnIle: 1.73 ± 0.405
2.106GlnLys: 2.106 ± 0.43
4.815GlnLeu: 4.815 ± 0.563
0.978GlnMet: 0.978 ± 0.256
1.805GlnAsn: 1.805 ± 0.361
0.903GlnPro: 0.903 ± 0.332
2.257GlnGln: 2.257 ± 0.554
2.182GlnArg: 2.182 ± 0.386
2.633GlnSer: 2.633 ± 0.521
1.73GlnThr: 1.73 ± 0.359
3.536GlnVal: 3.536 ± 0.444
0.677GlnTrp: 0.677 ± 0.212
1.655GlnTyr: 1.655 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
3.686ArgAla: 3.686 ± 0.525
0.451ArgCys: 0.451 ± 0.188
2.407ArgAsp: 2.407 ± 0.355
3.084ArgGlu: 3.084 ± 0.489
1.881ArgPhe: 1.881 ± 0.334
2.708ArgGly: 2.708 ± 0.425
0.527ArgHis: 0.527 ± 0.186
3.009ArgIle: 3.009 ± 0.621
3.31ArgLys: 3.31 ± 0.545
4.363ArgLeu: 4.363 ± 0.547
1.128ArgMet: 1.128 ± 0.365
1.805ArgAsn: 1.805 ± 0.404
1.881ArgPro: 1.881 ± 0.312
1.881ArgGln: 1.881 ± 0.456
3.084ArgArg: 3.084 ± 0.63
2.407ArgSer: 2.407 ± 0.403
2.483ArgThr: 2.483 ± 0.475
2.934ArgVal: 2.934 ± 0.405
0.677ArgTrp: 0.677 ± 0.257
2.483ArgTyr: 2.483 ± 0.375
0.0ArgXaa: 0.0 ± 0.0
Ser
5.191SerAla: 5.191 ± 0.627
0.527SerCys: 0.527 ± 0.185
4.213SerAsp: 4.213 ± 0.633
5.191SerGlu: 5.191 ± 0.54
2.934SerPhe: 2.934 ± 0.321
5.191SerGly: 5.191 ± 0.691
1.204SerHis: 1.204 ± 0.342
3.837SerIle: 3.837 ± 0.526
5.115SerLys: 5.115 ± 0.554
4.815SerLeu: 4.815 ± 0.611
1.956SerMet: 1.956 ± 0.351
3.084SerAsn: 3.084 ± 0.431
1.956SerPro: 1.956 ± 0.336
2.182SerGln: 2.182 ± 0.347
2.934SerArg: 2.934 ± 0.466
3.16SerSer: 3.16 ± 0.549
4.815SerThr: 4.815 ± 0.648
2.708SerVal: 2.708 ± 0.37
0.527SerTrp: 0.527 ± 0.167
2.332SerTyr: 2.332 ± 0.505
0.0SerXaa: 0.0 ± 0.0
Thr
6.018ThrAla: 6.018 ± 0.853
0.301ThrCys: 0.301 ± 0.135
3.686ThrAsp: 3.686 ± 0.456
3.686ThrGlu: 3.686 ± 0.597
3.31ThrPhe: 3.31 ± 0.359
3.987ThrGly: 3.987 ± 0.537
0.978ThrHis: 0.978 ± 0.275
3.009ThrIle: 3.009 ± 0.515
4.514ThrLys: 4.514 ± 0.589
6.018ThrLeu: 6.018 ± 0.846
1.279ThrMet: 1.279 ± 0.308
3.31ThrAsn: 3.31 ± 0.452
2.633ThrPro: 2.633 ± 0.382
2.859ThrGln: 2.859 ± 0.56
2.934ThrArg: 2.934 ± 0.387
3.31ThrSer: 3.31 ± 0.523
4.664ThrThr: 4.664 ± 0.591
4.438ThrVal: 4.438 ± 0.571
0.226ThrTrp: 0.226 ± 0.145
2.407ThrTyr: 2.407 ± 0.387
0.0ThrXaa: 0.0 ± 0.0
Val
5.416ValAla: 5.416 ± 0.663
0.602ValCys: 0.602 ± 0.239
4.965ValAsp: 4.965 ± 0.671
5.04ValGlu: 5.04 ± 0.64
2.106ValPhe: 2.106 ± 0.393
3.009ValGly: 3.009 ± 0.439
1.429ValHis: 1.429 ± 0.327
3.611ValIle: 3.611 ± 0.429
4.213ValLys: 4.213 ± 0.516
6.093ValLeu: 6.093 ± 0.599
1.354ValMet: 1.354 ± 0.286
2.332ValAsn: 2.332 ± 0.457
2.182ValPro: 2.182 ± 0.402
2.633ValGln: 2.633 ± 0.376
2.708ValArg: 2.708 ± 0.495
4.589ValSer: 4.589 ± 0.521
4.138ValThr: 4.138 ± 0.514
5.341ValVal: 5.341 ± 0.765
1.204ValTrp: 1.204 ± 0.262
2.031ValTyr: 2.031 ± 0.546
0.0ValXaa: 0.0 ± 0.0
Trp
0.527TrpAla: 0.527 ± 0.188
0.075TrpCys: 0.075 ± 0.066
0.978TrpAsp: 0.978 ± 0.314
0.752TrpGlu: 0.752 ± 0.252
0.677TrpPhe: 0.677 ± 0.229
0.752TrpGly: 0.752 ± 0.274
0.527TrpHis: 0.527 ± 0.144
0.677TrpIle: 0.677 ± 0.294
0.677TrpLys: 0.677 ± 0.252
0.978TrpLeu: 0.978 ± 0.263
0.376TrpMet: 0.376 ± 0.177
0.903TrpAsn: 0.903 ± 0.269
0.301TrpPro: 0.301 ± 0.146
0.677TrpGln: 0.677 ± 0.2
0.978TrpArg: 0.978 ± 0.346
0.828TrpSer: 0.828 ± 0.183
0.451TrpThr: 0.451 ± 0.2
1.279TrpVal: 1.279 ± 0.306
0.0TrpTrp: 0.0 ± 0.0
0.677TrpTyr: 0.677 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.633TyrAla: 2.633 ± 0.433
0.752TyrCys: 0.752 ± 0.304
2.106TyrAsp: 2.106 ± 0.389
2.483TyrGlu: 2.483 ± 0.481
1.73TyrPhe: 1.73 ± 0.4
2.558TyrGly: 2.558 ± 0.335
0.451TyrHis: 0.451 ± 0.217
1.655TyrIle: 1.655 ± 0.378
3.31TyrLys: 3.31 ± 0.607
3.385TyrLeu: 3.385 ± 0.486
1.053TyrMet: 1.053 ± 0.24
2.934TyrAsn: 2.934 ± 0.382
1.429TyrPro: 1.429 ± 0.303
1.354TyrGln: 1.354 ± 0.366
2.031TyrArg: 2.031 ± 0.396
3.084TyrSer: 3.084 ± 0.388
2.483TyrThr: 2.483 ± 0.39
2.558TyrVal: 2.558 ± 0.374
0.602TyrTrp: 0.602 ± 0.232
2.031TyrTyr: 2.031 ± 0.511
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (13294 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski