Amino acid dipepetide frequency for Pseudomonas phage PollyC

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.251AlaAla: 17.251 ± 2.072
0.777AlaCys: 0.777 ± 0.293
7.771AlaAsp: 7.771 ± 0.784
7.227AlaGlu: 7.227 ± 0.908
3.186AlaPhe: 3.186 ± 0.487
9.946AlaGly: 9.946 ± 0.917
1.787AlaHis: 1.787 ± 0.35
4.352AlaIle: 4.352 ± 0.502
7.46AlaLys: 7.46 ± 0.879
11.034AlaLeu: 11.034 ± 0.845
2.953AlaMet: 2.953 ± 0.476
4.895AlaAsn: 4.895 ± 1.272
3.808AlaPro: 3.808 ± 0.471
5.284AlaGln: 5.284 ± 0.978
5.828AlaArg: 5.828 ± 0.634
4.895AlaSer: 4.895 ± 0.749
7.537AlaThr: 7.537 ± 1.047
7.926AlaVal: 7.926 ± 1.139
1.088AlaTrp: 1.088 ± 0.292
2.564AlaTyr: 2.564 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
1.166CysAla: 1.166 ± 0.357
0.078CysCys: 0.078 ± 0.082
0.466CysAsp: 0.466 ± 0.19
0.078CysGlu: 0.078 ± 0.076
0.466CysPhe: 0.466 ± 0.202
0.466CysGly: 0.466 ± 0.217
0.233CysHis: 0.233 ± 0.143
0.466CysIle: 0.466 ± 0.215
0.622CysLys: 0.622 ± 0.202
1.166CysLeu: 1.166 ± 0.31
0.389CysMet: 0.389 ± 0.175
0.311CysAsn: 0.311 ± 0.163
0.622CysPro: 0.622 ± 0.22
0.311CysGln: 0.311 ± 0.204
0.622CysArg: 0.622 ± 0.245
0.155CysSer: 0.155 ± 0.098
0.777CysThr: 0.777 ± 0.276
1.01CysVal: 1.01 ± 0.32
0.233CysTrp: 0.233 ± 0.118
0.078CysTyr: 0.078 ± 0.09
0.0CysXaa: 0.0 ± 0.0
Asp
6.76AspAla: 6.76 ± 0.67
0.622AspCys: 0.622 ± 0.247
3.652AspAsp: 3.652 ± 0.752
3.419AspGlu: 3.419 ± 0.47
2.02AspPhe: 2.02 ± 0.373
5.284AspGly: 5.284 ± 0.677
1.088AspHis: 1.088 ± 0.275
3.031AspIle: 3.031 ± 0.414
2.642AspLys: 2.642 ± 0.365
4.973AspLeu: 4.973 ± 0.606
2.098AspMet: 2.098 ± 0.442
2.253AspAsn: 2.253 ± 0.491
3.264AspPro: 3.264 ± 0.486
2.098AspGln: 2.098 ± 0.58
3.419AspArg: 3.419 ± 0.428
4.118AspSer: 4.118 ± 0.586
2.797AspThr: 2.797 ± 0.421
4.74AspVal: 4.74 ± 0.559
0.622AspTrp: 0.622 ± 0.244
1.865AspTyr: 1.865 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
6.76GluAla: 6.76 ± 0.756
0.389GluCys: 0.389 ± 0.169
1.554GluAsp: 1.554 ± 0.339
3.108GluGlu: 3.108 ± 0.653
2.253GluPhe: 2.253 ± 0.614
3.264GluGly: 3.264 ± 0.492
1.632GluHis: 1.632 ± 0.497
2.564GluIle: 2.564 ± 0.487
2.797GluLys: 2.797 ± 0.488
5.828GluLeu: 5.828 ± 0.731
1.787GluMet: 1.787 ± 0.382
1.865GluAsn: 1.865 ± 0.413
1.632GluPro: 1.632 ± 0.372
3.108GluGln: 3.108 ± 0.445
3.963GluArg: 3.963 ± 0.853
2.409GluSer: 2.409 ± 0.384
2.564GluThr: 2.564 ± 0.472
3.963GluVal: 3.963 ± 0.55
1.01GluTrp: 1.01 ± 0.346
2.098GluTyr: 2.098 ± 0.473
0.0GluXaa: 0.0 ± 0.0
Phe
3.264PheAla: 3.264 ± 0.442
0.699PheCys: 0.699 ± 0.326
2.875PheAsp: 2.875 ± 0.511
1.243PheGlu: 1.243 ± 0.334
1.01PhePhe: 1.01 ± 0.241
2.642PheGly: 2.642 ± 0.354
0.622PheHis: 0.622 ± 0.223
1.399PheIle: 1.399 ± 0.39
2.02PheLys: 2.02 ± 0.446
3.186PheLeu: 3.186 ± 0.746
0.777PheMet: 0.777 ± 0.256
1.476PheAsn: 1.476 ± 0.372
1.554PhePro: 1.554 ± 0.389
1.865PheGln: 1.865 ± 0.316
1.865PheArg: 1.865 ± 0.321
2.098PheSer: 2.098 ± 0.4
1.71PheThr: 1.71 ± 0.316
1.943PheVal: 1.943 ± 0.428
0.311PheTrp: 0.311 ± 0.164
1.088PheTyr: 1.088 ± 0.3
0.0PheXaa: 0.0 ± 0.0
Gly
8.858GlyAla: 8.858 ± 1.055
1.088GlyCys: 1.088 ± 0.295
4.352GlyAsp: 4.352 ± 0.524
3.652GlyGlu: 3.652 ± 0.554
2.797GlyPhe: 2.797 ± 0.55
7.693GlyGly: 7.693 ± 1.05
1.243GlyHis: 1.243 ± 0.285
3.341GlyIle: 3.341 ± 0.519
4.507GlyLys: 4.507 ± 0.783
4.895GlyLeu: 4.895 ± 0.663
2.875GlyMet: 2.875 ± 0.762
3.73GlyAsn: 3.73 ± 0.473
2.253GlyPro: 2.253 ± 0.502
4.118GlyGln: 4.118 ± 0.526
4.041GlyArg: 4.041 ± 0.561
5.517GlySer: 5.517 ± 0.645
5.517GlyThr: 5.517 ± 0.663
5.906GlyVal: 5.906 ± 0.707
1.088GlyTrp: 1.088 ± 0.276
2.253GlyTyr: 2.253 ± 0.514
0.0GlyXaa: 0.0 ± 0.0
His
2.409HisAla: 2.409 ± 0.444
0.389HisCys: 0.389 ± 0.169
1.554HisAsp: 1.554 ± 0.451
0.855HisGlu: 0.855 ± 0.295
0.699HisPhe: 0.699 ± 0.296
1.71HisGly: 1.71 ± 0.406
0.233HisHis: 0.233 ± 0.132
0.544HisIle: 0.544 ± 0.219
1.166HisLys: 1.166 ± 0.291
1.554HisLeu: 1.554 ± 0.376
1.01HisMet: 1.01 ± 0.262
0.389HisAsn: 0.389 ± 0.224
1.399HisPro: 1.399 ± 0.333
0.699HisGln: 0.699 ± 0.247
0.622HisArg: 0.622 ± 0.241
1.632HisSer: 1.632 ± 0.485
1.01HisThr: 1.01 ± 0.266
1.01HisVal: 1.01 ± 0.288
0.389HisTrp: 0.389 ± 0.136
0.777HisTyr: 0.777 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
4.585IleAla: 4.585 ± 0.69
0.233IleCys: 0.233 ± 0.132
2.02IleAsp: 2.02 ± 0.35
1.865IleGlu: 1.865 ± 0.351
1.321IlePhe: 1.321 ± 0.396
3.341IleGly: 3.341 ± 0.507
0.855IleHis: 0.855 ± 0.272
1.943IleIle: 1.943 ± 0.373
2.564IleLys: 2.564 ± 0.572
3.73IleLeu: 3.73 ± 0.528
1.166IleMet: 1.166 ± 0.296
1.943IleAsn: 1.943 ± 0.417
1.632IlePro: 1.632 ± 0.402
2.642IleGln: 2.642 ± 0.44
2.875IleArg: 2.875 ± 0.47
2.176IleSer: 2.176 ± 0.434
2.72IleThr: 2.72 ± 0.483
3.885IleVal: 3.885 ± 0.582
0.622IleTrp: 0.622 ± 0.253
1.476IleTyr: 1.476 ± 0.302
0.0IleXaa: 0.0 ± 0.0
Lys
5.75LysAla: 5.75 ± 0.699
0.544LysCys: 0.544 ± 0.221
3.186LysAsp: 3.186 ± 0.446
2.875LysGlu: 2.875 ± 0.385
2.176LysPhe: 2.176 ± 0.438
3.73LysGly: 3.73 ± 0.584
1.166LysHis: 1.166 ± 0.244
2.176LysIle: 2.176 ± 0.54
2.331LysLys: 2.331 ± 0.531
3.497LysLeu: 3.497 ± 0.533
1.476LysMet: 1.476 ± 0.391
2.02LysAsn: 2.02 ± 0.457
2.331LysPro: 2.331 ± 0.344
2.253LysGln: 2.253 ± 0.553
3.341LysArg: 3.341 ± 0.578
2.642LysSer: 2.642 ± 0.366
2.953LysThr: 2.953 ± 0.548
4.274LysVal: 4.274 ± 0.742
0.466LysTrp: 0.466 ± 0.188
1.399LysTyr: 1.399 ± 0.373
0.0LysXaa: 0.0 ± 0.0
Leu
10.646LeuAla: 10.646 ± 1.126
0.932LeuCys: 0.932 ± 0.34
5.595LeuAsp: 5.595 ± 0.578
4.662LeuGlu: 4.662 ± 0.653
1.71LeuPhe: 1.71 ± 0.35
6.372LeuGly: 6.372 ± 0.868
1.554LeuHis: 1.554 ± 0.399
2.953LeuIle: 2.953 ± 0.548
3.497LeuLys: 3.497 ± 0.471
7.46LeuLeu: 7.46 ± 0.981
2.642LeuMet: 2.642 ± 0.587
2.487LeuAsn: 2.487 ± 0.37
4.973LeuPro: 4.973 ± 0.645
3.808LeuGln: 3.808 ± 0.635
5.673LeuArg: 5.673 ± 0.855
5.75LeuSer: 5.75 ± 0.628
4.818LeuThr: 4.818 ± 0.666
5.828LeuVal: 5.828 ± 0.71
0.777LeuTrp: 0.777 ± 0.201
2.176LeuTyr: 2.176 ± 0.339
0.0LeuXaa: 0.0 ± 0.0
Met
3.574MetAla: 3.574 ± 0.602
0.078MetCys: 0.078 ± 0.077
2.098MetAsp: 2.098 ± 0.701
1.632MetGlu: 1.632 ± 0.355
1.01MetPhe: 1.01 ± 0.301
1.71MetGly: 1.71 ± 0.481
0.777MetHis: 0.777 ± 0.209
1.088MetIle: 1.088 ± 0.316
1.71MetLys: 1.71 ± 0.355
3.497MetLeu: 3.497 ± 0.662
1.321MetMet: 1.321 ± 0.515
0.544MetAsn: 0.544 ± 0.234
1.554MetPro: 1.554 ± 0.413
1.476MetGln: 1.476 ± 0.369
1.865MetArg: 1.865 ± 0.411
1.71MetSer: 1.71 ± 0.371
1.632MetThr: 1.632 ± 0.361
1.476MetVal: 1.476 ± 0.285
0.389MetTrp: 0.389 ± 0.159
0.777MetTyr: 0.777 ± 0.273
0.0MetXaa: 0.0 ± 0.0
Asn
4.507AsnAla: 4.507 ± 1.095
0.311AsnCys: 0.311 ± 0.193
2.253AsnAsp: 2.253 ± 0.321
2.72AsnGlu: 2.72 ± 0.465
1.554AsnPhe: 1.554 ± 0.306
4.118AsnGly: 4.118 ± 0.874
0.777AsnHis: 0.777 ± 0.25
1.71AsnIle: 1.71 ± 0.397
1.943AsnLys: 1.943 ± 0.494
3.031AsnLeu: 3.031 ± 0.563
1.243AsnMet: 1.243 ± 0.314
1.943AsnAsn: 1.943 ± 0.57
2.253AsnPro: 2.253 ± 0.336
1.321AsnGln: 1.321 ± 0.44
1.71AsnArg: 1.71 ± 0.335
2.253AsnSer: 2.253 ± 0.472
2.176AsnThr: 2.176 ± 0.625
2.253AsnVal: 2.253 ± 0.434
1.01AsnTrp: 1.01 ± 0.314
1.243AsnTyr: 1.243 ± 0.3
0.0AsnXaa: 0.0 ± 0.0
Pro
4.74ProAla: 4.74 ± 0.801
0.389ProCys: 0.389 ± 0.21
3.73ProAsp: 3.73 ± 0.535
2.487ProGlu: 2.487 ± 0.509
1.476ProPhe: 1.476 ± 0.379
3.419ProGly: 3.419 ± 0.431
0.932ProHis: 0.932 ± 0.221
2.331ProIle: 2.331 ± 0.376
1.554ProLys: 1.554 ± 0.395
3.031ProLeu: 3.031 ± 0.505
1.243ProMet: 1.243 ± 0.249
1.321ProAsn: 1.321 ± 0.332
0.777ProPro: 0.777 ± 0.27
2.02ProGln: 2.02 ± 0.362
1.399ProArg: 1.399 ± 0.35
2.564ProSer: 2.564 ± 0.337
3.264ProThr: 3.264 ± 0.5
3.264ProVal: 3.264 ± 0.515
0.699ProTrp: 0.699 ± 0.208
1.166ProTyr: 1.166 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
7.071GlnAla: 7.071 ± 0.925
0.544GlnCys: 0.544 ± 0.209
2.72GlnAsp: 2.72 ± 0.525
2.253GlnGlu: 2.253 ± 0.39
1.943GlnPhe: 1.943 ± 0.333
2.953GlnGly: 2.953 ± 0.569
1.554GlnHis: 1.554 ± 0.448
2.564GlnIle: 2.564 ± 0.51
1.554GlnLys: 1.554 ± 0.321
4.118GlnLeu: 4.118 ± 0.625
1.554GlnMet: 1.554 ± 0.447
1.476GlnAsn: 1.476 ± 0.33
1.243GlnPro: 1.243 ± 0.222
3.574GlnGln: 3.574 ± 1.078
2.953GlnArg: 2.953 ± 0.502
2.176GlnSer: 2.176 ± 0.539
2.02GlnThr: 2.02 ± 0.556
3.808GlnVal: 3.808 ± 0.607
0.466GlnTrp: 0.466 ± 0.209
1.243GlnTyr: 1.243 ± 0.47
0.0GlnXaa: 0.0 ± 0.0
Arg
5.439ArgAla: 5.439 ± 1.274
0.311ArgCys: 0.311 ± 0.157
4.662ArgAsp: 4.662 ± 0.509
3.963ArgGlu: 3.963 ± 0.714
1.865ArgPhe: 1.865 ± 0.39
4.352ArgGly: 4.352 ± 0.56
1.632ArgHis: 1.632 ± 0.42
2.797ArgIle: 2.797 ± 0.461
3.341ArgLys: 3.341 ± 0.637
4.973ArgLeu: 4.973 ± 0.72
1.476ArgMet: 1.476 ± 0.305
2.02ArgAsn: 2.02 ± 0.368
2.253ArgPro: 2.253 ± 0.565
2.72ArgGln: 2.72 ± 0.523
3.574ArgArg: 3.574 ± 0.583
2.564ArgSer: 2.564 ± 0.487
3.341ArgThr: 3.341 ± 0.488
3.341ArgVal: 3.341 ± 0.515
1.243ArgTrp: 1.243 ± 0.301
2.02ArgTyr: 2.02 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
6.994SerAla: 6.994 ± 0.656
0.777SerCys: 0.777 ± 0.284
2.72SerAsp: 2.72 ± 0.259
2.02SerGlu: 2.02 ± 0.429
1.71SerPhe: 1.71 ± 0.326
5.051SerGly: 5.051 ± 0.747
0.699SerHis: 0.699 ± 0.267
2.642SerIle: 2.642 ± 0.526
2.797SerLys: 2.797 ± 0.524
4.507SerLeu: 4.507 ± 0.53
1.787SerMet: 1.787 ± 0.326
2.797SerAsn: 2.797 ± 0.479
2.487SerPro: 2.487 ± 0.448
2.564SerGln: 2.564 ± 0.42
3.341SerArg: 3.341 ± 0.384
3.341SerSer: 3.341 ± 0.56
3.574SerThr: 3.574 ± 0.548
3.808SerVal: 3.808 ± 0.621
0.777SerTrp: 0.777 ± 0.222
1.243SerTyr: 1.243 ± 0.438
0.0SerXaa: 0.0 ± 0.0
Thr
6.838ThrAla: 6.838 ± 1.033
0.311ThrCys: 0.311 ± 0.121
2.487ThrAsp: 2.487 ± 0.385
3.264ThrGlu: 3.264 ± 0.534
1.399ThrPhe: 1.399 ± 0.313
4.895ThrGly: 4.895 ± 0.681
1.01ThrHis: 1.01 ± 0.326
3.108ThrIle: 3.108 ± 0.549
2.72ThrLys: 2.72 ± 0.361
4.662ThrLeu: 4.662 ± 0.542
1.321ThrMet: 1.321 ± 0.364
2.953ThrAsn: 2.953 ± 0.578
3.031ThrPro: 3.031 ± 0.379
2.72ThrGln: 2.72 ± 0.541
3.108ThrArg: 3.108 ± 0.479
3.885ThrSer: 3.885 ± 0.533
3.419ThrThr: 3.419 ± 0.59
4.196ThrVal: 4.196 ± 0.704
1.01ThrTrp: 1.01 ± 0.293
1.554ThrTyr: 1.554 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
6.994ValAla: 6.994 ± 0.631
0.622ValCys: 0.622 ± 0.209
4.818ValAsp: 4.818 ± 0.64
5.206ValGlu: 5.206 ± 0.642
2.875ValPhe: 2.875 ± 0.374
4.895ValGly: 4.895 ± 0.546
0.932ValHis: 0.932 ± 0.279
2.953ValIle: 2.953 ± 0.497
4.041ValLys: 4.041 ± 0.593
5.051ValLeu: 5.051 ± 0.756
1.943ValMet: 1.943 ± 0.342
3.885ValAsn: 3.885 ± 0.617
3.497ValPro: 3.497 ± 0.552
3.031ValGln: 3.031 ± 0.546
4.196ValArg: 4.196 ± 0.573
3.574ValSer: 3.574 ± 0.431
4.352ValThr: 4.352 ± 0.953
5.517ValVal: 5.517 ± 0.714
1.01ValTrp: 1.01 ± 0.281
2.564ValTyr: 2.564 ± 0.523
0.0ValXaa: 0.0 ± 0.0
Trp
1.399TrpAla: 1.399 ± 0.395
0.233TrpCys: 0.233 ± 0.184
0.699TrpAsp: 0.699 ± 0.287
0.622TrpGlu: 0.622 ± 0.196
1.166TrpPhe: 1.166 ± 0.276
1.321TrpGly: 1.321 ± 0.411
0.466TrpHis: 0.466 ± 0.177
0.466TrpIle: 0.466 ± 0.157
0.389TrpLys: 0.389 ± 0.165
1.865TrpLeu: 1.865 ± 0.406
0.233TrpMet: 0.233 ± 0.127
0.622TrpAsn: 0.622 ± 0.223
0.155TrpPro: 0.155 ± 0.107
0.466TrpGln: 0.466 ± 0.149
1.321TrpArg: 1.321 ± 0.36
0.311TrpSer: 0.311 ± 0.162
0.389TrpThr: 0.389 ± 0.167
1.476TrpVal: 1.476 ± 0.33
0.078TrpTrp: 0.078 ± 0.068
0.078TrpTyr: 0.078 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.564TyrAla: 2.564 ± 0.427
0.389TyrCys: 0.389 ± 0.167
1.554TyrAsp: 1.554 ± 0.368
1.399TyrGlu: 1.399 ± 0.326
1.01TyrPhe: 1.01 ± 0.263
2.564TyrGly: 2.564 ± 0.361
0.855TyrHis: 0.855 ± 0.268
1.166TyrIle: 1.166 ± 0.376
0.932TyrLys: 0.932 ± 0.281
2.253TyrLeu: 2.253 ± 0.479
0.544TyrMet: 0.544 ± 0.179
1.399TyrAsn: 1.399 ± 0.408
1.088TyrPro: 1.088 ± 0.413
1.71TyrGln: 1.71 ± 0.405
2.176TyrArg: 2.176 ± 0.426
1.865TyrSer: 1.865 ± 0.421
1.321TyrThr: 1.321 ± 0.298
2.331TyrVal: 2.331 ± 0.448
0.466TyrTrp: 0.466 ± 0.22
0.932TyrTyr: 0.932 ± 0.303
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12870 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski