Amino acid dipepetide frequency for Pseudomonas phage KNP

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.902AlaAla: 10.902 ± 1.11
0.869AlaCys: 0.869 ± 0.264
6.794AlaAsp: 6.794 ± 1.011
6.241AlaGlu: 6.241 ± 0.769
2.133AlaPhe: 2.133 ± 0.48
7.742AlaGly: 7.742 ± 0.782
1.106AlaHis: 1.106 ± 0.308
5.214AlaIle: 5.214 ± 0.548
5.609AlaLys: 5.609 ± 0.516
8.769AlaLeu: 8.769 ± 1.175
3.081AlaMet: 3.081 ± 0.527
4.424AlaAsn: 4.424 ± 0.68
3.476AlaPro: 3.476 ± 0.528
4.582AlaGln: 4.582 ± 0.532
5.767AlaArg: 5.767 ± 0.62
5.846AlaSer: 5.846 ± 0.889
6.557AlaThr: 6.557 ± 0.725
5.53AlaVal: 5.53 ± 0.738
1.264AlaTrp: 1.264 ± 0.327
2.528AlaTyr: 2.528 ± 0.371
0.0AlaXaa: 0.0 ± 0.0
Cys
1.264CysAla: 1.264 ± 0.26
0.079CysCys: 0.079 ± 0.069
0.474CysAsp: 0.474 ± 0.294
0.316CysGlu: 0.316 ± 0.146
0.553CysPhe: 0.553 ± 0.227
0.632CysGly: 0.632 ± 0.248
0.316CysHis: 0.316 ± 0.149
0.316CysIle: 0.316 ± 0.148
0.632CysLys: 0.632 ± 0.272
0.948CysLeu: 0.948 ± 0.289
0.158CysMet: 0.158 ± 0.12
0.474CysAsn: 0.474 ± 0.186
0.395CysPro: 0.395 ± 0.219
0.79CysGln: 0.79 ± 0.23
0.711CysArg: 0.711 ± 0.365
0.237CysSer: 0.237 ± 0.118
0.079CysThr: 0.079 ± 0.075
0.948CysVal: 0.948 ± 0.31
0.0CysTrp: 0.0 ± 0.0
0.316CysTyr: 0.316 ± 0.148
0.0CysXaa: 0.0 ± 0.0
Asp
5.372AspAla: 5.372 ± 0.72
0.711AspCys: 0.711 ± 0.332
4.424AspAsp: 4.424 ± 0.682
4.029AspGlu: 4.029 ± 0.722
3.002AspPhe: 3.002 ± 0.544
6.004AspGly: 6.004 ± 0.628
1.422AspHis: 1.422 ± 0.418
3.713AspIle: 3.713 ± 0.467
3.713AspLys: 3.713 ± 0.487
4.819AspLeu: 4.819 ± 0.584
1.738AspMet: 1.738 ± 0.329
2.133AspAsn: 2.133 ± 0.302
3.081AspPro: 3.081 ± 0.671
2.37AspGln: 2.37 ± 0.409
3.002AspArg: 3.002 ± 0.502
3.634AspSer: 3.634 ± 0.547
3.081AspThr: 3.081 ± 0.351
4.503AspVal: 4.503 ± 0.667
1.343AspTrp: 1.343 ± 0.297
1.58AspTyr: 1.58 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
7.9GluAla: 7.9 ± 0.909
0.553GluCys: 0.553 ± 0.228
4.029GluAsp: 4.029 ± 0.435
3.476GluGlu: 3.476 ± 0.712
3.397GluPhe: 3.397 ± 0.569
4.661GluGly: 4.661 ± 0.682
1.659GluHis: 1.659 ± 0.391
3.239GluIle: 3.239 ± 0.361
2.528GluLys: 2.528 ± 0.356
4.977GluLeu: 4.977 ± 0.735
2.133GluMet: 2.133 ± 0.453
2.054GluAsn: 2.054 ± 0.337
1.896GluPro: 1.896 ± 0.363
3.476GluGln: 3.476 ± 0.539
4.503GluArg: 4.503 ± 0.575
3.792GluSer: 3.792 ± 0.543
4.108GluThr: 4.108 ± 0.678
4.029GluVal: 4.029 ± 0.802
1.264GluTrp: 1.264 ± 0.392
2.528GluTyr: 2.528 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
2.923PheAla: 2.923 ± 0.493
0.553PheCys: 0.553 ± 0.202
2.844PheAsp: 2.844 ± 0.511
1.659PheGlu: 1.659 ± 0.328
1.264PhePhe: 1.264 ± 0.307
2.923PheGly: 2.923 ± 0.467
1.027PheHis: 1.027 ± 0.23
1.58PheIle: 1.58 ± 0.338
1.422PheLys: 1.422 ± 0.318
2.528PheLeu: 2.528 ± 0.383
1.027PheMet: 1.027 ± 0.3
1.896PheAsn: 1.896 ± 0.407
2.37PhePro: 2.37 ± 0.499
1.185PheGln: 1.185 ± 0.354
2.133PheArg: 2.133 ± 0.327
2.449PheSer: 2.449 ± 0.561
2.923PheThr: 2.923 ± 0.453
3.002PheVal: 3.002 ± 0.559
0.474PheTrp: 0.474 ± 0.197
0.948PheTyr: 0.948 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
6.399GlyAla: 6.399 ± 0.9
0.553GlyCys: 0.553 ± 0.208
5.609GlyAsp: 5.609 ± 0.5
4.898GlyGlu: 4.898 ± 0.548
3.318GlyPhe: 3.318 ± 0.516
6.399GlyGly: 6.399 ± 0.771
1.659GlyHis: 1.659 ± 0.362
4.819GlyIle: 4.819 ± 0.658
4.266GlyLys: 4.266 ± 0.584
6.162GlyLeu: 6.162 ± 0.748
1.738GlyMet: 1.738 ± 0.313
2.37GlyAsn: 2.37 ± 0.491
2.37GlyPro: 2.37 ± 0.444
2.923GlyGln: 2.923 ± 0.501
4.898GlyArg: 4.898 ± 0.581
6.478GlySer: 6.478 ± 1.137
4.898GlyThr: 4.898 ± 0.759
4.977GlyVal: 4.977 ± 0.527
1.659GlyTrp: 1.659 ± 0.374
3.476GlyTyr: 3.476 ± 0.561
0.0GlyXaa: 0.0 ± 0.0
His
1.422HisAla: 1.422 ± 0.309
0.395HisCys: 0.395 ± 0.201
1.422HisAsp: 1.422 ± 0.336
0.948HisGlu: 0.948 ± 0.299
0.711HisPhe: 0.711 ± 0.267
1.501HisGly: 1.501 ± 0.32
0.158HisHis: 0.158 ± 0.108
1.501HisIle: 1.501 ± 0.383
1.185HisLys: 1.185 ± 0.315
1.738HisLeu: 1.738 ± 0.402
1.027HisMet: 1.027 ± 0.268
0.79HisAsn: 0.79 ± 0.258
0.711HisPro: 0.711 ± 0.332
0.711HisGln: 0.711 ± 0.264
0.553HisArg: 0.553 ± 0.182
1.106HisSer: 1.106 ± 0.268
0.869HisThr: 0.869 ± 0.3
1.106HisVal: 1.106 ± 0.285
0.711HisTrp: 0.711 ± 0.258
0.79HisTyr: 0.79 ± 0.184
0.0HisXaa: 0.0 ± 0.0
Ile
5.609IleAla: 5.609 ± 0.585
0.711IleCys: 0.711 ± 0.275
3.476IleAsp: 3.476 ± 0.546
4.187IleGlu: 4.187 ± 0.595
1.027IlePhe: 1.027 ± 0.278
3.634IleGly: 3.634 ± 0.469
0.79IleHis: 0.79 ± 0.256
2.212IleIle: 2.212 ± 0.407
3.397IleLys: 3.397 ± 0.549
3.792IleLeu: 3.792 ± 0.49
0.711IleMet: 0.711 ± 0.239
2.291IleAsn: 2.291 ± 0.548
1.975IlePro: 1.975 ± 0.485
2.449IleGln: 2.449 ± 0.456
3.318IleArg: 3.318 ± 0.488
2.528IleSer: 2.528 ± 0.445
2.923IleThr: 2.923 ± 0.462
3.397IleVal: 3.397 ± 0.505
0.632IleTrp: 0.632 ± 0.245
1.738IleTyr: 1.738 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
5.925LysAla: 5.925 ± 0.777
0.553LysCys: 0.553 ± 0.225
3.634LysAsp: 3.634 ± 0.473
3.634LysGlu: 3.634 ± 0.571
2.607LysPhe: 2.607 ± 0.393
5.056LysGly: 5.056 ± 0.541
1.185LysHis: 1.185 ± 0.377
2.528LysIle: 2.528 ± 0.436
3.239LysLys: 3.239 ± 0.516
5.53LysLeu: 5.53 ± 0.779
1.343LysMet: 1.343 ± 0.313
1.817LysAsn: 1.817 ± 0.333
2.765LysPro: 2.765 ± 0.49
2.212LysGln: 2.212 ± 0.371
3.002LysArg: 3.002 ± 0.538
2.607LysSer: 2.607 ± 0.437
3.081LysThr: 3.081 ± 0.445
4.74LysVal: 4.74 ± 0.628
0.711LysTrp: 0.711 ± 0.31
1.343LysTyr: 1.343 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
7.663LeuAla: 7.663 ± 0.835
0.474LeuCys: 0.474 ± 0.191
4.661LeuAsp: 4.661 ± 0.57
5.293LeuGlu: 5.293 ± 0.769
1.975LeuPhe: 1.975 ± 0.503
4.977LeuGly: 4.977 ± 0.636
1.58LeuHis: 1.58 ± 0.264
4.898LeuIle: 4.898 ± 0.618
6.399LeuLys: 6.399 ± 0.677
5.609LeuLeu: 5.609 ± 0.719
2.449LeuMet: 2.449 ± 0.434
4.029LeuAsn: 4.029 ± 0.715
3.081LeuPro: 3.081 ± 0.538
4.424LeuGln: 4.424 ± 0.559
5.135LeuArg: 5.135 ± 0.539
5.135LeuSer: 5.135 ± 0.66
4.029LeuThr: 4.029 ± 0.544
4.898LeuVal: 4.898 ± 0.717
1.106LeuTrp: 1.106 ± 0.289
2.212LeuTyr: 2.212 ± 0.461
0.0LeuXaa: 0.0 ± 0.0
Met
2.844MetAla: 2.844 ± 0.418
0.237MetCys: 0.237 ± 0.141
1.817MetAsp: 1.817 ± 0.414
1.817MetGlu: 1.817 ± 0.45
0.474MetPhe: 0.474 ± 0.237
1.659MetGly: 1.659 ± 0.315
0.474MetHis: 0.474 ± 0.166
1.501MetIle: 1.501 ± 0.358
1.027MetLys: 1.027 ± 0.312
2.607MetLeu: 2.607 ± 0.413
0.158MetMet: 0.158 ± 0.103
1.185MetAsn: 1.185 ± 0.27
1.106MetPro: 1.106 ± 0.28
0.869MetGln: 0.869 ± 0.284
1.027MetArg: 1.027 ± 0.244
2.291MetSer: 2.291 ± 0.446
2.449MetThr: 2.449 ± 0.394
1.422MetVal: 1.422 ± 0.348
0.158MetTrp: 0.158 ± 0.1
0.316MetTyr: 0.316 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
3.634AsnAla: 3.634 ± 0.457
0.474AsnCys: 0.474 ± 0.156
1.896AsnAsp: 1.896 ± 0.45
3.081AsnGlu: 3.081 ± 0.612
1.501AsnPhe: 1.501 ± 0.342
4.977AsnGly: 4.977 ± 0.84
0.395AsnHis: 0.395 ± 0.181
1.659AsnIle: 1.659 ± 0.284
1.106AsnLys: 1.106 ± 0.29
3.081AsnLeu: 3.081 ± 0.388
0.869AsnMet: 0.869 ± 0.298
1.975AsnAsn: 1.975 ± 0.4
2.765AsnPro: 2.765 ± 0.457
1.896AsnGln: 1.896 ± 0.398
2.212AsnArg: 2.212 ± 0.44
3.239AsnSer: 3.239 ± 0.787
1.58AsnThr: 1.58 ± 0.307
2.449AsnVal: 2.449 ± 0.547
0.395AsnTrp: 0.395 ± 0.133
1.659AsnTyr: 1.659 ± 0.432
0.0AsnXaa: 0.0 ± 0.0
Pro
3.555ProAla: 3.555 ± 0.447
0.316ProCys: 0.316 ± 0.212
3.239ProAsp: 3.239 ± 0.454
4.582ProGlu: 4.582 ± 0.574
1.501ProPhe: 1.501 ± 0.321
1.975ProGly: 1.975 ± 0.37
1.264ProHis: 1.264 ± 0.354
1.58ProIle: 1.58 ± 0.411
2.291ProLys: 2.291 ± 0.432
2.449ProLeu: 2.449 ± 0.453
1.027ProMet: 1.027 ± 0.302
1.975ProAsn: 1.975 ± 0.386
1.027ProPro: 1.027 ± 0.255
1.659ProGln: 1.659 ± 0.327
1.896ProArg: 1.896 ± 0.421
2.607ProSer: 2.607 ± 0.515
1.58ProThr: 1.58 ± 0.297
2.844ProVal: 2.844 ± 0.458
0.553ProTrp: 0.553 ± 0.282
1.264ProTyr: 1.264 ± 0.345
0.0ProXaa: 0.0 ± 0.0
Gln
6.794GlnAla: 6.794 ± 0.754
0.316GlnCys: 0.316 ± 0.148
2.133GlnAsp: 2.133 ± 0.348
3.081GlnGlu: 3.081 ± 0.603
2.291GlnPhe: 2.291 ± 0.324
3.95GlnGly: 3.95 ± 0.529
0.711GlnHis: 0.711 ± 0.244
1.738GlnIle: 1.738 ± 0.421
2.212GlnLys: 2.212 ± 0.649
4.108GlnLeu: 4.108 ± 0.604
1.027GlnMet: 1.027 ± 0.249
1.58GlnAsn: 1.58 ± 0.339
1.185GlnPro: 1.185 ± 0.268
2.923GlnGln: 2.923 ± 0.648
2.607GlnArg: 2.607 ± 0.492
2.607GlnSer: 2.607 ± 0.589
1.58GlnThr: 1.58 ± 0.341
2.291GlnVal: 2.291 ± 0.346
0.711GlnTrp: 0.711 ± 0.246
2.054GlnTyr: 2.054 ± 0.383
0.0GlnXaa: 0.0 ± 0.0
Arg
5.214ArgAla: 5.214 ± 0.448
0.553ArgCys: 0.553 ± 0.203
3.239ArgAsp: 3.239 ± 0.394
4.266ArgGlu: 4.266 ± 0.575
2.37ArgPhe: 2.37 ± 0.444
4.819ArgGly: 4.819 ± 0.483
1.027ArgHis: 1.027 ± 0.348
3.002ArgIle: 3.002 ± 0.438
3.871ArgLys: 3.871 ± 0.636
4.424ArgLeu: 4.424 ± 0.576
1.58ArgMet: 1.58 ± 0.31
1.975ArgAsn: 1.975 ± 0.317
1.896ArgPro: 1.896 ± 0.357
3.002ArgGln: 3.002 ± 0.556
3.16ArgArg: 3.16 ± 0.376
4.266ArgSer: 4.266 ± 0.58
2.054ArgThr: 2.054 ± 0.448
3.871ArgVal: 3.871 ± 0.503
0.711ArgTrp: 0.711 ± 0.223
2.054ArgTyr: 2.054 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
5.767SerAla: 5.767 ± 1.017
0.632SerCys: 0.632 ± 0.264
4.898SerAsp: 4.898 ± 0.571
3.634SerGlu: 3.634 ± 0.445
2.844SerPhe: 2.844 ± 0.397
6.794SerGly: 6.794 ± 0.977
1.896SerHis: 1.896 ± 0.301
2.923SerIle: 2.923 ± 0.473
2.923SerLys: 2.923 ± 0.382
4.74SerLeu: 4.74 ± 0.5
0.948SerMet: 0.948 ± 0.291
2.765SerAsn: 2.765 ± 0.376
2.212SerPro: 2.212 ± 0.302
2.844SerGln: 2.844 ± 0.637
3.318SerArg: 3.318 ± 0.543
3.95SerSer: 3.95 ± 0.6
2.528SerThr: 2.528 ± 0.455
3.95SerVal: 3.95 ± 0.512
0.711SerTrp: 0.711 ± 0.17
2.212SerTyr: 2.212 ± 0.51
0.0SerXaa: 0.0 ± 0.0
Thr
4.108ThrAla: 4.108 ± 0.708
0.553ThrCys: 0.553 ± 0.202
3.555ThrAsp: 3.555 ± 0.446
3.713ThrGlu: 3.713 ± 0.49
2.765ThrPhe: 2.765 ± 0.602
4.424ThrGly: 4.424 ± 0.624
0.553ThrHis: 0.553 ± 0.212
3.713ThrIle: 3.713 ± 0.613
5.056ThrLys: 5.056 ± 0.486
3.397ThrLeu: 3.397 ± 0.579
1.185ThrMet: 1.185 ± 0.326
2.449ThrAsn: 2.449 ± 0.444
2.449ThrPro: 2.449 ± 0.359
1.975ThrGln: 1.975 ± 0.385
2.37ThrArg: 2.37 ± 0.384
3.081ThrSer: 3.081 ± 0.448
2.923ThrThr: 2.923 ± 0.756
4.503ThrVal: 4.503 ± 0.558
0.474ThrTrp: 0.474 ± 0.215
1.343ThrTyr: 1.343 ± 0.27
0.0ThrXaa: 0.0 ± 0.0
Val
6.399ValAla: 6.399 ± 0.827
0.711ValCys: 0.711 ± 0.222
2.607ValAsp: 2.607 ± 0.418
4.661ValGlu: 4.661 ± 0.62
1.817ValPhe: 1.817 ± 0.364
4.424ValGly: 4.424 ± 0.693
0.711ValHis: 0.711 ± 0.251
3.002ValIle: 3.002 ± 0.673
3.713ValLys: 3.713 ± 0.395
5.609ValLeu: 5.609 ± 0.567
1.896ValMet: 1.896 ± 0.396
2.528ValAsn: 2.528 ± 0.628
3.16ValPro: 3.16 ± 0.671
2.923ValGln: 2.923 ± 0.366
5.056ValArg: 5.056 ± 0.628
3.871ValSer: 3.871 ± 0.534
4.424ValThr: 4.424 ± 0.803
4.108ValVal: 4.108 ± 0.751
1.106ValTrp: 1.106 ± 0.273
2.528ValTyr: 2.528 ± 0.509
0.0ValXaa: 0.0 ± 0.0
Trp
1.343TrpAla: 1.343 ± 0.262
0.0TrpCys: 0.0 ± 0.0
1.027TrpAsp: 1.027 ± 0.286
0.632TrpGlu: 0.632 ± 0.229
0.553TrpPhe: 0.553 ± 0.192
0.632TrpGly: 0.632 ± 0.184
0.632TrpHis: 0.632 ± 0.208
0.553TrpIle: 0.553 ± 0.179
0.869TrpLys: 0.869 ± 0.199
1.817TrpLeu: 1.817 ± 0.353
0.395TrpMet: 0.395 ± 0.139
0.553TrpAsn: 0.553 ± 0.158
0.237TrpPro: 0.237 ± 0.142
0.79TrpGln: 0.79 ± 0.25
1.185TrpArg: 1.185 ± 0.334
1.185TrpSer: 1.185 ± 0.365
0.79TrpThr: 0.79 ± 0.272
0.948TrpVal: 0.948 ± 0.275
0.158TrpTrp: 0.158 ± 0.103
0.237TrpTyr: 0.237 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.397TyrAla: 3.397 ± 0.588
0.395TyrCys: 0.395 ± 0.156
1.896TyrAsp: 1.896 ± 0.419
2.054TyrGlu: 2.054 ± 0.389
1.027TyrPhe: 1.027 ± 0.244
2.686TyrGly: 2.686 ± 0.409
0.79TyrHis: 0.79 ± 0.226
1.106TyrIle: 1.106 ± 0.291
2.054TyrLys: 2.054 ± 0.311
2.923TyrLeu: 2.923 ± 0.417
0.869TyrMet: 0.869 ± 0.261
1.58TyrAsn: 1.58 ± 0.284
1.027TyrPro: 1.027 ± 0.241
1.896TyrGln: 1.896 ± 0.372
1.58TyrArg: 1.58 ± 0.238
1.58TyrSer: 1.58 ± 0.38
2.133TyrThr: 2.133 ± 0.431
1.738TyrVal: 1.738 ± 0.309
0.395TyrTrp: 0.395 ± 0.177
0.632TyrTyr: 0.632 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski