Amino acid dipepetide frequency for Pseudomonas phage Zuri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.386AlaAla: 10.386 ± 1.575
0.584AlaCys: 0.584 ± 0.189
4.922AlaAsp: 4.922 ± 0.413
6.173AlaGlu: 6.173 ± 0.626
2.628AlaPhe: 2.628 ± 0.3
6.507AlaGly: 6.507 ± 0.902
1.084AlaHis: 1.084 ± 0.182
5.047AlaIle: 5.047 ± 0.506
4.964AlaLys: 4.964 ± 0.509
8.3AlaLeu: 8.3 ± 0.501
2.336AlaMet: 2.336 ± 0.297
4.797AlaAsn: 4.797 ± 0.438
4.338AlaPro: 4.338 ± 0.614
6.215AlaGln: 6.215 ± 0.673
4.171AlaArg: 4.171 ± 0.515
5.547AlaSer: 5.547 ± 0.716
5.255AlaThr: 5.255 ± 0.563
6.298AlaVal: 6.298 ± 0.471
1.335AlaTrp: 1.335 ± 0.26
3.253AlaTyr: 3.253 ± 0.302
0.042AlaXaa: 0.042 ± 0.047
Cys
0.626CysAla: 0.626 ± 0.152
0.25CysCys: 0.25 ± 0.122
0.375CysAsp: 0.375 ± 0.139
0.834CysGlu: 0.834 ± 0.252
0.25CysPhe: 0.25 ± 0.112
0.542CysGly: 0.542 ± 0.153
0.501CysHis: 0.501 ± 0.19
0.375CysIle: 0.375 ± 0.128
0.542CysLys: 0.542 ± 0.188
0.918CysLeu: 0.918 ± 0.271
0.125CysMet: 0.125 ± 0.085
0.417CysAsn: 0.417 ± 0.145
0.417CysPro: 0.417 ± 0.17
0.542CysGln: 0.542 ± 0.2
0.375CysArg: 0.375 ± 0.141
0.542CysSer: 0.542 ± 0.163
0.542CysThr: 0.542 ± 0.176
0.209CysVal: 0.209 ± 0.102
0.042CysTrp: 0.042 ± 0.04
0.167CysTyr: 0.167 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
5.214AspAla: 5.214 ± 0.505
0.626AspCys: 0.626 ± 0.211
3.587AspAsp: 3.587 ± 0.511
3.712AspGlu: 3.712 ± 0.395
1.96AspPhe: 1.96 ± 0.292
4.463AspGly: 4.463 ± 0.41
1.126AspHis: 1.126 ± 0.258
3.754AspIle: 3.754 ± 0.495
2.628AspLys: 2.628 ± 0.344
5.214AspLeu: 5.214 ± 0.57
2.002AspMet: 2.002 ± 0.332
2.503AspAsn: 2.503 ± 0.319
3.504AspPro: 3.504 ± 0.358
3.212AspGln: 3.212 ± 0.368
3.42AspArg: 3.42 ± 0.379
2.628AspSer: 2.628 ± 0.464
2.878AspThr: 2.878 ± 0.349
3.879AspVal: 3.879 ± 0.438
0.834AspTrp: 0.834 ± 0.165
2.211AspTyr: 2.211 ± 0.296
0.0AspXaa: 0.0 ± 0.0
Glu
6.757GluAla: 6.757 ± 0.866
0.459GluCys: 0.459 ± 0.154
3.253GluAsp: 3.253 ± 0.427
5.047GluGlu: 5.047 ± 0.598
2.753GluPhe: 2.753 ± 0.332
4.463GluGly: 4.463 ± 0.524
1.46GluHis: 1.46 ± 0.323
4.338GluIle: 4.338 ± 0.556
3.921GluLys: 3.921 ± 0.578
6.382GluLeu: 6.382 ± 0.52
2.211GluMet: 2.211 ± 0.384
2.294GluAsn: 2.294 ± 0.383
2.586GluPro: 2.586 ± 0.348
3.087GluGln: 3.087 ± 0.418
3.128GluArg: 3.128 ± 0.416
3.087GluSer: 3.087 ± 0.341
3.17GluThr: 3.17 ± 0.275
5.005GluVal: 5.005 ± 0.61
0.709GluTrp: 0.709 ± 0.14
1.71GluTyr: 1.71 ± 0.254
0.0GluXaa: 0.0 ± 0.0
Phe
2.544PheAla: 2.544 ± 0.326
0.292PheCys: 0.292 ± 0.108
2.377PheAsp: 2.377 ± 0.264
1.877PheGlu: 1.877 ± 0.3
1.251PhePhe: 1.251 ± 0.286
2.795PheGly: 2.795 ± 0.321
0.626PheHis: 0.626 ± 0.199
1.668PheIle: 1.668 ± 0.29
2.169PheLys: 2.169 ± 0.346
3.545PheLeu: 3.545 ± 0.424
1.585PheMet: 1.585 ± 0.262
2.169PheAsn: 2.169 ± 0.391
1.502PhePro: 1.502 ± 0.181
1.627PheGln: 1.627 ± 0.236
1.835PheArg: 1.835 ± 0.256
1.96PheSer: 1.96 ± 0.241
2.211PheThr: 2.211 ± 0.303
2.044PheVal: 2.044 ± 0.282
0.334PheTrp: 0.334 ± 0.153
1.043PheTyr: 1.043 ± 0.18
0.0PheXaa: 0.0 ± 0.0
Gly
4.546GlyAla: 4.546 ± 0.547
0.375GlyCys: 0.375 ± 0.112
3.462GlyAsp: 3.462 ± 0.346
4.254GlyGlu: 4.254 ± 0.42
2.669GlyPhe: 2.669 ± 0.407
4.129GlyGly: 4.129 ± 0.482
1.335GlyHis: 1.335 ± 0.239
3.545GlyIle: 3.545 ± 0.427
4.463GlyLys: 4.463 ± 0.442
5.339GlyLeu: 5.339 ± 0.641
2.127GlyMet: 2.127 ± 0.263
3.67GlyAsn: 3.67 ± 0.396
1.71GlyPro: 1.71 ± 0.277
3.796GlyGln: 3.796 ± 0.438
3.587GlyArg: 3.587 ± 0.449
4.588GlySer: 4.588 ± 0.486
4.672GlyThr: 4.672 ± 0.591
4.213GlyVal: 4.213 ± 0.407
1.335GlyTrp: 1.335 ± 0.245
2.669GlyTyr: 2.669 ± 0.467
0.0GlyXaa: 0.0 ± 0.0
His
1.418HisAla: 1.418 ± 0.212
0.417HisCys: 0.417 ± 0.163
1.001HisAsp: 1.001 ± 0.225
1.126HisGlu: 1.126 ± 0.26
1.001HisPhe: 1.001 ± 0.219
1.293HisGly: 1.293 ± 0.257
0.709HisHis: 0.709 ± 0.232
1.585HisIle: 1.585 ± 0.329
1.168HisLys: 1.168 ± 0.233
1.919HisLeu: 1.919 ± 0.285
0.459HisMet: 0.459 ± 0.133
0.626HisAsn: 0.626 ± 0.189
1.084HisPro: 1.084 ± 0.252
1.126HisGln: 1.126 ± 0.35
0.792HisArg: 0.792 ± 0.223
0.876HisSer: 0.876 ± 0.194
0.751HisThr: 0.751 ± 0.144
1.502HisVal: 1.502 ± 0.336
0.334HisTrp: 0.334 ± 0.111
0.918HisTyr: 0.918 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
4.755IleAla: 4.755 ± 0.398
0.584IleCys: 0.584 ± 0.18
3.879IleAsp: 3.879 ± 0.427
3.253IleGlu: 3.253 ± 0.355
1.585IlePhe: 1.585 ± 0.299
3.337IleGly: 3.337 ± 0.467
1.418IleHis: 1.418 ± 0.266
3.003IleIle: 3.003 ± 0.529
3.045IleLys: 3.045 ± 0.399
3.42IleLeu: 3.42 ± 0.561
1.001IleMet: 1.001 ± 0.202
2.753IleAsn: 2.753 ± 0.332
2.419IlePro: 2.419 ± 0.323
2.961IleGln: 2.961 ± 0.396
3.087IleArg: 3.087 ± 0.375
3.087IleSer: 3.087 ± 0.324
4.296IleThr: 4.296 ± 0.465
3.17IleVal: 3.17 ± 0.397
0.751IleTrp: 0.751 ± 0.226
1.543IleTyr: 1.543 ± 0.242
0.0IleXaa: 0.0 ± 0.0
Lys
6.84LysAla: 6.84 ± 0.84
0.292LysCys: 0.292 ± 0.117
3.921LysAsp: 3.921 ± 0.434
3.629LysGlu: 3.629 ± 0.62
2.461LysPhe: 2.461 ± 0.336
3.545LysGly: 3.545 ± 0.326
1.043LysHis: 1.043 ± 0.233
2.628LysIle: 2.628 ± 0.361
4.088LysLys: 4.088 ± 0.55
5.756LysLeu: 5.756 ± 0.612
1.752LysMet: 1.752 ± 0.232
2.711LysAsn: 2.711 ± 0.404
2.419LysPro: 2.419 ± 0.384
2.92LysGln: 2.92 ± 0.402
3.045LysArg: 3.045 ± 0.39
2.753LysSer: 2.753 ± 0.336
3.212LysThr: 3.212 ± 0.355
4.213LysVal: 4.213 ± 0.426
0.709LysTrp: 0.709 ± 0.199
1.794LysTyr: 1.794 ± 0.298
0.0LysXaa: 0.0 ± 0.0
Leu
8.926LeuAla: 8.926 ± 0.735
0.834LeuCys: 0.834 ± 0.236
6.048LeuAsp: 6.048 ± 0.575
5.756LeuGlu: 5.756 ± 0.535
2.92LeuPhe: 2.92 ± 0.277
5.631LeuGly: 5.631 ± 0.446
1.627LeuHis: 1.627 ± 0.252
4.129LeuIle: 4.129 ± 0.382
5.339LeuLys: 5.339 ± 0.494
7.508LeuLeu: 7.508 ± 0.626
2.127LeuMet: 2.127 ± 0.247
4.838LeuAsn: 4.838 ± 0.503
4.38LeuPro: 4.38 ± 0.331
3.379LeuGln: 3.379 ± 0.406
4.254LeuArg: 4.254 ± 0.451
5.506LeuSer: 5.506 ± 0.455
5.923LeuThr: 5.923 ± 0.469
5.673LeuVal: 5.673 ± 0.583
1.001LeuTrp: 1.001 ± 0.216
2.377LeuTyr: 2.377 ± 0.323
0.0LeuXaa: 0.0 ± 0.0
Met
3.128MetAla: 3.128 ± 0.375
0.292MetCys: 0.292 ± 0.125
1.919MetAsp: 1.919 ± 0.357
2.086MetGlu: 2.086 ± 0.275
0.918MetPhe: 0.918 ± 0.193
1.835MetGly: 1.835 ± 0.311
0.501MetHis: 0.501 ± 0.156
1.293MetIle: 1.293 ± 0.22
1.335MetLys: 1.335 ± 0.315
2.92MetLeu: 2.92 ± 0.329
0.667MetMet: 0.667 ± 0.172
1.627MetAsn: 1.627 ± 0.254
1.126MetPro: 1.126 ± 0.192
1.084MetGln: 1.084 ± 0.203
0.918MetArg: 0.918 ± 0.192
2.169MetSer: 2.169 ± 0.303
1.251MetThr: 1.251 ± 0.166
2.086MetVal: 2.086 ± 0.335
0.125MetTrp: 0.125 ± 0.082
1.251MetTyr: 1.251 ± 0.217
0.0MetXaa: 0.0 ± 0.0
Asn
4.213AsnAla: 4.213 ± 0.601
0.167AsnCys: 0.167 ± 0.082
2.211AsnAsp: 2.211 ± 0.333
2.628AsnGlu: 2.628 ± 0.376
1.752AsnPhe: 1.752 ± 0.299
3.253AsnGly: 3.253 ± 0.425
1.168AsnHis: 1.168 ± 0.195
2.169AsnIle: 2.169 ± 0.268
3.545AsnLys: 3.545 ± 0.442
4.63AsnLeu: 4.63 ± 0.509
1.251AsnMet: 1.251 ± 0.204
2.294AsnAsn: 2.294 ± 0.32
3.128AsnPro: 3.128 ± 0.545
4.088AsnGln: 4.088 ± 0.494
2.419AsnArg: 2.419 ± 0.274
2.753AsnSer: 2.753 ± 0.321
2.503AsnThr: 2.503 ± 0.384
2.461AsnVal: 2.461 ± 0.34
0.626AsnTrp: 0.626 ± 0.189
1.46AsnTyr: 1.46 ± 0.212
0.0AsnXaa: 0.0 ± 0.0
Pro
5.214ProAla: 5.214 ± 0.51
0.292ProCys: 0.292 ± 0.144
3.045ProAsp: 3.045 ± 0.308
3.796ProGlu: 3.796 ± 0.53
1.668ProPhe: 1.668 ± 0.323
2.753ProGly: 2.753 ± 0.294
0.584ProHis: 0.584 ± 0.203
1.543ProIle: 1.543 ± 0.286
2.795ProLys: 2.795 ± 0.445
3.045ProLeu: 3.045 ± 0.274
1.335ProMet: 1.335 ± 0.22
2.086ProAsn: 2.086 ± 0.336
1.96ProPro: 1.96 ± 0.339
1.585ProGln: 1.585 ± 0.243
1.376ProArg: 1.376 ± 0.263
2.252ProSer: 2.252 ± 0.264
2.753ProThr: 2.753 ± 0.34
4.088ProVal: 4.088 ± 0.397
0.626ProTrp: 0.626 ± 0.199
1.001ProTyr: 1.001 ± 0.164
0.0ProXaa: 0.0 ± 0.0
Gln
5.547GlnAla: 5.547 ± 0.593
0.167GlnCys: 0.167 ± 0.093
2.419GlnAsp: 2.419 ± 0.376
3.629GlnGlu: 3.629 ± 0.359
1.96GlnPhe: 1.96 ± 0.314
3.212GlnGly: 3.212 ± 0.434
0.918GlnHis: 0.918 ± 0.194
2.795GlnIle: 2.795 ± 0.397
2.336GlnLys: 2.336 ± 0.346
5.047GlnLeu: 5.047 ± 0.429
2.211GlnMet: 2.211 ± 0.287
2.544GlnAsn: 2.544 ± 0.441
2.169GlnPro: 2.169 ± 0.484
3.67GlnGln: 3.67 ± 0.618
2.586GlnArg: 2.586 ± 0.4
2.336GlnSer: 2.336 ± 0.317
2.711GlnThr: 2.711 ± 0.318
3.754GlnVal: 3.754 ± 0.419
0.667GlnTrp: 0.667 ± 0.164
1.543GlnTyr: 1.543 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
3.67ArgAla: 3.67 ± 0.282
0.751ArgCys: 0.751 ± 0.215
3.128ArgAsp: 3.128 ± 0.315
3.17ArgGlu: 3.17 ± 0.391
1.835ArgPhe: 1.835 ± 0.295
2.461ArgGly: 2.461 ± 0.31
0.876ArgHis: 0.876 ± 0.204
2.836ArgIle: 2.836 ± 0.286
2.503ArgLys: 2.503 ± 0.344
5.464ArgLeu: 5.464 ± 0.481
1.251ArgMet: 1.251 ± 0.196
2.753ArgAsn: 2.753 ± 0.353
1.502ArgPro: 1.502 ± 0.247
2.419ArgGln: 2.419 ± 0.401
2.377ArgArg: 2.377 ± 0.435
2.419ArgSer: 2.419 ± 0.296
3.128ArgThr: 3.128 ± 0.309
3.545ArgVal: 3.545 ± 0.327
0.459ArgTrp: 0.459 ± 0.145
1.627ArgTyr: 1.627 ± 0.22
0.0ArgXaa: 0.0 ± 0.0
Ser
5.005SerAla: 5.005 ± 0.511
0.667SerCys: 0.667 ± 0.201
3.379SerAsp: 3.379 ± 0.31
3.462SerGlu: 3.462 ± 0.407
1.418SerPhe: 1.418 ± 0.223
3.962SerGly: 3.962 ± 0.397
1.251SerHis: 1.251 ± 0.245
3.712SerIle: 3.712 ± 0.376
3.837SerLys: 3.837 ± 0.425
5.13SerLeu: 5.13 ± 0.7
1.794SerMet: 1.794 ± 0.347
2.252SerAsn: 2.252 ± 0.315
1.794SerPro: 1.794 ± 0.292
2.628SerGln: 2.628 ± 0.487
3.17SerArg: 3.17 ± 0.442
2.878SerSer: 2.878 ± 0.453
3.045SerThr: 3.045 ± 0.369
3.629SerVal: 3.629 ± 0.455
0.834SerTrp: 0.834 ± 0.187
1.835SerTyr: 1.835 ± 0.326
0.0SerXaa: 0.0 ± 0.0
Thr
5.756ThrAla: 5.756 ± 0.542
0.501ThrCys: 0.501 ± 0.157
3.67ThrAsp: 3.67 ± 0.41
4.088ThrGlu: 4.088 ± 0.526
2.377ThrPhe: 2.377 ± 0.324
4.338ThrGly: 4.338 ± 0.463
0.834ThrHis: 0.834 ± 0.168
3.128ThrIle: 3.128 ± 0.475
3.504ThrLys: 3.504 ± 0.359
4.713ThrLeu: 4.713 ± 0.574
1.084ThrMet: 1.084 ± 0.21
2.669ThrAsn: 2.669 ± 0.407
3.17ThrPro: 3.17 ± 0.35
2.711ThrGln: 2.711 ± 0.394
2.503ThrArg: 2.503 ± 0.27
3.003ThrSer: 3.003 ± 0.503
3.796ThrThr: 3.796 ± 0.546
4.964ThrVal: 4.964 ± 0.628
0.667ThrTrp: 0.667 ± 0.19
1.418ThrTyr: 1.418 ± 0.317
0.0ThrXaa: 0.0 ± 0.0
Val
6.09ValAla: 6.09 ± 0.556
0.626ValCys: 0.626 ± 0.193
3.962ValAsp: 3.962 ± 0.446
4.588ValGlu: 4.588 ± 0.47
2.252ValPhe: 2.252 ± 0.307
4.546ValGly: 4.546 ± 0.447
2.044ValHis: 2.044 ± 0.284
3.587ValIle: 3.587 ± 0.554
4.63ValLys: 4.63 ± 0.446
5.422ValLeu: 5.422 ± 0.388
1.877ValMet: 1.877 ± 0.205
3.17ValAsn: 3.17 ± 0.372
2.753ValPro: 2.753 ± 0.337
2.753ValGln: 2.753 ± 0.357
3.253ValArg: 3.253 ± 0.413
4.63ValSer: 4.63 ± 0.452
4.713ValThr: 4.713 ± 0.651
4.505ValVal: 4.505 ± 0.496
0.792ValTrp: 0.792 ± 0.236
2.211ValTyr: 2.211 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
1.126TrpAla: 1.126 ± 0.225
0.042TrpCys: 0.042 ± 0.039
0.918TrpAsp: 0.918 ± 0.223
0.792TrpGlu: 0.792 ± 0.178
0.792TrpPhe: 0.792 ± 0.199
0.792TrpGly: 0.792 ± 0.162
0.292TrpHis: 0.292 ± 0.104
0.709TrpIle: 0.709 ± 0.193
0.792TrpLys: 0.792 ± 0.208
1.084TrpLeu: 1.084 ± 0.231
0.375TrpMet: 0.375 ± 0.116
0.876TrpAsn: 0.876 ± 0.201
0.375TrpPro: 0.375 ± 0.135
0.542TrpGln: 0.542 ± 0.153
0.667TrpArg: 0.667 ± 0.145
0.626TrpSer: 0.626 ± 0.139
0.542TrpThr: 0.542 ± 0.167
1.001TrpVal: 1.001 ± 0.182
0.209TrpTrp: 0.209 ± 0.11
0.375TrpTyr: 0.375 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.419TyrAla: 2.419 ± 0.302
0.375TyrCys: 0.375 ± 0.134
1.877TyrAsp: 1.877 ± 0.19
1.919TyrGlu: 1.919 ± 0.299
1.001TyrPhe: 1.001 ± 0.239
2.628TyrGly: 2.628 ± 0.337
0.667TyrHis: 0.667 ± 0.208
1.502TyrIle: 1.502 ± 0.234
2.127TyrLys: 2.127 ± 0.454
2.461TyrLeu: 2.461 ± 0.328
0.918TyrMet: 0.918 ± 0.22
1.668TyrAsn: 1.668 ± 0.198
1.46TyrPro: 1.46 ± 0.316
1.877TyrGln: 1.877 ± 0.288
1.21TyrArg: 1.21 ± 0.207
2.044TyrSer: 2.044 ± 0.249
1.543TyrThr: 1.543 ± 0.281
2.169TyrVal: 2.169 ± 0.252
0.542TyrTrp: 0.542 ± 0.146
0.751TyrTyr: 0.751 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.042XaaHis: 0.042 ± 0.047
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.209XaaXaa: 0.209 ± 0.234
Statistics based on 99 proteins (23976 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski