Amino acid dipepetide frequency for Pseudomonas phage Lana

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.146AlaAla: 13.146 ± 1.572
0.903AlaCys: 0.903 ± 0.199
6.826AlaAsp: 6.826 ± 0.436
7.584AlaGlu: 7.584 ± 0.791
2.781AlaPhe: 2.781 ± 0.357
7.765AlaGly: 7.765 ± 0.997
2.131AlaHis: 2.131 ± 0.309
5.092AlaIle: 5.092 ± 0.474
5.598AlaLys: 5.598 ± 0.605
9.318AlaLeu: 9.318 ± 0.666
3.034AlaMet: 3.034 ± 0.392
3.359AlaAsn: 3.359 ± 0.369
3.323AlaPro: 3.323 ± 0.362
5.056AlaGln: 5.056 ± 0.609
4.912AlaArg: 4.912 ± 0.562
5.49AlaSer: 5.49 ± 0.693
6.031AlaThr: 6.031 ± 0.58
6.465AlaVal: 6.465 ± 0.474
1.192AlaTrp: 1.192 ± 0.258
2.131AlaTyr: 2.131 ± 0.247
0.0AlaXaa: 0.0 ± 0.0
Cys
0.542CysAla: 0.542 ± 0.167
0.217CysCys: 0.217 ± 0.08
0.831CysAsp: 0.831 ± 0.162
0.722CysGlu: 0.722 ± 0.193
0.397CysPhe: 0.397 ± 0.115
0.903CysGly: 0.903 ± 0.23
0.253CysHis: 0.253 ± 0.101
0.361CysIle: 0.361 ± 0.116
0.722CysLys: 0.722 ± 0.132
1.12CysLeu: 1.12 ± 0.265
0.181CysMet: 0.181 ± 0.079
0.47CysAsn: 0.47 ± 0.115
0.722CysPro: 0.722 ± 0.173
0.65CysGln: 0.65 ± 0.171
0.795CysArg: 0.795 ± 0.195
0.433CysSer: 0.433 ± 0.132
0.578CysThr: 0.578 ± 0.158
0.614CysVal: 0.614 ± 0.148
0.253CysTrp: 0.253 ± 0.087
0.361CysTyr: 0.361 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
5.237AspAla: 5.237 ± 0.501
0.795AspCys: 0.795 ± 0.209
3.72AspAsp: 3.72 ± 0.435
4.623AspGlu: 4.623 ± 0.566
2.709AspPhe: 2.709 ± 0.334
5.49AspGly: 5.49 ± 0.532
1.589AspHis: 1.589 ± 0.317
2.998AspIle: 2.998 ± 0.339
4.226AspLys: 4.226 ± 0.591
5.706AspLeu: 5.706 ± 0.583
1.842AspMet: 1.842 ± 0.277
1.986AspAsn: 1.986 ± 0.226
2.925AspPro: 2.925 ± 0.31
1.914AspGln: 1.914 ± 0.249
2.709AspArg: 2.709 ± 0.331
3.07AspSer: 3.07 ± 0.308
3.9AspThr: 3.9 ± 0.349
3.973AspVal: 3.973 ± 0.378
1.047AspTrp: 1.047 ± 0.216
1.734AspTyr: 1.734 ± 0.228
0.0AspXaa: 0.0 ± 0.0
Glu
7.331GluAla: 7.331 ± 0.801
0.758GluCys: 0.758 ± 0.205
2.889GluAsp: 2.889 ± 0.323
4.984GluGlu: 4.984 ± 0.699
2.564GluPhe: 2.564 ± 0.338
4.876GluGly: 4.876 ± 0.371
1.445GluHis: 1.445 ± 0.237
3.575GluIle: 3.575 ± 0.416
3.648GluLys: 3.648 ± 0.423
6.14GluLeu: 6.14 ± 0.547
2.384GluMet: 2.384 ± 0.405
1.77GluAsn: 1.77 ± 0.293
2.853GluPro: 2.853 ± 0.46
3.395GluGln: 3.395 ± 0.356
3.287GluArg: 3.287 ± 0.421
3.034GluSer: 3.034 ± 0.317
3.395GluThr: 3.395 ± 0.301
5.201GluVal: 5.201 ± 0.444
0.939GluTrp: 0.939 ± 0.208
1.445GluTyr: 1.445 ± 0.251
0.0GluXaa: 0.0 ± 0.0
Phe
2.998PheAla: 2.998 ± 0.294
0.397PheCys: 0.397 ± 0.149
2.889PheAsp: 2.889 ± 0.285
2.095PheGlu: 2.095 ± 0.337
0.939PhePhe: 0.939 ± 0.182
2.781PheGly: 2.781 ± 0.292
0.686PheHis: 0.686 ± 0.181
1.517PheIle: 1.517 ± 0.271
2.059PheLys: 2.059 ± 0.283
2.203PheLeu: 2.203 ± 0.331
0.722PheMet: 0.722 ± 0.159
1.372PheAsn: 1.372 ± 0.223
1.156PhePro: 1.156 ± 0.251
0.939PheGln: 0.939 ± 0.152
1.914PheArg: 1.914 ± 0.251
2.817PheSer: 2.817 ± 0.378
2.348PheThr: 2.348 ± 0.266
2.239PheVal: 2.239 ± 0.269
0.289PheTrp: 0.289 ± 0.098
1.625PheTyr: 1.625 ± 0.22
0.0PheXaa: 0.0 ± 0.0
Gly
5.345GlyAla: 5.345 ± 0.362
0.939GlyCys: 0.939 ± 0.23
4.478GlyAsp: 4.478 ± 0.458
3.973GlyGlu: 3.973 ± 0.431
2.925GlyPhe: 2.925 ± 0.391
5.598GlyGly: 5.598 ± 0.659
1.625GlyHis: 1.625 ± 0.25
3.287GlyIle: 3.287 ± 0.288
4.587GlyLys: 4.587 ± 0.418
5.995GlyLeu: 5.995 ± 0.561
2.42GlyMet: 2.42 ± 0.252
2.817GlyAsn: 2.817 ± 0.454
1.806GlyPro: 1.806 ± 0.282
2.853GlyGln: 2.853 ± 0.363
3.359GlyArg: 3.359 ± 0.384
5.49GlySer: 5.49 ± 0.66
5.526GlyThr: 5.526 ± 0.749
5.201GlyVal: 5.201 ± 0.458
1.336GlyTrp: 1.336 ± 0.187
2.131GlyTyr: 2.131 ± 0.33
0.0GlyXaa: 0.0 ± 0.0
His
2.311HisAla: 2.311 ± 0.352
0.506HisCys: 0.506 ± 0.131
1.264HisAsp: 1.264 ± 0.268
1.047HisGlu: 1.047 ± 0.212
1.047HisPhe: 1.047 ± 0.232
1.734HisGly: 1.734 ± 0.332
0.758HisHis: 0.758 ± 0.218
0.722HisIle: 0.722 ± 0.147
0.831HisLys: 0.831 ± 0.207
2.022HisLeu: 2.022 ± 0.365
0.578HisMet: 0.578 ± 0.158
1.156HisAsn: 1.156 ± 0.22
1.336HisPro: 1.336 ± 0.208
0.795HisGln: 0.795 ± 0.158
1.372HisArg: 1.372 ± 0.226
1.228HisSer: 1.228 ± 0.237
0.758HisThr: 0.758 ± 0.15
1.011HisVal: 1.011 ± 0.152
0.542HisTrp: 0.542 ± 0.133
0.867HisTyr: 0.867 ± 0.179
0.0HisXaa: 0.0 ± 0.0
Ile
4.948IleAla: 4.948 ± 0.355
0.325IleCys: 0.325 ± 0.103
3.323IleAsp: 3.323 ± 0.469
3.864IleGlu: 3.864 ± 0.337
1.336IlePhe: 1.336 ± 0.232
2.998IleGly: 2.998 ± 0.365
1.264IleHis: 1.264 ± 0.253
1.625IleIle: 1.625 ± 0.249
2.564IleLys: 2.564 ± 0.326
3.684IleLeu: 3.684 ± 0.319
0.939IleMet: 0.939 ± 0.163
2.095IleAsn: 2.095 ± 0.34
2.275IlePro: 2.275 ± 0.279
2.239IleGln: 2.239 ± 0.23
2.673IleArg: 2.673 ± 0.354
2.492IleSer: 2.492 ± 0.314
4.587IleThr: 4.587 ± 0.977
2.528IleVal: 2.528 ± 0.271
0.578IleTrp: 0.578 ± 0.143
1.3IleTyr: 1.3 ± 0.241
0.0IleXaa: 0.0 ± 0.0
Lys
7.115LysAla: 7.115 ± 0.459
0.614LysCys: 0.614 ± 0.229
3.828LysAsp: 3.828 ± 0.484
4.298LysGlu: 4.298 ± 0.496
1.661LysPhe: 1.661 ± 0.278
3.72LysGly: 3.72 ± 0.463
1.372LysHis: 1.372 ± 0.194
2.311LysIle: 2.311 ± 0.321
3.467LysLys: 3.467 ± 0.5
5.598LysLeu: 5.598 ± 0.516
1.661LysMet: 1.661 ± 0.362
1.95LysAsn: 1.95 ± 0.315
2.42LysPro: 2.42 ± 0.323
2.095LysGln: 2.095 ± 0.313
2.564LysArg: 2.564 ± 0.32
2.492LysSer: 2.492 ± 0.342
3.323LysThr: 3.323 ± 0.363
3.937LysVal: 3.937 ± 0.46
0.65LysTrp: 0.65 ± 0.185
1.77LysTyr: 1.77 ± 0.215
0.0LysXaa: 0.0 ± 0.0
Leu
9.173LeuAla: 9.173 ± 0.923
0.903LeuCys: 0.903 ± 0.183
5.851LeuAsp: 5.851 ± 0.526
5.453LeuGlu: 5.453 ± 0.429
2.817LeuPhe: 2.817 ± 0.343
5.092LeuGly: 5.092 ± 0.401
1.697LeuHis: 1.697 ± 0.264
4.478LeuIle: 4.478 ± 0.406
5.598LeuLys: 5.598 ± 0.515
6.681LeuLeu: 6.681 ± 0.52
2.348LeuMet: 2.348 ± 0.342
3.684LeuAsn: 3.684 ± 0.411
4.262LeuPro: 4.262 ± 0.385
3.937LeuGln: 3.937 ± 0.452
5.128LeuArg: 5.128 ± 0.491
5.526LeuSer: 5.526 ± 0.471
6.284LeuThr: 6.284 ± 0.818
6.284LeuVal: 6.284 ± 0.524
0.939LeuTrp: 0.939 ± 0.168
1.986LeuTyr: 1.986 ± 0.256
0.0LeuXaa: 0.0 ± 0.0
Met
3.034MetAla: 3.034 ± 0.399
0.253MetCys: 0.253 ± 0.102
1.336MetAsp: 1.336 ± 0.241
1.661MetGlu: 1.661 ± 0.27
1.011MetPhe: 1.011 ± 0.209
1.553MetGly: 1.553 ± 0.219
0.433MetHis: 0.433 ± 0.155
1.481MetIle: 1.481 ± 0.217
1.986MetLys: 1.986 ± 0.272
1.914MetLeu: 1.914 ± 0.269
0.686MetMet: 0.686 ± 0.168
1.264MetAsn: 1.264 ± 0.224
1.156MetPro: 1.156 ± 0.214
1.156MetGln: 1.156 ± 0.175
1.445MetArg: 1.445 ± 0.223
1.986MetSer: 1.986 ± 0.331
2.564MetThr: 2.564 ± 0.368
1.878MetVal: 1.878 ± 0.239
0.47MetTrp: 0.47 ± 0.119
0.758MetTyr: 0.758 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
4.081AsnAla: 4.081 ± 0.409
0.325AsnCys: 0.325 ± 0.096
1.95AsnAsp: 1.95 ± 0.247
1.806AsnGlu: 1.806 ± 0.246
1.264AsnPhe: 1.264 ± 0.182
3.287AsnGly: 3.287 ± 0.502
0.831AsnHis: 0.831 ± 0.219
2.203AsnIle: 2.203 ± 0.304
1.661AsnLys: 1.661 ± 0.256
3.503AsnLeu: 3.503 ± 0.38
0.831AsnMet: 0.831 ± 0.159
1.625AsnAsn: 1.625 ± 0.255
1.986AsnPro: 1.986 ± 0.268
0.686AsnGln: 0.686 ± 0.139
1.372AsnArg: 1.372 ± 0.245
2.239AsnSer: 2.239 ± 0.392
2.564AsnThr: 2.564 ± 0.458
3.142AsnVal: 3.142 ± 0.31
0.65AsnTrp: 0.65 ± 0.117
1.047AsnTyr: 1.047 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
4.406ProAla: 4.406 ± 0.452
0.47ProCys: 0.47 ± 0.139
2.961ProAsp: 2.961 ± 0.318
3.359ProGlu: 3.359 ± 0.439
1.192ProPhe: 1.192 ± 0.218
3.323ProGly: 3.323 ± 0.359
0.975ProHis: 0.975 ± 0.214
1.95ProIle: 1.95 ± 0.316
1.842ProLys: 1.842 ± 0.285
2.709ProLeu: 2.709 ± 0.422
1.192ProMet: 1.192 ± 0.219
1.228ProAsn: 1.228 ± 0.208
1.734ProPro: 1.734 ± 0.306
1.192ProGln: 1.192 ± 0.224
2.167ProArg: 2.167 ± 0.351
2.348ProSer: 2.348 ± 0.402
3.539ProThr: 3.539 ± 0.308
3.467ProVal: 3.467 ± 0.402
0.758ProTrp: 0.758 ± 0.17
1.372ProTyr: 1.372 ± 0.231
0.0ProXaa: 0.0 ± 0.0
Gln
4.551GlnAla: 4.551 ± 0.42
0.542GlnCys: 0.542 ± 0.161
1.589GlnAsp: 1.589 ± 0.218
2.203GlnGlu: 2.203 ± 0.266
1.336GlnPhe: 1.336 ± 0.209
2.817GlnGly: 2.817 ± 0.283
0.975GlnHis: 0.975 ± 0.193
2.275GlnIle: 2.275 ± 0.288
2.095GlnLys: 2.095 ± 0.431
5.201GlnLeu: 5.201 ± 0.544
1.192GlnMet: 1.192 ± 0.204
0.867GlnAsn: 0.867 ± 0.207
1.77GlnPro: 1.77 ± 0.255
1.734GlnGln: 1.734 ± 0.26
2.673GlnArg: 2.673 ± 0.396
1.842GlnSer: 1.842 ± 0.302
1.878GlnThr: 1.878 ± 0.318
3.756GlnVal: 3.756 ± 0.369
0.578GlnTrp: 0.578 ± 0.149
0.65GlnTyr: 0.65 ± 0.146
0.0GlnXaa: 0.0 ± 0.0
Arg
4.406ArgAla: 4.406 ± 0.525
0.433ArgCys: 0.433 ± 0.151
3.25ArgAsp: 3.25 ± 0.425
2.853ArgGlu: 2.853 ± 0.347
2.167ArgPhe: 2.167 ± 0.216
3.106ArgGly: 3.106 ± 0.356
1.409ArgHis: 1.409 ± 0.236
2.817ArgIle: 2.817 ± 0.412
3.034ArgLys: 3.034 ± 0.367
5.67ArgLeu: 5.67 ± 0.481
1.625ArgMet: 1.625 ± 0.239
1.842ArgAsn: 1.842 ± 0.239
2.131ArgPro: 2.131 ± 0.339
2.781ArgGln: 2.781 ± 0.389
2.853ArgArg: 2.853 ± 0.401
2.492ArgSer: 2.492 ± 0.301
2.925ArgThr: 2.925 ± 0.456
3.756ArgVal: 3.756 ± 0.633
0.939ArgTrp: 0.939 ± 0.24
1.697ArgTyr: 1.697 ± 0.261
0.0ArgXaa: 0.0 ± 0.0
Ser
6.031SerAla: 6.031 ± 0.849
0.65SerCys: 0.65 ± 0.188
3.359SerAsp: 3.359 ± 0.451
3.72SerGlu: 3.72 ± 0.447
2.311SerPhe: 2.311 ± 0.223
4.262SerGly: 4.262 ± 0.48
0.758SerHis: 0.758 ± 0.14
2.817SerIle: 2.817 ± 0.322
2.636SerLys: 2.636 ± 0.359
5.562SerLeu: 5.562 ± 0.667
1.589SerMet: 1.589 ± 0.247
2.636SerAsn: 2.636 ± 0.272
2.022SerPro: 2.022 ± 0.333
2.167SerGln: 2.167 ± 0.392
2.998SerArg: 2.998 ± 0.33
3.323SerSer: 3.323 ± 0.646
3.792SerThr: 3.792 ± 0.654
4.045SerVal: 4.045 ± 0.377
0.975SerTrp: 0.975 ± 0.173
1.842SerTyr: 1.842 ± 0.263
0.0SerXaa: 0.0 ± 0.0
Thr
6.826ThrAla: 6.826 ± 0.836
0.795ThrCys: 0.795 ± 0.232
4.262ThrAsp: 4.262 ± 0.407
4.37ThrGlu: 4.37 ± 0.453
2.167ThrPhe: 2.167 ± 0.233
4.37ThrGly: 4.37 ± 0.412
1.481ThrHis: 1.481 ± 0.248
3.214ThrIle: 3.214 ± 0.59
3.142ThrLys: 3.142 ± 0.339
6.212ThrLeu: 6.212 ± 0.662
1.336ThrMet: 1.336 ± 0.241
2.528ThrAsn: 2.528 ± 0.49
2.925ThrPro: 2.925 ± 0.349
2.275ThrGln: 2.275 ± 0.293
3.359ThrArg: 3.359 ± 0.547
4.478ThrSer: 4.478 ± 1.147
4.767ThrThr: 4.767 ± 0.872
5.165ThrVal: 5.165 ± 0.578
0.903ThrTrp: 0.903 ± 0.173
1.806ThrTyr: 1.806 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
7.223ValAla: 7.223 ± 0.586
0.722ValCys: 0.722 ± 0.178
4.298ValAsp: 4.298 ± 0.377
4.731ValGlu: 4.731 ± 0.373
1.842ValPhe: 1.842 ± 0.207
5.128ValGly: 5.128 ± 0.346
1.156ValHis: 1.156 ± 0.243
2.781ValIle: 2.781 ± 0.339
4.623ValLys: 4.623 ± 0.58
5.056ValLeu: 5.056 ± 0.399
2.239ValMet: 2.239 ± 0.308
2.636ValAsn: 2.636 ± 0.419
3.539ValPro: 3.539 ± 0.543
2.961ValGln: 2.961 ± 0.338
3.648ValArg: 3.648 ± 0.329
4.334ValSer: 4.334 ± 0.4
4.948ValThr: 4.948 ± 0.696
5.128ValVal: 5.128 ± 0.426
1.372ValTrp: 1.372 ± 0.295
2.275ValTyr: 2.275 ± 0.33
0.0ValXaa: 0.0 ± 0.0
Trp
1.192TrpAla: 1.192 ± 0.197
0.144TrpCys: 0.144 ± 0.083
1.156TrpAsp: 1.156 ± 0.183
1.011TrpGlu: 1.011 ± 0.174
0.433TrpPhe: 0.433 ± 0.11
1.011TrpGly: 1.011 ± 0.159
0.47TrpHis: 0.47 ± 0.136
0.686TrpIle: 0.686 ± 0.174
1.083TrpLys: 1.083 ± 0.234
1.481TrpLeu: 1.481 ± 0.285
0.65TrpMet: 0.65 ± 0.129
0.614TrpAsn: 0.614 ± 0.128
0.578TrpPro: 0.578 ± 0.151
0.614TrpGln: 0.614 ± 0.151
0.975TrpArg: 0.975 ± 0.21
0.722TrpSer: 0.722 ± 0.158
0.867TrpThr: 0.867 ± 0.206
1.011TrpVal: 1.011 ± 0.233
0.289TrpTrp: 0.289 ± 0.115
0.47TrpTyr: 0.47 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.239TyrAla: 2.239 ± 0.288
0.433TyrCys: 0.433 ± 0.135
2.131TyrAsp: 2.131 ± 0.264
1.553TyrGlu: 1.553 ± 0.3
1.083TyrPhe: 1.083 ± 0.197
1.878TyrGly: 1.878 ± 0.249
0.578TyrHis: 0.578 ± 0.151
1.445TyrIle: 1.445 ± 0.24
1.445TyrLys: 1.445 ± 0.186
2.384TyrLeu: 2.384 ± 0.302
0.542TyrMet: 0.542 ± 0.134
1.156TyrAsn: 1.156 ± 0.234
1.264TyrPro: 1.264 ± 0.252
1.083TyrGln: 1.083 ± 0.201
1.986TyrArg: 1.986 ± 0.272
1.661TyrSer: 1.661 ± 0.256
1.806TyrThr: 1.806 ± 0.289
1.842TyrVal: 1.842 ± 0.309
0.758TyrTrp: 0.758 ± 0.145
1.264TyrTyr: 1.264 ± 0.232
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 132 proteins (27690 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski