Amino acid dipepetide frequency for Pseudomonas phage phiAH14a

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.332AlaAla: 15.332 ± 1.646
1.175AlaCys: 1.175 ± 0.314
7.226AlaAsp: 7.226 ± 0.675
9.34AlaGlu: 9.34 ± 1.37
3.525AlaPhe: 3.525 ± 0.446
9.105AlaGly: 9.105 ± 0.923
1.469AlaHis: 1.469 ± 0.31
5.933AlaIle: 5.933 ± 0.604
6.109AlaLys: 6.109 ± 0.684
10.691AlaLeu: 10.691 ± 0.879
4.406AlaMet: 4.406 ± 0.483
3.29AlaAsn: 3.29 ± 0.474
3.936AlaPro: 3.936 ± 0.537
4.641AlaGln: 4.641 ± 0.701
6.051AlaArg: 6.051 ± 0.667
5.816AlaSer: 5.816 ± 0.59
6.344AlaThr: 6.344 ± 0.692
7.813AlaVal: 7.813 ± 0.798
2.232AlaTrp: 2.232 ± 0.406
3.466AlaTyr: 3.466 ± 0.516
0.0AlaXaa: 0.0 ± 0.0
Cys
1.527CysAla: 1.527 ± 0.407
0.294CysCys: 0.294 ± 0.155
0.94CysAsp: 0.94 ± 0.253
0.587CysGlu: 0.587 ± 0.154
0.47CysPhe: 0.47 ± 0.165
1.527CysGly: 1.527 ± 0.383
0.294CysHis: 0.294 ± 0.134
0.529CysIle: 0.529 ± 0.232
0.764CysLys: 0.764 ± 0.269
1.175CysLeu: 1.175 ± 0.339
0.294CysMet: 0.294 ± 0.152
0.646CysAsn: 0.646 ± 0.187
0.47CysPro: 0.47 ± 0.175
0.587CysGln: 0.587 ± 0.199
1.116CysArg: 1.116 ± 0.278
0.646CysSer: 0.646 ± 0.199
0.47CysThr: 0.47 ± 0.169
1.057CysVal: 1.057 ± 0.244
0.411CysTrp: 0.411 ± 0.162
0.411CysTyr: 0.411 ± 0.165
0.0CysXaa: 0.0 ± 0.0
Asp
5.581AspAla: 5.581 ± 0.542
1.116AspCys: 1.116 ± 0.238
3.407AspAsp: 3.407 ± 0.56
4.288AspGlu: 4.288 ± 0.586
1.469AspPhe: 1.469 ± 0.276
5.757AspGly: 5.757 ± 0.794
0.764AspHis: 0.764 ± 0.271
3.407AspIle: 3.407 ± 0.375
2.82AspLys: 2.82 ± 0.346
5.111AspLeu: 5.111 ± 0.465
1.234AspMet: 1.234 ± 0.272
1.997AspAsn: 1.997 ± 0.361
2.35AspPro: 2.35 ± 0.374
2.996AspGln: 2.996 ± 0.391
3.113AspArg: 3.113 ± 0.431
3.29AspSer: 3.29 ± 0.385
2.937AspThr: 2.937 ± 0.412
3.583AspVal: 3.583 ± 0.53
0.999AspTrp: 0.999 ± 0.3
1.997AspTyr: 1.997 ± 0.315
0.0AspXaa: 0.0 ± 0.0
Glu
8.165GluAla: 8.165 ± 0.88
0.705GluCys: 0.705 ± 0.23
3.29GluAsp: 3.29 ± 0.485
3.936GluGlu: 3.936 ± 0.434
2.056GluPhe: 2.056 ± 0.268
3.466GluGly: 3.466 ± 0.423
0.999GluHis: 0.999 ± 0.246
4.406GluIle: 4.406 ± 0.504
3.818GluLys: 3.818 ± 0.434
6.403GluLeu: 6.403 ± 0.68
1.116GluMet: 1.116 ± 0.287
1.704GluAsn: 1.704 ± 0.302
2.056GluPro: 2.056 ± 0.334
3.525GluGln: 3.525 ± 0.661
5.228GluArg: 5.228 ± 0.713
4.23GluSer: 4.23 ± 0.641
2.82GluThr: 2.82 ± 0.478
4.053GluVal: 4.053 ± 0.473
1.41GluTrp: 1.41 ± 0.31
2.115GluTyr: 2.115 ± 0.353
0.0GluXaa: 0.0 ± 0.0
Phe
3.172PheAla: 3.172 ± 0.416
0.411PheCys: 0.411 ± 0.181
1.704PheAsp: 1.704 ± 0.296
2.232PheGlu: 2.232 ± 0.404
0.705PhePhe: 0.705 ± 0.184
3.231PheGly: 3.231 ± 0.434
0.47PheHis: 0.47 ± 0.169
1.586PheIle: 1.586 ± 0.268
2.115PheLys: 2.115 ± 0.413
1.997PheLeu: 1.997 ± 0.313
0.47PheMet: 0.47 ± 0.175
1.469PheAsn: 1.469 ± 0.266
1.351PhePro: 1.351 ± 0.301
1.645PheGln: 1.645 ± 0.314
2.35PheArg: 2.35 ± 0.396
1.41PheSer: 1.41 ± 0.288
1.88PheThr: 1.88 ± 0.345
1.762PheVal: 1.762 ± 0.33
0.705PheTrp: 0.705 ± 0.23
0.822PheTyr: 0.822 ± 0.234
0.0PheXaa: 0.0 ± 0.0
Gly
8.4GlyAla: 8.4 ± 0.841
1.292GlyCys: 1.292 ± 0.297
4.112GlyAsp: 4.112 ± 0.475
5.228GlyGlu: 5.228 ± 0.656
3.583GlyPhe: 3.583 ± 0.536
6.462GlyGly: 6.462 ± 0.648
2.056GlyHis: 2.056 ± 0.443
3.818GlyIle: 3.818 ± 0.407
4.817GlyLys: 4.817 ± 0.571
6.756GlyLeu: 6.756 ± 0.655
2.174GlyMet: 2.174 ± 0.356
2.409GlyAsn: 2.409 ± 0.285
2.291GlyPro: 2.291 ± 0.379
3.995GlyGln: 3.995 ± 0.459
4.347GlyArg: 4.347 ± 0.532
4.758GlySer: 4.758 ± 0.559
3.29GlyThr: 3.29 ± 0.392
5.346GlyVal: 5.346 ± 0.701
1.645GlyTrp: 1.645 ± 0.29
2.056GlyTyr: 2.056 ± 0.333
0.0GlyXaa: 0.0 ± 0.0
His
2.115HisAla: 2.115 ± 0.37
0.411HisCys: 0.411 ± 0.157
0.881HisAsp: 0.881 ± 0.228
0.822HisGlu: 0.822 ± 0.263
0.47HisPhe: 0.47 ± 0.182
1.762HisGly: 1.762 ± 0.376
0.235HisHis: 0.235 ± 0.12
0.646HisIle: 0.646 ± 0.224
0.646HisLys: 0.646 ± 0.181
0.999HisLeu: 0.999 ± 0.279
0.352HisMet: 0.352 ± 0.13
0.352HisAsn: 0.352 ± 0.148
0.764HisPro: 0.764 ± 0.275
0.705HisGln: 0.705 ± 0.23
1.057HisArg: 1.057 ± 0.232
1.175HisSer: 1.175 ± 0.342
0.587HisThr: 0.587 ± 0.194
0.822HisVal: 0.822 ± 0.202
0.529HisTrp: 0.529 ± 0.182
0.47HisTyr: 0.47 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
6.521IleAla: 6.521 ± 0.726
0.764IleCys: 0.764 ± 0.18
3.877IleAsp: 3.877 ± 0.463
3.231IleGlu: 3.231 ± 0.385
0.999IlePhe: 0.999 ± 0.244
4.23IleGly: 4.23 ± 0.604
0.646IleHis: 0.646 ± 0.19
2.291IleIle: 2.291 ± 0.337
2.761IleLys: 2.761 ± 0.347
2.937IleLeu: 2.937 ± 0.398
0.529IleMet: 0.529 ± 0.236
2.35IleAsn: 2.35 ± 0.338
2.526IlePro: 2.526 ± 0.41
2.291IleGln: 2.291 ± 0.377
3.466IleArg: 3.466 ± 0.564
2.937IleSer: 2.937 ± 0.673
3.407IleThr: 3.407 ± 0.425
2.643IleVal: 2.643 ± 0.426
0.764IleTrp: 0.764 ± 0.24
1.527IleTyr: 1.527 ± 0.279
0.0IleXaa: 0.0 ± 0.0
Lys
6.344LysAla: 6.344 ± 0.76
0.587LysCys: 0.587 ± 0.196
3.29LysAsp: 3.29 ± 0.503
2.761LysGlu: 2.761 ± 0.397
1.234LysPhe: 1.234 ± 0.255
3.995LysGly: 3.995 ± 0.487
0.822LysHis: 0.822 ± 0.214
1.997LysIle: 1.997 ± 0.316
2.643LysLys: 2.643 ± 0.412
5.052LysLeu: 5.052 ± 0.564
1.116LysMet: 1.116 ± 0.242
1.527LysAsn: 1.527 ± 0.315
3.525LysPro: 3.525 ± 0.443
2.526LysGln: 2.526 ± 0.426
3.466LysArg: 3.466 ± 0.493
3.231LysSer: 3.231 ± 0.434
3.877LysThr: 3.877 ± 0.57
2.82LysVal: 2.82 ± 0.577
0.764LysTrp: 0.764 ± 0.26
1.057LysTyr: 1.057 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
9.869LeuAla: 9.869 ± 0.883
1.116LeuCys: 1.116 ± 0.263
5.052LeuAsp: 5.052 ± 0.496
5.639LeuGlu: 5.639 ± 0.418
2.291LeuPhe: 2.291 ± 0.289
5.581LeuGly: 5.581 ± 0.51
0.999LeuHis: 0.999 ± 0.292
4.7LeuIle: 4.7 ± 0.52
4.347LeuLys: 4.347 ± 0.638
5.169LeuLeu: 5.169 ± 0.491
1.704LeuMet: 1.704 ± 0.356
1.939LeuAsn: 1.939 ± 0.284
4.582LeuPro: 4.582 ± 0.707
3.29LeuGln: 3.29 ± 0.385
5.698LeuArg: 5.698 ± 0.647
5.052LeuSer: 5.052 ± 0.51
5.404LeuThr: 5.404 ± 0.617
5.052LeuVal: 5.052 ± 0.449
0.999LeuTrp: 0.999 ± 0.219
1.939LeuTyr: 1.939 ± 0.337
0.0LeuXaa: 0.0 ± 0.0
Met
2.702MetAla: 2.702 ± 0.385
0.235MetCys: 0.235 ± 0.108
1.057MetAsp: 1.057 ± 0.273
1.116MetGlu: 1.116 ± 0.3
0.646MetPhe: 0.646 ± 0.178
1.292MetGly: 1.292 ± 0.269
0.352MetHis: 0.352 ± 0.138
1.175MetIle: 1.175 ± 0.3
1.41MetLys: 1.41 ± 0.282
1.527MetLeu: 1.527 ± 0.283
0.587MetMet: 0.587 ± 0.179
1.234MetAsn: 1.234 ± 0.272
1.351MetPro: 1.351 ± 0.29
1.175MetGln: 1.175 ± 0.275
2.174MetArg: 2.174 ± 0.374
2.232MetSer: 2.232 ± 0.294
2.585MetThr: 2.585 ± 0.366
1.469MetVal: 1.469 ± 0.241
0.235MetTrp: 0.235 ± 0.114
0.411MetTyr: 0.411 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
3.936AsnAla: 3.936 ± 0.46
0.47AsnCys: 0.47 ± 0.168
1.586AsnAsp: 1.586 ± 0.268
1.939AsnGlu: 1.939 ± 0.351
1.175AsnPhe: 1.175 ± 0.26
2.937AsnGly: 2.937 ± 0.417
0.529AsnHis: 0.529 ± 0.224
1.527AsnIle: 1.527 ± 0.242
1.351AsnLys: 1.351 ± 0.277
2.643AsnLeu: 2.643 ± 0.374
0.999AsnMet: 0.999 ± 0.236
1.116AsnAsn: 1.116 ± 0.283
2.643AsnPro: 2.643 ± 0.315
1.292AsnGln: 1.292 ± 0.287
1.821AsnArg: 1.821 ± 0.27
1.997AsnSer: 1.997 ± 0.341
1.762AsnThr: 1.762 ± 0.331
1.88AsnVal: 1.88 ± 0.308
0.646AsnTrp: 0.646 ± 0.178
1.41AsnTyr: 1.41 ± 0.288
0.0AsnXaa: 0.0 ± 0.0
Pro
5.639ProAla: 5.639 ± 0.72
0.411ProCys: 0.411 ± 0.156
2.526ProAsp: 2.526 ± 0.372
3.701ProGlu: 3.701 ± 0.509
1.234ProPhe: 1.234 ± 0.254
4.112ProGly: 4.112 ± 0.57
0.764ProHis: 0.764 ± 0.21
1.939ProIle: 1.939 ± 0.34
2.467ProLys: 2.467 ± 0.541
3.466ProLeu: 3.466 ± 0.405
1.527ProMet: 1.527 ± 0.316
1.057ProAsn: 1.057 ± 0.304
3.055ProPro: 3.055 ± 0.481
1.351ProGln: 1.351 ± 0.287
2.056ProArg: 2.056 ± 0.339
2.937ProSer: 2.937 ± 0.394
2.526ProThr: 2.526 ± 0.392
4.112ProVal: 4.112 ± 0.578
0.94ProTrp: 0.94 ± 0.244
0.822ProTyr: 0.822 ± 0.219
0.0ProXaa: 0.0 ± 0.0
Gln
6.521GlnAla: 6.521 ± 0.915
0.94GlnCys: 0.94 ± 0.275
1.821GlnAsp: 1.821 ± 0.357
2.467GlnGlu: 2.467 ± 0.354
1.704GlnPhe: 1.704 ± 0.294
3.407GlnGly: 3.407 ± 0.505
0.646GlnHis: 0.646 ± 0.183
2.467GlnIle: 2.467 ± 0.308
1.704GlnLys: 1.704 ± 0.401
2.585GlnLeu: 2.585 ± 0.444
1.762GlnMet: 1.762 ± 0.317
1.234GlnAsn: 1.234 ± 0.367
2.056GlnPro: 2.056 ± 0.366
2.526GlnGln: 2.526 ± 0.487
3.29GlnArg: 3.29 ± 0.489
1.88GlnSer: 1.88 ± 0.373
2.467GlnThr: 2.467 ± 0.368
3.348GlnVal: 3.348 ± 0.453
0.411GlnTrp: 0.411 ± 0.154
1.057GlnTyr: 1.057 ± 0.217
0.0GlnXaa: 0.0 ± 0.0
Arg
6.579ArgAla: 6.579 ± 0.611
0.705ArgCys: 0.705 ± 0.22
4.288ArgAsp: 4.288 ± 0.616
3.936ArgGlu: 3.936 ± 0.479
2.232ArgPhe: 2.232 ± 0.505
4.171ArgGly: 4.171 ± 0.487
1.351ArgHis: 1.351 ± 0.318
2.526ArgIle: 2.526 ± 0.445
3.76ArgLys: 3.76 ± 0.564
5.992ArgLeu: 5.992 ± 0.507
1.292ArgMet: 1.292 ± 0.191
2.585ArgAsn: 2.585 ± 0.48
2.526ArgPro: 2.526 ± 0.474
2.761ArgGln: 2.761 ± 0.564
3.701ArgArg: 3.701 ± 0.484
3.407ArgSer: 3.407 ± 0.598
3.995ArgThr: 3.995 ± 0.47
3.936ArgVal: 3.936 ± 0.459
0.47ArgTrp: 0.47 ± 0.199
1.527ArgTyr: 1.527 ± 0.28
0.0ArgXaa: 0.0 ± 0.0
Ser
7.284SerAla: 7.284 ± 1.062
0.999SerCys: 0.999 ± 0.332
3.231SerAsp: 3.231 ± 0.483
3.877SerGlu: 3.877 ± 0.521
2.174SerPhe: 2.174 ± 0.371
5.111SerGly: 5.111 ± 0.605
0.764SerHis: 0.764 ± 0.216
3.76SerIle: 3.76 ± 0.467
2.937SerLys: 2.937 ± 0.396
4.935SerLeu: 4.935 ± 0.585
1.586SerMet: 1.586 ± 0.285
2.467SerAsn: 2.467 ± 0.355
2.937SerPro: 2.937 ± 0.482
2.291SerGln: 2.291 ± 0.462
2.585SerArg: 2.585 ± 0.382
3.583SerSer: 3.583 ± 0.512
2.409SerThr: 2.409 ± 0.363
4.23SerVal: 4.23 ± 0.537
0.822SerTrp: 0.822 ± 0.192
1.116SerTyr: 1.116 ± 0.241
0.0SerXaa: 0.0 ± 0.0
Thr
6.286ThrAla: 6.286 ± 0.733
0.822ThrCys: 0.822 ± 0.236
3.113ThrAsp: 3.113 ± 0.384
3.055ThrGlu: 3.055 ± 0.403
2.291ThrPhe: 2.291 ± 0.444
4.993ThrGly: 4.993 ± 0.543
0.705ThrHis: 0.705 ± 0.201
2.878ThrIle: 2.878 ± 0.446
2.585ThrLys: 2.585 ± 0.369
4.053ThrLeu: 4.053 ± 0.454
0.94ThrMet: 0.94 ± 0.235
2.232ThrAsn: 2.232 ± 0.364
3.818ThrPro: 3.818 ± 0.493
2.467ThrGln: 2.467 ± 0.336
2.937ThrArg: 2.937 ± 0.401
3.642ThrSer: 3.642 ± 0.555
3.583ThrThr: 3.583 ± 0.673
3.76ThrVal: 3.76 ± 0.382
0.999ThrTrp: 0.999 ± 0.263
1.41ThrTyr: 1.41 ± 0.284
0.0ThrXaa: 0.0 ± 0.0
Val
6.344ValAla: 6.344 ± 0.566
1.234ValCys: 1.234 ± 0.284
4.288ValAsp: 4.288 ± 0.586
4.347ValGlu: 4.347 ± 0.555
2.115ValPhe: 2.115 ± 0.368
4.7ValGly: 4.7 ± 0.499
0.999ValHis: 0.999 ± 0.317
3.466ValIle: 3.466 ± 0.421
3.466ValLys: 3.466 ± 0.381
4.935ValLeu: 4.935 ± 0.482
2.056ValMet: 2.056 ± 0.34
2.761ValAsn: 2.761 ± 0.357
2.878ValPro: 2.878 ± 0.395
1.88ValGln: 1.88 ± 0.34
3.701ValArg: 3.701 ± 0.474
3.877ValSer: 3.877 ± 0.335
4.406ValThr: 4.406 ± 0.648
4.7ValVal: 4.7 ± 0.513
0.822ValTrp: 0.822 ± 0.199
1.41ValTyr: 1.41 ± 0.291
0.0ValXaa: 0.0 ± 0.0
Trp
1.939TrpAla: 1.939 ± 0.321
0.176TrpCys: 0.176 ± 0.129
0.822TrpAsp: 0.822 ± 0.241
0.881TrpGlu: 0.881 ± 0.243
0.235TrpPhe: 0.235 ± 0.105
0.881TrpGly: 0.881 ± 0.26
0.352TrpHis: 0.352 ± 0.143
0.646TrpIle: 0.646 ± 0.176
1.116TrpLys: 1.116 ± 0.238
1.41TrpLeu: 1.41 ± 0.335
0.235TrpMet: 0.235 ± 0.118
0.646TrpAsn: 0.646 ± 0.237
0.822TrpPro: 0.822 ± 0.189
0.822TrpGln: 0.822 ± 0.231
1.762TrpArg: 1.762 ± 0.32
1.351TrpSer: 1.351 ± 0.382
0.822TrpThr: 0.822 ± 0.191
0.94TrpVal: 0.94 ± 0.226
0.47TrpTrp: 0.47 ± 0.185
0.411TrpTyr: 0.411 ± 0.142
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.642TyrAla: 3.642 ± 0.472
0.352TyrCys: 0.352 ± 0.141
1.704TyrAsp: 1.704 ± 0.46
2.115TyrGlu: 2.115 ± 0.343
0.881TyrPhe: 0.881 ± 0.201
2.115TyrGly: 2.115 ± 0.334
0.587TyrHis: 0.587 ± 0.175
0.999TyrIle: 0.999 ± 0.203
0.999TyrLys: 0.999 ± 0.249
2.585TyrLeu: 2.585 ± 0.358
0.529TyrMet: 0.529 ± 0.169
0.646TyrAsn: 0.646 ± 0.219
0.705TyrPro: 0.705 ± 0.225
1.527TyrGln: 1.527 ± 0.271
1.821TyrArg: 1.821 ± 0.335
1.645TyrSer: 1.645 ± 0.318
1.057TyrThr: 1.057 ± 0.292
1.057TyrVal: 1.057 ± 0.267
0.47TyrTrp: 0.47 ± 0.133
0.764TyrTyr: 0.764 ± 0.211
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (17024 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski