Amino acid dipepetide frequency for Planktothrix phage PaV-LD

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.575AlaAla: 4.575 ± 0.395
0.598AlaCys: 0.598 ± 0.131
3.097AlaAsp: 3.097 ± 0.29
4.715AlaGlu: 4.715 ± 0.406
2.428AlaPhe: 2.428 ± 0.346
4.399AlaGly: 4.399 ± 0.449
0.95AlaHis: 0.95 ± 0.171
6.228AlaIle: 6.228 ± 0.477
5.243AlaLys: 5.243 ± 0.473
6.651AlaLeu: 6.651 ± 0.742
1.724AlaMet: 1.724 ± 0.267
3.097AlaAsn: 3.097 ± 0.291
2.252AlaPro: 2.252 ± 0.289
3.061AlaGln: 3.061 ± 0.42
2.815AlaArg: 2.815 ± 0.35
4.715AlaSer: 4.715 ± 0.461
4.293AlaThr: 4.293 ± 0.376
4.434AlaVal: 4.434 ± 0.378
1.302AlaTrp: 1.302 ± 0.208
1.724AlaTyr: 1.724 ± 0.225
0.0AlaXaa: 0.0 ± 0.0
Cys
0.317CysAla: 0.317 ± 0.117
0.141CysCys: 0.141 ± 0.063
0.774CysAsp: 0.774 ± 0.232
0.739CysGlu: 0.739 ± 0.145
0.493CysPhe: 0.493 ± 0.144
0.985CysGly: 0.985 ± 0.183
0.246CysHis: 0.246 ± 0.095
0.422CysIle: 0.422 ± 0.114
0.633CysLys: 0.633 ± 0.136
0.739CysLeu: 0.739 ± 0.172
0.106CysMet: 0.106 ± 0.063
0.317CysAsn: 0.317 ± 0.109
0.422CysPro: 0.422 ± 0.118
0.282CysGln: 0.282 ± 0.093
0.704CysArg: 0.704 ± 0.177
0.669CysSer: 0.669 ± 0.171
0.317CysThr: 0.317 ± 0.094
0.739CysVal: 0.739 ± 0.19
0.07CysTrp: 0.07 ± 0.044
0.563CysTyr: 0.563 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
4.117AspAla: 4.117 ± 0.354
1.02AspCys: 1.02 ± 0.222
3.097AspAsp: 3.097 ± 0.321
3.167AspGlu: 3.167 ± 0.405
2.85AspPhe: 2.85 ± 0.324
3.836AspGly: 3.836 ± 0.471
0.598AspHis: 0.598 ± 0.159
4.047AspIle: 4.047 ± 0.381
4.187AspLys: 4.187 ± 0.444
4.715AspLeu: 4.715 ± 0.526
0.985AspMet: 0.985 ± 0.169
2.815AspAsn: 2.815 ± 0.38
3.026AspPro: 3.026 ± 0.31
2.182AspGln: 2.182 ± 0.307
2.991AspArg: 2.991 ± 0.291
4.469AspSer: 4.469 ± 0.401
2.076AspThr: 2.076 ± 0.338
3.132AspVal: 3.132 ± 0.314
1.302AspTrp: 1.302 ± 0.24
2.604AspTyr: 2.604 ± 0.357
0.0AspXaa: 0.0 ± 0.0
Glu
4.997GluAla: 4.997 ± 0.471
0.457GluCys: 0.457 ± 0.141
4.434GluAsp: 4.434 ± 0.495
5.138GluGlu: 5.138 ± 0.517
2.921GluPhe: 2.921 ± 0.308
3.836GluGly: 3.836 ± 0.385
0.915GluHis: 0.915 ± 0.189
4.751GluIle: 4.751 ± 0.356
5.701GluLys: 5.701 ± 0.534
7.284GluLeu: 7.284 ± 0.623
2.252GluMet: 2.252 ± 0.284
3.237GluAsn: 3.237 ± 0.37
2.639GluPro: 2.639 ± 0.351
3.73GluGln: 3.73 ± 0.418
3.202GluArg: 3.202 ± 0.325
4.575GluSer: 4.575 ± 0.355
4.117GluThr: 4.117 ± 0.363
4.399GluVal: 4.399 ± 0.413
1.443GluTrp: 1.443 ± 0.255
2.498GluTyr: 2.498 ± 0.309
0.0GluXaa: 0.0 ± 0.0
Phe
2.498PheAla: 2.498 ± 0.24
0.528PheCys: 0.528 ± 0.135
2.428PheAsp: 2.428 ± 0.261
2.639PheGlu: 2.639 ± 0.323
1.196PhePhe: 1.196 ± 0.216
2.991PheGly: 2.991 ± 0.314
0.176PheHis: 0.176 ± 0.083
1.971PheIle: 1.971 ± 0.257
2.358PheLys: 2.358 ± 0.306
3.202PheLeu: 3.202 ± 0.373
0.457PheMet: 0.457 ± 0.117
2.041PheAsn: 2.041 ± 0.24
1.759PhePro: 1.759 ± 0.271
1.443PheGln: 1.443 ± 0.216
1.971PheArg: 1.971 ± 0.253
3.132PheSer: 3.132 ± 0.296
2.463PheThr: 2.463 ± 0.291
2.287PheVal: 2.287 ± 0.303
0.563PheTrp: 0.563 ± 0.128
0.985PheTyr: 0.985 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
3.836GlyAla: 3.836 ± 0.402
0.774GlyCys: 0.774 ± 0.184
4.223GlyAsp: 4.223 ± 0.347
4.962GlyGlu: 4.962 ± 0.413
2.921GlyPhe: 2.921 ± 0.282
5.63GlyGly: 5.63 ± 0.539
0.739GlyHis: 0.739 ± 0.151
4.786GlyIle: 4.786 ± 0.416
5.419GlyLys: 5.419 ± 0.416
5.489GlyLeu: 5.489 ± 0.459
2.041GlyMet: 2.041 ± 0.274
3.343GlyAsn: 3.343 ± 0.36
0.387GlyPro: 0.387 ± 0.099
2.569GlyGln: 2.569 ± 0.319
3.343GlyArg: 3.343 ± 0.346
3.941GlySer: 3.941 ± 0.541
3.413GlyThr: 3.413 ± 0.45
4.61GlyVal: 4.61 ± 0.449
1.126GlyTrp: 1.126 ± 0.213
2.885GlyTyr: 2.885 ± 0.313
0.0GlyXaa: 0.0 ± 0.0
His
0.739HisAla: 0.739 ± 0.166
0.106HisCys: 0.106 ± 0.059
0.669HisAsp: 0.669 ± 0.175
1.091HisGlu: 1.091 ± 0.205
0.774HisPhe: 0.774 ± 0.226
0.845HisGly: 0.845 ± 0.161
0.246HisHis: 0.246 ± 0.1
0.704HisIle: 0.704 ± 0.194
0.845HisLys: 0.845 ± 0.189
1.196HisLeu: 1.196 ± 0.202
0.106HisMet: 0.106 ± 0.056
0.633HisAsn: 0.633 ± 0.153
1.232HisPro: 1.232 ± 0.209
0.669HisGln: 0.669 ± 0.157
1.126HisArg: 1.126 ± 0.198
0.774HisSer: 0.774 ± 0.173
0.457HisThr: 0.457 ± 0.108
0.598HisVal: 0.598 ± 0.148
0.352HisTrp: 0.352 ± 0.132
0.422HisTyr: 0.422 ± 0.128
0.0HisXaa: 0.0 ± 0.0
Ile
5.63IleAla: 5.63 ± 0.429
0.669IleCys: 0.669 ± 0.201
4.082IleAsp: 4.082 ± 0.333
5.877IleGlu: 5.877 ± 0.463
2.217IlePhe: 2.217 ± 0.276
3.554IleGly: 3.554 ± 0.413
0.809IleHis: 0.809 ± 0.138
4.223IleIle: 4.223 ± 0.426
5.982IleLys: 5.982 ± 0.468
4.575IleLeu: 4.575 ± 0.426
1.126IleMet: 1.126 ± 0.226
4.117IleAsn: 4.117 ± 0.412
3.237IlePro: 3.237 ± 0.368
2.78IleGln: 2.78 ± 0.373
3.097IleArg: 3.097 ± 0.297
4.891IleSer: 4.891 ± 0.458
3.765IleThr: 3.765 ± 0.407
3.765IleVal: 3.765 ± 0.364
0.809IleTrp: 0.809 ± 0.189
1.795IleTyr: 1.795 ± 0.257
0.0IleXaa: 0.0 ± 0.0
Lys
6.44LysAla: 6.44 ± 0.652
0.563LysCys: 0.563 ± 0.132
3.976LysAsp: 3.976 ± 0.447
5.173LysGlu: 5.173 ± 0.467
2.252LysPhe: 2.252 ± 0.259
4.117LysGly: 4.117 ± 0.387
1.056LysHis: 1.056 ± 0.204
4.926LysIle: 4.926 ± 0.464
6.651LysLys: 6.651 ± 0.532
6.616LysLeu: 6.616 ± 0.543
1.548LysMet: 1.548 ± 0.254
3.413LysAsn: 3.413 ± 0.361
4.469LysPro: 4.469 ± 0.552
3.273LysGln: 3.273 ± 0.421
3.484LysArg: 3.484 ± 0.409
5.841LysSer: 5.841 ± 0.563
5.138LysThr: 5.138 ± 0.545
3.871LysVal: 3.871 ± 0.4
1.126LysTrp: 1.126 ± 0.191
2.287LysTyr: 2.287 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
6.123LeuAla: 6.123 ± 0.592
0.493LeuCys: 0.493 ± 0.146
5.278LeuAsp: 5.278 ± 0.421
6.791LeuGlu: 6.791 ± 0.613
2.71LeuPhe: 2.71 ± 0.315
5.63LeuGly: 5.63 ± 0.637
1.126LeuHis: 1.126 ± 0.182
6.44LeuIle: 6.44 ± 0.532
6.334LeuLys: 6.334 ± 0.543
5.665LeuLeu: 5.665 ± 0.451
1.795LeuMet: 1.795 ± 0.237
5.032LeuAsn: 5.032 ± 0.399
3.695LeuPro: 3.695 ± 0.367
3.167LeuGln: 3.167 ± 0.511
3.695LeuArg: 3.695 ± 0.456
6.897LeuSer: 6.897 ± 0.591
6.123LeuThr: 6.123 ± 0.513
4.962LeuVal: 4.962 ± 0.449
0.845LeuTrp: 0.845 ± 0.17
1.795LeuTyr: 1.795 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
1.759MetAla: 1.759 ± 0.247
0.176MetCys: 0.176 ± 0.079
1.337MetAsp: 1.337 ± 0.171
1.654MetGlu: 1.654 ± 0.26
0.528MetPhe: 0.528 ± 0.146
1.654MetGly: 1.654 ± 0.234
0.282MetHis: 0.282 ± 0.093
1.302MetIle: 1.302 ± 0.226
1.724MetLys: 1.724 ± 0.243
1.302MetLeu: 1.302 ± 0.195
0.493MetMet: 0.493 ± 0.17
1.02MetAsn: 1.02 ± 0.206
0.88MetPro: 0.88 ± 0.202
0.774MetGln: 0.774 ± 0.141
0.809MetArg: 0.809 ± 0.145
1.302MetSer: 1.302 ± 0.217
1.548MetThr: 1.548 ± 0.235
1.513MetVal: 1.513 ± 0.216
0.176MetTrp: 0.176 ± 0.072
0.176MetTyr: 0.176 ± 0.083
0.0MetXaa: 0.0 ± 0.0
Asn
2.991AsnAla: 2.991 ± 0.321
0.739AsnCys: 0.739 ± 0.169
2.498AsnAsp: 2.498 ± 0.317
2.287AsnGlu: 2.287 ± 0.294
1.619AsnPhe: 1.619 ± 0.243
3.308AsnGly: 3.308 ± 0.334
0.915AsnHis: 0.915 ± 0.214
2.991AsnIle: 2.991 ± 0.326
3.589AsnLys: 3.589 ± 0.315
4.856AsnLeu: 4.856 ± 0.433
0.598AsnMet: 0.598 ± 0.138
2.885AsnAsn: 2.885 ± 0.322
2.885AsnPro: 2.885 ± 0.395
3.237AsnGln: 3.237 ± 0.323
2.498AsnArg: 2.498 ± 0.296
3.765AsnSer: 3.765 ± 0.369
2.498AsnThr: 2.498 ± 0.297
2.358AsnVal: 2.358 ± 0.283
0.95AsnTrp: 0.95 ± 0.212
2.076AsnTyr: 2.076 ± 0.305
0.0AsnXaa: 0.0 ± 0.0
Pro
2.322ProAla: 2.322 ± 0.316
0.387ProCys: 0.387 ± 0.137
3.484ProAsp: 3.484 ± 0.409
3.906ProGlu: 3.906 ± 0.484
1.584ProPhe: 1.584 ± 0.212
1.971ProGly: 1.971 ± 0.255
0.598ProHis: 0.598 ± 0.132
2.322ProIle: 2.322 ± 0.227
3.8ProLys: 3.8 ± 0.51
2.885ProLeu: 2.885 ± 0.287
0.528ProMet: 0.528 ± 0.139
2.182ProAsn: 2.182 ± 0.296
2.393ProPro: 2.393 ± 0.378
2.252ProGln: 2.252 ± 0.26
1.584ProArg: 1.584 ± 0.258
3.519ProSer: 3.519 ± 0.349
3.554ProThr: 3.554 ± 0.335
3.554ProVal: 3.554 ± 0.414
0.457ProTrp: 0.457 ± 0.126
1.372ProTyr: 1.372 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
3.097GlnAla: 3.097 ± 0.312
0.387GlnCys: 0.387 ± 0.107
1.83GlnAsp: 1.83 ± 0.258
3.941GlnGlu: 3.941 ± 0.502
1.478GlnPhe: 1.478 ± 0.257
2.885GlnGly: 2.885 ± 0.314
0.528GlnHis: 0.528 ± 0.123
3.273GlnIle: 3.273 ± 0.336
4.012GlnLys: 4.012 ± 0.394
4.258GlnLeu: 4.258 ± 0.452
1.091GlnMet: 1.091 ± 0.184
1.654GlnAsn: 1.654 ± 0.202
1.513GlnPro: 1.513 ± 0.229
2.358GlnGln: 2.358 ± 0.408
1.9GlnArg: 1.9 ± 0.251
2.815GlnSer: 2.815 ± 0.495
3.026GlnThr: 3.026 ± 0.327
2.463GlnVal: 2.463 ± 0.321
0.563GlnTrp: 0.563 ± 0.143
1.337GlnTyr: 1.337 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
2.674ArgAla: 2.674 ± 0.316
0.211ArgCys: 0.211 ± 0.086
2.111ArgAsp: 2.111 ± 0.341
3.906ArgGlu: 3.906 ± 0.348
2.147ArgPhe: 2.147 ± 0.32
2.534ArgGly: 2.534 ± 0.287
1.091ArgHis: 1.091 ± 0.184
3.66ArgIle: 3.66 ± 0.333
3.343ArgLys: 3.343 ± 0.385
4.258ArgLeu: 4.258 ± 0.44
1.267ArgMet: 1.267 ± 0.21
2.322ArgAsn: 2.322 ± 0.253
1.9ArgPro: 1.9 ± 0.278
2.076ArgGln: 2.076 ± 0.369
2.393ArgArg: 2.393 ± 0.279
3.202ArgSer: 3.202 ± 0.357
2.358ArgThr: 2.358 ± 0.217
3.308ArgVal: 3.308 ± 0.353
0.845ArgTrp: 0.845 ± 0.23
1.971ArgTyr: 1.971 ± 0.273
0.0ArgXaa: 0.0 ± 0.0
Ser
4.399SerAla: 4.399 ± 0.479
0.845SerCys: 0.845 ± 0.211
4.469SerAsp: 4.469 ± 0.34
4.645SerGlu: 4.645 ± 0.423
2.815SerPhe: 2.815 ± 0.332
6.44SerGly: 6.44 ± 0.635
0.915SerHis: 0.915 ± 0.195
4.68SerIle: 4.68 ± 0.403
4.434SerLys: 4.434 ± 0.474
6.158SerLeu: 6.158 ± 0.439
1.232SerMet: 1.232 ± 0.185
2.991SerAsn: 2.991 ± 0.327
3.554SerPro: 3.554 ± 0.313
3.237SerGln: 3.237 ± 0.39
3.66SerArg: 3.66 ± 0.409
5.314SerSer: 5.314 ± 0.421
4.047SerThr: 4.047 ± 0.442
4.539SerVal: 4.539 ± 0.408
1.056SerTrp: 1.056 ± 0.182
2.604SerTyr: 2.604 ± 0.304
0.0SerXaa: 0.0 ± 0.0
Thr
4.082ThrAla: 4.082 ± 0.436
0.387ThrCys: 0.387 ± 0.12
3.449ThrAsp: 3.449 ± 0.343
3.554ThrGlu: 3.554 ± 0.32
1.619ThrPhe: 1.619 ± 0.235
4.504ThrGly: 4.504 ± 0.467
0.774ThrHis: 0.774 ± 0.151
4.434ThrIle: 4.434 ± 0.395
4.012ThrLys: 4.012 ± 0.379
5.771ThrLeu: 5.771 ± 0.528
0.845ThrMet: 0.845 ± 0.172
2.674ThrAsn: 2.674 ± 0.307
4.152ThrPro: 4.152 ± 0.468
2.287ThrGln: 2.287 ± 0.235
2.78ThrArg: 2.78 ± 0.339
3.519ThrSer: 3.519 ± 0.379
3.66ThrThr: 3.66 ± 0.436
4.187ThrVal: 4.187 ± 0.357
0.774ThrTrp: 0.774 ± 0.156
1.935ThrTyr: 1.935 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
4.61ValAla: 4.61 ± 0.386
0.528ValCys: 0.528 ± 0.151
3.624ValAsp: 3.624 ± 0.358
5.314ValGlu: 5.314 ± 0.415
2.393ValPhe: 2.393 ± 0.282
4.223ValGly: 4.223 ± 0.44
1.02ValHis: 1.02 ± 0.182
3.554ValIle: 3.554 ± 0.333
4.68ValLys: 4.68 ± 0.588
4.223ValLeu: 4.223 ± 0.438
1.337ValMet: 1.337 ± 0.185
3.308ValAsn: 3.308 ± 0.33
2.463ValPro: 2.463 ± 0.355
2.287ValGln: 2.287 ± 0.333
2.569ValArg: 2.569 ± 0.29
5.138ValSer: 5.138 ± 0.44
3.976ValThr: 3.976 ± 0.524
3.765ValVal: 3.765 ± 0.31
0.809ValTrp: 0.809 ± 0.175
1.513ValTyr: 1.513 ± 0.238
0.0ValXaa: 0.0 ± 0.0
Trp
1.267TrpAla: 1.267 ± 0.209
0.211TrpCys: 0.211 ± 0.104
0.845TrpAsp: 0.845 ± 0.168
1.02TrpGlu: 1.02 ± 0.179
0.704TrpPhe: 0.704 ± 0.16
1.196TrpGly: 1.196 ± 0.218
0.211TrpHis: 0.211 ± 0.067
0.704TrpIle: 0.704 ± 0.156
0.985TrpLys: 0.985 ± 0.187
1.9TrpLeu: 1.9 ± 0.262
0.317TrpMet: 0.317 ± 0.107
0.598TrpAsn: 0.598 ± 0.164
0.106TrpPro: 0.106 ± 0.06
0.845TrpGln: 0.845 ± 0.151
0.915TrpArg: 0.915 ± 0.19
0.845TrpSer: 0.845 ± 0.161
1.056TrpThr: 1.056 ± 0.209
1.091TrpVal: 1.091 ± 0.177
0.387TrpTrp: 0.387 ± 0.104
0.317TrpTyr: 0.317 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.83TyrAla: 1.83 ± 0.25
0.457TyrCys: 0.457 ± 0.124
1.513TyrAsp: 1.513 ± 0.213
2.006TyrGlu: 2.006 ± 0.3
1.443TyrPhe: 1.443 ± 0.23
1.9TyrGly: 1.9 ± 0.227
0.387TyrHis: 0.387 ± 0.142
1.584TyrIle: 1.584 ± 0.26
2.322TyrLys: 2.322 ± 0.264
2.85TyrLeu: 2.85 ± 0.36
0.528TyrMet: 0.528 ± 0.143
1.865TyrAsn: 1.865 ± 0.213
1.689TyrPro: 1.689 ± 0.282
1.9TyrGln: 1.9 ± 0.262
2.041TyrArg: 2.041 ± 0.258
2.71TyrSer: 2.71 ± 0.356
1.513TyrThr: 1.513 ± 0.23
1.759TyrVal: 1.759 ± 0.287
0.563TyrTrp: 0.563 ± 0.152
1.619TyrTyr: 1.619 ± 0.279
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 142 proteins (28419 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski