Amino acid dipepetide frequency for Pseudomonas phage Epa33

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.714AlaAla: 17.714 ± 1.51
0.787AlaCys: 0.787 ± 0.296
8.611AlaAsp: 8.611 ± 0.661
9.152AlaGlu: 9.152 ± 0.691
3.297AlaPhe: 3.297 ± 0.304
9.841AlaGly: 9.841 ± 0.966
2.165AlaHis: 2.165 ± 0.355
5.806AlaIle: 5.806 ± 0.513
6.003AlaLys: 6.003 ± 0.75
10.333AlaLeu: 10.333 ± 0.947
3.395AlaMet: 3.395 ± 0.383
3.297AlaAsn: 3.297 ± 0.465
7.282AlaPro: 7.282 ± 0.735
7.086AlaGln: 7.086 ± 0.777
8.66AlaArg: 8.66 ± 0.728
6.249AlaSer: 6.249 ± 0.63
5.757AlaThr: 5.757 ± 0.647
8.316AlaVal: 8.316 ± 0.7
2.017AlaTrp: 2.017 ± 0.342
2.805AlaTyr: 2.805 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
0.64CysAla: 0.64 ± 0.246
0.246CysCys: 0.246 ± 0.13
0.738CysAsp: 0.738 ± 0.239
0.443CysGlu: 0.443 ± 0.146
0.098CysPhe: 0.098 ± 0.079
0.738CysGly: 0.738 ± 0.228
0.098CysHis: 0.098 ± 0.08
0.295CysIle: 0.295 ± 0.133
0.295CysLys: 0.295 ± 0.138
0.689CysLeu: 0.689 ± 0.232
0.049CysMet: 0.049 ± 0.047
0.295CysAsn: 0.295 ± 0.119
0.443CysPro: 0.443 ± 0.188
0.492CysGln: 0.492 ± 0.197
0.492CysArg: 0.492 ± 0.151
0.492CysSer: 0.492 ± 0.176
0.295CysThr: 0.295 ± 0.153
0.344CysVal: 0.344 ± 0.131
0.197CysTrp: 0.197 ± 0.116
0.394CysTyr: 0.394 ± 0.16
0.0CysXaa: 0.0 ± 0.0
Asp
8.906AspAla: 8.906 ± 0.795
0.295AspCys: 0.295 ± 0.119
4.084AspAsp: 4.084 ± 0.481
3.789AspGlu: 3.789 ± 0.489
1.821AspPhe: 1.821 ± 0.336
5.068AspGly: 5.068 ± 0.576
0.935AspHis: 0.935 ± 0.255
2.854AspIle: 2.854 ± 0.316
2.067AspLys: 2.067 ± 0.298
4.822AspLeu: 4.822 ± 0.419
2.017AspMet: 2.017 ± 0.368
1.476AspAsn: 1.476 ± 0.257
3.149AspPro: 3.149 ± 0.467
3.149AspGln: 3.149 ± 0.413
4.084AspArg: 4.084 ± 0.462
3.395AspSer: 3.395 ± 0.408
3.494AspThr: 3.494 ± 0.491
4.133AspVal: 4.133 ± 0.495
0.984AspTrp: 0.984 ± 0.197
1.673AspTyr: 1.673 ± 0.355
0.0AspXaa: 0.0 ± 0.0
Glu
7.627GluAla: 7.627 ± 0.893
0.59GluCys: 0.59 ± 0.252
2.805GluAsp: 2.805 ± 0.444
3.346GluGlu: 3.346 ± 0.506
1.673GluPhe: 1.673 ± 0.429
4.478GluGly: 4.478 ± 0.663
1.23GluHis: 1.23 ± 0.279
3.051GluIle: 3.051 ± 0.484
2.362GluLys: 2.362 ± 0.37
5.806GluLeu: 5.806 ± 0.602
1.624GluMet: 1.624 ± 0.309
1.771GluAsn: 1.771 ± 0.261
2.952GluPro: 2.952 ± 0.42
4.527GluGln: 4.527 ± 0.48
5.855GluArg: 5.855 ± 0.539
3.789GluSer: 3.789 ± 0.483
2.46GluThr: 2.46 ± 0.377
3.444GluVal: 3.444 ± 0.397
0.738GluTrp: 0.738 ± 0.243
1.575GluTyr: 1.575 ± 0.33
0.0GluXaa: 0.0 ± 0.0
Phe
3.346PheAla: 3.346 ± 0.42
0.148PheCys: 0.148 ± 0.088
2.017PheAsp: 2.017 ± 0.337
1.821PheGlu: 1.821 ± 0.427
0.492PhePhe: 0.492 ± 0.127
2.706PheGly: 2.706 ± 0.521
0.492PheHis: 0.492 ± 0.159
1.132PheIle: 1.132 ± 0.251
1.132PheLys: 1.132 ± 0.204
1.968PheLeu: 1.968 ± 0.334
0.394PheMet: 0.394 ± 0.124
0.935PheAsn: 0.935 ± 0.248
1.821PhePro: 1.821 ± 0.31
1.329PheGln: 1.329 ± 0.187
2.116PheArg: 2.116 ± 0.316
1.968PheSer: 1.968 ± 0.264
1.427PheThr: 1.427 ± 0.364
2.165PheVal: 2.165 ± 0.414
0.492PheTrp: 0.492 ± 0.22
0.935PheTyr: 0.935 ± 0.289
0.0PheXaa: 0.0 ± 0.0
Gly
8.119GlyAla: 8.119 ± 0.724
0.738GlyCys: 0.738 ± 0.239
4.33GlyAsp: 4.33 ± 0.532
5.019GlyGlu: 5.019 ± 0.586
3.248GlyPhe: 3.248 ± 0.563
6.397GlyGly: 6.397 ± 0.697
1.132GlyHis: 1.132 ± 0.269
4.084GlyIle: 4.084 ± 0.518
3.297GlyLys: 3.297 ± 0.429
6.249GlyLeu: 6.249 ± 0.528
2.263GlyMet: 2.263 ± 0.277
2.313GlyAsn: 2.313 ± 0.344
2.755GlyPro: 2.755 ± 0.338
4.576GlyGln: 4.576 ± 0.523
5.708GlyArg: 5.708 ± 0.601
4.281GlySer: 4.281 ± 0.435
4.822GlyThr: 4.822 ± 0.711
5.117GlyVal: 5.117 ± 0.522
1.329GlyTrp: 1.329 ± 0.243
2.165GlyTyr: 2.165 ± 0.286
0.0GlyXaa: 0.0 ± 0.0
His
2.46HisAla: 2.46 ± 0.336
0.148HisCys: 0.148 ± 0.085
1.033HisAsp: 1.033 ± 0.275
1.329HisGlu: 1.329 ± 0.244
0.689HisPhe: 0.689 ± 0.219
1.968HisGly: 1.968 ± 0.332
0.689HisHis: 0.689 ± 0.263
0.59HisIle: 0.59 ± 0.184
0.246HisLys: 0.246 ± 0.088
1.771HisLeu: 1.771 ± 0.493
0.738HisMet: 0.738 ± 0.208
0.64HisAsn: 0.64 ± 0.207
0.984HisPro: 0.984 ± 0.28
0.836HisGln: 0.836 ± 0.171
1.87HisArg: 1.87 ± 0.289
1.23HisSer: 1.23 ± 0.348
1.033HisThr: 1.033 ± 0.247
1.279HisVal: 1.279 ± 0.245
0.394HisTrp: 0.394 ± 0.169
0.59HisTyr: 0.59 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
5.265IleAla: 5.265 ± 0.513
0.59IleCys: 0.59 ± 0.177
3.297IleAsp: 3.297 ± 0.355
3.444IleGlu: 3.444 ± 0.46
0.738IlePhe: 0.738 ± 0.214
3.297IleGly: 3.297 ± 0.479
1.033IleHis: 1.033 ± 0.26
1.427IleIle: 1.427 ± 0.416
1.968IleLys: 1.968 ± 0.296
2.706IleLeu: 2.706 ± 0.4
0.59IleMet: 0.59 ± 0.152
1.968IleAsn: 1.968 ± 0.32
2.116IlePro: 2.116 ± 0.286
1.279IleGln: 1.279 ± 0.208
3.346IleArg: 3.346 ± 0.405
1.968IleSer: 1.968 ± 0.354
2.854IleThr: 2.854 ± 0.391
2.362IleVal: 2.362 ± 0.377
0.738IleTrp: 0.738 ± 0.236
1.279IleTyr: 1.279 ± 0.3
0.0IleXaa: 0.0 ± 0.0
Lys
5.068LysAla: 5.068 ± 0.785
0.344LysCys: 0.344 ± 0.15
2.608LysAsp: 2.608 ± 0.419
2.214LysGlu: 2.214 ± 0.341
1.083LysPhe: 1.083 ± 0.19
2.706LysGly: 2.706 ± 0.372
0.836LysHis: 0.836 ± 0.22
1.132LysIle: 1.132 ± 0.292
2.165LysLys: 2.165 ± 0.381
3.198LysLeu: 3.198 ± 0.362
0.984LysMet: 0.984 ± 0.242
0.541LysAsn: 0.541 ± 0.164
2.805LysPro: 2.805 ± 0.376
2.017LysGln: 2.017 ± 0.388
2.657LysArg: 2.657 ± 0.452
2.017LysSer: 2.017 ± 0.402
2.214LysThr: 2.214 ± 0.375
2.755LysVal: 2.755 ± 0.301
0.492LysTrp: 0.492 ± 0.157
1.181LysTyr: 1.181 ± 0.249
0.0LysXaa: 0.0 ± 0.0
Leu
11.612LeuAla: 11.612 ± 0.942
0.443LeuCys: 0.443 ± 0.192
5.462LeuAsp: 5.462 ± 0.528
4.773LeuGlu: 4.773 ± 0.429
2.509LeuPhe: 2.509 ± 0.281
5.216LeuGly: 5.216 ± 0.61
2.017LeuHis: 2.017 ± 0.321
2.805LeuIle: 2.805 ± 0.459
3.051LeuLys: 3.051 ± 0.359
6.397LeuLeu: 6.397 ± 0.714
1.87LeuMet: 1.87 ± 0.321
3.198LeuAsn: 3.198 ± 0.372
4.281LeuPro: 4.281 ± 0.561
3.297LeuGln: 3.297 ± 0.366
6.446LeuArg: 6.446 ± 0.61
4.773LeuSer: 4.773 ± 0.539
4.527LeuThr: 4.527 ± 0.552
5.905LeuVal: 5.905 ± 0.717
1.033LeuTrp: 1.033 ± 0.229
1.624LeuTyr: 1.624 ± 0.32
0.0LeuXaa: 0.0 ± 0.0
Met
2.952MetAla: 2.952 ± 0.594
0.148MetCys: 0.148 ± 0.093
1.132MetAsp: 1.132 ± 0.244
0.64MetGlu: 0.64 ± 0.186
0.246MetPhe: 0.246 ± 0.115
2.067MetGly: 2.067 ± 0.34
0.541MetHis: 0.541 ± 0.146
0.984MetIle: 0.984 ± 0.179
1.23MetLys: 1.23 ± 0.223
2.263MetLeu: 2.263 ± 0.324
0.344MetMet: 0.344 ± 0.146
1.033MetAsn: 1.033 ± 0.205
2.017MetPro: 2.017 ± 0.267
1.279MetGln: 1.279 ± 0.246
1.624MetArg: 1.624 ± 0.37
1.673MetSer: 1.673 ± 0.305
1.624MetThr: 1.624 ± 0.228
1.132MetVal: 1.132 ± 0.233
0.295MetTrp: 0.295 ± 0.112
0.492MetTyr: 0.492 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
3.789AsnAla: 3.789 ± 0.53
0.344AsnCys: 0.344 ± 0.19
1.722AsnAsp: 1.722 ± 0.267
1.279AsnGlu: 1.279 ± 0.232
0.836AsnPhe: 0.836 ± 0.185
2.509AsnGly: 2.509 ± 0.313
0.541AsnHis: 0.541 ± 0.216
1.427AsnIle: 1.427 ± 0.34
1.083AsnLys: 1.083 ± 0.207
2.509AsnLeu: 2.509 ± 0.415
1.279AsnMet: 1.279 ± 0.28
0.689AsnAsn: 0.689 ± 0.22
1.968AsnPro: 1.968 ± 0.344
1.575AsnGln: 1.575 ± 0.323
2.067AsnArg: 2.067 ± 0.272
2.165AsnSer: 2.165 ± 0.309
1.87AsnThr: 1.87 ± 0.284
1.476AsnVal: 1.476 ± 0.265
0.64AsnTrp: 0.64 ± 0.2
0.738AsnTyr: 0.738 ± 0.222
0.0AsnXaa: 0.0 ± 0.0
Pro
8.316ProAla: 8.316 ± 0.669
0.394ProCys: 0.394 ± 0.171
4.133ProAsp: 4.133 ± 0.558
3.936ProGlu: 3.936 ± 0.401
1.279ProPhe: 1.279 ± 0.264
4.084ProGly: 4.084 ± 0.712
0.738ProHis: 0.738 ± 0.216
2.165ProIle: 2.165 ± 0.503
1.87ProLys: 1.87 ± 0.356
3.395ProLeu: 3.395 ± 0.57
1.427ProMet: 1.427 ± 0.253
1.575ProAsn: 1.575 ± 0.327
3.838ProPro: 3.838 ± 1.092
2.313ProGln: 2.313 ± 0.45
2.559ProArg: 2.559 ± 0.387
3.051ProSer: 3.051 ± 0.408
3.494ProThr: 3.494 ± 0.401
3.789ProVal: 3.789 ± 0.43
0.984ProTrp: 0.984 ± 0.191
1.673ProTyr: 1.673 ± 0.343
0.0ProXaa: 0.0 ± 0.0
Gln
8.07GlnAla: 8.07 ± 0.69
0.197GlnCys: 0.197 ± 0.111
2.263GlnAsp: 2.263 ± 0.335
3.051GlnGlu: 3.051 ± 0.445
1.821GlnPhe: 1.821 ± 0.246
3.887GlnGly: 3.887 ± 0.501
1.722GlnHis: 1.722 ± 0.317
2.706GlnIle: 2.706 ± 0.521
1.083GlnLys: 1.083 ± 0.245
3.198GlnLeu: 3.198 ± 0.413
1.033GlnMet: 1.033 ± 0.278
1.23GlnAsn: 1.23 ± 0.347
3.002GlnPro: 3.002 ± 0.387
4.379GlnGln: 4.379 ± 0.64
3.198GlnArg: 3.198 ± 0.454
2.165GlnSer: 2.165 ± 0.348
1.821GlnThr: 1.821 ± 0.291
3.444GlnVal: 3.444 ± 0.369
0.984GlnTrp: 0.984 ± 0.26
1.722GlnTyr: 1.722 ± 0.286
0.0GlnXaa: 0.0 ± 0.0
Arg
8.07ArgAla: 8.07 ± 0.616
0.689ArgCys: 0.689 ± 0.202
4.232ArgAsp: 4.232 ± 0.651
4.527ArgGlu: 4.527 ± 0.652
3.002ArgPhe: 3.002 ± 0.427
4.822ArgGly: 4.822 ± 0.64
1.525ArgHis: 1.525 ± 0.344
3.395ArgIle: 3.395 ± 0.43
3.248ArgLys: 3.248 ± 0.385
7.135ArgLeu: 7.135 ± 0.639
1.525ArgMet: 1.525 ± 0.205
2.362ArgAsn: 2.362 ± 0.357
3.444ArgPro: 3.444 ± 0.525
4.182ArgGln: 4.182 ± 0.565
5.855ArgArg: 5.855 ± 0.639
3.543ArgSer: 3.543 ± 0.672
3.346ArgThr: 3.346 ± 0.442
5.265ArgVal: 5.265 ± 0.412
1.23ArgTrp: 1.23 ± 0.29
1.771ArgTyr: 1.771 ± 0.385
0.0ArgXaa: 0.0 ± 0.0
Ser
6.84SerAla: 6.84 ± 0.63
0.246SerCys: 0.246 ± 0.112
3.444SerAsp: 3.444 ± 0.575
3.494SerGlu: 3.494 ± 0.487
1.378SerPhe: 1.378 ± 0.243
5.265SerGly: 5.265 ± 0.741
1.181SerHis: 1.181 ± 0.325
1.821SerIle: 1.821 ± 0.285
2.116SerLys: 2.116 ± 0.301
4.921SerLeu: 4.921 ± 0.481
1.181SerMet: 1.181 ± 0.217
1.821SerAsn: 1.821 ± 0.279
3.051SerPro: 3.051 ± 0.522
2.214SerGln: 2.214 ± 0.32
3.297SerArg: 3.297 ± 0.366
3.002SerSer: 3.002 ± 0.411
3.248SerThr: 3.248 ± 0.459
3.641SerVal: 3.641 ± 0.34
0.886SerTrp: 0.886 ± 0.181
1.624SerTyr: 1.624 ± 0.358
0.0SerXaa: 0.0 ± 0.0
Thr
6.987ThrAla: 6.987 ± 0.735
0.246ThrCys: 0.246 ± 0.125
3.149ThrAsp: 3.149 ± 0.376
2.608ThrGlu: 2.608 ± 0.372
2.165ThrPhe: 2.165 ± 0.321
5.068ThrGly: 5.068 ± 0.519
0.886ThrHis: 0.886 ± 0.247
2.165ThrIle: 2.165 ± 0.344
1.722ThrLys: 1.722 ± 0.257
4.084ThrLeu: 4.084 ± 0.398
0.984ThrMet: 0.984 ± 0.205
1.624ThrAsn: 1.624 ± 0.281
3.641ThrPro: 3.641 ± 0.341
1.968ThrGln: 1.968 ± 0.283
3.592ThrArg: 3.592 ± 0.486
2.952ThrSer: 2.952 ± 0.416
3.002ThrThr: 3.002 ± 0.557
4.576ThrVal: 4.576 ± 0.605
0.886ThrTrp: 0.886 ± 0.178
1.427ThrTyr: 1.427 ± 0.239
0.0ThrXaa: 0.0 ± 0.0
Val
8.414ValAla: 8.414 ± 0.608
0.689ValCys: 0.689 ± 0.25
4.675ValAsp: 4.675 ± 0.507
4.724ValGlu: 4.724 ± 0.54
1.427ValPhe: 1.427 ± 0.374
4.576ValGly: 4.576 ± 0.414
1.771ValHis: 1.771 ± 0.341
3.051ValIle: 3.051 ± 0.439
2.559ValLys: 2.559 ± 0.364
5.757ValLeu: 5.757 ± 0.55
1.329ValMet: 1.329 ± 0.234
1.771ValAsn: 1.771 ± 0.262
3.543ValPro: 3.543 ± 0.593
3.051ValGln: 3.051 ± 0.498
5.462ValArg: 5.462 ± 0.649
3.641ValSer: 3.641 ± 0.473
3.936ValThr: 3.936 ± 0.589
4.576ValVal: 4.576 ± 0.539
0.935ValTrp: 0.935 ± 0.287
0.984ValTyr: 0.984 ± 0.273
0.0ValXaa: 0.0 ± 0.0
Trp
1.821TrpAla: 1.821 ± 0.294
0.197TrpCys: 0.197 ± 0.111
0.836TrpAsp: 0.836 ± 0.206
0.836TrpGlu: 0.836 ± 0.191
0.148TrpPhe: 0.148 ± 0.097
1.181TrpGly: 1.181 ± 0.309
0.541TrpHis: 0.541 ± 0.118
0.246TrpIle: 0.246 ± 0.127
0.689TrpLys: 0.689 ± 0.18
2.165TrpLeu: 2.165 ± 0.415
0.344TrpMet: 0.344 ± 0.124
0.59TrpAsn: 0.59 ± 0.146
0.64TrpPro: 0.64 ± 0.171
0.59TrpGln: 0.59 ± 0.164
1.378TrpArg: 1.378 ± 0.242
0.541TrpSer: 0.541 ± 0.2
0.984TrpThr: 0.984 ± 0.232
1.427TrpVal: 1.427 ± 0.399
0.246TrpTrp: 0.246 ± 0.103
0.443TrpTyr: 0.443 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.559TyrAla: 2.559 ± 0.42
0.295TyrCys: 0.295 ± 0.149
1.624TyrAsp: 1.624 ± 0.225
1.279TyrGlu: 1.279 ± 0.326
0.886TyrPhe: 0.886 ± 0.24
2.067TyrGly: 2.067 ± 0.349
0.443TyrHis: 0.443 ± 0.16
1.132TyrIle: 1.132 ± 0.214
0.836TyrLys: 0.836 ± 0.213
1.919TyrLeu: 1.919 ± 0.293
0.295TyrMet: 0.295 ± 0.167
1.378TyrAsn: 1.378 ± 0.247
1.279TyrPro: 1.279 ± 0.243
0.836TyrGln: 0.836 ± 0.202
2.903TyrArg: 2.903 ± 0.428
1.771TyrSer: 1.771 ± 0.402
1.525TyrThr: 1.525 ± 0.27
1.722TyrVal: 1.722 ± 0.377
0.344TyrTrp: 0.344 ± 0.171
0.344TyrTyr: 0.344 ± 0.11
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (20324 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski