Amino acid dipepetide frequency for Arthrobacter phage Powerpuff

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.18AlaAla: 20.18 ± 2.921
0.223AlaCys: 0.223 ± 0.115
9.496AlaAsp: 9.496 ± 1.17
9.125AlaGlu: 9.125 ± 0.886
4.08AlaPhe: 4.08 ± 0.576
9.645AlaGly: 9.645 ± 0.994
2.597AlaHis: 2.597 ± 0.454
5.713AlaIle: 5.713 ± 0.768
7.048AlaLys: 7.048 ± 0.752
13.206AlaLeu: 13.206 ± 1.07
2.893AlaMet: 2.893 ± 0.375
2.522AlaAsn: 2.522 ± 0.582
6.9AlaPro: 6.9 ± 0.794
3.709AlaGln: 3.709 ± 0.743
9.941AlaArg: 9.941 ± 1.124
7.419AlaSer: 7.419 ± 0.701
6.529AlaThr: 6.529 ± 0.647
9.645AlaVal: 9.645 ± 1.005
1.781AlaTrp: 1.781 ± 0.322
3.487AlaTyr: 3.487 ± 0.52
0.0AlaXaa: 0.0 ± 0.0
Cys
0.297CysAla: 0.297 ± 0.126
0.0CysCys: 0.0 ± 0.0
0.371CysAsp: 0.371 ± 0.157
0.445CysGlu: 0.445 ± 0.165
0.223CysPhe: 0.223 ± 0.116
0.594CysGly: 0.594 ± 0.193
0.074CysHis: 0.074 ± 0.079
0.223CysIle: 0.223 ± 0.139
0.223CysLys: 0.223 ± 0.121
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.074CysAsn: 0.074 ± 0.057
0.519CysPro: 0.519 ± 0.227
0.371CysGln: 0.371 ± 0.164
0.594CysArg: 0.594 ± 0.191
0.148CysSer: 0.148 ± 0.098
0.223CysThr: 0.223 ± 0.123
0.074CysVal: 0.074 ± 0.076
0.148CysTrp: 0.148 ± 0.109
0.297CysTyr: 0.297 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
11.054AspAla: 11.054 ± 1.006
0.297AspCys: 0.297 ± 0.139
4.006AspAsp: 4.006 ± 0.547
4.006AspGlu: 4.006 ± 0.629
2.374AspPhe: 2.374 ± 0.367
5.564AspGly: 5.564 ± 0.721
1.261AspHis: 1.261 ± 0.352
1.706AspIle: 1.706 ± 0.346
2.374AspLys: 2.374 ± 0.506
6.751AspLeu: 6.751 ± 0.756
0.594AspMet: 0.594 ± 0.169
1.632AspAsn: 1.632 ± 0.391
3.932AspPro: 3.932 ± 0.607
1.261AspGln: 1.261 ± 0.251
2.968AspArg: 2.968 ± 0.535
3.19AspSer: 3.19 ± 0.467
3.858AspThr: 3.858 ± 0.422
4.748AspVal: 4.748 ± 0.496
1.113AspTrp: 1.113 ± 0.28
1.558AspTyr: 1.558 ± 0.373
0.0AspXaa: 0.0 ± 0.0
Glu
10.238GluAla: 10.238 ± 1.107
0.148GluCys: 0.148 ± 0.101
4.006GluAsp: 4.006 ± 0.537
5.267GluGlu: 5.267 ± 0.686
1.558GluPhe: 1.558 ± 0.296
4.229GluGly: 4.229 ± 0.452
1.558GluHis: 1.558 ± 0.329
3.932GluIle: 3.932 ± 0.509
2.226GluLys: 2.226 ± 0.408
5.935GluLeu: 5.935 ± 0.71
1.187GluMet: 1.187 ± 0.259
1.113GluAsn: 1.113 ± 0.275
2.597GluPro: 2.597 ± 0.502
0.964GluGln: 0.964 ± 0.206
5.342GluArg: 5.342 ± 1.019
3.339GluSer: 3.339 ± 0.515
3.635GluThr: 3.635 ± 0.619
4.674GluVal: 4.674 ± 0.631
1.558GluTrp: 1.558 ± 0.299
1.632GluTyr: 1.632 ± 0.39
0.0GluXaa: 0.0 ± 0.0
Phe
3.116PheAla: 3.116 ± 0.368
0.297PheCys: 0.297 ± 0.157
2.671PheAsp: 2.671 ± 0.568
2.077PheGlu: 2.077 ± 0.347
0.742PhePhe: 0.742 ± 0.216
2.968PheGly: 2.968 ± 0.458
0.594PheHis: 0.594 ± 0.196
1.558PheIle: 1.558 ± 0.344
1.41PheLys: 1.41 ± 0.308
2.077PheLeu: 2.077 ± 0.409
0.742PheMet: 0.742 ± 0.218
0.371PheAsn: 0.371 ± 0.179
1.41PhePro: 1.41 ± 0.23
1.039PheGln: 1.039 ± 0.267
1.855PheArg: 1.855 ± 0.379
1.706PheSer: 1.706 ± 0.262
2.448PheThr: 2.448 ± 0.487
2.077PheVal: 2.077 ± 0.434
0.519PheTrp: 0.519 ± 0.207
1.039PheTyr: 1.039 ± 0.233
0.0PheXaa: 0.0 ± 0.0
Gly
9.348GlyAla: 9.348 ± 0.84
0.371GlyCys: 0.371 ± 0.162
5.342GlyAsp: 5.342 ± 0.759
5.416GlyGlu: 5.416 ± 0.542
2.745GlyPhe: 2.745 ± 0.487
7.122GlyGly: 7.122 ± 0.848
1.706GlyHis: 1.706 ± 0.295
3.561GlyIle: 3.561 ± 0.872
4.822GlyLys: 4.822 ± 0.591
7.048GlyLeu: 7.048 ± 0.899
1.558GlyMet: 1.558 ± 0.289
2.448GlyAsn: 2.448 ± 0.567
3.487GlyPro: 3.487 ± 0.526
1.781GlyGln: 1.781 ± 0.394
5.787GlyArg: 5.787 ± 0.677
3.932GlySer: 3.932 ± 0.632
4.674GlyThr: 4.674 ± 0.836
5.935GlyVal: 5.935 ± 0.657
2.003GlyTrp: 2.003 ± 0.393
3.635GlyTyr: 3.635 ± 0.457
0.0GlyXaa: 0.0 ± 0.0
His
2.374HisAla: 2.374 ± 0.424
0.223HisCys: 0.223 ± 0.115
1.187HisAsp: 1.187 ± 0.262
1.335HisGlu: 1.335 ± 0.333
0.89HisPhe: 0.89 ± 0.265
1.632HisGly: 1.632 ± 0.307
0.594HisHis: 0.594 ± 0.198
0.297HisIle: 0.297 ± 0.125
0.445HisLys: 0.445 ± 0.179
2.226HisLeu: 2.226 ± 0.402
0.223HisMet: 0.223 ± 0.115
0.445HisAsn: 0.445 ± 0.148
1.484HisPro: 1.484 ± 0.301
0.297HisGln: 0.297 ± 0.132
1.113HisArg: 1.113 ± 0.233
1.039HisSer: 1.039 ± 0.302
1.187HisThr: 1.187 ± 0.268
1.113HisVal: 1.113 ± 0.255
0.223HisTrp: 0.223 ± 0.134
0.371HisTyr: 0.371 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
5.342IleAla: 5.342 ± 0.66
0.148IleCys: 0.148 ± 0.103
2.3IleAsp: 2.3 ± 0.372
3.635IleGlu: 3.635 ± 0.601
1.113IlePhe: 1.113 ± 0.312
4.08IleGly: 4.08 ± 0.514
0.89IleHis: 0.89 ± 0.219
1.855IleIle: 1.855 ± 0.43
1.558IleLys: 1.558 ± 0.403
3.042IleLeu: 3.042 ± 0.54
0.742IleMet: 0.742 ± 0.197
1.335IleAsn: 1.335 ± 0.352
2.448IlePro: 2.448 ± 0.531
2.522IleGln: 2.522 ± 0.563
3.635IleArg: 3.635 ± 0.37
2.003IleSer: 2.003 ± 0.436
2.522IleThr: 2.522 ± 0.45
3.264IleVal: 3.264 ± 0.588
0.89IleTrp: 0.89 ± 0.244
0.594IleTyr: 0.594 ± 0.233
0.0IleXaa: 0.0 ± 0.0
Lys
7.79LysAla: 7.79 ± 0.892
0.148LysCys: 0.148 ± 0.104
1.929LysAsp: 1.929 ± 0.445
2.893LysGlu: 2.893 ± 0.49
1.039LysPhe: 1.039 ± 0.293
3.264LysGly: 3.264 ± 0.448
1.039LysHis: 1.039 ± 0.29
2.374LysIle: 2.374 ± 0.339
2.003LysLys: 2.003 ± 0.439
3.19LysLeu: 3.19 ± 0.408
0.964LysMet: 0.964 ± 0.253
1.039LysAsn: 1.039 ± 0.263
2.226LysPro: 2.226 ± 0.371
1.113LysGln: 1.113 ± 0.252
3.042LysArg: 3.042 ± 0.418
1.558LysSer: 1.558 ± 0.312
2.968LysThr: 2.968 ± 0.529
2.893LysVal: 2.893 ± 0.438
0.816LysTrp: 0.816 ± 0.248
0.519LysTyr: 0.519 ± 0.181
0.0LysXaa: 0.0 ± 0.0
Leu
12.612LeuAla: 12.612 ± 1.502
0.519LeuCys: 0.519 ± 0.199
5.638LeuAsp: 5.638 ± 0.686
3.264LeuGlu: 3.264 ± 0.435
2.3LeuPhe: 2.3 ± 0.378
7.493LeuGly: 7.493 ± 0.873
1.929LeuHis: 1.929 ± 0.416
4.229LeuIle: 4.229 ± 0.527
2.448LeuLys: 2.448 ± 0.492
6.38LeuLeu: 6.38 ± 0.815
1.261LeuMet: 1.261 ± 0.352
2.745LeuAsn: 2.745 ± 0.329
4.303LeuPro: 4.303 ± 0.523
2.893LeuGln: 2.893 ± 0.546
7.196LeuArg: 7.196 ± 0.9
4.674LeuSer: 4.674 ± 0.64
6.529LeuThr: 6.529 ± 0.767
5.935LeuVal: 5.935 ± 0.537
1.632LeuTrp: 1.632 ± 0.282
1.335LeuTyr: 1.335 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
2.968MetAla: 2.968 ± 0.429
0.148MetCys: 0.148 ± 0.103
1.41MetAsp: 1.41 ± 0.276
0.742MetGlu: 0.742 ± 0.191
0.223MetPhe: 0.223 ± 0.119
1.187MetGly: 1.187 ± 0.36
0.074MetHis: 0.074 ± 0.064
0.89MetIle: 0.89 ± 0.288
1.261MetLys: 1.261 ± 0.339
1.484MetLeu: 1.484 ± 0.302
0.445MetMet: 0.445 ± 0.219
0.297MetAsn: 0.297 ± 0.124
1.41MetPro: 1.41 ± 0.305
0.148MetGln: 0.148 ± 0.094
1.261MetArg: 1.261 ± 0.233
1.706MetSer: 1.706 ± 0.353
1.855MetThr: 1.855 ± 0.347
0.89MetVal: 0.89 ± 0.278
0.0MetTrp: 0.0 ± 0.0
0.297MetTyr: 0.297 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
2.968AsnAla: 2.968 ± 0.587
0.0AsnCys: 0.0 ± 0.0
1.187AsnAsp: 1.187 ± 0.26
0.816AsnGlu: 0.816 ± 0.242
0.668AsnPhe: 0.668 ± 0.227
3.116AsnGly: 3.116 ± 0.465
0.297AsnHis: 0.297 ± 0.122
0.742AsnIle: 0.742 ± 0.191
0.519AsnLys: 0.519 ± 0.183
2.151AsnLeu: 2.151 ± 0.468
0.519AsnMet: 0.519 ± 0.196
0.519AsnAsn: 0.519 ± 0.29
1.706AsnPro: 1.706 ± 0.287
0.519AsnGln: 0.519 ± 0.178
1.335AsnArg: 1.335 ± 0.249
0.964AsnSer: 0.964 ± 0.288
1.558AsnThr: 1.558 ± 0.358
2.3AsnVal: 2.3 ± 0.62
0.668AsnTrp: 0.668 ± 0.22
0.371AsnTyr: 0.371 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
6.677ProAla: 6.677 ± 1.069
0.594ProCys: 0.594 ± 0.201
4.451ProAsp: 4.451 ± 0.607
4.155ProGlu: 4.155 ± 0.618
2.077ProPhe: 2.077 ± 0.432
4.08ProGly: 4.08 ± 0.573
0.816ProHis: 0.816 ± 0.217
1.781ProIle: 1.781 ± 0.298
2.745ProLys: 2.745 ± 0.561
4.303ProLeu: 4.303 ± 0.703
0.964ProMet: 0.964 ± 0.241
1.039ProAsn: 1.039 ± 0.283
1.781ProPro: 1.781 ± 0.449
1.558ProGln: 1.558 ± 0.275
3.116ProArg: 3.116 ± 0.418
2.3ProSer: 2.3 ± 0.405
2.819ProThr: 2.819 ± 0.398
4.006ProVal: 4.006 ± 0.464
0.89ProTrp: 0.89 ± 0.242
0.816ProTyr: 0.816 ± 0.263
0.0ProXaa: 0.0 ± 0.0
Gln
2.671GlnAla: 2.671 ± 0.588
0.223GlnCys: 0.223 ± 0.114
1.41GlnAsp: 1.41 ± 0.372
1.558GlnGlu: 1.558 ± 0.322
0.594GlnPhe: 0.594 ± 0.171
1.929GlnGly: 1.929 ± 0.443
0.668GlnHis: 0.668 ± 0.171
2.003GlnIle: 2.003 ± 0.42
1.781GlnLys: 1.781 ± 0.365
1.41GlnLeu: 1.41 ± 0.319
0.964GlnMet: 0.964 ± 0.298
0.594GlnAsn: 0.594 ± 0.202
0.668GlnPro: 0.668 ± 0.214
0.519GlnGln: 0.519 ± 0.188
2.3GlnArg: 2.3 ± 0.502
1.484GlnSer: 1.484 ± 0.302
2.448GlnThr: 2.448 ± 0.554
1.484GlnVal: 1.484 ± 0.321
0.297GlnTrp: 0.297 ± 0.155
0.445GlnTyr: 0.445 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
7.642ArgAla: 7.642 ± 0.983
0.223ArgCys: 0.223 ± 0.105
4.822ArgAsp: 4.822 ± 0.623
5.416ArgGlu: 5.416 ± 0.808
2.226ArgPhe: 2.226 ± 0.434
4.6ArgGly: 4.6 ± 0.634
1.261ArgHis: 1.261 ± 0.317
3.19ArgIle: 3.19 ± 0.469
3.339ArgLys: 3.339 ± 0.508
7.938ArgLeu: 7.938 ± 0.98
1.632ArgMet: 1.632 ± 0.341
1.558ArgAsn: 1.558 ± 0.28
3.042ArgPro: 3.042 ± 0.507
1.781ArgGln: 1.781 ± 0.364
6.9ArgArg: 6.9 ± 0.966
4.6ArgSer: 4.6 ± 0.726
4.526ArgThr: 4.526 ± 0.692
5.045ArgVal: 5.045 ± 0.66
1.41ArgTrp: 1.41 ± 0.306
2.448ArgTyr: 2.448 ± 0.402
0.0ArgXaa: 0.0 ± 0.0
Ser
6.232SerAla: 6.232 ± 0.584
0.297SerCys: 0.297 ± 0.161
2.671SerAsp: 2.671 ± 0.397
3.561SerGlu: 3.561 ± 0.48
2.374SerPhe: 2.374 ± 0.41
6.232SerGly: 6.232 ± 0.85
0.297SerHis: 0.297 ± 0.161
2.745SerIle: 2.745 ± 0.458
1.706SerLys: 1.706 ± 0.468
3.19SerLeu: 3.19 ± 0.422
1.187SerMet: 1.187 ± 0.275
1.484SerAsn: 1.484 ± 0.374
2.893SerPro: 2.893 ± 0.442
1.039SerGln: 1.039 ± 0.285
3.413SerArg: 3.413 ± 0.469
2.597SerSer: 2.597 ± 0.433
5.267SerThr: 5.267 ± 0.846
3.561SerVal: 3.561 ± 0.49
0.964SerTrp: 0.964 ± 0.254
2.003SerTyr: 2.003 ± 0.437
0.0SerXaa: 0.0 ± 0.0
Thr
8.977ThrAla: 8.977 ± 1.297
0.445ThrCys: 0.445 ± 0.204
3.709ThrAsp: 3.709 ± 0.461
4.303ThrGlu: 4.303 ± 0.544
2.151ThrPhe: 2.151 ± 0.367
6.158ThrGly: 6.158 ± 0.814
0.742ThrHis: 0.742 ± 0.235
2.819ThrIle: 2.819 ± 0.527
2.671ThrLys: 2.671 ± 0.418
6.158ThrLeu: 6.158 ± 0.549
1.039ThrMet: 1.039 ± 0.283
0.964ThrAsn: 0.964 ± 0.287
4.08ThrPro: 4.08 ± 0.541
1.113ThrGln: 1.113 ± 0.341
4.08ThrArg: 4.08 ± 0.631
4.006ThrSer: 4.006 ± 0.488
4.303ThrThr: 4.303 ± 0.542
5.416ThrVal: 5.416 ± 0.788
0.519ThrTrp: 0.519 ± 0.176
1.781ThrTyr: 1.781 ± 0.458
0.0ThrXaa: 0.0 ± 0.0
Val
10.09ValAla: 10.09 ± 0.903
0.445ValCys: 0.445 ± 0.185
5.193ValAsp: 5.193 ± 0.71
4.229ValGlu: 4.229 ± 0.708
2.151ValPhe: 2.151 ± 0.36
4.451ValGly: 4.451 ± 0.743
1.41ValHis: 1.41 ± 0.302
2.597ValIle: 2.597 ± 0.366
2.448ValLys: 2.448 ± 0.433
6.158ValLeu: 6.158 ± 0.638
1.113ValMet: 1.113 ± 0.225
1.558ValAsn: 1.558 ± 0.361
3.858ValPro: 3.858 ± 0.578
1.781ValGln: 1.781 ± 0.294
6.158ValArg: 6.158 ± 0.742
4.526ValSer: 4.526 ± 0.558
3.561ValThr: 3.561 ± 0.543
5.342ValVal: 5.342 ± 0.637
1.484ValTrp: 1.484 ± 0.353
2.226ValTyr: 2.226 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
1.929TrpAla: 1.929 ± 0.365
0.074TrpCys: 0.074 ± 0.065
1.113TrpAsp: 1.113 ± 0.241
1.484TrpGlu: 1.484 ± 0.333
0.223TrpPhe: 0.223 ± 0.129
1.41TrpGly: 1.41 ± 0.314
0.371TrpHis: 0.371 ± 0.15
0.964TrpIle: 0.964 ± 0.263
0.742TrpLys: 0.742 ± 0.207
0.816TrpLeu: 0.816 ± 0.246
0.148TrpMet: 0.148 ± 0.086
0.742TrpAsn: 0.742 ± 0.303
0.668TrpPro: 0.668 ± 0.215
0.594TrpGln: 0.594 ± 0.209
2.151TrpArg: 2.151 ± 0.425
1.41TrpSer: 1.41 ± 0.434
1.632TrpThr: 1.632 ± 0.317
0.445TrpVal: 0.445 ± 0.142
0.148TrpTrp: 0.148 ± 0.094
0.594TrpTyr: 0.594 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.635TyrAla: 3.635 ± 0.456
0.074TyrCys: 0.074 ± 0.086
1.335TyrAsp: 1.335 ± 0.281
1.41TyrGlu: 1.41 ± 0.347
0.964TyrPhe: 0.964 ± 0.211
2.968TyrGly: 2.968 ± 0.455
0.519TyrHis: 0.519 ± 0.156
0.742TyrIle: 0.742 ± 0.199
1.039TyrLys: 1.039 ± 0.34
1.929TyrLeu: 1.929 ± 0.4
0.445TyrMet: 0.445 ± 0.158
0.445TyrAsn: 0.445 ± 0.177
1.558TyrPro: 1.558 ± 0.331
0.445TyrGln: 0.445 ± 0.197
1.41TyrArg: 1.41 ± 0.308
1.113TyrSer: 1.113 ± 0.279
2.893TyrThr: 2.893 ± 0.462
1.929TyrVal: 1.929 ± 0.475
0.594TyrTrp: 0.594 ± 0.198
0.742TyrTyr: 0.742 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski