Amino acid dipepetide frequency for Arthrobacter phage DrManhattan

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.887AlaAla: 16.887 ± 1.599
0.458AlaCys: 0.458 ± 0.184
7.412AlaAsp: 7.412 ± 0.903
9.017AlaGlu: 9.017 ± 0.977
3.439AlaPhe: 3.439 ± 0.465
8.558AlaGly: 8.558 ± 0.76
1.91AlaHis: 1.91 ± 0.332
4.967AlaIle: 4.967 ± 0.673
5.96AlaLys: 5.96 ± 0.913
12.149AlaLeu: 12.149 ± 1.27
3.209AlaMet: 3.209 ± 0.448
2.98AlaAsn: 2.98 ± 0.442
7.106AlaPro: 7.106 ± 0.972
4.203AlaGln: 4.203 ± 0.532
9.322AlaArg: 9.322 ± 1.069
6.266AlaSer: 6.266 ± 0.71
5.884AlaThr: 5.884 ± 0.639
8.864AlaVal: 8.864 ± 0.986
2.063AlaTrp: 2.063 ± 0.387
3.362AlaTyr: 3.362 ± 0.455
0.0AlaXaa: 0.0 ± 0.0
Cys
0.611CysAla: 0.611 ± 0.263
0.153CysCys: 0.153 ± 0.118
0.382CysAsp: 0.382 ± 0.168
0.611CysGlu: 0.611 ± 0.234
0.229CysPhe: 0.229 ± 0.128
0.611CysGly: 0.611 ± 0.17
0.229CysHis: 0.229 ± 0.139
0.153CysIle: 0.153 ± 0.098
0.0CysLys: 0.0 ± 0.0
0.153CysLeu: 0.153 ± 0.115
0.0CysMet: 0.0 ± 0.0
0.382CysAsn: 0.382 ± 0.178
0.688CysPro: 0.688 ± 0.234
0.229CysGln: 0.229 ± 0.135
0.535CysArg: 0.535 ± 0.244
0.076CysSer: 0.076 ± 0.082
0.229CysThr: 0.229 ± 0.134
0.382CysVal: 0.382 ± 0.214
0.076CysTrp: 0.076 ± 0.072
0.306CysTyr: 0.306 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
8.252AspAla: 8.252 ± 0.923
0.306AspCys: 0.306 ± 0.14
4.661AspAsp: 4.661 ± 0.591
4.661AspGlu: 4.661 ± 0.583
1.681AspPhe: 1.681 ± 0.316
6.953AspGly: 6.953 ± 0.755
1.681AspHis: 1.681 ± 0.343
1.452AspIle: 1.452 ± 0.335
1.681AspLys: 1.681 ± 0.415
7.336AspLeu: 7.336 ± 1.121
0.535AspMet: 0.535 ± 0.2
1.605AspAsn: 1.605 ± 0.354
3.973AspPro: 3.973 ± 0.483
1.299AspGln: 1.299 ± 0.332
3.897AspArg: 3.897 ± 0.699
3.668AspSer: 3.668 ± 0.425
3.439AspThr: 3.439 ± 0.451
4.585AspVal: 4.585 ± 0.497
0.993AspTrp: 0.993 ± 0.245
2.14AspTyr: 2.14 ± 0.408
0.0AspXaa: 0.0 ± 0.0
Glu
9.628GluAla: 9.628 ± 1.142
0.611GluCys: 0.611 ± 0.247
3.286GluAsp: 3.286 ± 0.547
4.508GluGlu: 4.508 ± 0.668
1.91GluPhe: 1.91 ± 0.413
4.738GluGly: 4.738 ± 0.523
1.452GluHis: 1.452 ± 0.34
3.591GluIle: 3.591 ± 0.545
2.369GluLys: 2.369 ± 0.404
5.043GluLeu: 5.043 ± 0.777
1.299GluMet: 1.299 ± 0.327
1.834GluAsn: 1.834 ± 0.4
3.439GluPro: 3.439 ± 0.659
0.917GluGln: 0.917 ± 0.269
5.272GluArg: 5.272 ± 0.807
2.598GluSer: 2.598 ± 0.457
4.661GluThr: 4.661 ± 0.627
4.814GluVal: 4.814 ± 0.751
1.757GluTrp: 1.757 ± 0.341
1.223GluTyr: 1.223 ± 0.313
0.0GluXaa: 0.0 ± 0.0
Phe
3.209PheAla: 3.209 ± 0.532
0.458PheCys: 0.458 ± 0.174
3.056PheAsp: 3.056 ± 0.445
2.369PheGlu: 2.369 ± 0.407
0.841PhePhe: 0.841 ± 0.238
2.904PheGly: 2.904 ± 0.46
0.382PheHis: 0.382 ± 0.202
1.299PheIle: 1.299 ± 0.304
1.146PheLys: 1.146 ± 0.29
2.445PheLeu: 2.445 ± 0.452
0.841PheMet: 0.841 ± 0.231
0.764PheAsn: 0.764 ± 0.224
1.452PhePro: 1.452 ± 0.309
1.146PheGln: 1.146 ± 0.414
1.605PheArg: 1.605 ± 0.336
1.91PheSer: 1.91 ± 0.344
2.292PheThr: 2.292 ± 0.459
2.063PheVal: 2.063 ± 0.461
0.306PheTrp: 0.306 ± 0.158
0.917PheTyr: 0.917 ± 0.264
0.0PheXaa: 0.0 ± 0.0
Gly
7.947GlyAla: 7.947 ± 0.984
0.382GlyCys: 0.382 ± 0.196
5.654GlyAsp: 5.654 ± 0.578
5.425GlyGlu: 5.425 ± 0.598
3.362GlyPhe: 3.362 ± 0.52
6.037GlyGly: 6.037 ± 0.763
1.375GlyHis: 1.375 ± 0.286
3.591GlyIle: 3.591 ± 0.638
4.661GlyLys: 4.661 ± 0.657
8.1GlyLeu: 8.1 ± 0.714
1.757GlyMet: 1.757 ± 0.38
3.056GlyAsn: 3.056 ± 0.62
4.05GlyPro: 4.05 ± 0.589
1.834GlyGln: 1.834 ± 0.445
5.884GlyArg: 5.884 ± 0.684
4.508GlySer: 4.508 ± 0.75
5.425GlyThr: 5.425 ± 1.038
5.807GlyVal: 5.807 ± 0.591
1.987GlyTrp: 1.987 ± 0.501
3.744GlyTyr: 3.744 ± 0.461
0.0GlyXaa: 0.0 ± 0.0
His
1.987HisAla: 1.987 ± 0.347
0.076HisCys: 0.076 ± 0.068
1.834HisAsp: 1.834 ± 0.574
1.375HisGlu: 1.375 ± 0.327
0.917HisPhe: 0.917 ± 0.295
1.681HisGly: 1.681 ± 0.391
0.458HisHis: 0.458 ± 0.165
0.764HisIle: 0.764 ± 0.249
0.458HisLys: 0.458 ± 0.167
1.223HisLeu: 1.223 ± 0.349
0.229HisMet: 0.229 ± 0.126
0.306HisAsn: 0.306 ± 0.124
1.299HisPro: 1.299 ± 0.288
0.458HisGln: 0.458 ± 0.169
1.375HisArg: 1.375 ± 0.294
0.611HisSer: 0.611 ± 0.215
1.07HisThr: 1.07 ± 0.293
1.223HisVal: 1.223 ± 0.326
0.382HisTrp: 0.382 ± 0.196
0.229HisTyr: 0.229 ± 0.129
0.0HisXaa: 0.0 ± 0.0
Ile
4.05IleAla: 4.05 ± 0.577
0.076IleCys: 0.076 ± 0.072
2.292IleAsp: 2.292 ± 0.442
2.904IleGlu: 2.904 ± 0.406
1.605IlePhe: 1.605 ± 0.348
3.515IleGly: 3.515 ± 0.79
0.993IleHis: 0.993 ± 0.253
1.07IleIle: 1.07 ± 0.33
1.375IleLys: 1.375 ± 0.348
3.515IleLeu: 3.515 ± 0.537
0.535IleMet: 0.535 ± 0.2
1.223IleAsn: 1.223 ± 0.294
2.292IlePro: 2.292 ± 0.389
1.987IleGln: 1.987 ± 0.537
4.126IleArg: 4.126 ± 0.46
2.063IleSer: 2.063 ± 0.482
2.98IleThr: 2.98 ± 0.434
2.904IleVal: 2.904 ± 0.506
0.688IleTrp: 0.688 ± 0.224
1.146IleTyr: 1.146 ± 0.352
0.0IleXaa: 0.0 ± 0.0
Lys
5.807LysAla: 5.807 ± 0.802
0.076LysCys: 0.076 ± 0.076
2.063LysAsp: 2.063 ± 0.458
2.522LysGlu: 2.522 ± 0.475
1.07LysPhe: 1.07 ± 0.25
3.897LysGly: 3.897 ± 0.623
1.07LysHis: 1.07 ± 0.308
1.987LysIle: 1.987 ± 0.367
1.987LysLys: 1.987 ± 0.678
3.133LysLeu: 3.133 ± 0.577
0.993LysMet: 0.993 ± 0.303
1.299LysAsn: 1.299 ± 0.315
2.598LysPro: 2.598 ± 0.437
0.917LysGln: 0.917 ± 0.389
3.209LysArg: 3.209 ± 0.483
1.987LysSer: 1.987 ± 0.417
2.598LysThr: 2.598 ± 0.474
3.209LysVal: 3.209 ± 0.487
0.993LysTrp: 0.993 ± 0.283
0.535LysTyr: 0.535 ± 0.191
0.0LysXaa: 0.0 ± 0.0
Leu
12.99LeuAla: 12.99 ± 1.154
0.458LeuCys: 0.458 ± 0.182
7.259LeuAsp: 7.259 ± 0.701
3.668LeuGlu: 3.668 ± 0.545
2.216LeuPhe: 2.216 ± 0.448
7.794LeuGly: 7.794 ± 0.884
1.528LeuHis: 1.528 ± 0.4
4.126LeuIle: 4.126 ± 0.46
2.445LeuLys: 2.445 ± 0.516
6.724LeuLeu: 6.724 ± 0.64
1.146LeuMet: 1.146 ± 0.332
2.445LeuAsn: 2.445 ± 0.327
4.738LeuPro: 4.738 ± 0.619
2.98LeuGln: 2.98 ± 0.577
7.488LeuArg: 7.488 ± 0.859
5.196LeuSer: 5.196 ± 0.712
6.495LeuThr: 6.495 ± 0.73
5.043LeuVal: 5.043 ± 0.618
1.299LeuTrp: 1.299 ± 0.258
1.91LeuTyr: 1.91 ± 0.407
0.0LeuXaa: 0.0 ± 0.0
Met
3.133MetAla: 3.133 ± 0.506
0.153MetCys: 0.153 ± 0.104
0.993MetAsp: 0.993 ± 0.249
0.764MetGlu: 0.764 ± 0.225
0.306MetPhe: 0.306 ± 0.155
1.223MetGly: 1.223 ± 0.487
0.229MetHis: 0.229 ± 0.148
1.605MetIle: 1.605 ± 0.424
0.993MetLys: 0.993 ± 0.309
1.375MetLeu: 1.375 ± 0.333
0.229MetMet: 0.229 ± 0.112
0.841MetAsn: 0.841 ± 0.279
1.07MetPro: 1.07 ± 0.306
0.535MetGln: 0.535 ± 0.188
1.223MetArg: 1.223 ± 0.323
1.452MetSer: 1.452 ± 0.312
2.216MetThr: 2.216 ± 0.545
1.07MetVal: 1.07 ± 0.25
0.0MetTrp: 0.0 ± 0.0
0.076MetTyr: 0.076 ± 0.079
0.0MetXaa: 0.0 ± 0.0
Asn
3.439AsnAla: 3.439 ± 0.475
0.0AsnCys: 0.0 ± 0.0
2.063AsnAsp: 2.063 ± 0.394
0.611AsnGlu: 0.611 ± 0.234
0.535AsnPhe: 0.535 ± 0.197
3.591AsnGly: 3.591 ± 0.504
0.382AsnHis: 0.382 ± 0.229
1.07AsnIle: 1.07 ± 0.323
0.764AsnLys: 0.764 ± 0.213
2.674AsnLeu: 2.674 ± 0.349
1.146AsnMet: 1.146 ± 0.26
0.764AsnAsn: 0.764 ± 0.238
2.445AsnPro: 2.445 ± 0.43
0.841AsnGln: 0.841 ± 0.289
1.223AsnArg: 1.223 ± 0.334
1.223AsnSer: 1.223 ± 0.274
2.063AsnThr: 2.063 ± 0.511
1.987AsnVal: 1.987 ± 0.585
0.382AsnTrp: 0.382 ± 0.141
0.611AsnTyr: 0.611 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
8.023ProAla: 8.023 ± 1.083
0.458ProCys: 0.458 ± 0.213
3.744ProAsp: 3.744 ± 0.483
4.432ProGlu: 4.432 ± 0.701
2.063ProPhe: 2.063 ± 0.347
5.578ProGly: 5.578 ± 0.755
0.917ProHis: 0.917 ± 0.268
2.445ProIle: 2.445 ± 0.516
2.98ProLys: 2.98 ± 0.539
3.668ProLeu: 3.668 ± 0.582
1.223ProMet: 1.223 ± 0.313
0.993ProAsn: 0.993 ± 0.195
1.681ProPro: 1.681 ± 0.634
1.07ProGln: 1.07 ± 0.302
2.598ProArg: 2.598 ± 0.436
2.827ProSer: 2.827 ± 0.444
2.98ProThr: 2.98 ± 0.584
4.279ProVal: 4.279 ± 0.637
0.841ProTrp: 0.841 ± 0.254
1.146ProTyr: 1.146 ± 0.334
0.0ProXaa: 0.0 ± 0.0
Gln
2.827GlnAla: 2.827 ± 0.463
0.382GlnCys: 0.382 ± 0.148
0.993GlnAsp: 0.993 ± 0.276
1.605GlnGlu: 1.605 ± 0.36
0.993GlnPhe: 0.993 ± 0.336
1.681GlnGly: 1.681 ± 0.418
0.688GlnHis: 0.688 ± 0.226
1.605GlnIle: 1.605 ± 0.336
1.605GlnLys: 1.605 ± 0.391
1.757GlnLeu: 1.757 ± 0.327
1.146GlnMet: 1.146 ± 0.43
0.841GlnAsn: 0.841 ± 0.306
0.764GlnPro: 0.764 ± 0.219
0.229GlnGln: 0.229 ± 0.126
2.14GlnArg: 2.14 ± 0.41
1.91GlnSer: 1.91 ± 0.326
2.522GlnThr: 2.522 ± 0.51
2.063GlnVal: 2.063 ± 0.408
0.535GlnTrp: 0.535 ± 0.231
0.306GlnTyr: 0.306 ± 0.139
0.0GlnXaa: 0.0 ± 0.0
Arg
7.259ArgAla: 7.259 ± 0.879
0.306ArgCys: 0.306 ± 0.151
5.578ArgAsp: 5.578 ± 0.707
5.654ArgGlu: 5.654 ± 0.841
2.292ArgPhe: 2.292 ± 0.417
4.508ArgGly: 4.508 ± 0.633
0.993ArgHis: 0.993 ± 0.28
2.598ArgIle: 2.598 ± 0.591
3.744ArgLys: 3.744 ± 0.508
7.718ArgLeu: 7.718 ± 1.041
1.299ArgMet: 1.299 ± 0.301
2.063ArgAsn: 2.063 ± 0.405
3.209ArgPro: 3.209 ± 0.511
2.445ArgGln: 2.445 ± 0.49
5.731ArgArg: 5.731 ± 0.988
3.821ArgSer: 3.821 ± 0.788
3.591ArgThr: 3.591 ± 0.69
5.807ArgVal: 5.807 ± 0.571
1.146ArgTrp: 1.146 ± 0.264
2.751ArgTyr: 2.751 ± 0.605
0.0ArgXaa: 0.0 ± 0.0
Ser
6.648SerAla: 6.648 ± 0.741
0.153SerCys: 0.153 ± 0.117
2.904SerAsp: 2.904 ± 0.427
2.522SerGlu: 2.522 ± 0.422
1.987SerPhe: 1.987 ± 0.425
6.113SerGly: 6.113 ± 0.786
0.535SerHis: 0.535 ± 0.224
2.445SerIle: 2.445 ± 0.521
2.522SerLys: 2.522 ± 0.505
4.05SerLeu: 4.05 ± 0.573
1.07SerMet: 1.07 ± 0.249
1.375SerAsn: 1.375 ± 0.286
3.591SerPro: 3.591 ± 0.661
1.223SerGln: 1.223 ± 0.255
3.362SerArg: 3.362 ± 0.599
3.056SerSer: 3.056 ± 0.48
4.203SerThr: 4.203 ± 0.552
4.661SerVal: 4.661 ± 0.588
0.917SerTrp: 0.917 ± 0.229
1.452SerTyr: 1.452 ± 0.345
0.0SerXaa: 0.0 ± 0.0
Thr
7.412ThrAla: 7.412 ± 0.72
0.611ThrCys: 0.611 ± 0.38
3.744ThrAsp: 3.744 ± 0.415
5.043ThrGlu: 5.043 ± 0.689
2.522ThrPhe: 2.522 ± 0.392
5.884ThrGly: 5.884 ± 0.797
0.764ThrHis: 0.764 ± 0.226
1.987ThrIle: 1.987 ± 0.356
2.598ThrLys: 2.598 ± 0.381
6.419ThrLeu: 6.419 ± 0.762
0.764ThrMet: 0.764 ± 0.218
1.375ThrAsn: 1.375 ± 0.291
4.126ThrPro: 4.126 ± 0.68
1.146ThrGln: 1.146 ± 0.295
5.043ThrArg: 5.043 ± 0.704
4.279ThrSer: 4.279 ± 0.636
4.661ThrThr: 4.661 ± 0.852
4.432ThrVal: 4.432 ± 0.634
0.611ThrTrp: 0.611 ± 0.317
1.299ThrTyr: 1.299 ± 0.265
0.0ThrXaa: 0.0 ± 0.0
Val
9.399ValAla: 9.399 ± 0.936
0.688ValCys: 0.688 ± 0.259
3.973ValAsp: 3.973 ± 0.52
4.126ValGlu: 4.126 ± 0.706
2.292ValPhe: 2.292 ± 0.42
5.12ValGly: 5.12 ± 0.689
1.452ValHis: 1.452 ± 0.31
2.598ValIle: 2.598 ± 0.486
2.904ValLys: 2.904 ± 0.387
6.877ValLeu: 6.877 ± 0.675
1.146ValMet: 1.146 ± 0.306
2.369ValAsn: 2.369 ± 0.523
3.821ValPro: 3.821 ± 0.575
2.14ValGln: 2.14 ± 0.408
5.12ValArg: 5.12 ± 0.67
4.585ValSer: 4.585 ± 0.549
3.668ValThr: 3.668 ± 0.487
5.349ValVal: 5.349 ± 0.72
1.375ValTrp: 1.375 ± 0.319
2.445ValTyr: 2.445 ± 0.486
0.0ValXaa: 0.0 ± 0.0
Trp
1.757TrpAla: 1.757 ± 0.286
0.0TrpCys: 0.0 ± 0.0
1.223TrpAsp: 1.223 ± 0.311
1.452TrpGlu: 1.452 ± 0.309
0.229TrpPhe: 0.229 ± 0.15
1.375TrpGly: 1.375 ± 0.339
0.306TrpHis: 0.306 ± 0.181
0.841TrpIle: 0.841 ± 0.226
0.611TrpLys: 0.611 ± 0.214
1.223TrpLeu: 1.223 ± 0.3
0.306TrpMet: 0.306 ± 0.152
0.688TrpAsn: 0.688 ± 0.228
0.153TrpPro: 0.153 ± 0.111
0.535TrpGln: 0.535 ± 0.193
1.605TrpArg: 1.605 ± 0.278
1.605TrpSer: 1.605 ± 0.443
1.605TrpThr: 1.605 ± 0.324
0.993TrpVal: 0.993 ± 0.242
0.153TrpTrp: 0.153 ± 0.098
0.458TrpTyr: 0.458 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.751TyrAla: 2.751 ± 0.447
0.229TyrCys: 0.229 ± 0.153
1.681TyrAsp: 1.681 ± 0.384
1.681TyrGlu: 1.681 ± 0.347
0.917TyrPhe: 0.917 ± 0.259
2.827TyrGly: 2.827 ± 0.529
0.611TyrHis: 0.611 ± 0.205
1.146TyrIle: 1.146 ± 0.38
1.223TyrLys: 1.223 ± 0.347
2.369TyrLeu: 2.369 ± 0.392
0.535TyrMet: 0.535 ± 0.206
0.611TyrAsn: 0.611 ± 0.352
1.452TyrPro: 1.452 ± 0.297
0.458TyrGln: 0.458 ± 0.178
1.757TyrArg: 1.757 ± 0.389
1.146TyrSer: 1.146 ± 0.314
2.14TyrThr: 2.14 ± 0.409
1.91TyrVal: 1.91 ± 0.468
0.611TyrTrp: 0.611 ± 0.231
0.535TyrTyr: 0.535 ± 0.208
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13088 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski