Amino acid dipepetide frequency for Thermobifida phage P318

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.506AlaAla: 11.506 ± 1.389
0.719AlaCys: 0.719 ± 0.272
5.609AlaAsp: 5.609 ± 0.606
7.623AlaGlu: 7.623 ± 0.832
3.236AlaPhe: 3.236 ± 0.73
6.184AlaGly: 6.184 ± 0.637
0.863AlaHis: 0.863 ± 0.217
4.89AlaIle: 4.89 ± 0.587
3.883AlaLys: 3.883 ± 0.55
7.335AlaLeu: 7.335 ± 1.193
1.654AlaMet: 1.654 ± 0.419
2.733AlaAsn: 2.733 ± 0.479
4.459AlaPro: 4.459 ± 0.72
2.948AlaGln: 2.948 ± 0.389
6.76AlaArg: 6.76 ± 0.721
5.178AlaSer: 5.178 ± 0.536
6.256AlaThr: 6.256 ± 0.842
6.975AlaVal: 6.975 ± 0.974
1.654AlaTrp: 1.654 ± 0.367
3.596AlaTyr: 3.596 ± 0.546
0.0AlaXaa: 0.0 ± 0.0
Cys
0.719CysAla: 0.719 ± 0.208
0.072CysCys: 0.072 ± 0.07
0.36CysAsp: 0.36 ± 0.137
0.216CysGlu: 0.216 ± 0.149
0.36CysPhe: 0.36 ± 0.134
1.079CysGly: 1.079 ± 0.399
0.288CysHis: 0.288 ± 0.132
0.288CysIle: 0.288 ± 0.154
0.216CysLys: 0.216 ± 0.115
0.072CysLeu: 0.072 ± 0.075
0.072CysMet: 0.072 ± 0.074
0.216CysAsn: 0.216 ± 0.12
0.719CysPro: 0.719 ± 0.287
0.144CysGln: 0.144 ± 0.099
0.647CysArg: 0.647 ± 0.266
0.36CysSer: 0.36 ± 0.144
0.216CysThr: 0.216 ± 0.109
1.007CysVal: 1.007 ± 0.292
0.144CysTrp: 0.144 ± 0.092
0.288CysTyr: 0.288 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
5.25AspAla: 5.25 ± 0.662
0.288AspCys: 0.288 ± 0.179
4.674AspAsp: 4.674 ± 0.885
3.955AspGlu: 3.955 ± 0.597
1.798AspPhe: 1.798 ± 0.294
5.178AspGly: 5.178 ± 0.882
1.007AspHis: 1.007 ± 0.313
2.445AspIle: 2.445 ± 0.487
2.876AspLys: 2.876 ± 0.601
5.393AspLeu: 5.393 ± 0.433
0.791AspMet: 0.791 ± 0.22
1.654AspAsn: 1.654 ± 0.335
4.099AspPro: 4.099 ± 0.629
1.798AspGln: 1.798 ± 0.458
3.667AspArg: 3.667 ± 0.627
3.308AspSer: 3.308 ± 0.431
2.373AspThr: 2.373 ± 0.314
5.178AspVal: 5.178 ± 0.801
1.51AspTrp: 1.51 ± 0.311
2.229AspTyr: 2.229 ± 0.446
0.0AspXaa: 0.0 ± 0.0
Glu
7.263GluAla: 7.263 ± 0.925
0.36GluCys: 0.36 ± 0.189
3.452GluAsp: 3.452 ± 0.563
5.25GluGlu: 5.25 ± 0.989
2.733GluPhe: 2.733 ± 0.394
5.825GluGly: 5.825 ± 0.662
1.294GluHis: 1.294 ± 0.408
3.739GluIle: 3.739 ± 0.393
3.667GluLys: 3.667 ± 0.423
5.034GluLeu: 5.034 ± 0.665
1.654GluMet: 1.654 ± 0.322
2.589GluAsn: 2.589 ± 0.427
2.445GluPro: 2.445 ± 0.396
2.445GluGln: 2.445 ± 0.445
4.243GluArg: 4.243 ± 0.52
4.89GluSer: 4.89 ± 0.558
5.321GluThr: 5.321 ± 0.572
6.688GluVal: 6.688 ± 0.778
1.438GluTrp: 1.438 ± 0.343
2.805GluTyr: 2.805 ± 0.556
0.0GluXaa: 0.0 ± 0.0
Phe
3.236PheAla: 3.236 ± 0.475
0.36PheCys: 0.36 ± 0.158
2.014PheAsp: 2.014 ± 0.351
2.085PheGlu: 2.085 ± 0.317
1.007PhePhe: 1.007 ± 0.282
3.452PheGly: 3.452 ± 0.39
0.791PheHis: 0.791 ± 0.203
1.438PheIle: 1.438 ± 0.33
1.51PheLys: 1.51 ± 0.287
3.38PheLeu: 3.38 ± 0.571
0.36PheMet: 0.36 ± 0.137
1.654PheAsn: 1.654 ± 0.313
1.582PhePro: 1.582 ± 0.313
1.007PheGln: 1.007 ± 0.301
2.085PheArg: 2.085 ± 0.339
1.726PheSer: 1.726 ± 0.319
1.222PheThr: 1.222 ± 0.269
2.661PheVal: 2.661 ± 0.502
0.288PheTrp: 0.288 ± 0.169
1.151PheTyr: 1.151 ± 0.288
0.0PheXaa: 0.0 ± 0.0
Gly
4.962GlyAla: 4.962 ± 0.696
0.503GlyCys: 0.503 ± 0.18
5.681GlyAsp: 5.681 ± 0.553
6.544GlyGlu: 6.544 ± 0.689
2.301GlyPhe: 2.301 ± 0.388
7.191GlyGly: 7.191 ± 0.656
1.726GlyHis: 1.726 ± 0.398
4.602GlyIle: 4.602 ± 0.652
4.459GlyLys: 4.459 ± 0.552
4.818GlyLeu: 4.818 ± 0.6
2.229GlyMet: 2.229 ± 0.352
4.387GlyAsn: 4.387 ± 0.484
3.739GlyPro: 3.739 ± 0.538
2.157GlyGln: 2.157 ± 0.426
5.969GlyArg: 5.969 ± 0.648
5.106GlySer: 5.106 ± 0.76
4.387GlyThr: 4.387 ± 0.538
7.335GlyVal: 7.335 ± 0.771
1.366GlyTrp: 1.366 ± 0.409
3.164GlyTyr: 3.164 ± 0.474
0.0GlyXaa: 0.0 ± 0.0
His
1.007HisAla: 1.007 ± 0.237
0.503HisCys: 0.503 ± 0.158
1.366HisAsp: 1.366 ± 0.327
1.366HisGlu: 1.366 ± 0.289
0.575HisPhe: 0.575 ± 0.229
1.51HisGly: 1.51 ± 0.372
0.503HisHis: 0.503 ± 0.316
1.222HisIle: 1.222 ± 0.315
0.575HisLys: 0.575 ± 0.163
1.726HisLeu: 1.726 ± 0.366
0.216HisMet: 0.216 ± 0.123
0.36HisAsn: 0.36 ± 0.144
0.935HisPro: 0.935 ± 0.341
0.935HisGln: 0.935 ± 0.286
1.294HisArg: 1.294 ± 0.357
0.791HisSer: 0.791 ± 0.24
0.863HisThr: 0.863 ± 0.258
1.654HisVal: 1.654 ± 0.342
0.288HisTrp: 0.288 ± 0.148
0.575HisTyr: 0.575 ± 0.2
0.0HisXaa: 0.0 ± 0.0
Ile
4.243IleAla: 4.243 ± 0.76
0.216IleCys: 0.216 ± 0.114
3.092IleAsp: 3.092 ± 0.545
3.164IleGlu: 3.164 ± 0.439
1.007IlePhe: 1.007 ± 0.253
4.674IleGly: 4.674 ± 0.841
0.935IleHis: 0.935 ± 0.283
2.517IleIle: 2.517 ± 0.388
2.014IleLys: 2.014 ± 0.478
3.883IleLeu: 3.883 ± 0.509
0.935IleMet: 0.935 ± 0.263
2.229IleAsn: 2.229 ± 0.36
3.38IlePro: 3.38 ± 0.525
1.294IleGln: 1.294 ± 0.281
4.746IleArg: 4.746 ± 0.594
4.243IleSer: 4.243 ± 0.53
3.308IleThr: 3.308 ± 0.55
4.746IleVal: 4.746 ± 0.583
1.007IleTrp: 1.007 ± 0.251
0.791IleTyr: 0.791 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
5.609LysAla: 5.609 ± 1.004
0.36LysCys: 0.36 ± 0.186
1.222LysAsp: 1.222 ± 0.355
2.661LysGlu: 2.661 ± 0.512
1.582LysPhe: 1.582 ± 0.377
3.739LysGly: 3.739 ± 0.478
0.791LysHis: 0.791 ± 0.298
2.589LysIle: 2.589 ± 0.469
2.876LysLys: 2.876 ± 0.454
3.811LysLeu: 3.811 ± 0.51
1.222LysMet: 1.222 ± 0.238
1.654LysAsn: 1.654 ± 0.286
1.942LysPro: 1.942 ± 0.383
1.222LysGln: 1.222 ± 0.297
2.661LysArg: 2.661 ± 0.43
3.092LysSer: 3.092 ± 0.515
2.157LysThr: 2.157 ± 0.416
3.02LysVal: 3.02 ± 0.507
0.863LysTrp: 0.863 ± 0.202
1.654LysTyr: 1.654 ± 0.329
0.0LysXaa: 0.0 ± 0.0
Leu
7.982LeuAla: 7.982 ± 0.718
0.503LeuCys: 0.503 ± 0.183
4.027LeuAsp: 4.027 ± 0.434
6.616LeuGlu: 6.616 ± 0.815
2.445LeuPhe: 2.445 ± 0.445
5.393LeuGly: 5.393 ± 0.799
1.582LeuHis: 1.582 ± 0.378
3.883LeuIle: 3.883 ± 0.518
4.099LeuLys: 4.099 ± 0.538
4.818LeuLeu: 4.818 ± 0.62
1.726LeuMet: 1.726 ± 0.404
2.157LeuAsn: 2.157 ± 0.395
4.53LeuPro: 4.53 ± 0.534
2.445LeuGln: 2.445 ± 0.355
5.465LeuArg: 5.465 ± 0.401
5.609LeuSer: 5.609 ± 0.66
4.818LeuThr: 4.818 ± 0.658
4.962LeuVal: 4.962 ± 0.426
1.151LeuTrp: 1.151 ± 0.338
2.229LeuTyr: 2.229 ± 0.371
0.0LeuXaa: 0.0 ± 0.0
Met
2.517MetAla: 2.517 ± 0.414
0.144MetCys: 0.144 ± 0.104
1.438MetAsp: 1.438 ± 0.287
1.366MetGlu: 1.366 ± 0.279
0.431MetPhe: 0.431 ± 0.188
0.935MetGly: 0.935 ± 0.237
0.072MetHis: 0.072 ± 0.071
1.51MetIle: 1.51 ± 0.272
1.079MetLys: 1.079 ± 0.26
1.726MetLeu: 1.726 ± 0.312
0.503MetMet: 0.503 ± 0.206
0.719MetAsn: 0.719 ± 0.229
1.366MetPro: 1.366 ± 0.229
0.431MetGln: 0.431 ± 0.179
1.726MetArg: 1.726 ± 0.307
1.582MetSer: 1.582 ± 0.284
1.438MetThr: 1.438 ± 0.353
1.582MetVal: 1.582 ± 0.324
0.072MetTrp: 0.072 ± 0.054
0.36MetTyr: 0.36 ± 0.125
0.0MetXaa: 0.0 ± 0.0
Asn
3.38AsnAla: 3.38 ± 0.566
0.216AsnCys: 0.216 ± 0.133
1.582AsnAsp: 1.582 ± 0.349
2.014AsnGlu: 2.014 ± 0.352
1.366AsnPhe: 1.366 ± 0.254
3.236AsnGly: 3.236 ± 0.587
0.935AsnHis: 0.935 ± 0.273
2.517AsnIle: 2.517 ± 0.427
1.366AsnLys: 1.366 ± 0.389
2.445AsnLeu: 2.445 ± 0.502
0.647AsnMet: 0.647 ± 0.22
1.151AsnAsn: 1.151 ± 0.239
1.942AsnPro: 1.942 ± 0.264
0.863AsnGln: 0.863 ± 0.223
3.164AsnArg: 3.164 ± 0.475
2.805AsnSer: 2.805 ± 0.491
1.798AsnThr: 1.798 ± 0.288
2.805AsnVal: 2.805 ± 0.389
0.431AsnTrp: 0.431 ± 0.172
1.079AsnTyr: 1.079 ± 0.263
0.0AsnXaa: 0.0 ± 0.0
Pro
4.243ProAla: 4.243 ± 0.552
0.503ProCys: 0.503 ± 0.206
3.452ProAsp: 3.452 ± 0.635
4.243ProGlu: 4.243 ± 0.437
1.582ProPhe: 1.582 ± 0.27
5.393ProGly: 5.393 ± 0.801
1.222ProHis: 1.222 ± 0.295
3.38ProIle: 3.38 ± 0.637
2.373ProLys: 2.373 ± 0.526
5.034ProLeu: 5.034 ± 0.607
0.719ProMet: 0.719 ± 0.24
1.87ProAsn: 1.87 ± 0.376
2.085ProPro: 2.085 ± 0.547
1.654ProGln: 1.654 ± 0.458
2.085ProArg: 2.085 ± 0.294
2.589ProSer: 2.589 ± 0.541
3.164ProThr: 3.164 ± 0.535
4.746ProVal: 4.746 ± 0.557
0.791ProTrp: 0.791 ± 0.235
1.151ProTyr: 1.151 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
3.955GlnAla: 3.955 ± 0.475
0.36GlnCys: 0.36 ± 0.168
1.726GlnAsp: 1.726 ± 0.374
2.301GlnGlu: 2.301 ± 0.414
1.222GlnPhe: 1.222 ± 0.245
1.942GlnGly: 1.942 ± 0.356
0.647GlnHis: 0.647 ± 0.223
1.51GlnIle: 1.51 ± 0.354
1.366GlnLys: 1.366 ± 0.311
2.661GlnLeu: 2.661 ± 0.5
1.007GlnMet: 1.007 ± 0.209
1.654GlnAsn: 1.654 ± 0.364
1.079GlnPro: 1.079 ± 0.288
1.222GlnGln: 1.222 ± 0.325
2.229GlnArg: 2.229 ± 0.439
1.079GlnSer: 1.079 ± 0.276
1.151GlnThr: 1.151 ± 0.282
2.157GlnVal: 2.157 ± 0.444
0.288GlnTrp: 0.288 ± 0.155
1.007GlnTyr: 1.007 ± 0.329
0.0GlnXaa: 0.0 ± 0.0
Arg
5.753ArgAla: 5.753 ± 0.842
0.431ArgCys: 0.431 ± 0.163
4.027ArgAsp: 4.027 ± 0.547
6.4ArgGlu: 6.4 ± 0.839
3.452ArgPhe: 3.452 ± 0.543
5.25ArgGly: 5.25 ± 0.552
1.079ArgHis: 1.079 ± 0.232
3.452ArgIle: 3.452 ± 0.432
2.517ArgLys: 2.517 ± 0.475
6.256ArgLeu: 6.256 ± 0.606
2.373ArgMet: 2.373 ± 0.405
2.733ArgAsn: 2.733 ± 0.421
3.236ArgPro: 3.236 ± 0.552
2.301ArgGln: 2.301 ± 0.557
5.465ArgArg: 5.465 ± 0.513
3.883ArgSer: 3.883 ± 0.515
2.876ArgThr: 2.876 ± 0.405
4.818ArgVal: 4.818 ± 0.694
1.51ArgTrp: 1.51 ± 0.382
2.085ArgTyr: 2.085 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
6.616SerAla: 6.616 ± 0.669
0.503SerCys: 0.503 ± 0.18
3.452SerAsp: 3.452 ± 0.546
4.099SerGlu: 4.099 ± 0.489
2.157SerPhe: 2.157 ± 0.366
5.969SerGly: 5.969 ± 0.607
1.438SerHis: 1.438 ± 0.338
2.373SerIle: 2.373 ± 0.368
2.014SerLys: 2.014 ± 0.461
4.459SerLeu: 4.459 ± 0.622
1.798SerMet: 1.798 ± 0.314
2.085SerAsn: 2.085 ± 0.333
3.164SerPro: 3.164 ± 0.507
2.373SerGln: 2.373 ± 0.467
5.106SerArg: 5.106 ± 0.784
4.099SerSer: 4.099 ± 0.613
3.596SerThr: 3.596 ± 0.449
4.962SerVal: 4.962 ± 0.559
0.863SerTrp: 0.863 ± 0.312
1.726SerTyr: 1.726 ± 0.321
0.0SerXaa: 0.0 ± 0.0
Thr
4.387ThrAla: 4.387 ± 0.614
0.36ThrCys: 0.36 ± 0.151
2.661ThrAsp: 2.661 ± 0.481
3.955ThrGlu: 3.955 ± 0.445
1.294ThrPhe: 1.294 ± 0.258
5.897ThrGly: 5.897 ± 0.592
1.079ThrHis: 1.079 ± 0.3
3.596ThrIle: 3.596 ± 0.387
2.373ThrLys: 2.373 ± 0.531
4.171ThrLeu: 4.171 ± 0.641
1.222ThrMet: 1.222 ± 0.22
2.301ThrAsn: 2.301 ± 0.35
4.602ThrPro: 4.602 ± 0.667
1.366ThrGln: 1.366 ± 0.352
2.589ThrArg: 2.589 ± 0.478
3.236ThrSer: 3.236 ± 0.458
2.589ThrThr: 2.589 ± 0.373
4.602ThrVal: 4.602 ± 0.661
1.151ThrTrp: 1.151 ± 0.3
2.014ThrTyr: 2.014 ± 0.408
0.0ThrXaa: 0.0 ± 0.0
Val
7.191ValAla: 7.191 ± 0.864
0.503ValCys: 0.503 ± 0.168
6.544ValAsp: 6.544 ± 0.793
4.89ValGlu: 4.89 ± 0.684
3.236ValPhe: 3.236 ± 0.557
5.178ValGly: 5.178 ± 0.533
1.294ValHis: 1.294 ± 0.275
4.315ValIle: 4.315 ± 0.645
3.667ValLys: 3.667 ± 0.477
5.465ValLeu: 5.465 ± 0.614
1.438ValMet: 1.438 ± 0.309
1.87ValAsn: 1.87 ± 0.394
4.315ValPro: 4.315 ± 0.558
2.445ValGln: 2.445 ± 0.317
6.328ValArg: 6.328 ± 0.648
5.681ValSer: 5.681 ± 0.608
5.25ValThr: 5.25 ± 0.629
6.472ValVal: 6.472 ± 0.893
1.654ValTrp: 1.654 ± 0.331
2.733ValTyr: 2.733 ± 0.49
0.0ValXaa: 0.0 ± 0.0
Trp
1.294TrpAla: 1.294 ± 0.305
0.144TrpCys: 0.144 ± 0.08
1.51TrpAsp: 1.51 ± 0.325
1.654TrpGlu: 1.654 ± 0.354
0.216TrpPhe: 0.216 ± 0.114
1.798TrpGly: 1.798 ± 0.334
0.216TrpHis: 0.216 ± 0.094
1.222TrpIle: 1.222 ± 0.301
0.575TrpLys: 0.575 ± 0.184
1.007TrpLeu: 1.007 ± 0.281
0.0TrpMet: 0.0 ± 0.0
0.863TrpAsn: 0.863 ± 0.26
0.935TrpPro: 0.935 ± 0.293
0.791TrpGln: 0.791 ± 0.244
1.007TrpArg: 1.007 ± 0.253
1.151TrpSer: 1.151 ± 0.254
1.151TrpThr: 1.151 ± 0.323
1.294TrpVal: 1.294 ± 0.253
0.216TrpTrp: 0.216 ± 0.105
0.647TrpTyr: 0.647 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.948TyrAla: 2.948 ± 0.429
0.431TyrCys: 0.431 ± 0.179
1.582TyrAsp: 1.582 ± 0.393
2.733TyrGlu: 2.733 ± 0.55
1.007TyrPhe: 1.007 ± 0.286
2.805TyrGly: 2.805 ± 0.467
0.503TyrHis: 0.503 ± 0.217
0.935TyrIle: 0.935 ± 0.244
1.222TyrLys: 1.222 ± 0.332
2.948TyrLeu: 2.948 ± 0.428
0.36TyrMet: 0.36 ± 0.133
0.863TyrAsn: 0.863 ± 0.272
1.726TyrPro: 1.726 ± 0.402
0.719TyrGln: 0.719 ± 0.201
2.733TyrArg: 2.733 ± 0.397
2.445TyrSer: 2.445 ± 0.618
1.51TyrThr: 1.51 ± 0.35
2.805TyrVal: 2.805 ± 0.491
1.007TyrTrp: 1.007 ± 0.212
1.151TyrTyr: 1.151 ± 0.319
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (13907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski