Amino acid dipepetide frequency for Gordonia phage Dmitri

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.703AlaAla: 20.703 ± 2.137
0.471AlaCys: 0.471 ± 0.229
8.671AlaAsp: 8.671 ± 0.918
8.335AlaGlu: 8.335 ± 0.96
2.958AlaPhe: 2.958 ± 0.556
10.889AlaGly: 10.889 ± 1.408
2.285AlaHis: 2.285 ± 0.385
5.109AlaIle: 5.109 ± 0.533
5.31AlaLys: 5.31 ± 0.799
9.814AlaLeu: 9.814 ± 0.799
3.899AlaMet: 3.899 ± 0.474
2.756AlaAsn: 2.756 ± 0.748
5.915AlaPro: 5.915 ± 0.684
5.512AlaGln: 5.512 ± 1.094
8.94AlaArg: 8.94 ± 0.936
5.512AlaSer: 5.512 ± 0.733
8.604AlaThr: 8.604 ± 0.904
7.461AlaVal: 7.461 ± 0.742
2.218AlaTrp: 2.218 ± 0.41
2.42AlaTyr: 2.42 ± 0.352
0.0AlaXaa: 0.0 ± 0.0
Cys
0.739CysAla: 0.739 ± 0.24
0.269CysCys: 0.269 ± 0.126
0.874CysAsp: 0.874 ± 0.257
0.202CysGlu: 0.202 ± 0.148
0.202CysPhe: 0.202 ± 0.119
0.807CysGly: 0.807 ± 0.274
0.134CysHis: 0.134 ± 0.086
0.202CysIle: 0.202 ± 0.106
0.134CysLys: 0.134 ± 0.107
0.538CysLeu: 0.538 ± 0.216
0.134CysMet: 0.134 ± 0.099
0.269CysAsn: 0.269 ± 0.124
0.403CysPro: 0.403 ± 0.195
0.403CysGln: 0.403 ± 0.166
0.941CysArg: 0.941 ± 0.24
0.471CysSer: 0.471 ± 0.195
0.269CysThr: 0.269 ± 0.129
0.739CysVal: 0.739 ± 0.245
0.269CysTrp: 0.269 ± 0.14
0.067CysTyr: 0.067 ± 0.073
0.0CysXaa: 0.0 ± 0.0
Asp
7.327AspAla: 7.327 ± 0.652
0.672AspCys: 0.672 ± 0.229
5.243AspAsp: 5.243 ± 0.788
5.377AspGlu: 5.377 ± 0.661
1.68AspPhe: 1.68 ± 0.401
6.587AspGly: 6.587 ± 0.627
1.949AspHis: 1.949 ± 0.393
2.756AspIle: 2.756 ± 0.438
1.68AspLys: 1.68 ± 0.359
7.058AspLeu: 7.058 ± 0.914
1.008AspMet: 1.008 ± 0.208
2.017AspAsn: 2.017 ± 0.371
4.772AspPro: 4.772 ± 0.703
1.949AspGln: 1.949 ± 0.45
4.772AspArg: 4.772 ± 0.772
3.226AspSer: 3.226 ± 0.489
3.092AspThr: 3.092 ± 0.423
4.84AspVal: 4.84 ± 0.726
0.941AspTrp: 0.941 ± 0.242
1.075AspTyr: 1.075 ± 0.26
0.0AspXaa: 0.0 ± 0.0
Glu
6.856GluAla: 6.856 ± 0.897
0.336GluCys: 0.336 ± 0.153
3.294GluAsp: 3.294 ± 0.485
2.017GluGlu: 2.017 ± 0.448
2.151GluPhe: 2.151 ± 0.439
3.697GluGly: 3.697 ± 0.555
1.008GluHis: 1.008 ± 0.286
3.428GluIle: 3.428 ± 0.538
1.882GluLys: 1.882 ± 0.388
5.714GluLeu: 5.714 ± 0.702
1.882GluMet: 1.882 ± 0.265
1.546GluAsn: 1.546 ± 0.256
2.285GluPro: 2.285 ± 0.407
3.092GluGln: 3.092 ± 0.423
3.966GluArg: 3.966 ± 0.562
3.428GluSer: 3.428 ± 0.506
3.294GluThr: 3.294 ± 0.489
3.563GluVal: 3.563 ± 0.642
1.21GluTrp: 1.21 ± 0.279
1.546GluTyr: 1.546 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.958PheAla: 2.958 ± 0.387
0.269PheCys: 0.269 ± 0.13
2.554PheAsp: 2.554 ± 0.573
2.017PheGlu: 2.017 ± 0.488
0.471PhePhe: 0.471 ± 0.155
2.689PheGly: 2.689 ± 0.543
0.538PheHis: 0.538 ± 0.188
1.008PheIle: 1.008 ± 0.296
0.538PheLys: 0.538 ± 0.168
2.017PheLeu: 2.017 ± 0.399
0.134PheMet: 0.134 ± 0.091
0.941PheAsn: 0.941 ± 0.284
1.075PhePro: 1.075 ± 0.374
0.605PheGln: 0.605 ± 0.158
1.21PheArg: 1.21 ± 0.326
1.008PheSer: 1.008 ± 0.245
1.815PheThr: 1.815 ± 0.377
1.949PheVal: 1.949 ± 0.313
0.672PheTrp: 0.672 ± 0.181
0.605PheTyr: 0.605 ± 0.173
0.0PheXaa: 0.0 ± 0.0
Gly
7.932GlyAla: 7.932 ± 0.858
0.403GlyCys: 0.403 ± 0.182
6.655GlyAsp: 6.655 ± 0.743
3.563GlyGlu: 3.563 ± 0.453
2.89GlyPhe: 2.89 ± 0.526
7.461GlyGly: 7.461 ± 0.839
1.815GlyHis: 1.815 ± 0.357
3.495GlyIle: 3.495 ± 0.56
2.89GlyLys: 2.89 ± 0.446
7.73GlyLeu: 7.73 ± 0.841
1.613GlyMet: 1.613 ± 0.325
2.353GlyAsn: 2.353 ± 0.391
4.638GlyPro: 4.638 ± 0.49
3.899GlyGln: 3.899 ± 0.64
6.251GlyArg: 6.251 ± 0.615
5.31GlySer: 5.31 ± 0.682
6.184GlyThr: 6.184 ± 0.794
7.125GlyVal: 7.125 ± 0.608
1.68GlyTrp: 1.68 ± 0.294
2.689GlyTyr: 2.689 ± 0.325
0.0GlyXaa: 0.0 ± 0.0
His
2.756HisAla: 2.756 ± 0.429
0.134HisCys: 0.134 ± 0.102
1.21HisAsp: 1.21 ± 0.289
1.008HisGlu: 1.008 ± 0.28
0.269HisPhe: 0.269 ± 0.166
1.613HisGly: 1.613 ± 0.388
0.538HisHis: 0.538 ± 0.17
0.874HisIle: 0.874 ± 0.201
0.538HisLys: 0.538 ± 0.169
2.151HisLeu: 2.151 ± 0.562
0.134HisMet: 0.134 ± 0.094
0.336HisAsn: 0.336 ± 0.156
1.143HisPro: 1.143 ± 0.298
0.471HisGln: 0.471 ± 0.166
1.479HisArg: 1.479 ± 0.336
1.143HisSer: 1.143 ± 0.258
1.546HisThr: 1.546 ± 0.283
0.807HisVal: 0.807 ± 0.223
0.336HisTrp: 0.336 ± 0.14
0.605HisTyr: 0.605 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
6.923IleAla: 6.923 ± 0.846
0.336IleCys: 0.336 ± 0.162
4.033IleAsp: 4.033 ± 0.542
3.025IleGlu: 3.025 ± 0.515
0.538IlePhe: 0.538 ± 0.268
3.294IleGly: 3.294 ± 0.433
0.471IleHis: 0.471 ± 0.197
1.479IleIle: 1.479 ± 0.415
1.008IleLys: 1.008 ± 0.259
2.554IleLeu: 2.554 ± 0.372
0.807IleMet: 0.807 ± 0.216
1.075IleAsn: 1.075 ± 0.272
2.084IlePro: 2.084 ± 0.333
1.949IleGln: 1.949 ± 0.406
2.689IleArg: 2.689 ± 0.389
2.285IleSer: 2.285 ± 0.48
3.63IleThr: 3.63 ± 0.538
2.487IleVal: 2.487 ± 0.428
0.538IleTrp: 0.538 ± 0.294
1.075IleTyr: 1.075 ± 0.35
0.0IleXaa: 0.0 ± 0.0
Lys
4.504LysAla: 4.504 ± 0.691
0.202LysCys: 0.202 ± 0.131
1.748LysAsp: 1.748 ± 0.422
1.479LysGlu: 1.479 ± 0.324
0.807LysPhe: 0.807 ± 0.301
2.218LysGly: 2.218 ± 0.385
0.605LysHis: 0.605 ± 0.223
1.143LysIle: 1.143 ± 0.218
1.479LysLys: 1.479 ± 0.349
3.159LysLeu: 3.159 ± 0.405
0.538LysMet: 0.538 ± 0.186
0.874LysAsn: 0.874 ± 0.227
1.143LysPro: 1.143 ± 0.304
0.941LysGln: 0.941 ± 0.239
1.748LysArg: 1.748 ± 0.348
2.017LysSer: 2.017 ± 0.361
3.092LysThr: 3.092 ± 0.579
2.554LysVal: 2.554 ± 0.427
0.941LysTrp: 0.941 ± 0.258
0.403LysTyr: 0.403 ± 0.156
0.0LysXaa: 0.0 ± 0.0
Leu
11.83LeuAla: 11.83 ± 1.001
0.672LeuCys: 0.672 ± 0.235
5.041LeuAsp: 5.041 ± 0.575
4.369LeuGlu: 4.369 ± 0.483
2.621LeuPhe: 2.621 ± 0.609
6.117LeuGly: 6.117 ± 0.952
1.815LeuHis: 1.815 ± 0.382
2.958LeuIle: 2.958 ± 0.499
2.218LeuLys: 2.218 ± 0.393
7.058LeuLeu: 7.058 ± 0.651
1.21LeuMet: 1.21 ± 0.332
1.748LeuAsn: 1.748 ± 0.293
5.176LeuPro: 5.176 ± 0.568
2.823LeuGln: 2.823 ± 0.401
5.646LeuArg: 5.646 ± 0.725
5.579LeuSer: 5.579 ± 0.584
5.982LeuThr: 5.982 ± 0.658
7.797LeuVal: 7.797 ± 0.845
1.68LeuTrp: 1.68 ± 0.438
1.075LeuTyr: 1.075 ± 0.251
0.0LeuXaa: 0.0 ± 0.0
Met
2.689MetAla: 2.689 ± 0.309
0.134MetCys: 0.134 ± 0.093
0.941MetAsp: 0.941 ± 0.223
0.672MetGlu: 0.672 ± 0.193
0.874MetPhe: 0.874 ± 0.302
1.815MetGly: 1.815 ± 0.445
0.336MetHis: 0.336 ± 0.159
0.672MetIle: 0.672 ± 0.183
0.672MetLys: 0.672 ± 0.197
1.882MetLeu: 1.882 ± 0.366
0.605MetMet: 0.605 ± 0.208
0.874MetAsn: 0.874 ± 0.263
0.874MetPro: 0.874 ± 0.203
0.874MetGln: 0.874 ± 0.301
1.412MetArg: 1.412 ± 0.265
2.218MetSer: 2.218 ± 0.419
1.949MetThr: 1.949 ± 0.265
1.277MetVal: 1.277 ± 0.321
0.336MetTrp: 0.336 ± 0.159
0.202MetTyr: 0.202 ± 0.1
0.0MetXaa: 0.0 ± 0.0
Asn
2.756AsnAla: 2.756 ± 0.451
0.202AsnCys: 0.202 ± 0.115
0.874AsnAsp: 0.874 ± 0.26
1.277AsnGlu: 1.277 ± 0.238
0.605AsnPhe: 0.605 ± 0.24
3.831AsnGly: 3.831 ± 0.652
0.672AsnHis: 0.672 ± 0.186
1.143AsnIle: 1.143 ± 0.322
0.739AsnLys: 0.739 ± 0.264
1.613AsnLeu: 1.613 ± 0.323
0.269AsnMet: 0.269 ± 0.138
0.672AsnAsn: 0.672 ± 0.229
2.621AsnPro: 2.621 ± 0.471
1.21AsnGln: 1.21 ± 0.247
1.479AsnArg: 1.479 ± 0.335
1.546AsnSer: 1.546 ± 0.339
1.21AsnThr: 1.21 ± 0.246
1.479AsnVal: 1.479 ± 0.287
0.807AsnTrp: 0.807 ± 0.221
0.538AsnTyr: 0.538 ± 0.172
0.0AsnXaa: 0.0 ± 0.0
Pro
6.991ProAla: 6.991 ± 0.799
0.605ProCys: 0.605 ± 0.186
4.504ProAsp: 4.504 ± 0.652
4.033ProGlu: 4.033 ± 0.467
1.546ProPhe: 1.546 ± 0.355
5.445ProGly: 5.445 ± 0.651
0.739ProHis: 0.739 ± 0.187
2.285ProIle: 2.285 ± 0.399
1.546ProLys: 1.546 ± 0.41
3.428ProLeu: 3.428 ± 0.438
0.605ProMet: 0.605 ± 0.192
1.613ProAsn: 1.613 ± 0.298
3.697ProPro: 3.697 ± 0.701
1.546ProGln: 1.546 ± 0.281
3.495ProArg: 3.495 ± 0.664
3.092ProSer: 3.092 ± 0.402
4.033ProThr: 4.033 ± 0.487
4.168ProVal: 4.168 ± 0.632
1.075ProTrp: 1.075 ± 0.199
1.075ProTyr: 1.075 ± 0.25
0.0ProXaa: 0.0 ± 0.0
Gln
4.369GlnAla: 4.369 ± 0.802
0.336GlnCys: 0.336 ± 0.159
1.68GlnAsp: 1.68 ± 0.299
1.68GlnGlu: 1.68 ± 0.347
0.941GlnPhe: 0.941 ± 0.23
2.823GlnGly: 2.823 ± 0.484
0.807GlnHis: 0.807 ± 0.244
1.748GlnIle: 1.748 ± 0.383
1.546GlnLys: 1.546 ± 0.33
4.302GlnLeu: 4.302 ± 0.523
0.941GlnMet: 0.941 ± 0.239
0.471GlnAsn: 0.471 ± 0.149
2.621GlnPro: 2.621 ± 0.333
1.815GlnGln: 1.815 ± 0.468
3.294GlnArg: 3.294 ± 0.494
2.151GlnSer: 2.151 ± 0.478
1.815GlnThr: 1.815 ± 0.347
3.159GlnVal: 3.159 ± 0.469
0.403GlnTrp: 0.403 ± 0.186
1.344GlnTyr: 1.344 ± 0.33
0.0GlnXaa: 0.0 ± 0.0
Arg
7.797ArgAla: 7.797 ± 0.708
0.874ArgCys: 0.874 ± 0.269
3.764ArgAsp: 3.764 ± 0.609
3.428ArgGlu: 3.428 ± 0.462
1.412ArgPhe: 1.412 ± 0.259
4.638ArgGly: 4.638 ± 0.533
1.412ArgHis: 1.412 ± 0.382
3.025ArgIle: 3.025 ± 0.526
2.689ArgLys: 2.689 ± 0.469
6.251ArgLeu: 6.251 ± 0.593
2.218ArgMet: 2.218 ± 0.37
1.815ArgAsn: 1.815 ± 0.347
3.294ArgPro: 3.294 ± 0.705
2.89ArgGln: 2.89 ± 0.44
7.394ArgArg: 7.394 ± 0.897
5.109ArgSer: 5.109 ± 0.67
4.705ArgThr: 4.705 ± 0.519
4.84ArgVal: 4.84 ± 0.578
1.412ArgTrp: 1.412 ± 0.337
1.748ArgTyr: 1.748 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
7.663SerAla: 7.663 ± 0.69
0.134SerCys: 0.134 ± 0.094
3.697SerAsp: 3.697 ± 0.409
3.025SerGlu: 3.025 ± 0.397
1.21SerPhe: 1.21 ± 0.273
6.923SerGly: 6.923 ± 0.838
0.874SerHis: 0.874 ± 0.232
2.621SerIle: 2.621 ± 0.39
1.613SerLys: 1.613 ± 0.467
3.966SerLeu: 3.966 ± 0.514
1.412SerMet: 1.412 ± 0.319
1.412SerAsn: 1.412 ± 0.233
3.361SerPro: 3.361 ± 0.421
2.218SerGln: 2.218 ± 0.46
3.697SerArg: 3.697 ± 0.478
3.697SerSer: 3.697 ± 0.645
4.638SerThr: 4.638 ± 0.699
4.033SerVal: 4.033 ± 0.514
1.075SerTrp: 1.075 ± 0.327
1.143SerTyr: 1.143 ± 0.337
0.0SerXaa: 0.0 ± 0.0
Thr
9.276ThrAla: 9.276 ± 1.053
0.672ThrCys: 0.672 ± 0.237
4.907ThrAsp: 4.907 ± 0.75
3.899ThrGlu: 3.899 ± 0.563
1.68ThrPhe: 1.68 ± 0.4
7.125ThrGly: 7.125 ± 0.743
0.941ThrHis: 0.941 ± 0.223
3.563ThrIle: 3.563 ± 0.502
1.748ThrLys: 1.748 ± 0.322
5.31ThrLeu: 5.31 ± 0.513
1.277ThrMet: 1.277 ± 0.357
1.68ThrAsn: 1.68 ± 0.373
4.638ThrPro: 4.638 ± 0.58
2.017ThrGln: 2.017 ± 0.365
3.495ThrArg: 3.495 ± 0.544
3.428ThrSer: 3.428 ± 0.402
3.966ThrThr: 3.966 ± 0.39
5.982ThrVal: 5.982 ± 0.534
1.412ThrTrp: 1.412 ± 0.395
1.412ThrTyr: 1.412 ± 0.368
0.0ThrXaa: 0.0 ± 0.0
Val
9.276ValAla: 9.276 ± 0.769
0.739ValCys: 0.739 ± 0.231
5.176ValAsp: 5.176 ± 0.677
4.436ValGlu: 4.436 ± 0.582
1.143ValPhe: 1.143 ± 0.24
5.781ValGly: 5.781 ± 0.598
0.941ValHis: 0.941 ± 0.222
3.294ValIle: 3.294 ± 0.571
2.285ValLys: 2.285 ± 0.364
5.243ValLeu: 5.243 ± 0.759
1.344ValMet: 1.344 ± 0.227
2.084ValAsn: 2.084 ± 0.315
3.966ValPro: 3.966 ± 0.539
2.285ValGln: 2.285 ± 0.394
5.579ValArg: 5.579 ± 0.576
5.176ValSer: 5.176 ± 0.556
5.646ValThr: 5.646 ± 0.566
5.445ValVal: 5.445 ± 0.642
1.613ValTrp: 1.613 ± 0.377
1.344ValTyr: 1.344 ± 0.276
0.0ValXaa: 0.0 ± 0.0
Trp
1.815TrpAla: 1.815 ± 0.33
0.336TrpCys: 0.336 ± 0.132
1.748TrpAsp: 1.748 ± 0.404
0.807TrpGlu: 0.807 ± 0.341
0.605TrpPhe: 0.605 ± 0.195
1.344TrpGly: 1.344 ± 0.278
0.538TrpHis: 0.538 ± 0.212
0.739TrpIle: 0.739 ± 0.227
0.269TrpLys: 0.269 ± 0.177
1.815TrpLeu: 1.815 ± 0.357
0.941TrpMet: 0.941 ± 0.278
0.471TrpAsn: 0.471 ± 0.199
0.739TrpPro: 0.739 ± 0.226
0.874TrpGln: 0.874 ± 0.194
1.68TrpArg: 1.68 ± 0.303
1.143TrpSer: 1.143 ± 0.258
1.344TrpThr: 1.344 ± 0.421
1.412TrpVal: 1.412 ± 0.357
0.471TrpTrp: 0.471 ± 0.184
0.403TrpTyr: 0.403 ± 0.141
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.621TyrAla: 2.621 ± 0.337
0.269TyrCys: 0.269 ± 0.13
1.412TyrAsp: 1.412 ± 0.309
1.613TyrGlu: 1.613 ± 0.372
0.471TyrPhe: 0.471 ± 0.163
1.479TyrGly: 1.479 ± 0.337
0.672TyrHis: 0.672 ± 0.231
0.739TyrIle: 0.739 ± 0.257
0.807TyrLys: 0.807 ± 0.218
1.815TyrLeu: 1.815 ± 0.393
0.269TyrMet: 0.269 ± 0.191
0.538TyrAsn: 0.538 ± 0.15
1.075TyrPro: 1.075 ± 0.266
0.874TyrGln: 0.874 ± 0.263
1.546TyrArg: 1.546 ± 0.402
0.874TyrSer: 0.874 ± 0.237
1.613TyrThr: 1.613 ± 0.384
1.68TyrVal: 1.68 ± 0.351
0.403TyrTrp: 0.403 ± 0.191
0.672TyrTyr: 0.672 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 63 proteins (14878 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski