Amino acid dipepetide frequency for Microbacterium phage Phinky

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.653AlaAla: 16.653 ± 1.317
0.897AlaCys: 0.897 ± 0.221
6.581AlaAsp: 6.581 ± 0.575
8.576AlaGlu: 8.576 ± 0.774
2.443AlaPhe: 2.443 ± 0.319
7.878AlaGly: 7.878 ± 0.762
1.745AlaHis: 1.745 ± 0.327
5.036AlaIle: 5.036 ± 0.548
3.49AlaLys: 3.49 ± 0.491
9.373AlaLeu: 9.373 ± 0.892
2.742AlaMet: 2.742 ± 0.408
3.789AlaAsn: 3.789 ± 0.493
5.185AlaPro: 5.185 ± 0.482
4.138AlaGln: 4.138 ± 0.462
7.329AlaArg: 7.329 ± 0.682
6.831AlaSer: 6.831 ± 0.79
7.977AlaThr: 7.977 ± 0.884
8.127AlaVal: 8.127 ± 0.883
2.144AlaTrp: 2.144 ± 0.356
3.54AlaTyr: 3.54 ± 0.426
0.0AlaXaa: 0.0 ± 0.0
Cys
0.748CysAla: 0.748 ± 0.224
0.1CysCys: 0.1 ± 0.07
0.399CysAsp: 0.399 ± 0.127
0.698CysGlu: 0.698 ± 0.172
0.249CysPhe: 0.249 ± 0.143
1.047CysGly: 1.047 ± 0.24
0.349CysHis: 0.349 ± 0.133
0.15CysIle: 0.15 ± 0.079
0.15CysLys: 0.15 ± 0.089
0.299CysLeu: 0.299 ± 0.12
0.0CysMet: 0.0 ± 0.0
0.249CysAsn: 0.249 ± 0.129
0.748CysPro: 0.748 ± 0.205
0.199CysGln: 0.199 ± 0.132
0.748CysArg: 0.748 ± 0.236
0.399CysSer: 0.399 ± 0.138
0.698CysThr: 0.698 ± 0.197
0.648CysVal: 0.648 ± 0.185
0.199CysTrp: 0.199 ± 0.092
0.449CysTyr: 0.449 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
7.578AspAla: 7.578 ± 0.751
0.598AspCys: 0.598 ± 0.142
4.138AspAsp: 4.138 ± 0.449
4.737AspGlu: 4.737 ± 0.61
1.994AspPhe: 1.994 ± 0.295
6.93AspGly: 6.93 ± 0.544
1.496AspHis: 1.496 ± 0.371
2.343AspIle: 2.343 ± 0.308
1.446AspLys: 1.446 ± 0.299
5.135AspLeu: 5.135 ± 0.558
1.097AspMet: 1.097 ± 0.274
1.795AspAsn: 1.795 ± 0.299
3.59AspPro: 3.59 ± 0.416
2.543AspGln: 2.543 ± 0.303
3.291AspArg: 3.291 ± 0.451
3.34AspSer: 3.34 ± 0.424
4.238AspThr: 4.238 ± 0.415
3.839AspVal: 3.839 ± 0.417
1.645AspTrp: 1.645 ± 0.274
1.645AspTyr: 1.645 ± 0.467
0.0AspXaa: 0.0 ± 0.0
Glu
7.13GluAla: 7.13 ± 0.68
0.349GluCys: 0.349 ± 0.146
3.689GluAsp: 3.689 ± 0.524
4.687GluGlu: 4.687 ± 0.665
2.044GluPhe: 2.044 ± 0.352
5.135GluGly: 5.135 ± 0.628
1.595GluHis: 1.595 ± 0.296
4.038GluIle: 4.038 ± 0.506
1.944GluLys: 1.944 ± 0.367
5.983GluLeu: 5.983 ± 0.549
1.745GluMet: 1.745 ± 0.35
1.895GluAsn: 1.895 ± 0.312
4.587GluPro: 4.587 ± 0.839
3.889GluGln: 3.889 ± 0.61
5.534GluArg: 5.534 ± 0.581
3.34GluSer: 3.34 ± 0.363
3.59GluThr: 3.59 ± 0.442
5.933GluVal: 5.933 ± 0.653
1.346GluTrp: 1.346 ± 0.266
1.595GluTyr: 1.595 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
3.49PheAla: 3.49 ± 0.403
0.349PheCys: 0.349 ± 0.127
2.543PheAsp: 2.543 ± 0.354
2.244PheGlu: 2.244 ± 0.304
0.698PhePhe: 0.698 ± 0.156
2.692PheGly: 2.692 ± 0.438
0.399PheHis: 0.399 ± 0.16
0.947PheIle: 0.947 ± 0.253
0.897PheLys: 0.897 ± 0.243
1.645PheLeu: 1.645 ± 0.305
0.698PheMet: 0.698 ± 0.218
0.349PheAsn: 0.349 ± 0.119
1.346PhePro: 1.346 ± 0.268
0.548PheGln: 0.548 ± 0.178
1.745PheArg: 1.745 ± 0.284
1.595PheSer: 1.595 ± 0.326
3.041PheThr: 3.041 ± 0.422
1.595PheVal: 1.595 ± 0.267
0.449PheTrp: 0.449 ± 0.168
0.698PheTyr: 0.698 ± 0.169
0.0PheXaa: 0.0 ± 0.0
Gly
9.074GlyAla: 9.074 ± 0.765
1.147GlyCys: 1.147 ± 0.328
5.335GlyAsp: 5.335 ± 0.585
5.285GlyGlu: 5.285 ± 0.69
3.041GlyPhe: 3.041 ± 0.39
7.18GlyGly: 7.18 ± 0.704
1.944GlyHis: 1.944 ± 0.35
3.939GlyIle: 3.939 ± 0.485
2.543GlyLys: 2.543 ± 0.375
5.734GlyLeu: 5.734 ± 0.662
1.994GlyMet: 1.994 ± 0.371
2.642GlyAsn: 2.642 ± 0.428
3.141GlyPro: 3.141 ± 0.474
3.291GlyGln: 3.291 ± 0.373
6.631GlyArg: 6.631 ± 0.758
5.335GlySer: 5.335 ± 0.525
6.332GlyThr: 6.332 ± 1.135
6.631GlyVal: 6.631 ± 0.597
1.645GlyTrp: 1.645 ± 0.306
2.892GlyTyr: 2.892 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
2.094HisAla: 2.094 ± 0.342
0.199HisCys: 0.199 ± 0.093
1.446HisAsp: 1.446 ± 0.309
1.296HisGlu: 1.296 ± 0.305
0.449HisPhe: 0.449 ± 0.154
1.745HisGly: 1.745 ± 0.395
0.399HisHis: 0.399 ± 0.153
0.698HisIle: 0.698 ± 0.221
0.449HisLys: 0.449 ± 0.154
1.446HisLeu: 1.446 ± 0.28
0.349HisMet: 0.349 ± 0.138
0.648HisAsn: 0.648 ± 0.195
1.246HisPro: 1.246 ± 0.25
0.449HisGln: 0.449 ± 0.134
1.097HisArg: 1.097 ± 0.274
0.947HisSer: 0.947 ± 0.219
0.698HisThr: 0.698 ± 0.217
1.845HisVal: 1.845 ± 0.277
0.399HisTrp: 0.399 ± 0.139
0.648HisTyr: 0.648 ± 0.192
0.0HisXaa: 0.0 ± 0.0
Ile
5.734IleAla: 5.734 ± 0.534
0.299IleCys: 0.299 ± 0.11
4.487IleAsp: 4.487 ± 0.542
4.138IleGlu: 4.138 ± 0.479
1.147IlePhe: 1.147 ± 0.259
4.238IleGly: 4.238 ± 0.563
0.848IleHis: 0.848 ± 0.208
2.293IleIle: 2.293 ± 0.37
0.748IleLys: 0.748 ± 0.231
2.792IleLeu: 2.792 ± 0.398
0.748IleMet: 0.748 ± 0.198
1.346IleAsn: 1.346 ± 0.251
3.091IlePro: 3.091 ± 0.589
1.446IleGln: 1.446 ± 0.312
2.892IleArg: 2.892 ± 0.339
2.842IleSer: 2.842 ± 0.356
3.989IleThr: 3.989 ± 0.501
3.989IleVal: 3.989 ± 0.346
0.997IleTrp: 0.997 ± 0.228
0.997IleTyr: 0.997 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
3.091LysAla: 3.091 ± 0.448
0.1LysCys: 0.1 ± 0.071
1.645LysAsp: 1.645 ± 0.282
1.496LysGlu: 1.496 ± 0.306
0.798LysPhe: 0.798 ± 0.193
2.044LysGly: 2.044 ± 0.348
0.698LysHis: 0.698 ± 0.204
0.997LysIle: 0.997 ± 0.216
0.698LysLys: 0.698 ± 0.16
2.393LysLeu: 2.393 ± 0.37
0.648LysMet: 0.648 ± 0.176
1.047LysAsn: 1.047 ± 0.247
1.496LysPro: 1.496 ± 0.307
1.047LysGln: 1.047 ± 0.247
2.393LysArg: 2.393 ± 0.398
1.346LysSer: 1.346 ± 0.319
1.546LysThr: 1.546 ± 0.341
1.546LysVal: 1.546 ± 0.284
0.399LysTrp: 0.399 ± 0.137
0.449LysTyr: 0.449 ± 0.132
0.0LysXaa: 0.0 ± 0.0
Leu
8.825LeuAla: 8.825 ± 0.819
0.249LeuCys: 0.249 ± 0.092
4.986LeuAsp: 4.986 ± 0.577
5.883LeuGlu: 5.883 ± 0.623
1.496LeuPhe: 1.496 ± 0.253
6.631LeuGly: 6.631 ± 0.612
1.047LeuHis: 1.047 ± 0.237
3.889LeuIle: 3.889 ± 0.511
1.745LeuLys: 1.745 ± 0.36
5.534LeuLeu: 5.534 ± 0.487
1.147LeuMet: 1.147 ± 0.245
2.493LeuAsn: 2.493 ± 0.393
4.088LeuPro: 4.088 ± 0.448
2.393LeuGln: 2.393 ± 0.288
5.534LeuArg: 5.534 ± 0.57
4.936LeuSer: 4.936 ± 0.6
6.482LeuThr: 6.482 ± 0.654
4.537LeuVal: 4.537 ± 0.534
0.947LeuTrp: 0.947 ± 0.227
1.097LeuTyr: 1.097 ± 0.271
0.0LeuXaa: 0.0 ± 0.0
Met
2.443MetAla: 2.443 ± 0.401
0.15MetCys: 0.15 ± 0.081
0.798MetAsp: 0.798 ± 0.185
1.097MetGlu: 1.097 ± 0.254
0.449MetPhe: 0.449 ± 0.204
1.595MetGly: 1.595 ± 0.303
0.299MetHis: 0.299 ± 0.135
1.246MetIle: 1.246 ± 0.301
0.648MetLys: 0.648 ± 0.18
2.343MetLeu: 2.343 ± 0.437
0.399MetMet: 0.399 ± 0.145
0.548MetAsn: 0.548 ± 0.171
1.396MetPro: 1.396 ± 0.255
1.246MetGln: 1.246 ± 0.255
2.044MetArg: 2.044 ± 0.296
2.493MetSer: 2.493 ± 0.428
1.795MetThr: 1.795 ± 0.316
1.246MetVal: 1.246 ± 0.277
0.1MetTrp: 0.1 ± 0.058
0.1MetTyr: 0.1 ± 0.068
0.0MetXaa: 0.0 ± 0.0
Asn
3.49AsnAla: 3.49 ± 0.436
0.249AsnCys: 0.249 ± 0.106
1.595AsnAsp: 1.595 ± 0.341
1.745AsnGlu: 1.745 ± 0.313
0.748AsnPhe: 0.748 ± 0.248
3.49AsnGly: 3.49 ± 0.336
0.548AsnHis: 0.548 ± 0.182
0.897AsnIle: 0.897 ± 0.204
0.548AsnLys: 0.548 ± 0.165
2.194AsnLeu: 2.194 ± 0.342
0.648AsnMet: 0.648 ± 0.174
0.598AsnAsn: 0.598 ± 0.172
2.593AsnPro: 2.593 ± 0.572
1.197AsnGln: 1.197 ± 0.205
2.194AsnArg: 2.194 ± 0.31
1.994AsnSer: 1.994 ± 0.449
1.895AsnThr: 1.895 ± 0.325
2.044AsnVal: 2.044 ± 0.32
0.598AsnTrp: 0.598 ± 0.172
0.648AsnTyr: 0.648 ± 0.185
0.0AsnXaa: 0.0 ± 0.0
Pro
6.432ProAla: 6.432 ± 1.386
0.499ProCys: 0.499 ± 0.153
3.739ProAsp: 3.739 ± 0.478
5.135ProGlu: 5.135 ± 0.792
1.197ProPhe: 1.197 ± 0.234
5.484ProGly: 5.484 ± 0.665
0.698ProHis: 0.698 ± 0.179
2.593ProIle: 2.593 ± 0.343
1.197ProLys: 1.197 ± 0.272
3.141ProLeu: 3.141 ± 0.394
0.947ProMet: 0.947 ± 0.207
1.895ProAsn: 1.895 ± 0.394
3.39ProPro: 3.39 ± 0.46
2.493ProGln: 2.493 ± 0.73
2.892ProArg: 2.892 ± 0.379
2.792ProSer: 2.792 ± 0.358
4.288ProThr: 4.288 ± 0.8
4.437ProVal: 4.437 ± 0.534
1.246ProTrp: 1.246 ± 0.276
1.097ProTyr: 1.097 ± 0.293
0.0ProXaa: 0.0 ± 0.0
Gln
3.839GlnAla: 3.839 ± 0.451
0.548GlnCys: 0.548 ± 0.159
1.496GlnAsp: 1.496 ± 0.281
1.944GlnGlu: 1.944 ± 0.311
1.446GlnPhe: 1.446 ± 0.274
2.942GlnGly: 2.942 ± 0.81
0.897GlnHis: 0.897 ± 0.239
1.795GlnIle: 1.795 ± 0.374
0.598GlnLys: 0.598 ± 0.183
2.892GlnLeu: 2.892 ± 0.399
1.346GlnMet: 1.346 ± 0.223
1.197GlnAsn: 1.197 ± 0.223
2.144GlnPro: 2.144 ± 0.451
3.191GlnGln: 3.191 ± 1.153
2.942GlnArg: 2.942 ± 0.468
1.595GlnSer: 1.595 ± 0.272
2.144GlnThr: 2.144 ± 0.27
2.642GlnVal: 2.642 ± 0.416
0.897GlnTrp: 0.897 ± 0.258
0.748GlnTyr: 0.748 ± 0.181
0.0GlnXaa: 0.0 ± 0.0
Arg
7.279ArgAla: 7.279 ± 0.727
0.748ArgCys: 0.748 ± 0.246
3.59ArgAsp: 3.59 ± 0.466
4.238ArgGlu: 4.238 ± 0.513
2.293ArgPhe: 2.293 ± 0.312
4.986ArgGly: 4.986 ± 0.627
1.396ArgHis: 1.396 ± 0.275
4.338ArgIle: 4.338 ± 0.447
2.443ArgLys: 2.443 ± 0.366
4.886ArgLeu: 4.886 ± 0.444
2.742ArgMet: 2.742 ± 0.348
2.244ArgAsn: 2.244 ± 0.367
2.792ArgPro: 2.792 ± 0.447
2.244ArgGln: 2.244 ± 0.35
6.282ArgArg: 6.282 ± 0.722
3.889ArgSer: 3.889 ± 0.444
4.437ArgThr: 4.437 ± 0.505
5.135ArgVal: 5.135 ± 0.659
1.296ArgTrp: 1.296 ± 0.235
2.443ArgTyr: 2.443 ± 0.363
0.0ArgXaa: 0.0 ± 0.0
Ser
7.229SerAla: 7.229 ± 0.791
0.548SerCys: 0.548 ± 0.161
3.54SerAsp: 3.54 ± 0.375
4.786SerGlu: 4.786 ± 0.476
1.745SerPhe: 1.745 ± 0.261
4.786SerGly: 4.786 ± 0.608
1.047SerHis: 1.047 ± 0.189
3.64SerIle: 3.64 ± 0.382
1.496SerLys: 1.496 ± 0.269
4.587SerLeu: 4.587 ± 0.568
1.147SerMet: 1.147 ± 0.22
1.396SerAsn: 1.396 ± 0.318
2.892SerPro: 2.892 ± 0.473
1.396SerGln: 1.396 ± 0.264
3.39SerArg: 3.39 ± 0.355
3.191SerSer: 3.191 ± 0.612
4.737SerThr: 4.737 ± 0.534
4.138SerVal: 4.138 ± 0.531
0.748SerTrp: 0.748 ± 0.194
1.446SerTyr: 1.446 ± 0.356
0.0SerXaa: 0.0 ± 0.0
Thr
8.077ThrAla: 8.077 ± 0.813
0.399ThrCys: 0.399 ± 0.131
4.737ThrAsp: 4.737 ± 0.507
4.587ThrGlu: 4.587 ± 0.444
2.293ThrPhe: 2.293 ± 0.48
5.435ThrGly: 5.435 ± 0.588
1.147ThrHis: 1.147 ± 0.255
3.64ThrIle: 3.64 ± 0.69
2.044ThrLys: 2.044 ± 0.308
5.883ThrLeu: 5.883 ± 0.527
1.645ThrMet: 1.645 ± 0.27
2.144ThrAsn: 2.144 ± 0.429
5.435ThrPro: 5.435 ± 0.993
2.094ThrGln: 2.094 ± 0.308
3.64ThrArg: 3.64 ± 0.585
4.387ThrSer: 4.387 ± 0.784
5.335ThrThr: 5.335 ± 0.761
6.033ThrVal: 6.033 ± 0.518
1.446ThrTrp: 1.446 ± 0.244
1.745ThrTyr: 1.745 ± 0.361
0.0ThrXaa: 0.0 ± 0.0
Val
7.03ValAla: 7.03 ± 0.638
0.698ValCys: 0.698 ± 0.183
5.833ValAsp: 5.833 ± 0.489
4.487ValGlu: 4.487 ± 0.536
1.944ValPhe: 1.944 ± 0.333
7.229ValGly: 7.229 ± 0.689
1.595ValHis: 1.595 ± 0.376
4.437ValIle: 4.437 ± 0.468
1.695ValLys: 1.695 ± 0.32
4.737ValLeu: 4.737 ± 0.566
1.795ValMet: 1.795 ± 0.276
2.443ValAsn: 2.443 ± 0.302
4.038ValPro: 4.038 ± 1.184
1.795ValGln: 1.795 ± 0.317
5.634ValArg: 5.634 ± 0.512
4.038ValSer: 4.038 ± 0.404
5.335ValThr: 5.335 ± 0.526
5.235ValVal: 5.235 ± 0.524
1.795ValTrp: 1.795 ± 0.302
1.645ValTyr: 1.645 ± 0.259
0.0ValXaa: 0.0 ± 0.0
Trp
1.496TrpAla: 1.496 ± 0.321
0.249TrpCys: 0.249 ± 0.109
1.097TrpAsp: 1.097 ± 0.21
1.346TrpGlu: 1.346 ± 0.313
0.848TrpPhe: 0.848 ± 0.218
1.396TrpGly: 1.396 ± 0.223
0.199TrpHis: 0.199 ± 0.101
1.147TrpIle: 1.147 ± 0.23
0.648TrpLys: 0.648 ± 0.187
1.446TrpLeu: 1.446 ± 0.303
0.249TrpMet: 0.249 ± 0.097
0.499TrpAsn: 0.499 ± 0.143
0.848TrpPro: 0.848 ± 0.206
0.848TrpGln: 0.848 ± 0.198
1.496TrpArg: 1.496 ± 0.405
1.197TrpSer: 1.197 ± 0.252
1.895TrpThr: 1.895 ± 0.317
1.695TrpVal: 1.695 ± 0.277
0.548TrpTrp: 0.548 ± 0.231
0.299TrpTyr: 0.299 ± 0.092
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.742TyrAla: 2.742 ± 0.399
0.15TyrCys: 0.15 ± 0.088
1.745TyrAsp: 1.745 ± 0.316
1.645TyrGlu: 1.645 ± 0.303
0.648TyrPhe: 0.648 ± 0.184
2.892TyrGly: 2.892 ± 0.397
0.199TyrHis: 0.199 ± 0.102
0.997TyrIle: 0.997 ± 0.218
0.449TyrLys: 0.449 ± 0.168
1.496TyrLeu: 1.496 ± 0.244
0.299TyrMet: 0.299 ± 0.124
0.698TyrAsn: 0.698 ± 0.195
1.595TyrPro: 1.595 ± 0.317
0.798TyrGln: 0.798 ± 0.201
1.994TyrArg: 1.994 ± 0.402
1.396TyrSer: 1.396 ± 0.27
1.695TyrThr: 1.695 ± 0.274
2.144TyrVal: 2.144 ± 0.614
0.598TyrTrp: 0.598 ± 0.232
0.399TyrTyr: 0.399 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (20058 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski