Amino acid dipepetide frequency for Microbacterium phage Pherbot

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.568AlaAla: 11.568 ± 1.341
0.703AlaCys: 0.703 ± 0.229
6.409AlaAsp: 6.409 ± 0.788
5.94AlaGlu: 5.94 ± 0.725
2.892AlaPhe: 2.892 ± 0.505
8.676AlaGly: 8.676 ± 1.018
1.954AlaHis: 1.954 ± 0.406
4.377AlaIle: 4.377 ± 0.767
6.018AlaLys: 6.018 ± 1.161
11.021AlaLeu: 11.021 ± 1.29
2.345AlaMet: 2.345 ± 0.333
2.579AlaAsn: 2.579 ± 0.38
3.83AlaPro: 3.83 ± 0.59
3.674AlaGln: 3.674 ± 0.503
6.175AlaArg: 6.175 ± 0.743
4.846AlaSer: 4.846 ± 0.63
6.878AlaThr: 6.878 ± 0.67
6.956AlaVal: 6.956 ± 0.892
1.798AlaTrp: 1.798 ± 0.399
2.892AlaTyr: 2.892 ± 0.664
0.0AlaXaa: 0.0 ± 0.0
Cys
0.547CysAla: 0.547 ± 0.241
0.0CysCys: 0.0 ± 0.0
0.391CysAsp: 0.391 ± 0.172
0.469CysGlu: 0.469 ± 0.185
0.156CysPhe: 0.156 ± 0.107
0.469CysGly: 0.469 ± 0.196
0.234CysHis: 0.234 ± 0.129
0.156CysIle: 0.156 ± 0.125
0.469CysLys: 0.469 ± 0.166
0.156CysLeu: 0.156 ± 0.112
0.0CysMet: 0.0 ± 0.0
0.078CysAsn: 0.078 ± 0.083
0.547CysPro: 0.547 ± 0.214
0.156CysGln: 0.156 ± 0.105
0.469CysArg: 0.469 ± 0.239
0.547CysSer: 0.547 ± 0.244
0.625CysThr: 0.625 ± 0.198
0.547CysVal: 0.547 ± 0.215
0.234CysTrp: 0.234 ± 0.117
0.234CysTyr: 0.234 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
4.846AspAla: 4.846 ± 0.651
0.547AspCys: 0.547 ± 0.191
4.69AspAsp: 4.69 ± 0.88
4.299AspGlu: 4.299 ± 1.09
2.657AspPhe: 2.657 ± 0.438
4.377AspGly: 4.377 ± 0.652
1.563AspHis: 1.563 ± 0.371
3.674AspIle: 3.674 ± 0.464
2.579AspLys: 2.579 ± 0.476
5.628AspLeu: 5.628 ± 0.532
1.485AspMet: 1.485 ± 0.338
1.329AspAsn: 1.329 ± 0.304
5.237AspPro: 5.237 ± 0.582
2.189AspGln: 2.189 ± 0.401
3.517AspArg: 3.517 ± 0.52
4.143AspSer: 4.143 ± 0.608
3.126AspThr: 3.126 ± 0.544
4.846AspVal: 4.846 ± 0.576
1.641AspTrp: 1.641 ± 0.381
2.423AspTyr: 2.423 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
7.894GluAla: 7.894 ± 0.717
0.469GluCys: 0.469 ± 0.229
5.237GluAsp: 5.237 ± 1.057
5.081GluGlu: 5.081 ± 1.238
1.329GluPhe: 1.329 ± 0.364
3.048GluGly: 3.048 ± 0.486
0.782GluHis: 0.782 ± 0.277
1.798GluIle: 1.798 ± 0.375
1.798GluLys: 1.798 ± 0.463
5.862GluLeu: 5.862 ± 0.705
1.407GluMet: 1.407 ± 0.347
2.11GluAsn: 2.11 ± 0.381
1.485GluPro: 1.485 ± 0.377
2.501GluGln: 2.501 ± 0.442
4.064GluArg: 4.064 ± 0.68
2.501GluSer: 2.501 ± 0.446
4.299GluThr: 4.299 ± 0.469
4.69GluVal: 4.69 ± 0.677
1.954GluTrp: 1.954 ± 0.413
1.876GluTyr: 1.876 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.814PheAla: 2.814 ± 0.427
0.234PheCys: 0.234 ± 0.13
1.876PheAsp: 1.876 ± 0.357
1.485PheGlu: 1.485 ± 0.308
0.782PhePhe: 0.782 ± 0.279
2.736PheGly: 2.736 ± 0.429
0.391PheHis: 0.391 ± 0.175
0.938PheIle: 0.938 ± 0.267
1.72PheLys: 1.72 ± 0.308
2.189PheLeu: 2.189 ± 0.433
0.703PheMet: 0.703 ± 0.3
1.485PheAsn: 1.485 ± 0.271
1.407PhePro: 1.407 ± 0.288
1.172PheGln: 1.172 ± 0.342
2.032PheArg: 2.032 ± 0.303
2.423PheSer: 2.423 ± 0.373
2.267PheThr: 2.267 ± 0.451
1.72PheVal: 1.72 ± 0.303
0.469PheTrp: 0.469 ± 0.245
0.782PheTyr: 0.782 ± 0.253
0.0PheXaa: 0.0 ± 0.0
Gly
6.487GlyAla: 6.487 ± 0.951
0.782GlyCys: 0.782 ± 0.259
5.549GlyAsp: 5.549 ± 0.615
4.377GlyGlu: 4.377 ± 0.642
2.579GlyPhe: 2.579 ± 0.45
7.113GlyGly: 7.113 ± 1.356
1.407GlyHis: 1.407 ± 0.425
4.299GlyIle: 4.299 ± 0.827
4.455GlyLys: 4.455 ± 0.756
6.487GlyLeu: 6.487 ± 0.745
1.72GlyMet: 1.72 ± 0.281
2.892GlyAsn: 2.892 ± 0.463
2.814GlyPro: 2.814 ± 0.426
4.455GlyGln: 4.455 ± 0.665
4.377GlyArg: 4.377 ± 0.51
6.018GlySer: 6.018 ± 0.824
7.035GlyThr: 7.035 ± 0.888
6.018GlyVal: 6.018 ± 0.806
1.251GlyTrp: 1.251 ± 0.262
2.345GlyTyr: 2.345 ± 0.442
0.0GlyXaa: 0.0 ± 0.0
His
1.485HisAla: 1.485 ± 0.351
0.0HisCys: 0.0 ± 0.0
0.938HisAsp: 0.938 ± 0.285
1.563HisGlu: 1.563 ± 0.398
0.625HisPhe: 0.625 ± 0.223
1.876HisGly: 1.876 ± 0.447
0.469HisHis: 0.469 ± 0.221
1.016HisIle: 1.016 ± 0.358
1.329HisLys: 1.329 ± 0.362
1.329HisLeu: 1.329 ± 0.374
0.313HisMet: 0.313 ± 0.19
0.625HisAsn: 0.625 ± 0.204
0.938HisPro: 0.938 ± 0.264
0.703HisGln: 0.703 ± 0.205
0.86HisArg: 0.86 ± 0.324
0.469HisSer: 0.469 ± 0.182
1.251HisThr: 1.251 ± 0.379
1.407HisVal: 1.407 ± 0.354
0.469HisTrp: 0.469 ± 0.14
1.094HisTyr: 1.094 ± 0.261
0.0HisXaa: 0.0 ± 0.0
Ile
4.299IleAla: 4.299 ± 0.529
0.391IleCys: 0.391 ± 0.164
3.439IleAsp: 3.439 ± 0.462
2.97IleGlu: 2.97 ± 0.56
0.86IlePhe: 0.86 ± 0.274
3.517IleGly: 3.517 ± 0.821
1.094IleHis: 1.094 ± 0.282
2.579IleIle: 2.579 ± 0.822
2.189IleLys: 2.189 ± 0.445
3.126IleLeu: 3.126 ± 0.479
1.251IleMet: 1.251 ± 0.309
1.954IleAsn: 1.954 ± 0.423
2.032IlePro: 2.032 ± 0.592
2.189IleGln: 2.189 ± 0.548
2.97IleArg: 2.97 ± 0.437
2.423IleSer: 2.423 ± 0.527
4.377IleThr: 4.377 ± 0.827
3.283IleVal: 3.283 ± 0.634
0.547IleTrp: 0.547 ± 0.195
1.251IleTyr: 1.251 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
5.784LysAla: 5.784 ± 0.942
0.156LysCys: 0.156 ± 0.108
2.032LysAsp: 2.032 ± 0.414
2.892LysGlu: 2.892 ± 0.506
1.407LysPhe: 1.407 ± 0.309
4.377LysGly: 4.377 ± 0.662
1.016LysHis: 1.016 ± 0.308
2.032LysIle: 2.032 ± 0.386
2.501LysLys: 2.501 ± 0.596
4.299LysLeu: 4.299 ± 0.666
1.563LysMet: 1.563 ± 0.343
0.938LysAsn: 0.938 ± 0.274
3.205LysPro: 3.205 ± 0.675
1.329LysGln: 1.329 ± 0.26
2.97LysArg: 2.97 ± 0.524
1.485LysSer: 1.485 ± 0.436
3.205LysThr: 3.205 ± 0.547
3.674LysVal: 3.674 ± 0.535
0.86LysTrp: 0.86 ± 0.276
0.703LysTyr: 0.703 ± 0.29
0.0LysXaa: 0.0 ± 0.0
Leu
9.379LeuAla: 9.379 ± 0.857
0.547LeuCys: 0.547 ± 0.267
4.768LeuAsp: 4.768 ± 0.625
4.612LeuGlu: 4.612 ± 0.601
1.641LeuPhe: 1.641 ± 0.306
8.129LeuGly: 8.129 ± 0.834
1.72LeuHis: 1.72 ± 0.358
3.986LeuIle: 3.986 ± 0.899
4.299LeuLys: 4.299 ± 0.58
7.816LeuLeu: 7.816 ± 0.846
1.798LeuMet: 1.798 ± 0.339
2.814LeuAsn: 2.814 ± 0.509
5.159LeuPro: 5.159 ± 0.705
2.736LeuGln: 2.736 ± 0.369
4.533LeuArg: 4.533 ± 0.641
4.846LeuSer: 4.846 ± 0.556
5.393LeuThr: 5.393 ± 0.637
7.035LeuVal: 7.035 ± 0.982
1.016LeuTrp: 1.016 ± 0.253
2.267LeuTyr: 2.267 ± 0.379
0.0LeuXaa: 0.0 ± 0.0
Met
3.126MetAla: 3.126 ± 0.388
0.234MetCys: 0.234 ± 0.137
1.641MetAsp: 1.641 ± 0.37
0.782MetGlu: 0.782 ± 0.315
0.625MetPhe: 0.625 ± 0.224
1.485MetGly: 1.485 ± 0.333
0.313MetHis: 0.313 ± 0.153
1.172MetIle: 1.172 ± 0.334
1.094MetLys: 1.094 ± 0.253
1.954MetLeu: 1.954 ± 0.338
0.703MetMet: 0.703 ± 0.249
1.016MetAsn: 1.016 ± 0.274
1.016MetPro: 1.016 ± 0.248
1.329MetGln: 1.329 ± 0.28
1.016MetArg: 1.016 ± 0.228
2.267MetSer: 2.267 ± 0.366
2.189MetThr: 2.189 ± 0.416
1.094MetVal: 1.094 ± 0.323
0.078MetTrp: 0.078 ± 0.068
0.234MetTyr: 0.234 ± 0.143
0.0MetXaa: 0.0 ± 0.0
Asn
2.579AsnAla: 2.579 ± 0.449
0.0AsnCys: 0.0 ± 0.0
2.11AsnAsp: 2.11 ± 0.466
2.11AsnGlu: 2.11 ± 0.376
0.703AsnPhe: 0.703 ± 0.208
3.126AsnGly: 3.126 ± 0.462
0.313AsnHis: 0.313 ± 0.156
1.407AsnIle: 1.407 ± 0.532
1.485AsnLys: 1.485 ± 0.352
2.736AsnLeu: 2.736 ± 0.409
0.547AsnMet: 0.547 ± 0.165
1.251AsnAsn: 1.251 ± 0.317
2.032AsnPro: 2.032 ± 0.36
1.172AsnGln: 1.172 ± 0.238
1.641AsnArg: 1.641 ± 0.4
2.032AsnSer: 2.032 ± 0.435
2.423AsnThr: 2.423 ± 0.396
2.267AsnVal: 2.267 ± 0.392
1.094AsnTrp: 1.094 ± 0.294
0.86AsnTyr: 0.86 ± 0.268
0.0AsnXaa: 0.0 ± 0.0
Pro
4.924ProAla: 4.924 ± 0.799
0.234ProCys: 0.234 ± 0.185
3.283ProAsp: 3.283 ± 0.519
2.736ProGlu: 2.736 ± 0.427
1.329ProPhe: 1.329 ± 0.259
4.299ProGly: 4.299 ± 0.577
0.86ProHis: 0.86 ± 0.256
1.876ProIle: 1.876 ± 0.34
2.267ProLys: 2.267 ± 0.401
3.439ProLeu: 3.439 ± 0.526
1.016ProMet: 1.016 ± 0.274
1.72ProAsn: 1.72 ± 0.375
1.485ProPro: 1.485 ± 0.328
2.501ProGln: 2.501 ± 0.38
2.501ProArg: 2.501 ± 0.746
3.595ProSer: 3.595 ± 0.572
3.361ProThr: 3.361 ± 0.564
4.455ProVal: 4.455 ± 0.57
0.86ProTrp: 0.86 ± 0.218
1.094ProTyr: 1.094 ± 0.284
0.0ProXaa: 0.0 ± 0.0
Gln
5.315GlnAla: 5.315 ± 0.85
0.078GlnCys: 0.078 ± 0.073
2.501GlnAsp: 2.501 ± 0.464
2.814GlnGlu: 2.814 ± 0.424
1.016GlnPhe: 1.016 ± 0.348
3.048GlnGly: 3.048 ± 0.511
0.938GlnHis: 0.938 ± 0.228
1.954GlnIle: 1.954 ± 0.398
1.251GlnLys: 1.251 ± 0.329
3.205GlnLeu: 3.205 ± 0.378
0.782GlnMet: 0.782 ± 0.197
1.485GlnAsn: 1.485 ± 0.408
1.798GlnPro: 1.798 ± 0.356
2.267GlnGln: 2.267 ± 0.458
3.126GlnArg: 3.126 ± 0.467
2.267GlnSer: 2.267 ± 0.388
2.501GlnThr: 2.501 ± 0.39
3.986GlnVal: 3.986 ± 0.493
0.86GlnTrp: 0.86 ± 0.268
1.016GlnTyr: 1.016 ± 0.232
0.0GlnXaa: 0.0 ± 0.0
Arg
5.002ArgAla: 5.002 ± 0.631
0.469ArgCys: 0.469 ± 0.219
3.752ArgAsp: 3.752 ± 0.496
2.97ArgGlu: 2.97 ± 0.452
2.267ArgPhe: 2.267 ± 0.394
4.299ArgGly: 4.299 ± 0.518
0.782ArgHis: 0.782 ± 0.264
3.283ArgIle: 3.283 ± 0.464
3.283ArgLys: 3.283 ± 0.683
6.097ArgLeu: 6.097 ± 0.705
1.485ArgMet: 1.485 ± 0.276
1.72ArgAsn: 1.72 ± 0.369
2.579ArgPro: 2.579 ± 0.565
2.189ArgGln: 2.189 ± 0.395
3.517ArgArg: 3.517 ± 0.682
3.126ArgSer: 3.126 ± 0.494
3.283ArgThr: 3.283 ± 0.748
5.471ArgVal: 5.471 ± 0.84
1.485ArgTrp: 1.485 ± 0.347
1.407ArgTyr: 1.407 ± 0.37
0.0ArgXaa: 0.0 ± 0.0
Ser
5.784SerAla: 5.784 ± 0.752
0.234SerCys: 0.234 ± 0.125
3.986SerAsp: 3.986 ± 0.534
2.501SerGlu: 2.501 ± 0.426
2.501SerPhe: 2.501 ± 0.443
5.549SerGly: 5.549 ± 0.718
1.407SerHis: 1.407 ± 0.396
2.267SerIle: 2.267 ± 0.49
2.892SerLys: 2.892 ± 0.478
3.595SerLeu: 3.595 ± 0.62
2.423SerMet: 2.423 ± 0.519
1.72SerAsn: 1.72 ± 0.31
2.423SerPro: 2.423 ± 0.406
2.892SerGln: 2.892 ± 0.543
3.439SerArg: 3.439 ± 0.465
4.221SerSer: 4.221 ± 0.522
3.595SerThr: 3.595 ± 0.528
4.143SerVal: 4.143 ± 0.532
1.016SerTrp: 1.016 ± 0.225
2.032SerTyr: 2.032 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
7.191ThrAla: 7.191 ± 1.063
0.469ThrCys: 0.469 ± 0.208
4.143ThrAsp: 4.143 ± 0.606
3.83ThrGlu: 3.83 ± 0.671
2.892ThrPhe: 2.892 ± 0.479
6.487ThrGly: 6.487 ± 0.624
0.938ThrHis: 0.938 ± 0.273
2.657ThrIle: 2.657 ± 0.483
2.97ThrLys: 2.97 ± 0.568
5.471ThrLeu: 5.471 ± 0.668
1.251ThrMet: 1.251 ± 0.382
1.641ThrAsn: 1.641 ± 0.321
3.986ThrPro: 3.986 ± 0.625
2.501ThrGln: 2.501 ± 0.441
3.752ThrArg: 3.752 ± 0.541
4.455ThrSer: 4.455 ± 0.63
4.143ThrThr: 4.143 ± 0.569
6.175ThrVal: 6.175 ± 0.77
1.798ThrTrp: 1.798 ± 0.412
2.345ThrTyr: 2.345 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
8.129ValAla: 8.129 ± 0.787
0.313ValCys: 0.313 ± 0.142
4.846ValAsp: 4.846 ± 0.601
5.002ValGlu: 5.002 ± 0.782
1.641ValPhe: 1.641 ± 0.372
4.846ValGly: 4.846 ± 0.898
1.329ValHis: 1.329 ± 0.378
4.533ValIle: 4.533 ± 0.601
2.892ValLys: 2.892 ± 0.476
6.175ValLeu: 6.175 ± 0.725
1.641ValMet: 1.641 ± 0.434
2.657ValAsn: 2.657 ± 0.456
3.986ValPro: 3.986 ± 0.595
3.83ValGln: 3.83 ± 0.564
4.377ValArg: 4.377 ± 0.589
4.455ValSer: 4.455 ± 0.745
6.175ValThr: 6.175 ± 0.764
5.862ValVal: 5.862 ± 0.93
1.876ValTrp: 1.876 ± 0.377
2.501ValTyr: 2.501 ± 0.488
0.0ValXaa: 0.0 ± 0.0
Trp
1.72TrpAla: 1.72 ± 0.356
0.156TrpCys: 0.156 ± 0.123
1.251TrpAsp: 1.251 ± 0.248
1.172TrpGlu: 1.172 ± 0.279
0.86TrpPhe: 0.86 ± 0.232
1.563TrpGly: 1.563 ± 0.398
0.938TrpHis: 0.938 ± 0.306
1.251TrpIle: 1.251 ± 0.309
0.703TrpLys: 0.703 ± 0.197
2.11TrpLeu: 2.11 ± 0.371
0.156TrpMet: 0.156 ± 0.109
0.703TrpAsn: 0.703 ± 0.206
0.86TrpPro: 0.86 ± 0.291
1.016TrpGln: 1.016 ± 0.269
1.094TrpArg: 1.094 ± 0.311
0.625TrpSer: 0.625 ± 0.239
1.563TrpThr: 1.563 ± 0.271
1.407TrpVal: 1.407 ± 0.327
0.782TrpTrp: 0.782 ± 0.286
0.86TrpTyr: 0.86 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.657TyrAla: 2.657 ± 0.387
0.469TyrCys: 0.469 ± 0.23
2.032TyrAsp: 2.032 ± 0.398
2.579TyrGlu: 2.579 ± 0.464
0.86TyrPhe: 0.86 ± 0.205
3.439TyrGly: 3.439 ± 0.632
0.313TyrHis: 0.313 ± 0.182
1.485TyrIle: 1.485 ± 0.302
0.234TyrLys: 0.234 ± 0.133
1.798TyrLeu: 1.798 ± 0.398
0.703TyrMet: 0.703 ± 0.293
1.172TyrAsn: 1.172 ± 0.326
1.016TyrPro: 1.016 ± 0.261
1.407TyrGln: 1.407 ± 0.296
2.032TyrArg: 2.032 ± 0.448
1.876TyrSer: 1.876 ± 0.362
1.407TyrThr: 1.407 ± 0.444
2.032TyrVal: 2.032 ± 0.475
0.703TyrTrp: 0.703 ± 0.226
0.938TyrTyr: 0.938 ± 0.364
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12795 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski