Amino acid dipepetide frequency for Microbacterium phage Bernstein

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.196AlaAla: 13.196 ± 1.354
0.318AlaCys: 0.318 ± 0.163
8.108AlaAsp: 8.108 ± 0.823
7.154AlaGlu: 7.154 ± 0.833
3.498AlaPhe: 3.498 ± 0.51
12.401AlaGly: 12.401 ± 0.704
1.749AlaHis: 1.749 ± 0.403
6.598AlaIle: 6.598 ± 1.104
5.723AlaLys: 5.723 ± 0.863
9.618AlaLeu: 9.618 ± 0.844
2.385AlaMet: 2.385 ± 0.437
3.259AlaAsn: 3.259 ± 0.506
4.769AlaPro: 4.769 ± 0.435
4.69AlaGln: 4.69 ± 0.607
5.803AlaArg: 5.803 ± 0.606
5.167AlaSer: 5.167 ± 0.689
8.188AlaThr: 8.188 ± 0.924
7.87AlaVal: 7.87 ± 0.956
2.226AlaTrp: 2.226 ± 0.409
2.703AlaTyr: 2.703 ± 0.454
0.0AlaXaa: 0.0 ± 0.0
Cys
0.318CysAla: 0.318 ± 0.137
0.0CysCys: 0.0 ± 0.0
0.079CysAsp: 0.079 ± 0.075
0.079CysGlu: 0.079 ± 0.078
0.477CysPhe: 0.477 ± 0.167
0.477CysGly: 0.477 ± 0.245
0.079CysHis: 0.079 ± 0.088
0.079CysIle: 0.079 ± 0.094
0.318CysLys: 0.318 ± 0.153
0.079CysLeu: 0.079 ± 0.08
0.079CysMet: 0.079 ± 0.074
0.159CysAsn: 0.159 ± 0.133
0.715CysPro: 0.715 ± 0.267
0.159CysGln: 0.159 ± 0.101
0.397CysArg: 0.397 ± 0.18
0.397CysSer: 0.397 ± 0.174
0.238CysThr: 0.238 ± 0.155
0.318CysVal: 0.318 ± 0.166
0.079CysTrp: 0.079 ± 0.078
0.238CysTyr: 0.238 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
7.075AspAla: 7.075 ± 0.546
0.079AspCys: 0.079 ± 0.085
3.577AspAsp: 3.577 ± 0.48
3.975AspGlu: 3.975 ± 0.563
3.418AspPhe: 3.418 ± 0.496
6.518AspGly: 6.518 ± 0.71
1.908AspHis: 1.908 ± 0.57
2.703AspIle: 2.703 ± 0.409
1.59AspLys: 1.59 ± 0.317
7.472AspLeu: 7.472 ± 0.983
1.192AspMet: 1.192 ± 0.318
1.908AspAsn: 1.908 ± 0.408
4.134AspPro: 4.134 ± 0.651
1.669AspGln: 1.669 ± 0.32
3.18AspArg: 3.18 ± 0.527
3.975AspSer: 3.975 ± 0.646
3.18AspThr: 3.18 ± 0.57
3.975AspVal: 3.975 ± 0.598
0.874AspTrp: 0.874 ± 0.22
2.703AspTyr: 2.703 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
7.234GluAla: 7.234 ± 0.858
0.238GluCys: 0.238 ± 0.119
4.452GluAsp: 4.452 ± 0.671
4.372GluGlu: 4.372 ± 0.606
2.941GluPhe: 2.941 ± 0.504
4.69GluGly: 4.69 ± 0.672
1.272GluHis: 1.272 ± 0.325
0.874GluIle: 0.874 ± 0.229
1.987GluLys: 1.987 ± 0.429
7.393GluLeu: 7.393 ± 0.748
1.033GluMet: 1.033 ± 0.255
1.987GluAsn: 1.987 ± 0.372
2.703GluPro: 2.703 ± 0.54
2.464GluGln: 2.464 ± 0.407
2.941GluArg: 2.941 ± 0.532
2.146GluSer: 2.146 ± 0.382
3.339GluThr: 3.339 ± 0.433
5.962GluVal: 5.962 ± 0.664
1.51GluTrp: 1.51 ± 0.383
1.033GluTyr: 1.033 ± 0.304
0.0GluXaa: 0.0 ± 0.0
Phe
2.941PheAla: 2.941 ± 0.407
0.0PheCys: 0.0 ± 0.0
2.067PheAsp: 2.067 ± 0.369
2.146PheGlu: 2.146 ± 0.336
1.033PhePhe: 1.033 ± 0.412
2.703PheGly: 2.703 ± 0.601
0.556PheHis: 0.556 ± 0.214
1.669PheIle: 1.669 ± 0.398
1.272PheLys: 1.272 ± 0.254
2.862PheLeu: 2.862 ± 0.502
1.033PheMet: 1.033 ± 0.307
1.51PheAsn: 1.51 ± 0.344
1.908PhePro: 1.908 ± 0.386
1.192PheGln: 1.192 ± 0.291
3.1PheArg: 3.1 ± 0.47
2.385PheSer: 2.385 ± 0.328
2.703PheThr: 2.703 ± 0.528
2.226PheVal: 2.226 ± 0.342
0.477PheTrp: 0.477 ± 0.212
0.715PheTyr: 0.715 ± 0.24
0.0PheXaa: 0.0 ± 0.0
Gly
8.347GlyAla: 8.347 ± 0.845
0.715GlyCys: 0.715 ± 0.317
6.677GlyAsp: 6.677 ± 0.663
5.246GlyGlu: 5.246 ± 0.758
3.021GlyPhe: 3.021 ± 0.517
5.882GlyGly: 5.882 ± 0.617
1.59GlyHis: 1.59 ± 0.306
5.008GlyIle: 5.008 ± 0.834
3.577GlyLys: 3.577 ± 0.544
6.041GlyLeu: 6.041 ± 0.751
2.146GlyMet: 2.146 ± 0.403
3.021GlyAsn: 3.021 ± 0.541
3.816GlyPro: 3.816 ± 0.73
4.293GlyGln: 4.293 ± 0.8
4.61GlyArg: 4.61 ± 0.648
4.452GlySer: 4.452 ± 0.589
5.723GlyThr: 5.723 ± 0.659
6.439GlyVal: 6.439 ± 0.859
2.067GlyTrp: 2.067 ± 0.399
3.498GlyTyr: 3.498 ± 0.479
0.0GlyXaa: 0.0 ± 0.0
His
2.146HisAla: 2.146 ± 0.487
0.159HisCys: 0.159 ± 0.129
1.749HisAsp: 1.749 ± 0.258
1.113HisGlu: 1.113 ± 0.295
1.192HisPhe: 1.192 ± 0.285
1.908HisGly: 1.908 ± 0.421
0.715HisHis: 0.715 ± 0.244
0.795HisIle: 0.795 ± 0.238
0.397HisLys: 0.397 ± 0.176
1.669HisLeu: 1.669 ± 0.409
0.636HisMet: 0.636 ± 0.175
0.636HisAsn: 0.636 ± 0.214
0.715HisPro: 0.715 ± 0.268
0.477HisGln: 0.477 ± 0.178
0.954HisArg: 0.954 ± 0.315
0.636HisSer: 0.636 ± 0.22
1.272HisThr: 1.272 ± 0.288
1.59HisVal: 1.59 ± 0.355
0.477HisTrp: 0.477 ± 0.249
0.397HisTyr: 0.397 ± 0.164
0.0HisXaa: 0.0 ± 0.0
Ile
6.2IleAla: 6.2 ± 1.035
0.159IleCys: 0.159 ± 0.104
2.703IleAsp: 2.703 ± 0.492
3.259IleGlu: 3.259 ± 0.436
0.715IlePhe: 0.715 ± 0.223
3.1IleGly: 3.1 ± 0.532
1.192IleHis: 1.192 ± 0.336
1.749IleIle: 1.749 ± 0.385
2.146IleLys: 2.146 ± 0.383
3.657IleLeu: 3.657 ± 0.503
0.795IleMet: 0.795 ± 0.284
2.226IleAsn: 2.226 ± 0.445
2.067IlePro: 2.067 ± 0.426
1.59IleGln: 1.59 ± 0.356
3.498IleArg: 3.498 ± 0.498
2.305IleSer: 2.305 ± 0.374
3.418IleThr: 3.418 ± 0.512
3.736IleVal: 3.736 ± 0.802
0.477IleTrp: 0.477 ± 0.226
1.431IleTyr: 1.431 ± 0.334
0.0IleXaa: 0.0 ± 0.0
Lys
6.359LysAla: 6.359 ± 0.843
0.556LysCys: 0.556 ± 0.2
1.749LysAsp: 1.749 ± 0.372
2.385LysGlu: 2.385 ± 0.421
0.874LysPhe: 0.874 ± 0.254
3.498LysGly: 3.498 ± 0.466
0.318LysHis: 0.318 ± 0.163
1.272LysIle: 1.272 ± 0.335
2.941LysLys: 2.941 ± 0.558
3.259LysLeu: 3.259 ± 0.615
1.033LysMet: 1.033 ± 0.262
1.351LysAsn: 1.351 ± 0.338
2.941LysPro: 2.941 ± 0.553
1.669LysGln: 1.669 ± 0.321
3.18LysArg: 3.18 ± 0.506
2.385LysSer: 2.385 ± 0.542
2.544LysThr: 2.544 ± 0.459
3.1LysVal: 3.1 ± 0.519
0.318LysTrp: 0.318 ± 0.175
0.715LysTyr: 0.715 ± 0.209
0.0LysXaa: 0.0 ± 0.0
Leu
9.777LeuAla: 9.777 ± 0.913
0.238LeuCys: 0.238 ± 0.12
5.723LeuAsp: 5.723 ± 0.818
6.041LeuGlu: 6.041 ± 0.777
2.862LeuPhe: 2.862 ± 0.431
6.439LeuGly: 6.439 ± 0.586
1.908LeuHis: 1.908 ± 0.431
4.293LeuIle: 4.293 ± 0.55
3.498LeuLys: 3.498 ± 0.569
6.2LeuLeu: 6.2 ± 0.685
1.749LeuMet: 1.749 ± 0.29
3.021LeuAsn: 3.021 ± 0.578
5.087LeuPro: 5.087 ± 0.637
2.862LeuGln: 2.862 ± 0.461
5.723LeuArg: 5.723 ± 0.652
4.61LeuSer: 4.61 ± 0.421
5.803LeuThr: 5.803 ± 0.492
6.757LeuVal: 6.757 ± 0.72
1.351LeuTrp: 1.351 ± 0.375
1.51LeuTyr: 1.51 ± 0.383
0.0LeuXaa: 0.0 ± 0.0
Met
3.418MetAla: 3.418 ± 0.532
0.0MetCys: 0.0 ± 0.0
1.59MetAsp: 1.59 ± 0.295
0.874MetGlu: 0.874 ± 0.249
0.079MetPhe: 0.079 ± 0.071
1.272MetGly: 1.272 ± 0.288
0.079MetHis: 0.079 ± 0.079
0.477MetIle: 0.477 ± 0.183
0.795MetLys: 0.795 ± 0.211
1.113MetLeu: 1.113 ± 0.336
0.238MetMet: 0.238 ± 0.128
1.51MetAsn: 1.51 ± 0.352
1.192MetPro: 1.192 ± 0.316
0.874MetGln: 0.874 ± 0.216
1.272MetArg: 1.272 ± 0.32
1.272MetSer: 1.272 ± 0.423
3.1MetThr: 3.1 ± 0.495
1.192MetVal: 1.192 ± 0.308
0.318MetTrp: 0.318 ± 0.174
0.397MetTyr: 0.397 ± 0.192
0.0MetXaa: 0.0 ± 0.0
Asn
3.418AsnAla: 3.418 ± 0.6
0.318AsnCys: 0.318 ± 0.139
1.272AsnAsp: 1.272 ± 0.253
1.192AsnGlu: 1.192 ± 0.261
1.033AsnPhe: 1.033 ± 0.262
4.69AsnGly: 4.69 ± 0.977
0.874AsnHis: 0.874 ± 0.302
1.113AsnIle: 1.113 ± 0.247
2.067AsnLys: 2.067 ± 0.36
2.941AsnLeu: 2.941 ± 0.506
0.715AsnMet: 0.715 ± 0.208
1.59AsnAsn: 1.59 ± 0.422
2.703AsnPro: 2.703 ± 0.392
1.113AsnGln: 1.113 ± 0.228
2.464AsnArg: 2.464 ± 0.501
2.862AsnSer: 2.862 ± 0.643
1.828AsnThr: 1.828 ± 0.489
1.987AsnVal: 1.987 ± 0.338
0.715AsnTrp: 0.715 ± 0.226
0.954AsnTyr: 0.954 ± 0.237
0.0AsnXaa: 0.0 ± 0.0
Pro
7.472ProAla: 7.472 ± 0.865
0.556ProCys: 0.556 ± 0.224
3.657ProAsp: 3.657 ± 0.547
3.975ProGlu: 3.975 ± 0.666
2.226ProPhe: 2.226 ± 0.462
3.975ProGly: 3.975 ± 0.637
1.192ProHis: 1.192 ± 0.293
3.18ProIle: 3.18 ± 0.482
2.862ProLys: 2.862 ± 0.56
3.339ProLeu: 3.339 ± 0.55
1.033ProMet: 1.033 ± 0.216
1.828ProAsn: 1.828 ± 0.352
1.749ProPro: 1.749 ± 0.373
1.113ProGln: 1.113 ± 0.31
2.226ProArg: 2.226 ± 0.41
2.464ProSer: 2.464 ± 0.429
3.259ProThr: 3.259 ± 0.529
2.862ProVal: 2.862 ± 0.504
1.113ProTrp: 1.113 ± 0.258
0.954ProTyr: 0.954 ± 0.262
0.0ProXaa: 0.0 ± 0.0
Gln
5.644GlnAla: 5.644 ± 0.629
0.318GlnCys: 0.318 ± 0.207
1.908GlnAsp: 1.908 ± 0.369
1.828GlnGlu: 1.828 ± 0.339
1.113GlnPhe: 1.113 ± 0.314
2.146GlnGly: 2.146 ± 0.522
0.636GlnHis: 0.636 ± 0.204
1.351GlnIle: 1.351 ± 0.257
0.874GlnLys: 0.874 ± 0.292
3.498GlnLeu: 3.498 ± 0.534
0.636GlnMet: 0.636 ± 0.189
1.669GlnAsn: 1.669 ± 0.389
1.272GlnPro: 1.272 ± 0.237
0.954GlnGln: 0.954 ± 0.31
2.226GlnArg: 2.226 ± 0.315
1.749GlnSer: 1.749 ± 0.369
1.59GlnThr: 1.59 ± 0.298
3.259GlnVal: 3.259 ± 0.507
0.715GlnTrp: 0.715 ± 0.224
1.192GlnTyr: 1.192 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
6.359ArgAla: 6.359 ± 0.727
0.318ArgCys: 0.318 ± 0.19
3.418ArgAsp: 3.418 ± 0.574
3.816ArgGlu: 3.816 ± 0.66
2.067ArgPhe: 2.067 ± 0.365
4.769ArgGly: 4.769 ± 0.746
1.59ArgHis: 1.59 ± 0.402
3.18ArgIle: 3.18 ± 0.443
2.305ArgLys: 2.305 ± 0.316
5.882ArgLeu: 5.882 ± 0.797
1.113ArgMet: 1.113 ± 0.261
2.067ArgAsn: 2.067 ± 0.349
2.305ArgPro: 2.305 ± 0.448
2.067ArgGln: 2.067 ± 0.334
5.246ArgArg: 5.246 ± 0.728
4.213ArgSer: 4.213 ± 0.573
3.736ArgThr: 3.736 ± 0.467
4.61ArgVal: 4.61 ± 0.477
1.351ArgTrp: 1.351 ± 0.31
1.113ArgTyr: 1.113 ± 0.267
0.0ArgXaa: 0.0 ± 0.0
Ser
5.246SerAla: 5.246 ± 0.681
0.0SerCys: 0.0 ± 0.0
3.736SerAsp: 3.736 ± 0.548
1.987SerGlu: 1.987 ± 0.444
1.908SerPhe: 1.908 ± 0.378
5.723SerGly: 5.723 ± 0.689
0.715SerHis: 0.715 ± 0.287
2.305SerIle: 2.305 ± 0.364
2.623SerLys: 2.623 ± 0.467
5.326SerLeu: 5.326 ± 0.745
1.431SerMet: 1.431 ± 0.412
1.908SerAsn: 1.908 ± 0.474
2.862SerPro: 2.862 ± 0.454
1.908SerGln: 1.908 ± 0.332
3.816SerArg: 3.816 ± 0.467
3.021SerSer: 3.021 ± 0.564
3.975SerThr: 3.975 ± 0.444
2.703SerVal: 2.703 ± 0.46
1.272SerTrp: 1.272 ± 0.321
0.715SerTyr: 0.715 ± 0.252
0.0SerXaa: 0.0 ± 0.0
Thr
6.995ThrAla: 6.995 ± 0.771
0.238ThrCys: 0.238 ± 0.159
4.849ThrAsp: 4.849 ± 0.605
4.134ThrGlu: 4.134 ± 0.476
2.305ThrPhe: 2.305 ± 0.354
6.2ThrGly: 6.2 ± 0.727
1.431ThrHis: 1.431 ± 0.353
4.293ThrIle: 4.293 ± 0.662
3.736ThrLys: 3.736 ± 0.529
5.008ThrLeu: 5.008 ± 0.732
1.351ThrMet: 1.351 ± 0.306
1.828ThrAsn: 1.828 ± 0.375
4.054ThrPro: 4.054 ± 0.612
1.272ThrGln: 1.272 ± 0.254
3.418ThrArg: 3.418 ± 0.454
3.259ThrSer: 3.259 ± 0.467
4.849ThrThr: 4.849 ± 0.74
4.531ThrVal: 4.531 ± 0.642
1.351ThrTrp: 1.351 ± 0.301
1.272ThrTyr: 1.272 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
8.029ValAla: 8.029 ± 0.666
0.397ValCys: 0.397 ± 0.213
4.928ValAsp: 4.928 ± 0.469
4.452ValGlu: 4.452 ± 0.639
2.464ValPhe: 2.464 ± 0.524
5.644ValGly: 5.644 ± 0.625
1.431ValHis: 1.431 ± 0.355
3.18ValIle: 3.18 ± 0.583
2.385ValLys: 2.385 ± 0.442
6.359ValLeu: 6.359 ± 0.738
1.351ValMet: 1.351 ± 0.35
2.146ValAsn: 2.146 ± 0.398
3.895ValPro: 3.895 ± 0.501
2.146ValGln: 2.146 ± 0.391
4.452ValArg: 4.452 ± 0.516
3.259ValSer: 3.259 ± 0.483
4.61ValThr: 4.61 ± 0.685
5.644ValVal: 5.644 ± 0.666
1.51ValTrp: 1.51 ± 0.342
2.862ValTyr: 2.862 ± 0.437
0.0ValXaa: 0.0 ± 0.0
Trp
2.305TrpAla: 2.305 ± 0.431
0.079TrpCys: 0.079 ± 0.082
0.636TrpAsp: 0.636 ± 0.238
0.954TrpGlu: 0.954 ± 0.295
0.954TrpPhe: 0.954 ± 0.24
1.351TrpGly: 1.351 ± 0.265
0.159TrpHis: 0.159 ± 0.127
0.874TrpIle: 0.874 ± 0.27
0.715TrpLys: 0.715 ± 0.184
1.908TrpLeu: 1.908 ± 0.375
0.318TrpMet: 0.318 ± 0.151
1.272TrpAsn: 1.272 ± 0.341
0.397TrpPro: 0.397 ± 0.176
0.954TrpGln: 0.954 ± 0.329
0.874TrpArg: 0.874 ± 0.254
1.431TrpSer: 1.431 ± 0.317
1.51TrpThr: 1.51 ± 0.287
1.59TrpVal: 1.59 ± 0.302
0.318TrpTrp: 0.318 ± 0.147
0.477TrpTyr: 0.477 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.941TyrAla: 2.941 ± 0.531
0.079TyrCys: 0.079 ± 0.094
2.226TyrAsp: 2.226 ± 0.399
1.59TyrGlu: 1.59 ± 0.344
0.397TyrPhe: 0.397 ± 0.142
2.782TyrGly: 2.782 ± 0.519
0.238TyrHis: 0.238 ± 0.146
1.431TyrIle: 1.431 ± 0.382
0.636TyrLys: 0.636 ± 0.222
1.828TyrLeu: 1.828 ± 0.351
0.715TyrMet: 0.715 ± 0.204
0.954TyrAsn: 0.954 ± 0.277
1.749TyrPro: 1.749 ± 0.355
1.113TyrGln: 1.113 ± 0.287
2.146TyrArg: 2.146 ± 0.46
1.192TyrSer: 1.192 ± 0.413
1.51TyrThr: 1.51 ± 0.303
0.795TyrVal: 0.795 ± 0.265
0.556TyrTrp: 0.556 ± 0.176
0.795TyrTyr: 0.795 ± 0.329
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (12581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski