Amino acid dipepetide frequency for Microbacterium phage Sansa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.177AlaAla: 8.177 ± 0.758
0.476AlaCys: 0.476 ± 0.188
5.478AlaAsp: 5.478 ± 0.595
6.589AlaGlu: 6.589 ± 0.713
2.779AlaPhe: 2.779 ± 0.475
8.336AlaGly: 8.336 ± 0.956
1.905AlaHis: 1.905 ± 0.368
5.716AlaIle: 5.716 ± 0.63
4.366AlaLys: 4.366 ± 0.677
9.209AlaLeu: 9.209 ± 0.994
2.779AlaMet: 2.779 ± 0.478
3.414AlaAsn: 3.414 ± 0.594
3.89AlaPro: 3.89 ± 0.701
3.414AlaGln: 3.414 ± 0.512
5.319AlaArg: 5.319 ± 0.584
5.319AlaSer: 5.319 ± 0.6
7.066AlaThr: 7.066 ± 0.778
7.542AlaVal: 7.542 ± 0.9
2.064AlaTrp: 2.064 ± 0.477
2.461AlaTyr: 2.461 ± 0.568
0.0AlaXaa: 0.0 ± 0.0
Cys
0.715CysAla: 0.715 ± 0.234
0.0CysCys: 0.0 ± 0.0
0.476CysAsp: 0.476 ± 0.173
0.556CysGlu: 0.556 ± 0.22
0.079CysPhe: 0.079 ± 0.072
0.635CysGly: 0.635 ± 0.268
0.238CysHis: 0.238 ± 0.14
0.079CysIle: 0.079 ± 0.091
0.318CysLys: 0.318 ± 0.156
0.318CysLeu: 0.318 ± 0.155
0.079CysMet: 0.079 ± 0.065
0.159CysAsn: 0.159 ± 0.118
0.556CysPro: 0.556 ± 0.187
0.397CysGln: 0.397 ± 0.239
0.238CysArg: 0.238 ± 0.137
0.318CysSer: 0.318 ± 0.155
0.318CysThr: 0.318 ± 0.17
0.556CysVal: 0.556 ± 0.21
0.159CysTrp: 0.159 ± 0.094
0.238CysTyr: 0.238 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
4.763AspAla: 4.763 ± 0.718
0.635AspCys: 0.635 ± 0.255
5.002AspAsp: 5.002 ± 0.882
5.399AspGlu: 5.399 ± 1.347
1.667AspPhe: 1.667 ± 0.28
3.89AspGly: 3.89 ± 0.572
1.27AspHis: 1.27 ± 0.357
3.493AspIle: 3.493 ± 0.446
2.62AspLys: 2.62 ± 0.487
5.399AspLeu: 5.399 ± 0.695
1.429AspMet: 1.429 ± 0.361
2.064AspAsn: 2.064 ± 0.412
4.684AspPro: 4.684 ± 0.53
1.905AspGln: 1.905 ± 0.426
2.779AspArg: 2.779 ± 0.566
3.414AspSer: 3.414 ± 0.562
3.652AspThr: 3.652 ± 0.497
3.414AspVal: 3.414 ± 0.465
1.35AspTrp: 1.35 ± 0.351
2.54AspTyr: 2.54 ± 0.463
0.0AspXaa: 0.0 ± 0.0
Glu
7.383GluAla: 7.383 ± 0.752
0.635GluCys: 0.635 ± 0.226
5.16GluAsp: 5.16 ± 1.121
5.954GluGlu: 5.954 ± 1.175
2.461GluPhe: 2.461 ± 0.41
4.446GluGly: 4.446 ± 0.619
1.667GluHis: 1.667 ± 0.483
2.144GluIle: 2.144 ± 0.426
2.461GluLys: 2.461 ± 0.467
5.081GluLeu: 5.081 ± 0.602
1.747GluMet: 1.747 ± 0.379
1.588GluAsn: 1.588 ± 0.318
2.144GluPro: 2.144 ± 0.472
2.779GluGln: 2.779 ± 0.592
4.049GluArg: 4.049 ± 0.622
4.049GluSer: 4.049 ± 0.573
4.446GluThr: 4.446 ± 0.657
5.24GluVal: 5.24 ± 0.926
1.508GluTrp: 1.508 ± 0.328
1.747GluTyr: 1.747 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
2.223PheAla: 2.223 ± 0.439
0.238PheCys: 0.238 ± 0.139
1.588PheAsp: 1.588 ± 0.299
1.508PheGlu: 1.508 ± 0.298
1.032PhePhe: 1.032 ± 0.271
3.573PheGly: 3.573 ± 0.542
0.715PheHis: 0.715 ± 0.253
1.508PheIle: 1.508 ± 0.307
1.985PheLys: 1.985 ± 0.384
2.302PheLeu: 2.302 ± 0.445
0.953PheMet: 0.953 ± 0.303
1.032PheAsn: 1.032 ± 0.244
1.27PhePro: 1.27 ± 0.316
1.111PheGln: 1.111 ± 0.242
1.985PheArg: 1.985 ± 0.354
1.985PheSer: 1.985 ± 0.392
2.461PheThr: 2.461 ± 0.408
1.588PheVal: 1.588 ± 0.33
0.476PheTrp: 0.476 ± 0.289
0.794PheTyr: 0.794 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
7.066GlyAla: 7.066 ± 1.091
0.635GlyCys: 0.635 ± 0.232
4.366GlyAsp: 4.366 ± 0.627
4.525GlyGlu: 4.525 ± 0.643
3.334GlyPhe: 3.334 ± 0.569
5.557GlyGly: 5.557 ± 1.006
1.429GlyHis: 1.429 ± 0.316
5.002GlyIle: 5.002 ± 0.786
5.24GlyLys: 5.24 ± 0.712
6.192GlyLeu: 6.192 ± 1.077
2.62GlyMet: 2.62 ± 0.47
3.176GlyAsn: 3.176 ± 0.477
3.017GlyPro: 3.017 ± 0.461
4.049GlyGln: 4.049 ± 0.77
5.954GlyArg: 5.954 ± 0.777
4.287GlySer: 4.287 ± 0.663
6.113GlyThr: 6.113 ± 0.711
6.113GlyVal: 6.113 ± 0.732
1.35GlyTrp: 1.35 ± 0.303
2.54GlyTyr: 2.54 ± 0.429
0.0GlyXaa: 0.0 ± 0.0
His
1.985HisAla: 1.985 ± 0.54
0.159HisCys: 0.159 ± 0.114
1.032HisAsp: 1.032 ± 0.303
1.35HisGlu: 1.35 ± 0.448
0.873HisPhe: 0.873 ± 0.234
1.826HisGly: 1.826 ± 0.435
0.238HisHis: 0.238 ± 0.159
0.873HisIle: 0.873 ± 0.241
1.191HisLys: 1.191 ± 0.281
1.747HisLeu: 1.747 ± 0.433
0.397HisMet: 0.397 ± 0.241
0.715HisAsn: 0.715 ± 0.244
0.635HisPro: 0.635 ± 0.236
0.318HisGln: 0.318 ± 0.176
0.873HisArg: 0.873 ± 0.195
0.715HisSer: 0.715 ± 0.234
0.794HisThr: 0.794 ± 0.253
1.191HisVal: 1.191 ± 0.332
0.318HisTrp: 0.318 ± 0.123
0.794HisTyr: 0.794 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.366IleAla: 4.366 ± 0.684
0.159IleCys: 0.159 ± 0.125
4.446IleAsp: 4.446 ± 0.517
3.97IleGlu: 3.97 ± 0.54
0.635IlePhe: 0.635 ± 0.287
5.002IleGly: 5.002 ± 1.034
1.032IleHis: 1.032 ± 0.28
3.176IleIle: 3.176 ± 0.747
2.223IleLys: 2.223 ± 0.446
2.54IleLeu: 2.54 ± 0.483
1.191IleMet: 1.191 ± 0.272
1.905IleAsn: 1.905 ± 0.414
2.144IlePro: 2.144 ± 0.564
2.144IleGln: 2.144 ± 0.382
2.779IleArg: 2.779 ± 0.442
3.811IleSer: 3.811 ± 0.661
3.573IleThr: 3.573 ± 0.719
3.096IleVal: 3.096 ± 0.532
0.873IleTrp: 0.873 ± 0.237
1.905IleTyr: 1.905 ± 0.431
0.0IleXaa: 0.0 ± 0.0
Lys
6.272LysAla: 6.272 ± 0.978
0.238LysCys: 0.238 ± 0.14
2.064LysAsp: 2.064 ± 0.463
2.858LysGlu: 2.858 ± 0.525
0.715LysPhe: 0.715 ± 0.221
3.89LysGly: 3.89 ± 0.595
0.476LysHis: 0.476 ± 0.196
1.826LysIle: 1.826 ± 0.327
2.302LysLys: 2.302 ± 0.499
5.319LysLeu: 5.319 ± 0.598
1.27LysMet: 1.27 ± 0.32
1.032LysAsn: 1.032 ± 0.306
3.334LysPro: 3.334 ± 0.561
1.429LysGln: 1.429 ± 0.334
3.811LysArg: 3.811 ± 0.61
2.699LysSer: 2.699 ± 0.521
3.493LysThr: 3.493 ± 0.528
4.287LysVal: 4.287 ± 0.75
1.35LysTrp: 1.35 ± 0.402
1.35LysTyr: 1.35 ± 0.328
0.0LysXaa: 0.0 ± 0.0
Leu
8.018LeuAla: 8.018 ± 0.818
0.397LeuCys: 0.397 ± 0.182
5.24LeuAsp: 5.24 ± 0.685
5.081LeuGlu: 5.081 ± 0.753
2.461LeuPhe: 2.461 ± 0.416
6.589LeuGly: 6.589 ± 0.937
0.953LeuHis: 0.953 ± 0.248
4.843LeuIle: 4.843 ± 1.005
4.605LeuLys: 4.605 ± 0.638
6.828LeuLeu: 6.828 ± 0.762
1.35LeuMet: 1.35 ± 0.27
2.779LeuAsn: 2.779 ± 0.384
4.922LeuPro: 4.922 ± 0.769
2.144LeuGln: 2.144 ± 0.373
4.763LeuArg: 4.763 ± 0.606
5.557LeuSer: 5.557 ± 0.658
6.034LeuThr: 6.034 ± 0.858
7.304LeuVal: 7.304 ± 0.91
0.873LeuTrp: 0.873 ± 0.294
2.144LeuTyr: 2.144 ± 0.392
0.0LeuXaa: 0.0 ± 0.0
Met
3.573MetAla: 3.573 ± 0.365
0.238MetCys: 0.238 ± 0.136
1.508MetAsp: 1.508 ± 0.343
1.429MetGlu: 1.429 ± 0.388
0.873MetPhe: 0.873 ± 0.28
2.302MetGly: 2.302 ± 0.476
0.238MetHis: 0.238 ± 0.138
1.35MetIle: 1.35 ± 0.388
1.111MetLys: 1.111 ± 0.274
2.302MetLeu: 2.302 ± 0.507
0.476MetMet: 0.476 ± 0.185
1.35MetAsn: 1.35 ± 0.297
1.508MetPro: 1.508 ± 0.385
0.794MetGln: 0.794 ± 0.232
1.27MetArg: 1.27 ± 0.373
2.54MetSer: 2.54 ± 0.447
1.667MetThr: 1.667 ± 0.428
1.35MetVal: 1.35 ± 0.269
0.238MetTrp: 0.238 ± 0.145
0.159MetTyr: 0.159 ± 0.103
0.0MetXaa: 0.0 ± 0.0
Asn
3.096AsnAla: 3.096 ± 0.376
0.159AsnCys: 0.159 ± 0.109
1.588AsnAsp: 1.588 ± 0.394
1.508AsnGlu: 1.508 ± 0.387
0.794AsnPhe: 0.794 ± 0.303
3.176AsnGly: 3.176 ± 0.548
0.873AsnHis: 0.873 ± 0.267
1.985AsnIle: 1.985 ± 0.431
1.905AsnLys: 1.905 ± 0.376
2.779AsnLeu: 2.779 ± 0.38
0.873AsnMet: 0.873 ± 0.238
1.111AsnAsn: 1.111 ± 0.32
2.223AsnPro: 2.223 ± 0.363
1.032AsnGln: 1.032 ± 0.294
1.667AsnArg: 1.667 ± 0.357
2.699AsnSer: 2.699 ± 0.551
2.54AsnThr: 2.54 ± 0.469
1.985AsnVal: 1.985 ± 0.524
0.476AsnTrp: 0.476 ± 0.172
0.953AsnTyr: 0.953 ± 0.229
0.0AsnXaa: 0.0 ± 0.0
Pro
4.446ProAla: 4.446 ± 0.656
0.079ProCys: 0.079 ± 0.091
2.54ProAsp: 2.54 ± 0.514
4.128ProGlu: 4.128 ± 0.756
1.747ProPhe: 1.747 ± 0.359
4.128ProGly: 4.128 ± 0.679
0.953ProHis: 0.953 ± 0.308
1.747ProIle: 1.747 ± 0.335
3.096ProLys: 3.096 ± 0.53
3.255ProLeu: 3.255 ± 0.437
1.35ProMet: 1.35 ± 0.415
1.35ProAsn: 1.35 ± 0.351
1.429ProPro: 1.429 ± 0.323
1.508ProGln: 1.508 ± 0.378
2.382ProArg: 2.382 ± 0.542
3.255ProSer: 3.255 ± 0.463
4.287ProThr: 4.287 ± 0.671
3.652ProVal: 3.652 ± 0.5
0.715ProTrp: 0.715 ± 0.233
0.953ProTyr: 0.953 ± 0.241
0.0ProXaa: 0.0 ± 0.0
Gln
4.843GlnAla: 4.843 ± 0.669
0.318GlnCys: 0.318 ± 0.168
2.144GlnAsp: 2.144 ± 0.411
2.62GlnGlu: 2.62 ± 0.504
0.079GlnPhe: 0.079 ± 0.08
2.461GlnGly: 2.461 ± 0.431
0.794GlnHis: 0.794 ± 0.261
1.826GlnIle: 1.826 ± 0.441
1.747GlnLys: 1.747 ± 0.344
2.699GlnLeu: 2.699 ± 0.392
0.715GlnMet: 0.715 ± 0.236
1.35GlnAsn: 1.35 ± 0.413
1.032GlnPro: 1.032 ± 0.26
0.715GlnGln: 0.715 ± 0.224
2.62GlnArg: 2.62 ± 0.403
2.144GlnSer: 2.144 ± 0.374
2.064GlnThr: 2.064 ± 0.382
2.382GlnVal: 2.382 ± 0.477
0.476GlnTrp: 0.476 ± 0.211
1.35GlnTyr: 1.35 ± 0.296
0.0GlnXaa: 0.0 ± 0.0
Arg
5.716ArgAla: 5.716 ± 0.742
0.476ArgCys: 0.476 ± 0.203
3.89ArgAsp: 3.89 ± 0.482
3.731ArgGlu: 3.731 ± 0.66
1.826ArgPhe: 1.826 ± 0.4
4.208ArgGly: 4.208 ± 0.515
0.715ArgHis: 0.715 ± 0.271
3.017ArgIle: 3.017 ± 0.524
3.176ArgLys: 3.176 ± 0.671
5.16ArgLeu: 5.16 ± 0.756
2.461ArgMet: 2.461 ± 0.435
2.302ArgAsn: 2.302 ± 0.458
2.699ArgPro: 2.699 ± 0.466
1.111ArgGln: 1.111 ± 0.254
3.573ArgArg: 3.573 ± 0.693
3.493ArgSer: 3.493 ± 0.5
3.493ArgThr: 3.493 ± 0.513
4.763ArgVal: 4.763 ± 0.646
1.27ArgTrp: 1.27 ± 0.379
2.54ArgTyr: 2.54 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
6.431SerAla: 6.431 ± 0.747
0.397SerCys: 0.397 ± 0.169
3.176SerAsp: 3.176 ± 0.5
3.334SerGlu: 3.334 ± 0.578
2.699SerPhe: 2.699 ± 0.431
4.922SerGly: 4.922 ± 0.641
1.191SerHis: 1.191 ± 0.344
3.096SerIle: 3.096 ± 0.467
3.573SerLys: 3.573 ± 0.445
5.24SerLeu: 5.24 ± 0.646
1.905SerMet: 1.905 ± 0.472
1.747SerAsn: 1.747 ± 0.358
2.144SerPro: 2.144 ± 0.387
2.54SerGln: 2.54 ± 0.524
3.97SerArg: 3.97 ± 0.797
3.493SerSer: 3.493 ± 0.499
4.287SerThr: 4.287 ± 0.664
5.319SerVal: 5.319 ± 0.677
1.111SerTrp: 1.111 ± 0.304
1.826SerTyr: 1.826 ± 0.415
0.0SerXaa: 0.0 ± 0.0
Thr
5.716ThrAla: 5.716 ± 0.614
0.159ThrCys: 0.159 ± 0.113
4.287ThrAsp: 4.287 ± 0.673
3.414ThrGlu: 3.414 ± 0.512
2.699ThrPhe: 2.699 ± 0.445
6.51ThrGly: 6.51 ± 0.755
0.873ThrHis: 0.873 ± 0.247
3.414ThrIle: 3.414 ± 0.428
2.62ThrLys: 2.62 ± 0.47
5.875ThrLeu: 5.875 ± 0.896
1.191ThrMet: 1.191 ± 0.373
1.826ThrAsn: 1.826 ± 0.349
4.049ThrPro: 4.049 ± 0.478
2.302ThrGln: 2.302 ± 0.391
4.049ThrArg: 4.049 ± 0.531
5.16ThrSer: 5.16 ± 0.9
4.446ThrThr: 4.446 ± 0.655
5.399ThrVal: 5.399 ± 0.572
1.588ThrTrp: 1.588 ± 0.309
2.144ThrTyr: 2.144 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
8.177ValAla: 8.177 ± 0.721
0.238ValCys: 0.238 ± 0.155
4.366ValAsp: 4.366 ± 0.702
5.002ValGlu: 5.002 ± 0.731
2.144ValPhe: 2.144 ± 0.39
6.51ValGly: 6.51 ± 1.073
1.429ValHis: 1.429 ± 0.364
3.731ValIle: 3.731 ± 0.63
2.54ValLys: 2.54 ± 0.463
6.907ValLeu: 6.907 ± 0.722
2.223ValMet: 2.223 ± 0.45
2.144ValAsn: 2.144 ± 0.37
2.858ValPro: 2.858 ± 0.514
3.255ValGln: 3.255 ± 0.537
4.446ValArg: 4.446 ± 0.655
4.208ValSer: 4.208 ± 0.712
4.684ValThr: 4.684 ± 0.584
5.399ValVal: 5.399 ± 0.697
1.667ValTrp: 1.667 ± 0.351
1.985ValTyr: 1.985 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
1.111TrpAla: 1.111 ± 0.232
0.238TrpCys: 0.238 ± 0.183
1.27TrpAsp: 1.27 ± 0.324
1.191TrpGlu: 1.191 ± 0.341
0.794TrpPhe: 0.794 ± 0.28
1.35TrpGly: 1.35 ± 0.317
0.635TrpHis: 0.635 ± 0.222
1.032TrpIle: 1.032 ± 0.29
1.27TrpLys: 1.27 ± 0.35
2.064TrpLeu: 2.064 ± 0.422
0.476TrpMet: 0.476 ± 0.176
0.794TrpAsn: 0.794 ± 0.231
0.794TrpPro: 0.794 ± 0.258
0.476TrpGln: 0.476 ± 0.245
0.953TrpArg: 0.953 ± 0.289
0.953TrpSer: 0.953 ± 0.349
0.953TrpThr: 0.953 ± 0.289
1.27TrpVal: 1.27 ± 0.28
0.556TrpTrp: 0.556 ± 0.265
1.032TrpTyr: 1.032 ± 0.291
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.223TyrAla: 2.223 ± 0.381
0.556TyrCys: 0.556 ± 0.229
1.747TyrAsp: 1.747 ± 0.386
2.54TyrGlu: 2.54 ± 0.351
0.635TyrPhe: 0.635 ± 0.204
3.255TyrGly: 3.255 ± 0.584
0.397TyrHis: 0.397 ± 0.147
1.111TyrIle: 1.111 ± 0.293
1.667TyrLys: 1.667 ± 0.445
1.667TyrLeu: 1.667 ± 0.344
0.715TyrMet: 0.715 ± 0.27
1.588TyrAsn: 1.588 ± 0.389
1.667TyrPro: 1.667 ± 0.427
0.953TyrGln: 0.953 ± 0.331
2.144TyrArg: 2.144 ± 0.448
2.382TyrSer: 2.382 ± 0.533
1.35TyrThr: 1.35 ± 0.25
2.064TyrVal: 2.064 ± 0.384
0.794TyrTrp: 0.794 ± 0.219
1.032TyrTyr: 1.032 ± 0.358
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 62 proteins (12597 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski