Amino acid dipepetide frequency for Klebsiella phage ST899-OXA48phi17.1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.413AlaAla: 11.413 ± 1.325
1.299AlaCys: 1.299 ± 0.366
5.753AlaAsp: 5.753 ± 0.767
5.475AlaGlu: 5.475 ± 0.758
2.876AlaPhe: 2.876 ± 0.505
8.073AlaGly: 8.073 ± 1.303
1.021AlaHis: 1.021 ± 0.357
5.66AlaIle: 5.66 ± 0.807
5.103AlaLys: 5.103 ± 1.058
7.238AlaLeu: 7.238 ± 0.795
3.34AlaMet: 3.34 ± 0.506
3.433AlaAsn: 3.433 ± 0.603
2.505AlaPro: 2.505 ± 0.515
3.712AlaGln: 3.712 ± 0.585
4.361AlaArg: 4.361 ± 0.765
8.073AlaSer: 8.073 ± 0.834
5.753AlaThr: 5.753 ± 0.77
5.939AlaVal: 5.939 ± 0.757
1.299AlaTrp: 1.299 ± 0.343
3.526AlaTyr: 3.526 ± 0.59
0.0AlaXaa: 0.0 ± 0.0
Cys
0.557CysAla: 0.557 ± 0.157
0.0CysCys: 0.0 ± 0.0
0.557CysAsp: 0.557 ± 0.232
0.464CysGlu: 0.464 ± 0.265
0.557CysPhe: 0.557 ± 0.247
0.65CysGly: 0.65 ± 0.237
0.093CysHis: 0.093 ± 0.078
0.371CysIle: 0.371 ± 0.188
0.65CysLys: 0.65 ± 0.239
0.742CysLeu: 0.742 ± 0.269
0.278CysMet: 0.278 ± 0.121
0.371CysAsn: 0.371 ± 0.192
0.65CysPro: 0.65 ± 0.23
0.464CysGln: 0.464 ± 0.199
0.65CysArg: 0.65 ± 0.268
0.464CysSer: 0.464 ± 0.181
0.65CysThr: 0.65 ± 0.223
0.65CysVal: 0.65 ± 0.257
0.278CysTrp: 0.278 ± 0.132
0.464CysTyr: 0.464 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
6.217AspAla: 6.217 ± 0.923
0.186AspCys: 0.186 ± 0.144
3.804AspAsp: 3.804 ± 0.922
3.433AspGlu: 3.433 ± 0.714
3.34AspPhe: 3.34 ± 0.595
6.031AspGly: 6.031 ± 1.012
0.557AspHis: 0.557 ± 0.262
3.712AspIle: 3.712 ± 0.59
2.691AspLys: 2.691 ± 0.5
4.268AspLeu: 4.268 ± 0.578
1.299AspMet: 1.299 ± 0.309
2.876AspAsn: 2.876 ± 0.467
1.67AspPro: 1.67 ± 0.337
1.763AspGln: 1.763 ± 0.432
2.227AspArg: 2.227 ± 0.402
3.433AspSer: 3.433 ± 0.663
2.691AspThr: 2.691 ± 0.603
4.454AspVal: 4.454 ± 0.705
0.742AspTrp: 0.742 ± 0.276
2.32AspTyr: 2.32 ± 0.401
0.0AspXaa: 0.0 ± 0.0
Glu
4.547GluAla: 4.547 ± 0.786
0.371GluCys: 0.371 ± 0.185
2.413GluAsp: 2.413 ± 0.76
3.062GluGlu: 3.062 ± 0.622
3.248GluPhe: 3.248 ± 0.498
3.062GluGly: 3.062 ± 0.52
1.299GluHis: 1.299 ± 0.522
2.505GluIle: 2.505 ± 0.489
4.547GluLys: 4.547 ± 0.633
4.64GluLeu: 4.64 ± 0.656
1.577GluMet: 1.577 ± 0.305
2.598GluAsn: 2.598 ± 0.501
1.856GluPro: 1.856 ± 0.597
2.505GluGln: 2.505 ± 0.468
2.969GluArg: 2.969 ± 0.536
3.155GluSer: 3.155 ± 0.647
3.526GluThr: 3.526 ± 0.552
3.248GluVal: 3.248 ± 0.54
1.113GluTrp: 1.113 ± 0.373
1.763GluTyr: 1.763 ± 0.437
0.0GluXaa: 0.0 ± 0.0
Phe
3.248PheAla: 3.248 ± 0.535
0.278PheCys: 0.278 ± 0.18
1.949PheAsp: 1.949 ± 0.47
1.763PheGlu: 1.763 ± 0.379
0.928PhePhe: 0.928 ± 0.323
2.505PheGly: 2.505 ± 0.392
0.742PheHis: 0.742 ± 0.263
2.041PheIle: 2.041 ± 0.555
1.577PheLys: 1.577 ± 0.509
2.876PheLeu: 2.876 ± 0.648
0.65PheMet: 0.65 ± 0.209
2.784PheAsn: 2.784 ± 0.585
2.227PhePro: 2.227 ± 0.497
1.299PheGln: 1.299 ± 0.332
1.856PheArg: 1.856 ± 0.356
2.784PheSer: 2.784 ± 0.455
3.155PheThr: 3.155 ± 0.532
2.876PheVal: 2.876 ± 0.491
0.557PheTrp: 0.557 ± 0.176
1.856PheTyr: 1.856 ± 0.343
0.0PheXaa: 0.0 ± 0.0
Gly
6.866GlyAla: 6.866 ± 1.145
0.742GlyCys: 0.742 ± 0.278
5.289GlyAsp: 5.289 ± 0.789
3.433GlyGlu: 3.433 ± 0.852
3.062GlyPhe: 3.062 ± 0.536
8.258GlyGly: 8.258 ± 2.19
0.557GlyHis: 0.557 ± 0.262
5.475GlyIle: 5.475 ± 0.788
4.268GlyLys: 4.268 ± 0.648
6.774GlyLeu: 6.774 ± 0.924
2.691GlyMet: 2.691 ± 0.471
2.784GlyAsn: 2.784 ± 0.572
1.763GlyPro: 1.763 ± 0.401
3.804GlyGln: 3.804 ± 0.615
4.268GlyArg: 4.268 ± 0.524
5.103GlySer: 5.103 ± 0.71
5.103GlyThr: 5.103 ± 1.083
5.382GlyVal: 5.382 ± 0.71
1.299GlyTrp: 1.299 ± 0.343
2.041GlyTyr: 2.041 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
1.392HisAla: 1.392 ± 0.393
0.186HisCys: 0.186 ± 0.111
0.928HisAsp: 0.928 ± 0.219
0.557HisGlu: 0.557 ± 0.247
0.65HisPhe: 0.65 ± 0.23
1.021HisGly: 1.021 ± 0.342
0.278HisHis: 0.278 ± 0.178
1.299HisIle: 1.299 ± 0.324
0.371HisLys: 0.371 ± 0.209
0.742HisLeu: 0.742 ± 0.33
0.835HisMet: 0.835 ± 0.254
0.371HisAsn: 0.371 ± 0.195
0.835HisPro: 0.835 ± 0.259
0.742HisGln: 0.742 ± 0.302
0.742HisArg: 0.742 ± 0.311
1.206HisSer: 1.206 ± 0.325
1.021HisThr: 1.021 ± 0.275
0.557HisVal: 0.557 ± 0.256
0.093HisTrp: 0.093 ± 0.081
0.928HisTyr: 0.928 ± 0.324
0.0HisXaa: 0.0 ± 0.0
Ile
5.567IleAla: 5.567 ± 0.608
0.371IleCys: 0.371 ± 0.226
4.083IleAsp: 4.083 ± 0.64
4.732IleGlu: 4.732 ± 0.682
2.041IlePhe: 2.041 ± 0.39
3.712IleGly: 3.712 ± 0.662
1.392IleHis: 1.392 ± 0.368
3.433IleIle: 3.433 ± 0.579
2.969IleLys: 2.969 ± 0.562
3.155IleLeu: 3.155 ± 0.627
1.113IleMet: 1.113 ± 0.323
3.062IleAsn: 3.062 ± 0.526
3.526IlePro: 3.526 ± 0.562
2.598IleGln: 2.598 ± 0.431
3.433IleArg: 3.433 ± 0.622
3.99IleSer: 3.99 ± 0.518
4.64IleThr: 4.64 ± 0.542
3.155IleVal: 3.155 ± 0.523
0.928IleTrp: 0.928 ± 0.302
2.413IleTyr: 2.413 ± 0.59
0.0IleXaa: 0.0 ± 0.0
Lys
5.567LysAla: 5.567 ± 0.805
0.557LysCys: 0.557 ± 0.28
3.062LysAsp: 3.062 ± 0.503
3.062LysGlu: 3.062 ± 0.548
1.206LysPhe: 1.206 ± 0.365
3.99LysGly: 3.99 ± 0.662
0.742LysHis: 0.742 ± 0.259
2.413LysIle: 2.413 ± 0.431
4.268LysLys: 4.268 ± 0.645
5.103LysLeu: 5.103 ± 0.96
1.763LysMet: 1.763 ± 0.369
1.856LysAsn: 1.856 ± 0.45
2.598LysPro: 2.598 ± 0.456
2.227LysGln: 2.227 ± 0.463
2.227LysArg: 2.227 ± 0.586
3.804LysSer: 3.804 ± 0.736
3.248LysThr: 3.248 ± 0.519
3.99LysVal: 3.99 ± 0.639
0.557LysTrp: 0.557 ± 0.222
1.485LysTyr: 1.485 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
6.588LeuAla: 6.588 ± 0.846
0.928LeuCys: 0.928 ± 0.298
4.361LeuAsp: 4.361 ± 0.638
3.897LeuGlu: 3.897 ± 0.637
2.134LeuPhe: 2.134 ± 0.566
3.897LeuGly: 3.897 ± 0.577
0.65LeuHis: 0.65 ± 0.193
5.753LeuIle: 5.753 ± 0.725
4.268LeuLys: 4.268 ± 0.422
5.846LeuLeu: 5.846 ± 0.792
2.32LeuMet: 2.32 ± 0.558
4.732LeuAsn: 4.732 ± 0.593
3.248LeuPro: 3.248 ± 0.407
2.969LeuGln: 2.969 ± 0.482
3.712LeuArg: 3.712 ± 0.543
8.815LeuSer: 8.815 ± 0.865
5.475LeuThr: 5.475 ± 0.715
5.846LeuVal: 5.846 ± 0.772
0.835LeuTrp: 0.835 ± 0.235
2.041LeuTyr: 2.041 ± 0.377
0.0LeuXaa: 0.0 ± 0.0
Met
3.248MetAla: 3.248 ± 0.752
0.278MetCys: 0.278 ± 0.198
1.67MetAsp: 1.67 ± 0.357
0.928MetGlu: 0.928 ± 0.224
0.464MetPhe: 0.464 ± 0.181
1.299MetGly: 1.299 ± 0.293
0.371MetHis: 0.371 ± 0.164
1.392MetIle: 1.392 ± 0.293
1.856MetLys: 1.856 ± 0.348
2.876MetLeu: 2.876 ± 0.467
1.021MetMet: 1.021 ± 0.293
1.299MetAsn: 1.299 ± 0.371
1.392MetPro: 1.392 ± 0.31
1.021MetGln: 1.021 ± 0.395
1.67MetArg: 1.67 ± 0.437
2.505MetSer: 2.505 ± 0.438
1.763MetThr: 1.763 ± 0.339
1.577MetVal: 1.577 ± 0.391
0.371MetTrp: 0.371 ± 0.203
1.206MetTyr: 1.206 ± 0.282
0.0MetXaa: 0.0 ± 0.0
Asn
4.825AsnAla: 4.825 ± 0.672
0.557AsnCys: 0.557 ± 0.264
3.248AsnAsp: 3.248 ± 0.444
1.67AsnGlu: 1.67 ± 0.393
2.227AsnPhe: 2.227 ± 0.395
4.176AsnGly: 4.176 ± 0.547
0.742AsnHis: 0.742 ± 0.258
3.155AsnIle: 3.155 ± 0.512
2.134AsnLys: 2.134 ± 0.546
2.505AsnLeu: 2.505 ± 0.443
1.392AsnMet: 1.392 ± 0.322
2.784AsnAsn: 2.784 ± 0.493
2.32AsnPro: 2.32 ± 0.362
1.949AsnGln: 1.949 ± 0.481
2.227AsnArg: 2.227 ± 0.453
2.413AsnSer: 2.413 ± 0.455
2.413AsnThr: 2.413 ± 0.451
3.712AsnVal: 3.712 ± 0.601
1.113AsnTrp: 1.113 ± 0.293
1.206AsnTyr: 1.206 ± 0.381
0.0AsnXaa: 0.0 ± 0.0
Pro
3.804ProAla: 3.804 ± 0.515
0.464ProCys: 0.464 ± 0.164
2.32ProAsp: 2.32 ± 0.554
3.433ProGlu: 3.433 ± 0.578
1.206ProPhe: 1.206 ± 0.344
3.34ProGly: 3.34 ± 0.456
0.65ProHis: 0.65 ± 0.23
1.67ProIle: 1.67 ± 0.47
1.485ProLys: 1.485 ± 0.385
3.526ProLeu: 3.526 ± 0.655
1.021ProMet: 1.021 ± 0.313
1.577ProAsn: 1.577 ± 0.325
1.856ProPro: 1.856 ± 0.386
1.206ProGln: 1.206 ± 0.357
1.577ProArg: 1.577 ± 0.327
2.784ProSer: 2.784 ± 0.607
2.134ProThr: 2.134 ± 0.408
2.598ProVal: 2.598 ± 0.429
0.464ProTrp: 0.464 ± 0.182
2.041ProTyr: 2.041 ± 0.469
0.0ProXaa: 0.0 ± 0.0
Gln
4.268GlnAla: 4.268 ± 0.563
1.113GlnCys: 1.113 ± 0.37
1.577GlnAsp: 1.577 ± 0.348
1.763GlnGlu: 1.763 ± 0.328
1.949GlnPhe: 1.949 ± 0.494
2.505GlnGly: 2.505 ± 0.35
0.278GlnHis: 0.278 ± 0.137
2.134GlnIle: 2.134 ± 0.386
3.433GlnLys: 3.433 ± 0.684
3.248GlnLeu: 3.248 ± 0.509
1.206GlnMet: 1.206 ± 0.301
2.691GlnAsn: 2.691 ± 0.5
1.206GlnPro: 1.206 ± 0.292
2.134GlnGln: 2.134 ± 0.431
2.041GlnArg: 2.041 ± 0.528
3.155GlnSer: 3.155 ± 0.519
1.856GlnThr: 1.856 ± 0.518
1.763GlnVal: 1.763 ± 0.442
0.742GlnTrp: 0.742 ± 0.247
1.021GlnTyr: 1.021 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
4.176ArgAla: 4.176 ± 0.617
0.557ArgCys: 0.557 ± 0.198
2.598ArgAsp: 2.598 ± 0.445
2.784ArgGlu: 2.784 ± 0.514
2.134ArgPhe: 2.134 ± 0.409
2.784ArgGly: 2.784 ± 0.647
1.299ArgHis: 1.299 ± 0.399
3.804ArgIle: 3.804 ± 0.703
2.876ArgLys: 2.876 ± 0.594
4.64ArgLeu: 4.64 ± 0.7
1.299ArgMet: 1.299 ± 0.339
1.299ArgAsn: 1.299 ± 0.334
1.113ArgPro: 1.113 ± 0.372
2.041ArgGln: 2.041 ± 0.528
3.897ArgArg: 3.897 ± 0.943
3.712ArgSer: 3.712 ± 0.703
2.32ArgThr: 2.32 ± 0.544
3.34ArgVal: 3.34 ± 0.473
1.392ArgTrp: 1.392 ± 0.388
1.856ArgTyr: 1.856 ± 0.374
0.0ArgXaa: 0.0 ± 0.0
Ser
7.702SerAla: 7.702 ± 0.682
0.65SerCys: 0.65 ± 0.327
2.969SerAsp: 2.969 ± 0.482
3.897SerGlu: 3.897 ± 0.67
3.155SerPhe: 3.155 ± 0.435
9.093SerGly: 9.093 ± 1.249
1.021SerHis: 1.021 ± 0.323
5.567SerIle: 5.567 ± 0.639
2.969SerLys: 2.969 ± 0.423
5.846SerLeu: 5.846 ± 0.707
1.949SerMet: 1.949 ± 0.4
2.876SerAsn: 2.876 ± 0.611
2.227SerPro: 2.227 ± 0.423
3.062SerGln: 3.062 ± 0.498
3.526SerArg: 3.526 ± 0.675
5.939SerSer: 5.939 ± 0.688
4.547SerThr: 4.547 ± 0.754
4.64SerVal: 4.64 ± 0.686
0.928SerTrp: 0.928 ± 0.288
2.32SerTyr: 2.32 ± 0.381
0.0SerXaa: 0.0 ± 0.0
Thr
6.031ThrAla: 6.031 ± 0.779
0.371ThrCys: 0.371 ± 0.153
3.897ThrAsp: 3.897 ± 0.639
2.876ThrGlu: 2.876 ± 0.396
1.856ThrPhe: 1.856 ± 0.372
5.753ThrGly: 5.753 ± 0.842
1.299ThrHis: 1.299 ± 0.322
3.155ThrIle: 3.155 ± 0.532
3.062ThrLys: 3.062 ± 0.723
5.567ThrLeu: 5.567 ± 0.715
0.742ThrMet: 0.742 ± 0.245
2.969ThrAsn: 2.969 ± 0.543
2.691ThrPro: 2.691 ± 0.407
2.041ThrGln: 2.041 ± 0.426
3.248ThrArg: 3.248 ± 0.606
4.825ThrSer: 4.825 ± 0.755
3.155ThrThr: 3.155 ± 0.518
5.846ThrVal: 5.846 ± 0.858
1.113ThrTrp: 1.113 ± 0.327
2.041ThrTyr: 2.041 ± 0.47
0.0ThrXaa: 0.0 ± 0.0
Val
6.31ValAla: 6.31 ± 0.843
0.278ValCys: 0.278 ± 0.198
4.454ValAsp: 4.454 ± 0.614
4.176ValGlu: 4.176 ± 0.735
2.32ValPhe: 2.32 ± 0.466
3.34ValGly: 3.34 ± 0.56
1.299ValHis: 1.299 ± 0.348
4.454ValIle: 4.454 ± 0.593
3.526ValLys: 3.526 ± 0.45
5.196ValLeu: 5.196 ± 0.634
2.413ValMet: 2.413 ± 0.517
4.268ValAsn: 4.268 ± 0.484
3.248ValPro: 3.248 ± 0.431
2.041ValGln: 2.041 ± 0.421
2.876ValArg: 2.876 ± 0.521
5.753ValSer: 5.753 ± 0.769
5.103ValThr: 5.103 ± 0.74
4.361ValVal: 4.361 ± 0.781
0.371ValTrp: 0.371 ± 0.173
2.227ValTyr: 2.227 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
1.021TrpAla: 1.021 ± 0.291
0.186TrpCys: 0.186 ± 0.111
0.928TrpAsp: 0.928 ± 0.277
1.392TrpGlu: 1.392 ± 0.5
0.835TrpPhe: 0.835 ± 0.279
1.577TrpGly: 1.577 ± 0.353
0.186TrpHis: 0.186 ± 0.134
0.371TrpIle: 0.371 ± 0.155
0.557TrpLys: 0.557 ± 0.186
1.485TrpLeu: 1.485 ± 0.4
0.65TrpMet: 0.65 ± 0.225
0.186TrpAsn: 0.186 ± 0.142
0.371TrpPro: 0.371 ± 0.218
0.464TrpGln: 0.464 ± 0.244
0.65TrpArg: 0.65 ± 0.252
1.021TrpSer: 1.021 ± 0.239
1.577TrpThr: 1.577 ± 0.402
1.021TrpVal: 1.021 ± 0.21
0.371TrpTrp: 0.371 ± 0.172
0.278TrpTyr: 0.278 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.505TyrAla: 2.505 ± 0.62
0.186TyrCys: 0.186 ± 0.11
1.856TyrAsp: 1.856 ± 0.356
1.206TyrGlu: 1.206 ± 0.257
1.763TyrPhe: 1.763 ± 0.48
3.897TyrGly: 3.897 ± 0.739
0.278TyrHis: 0.278 ± 0.156
1.949TyrIle: 1.949 ± 0.449
1.113TyrLys: 1.113 ± 0.269
2.041TyrLeu: 2.041 ± 0.424
0.557TyrMet: 0.557 ± 0.188
1.949TyrAsn: 1.949 ± 0.393
1.856TyrPro: 1.856 ± 0.451
2.041TyrGln: 2.041 ± 0.502
1.763TyrArg: 1.763 ± 0.408
1.949TyrSer: 1.949 ± 0.444
2.413TyrThr: 2.413 ± 0.431
3.155TyrVal: 3.155 ± 0.473
0.557TyrTrp: 0.557 ± 0.235
1.113TyrTyr: 1.113 ± 0.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10778 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski