Amino acid dipepetide frequency for Achromobacter phage phiAxp-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.558AlaAla: 11.558 ± 1.643
0.911AlaCys: 0.911 ± 0.329
5.604AlaAsp: 5.604 ± 0.527
6.865AlaGlu: 6.865 ± 1.081
3.362AlaPhe: 3.362 ± 0.539
7.425AlaGly: 7.425 ± 0.869
1.891AlaHis: 1.891 ± 0.383
5.254AlaIle: 5.254 ± 0.508
5.254AlaLys: 5.254 ± 0.711
8.126AlaLeu: 8.126 ± 0.824
3.713AlaMet: 3.713 ± 0.493
4.203AlaAsn: 4.203 ± 0.467
4.203AlaPro: 4.203 ± 0.671
4.273AlaGln: 4.273 ± 0.771
5.604AlaArg: 5.604 ± 0.656
5.184AlaSer: 5.184 ± 0.654
6.024AlaThr: 6.024 ± 0.835
6.164AlaVal: 6.164 ± 0.662
1.331AlaTrp: 1.331 ± 0.29
3.082AlaTyr: 3.082 ± 0.493
0.0AlaXaa: 0.0 ± 0.0
Cys
0.63CysAla: 0.63 ± 0.219
0.28CysCys: 0.28 ± 0.152
0.28CysAsp: 0.28 ± 0.163
0.49CysGlu: 0.49 ± 0.183
0.14CysPhe: 0.14 ± 0.097
0.771CysGly: 0.771 ± 0.31
0.07CysHis: 0.07 ± 0.076
0.42CysIle: 0.42 ± 0.154
0.14CysLys: 0.14 ± 0.1
0.56CysLeu: 0.56 ± 0.186
0.21CysMet: 0.21 ± 0.119
0.28CysAsn: 0.28 ± 0.156
0.56CysPro: 0.56 ± 0.167
0.35CysGln: 0.35 ± 0.14
0.63CysArg: 0.63 ± 0.279
0.28CysSer: 0.28 ± 0.14
0.42CysThr: 0.42 ± 0.185
0.911CysVal: 0.911 ± 0.299
0.28CysTrp: 0.28 ± 0.157
0.35CysTyr: 0.35 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
7.075AspAla: 7.075 ± 0.811
0.7AspCys: 0.7 ± 0.211
3.292AspAsp: 3.292 ± 0.694
3.713AspGlu: 3.713 ± 0.624
1.541AspPhe: 1.541 ± 0.302
4.763AspGly: 4.763 ± 0.619
0.7AspHis: 0.7 ± 0.226
2.031AspIle: 2.031 ± 0.461
2.942AspLys: 2.942 ± 0.438
3.642AspLeu: 3.642 ± 0.553
1.611AspMet: 1.611 ± 0.351
1.751AspAsn: 1.751 ± 0.334
2.242AspPro: 2.242 ± 0.366
1.611AspGln: 1.611 ± 0.249
3.362AspArg: 3.362 ± 0.403
2.732AspSer: 2.732 ± 0.334
3.853AspThr: 3.853 ± 0.579
4.203AspVal: 4.203 ± 0.475
0.911AspTrp: 0.911 ± 0.174
1.751AspTyr: 1.751 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
5.324GluAla: 5.324 ± 0.723
0.63GluCys: 0.63 ± 0.21
2.732GluAsp: 2.732 ± 0.427
2.592GluGlu: 2.592 ± 0.455
3.222GluPhe: 3.222 ± 0.552
3.642GluGly: 3.642 ± 0.497
1.051GluHis: 1.051 ± 0.218
3.853GluIle: 3.853 ± 0.534
3.082GluLys: 3.082 ± 0.521
6.725GluLeu: 6.725 ± 0.733
2.592GluMet: 2.592 ± 0.355
2.242GluAsn: 2.242 ± 0.325
1.891GluPro: 1.891 ± 0.338
3.642GluGln: 3.642 ± 0.397
3.783GluArg: 3.783 ± 0.454
3.783GluSer: 3.783 ± 0.511
3.993GluThr: 3.993 ± 0.484
2.732GluVal: 2.732 ± 0.454
1.261GluTrp: 1.261 ± 0.277
2.452GluTyr: 2.452 ± 0.293
0.0GluXaa: 0.0 ± 0.0
Phe
3.082PheAla: 3.082 ± 0.521
0.28PheCys: 0.28 ± 0.135
3.572PheAsp: 3.572 ± 0.44
3.012PheGlu: 3.012 ± 0.388
1.331PhePhe: 1.331 ± 0.342
2.592PheGly: 2.592 ± 0.386
0.56PheHis: 0.56 ± 0.203
2.101PheIle: 2.101 ± 0.416
1.261PheLys: 1.261 ± 0.39
2.802PheLeu: 2.802 ± 0.498
1.121PheMet: 1.121 ± 0.278
1.821PheAsn: 1.821 ± 0.411
1.751PhePro: 1.751 ± 0.331
1.051PheGln: 1.051 ± 0.228
2.171PheArg: 2.171 ± 0.312
2.312PheSer: 2.312 ± 0.462
2.101PheThr: 2.101 ± 0.444
3.432PheVal: 3.432 ± 0.537
0.35PheTrp: 0.35 ± 0.154
1.401PheTyr: 1.401 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
6.374GlyAla: 6.374 ± 0.79
0.981GlyCys: 0.981 ± 0.291
4.063GlyAsp: 4.063 ± 0.501
4.203GlyGlu: 4.203 ± 0.567
3.082GlyPhe: 3.082 ± 0.409
7.495GlyGly: 7.495 ± 1.884
0.841GlyHis: 0.841 ± 0.203
3.783GlyIle: 3.783 ± 0.508
5.113GlyLys: 5.113 ± 0.621
5.184GlyLeu: 5.184 ± 0.629
3.012GlyMet: 3.012 ± 0.536
3.642GlyAsn: 3.642 ± 0.539
3.993GlyPro: 3.993 ± 0.644
3.012GlyGln: 3.012 ± 0.441
4.483GlyArg: 4.483 ± 0.706
5.113GlySer: 5.113 ± 0.702
4.063GlyThr: 4.063 ± 0.498
6.094GlyVal: 6.094 ± 0.687
1.541GlyTrp: 1.541 ± 0.295
3.713GlyTyr: 3.713 ± 0.506
0.0GlyXaa: 0.0 ± 0.0
His
1.541HisAla: 1.541 ± 0.336
0.14HisCys: 0.14 ± 0.102
0.7HisAsp: 0.7 ± 0.186
0.7HisGlu: 0.7 ± 0.225
0.7HisPhe: 0.7 ± 0.246
1.821HisGly: 1.821 ± 0.382
0.07HisHis: 0.07 ± 0.073
1.051HisIle: 1.051 ± 0.251
0.63HisLys: 0.63 ± 0.224
0.911HisLeu: 0.911 ± 0.339
0.28HisMet: 0.28 ± 0.12
0.56HisAsn: 0.56 ± 0.167
1.191HisPro: 1.191 ± 0.302
0.49HisGln: 0.49 ± 0.201
1.191HisArg: 1.191 ± 0.282
1.051HisSer: 1.051 ± 0.26
1.191HisThr: 1.191 ± 0.271
1.401HisVal: 1.401 ± 0.364
0.0HisTrp: 0.0 ± 0.0
0.7HisTyr: 0.7 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
6.234IleAla: 6.234 ± 0.608
0.63IleCys: 0.63 ± 0.232
3.012IleAsp: 3.012 ± 0.506
4.623IleGlu: 4.623 ± 0.491
1.191IlePhe: 1.191 ± 0.223
3.993IleGly: 3.993 ± 0.513
0.841IleHis: 0.841 ± 0.217
2.802IleIle: 2.802 ± 0.472
2.942IleLys: 2.942 ± 0.396
2.942IleLeu: 2.942 ± 0.488
0.911IleMet: 0.911 ± 0.227
2.872IleAsn: 2.872 ± 0.385
2.872IlePro: 2.872 ± 0.415
2.382IleGln: 2.382 ± 0.344
3.082IleArg: 3.082 ± 0.395
2.802IleSer: 2.802 ± 0.46
3.502IleThr: 3.502 ± 0.552
3.572IleVal: 3.572 ± 0.434
0.28IleTrp: 0.28 ± 0.128
1.541IleTyr: 1.541 ± 0.324
0.0IleXaa: 0.0 ± 0.0
Lys
5.324LysAla: 5.324 ± 0.699
0.63LysCys: 0.63 ± 0.173
2.942LysAsp: 2.942 ± 0.441
2.171LysGlu: 2.171 ± 0.493
2.171LysPhe: 2.171 ± 0.351
3.993LysGly: 3.993 ± 0.599
1.261LysHis: 1.261 ± 0.415
3.222LysIle: 3.222 ± 0.374
2.242LysLys: 2.242 ± 0.41
4.553LysLeu: 4.553 ± 0.842
2.101LysMet: 2.101 ± 0.444
2.242LysAsn: 2.242 ± 0.506
2.312LysPro: 2.312 ± 0.505
2.101LysGln: 2.101 ± 0.447
3.222LysArg: 3.222 ± 0.551
2.031LysSer: 2.031 ± 0.355
3.432LysThr: 3.432 ± 0.56
3.222LysVal: 3.222 ± 0.449
0.911LysTrp: 0.911 ± 0.328
1.891LysTyr: 1.891 ± 0.401
0.0LysXaa: 0.0 ± 0.0
Leu
7.915LeuAla: 7.915 ± 0.816
0.35LeuCys: 0.35 ± 0.213
3.642LeuAsp: 3.642 ± 0.497
4.693LeuGlu: 4.693 ± 0.644
2.522LeuPhe: 2.522 ± 0.333
4.693LeuGly: 4.693 ± 0.451
1.261LeuHis: 1.261 ± 0.336
3.572LeuIle: 3.572 ± 0.506
4.623LeuLys: 4.623 ± 0.706
4.553LeuLeu: 4.553 ± 0.628
2.452LeuMet: 2.452 ± 0.419
4.273LeuAsn: 4.273 ± 0.486
4.413LeuPro: 4.413 ± 0.633
2.662LeuGln: 2.662 ± 0.503
5.324LeuArg: 5.324 ± 0.62
4.483LeuSer: 4.483 ± 0.506
4.833LeuThr: 4.833 ± 0.553
3.292LeuVal: 3.292 ± 0.375
0.981LeuTrp: 0.981 ± 0.232
2.242LeuTyr: 2.242 ± 0.397
0.0LeuXaa: 0.0 ± 0.0
Met
3.082MetAla: 3.082 ± 0.389
0.21MetCys: 0.21 ± 0.129
1.751MetAsp: 1.751 ± 0.422
1.401MetGlu: 1.401 ± 0.263
1.331MetPhe: 1.331 ± 0.235
1.961MetGly: 1.961 ± 0.332
0.42MetHis: 0.42 ± 0.16
1.961MetIle: 1.961 ± 0.35
2.522MetLys: 2.522 ± 0.437
3.012MetLeu: 3.012 ± 0.402
0.841MetMet: 0.841 ± 0.184
2.101MetAsn: 2.101 ± 0.39
1.611MetPro: 1.611 ± 0.308
1.191MetGln: 1.191 ± 0.344
1.961MetArg: 1.961 ± 0.37
2.522MetSer: 2.522 ± 0.373
1.961MetThr: 1.961 ± 0.334
1.681MetVal: 1.681 ± 0.34
0.21MetTrp: 0.21 ± 0.117
0.841MetTyr: 0.841 ± 0.2
0.0MetXaa: 0.0 ± 0.0
Asn
3.923AsnAla: 3.923 ± 0.54
0.42AsnCys: 0.42 ± 0.211
2.452AsnAsp: 2.452 ± 0.401
2.872AsnGlu: 2.872 ± 0.352
1.541AsnPhe: 1.541 ± 0.409
5.814AsnGly: 5.814 ± 0.795
0.35AsnHis: 0.35 ± 0.139
2.312AsnIle: 2.312 ± 0.392
2.382AsnLys: 2.382 ± 0.367
2.872AsnLeu: 2.872 ± 0.371
2.452AsnMet: 2.452 ± 0.459
2.592AsnAsn: 2.592 ± 0.485
3.152AsnPro: 3.152 ± 0.509
1.471AsnGln: 1.471 ± 0.417
2.662AsnArg: 2.662 ± 0.451
2.171AsnSer: 2.171 ± 0.458
2.522AsnThr: 2.522 ± 0.355
3.292AsnVal: 3.292 ± 0.532
0.49AsnTrp: 0.49 ± 0.149
1.471AsnTyr: 1.471 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
4.903ProAla: 4.903 ± 0.756
0.56ProCys: 0.56 ± 0.205
3.572ProAsp: 3.572 ± 0.445
3.432ProGlu: 3.432 ± 0.43
1.891ProPhe: 1.891 ± 0.388
4.063ProGly: 4.063 ± 0.619
1.051ProHis: 1.051 ± 0.26
1.891ProIle: 1.891 ± 0.336
2.522ProLys: 2.522 ± 0.447
2.592ProLeu: 2.592 ± 0.466
0.771ProMet: 0.771 ± 0.194
3.012ProAsn: 3.012 ± 0.625
2.101ProPro: 2.101 ± 0.453
1.681ProGln: 1.681 ± 0.412
1.961ProArg: 1.961 ± 0.318
3.082ProSer: 3.082 ± 0.443
2.802ProThr: 2.802 ± 0.468
4.413ProVal: 4.413 ± 0.657
0.35ProTrp: 0.35 ± 0.14
1.611ProTyr: 1.611 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
4.973GlnAla: 4.973 ± 0.73
0.07GlnCys: 0.07 ± 0.076
1.401GlnAsp: 1.401 ± 0.378
1.541GlnGlu: 1.541 ± 0.313
1.331GlnPhe: 1.331 ± 0.347
2.732GlnGly: 2.732 ± 0.488
0.911GlnHis: 0.911 ± 0.243
2.171GlnIle: 2.171 ± 0.634
1.961GlnLys: 1.961 ± 0.316
3.642GlnLeu: 3.642 ± 0.463
0.841GlnMet: 0.841 ± 0.27
1.121GlnAsn: 1.121 ± 0.384
1.821GlnPro: 1.821 ± 0.333
2.522GlnGln: 2.522 ± 1.077
3.432GlnArg: 3.432 ± 0.443
1.891GlnSer: 1.891 ± 0.444
2.171GlnThr: 2.171 ± 0.4
2.802GlnVal: 2.802 ± 0.553
0.911GlnTrp: 0.911 ± 0.251
1.401GlnTyr: 1.401 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
5.464ArgAla: 5.464 ± 0.697
0.42ArgCys: 0.42 ± 0.172
3.362ArgAsp: 3.362 ± 0.451
4.623ArgGlu: 4.623 ± 0.62
2.872ArgPhe: 2.872 ± 0.364
3.853ArgGly: 3.853 ± 0.431
0.911ArgHis: 0.911 ± 0.29
3.362ArgIle: 3.362 ± 0.522
2.242ArgLys: 2.242 ± 0.441
3.853ArgLeu: 3.853 ± 0.468
2.732ArgMet: 2.732 ± 0.424
3.432ArgAsn: 3.432 ± 0.531
2.452ArgPro: 2.452 ± 0.605
1.891ArgGln: 1.891 ± 0.313
3.292ArgArg: 3.292 ± 0.539
3.783ArgSer: 3.783 ± 0.551
2.872ArgThr: 2.872 ± 0.445
4.693ArgVal: 4.693 ± 0.508
0.911ArgTrp: 0.911 ± 0.197
2.522ArgTyr: 2.522 ± 0.493
0.0ArgXaa: 0.0 ± 0.0
Ser
5.324SerAla: 5.324 ± 0.825
0.14SerCys: 0.14 ± 0.091
2.942SerAsp: 2.942 ± 0.456
3.082SerGlu: 3.082 ± 0.416
2.662SerPhe: 2.662 ± 0.418
4.973SerGly: 4.973 ± 0.673
0.981SerHis: 0.981 ± 0.233
3.292SerIle: 3.292 ± 0.314
3.502SerLys: 3.502 ± 0.422
3.783SerLeu: 3.783 ± 0.522
1.891SerMet: 1.891 ± 0.324
2.802SerAsn: 2.802 ± 0.563
2.312SerPro: 2.312 ± 0.47
2.101SerGln: 2.101 ± 0.256
2.522SerArg: 2.522 ± 0.414
3.152SerSer: 3.152 ± 0.424
3.572SerThr: 3.572 ± 0.547
3.713SerVal: 3.713 ± 0.412
0.771SerTrp: 0.771 ± 0.201
2.592SerTyr: 2.592 ± 0.376
0.0SerXaa: 0.0 ± 0.0
Thr
6.444ThrAla: 6.444 ± 0.943
0.14ThrCys: 0.14 ± 0.101
2.522ThrAsp: 2.522 ± 0.331
3.783ThrGlu: 3.783 ± 0.608
2.592ThrPhe: 2.592 ± 0.489
4.973ThrGly: 4.973 ± 0.548
0.841ThrHis: 0.841 ± 0.262
3.222ThrIle: 3.222 ± 0.474
2.452ThrLys: 2.452 ± 0.409
4.553ThrLeu: 4.553 ± 0.48
1.681ThrMet: 1.681 ± 0.315
2.872ThrAsn: 2.872 ± 0.5
3.783ThrPro: 3.783 ± 0.461
2.592ThrGln: 2.592 ± 0.479
3.432ThrArg: 3.432 ± 0.528
3.292ThrSer: 3.292 ± 0.581
3.713ThrThr: 3.713 ± 0.79
4.973ThrVal: 4.973 ± 0.711
0.56ThrTrp: 0.56 ± 0.201
2.031ThrTyr: 2.031 ± 0.391
0.0ThrXaa: 0.0 ± 0.0
Val
6.304ValAla: 6.304 ± 0.629
0.0ValCys: 0.0 ± 0.0
3.642ValAsp: 3.642 ± 0.387
5.324ValGlu: 5.324 ± 0.587
2.872ValPhe: 2.872 ± 0.431
5.394ValGly: 5.394 ± 0.638
1.191ValHis: 1.191 ± 0.291
3.642ValIle: 3.642 ± 0.391
4.063ValLys: 4.063 ± 0.465
5.043ValLeu: 5.043 ± 0.48
1.821ValMet: 1.821 ± 0.302
3.222ValAsn: 3.222 ± 0.462
2.942ValPro: 2.942 ± 0.585
2.382ValGln: 2.382 ± 0.427
4.203ValArg: 4.203 ± 0.438
4.133ValSer: 4.133 ± 0.459
4.623ValThr: 4.623 ± 0.612
5.394ValVal: 5.394 ± 0.68
1.261ValTrp: 1.261 ± 0.234
2.592ValTyr: 2.592 ± 0.444
0.0ValXaa: 0.0 ± 0.0
Trp
1.331TrpAla: 1.331 ± 0.301
0.21TrpCys: 0.21 ± 0.098
0.911TrpAsp: 0.911 ± 0.258
0.49TrpGlu: 0.49 ± 0.17
0.63TrpPhe: 0.63 ± 0.19
1.401TrpGly: 1.401 ± 0.354
0.28TrpHis: 0.28 ± 0.154
0.841TrpIle: 0.841 ± 0.179
0.63TrpLys: 0.63 ± 0.21
0.7TrpLeu: 0.7 ± 0.185
0.28TrpMet: 0.28 ± 0.115
0.771TrpAsn: 0.771 ± 0.22
0.771TrpPro: 0.771 ± 0.292
0.911TrpGln: 0.911 ± 0.259
0.771TrpArg: 0.771 ± 0.254
0.42TrpSer: 0.42 ± 0.151
0.7TrpThr: 0.7 ± 0.226
1.331TrpVal: 1.331 ± 0.249
0.35TrpTrp: 0.35 ± 0.149
0.56TrpTyr: 0.56 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.222TyrAla: 3.222 ± 0.586
0.35TyrCys: 0.35 ± 0.172
2.031TyrAsp: 2.031 ± 0.415
1.611TyrGlu: 1.611 ± 0.328
1.121TyrPhe: 1.121 ± 0.214
3.432TyrGly: 3.432 ± 0.608
0.841TyrHis: 0.841 ± 0.281
2.312TyrIle: 2.312 ± 0.35
1.471TyrLys: 1.471 ± 0.569
2.732TyrLeu: 2.732 ± 0.478
1.191TyrMet: 1.191 ± 0.288
1.401TyrAsn: 1.401 ± 0.316
1.891TyrPro: 1.891 ± 0.401
1.331TyrGln: 1.331 ± 0.295
2.452TyrArg: 2.452 ± 0.455
1.891TyrSer: 1.891 ± 0.45
2.101TyrThr: 2.101 ± 0.286
2.732TyrVal: 2.732 ± 0.389
0.63TyrTrp: 0.63 ± 0.177
1.401TyrTyr: 1.401 ± 0.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (14277 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski