Amino acid dipepetide frequency for Streptococcus phage 8140

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.577AlaAla: 2.577 ± 0.89
0.286AlaCys: 0.286 ± 0.166
6.109AlaAsp: 6.109 ± 0.634
6.586AlaGlu: 6.586 ± 0.729
2.864AlaPhe: 2.864 ± 0.777
4.391AlaGly: 4.391 ± 1.182
0.477AlaHis: 0.477 ± 0.202
4.773AlaIle: 4.773 ± 0.976
5.441AlaLys: 5.441 ± 0.817
6.586AlaLeu: 6.586 ± 0.871
2.1AlaMet: 2.1 ± 0.478
3.246AlaAsn: 3.246 ± 0.665
1.814AlaPro: 1.814 ± 0.448
2.577AlaGln: 2.577 ± 0.503
2.482AlaArg: 2.482 ± 0.42
1.909AlaSer: 1.909 ± 0.497
4.009AlaThr: 4.009 ± 0.686
5.823AlaVal: 5.823 ± 0.898
1.336AlaTrp: 1.336 ± 0.513
1.623AlaTyr: 1.623 ± 0.293
0.0AlaXaa: 0.0 ± 0.0
Cys
0.191CysAla: 0.191 ± 0.102
0.095CysCys: 0.095 ± 0.088
0.477CysAsp: 0.477 ± 0.223
0.668CysGlu: 0.668 ± 0.237
0.095CysPhe: 0.095 ± 0.07
0.477CysGly: 0.477 ± 0.311
0.191CysHis: 0.191 ± 0.13
0.573CysIle: 0.573 ± 0.284
0.859CysLys: 0.859 ± 0.239
0.382CysLeu: 0.382 ± 0.192
0.095CysMet: 0.095 ± 0.122
0.191CysAsn: 0.191 ± 0.177
0.286CysPro: 0.286 ± 0.183
0.382CysGln: 0.382 ± 0.183
0.477CysArg: 0.477 ± 0.166
0.382CysSer: 0.382 ± 0.172
0.286CysThr: 0.286 ± 0.157
0.191CysVal: 0.191 ± 0.121
0.095CysTrp: 0.095 ± 0.088
0.477CysTyr: 0.477 ± 0.222
0.0CysXaa: 0.0 ± 0.0
Asp
3.818AspAla: 3.818 ± 0.702
0.668AspCys: 0.668 ± 0.256
3.436AspAsp: 3.436 ± 0.672
5.059AspGlu: 5.059 ± 0.936
3.055AspPhe: 3.055 ± 0.506
4.677AspGly: 4.677 ± 0.591
0.286AspHis: 0.286 ± 0.175
5.441AspIle: 5.441 ± 0.669
5.155AspLys: 5.155 ± 0.799
5.727AspLeu: 5.727 ± 0.814
1.909AspMet: 1.909 ± 0.409
3.15AspAsn: 3.15 ± 0.475
2.291AspPro: 2.291 ± 0.454
1.718AspGln: 1.718 ± 0.424
2.959AspArg: 2.959 ± 0.629
3.723AspSer: 3.723 ± 0.528
4.009AspThr: 4.009 ± 0.475
3.818AspVal: 3.818 ± 0.73
1.623AspTrp: 1.623 ± 0.57
3.15AspTyr: 3.15 ± 0.678
0.0AspXaa: 0.0 ± 0.0
Glu
5.441GluAla: 5.441 ± 0.782
0.668GluCys: 0.668 ± 0.243
4.486GluAsp: 4.486 ± 0.876
6.205GluGlu: 6.205 ± 1.341
3.627GluPhe: 3.627 ± 0.693
3.436GluGly: 3.436 ± 0.47
1.241GluHis: 1.241 ± 0.291
6.3GluIle: 6.3 ± 0.794
6.205GluLys: 6.205 ± 0.975
9.259GluLeu: 9.259 ± 1.106
1.909GluMet: 1.909 ± 0.499
4.105GluAsn: 4.105 ± 0.598
1.814GluPro: 1.814 ± 0.484
4.105GluGln: 4.105 ± 0.665
5.25GluArg: 5.25 ± 0.661
4.868GluSer: 4.868 ± 0.662
3.436GluThr: 3.436 ± 0.527
5.536GluVal: 5.536 ± 0.64
0.668GluTrp: 0.668 ± 0.244
2.768GluTyr: 2.768 ± 0.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.577PheAla: 2.577 ± 0.554
0.382PheCys: 0.382 ± 0.192
4.391PheAsp: 4.391 ± 0.547
3.723PheGlu: 3.723 ± 0.699
1.718PhePhe: 1.718 ± 0.399
2.291PheGly: 2.291 ± 0.757
0.191PheHis: 0.191 ± 0.136
2.577PheIle: 2.577 ± 0.494
3.246PheLys: 3.246 ± 0.544
2.482PheLeu: 2.482 ± 0.426
1.336PheMet: 1.336 ± 0.395
2.482PheAsn: 2.482 ± 0.504
0.668PhePro: 0.668 ± 0.268
1.814PheGln: 1.814 ± 0.373
0.955PheArg: 0.955 ± 0.277
3.723PheSer: 3.723 ± 0.751
2.577PheThr: 2.577 ± 0.505
1.336PheVal: 1.336 ± 0.405
0.477PheTrp: 0.477 ± 0.238
2.195PheTyr: 2.195 ± 0.503
0.0PheXaa: 0.0 ± 0.0
Gly
2.959GlyAla: 2.959 ± 0.613
0.0GlyCys: 0.0 ± 0.0
3.723GlyAsp: 3.723 ± 0.563
5.25GlyGlu: 5.25 ± 0.774
2.959GlyPhe: 2.959 ± 0.763
4.773GlyGly: 4.773 ± 1.604
0.477GlyHis: 0.477 ± 0.197
4.105GlyIle: 4.105 ± 0.703
5.25GlyLys: 5.25 ± 0.609
5.346GlyLeu: 5.346 ± 1.333
1.527GlyMet: 1.527 ± 0.354
4.2GlyAsn: 4.2 ± 0.624
0.859GlyPro: 0.859 ± 0.287
3.341GlyGln: 3.341 ± 0.464
3.627GlyArg: 3.627 ± 0.542
4.009GlySer: 4.009 ± 0.897
2.768GlyThr: 2.768 ± 0.42
4.391GlyVal: 4.391 ± 0.645
0.668GlyTrp: 0.668 ± 0.197
2.864GlyTyr: 2.864 ± 0.55
0.0GlyXaa: 0.0 ± 0.0
His
0.955HisAla: 0.955 ± 0.329
0.095HisCys: 0.095 ± 0.096
0.764HisAsp: 0.764 ± 0.317
1.241HisGlu: 1.241 ± 0.279
0.859HisPhe: 0.859 ± 0.235
0.668HisGly: 0.668 ± 0.248
0.191HisHis: 0.191 ± 0.137
0.286HisIle: 0.286 ± 0.21
0.477HisLys: 0.477 ± 0.186
1.05HisLeu: 1.05 ± 0.295
0.286HisMet: 0.286 ± 0.153
1.05HisAsn: 1.05 ± 0.335
0.668HisPro: 0.668 ± 0.208
0.764HisGln: 0.764 ± 0.283
0.573HisArg: 0.573 ± 0.18
1.336HisSer: 1.336 ± 0.372
0.668HisThr: 0.668 ± 0.234
0.764HisVal: 0.764 ± 0.268
0.191HisTrp: 0.191 ± 0.14
0.477HisTyr: 0.477 ± 0.271
0.0HisXaa: 0.0 ± 0.0
Ile
5.25IleAla: 5.25 ± 0.914
0.286IleCys: 0.286 ± 0.124
4.105IleAsp: 4.105 ± 0.742
6.109IleGlu: 6.109 ± 0.746
3.055IlePhe: 3.055 ± 0.658
4.2IleGly: 4.2 ± 1.129
0.382IleHis: 0.382 ± 0.213
3.723IleIle: 3.723 ± 0.683
6.968IleLys: 6.968 ± 0.926
4.677IleLeu: 4.677 ± 0.586
1.241IleMet: 1.241 ± 0.425
4.009IleAsn: 4.009 ± 0.6
1.814IlePro: 1.814 ± 0.486
2.386IleGln: 2.386 ± 0.419
2.768IleArg: 2.768 ± 0.482
5.155IleSer: 5.155 ± 0.756
4.391IleThr: 4.391 ± 0.671
3.341IleVal: 3.341 ± 0.525
0.286IleTrp: 0.286 ± 0.166
2.1IleTyr: 2.1 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
4.964LysAla: 4.964 ± 0.811
0.477LysCys: 0.477 ± 0.222
6.396LysAsp: 6.396 ± 0.638
8.305LysGlu: 8.305 ± 0.804
2.864LysPhe: 2.864 ± 0.489
3.818LysGly: 3.818 ± 0.59
1.432LysHis: 1.432 ± 0.38
6.109LysIle: 6.109 ± 0.91
6.682LysLys: 6.682 ± 0.671
6.491LysLeu: 6.491 ± 0.76
2.673LysMet: 2.673 ± 0.465
4.868LysAsn: 4.868 ± 0.638
2.673LysPro: 2.673 ± 0.644
3.723LysGln: 3.723 ± 0.695
3.627LysArg: 3.627 ± 0.496
3.914LysSer: 3.914 ± 0.46
4.009LysThr: 4.009 ± 0.563
6.396LysVal: 6.396 ± 0.74
0.764LysTrp: 0.764 ± 0.287
3.341LysTyr: 3.341 ± 0.611
0.0LysXaa: 0.0 ± 0.0
Leu
6.968LeuAla: 6.968 ± 1.234
0.764LeuCys: 0.764 ± 0.303
6.014LeuAsp: 6.014 ± 0.808
6.968LeuGlu: 6.968 ± 0.988
2.768LeuPhe: 2.768 ± 0.511
6.396LeuGly: 6.396 ± 1.508
1.432LeuHis: 1.432 ± 0.369
4.2LeuIle: 4.2 ± 0.654
7.064LeuLys: 7.064 ± 0.796
6.968LeuLeu: 6.968 ± 1.009
2.005LeuMet: 2.005 ± 0.383
3.627LeuAsn: 3.627 ± 0.463
2.1LeuPro: 2.1 ± 0.484
3.246LeuGln: 3.246 ± 0.605
4.009LeuArg: 4.009 ± 0.752
5.155LeuSer: 5.155 ± 0.839
4.582LeuThr: 4.582 ± 0.517
3.627LeuVal: 3.627 ± 0.567
0.668LeuTrp: 0.668 ± 0.316
2.673LeuTyr: 2.673 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
1.909MetAla: 1.909 ± 0.402
0.191MetCys: 0.191 ± 0.129
1.623MetAsp: 1.623 ± 0.323
1.909MetGlu: 1.909 ± 0.434
0.859MetPhe: 0.859 ± 0.235
1.623MetGly: 1.623 ± 0.451
0.382MetHis: 0.382 ± 0.259
2.195MetIle: 2.195 ± 0.463
2.864MetLys: 2.864 ± 0.704
1.527MetLeu: 1.527 ± 0.309
0.286MetMet: 0.286 ± 0.164
1.527MetAsn: 1.527 ± 0.376
0.955MetPro: 0.955 ± 0.262
1.432MetGln: 1.432 ± 0.435
1.145MetArg: 1.145 ± 0.311
0.955MetSer: 0.955 ± 0.293
1.814MetThr: 1.814 ± 0.398
1.241MetVal: 1.241 ± 0.3
0.286MetTrp: 0.286 ± 0.159
0.573MetTyr: 0.573 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
4.677AsnAla: 4.677 ± 1.045
0.286AsnCys: 0.286 ± 0.15
2.864AsnAsp: 2.864 ± 0.547
3.723AsnGlu: 3.723 ± 0.707
1.527AsnPhe: 1.527 ± 0.442
4.2AsnGly: 4.2 ± 0.543
1.241AsnHis: 1.241 ± 0.414
3.532AsnIle: 3.532 ± 0.588
4.296AsnLys: 4.296 ± 0.562
3.723AsnLeu: 3.723 ± 0.699
1.05AsnMet: 1.05 ± 0.277
2.673AsnAsn: 2.673 ± 0.627
1.909AsnPro: 1.909 ± 0.418
2.768AsnGln: 2.768 ± 0.547
2.673AsnArg: 2.673 ± 0.601
3.15AsnSer: 3.15 ± 0.667
3.436AsnThr: 3.436 ± 0.624
3.341AsnVal: 3.341 ± 0.412
0.859AsnTrp: 0.859 ± 0.207
1.623AsnTyr: 1.623 ± 0.351
0.0AsnXaa: 0.0 ± 0.0
Pro
2.005ProAla: 2.005 ± 0.468
0.095ProCys: 0.095 ± 0.087
2.005ProAsp: 2.005 ± 0.432
3.055ProGlu: 3.055 ± 0.451
1.432ProPhe: 1.432 ± 0.436
1.241ProGly: 1.241 ± 0.299
0.573ProHis: 0.573 ± 0.235
1.623ProIle: 1.623 ± 0.424
2.291ProLys: 2.291 ± 0.433
1.432ProLeu: 1.432 ± 0.341
0.477ProMet: 0.477 ± 0.228
1.05ProAsn: 1.05 ± 0.4
1.05ProPro: 1.05 ± 0.358
1.336ProGln: 1.336 ± 0.413
1.527ProArg: 1.527 ± 0.344
1.05ProSer: 1.05 ± 0.449
0.955ProThr: 0.955 ± 0.324
1.909ProVal: 1.909 ± 0.397
0.477ProTrp: 0.477 ± 0.201
1.623ProTyr: 1.623 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
2.864GlnAla: 2.864 ± 0.529
0.382GlnCys: 0.382 ± 0.189
2.1GlnAsp: 2.1 ± 0.437
3.532GlnGlu: 3.532 ± 0.676
1.145GlnPhe: 1.145 ± 0.316
2.195GlnGly: 2.195 ± 0.418
0.286GlnHis: 0.286 ± 0.155
3.723GlnIle: 3.723 ± 0.603
3.818GlnLys: 3.818 ± 0.554
3.436GlnLeu: 3.436 ± 0.509
1.05GlnMet: 1.05 ± 0.258
2.005GlnAsn: 2.005 ± 0.419
1.05GlnPro: 1.05 ± 0.309
2.005GlnGln: 2.005 ± 0.419
1.909GlnArg: 1.909 ± 0.422
2.768GlnSer: 2.768 ± 0.447
2.386GlnThr: 2.386 ± 0.459
4.582GlnVal: 4.582 ± 0.748
0.573GlnTrp: 0.573 ± 0.24
1.145GlnTyr: 1.145 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
3.15ArgAla: 3.15 ± 0.411
0.382ArgCys: 0.382 ± 0.159
2.291ArgAsp: 2.291 ± 0.526
2.673ArgGlu: 2.673 ± 0.539
2.005ArgPhe: 2.005 ± 0.48
2.1ArgGly: 2.1 ± 0.528
0.668ArgHis: 0.668 ± 0.234
3.15ArgIle: 3.15 ± 0.539
4.105ArgLys: 4.105 ± 0.749
5.059ArgLeu: 5.059 ± 0.753
2.482ArgMet: 2.482 ± 0.511
2.482ArgAsn: 2.482 ± 0.534
0.764ArgPro: 0.764 ± 0.254
2.482ArgGln: 2.482 ± 0.498
2.482ArgArg: 2.482 ± 0.556
2.673ArgSer: 2.673 ± 0.523
2.577ArgThr: 2.577 ± 0.551
2.864ArgVal: 2.864 ± 0.443
0.286ArgTrp: 0.286 ± 0.185
1.814ArgTyr: 1.814 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
4.486SerAla: 4.486 ± 1.022
0.286SerCys: 0.286 ± 0.216
3.818SerAsp: 3.818 ± 0.606
3.723SerGlu: 3.723 ± 0.728
2.005SerPhe: 2.005 ± 0.446
4.868SerGly: 4.868 ± 0.759
1.145SerHis: 1.145 ± 0.363
4.009SerIle: 4.009 ± 0.611
4.486SerLys: 4.486 ± 0.656
5.25SerLeu: 5.25 ± 0.629
1.527SerMet: 1.527 ± 0.437
3.246SerAsn: 3.246 ± 0.622
1.432SerPro: 1.432 ± 0.319
2.195SerGln: 2.195 ± 0.456
2.959SerArg: 2.959 ± 0.588
4.105SerSer: 4.105 ± 0.861
3.818SerThr: 3.818 ± 0.572
3.532SerVal: 3.532 ± 0.669
1.05SerTrp: 1.05 ± 0.414
2.482SerTyr: 2.482 ± 0.625
0.0SerXaa: 0.0 ± 0.0
Thr
4.582ThrAla: 4.582 ± 0.79
0.191ThrCys: 0.191 ± 0.14
4.391ThrAsp: 4.391 ± 0.614
3.436ThrGlu: 3.436 ± 0.531
2.864ThrPhe: 2.864 ± 0.564
3.532ThrGly: 3.532 ± 0.582
0.764ThrHis: 0.764 ± 0.349
3.914ThrIle: 3.914 ± 0.71
3.723ThrLys: 3.723 ± 0.737
4.582ThrLeu: 4.582 ± 0.817
0.668ThrMet: 0.668 ± 0.21
3.341ThrAsn: 3.341 ± 0.488
1.145ThrPro: 1.145 ± 0.352
2.673ThrGln: 2.673 ± 0.485
1.718ThrArg: 1.718 ± 0.375
3.914ThrSer: 3.914 ± 0.596
5.155ThrThr: 5.155 ± 0.868
4.773ThrVal: 4.773 ± 0.855
0.764ThrTrp: 0.764 ± 0.291
2.291ThrTyr: 2.291 ± 0.466
0.0ThrXaa: 0.0 ± 0.0
Val
5.155ValAla: 5.155 ± 0.649
0.382ValCys: 0.382 ± 0.199
3.627ValAsp: 3.627 ± 0.654
5.059ValGlu: 5.059 ± 0.705
2.673ValPhe: 2.673 ± 0.442
5.536ValGly: 5.536 ± 0.834
0.764ValHis: 0.764 ± 0.265
2.864ValIle: 2.864 ± 0.532
5.823ValLys: 5.823 ± 0.704
4.296ValLeu: 4.296 ± 0.54
1.432ValMet: 1.432 ± 0.378
3.914ValAsn: 3.914 ± 0.836
2.195ValPro: 2.195 ± 0.422
1.718ValGln: 1.718 ± 0.469
2.482ValArg: 2.482 ± 0.43
4.773ValSer: 4.773 ± 0.63
4.964ValThr: 4.964 ± 0.676
4.773ValVal: 4.773 ± 0.807
0.573ValTrp: 0.573 ± 0.2
3.246ValTyr: 3.246 ± 0.575
0.0ValXaa: 0.0 ± 0.0
Trp
1.241TrpAla: 1.241 ± 0.384
0.191TrpCys: 0.191 ± 0.109
0.573TrpAsp: 0.573 ± 0.289
0.764TrpGlu: 0.764 ± 0.27
0.955TrpPhe: 0.955 ± 0.515
0.573TrpGly: 0.573 ± 0.212
0.191TrpHis: 0.191 ± 0.147
0.668TrpIle: 0.668 ± 0.349
1.05TrpLys: 1.05 ± 0.322
0.573TrpLeu: 0.573 ± 0.318
0.382TrpMet: 0.382 ± 0.173
0.859TrpAsn: 0.859 ± 0.329
0.191TrpPro: 0.191 ± 0.136
0.477TrpGln: 0.477 ± 0.356
0.859TrpArg: 0.859 ± 0.341
0.382TrpSer: 0.382 ± 0.156
0.764TrpThr: 0.764 ± 0.302
1.05TrpVal: 1.05 ± 0.319
0.191TrpTrp: 0.191 ± 0.121
0.191TrpTyr: 0.191 ± 0.121
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.241TyrAla: 1.241 ± 0.262
0.764TyrCys: 0.764 ± 0.35
2.482TyrAsp: 2.482 ± 0.447
3.341TyrGlu: 3.341 ± 0.734
1.814TyrPhe: 1.814 ± 0.61
2.1TyrGly: 2.1 ± 0.378
1.05TyrHis: 1.05 ± 0.329
2.577TyrIle: 2.577 ± 0.507
3.723TyrLys: 3.723 ± 0.672
2.386TyrLeu: 2.386 ± 0.396
0.764TyrMet: 0.764 ± 0.35
1.527TyrAsn: 1.527 ± 0.392
1.527TyrPro: 1.527 ± 0.458
1.814TyrGln: 1.814 ± 0.438
2.005TyrArg: 2.005 ± 0.527
2.482TyrSer: 2.482 ± 0.613
1.814TyrThr: 1.814 ± 0.384
2.959TyrVal: 2.959 ± 0.633
0.286TyrTrp: 0.286 ± 0.147
1.145TyrTyr: 1.145 ± 0.376
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (10477 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski