Amino acid dipepetide frequency for Oenococcus phage phiOE33PA

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.806AlaAla: 4.806 ± 1.144
0.481AlaCys: 0.481 ± 0.207
3.845AlaAsp: 3.845 ± 0.556
3.605AlaGlu: 3.605 ± 0.841
3.845AlaPhe: 3.845 ± 0.53
5.127AlaGly: 5.127 ± 0.834
1.202AlaHis: 1.202 ± 0.352
5.767AlaIle: 5.767 ± 0.81
6.408AlaLys: 6.408 ± 1.215
6.488AlaLeu: 6.488 ± 0.658
1.522AlaMet: 1.522 ± 0.401
4.165AlaAsn: 4.165 ± 0.873
1.282AlaPro: 1.282 ± 0.307
2.403AlaGln: 2.403 ± 0.537
2.403AlaArg: 2.403 ± 0.414
5.447AlaSer: 5.447 ± 0.826
4.085AlaThr: 4.085 ± 0.508
3.925AlaVal: 3.925 ± 0.578
0.801AlaTrp: 0.801 ± 0.188
2.884AlaTyr: 2.884 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.16CysAsp: 0.16 ± 0.107
0.16CysGlu: 0.16 ± 0.122
0.32CysPhe: 0.32 ± 0.165
0.16CysGly: 0.16 ± 0.117
0.24CysHis: 0.24 ± 0.175
0.401CysIle: 0.401 ± 0.183
0.32CysLys: 0.32 ± 0.178
1.282CysLeu: 1.282 ± 0.309
0.0CysMet: 0.0 ± 0.0
0.24CysAsn: 0.24 ± 0.128
0.401CysPro: 0.401 ± 0.212
0.08CysGln: 0.08 ± 0.084
0.24CysArg: 0.24 ± 0.14
0.32CysSer: 0.32 ± 0.152
0.08CysThr: 0.08 ± 0.083
0.32CysVal: 0.32 ± 0.177
0.0CysTrp: 0.0 ± 0.0
0.08CysTyr: 0.08 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
4.165AspAla: 4.165 ± 0.635
0.24AspCys: 0.24 ± 0.124
4.406AspAsp: 4.406 ± 0.62
3.605AspGlu: 3.605 ± 0.606
3.044AspPhe: 3.044 ± 0.702
4.326AspGly: 4.326 ± 0.507
1.202AspHis: 1.202 ± 0.316
4.966AspIle: 4.966 ± 0.831
4.726AspLys: 4.726 ± 0.585
6.008AspLeu: 6.008 ± 0.587
1.602AspMet: 1.602 ± 0.433
4.085AspAsn: 4.085 ± 0.545
2.403AspPro: 2.403 ± 0.471
3.124AspGln: 3.124 ± 0.43
1.602AspArg: 1.602 ± 0.373
4.245AspSer: 4.245 ± 0.586
4.005AspThr: 4.005 ± 0.698
4.005AspVal: 4.005 ± 0.49
1.041AspTrp: 1.041 ± 0.325
2.723AspTyr: 2.723 ± 0.522
0.0AspXaa: 0.0 ± 0.0
Glu
3.364GluAla: 3.364 ± 0.608
0.24GluCys: 0.24 ± 0.144
2.563GluAsp: 2.563 ± 0.413
3.444GluGlu: 3.444 ± 0.64
2.483GluPhe: 2.483 ± 0.512
1.362GluGly: 1.362 ± 0.404
0.881GluHis: 0.881 ± 0.241
3.364GluIle: 3.364 ± 0.62
5.847GluLys: 5.847 ± 0.809
4.406GluLeu: 4.406 ± 0.732
1.602GluMet: 1.602 ± 0.28
3.685GluAsn: 3.685 ± 0.57
0.961GluPro: 0.961 ± 0.303
2.483GluGln: 2.483 ± 0.55
1.922GluArg: 1.922 ± 0.39
3.364GluSer: 3.364 ± 0.469
2.563GluThr: 2.563 ± 0.44
3.044GluVal: 3.044 ± 0.46
0.481GluTrp: 0.481 ± 0.216
2.323GluTyr: 2.323 ± 0.538
0.0GluXaa: 0.0 ± 0.0
Phe
2.964PheAla: 2.964 ± 0.505
0.24PheCys: 0.24 ± 0.147
3.364PheAsp: 3.364 ± 0.547
1.522PheGlu: 1.522 ± 0.354
1.682PhePhe: 1.682 ± 0.269
2.483PheGly: 2.483 ± 0.419
1.041PheHis: 1.041 ± 0.306
2.723PheIle: 2.723 ± 0.571
2.884PheLys: 2.884 ± 0.514
3.284PheLeu: 3.284 ± 0.615
1.282PheMet: 1.282 ± 0.306
3.204PheAsn: 3.204 ± 0.386
1.282PhePro: 1.282 ± 0.283
1.522PheGln: 1.522 ± 0.333
1.442PheArg: 1.442 ± 0.307
3.444PheSer: 3.444 ± 0.432
2.403PheThr: 2.403 ± 0.434
3.204PheVal: 3.204 ± 0.593
0.16PheTrp: 0.16 ± 0.129
1.762PheTyr: 1.762 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
2.643GlyAla: 2.643 ± 0.599
0.24GlyCys: 0.24 ± 0.149
3.925GlyAsp: 3.925 ± 0.66
2.643GlyGlu: 2.643 ± 0.556
2.804GlyPhe: 2.804 ± 0.551
3.605GlyGly: 3.605 ± 0.836
1.442GlyHis: 1.442 ± 0.367
6.168GlyIle: 6.168 ± 1.117
5.767GlyLys: 5.767 ± 0.813
6.088GlyLeu: 6.088 ± 1.011
2.163GlyMet: 2.163 ± 0.429
2.723GlyAsn: 2.723 ± 0.43
0.24GlyPro: 0.24 ± 0.143
3.204GlyGln: 3.204 ± 0.554
1.522GlyArg: 1.522 ± 0.274
4.326GlySer: 4.326 ± 1.122
3.685GlyThr: 3.685 ± 0.682
2.884GlyVal: 2.884 ± 0.397
1.121GlyTrp: 1.121 ± 0.363
3.444GlyTyr: 3.444 ± 0.542
0.0GlyXaa: 0.0 ± 0.0
His
1.121HisAla: 1.121 ± 0.389
0.08HisCys: 0.08 ± 0.084
1.442HisAsp: 1.442 ± 0.287
0.481HisGlu: 0.481 ± 0.216
1.041HisPhe: 1.041 ± 0.367
1.522HisGly: 1.522 ± 0.375
0.481HisHis: 0.481 ± 0.182
1.362HisIle: 1.362 ± 0.314
1.522HisLys: 1.522 ± 0.384
1.842HisLeu: 1.842 ± 0.315
0.08HisMet: 0.08 ± 0.076
1.362HisAsn: 1.362 ± 0.336
0.32HisPro: 0.32 ± 0.197
0.641HisGln: 0.641 ± 0.255
1.041HisArg: 1.041 ± 0.36
1.762HisSer: 1.762 ± 0.362
0.801HisThr: 0.801 ± 0.247
0.561HisVal: 0.561 ± 0.209
0.16HisTrp: 0.16 ± 0.11
0.961HisTyr: 0.961 ± 0.404
0.0HisXaa: 0.0 ± 0.0
Ile
5.607IleAla: 5.607 ± 0.74
0.401IleCys: 0.401 ± 0.241
5.287IleAsp: 5.287 ± 0.857
3.765IleGlu: 3.765 ± 0.578
2.563IlePhe: 2.563 ± 0.379
4.005IleGly: 4.005 ± 0.773
1.362IleHis: 1.362 ± 0.382
5.367IleIle: 5.367 ± 0.929
6.488IleLys: 6.488 ± 0.872
5.447IleLeu: 5.447 ± 1.003
1.121IleMet: 1.121 ± 0.23
4.085IleAsn: 4.085 ± 0.501
2.804IlePro: 2.804 ± 0.547
3.444IleGln: 3.444 ± 0.519
2.163IleArg: 2.163 ± 0.47
5.687IleSer: 5.687 ± 0.774
6.008IleThr: 6.008 ± 0.881
4.326IleVal: 4.326 ± 0.612
0.881IleTrp: 0.881 ± 0.255
2.643IleTyr: 2.643 ± 0.549
0.0IleXaa: 0.0 ± 0.0
Lys
7.77LysAla: 7.77 ± 1.251
0.08LysCys: 0.08 ± 0.072
5.687LysAsp: 5.687 ± 0.845
4.966LysGlu: 4.966 ± 0.753
2.723LysPhe: 2.723 ± 0.493
3.845LysGly: 3.845 ± 0.748
1.762LysHis: 1.762 ± 0.461
5.207LysIle: 5.207 ± 0.847
6.168LysLys: 6.168 ± 0.888
6.408LysLeu: 6.408 ± 0.964
2.483LysMet: 2.483 ± 0.478
4.886LysAsn: 4.886 ± 0.582
2.643LysPro: 2.643 ± 0.499
5.447LysGln: 5.447 ± 0.816
2.964LysArg: 2.964 ± 0.573
5.287LysSer: 5.287 ± 0.654
6.408LysThr: 6.408 ± 0.719
5.127LysVal: 5.127 ± 0.799
1.121LysTrp: 1.121 ± 0.326
3.685LysTyr: 3.685 ± 0.609
0.0LysXaa: 0.0 ± 0.0
Leu
5.607LeuAla: 5.607 ± 0.656
0.561LeuCys: 0.561 ± 0.239
5.046LeuAsp: 5.046 ± 0.615
4.005LeuGlu: 4.005 ± 0.585
3.845LeuPhe: 3.845 ± 0.57
5.527LeuGly: 5.527 ± 0.847
1.362LeuHis: 1.362 ± 0.397
6.568LeuIle: 6.568 ± 1.113
7.69LeuLys: 7.69 ± 0.867
5.847LeuLeu: 5.847 ± 0.618
2.243LeuMet: 2.243 ± 0.503
7.369LeuAsn: 7.369 ± 0.81
2.563LeuPro: 2.563 ± 0.509
3.044LeuGln: 3.044 ± 0.545
3.284LeuArg: 3.284 ± 0.452
6.809LeuSer: 6.809 ± 0.922
7.209LeuThr: 7.209 ± 0.763
4.646LeuVal: 4.646 ± 0.476
0.561LeuTrp: 0.561 ± 0.187
2.563LeuTyr: 2.563 ± 0.437
0.0LeuXaa: 0.0 ± 0.0
Met
1.762MetAla: 1.762 ± 0.419
0.08MetCys: 0.08 ± 0.075
1.762MetAsp: 1.762 ± 0.429
1.041MetGlu: 1.041 ± 0.242
0.961MetPhe: 0.961 ± 0.277
0.32MetGly: 0.32 ± 0.163
0.24MetHis: 0.24 ± 0.126
1.282MetIle: 1.282 ± 0.345
2.163MetLys: 2.163 ± 0.489
2.323MetLeu: 2.323 ± 0.341
0.481MetMet: 0.481 ± 0.19
1.362MetAsn: 1.362 ± 0.309
1.522MetPro: 1.522 ± 0.445
0.641MetGln: 0.641 ± 0.261
0.881MetArg: 0.881 ± 0.414
1.522MetSer: 1.522 ± 0.334
1.682MetThr: 1.682 ± 0.384
1.362MetVal: 1.362 ± 0.357
0.401MetTrp: 0.401 ± 0.181
0.641MetTyr: 0.641 ± 0.235
0.0MetXaa: 0.0 ± 0.0
Asn
4.326AsnAla: 4.326 ± 0.682
0.401AsnCys: 0.401 ± 0.213
3.765AsnAsp: 3.765 ± 0.589
3.284AsnGlu: 3.284 ± 0.55
2.003AsnPhe: 2.003 ± 0.432
4.165AsnGly: 4.165 ± 0.651
1.442AsnHis: 1.442 ± 0.381
4.486AsnIle: 4.486 ± 0.539
5.847AsnLys: 5.847 ± 0.832
4.566AsnLeu: 4.566 ± 0.529
0.961AsnMet: 0.961 ± 0.283
4.326AsnAsn: 4.326 ± 0.847
3.204AsnPro: 3.204 ± 0.576
2.563AsnGln: 2.563 ± 0.608
2.003AsnArg: 2.003 ± 0.51
4.165AsnSer: 4.165 ± 0.728
3.364AsnThr: 3.364 ± 0.615
4.245AsnVal: 4.245 ± 0.555
0.481AsnTrp: 0.481 ± 0.196
2.003AsnTyr: 2.003 ± 0.522
0.0AsnXaa: 0.0 ± 0.0
Pro
2.243ProAla: 2.243 ± 0.439
0.16ProCys: 0.16 ± 0.123
2.243ProAsp: 2.243 ± 0.405
2.083ProGlu: 2.083 ± 0.45
1.121ProPhe: 1.121 ± 0.367
1.121ProGly: 1.121 ± 0.369
0.641ProHis: 0.641 ± 0.22
2.003ProIle: 2.003 ± 0.394
1.682ProLys: 1.682 ± 0.347
2.483ProLeu: 2.483 ± 0.484
0.481ProMet: 0.481 ± 0.14
1.682ProAsn: 1.682 ± 0.389
0.641ProPro: 0.641 ± 0.211
1.121ProGln: 1.121 ± 0.321
0.721ProArg: 0.721 ± 0.286
2.884ProSer: 2.884 ± 0.534
1.602ProThr: 1.602 ± 0.356
2.003ProVal: 2.003 ± 0.32
0.32ProTrp: 0.32 ± 0.194
1.602ProTyr: 1.602 ± 0.323
0.0ProXaa: 0.0 ± 0.0
Gln
3.605GlnAla: 3.605 ± 0.599
0.32GlnCys: 0.32 ± 0.167
2.003GlnAsp: 2.003 ± 0.493
2.003GlnGlu: 2.003 ± 0.471
1.602GlnPhe: 1.602 ± 0.393
2.003GlnGly: 2.003 ± 0.422
0.801GlnHis: 0.801 ± 0.268
3.845GlnIle: 3.845 ± 0.521
2.643GlnLys: 2.643 ± 0.65
5.367GlnLeu: 5.367 ± 0.768
0.721GlnMet: 0.721 ± 0.233
2.964GlnAsn: 2.964 ± 0.513
0.881GlnPro: 0.881 ± 0.291
1.842GlnGln: 1.842 ± 0.456
1.202GlnArg: 1.202 ± 0.287
3.925GlnSer: 3.925 ± 0.765
2.884GlnThr: 2.884 ± 0.554
2.884GlnVal: 2.884 ± 0.529
0.561GlnTrp: 0.561 ± 0.237
2.003GlnTyr: 2.003 ± 0.393
0.0GlnXaa: 0.0 ± 0.0
Arg
1.922ArgAla: 1.922 ± 0.414
0.16ArgCys: 0.16 ± 0.128
1.762ArgAsp: 1.762 ± 0.414
1.682ArgGlu: 1.682 ± 0.433
1.762ArgPhe: 1.762 ± 0.438
1.762ArgGly: 1.762 ± 0.386
0.481ArgHis: 0.481 ± 0.278
2.243ArgIle: 2.243 ± 0.568
2.884ArgLys: 2.884 ± 0.474
3.124ArgLeu: 3.124 ± 0.593
0.801ArgMet: 0.801 ± 0.263
1.682ArgAsn: 1.682 ± 0.487
0.561ArgPro: 0.561 ± 0.233
1.762ArgGln: 1.762 ± 0.34
1.202ArgArg: 1.202 ± 0.379
1.842ArgSer: 1.842 ± 0.382
1.522ArgThr: 1.522 ± 0.336
2.163ArgVal: 2.163 ± 0.486
0.801ArgTrp: 0.801 ± 0.23
1.362ArgTyr: 1.362 ± 0.289
0.0ArgXaa: 0.0 ± 0.0
Ser
4.646SerAla: 4.646 ± 0.938
0.0SerCys: 0.0 ± 0.0
5.767SerAsp: 5.767 ± 0.538
4.245SerGlu: 4.245 ± 0.563
3.124SerPhe: 3.124 ± 0.629
6.649SerGly: 6.649 ± 0.783
1.202SerHis: 1.202 ± 0.292
5.127SerIle: 5.127 ± 0.629
6.408SerLys: 6.408 ± 1.114
5.847SerLeu: 5.847 ± 0.697
1.922SerMet: 1.922 ± 0.411
4.165SerAsn: 4.165 ± 0.59
1.922SerPro: 1.922 ± 0.439
2.884SerGln: 2.884 ± 0.489
1.762SerArg: 1.762 ± 0.406
5.127SerSer: 5.127 ± 1.054
6.248SerThr: 6.248 ± 1.199
4.806SerVal: 4.806 ± 0.627
0.721SerTrp: 0.721 ± 0.265
3.204SerTyr: 3.204 ± 0.454
0.0SerXaa: 0.0 ± 0.0
Thr
6.248ThrAla: 6.248 ± 0.909
0.401ThrCys: 0.401 ± 0.162
5.046ThrAsp: 5.046 ± 0.677
3.525ThrGlu: 3.525 ± 0.419
2.403ThrPhe: 2.403 ± 0.423
5.127ThrGly: 5.127 ± 0.893
0.961ThrHis: 0.961 ± 0.249
4.406ThrIle: 4.406 ± 0.787
6.248ThrLys: 6.248 ± 0.802
4.646ThrLeu: 4.646 ± 0.653
1.362ThrMet: 1.362 ± 0.268
3.364ThrAsn: 3.364 ± 0.515
1.842ThrPro: 1.842 ± 0.367
2.643ThrGln: 2.643 ± 0.367
1.442ThrArg: 1.442 ± 0.372
4.726ThrSer: 4.726 ± 0.888
5.687ThrThr: 5.687 ± 0.832
3.765ThrVal: 3.765 ± 0.568
0.32ThrTrp: 0.32 ± 0.134
2.804ThrTyr: 2.804 ± 0.651
0.0ThrXaa: 0.0 ± 0.0
Val
4.165ValAla: 4.165 ± 0.501
0.401ValCys: 0.401 ± 0.183
4.726ValAsp: 4.726 ± 0.6
2.884ValGlu: 2.884 ± 0.542
2.403ValPhe: 2.403 ± 0.403
4.165ValGly: 4.165 ± 0.827
0.721ValHis: 0.721 ± 0.201
4.245ValIle: 4.245 ± 0.853
4.726ValLys: 4.726 ± 0.719
4.886ValLeu: 4.886 ± 0.442
0.721ValMet: 0.721 ± 0.261
3.525ValAsn: 3.525 ± 0.605
1.602ValPro: 1.602 ± 0.324
2.243ValGln: 2.243 ± 0.39
1.762ValArg: 1.762 ± 0.501
6.649ValSer: 6.649 ± 0.925
3.845ValThr: 3.845 ± 0.55
3.925ValVal: 3.925 ± 0.622
0.561ValTrp: 0.561 ± 0.215
1.682ValTyr: 1.682 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
0.721TrpAla: 0.721 ± 0.235
0.0TrpCys: 0.0 ± 0.0
0.401TrpAsp: 0.401 ± 0.152
0.481TrpGlu: 0.481 ± 0.179
0.481TrpPhe: 0.481 ± 0.202
0.561TrpGly: 0.561 ± 0.206
0.16TrpHis: 0.16 ± 0.096
0.961TrpIle: 0.961 ± 0.265
0.881TrpLys: 0.881 ± 0.278
1.602TrpLeu: 1.602 ± 0.369
0.16TrpMet: 0.16 ± 0.108
0.961TrpAsn: 0.961 ± 0.245
0.16TrpPro: 0.16 ± 0.125
0.401TrpGln: 0.401 ± 0.175
0.481TrpArg: 0.481 ± 0.196
0.801TrpSer: 0.801 ± 0.262
0.641TrpThr: 0.641 ± 0.172
0.481TrpVal: 0.481 ± 0.167
0.08TrpTrp: 0.08 ± 0.079
0.561TrpTyr: 0.561 ± 0.302
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.804TyrAla: 2.804 ± 0.385
0.32TyrCys: 0.32 ± 0.196
2.323TyrAsp: 2.323 ± 0.384
0.961TyrGlu: 0.961 ± 0.251
1.682TyrPhe: 1.682 ± 0.325
2.804TyrGly: 2.804 ± 0.442
0.961TyrHis: 0.961 ± 0.323
2.964TyrIle: 2.964 ± 0.673
3.444TyrLys: 3.444 ± 0.557
4.406TyrLeu: 4.406 ± 0.755
0.641TyrMet: 0.641 ± 0.222
1.762TyrAsn: 1.762 ± 0.43
1.602TyrPro: 1.602 ± 0.422
2.563TyrGln: 2.563 ± 0.472
1.442TyrArg: 1.442 ± 0.379
3.444TyrSer: 3.444 ± 0.493
2.243TyrThr: 2.243 ± 0.556
2.163TyrVal: 2.163 ± 0.399
0.401TyrTrp: 0.401 ± 0.194
2.323TyrTyr: 2.323 ± 0.452
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (12485 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski