Amino acid dipepetide frequency for Lactobacillus phage phiJB

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.064AlaAla: 6.064 ± 1.116
0.527AlaCys: 0.527 ± 0.227
5.624AlaAsp: 5.624 ± 1.227
3.779AlaGlu: 3.779 ± 0.58
2.461AlaPhe: 2.461 ± 0.417
5.976AlaGly: 5.976 ± 1.302
0.527AlaHis: 0.527 ± 0.222
5.097AlaIle: 5.097 ± 0.977
8.173AlaLys: 8.173 ± 0.788
7.03AlaLeu: 7.03 ± 1.145
2.021AlaMet: 2.021 ± 0.345
4.13AlaAsn: 4.13 ± 0.486
1.582AlaPro: 1.582 ± 0.375
4.218AlaGln: 4.218 ± 0.491
3.339AlaArg: 3.339 ± 0.5
5.976AlaSer: 5.976 ± 0.739
6.327AlaThr: 6.327 ± 0.703
4.482AlaVal: 4.482 ± 0.577
1.406AlaTrp: 1.406 ± 0.356
3.603AlaTyr: 3.603 ± 0.747
0.0AlaXaa: 0.0 ± 0.0
Cys
0.352CysAla: 0.352 ± 0.173
0.088CysCys: 0.088 ± 0.084
0.615CysAsp: 0.615 ± 0.272
0.264CysGlu: 0.264 ± 0.159
0.527CysPhe: 0.527 ± 0.238
0.791CysGly: 0.791 ± 0.316
0.264CysHis: 0.264 ± 0.139
0.439CysIle: 0.439 ± 0.198
0.439CysLys: 0.439 ± 0.201
0.527CysLeu: 0.527 ± 0.195
0.439CysMet: 0.439 ± 0.213
0.088CysAsn: 0.088 ± 0.084
0.439CysPro: 0.439 ± 0.269
0.439CysGln: 0.439 ± 0.27
0.088CysArg: 0.088 ± 0.085
0.527CysSer: 0.527 ± 0.222
0.352CysThr: 0.352 ± 0.212
0.615CysVal: 0.615 ± 0.212
0.0CysTrp: 0.0 ± 0.0
0.352CysTyr: 0.352 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
4.746AspAla: 4.746 ± 0.655
0.615AspCys: 0.615 ± 0.314
3.867AspAsp: 3.867 ± 0.52
3.779AspGlu: 3.779 ± 0.782
3.339AspPhe: 3.339 ± 0.465
6.591AspGly: 6.591 ± 0.981
1.406AspHis: 1.406 ± 0.377
3.867AspIle: 3.867 ± 0.504
3.691AspLys: 3.691 ± 0.525
5.8AspLeu: 5.8 ± 0.666
2.021AspMet: 2.021 ± 0.422
2.724AspAsn: 2.724 ± 0.515
3.076AspPro: 3.076 ± 0.518
4.043AspGln: 4.043 ± 0.79
2.724AspArg: 2.724 ± 0.617
4.921AspSer: 4.921 ± 0.989
3.955AspThr: 3.955 ± 0.9
3.427AspVal: 3.427 ± 0.533
1.318AspTrp: 1.318 ± 0.313
4.394AspTyr: 4.394 ± 0.605
0.0AspXaa: 0.0 ± 0.0
Glu
4.306GluAla: 4.306 ± 0.779
0.439GluCys: 0.439 ± 0.211
3.691GluAsp: 3.691 ± 0.828
3.427GluGlu: 3.427 ± 0.559
2.285GluPhe: 2.285 ± 0.344
2.549GluGly: 2.549 ± 0.39
0.791GluHis: 0.791 ± 0.298
3.955GluIle: 3.955 ± 0.676
3.164GluLys: 3.164 ± 0.556
5.888GluLeu: 5.888 ± 0.777
2.021GluMet: 2.021 ± 0.47
3.164GluAsn: 3.164 ± 0.597
1.055GluPro: 1.055 ± 0.353
2.285GluGln: 2.285 ± 0.452
2.197GluArg: 2.197 ± 0.37
3.427GluSer: 3.427 ± 0.42
2.549GluThr: 2.549 ± 0.598
3.515GluVal: 3.515 ± 0.529
0.879GluTrp: 0.879 ± 0.303
1.582GluTyr: 1.582 ± 0.38
0.0GluXaa: 0.0 ± 0.0
Phe
3.252PheAla: 3.252 ± 0.543
0.176PheCys: 0.176 ± 0.128
3.164PheAsp: 3.164 ± 0.593
2.197PheGlu: 2.197 ± 0.543
1.933PhePhe: 1.933 ± 0.503
2.549PheGly: 2.549 ± 0.348
0.791PheHis: 0.791 ± 0.227
2.549PheIle: 2.549 ± 0.529
4.043PheLys: 4.043 ± 0.931
2.285PheLeu: 2.285 ± 0.518
0.967PheMet: 0.967 ± 0.345
2.021PheAsn: 2.021 ± 0.381
1.318PhePro: 1.318 ± 0.32
0.879PheGln: 0.879 ± 0.345
1.758PheArg: 1.758 ± 0.327
2.461PheSer: 2.461 ± 0.448
2.549PheThr: 2.549 ± 0.412
2.021PheVal: 2.021 ± 0.503
0.703PheTrp: 0.703 ± 0.297
1.055PheTyr: 1.055 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
4.57GlyAla: 4.57 ± 0.71
0.615GlyCys: 0.615 ± 0.25
4.746GlyAsp: 4.746 ± 0.66
2.988GlyGlu: 2.988 ± 0.488
3.779GlyPhe: 3.779 ± 0.663
5.009GlyGly: 5.009 ± 0.913
1.758GlyHis: 1.758 ± 0.371
3.252GlyIle: 3.252 ± 0.517
6.503GlyLys: 6.503 ± 0.672
5.273GlyLeu: 5.273 ± 0.766
2.109GlyMet: 2.109 ± 0.5
5.009GlyAsn: 5.009 ± 0.995
1.758GlyPro: 1.758 ± 0.713
3.515GlyGln: 3.515 ± 0.597
1.933GlyArg: 1.933 ± 0.388
5.185GlySer: 5.185 ± 0.983
5.712GlyThr: 5.712 ± 0.709
4.833GlyVal: 4.833 ± 0.792
0.352GlyTrp: 0.352 ± 0.154
2.9GlyTyr: 2.9 ± 0.503
0.0GlyXaa: 0.0 ± 0.0
His
0.879HisAla: 0.879 ± 0.247
0.176HisCys: 0.176 ± 0.107
1.406HisAsp: 1.406 ± 0.425
0.791HisGlu: 0.791 ± 0.232
0.967HisPhe: 0.967 ± 0.26
1.318HisGly: 1.318 ± 0.351
0.352HisHis: 0.352 ± 0.226
0.967HisIle: 0.967 ± 0.253
1.055HisLys: 1.055 ± 0.344
0.791HisLeu: 0.791 ± 0.327
0.439HisMet: 0.439 ± 0.185
0.352HisAsn: 0.352 ± 0.179
1.318HisPro: 1.318 ± 0.329
1.055HisGln: 1.055 ± 0.286
1.23HisArg: 1.23 ± 0.326
1.055HisSer: 1.055 ± 0.336
1.23HisThr: 1.23 ± 0.423
1.406HisVal: 1.406 ± 0.288
0.088HisTrp: 0.088 ± 0.076
0.967HisTyr: 0.967 ± 0.336
0.0HisXaa: 0.0 ± 0.0
Ile
3.955IleAla: 3.955 ± 0.549
0.264IleCys: 0.264 ± 0.161
5.888IleAsp: 5.888 ± 0.615
3.164IleGlu: 3.164 ± 0.557
1.494IlePhe: 1.494 ± 0.324
4.043IleGly: 4.043 ± 0.828
0.879IleHis: 0.879 ± 0.285
2.988IleIle: 2.988 ± 0.593
5.361IleLys: 5.361 ± 0.896
3.955IleLeu: 3.955 ± 0.606
1.318IleMet: 1.318 ± 0.341
3.603IleAsn: 3.603 ± 0.686
1.582IlePro: 1.582 ± 0.342
1.933IleGln: 1.933 ± 0.471
3.076IleArg: 3.076 ± 0.621
4.394IleSer: 4.394 ± 0.56
4.57IleThr: 4.57 ± 0.71
3.691IleVal: 3.691 ± 0.58
1.055IleTrp: 1.055 ± 0.289
2.988IleTyr: 2.988 ± 0.443
0.0IleXaa: 0.0 ± 0.0
Lys
7.206LysAla: 7.206 ± 1.115
0.615LysCys: 0.615 ± 0.243
4.043LysAsp: 4.043 ± 0.746
5.449LysGlu: 5.449 ± 0.753
2.461LysPhe: 2.461 ± 0.539
6.415LysGly: 6.415 ± 0.969
1.67LysHis: 1.67 ± 0.439
3.867LysIle: 3.867 ± 0.693
7.03LysLys: 7.03 ± 1.219
6.24LysLeu: 6.24 ± 0.867
2.109LysMet: 2.109 ± 0.46
3.603LysAsn: 3.603 ± 0.512
2.549LysPro: 2.549 ± 0.522
3.076LysGln: 3.076 ± 0.615
3.603LysArg: 3.603 ± 0.69
5.097LysSer: 5.097 ± 0.646
5.185LysThr: 5.185 ± 0.765
5.185LysVal: 5.185 ± 0.796
1.318LysTrp: 1.318 ± 0.324
1.758LysTyr: 1.758 ± 0.397
0.0LysXaa: 0.0 ± 0.0
Leu
6.855LeuAla: 6.855 ± 0.919
0.703LeuCys: 0.703 ± 0.294
6.415LeuAsp: 6.415 ± 0.86
4.394LeuGlu: 4.394 ± 0.775
2.109LeuPhe: 2.109 ± 0.386
4.394LeuGly: 4.394 ± 0.599
0.791LeuHis: 0.791 ± 0.301
4.57LeuIle: 4.57 ± 0.759
8.349LeuLys: 8.349 ± 1.205
5.009LeuLeu: 5.009 ± 0.748
1.23LeuMet: 1.23 ± 0.344
3.691LeuAsn: 3.691 ± 0.597
2.021LeuPro: 2.021 ± 0.46
2.988LeuGln: 2.988 ± 0.374
2.109LeuArg: 2.109 ± 0.507
4.218LeuSer: 4.218 ± 0.705
5.449LeuThr: 5.449 ± 0.68
3.955LeuVal: 3.955 ± 0.675
1.23LeuTrp: 1.23 ± 0.478
2.812LeuTyr: 2.812 ± 0.604
0.0LeuXaa: 0.0 ± 0.0
Met
3.252MetAla: 3.252 ± 0.494
0.176MetCys: 0.176 ± 0.112
1.67MetAsp: 1.67 ± 0.444
1.142MetGlu: 1.142 ± 0.342
1.23MetPhe: 1.23 ± 0.412
1.142MetGly: 1.142 ± 0.442
0.615MetHis: 0.615 ± 0.25
1.933MetIle: 1.933 ± 0.543
1.23MetLys: 1.23 ± 0.413
1.846MetLeu: 1.846 ± 0.432
0.088MetMet: 0.088 ± 0.105
0.615MetAsn: 0.615 ± 0.208
1.406MetPro: 1.406 ± 0.325
1.846MetGln: 1.846 ± 0.408
1.23MetArg: 1.23 ± 0.311
2.636MetSer: 2.636 ± 0.433
1.142MetThr: 1.142 ± 0.298
1.142MetVal: 1.142 ± 0.31
0.176MetTrp: 0.176 ± 0.126
0.615MetTyr: 0.615 ± 0.19
0.0MetXaa: 0.0 ± 0.0
Asn
4.746AsnAla: 4.746 ± 0.734
0.967AsnCys: 0.967 ± 0.298
3.164AsnAsp: 3.164 ± 0.504
2.9AsnGlu: 2.9 ± 0.648
2.021AsnPhe: 2.021 ± 0.363
5.097AsnGly: 5.097 ± 0.727
0.791AsnHis: 0.791 ± 0.235
2.549AsnIle: 2.549 ± 0.455
2.549AsnLys: 2.549 ± 0.361
2.724AsnLeu: 2.724 ± 0.412
1.318AsnMet: 1.318 ± 0.361
1.23AsnAsn: 1.23 ± 0.347
2.988AsnPro: 2.988 ± 0.724
2.197AsnGln: 2.197 ± 0.464
2.021AsnArg: 2.021 ± 0.38
2.636AsnSer: 2.636 ± 0.43
2.636AsnThr: 2.636 ± 0.383
3.515AsnVal: 3.515 ± 0.468
0.879AsnTrp: 0.879 ± 0.238
2.373AsnTyr: 2.373 ± 0.518
0.0AsnXaa: 0.0 ± 0.0
Pro
3.339ProAla: 3.339 ± 0.649
0.088ProCys: 0.088 ± 0.096
2.636ProAsp: 2.636 ± 0.38
1.846ProGlu: 1.846 ± 0.498
0.879ProPhe: 0.879 ± 0.283
2.021ProGly: 2.021 ± 0.355
0.967ProHis: 0.967 ± 0.284
1.933ProIle: 1.933 ± 0.375
2.636ProLys: 2.636 ± 0.708
1.933ProLeu: 1.933 ± 0.422
0.879ProMet: 0.879 ± 0.289
1.846ProAsn: 1.846 ± 0.462
1.142ProPro: 1.142 ± 0.324
1.23ProGln: 1.23 ± 0.419
1.406ProArg: 1.406 ± 0.291
2.724ProSer: 2.724 ± 0.437
2.461ProThr: 2.461 ± 0.508
1.933ProVal: 1.933 ± 0.521
0.176ProTrp: 0.176 ± 0.134
1.758ProTyr: 1.758 ± 0.344
0.0ProXaa: 0.0 ± 0.0
Gln
3.691GlnAla: 3.691 ± 0.665
0.264GlnCys: 0.264 ± 0.144
2.636GlnAsp: 2.636 ± 0.334
2.197GlnGlu: 2.197 ± 0.422
1.582GlnPhe: 1.582 ± 0.354
2.988GlnGly: 2.988 ± 0.592
0.527GlnHis: 0.527 ± 0.232
2.9GlnIle: 2.9 ± 0.452
2.9GlnLys: 2.9 ± 0.514
3.515GlnLeu: 3.515 ± 0.801
1.494GlnMet: 1.494 ± 0.397
1.758GlnAsn: 1.758 ± 0.384
1.406GlnPro: 1.406 ± 0.381
1.758GlnGln: 1.758 ± 0.523
0.967GlnArg: 0.967 ± 0.259
2.988GlnSer: 2.988 ± 0.539
3.427GlnThr: 3.427 ± 0.641
3.515GlnVal: 3.515 ± 0.688
0.439GlnTrp: 0.439 ± 0.17
2.021GlnTyr: 2.021 ± 0.664
0.0GlnXaa: 0.0 ± 0.0
Arg
3.252ArgAla: 3.252 ± 0.588
0.088ArgCys: 0.088 ± 0.085
2.636ArgAsp: 2.636 ± 0.456
2.549ArgGlu: 2.549 ± 0.596
1.582ArgPhe: 1.582 ± 0.375
1.758ArgGly: 1.758 ± 0.448
0.615ArgHis: 0.615 ± 0.226
2.812ArgIle: 2.812 ± 0.55
3.427ArgLys: 3.427 ± 0.599
3.427ArgLeu: 3.427 ± 0.658
0.791ArgMet: 0.791 ± 0.245
1.582ArgAsn: 1.582 ± 0.422
1.318ArgPro: 1.318 ± 0.349
1.846ArgGln: 1.846 ± 0.422
1.406ArgArg: 1.406 ± 0.389
1.318ArgSer: 1.318 ± 0.3
3.779ArgThr: 3.779 ± 0.512
2.724ArgVal: 2.724 ± 0.529
0.879ArgTrp: 0.879 ± 0.247
1.67ArgTyr: 1.67 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
6.415SerAla: 6.415 ± 0.755
0.264SerCys: 0.264 ± 0.183
5.185SerAsp: 5.185 ± 0.844
2.988SerGlu: 2.988 ± 0.613
2.9SerPhe: 2.9 ± 0.538
5.009SerGly: 5.009 ± 0.602
1.055SerHis: 1.055 ± 0.347
3.603SerIle: 3.603 ± 0.621
5.009SerLys: 5.009 ± 0.677
4.394SerLeu: 4.394 ± 0.614
1.67SerMet: 1.67 ± 0.345
3.164SerAsn: 3.164 ± 0.584
2.636SerPro: 2.636 ± 0.59
2.461SerGln: 2.461 ± 0.463
2.461SerArg: 2.461 ± 0.469
4.13SerSer: 4.13 ± 0.772
3.867SerThr: 3.867 ± 0.576
4.043SerVal: 4.043 ± 0.556
0.615SerTrp: 0.615 ± 0.246
3.603SerTyr: 3.603 ± 0.443
0.0SerXaa: 0.0 ± 0.0
Thr
5.712ThrAla: 5.712 ± 0.742
0.615ThrCys: 0.615 ± 0.245
5.009ThrAsp: 5.009 ± 0.837
3.076ThrGlu: 3.076 ± 0.411
2.812ThrPhe: 2.812 ± 0.482
6.064ThrGly: 6.064 ± 0.958
1.318ThrHis: 1.318 ± 0.339
4.921ThrIle: 4.921 ± 0.748
3.955ThrLys: 3.955 ± 0.525
5.185ThrLeu: 5.185 ± 0.863
1.758ThrMet: 1.758 ± 0.38
3.164ThrAsn: 3.164 ± 0.665
2.9ThrPro: 2.9 ± 0.599
2.724ThrGln: 2.724 ± 0.503
2.373ThrArg: 2.373 ± 0.47
4.57ThrSer: 4.57 ± 0.636
5.097ThrThr: 5.097 ± 0.713
4.921ThrVal: 4.921 ± 0.868
0.791ThrTrp: 0.791 ± 0.308
2.636ThrTyr: 2.636 ± 0.533
0.0ThrXaa: 0.0 ± 0.0
Val
5.537ValAla: 5.537 ± 1.01
0.352ValCys: 0.352 ± 0.156
4.043ValAsp: 4.043 ± 0.756
4.043ValGlu: 4.043 ± 0.623
1.67ValPhe: 1.67 ± 0.41
4.658ValGly: 4.658 ± 0.582
1.494ValHis: 1.494 ± 0.307
3.427ValIle: 3.427 ± 0.603
4.833ValLys: 4.833 ± 0.595
3.603ValLeu: 3.603 ± 0.488
1.406ValMet: 1.406 ± 0.306
4.13ValAsn: 4.13 ± 0.554
2.197ValPro: 2.197 ± 0.384
2.373ValGln: 2.373 ± 0.484
2.988ValArg: 2.988 ± 0.543
3.955ValSer: 3.955 ± 0.58
4.482ValThr: 4.482 ± 0.593
5.888ValVal: 5.888 ± 0.633
1.055ValTrp: 1.055 ± 0.261
2.285ValTyr: 2.285 ± 0.553
0.0ValXaa: 0.0 ± 0.0
Trp
1.142TrpAla: 1.142 ± 0.305
0.088TrpCys: 0.088 ± 0.084
0.879TrpAsp: 0.879 ± 0.253
0.527TrpGlu: 0.527 ± 0.19
0.703TrpPhe: 0.703 ± 0.269
0.352TrpGly: 0.352 ± 0.169
0.439TrpHis: 0.439 ± 0.179
1.494TrpIle: 1.494 ± 0.287
1.406TrpLys: 1.406 ± 0.359
1.055TrpLeu: 1.055 ± 0.243
0.176TrpMet: 0.176 ± 0.103
1.406TrpAsn: 1.406 ± 0.427
0.439TrpPro: 0.439 ± 0.233
0.615TrpGln: 0.615 ± 0.206
0.615TrpArg: 0.615 ± 0.237
0.967TrpSer: 0.967 ± 0.462
0.615TrpThr: 0.615 ± 0.21
0.703TrpVal: 0.703 ± 0.182
0.088TrpTrp: 0.088 ± 0.082
0.352TrpTyr: 0.352 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.076TyrAla: 3.076 ± 0.394
0.527TyrCys: 0.527 ± 0.207
2.988TyrAsp: 2.988 ± 0.697
1.67TyrGlu: 1.67 ± 0.317
1.933TyrPhe: 1.933 ± 0.517
3.076TyrGly: 3.076 ± 0.603
0.791TyrHis: 0.791 ± 0.294
3.076TyrIle: 3.076 ± 0.632
2.724TyrLys: 2.724 ± 0.495
2.812TyrLeu: 2.812 ± 0.656
0.791TyrMet: 0.791 ± 0.271
2.021TyrAsn: 2.021 ± 0.489
0.703TyrPro: 0.703 ± 0.238
1.494TyrGln: 1.494 ± 0.326
2.021TyrArg: 2.021 ± 0.329
2.285TyrSer: 2.285 ± 0.4
4.218TyrThr: 4.218 ± 0.94
2.9TyrVal: 2.9 ± 0.603
0.615TyrTrp: 0.615 ± 0.208
2.285TyrTyr: 2.285 ± 0.545
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11380 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski