Amino acid dipepetide frequency for Lactococcus phage phiLC3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.538AlaAla: 3.538 ± 0.873
0.303AlaCys: 0.303 ± 0.168
3.942AlaAsp: 3.942 ± 0.73
4.245AlaGlu: 4.245 ± 0.919
2.426AlaPhe: 2.426 ± 0.466
4.448AlaGly: 4.448 ± 1.099
1.112AlaHis: 1.112 ± 0.352
5.661AlaIle: 5.661 ± 0.976
5.458AlaLys: 5.458 ± 0.72
5.458AlaLeu: 5.458 ± 0.852
1.213AlaMet: 1.213 ± 0.345
3.134AlaAsn: 3.134 ± 0.672
1.213AlaPro: 1.213 ± 0.321
2.628AlaGln: 2.628 ± 0.612
2.527AlaArg: 2.527 ± 0.488
4.347AlaSer: 4.347 ± 0.73
4.852AlaThr: 4.852 ± 0.685
5.054AlaVal: 5.054 ± 0.749
1.112AlaTrp: 1.112 ± 0.346
1.921AlaTyr: 1.921 ± 0.474
0.0AlaXaa: 0.0 ± 0.0
Cys
0.303CysAla: 0.303 ± 0.167
0.101CysCys: 0.101 ± 0.098
0.606CysAsp: 0.606 ± 0.297
0.0CysGlu: 0.0 ± 0.0
0.202CysPhe: 0.202 ± 0.132
0.708CysGly: 0.708 ± 0.278
0.202CysHis: 0.202 ± 0.142
0.101CysIle: 0.101 ± 0.117
0.303CysLys: 0.303 ± 0.173
0.404CysLeu: 0.404 ± 0.213
0.0CysMet: 0.0 ± 0.0
0.303CysAsn: 0.303 ± 0.195
0.202CysPro: 0.202 ± 0.13
0.404CysGln: 0.404 ± 0.19
0.505CysArg: 0.505 ± 0.256
0.505CysSer: 0.505 ± 0.202
0.303CysThr: 0.303 ± 0.178
0.101CysVal: 0.101 ± 0.096
0.0CysTrp: 0.0 ± 0.0
0.606CysTyr: 0.606 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
2.83AspAla: 2.83 ± 0.595
0.404AspCys: 0.404 ± 0.22
3.336AspAsp: 3.336 ± 0.595
5.054AspGlu: 5.054 ± 0.784
3.74AspPhe: 3.74 ± 0.507
4.347AspGly: 4.347 ± 0.612
0.809AspHis: 0.809 ± 0.275
5.155AspIle: 5.155 ± 0.796
4.245AspLys: 4.245 ± 0.548
3.942AspLeu: 3.942 ± 0.698
2.022AspMet: 2.022 ± 0.42
3.74AspAsn: 3.74 ± 0.557
1.415AspPro: 1.415 ± 0.421
1.314AspGln: 1.314 ± 0.404
2.83AspArg: 2.83 ± 0.542
4.549AspSer: 4.549 ± 0.63
3.134AspThr: 3.134 ± 0.578
3.538AspVal: 3.538 ± 0.729
1.516AspTrp: 1.516 ± 0.364
2.224AspTyr: 2.224 ± 0.414
0.0AspXaa: 0.0 ± 0.0
Glu
4.953GluAla: 4.953 ± 0.57
0.505GluCys: 0.505 ± 0.229
2.83GluAsp: 2.83 ± 0.835
4.347GluGlu: 4.347 ± 0.813
3.336GluPhe: 3.336 ± 0.683
1.921GluGly: 1.921 ± 0.344
1.112GluHis: 1.112 ± 0.308
5.357GluIle: 5.357 ± 0.829
7.076GluLys: 7.076 ± 1.354
7.682GluLeu: 7.682 ± 0.863
2.426GluMet: 2.426 ± 0.516
3.841GluAsn: 3.841 ± 0.703
2.022GluPro: 2.022 ± 0.564
4.043GluGln: 4.043 ± 0.613
2.628GluArg: 2.628 ± 0.615
4.043GluSer: 4.043 ± 0.713
3.639GluThr: 3.639 ± 0.528
4.852GluVal: 4.852 ± 0.79
0.708GluTrp: 0.708 ± 0.339
2.123GluTyr: 2.123 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
2.325PheAla: 2.325 ± 0.442
0.404PheCys: 0.404 ± 0.179
3.032PheAsp: 3.032 ± 0.397
3.538PheGlu: 3.538 ± 0.668
2.022PhePhe: 2.022 ± 0.446
3.538PheGly: 3.538 ± 0.552
1.213PheHis: 1.213 ± 0.411
2.325PheIle: 2.325 ± 0.501
3.538PheLys: 3.538 ± 0.479
1.921PheLeu: 1.921 ± 0.46
0.809PheMet: 0.809 ± 0.285
2.628PheAsn: 2.628 ± 0.567
1.112PhePro: 1.112 ± 0.302
1.617PheGln: 1.617 ± 0.526
1.415PheArg: 1.415 ± 0.41
2.83PheSer: 2.83 ± 0.82
2.83PheThr: 2.83 ± 0.471
2.325PheVal: 2.325 ± 0.368
0.606PheTrp: 0.606 ± 0.276
1.617PheTyr: 1.617 ± 0.378
0.0PheXaa: 0.0 ± 0.0
Gly
3.538GlyAla: 3.538 ± 0.937
0.202GlyCys: 0.202 ± 0.156
3.538GlyAsp: 3.538 ± 0.591
3.74GlyGlu: 3.74 ± 0.516
3.74GlyPhe: 3.74 ± 0.686
5.256GlyGly: 5.256 ± 1.013
0.91GlyHis: 0.91 ± 0.269
7.379GlyIle: 7.379 ± 1.559
5.661GlyLys: 5.661 ± 0.823
5.357GlyLeu: 5.357 ± 0.747
1.112GlyMet: 1.112 ± 0.346
3.235GlyAsn: 3.235 ± 0.621
0.809GlyPro: 0.809 ± 0.401
3.032GlyGln: 3.032 ± 0.698
1.819GlyArg: 1.819 ± 0.414
3.538GlySer: 3.538 ± 0.984
5.357GlyThr: 5.357 ± 0.768
3.841GlyVal: 3.841 ± 0.691
0.91GlyTrp: 0.91 ± 0.299
2.931GlyTyr: 2.931 ± 0.554
0.0GlyXaa: 0.0 ± 0.0
His
1.112HisAla: 1.112 ± 0.374
0.0HisCys: 0.0 ± 0.0
0.809HisAsp: 0.809 ± 0.291
1.112HisGlu: 1.112 ± 0.344
0.505HisPhe: 0.505 ± 0.204
0.606HisGly: 0.606 ± 0.266
0.202HisHis: 0.202 ± 0.148
0.91HisIle: 0.91 ± 0.305
1.011HisLys: 1.011 ± 0.251
0.809HisLeu: 0.809 ± 0.325
0.202HisMet: 0.202 ± 0.137
0.606HisAsn: 0.606 ± 0.243
0.708HisPro: 0.708 ± 0.285
0.809HisGln: 0.809 ± 0.335
0.404HisArg: 0.404 ± 0.181
1.112HisSer: 1.112 ± 0.337
0.505HisThr: 0.505 ± 0.224
0.809HisVal: 0.809 ± 0.288
0.404HisTrp: 0.404 ± 0.196
0.606HisTyr: 0.606 ± 0.262
0.0HisXaa: 0.0 ± 0.0
Ile
5.054IleAla: 5.054 ± 0.783
0.303IleCys: 0.303 ± 0.187
6.267IleAsp: 6.267 ± 0.527
4.953IleGlu: 4.953 ± 0.721
1.819IlePhe: 1.819 ± 0.345
5.155IleGly: 5.155 ± 0.782
0.809IleHis: 0.809 ± 0.269
4.347IleIle: 4.347 ± 0.824
6.874IleLys: 6.874 ± 0.843
5.357IleLeu: 5.357 ± 0.773
1.415IleMet: 1.415 ± 0.391
4.347IleAsn: 4.347 ± 0.641
3.235IlePro: 3.235 ± 0.522
2.931IleGln: 2.931 ± 0.493
2.123IleArg: 2.123 ± 0.373
7.985IleSer: 7.985 ± 1.423
4.448IleThr: 4.448 ± 0.749
3.538IleVal: 3.538 ± 0.539
0.303IleTrp: 0.303 ± 0.157
2.123IleTyr: 2.123 ± 0.564
0.0IleXaa: 0.0 ± 0.0
Lys
6.469LysAla: 6.469 ± 0.72
0.404LysCys: 0.404 ± 0.211
4.65LysAsp: 4.65 ± 0.763
7.783LysGlu: 7.783 ± 1.12
2.426LysPhe: 2.426 ± 0.521
5.458LysGly: 5.458 ± 0.687
0.809LysHis: 0.809 ± 0.27
6.267LysIle: 6.267 ± 0.867
8.188LysLys: 8.188 ± 1.296
7.379LysLeu: 7.379 ± 0.809
1.617LysMet: 1.617 ± 0.375
5.661LysAsn: 5.661 ± 0.98
3.437LysPro: 3.437 ± 0.521
4.448LysGln: 4.448 ± 0.713
3.841LysArg: 3.841 ± 0.644
5.762LysSer: 5.762 ± 0.771
4.65LysThr: 4.65 ± 0.731
4.347LysVal: 4.347 ± 0.948
1.011LysTrp: 1.011 ± 0.355
3.538LysTyr: 3.538 ± 0.749
0.0LysXaa: 0.0 ± 0.0
Leu
5.458LeuAla: 5.458 ± 0.732
0.404LeuCys: 0.404 ± 0.203
4.852LeuAsp: 4.852 ± 0.557
5.155LeuGlu: 5.155 ± 0.782
3.639LeuPhe: 3.639 ± 0.647
5.256LeuGly: 5.256 ± 0.729
0.606LeuHis: 0.606 ± 0.236
5.458LeuIle: 5.458 ± 0.681
8.895LeuLys: 8.895 ± 1.27
5.661LeuLeu: 5.661 ± 0.844
2.426LeuMet: 2.426 ± 0.54
5.256LeuAsn: 5.256 ± 0.633
3.437LeuPro: 3.437 ± 0.505
2.527LeuGln: 2.527 ± 0.572
3.336LeuArg: 3.336 ± 0.566
7.076LeuSer: 7.076 ± 0.73
4.852LeuThr: 4.852 ± 0.77
3.841LeuVal: 3.841 ± 0.517
1.011LeuTrp: 1.011 ± 0.374
2.022LeuTyr: 2.022 ± 0.515
0.0LeuXaa: 0.0 ± 0.0
Met
2.123MetAla: 2.123 ± 0.398
0.404MetCys: 0.404 ± 0.208
1.213MetAsp: 1.213 ± 0.304
1.921MetGlu: 1.921 ± 0.494
0.404MetPhe: 0.404 ± 0.207
0.708MetGly: 0.708 ± 0.353
0.101MetHis: 0.101 ± 0.106
1.617MetIle: 1.617 ± 0.344
2.224MetLys: 2.224 ± 0.6
1.718MetLeu: 1.718 ± 0.413
0.606MetMet: 0.606 ± 0.255
1.112MetAsn: 1.112 ± 0.308
1.011MetPro: 1.011 ± 0.314
1.011MetGln: 1.011 ± 0.274
0.91MetArg: 0.91 ± 0.272
2.123MetSer: 2.123 ± 0.427
2.527MetThr: 2.527 ± 0.395
1.011MetVal: 1.011 ± 0.306
0.303MetTrp: 0.303 ± 0.152
1.011MetTyr: 1.011 ± 0.284
0.0MetXaa: 0.0 ± 0.0
Asn
3.639AsnAla: 3.639 ± 0.707
0.202AsnCys: 0.202 ± 0.139
3.134AsnAsp: 3.134 ± 0.602
2.931AsnGlu: 2.931 ± 0.505
3.235AsnPhe: 3.235 ± 0.496
4.549AsnGly: 4.549 ± 0.801
0.91AsnHis: 0.91 ± 0.424
3.942AsnIle: 3.942 ± 0.657
5.256AsnLys: 5.256 ± 0.627
4.751AsnLeu: 4.751 ± 1.038
0.91AsnMet: 0.91 ± 0.273
3.336AsnAsn: 3.336 ± 0.663
2.628AsnPro: 2.628 ± 0.485
3.134AsnGln: 3.134 ± 0.603
1.819AsnArg: 1.819 ± 0.495
3.336AsnSer: 3.336 ± 0.486
3.032AsnThr: 3.032 ± 0.471
4.043AsnVal: 4.043 ± 0.546
0.505AsnTrp: 0.505 ± 0.246
1.921AsnTyr: 1.921 ± 0.557
0.0AsnXaa: 0.0 ± 0.0
Pro
1.314ProAla: 1.314 ± 0.359
0.202ProCys: 0.202 ± 0.13
1.921ProAsp: 1.921 ± 0.509
2.426ProGlu: 2.426 ± 0.475
1.314ProPhe: 1.314 ± 0.33
1.011ProGly: 1.011 ± 0.32
0.202ProHis: 0.202 ± 0.136
2.83ProIle: 2.83 ± 0.459
2.729ProLys: 2.729 ± 0.596
3.74ProLeu: 3.74 ± 0.622
1.011ProMet: 1.011 ± 0.331
2.224ProAsn: 2.224 ± 0.572
0.708ProPro: 0.708 ± 0.258
2.123ProGln: 2.123 ± 0.433
1.112ProArg: 1.112 ± 0.341
2.123ProSer: 2.123 ± 0.363
1.921ProThr: 1.921 ± 0.416
2.527ProVal: 2.527 ± 0.518
0.202ProTrp: 0.202 ± 0.127
0.91ProTyr: 0.91 ± 0.34
0.0ProXaa: 0.0 ± 0.0
Gln
4.347GlnAla: 4.347 ± 0.554
0.101GlnCys: 0.101 ± 0.09
1.415GlnAsp: 1.415 ± 0.459
3.841GlnGlu: 3.841 ± 0.61
1.415GlnPhe: 1.415 ± 0.416
1.921GlnGly: 1.921 ± 0.411
0.303GlnHis: 0.303 ± 0.178
2.931GlnIle: 2.931 ± 0.538
4.549GlnLys: 4.549 ± 0.647
3.841GlnLeu: 3.841 ± 0.641
0.91GlnMet: 0.91 ± 0.34
2.325GlnAsn: 2.325 ± 0.519
1.415GlnPro: 1.415 ± 0.358
2.224GlnGln: 2.224 ± 0.461
1.921GlnArg: 1.921 ± 0.302
3.032GlnSer: 3.032 ± 0.543
2.527GlnThr: 2.527 ± 0.438
1.921GlnVal: 1.921 ± 0.439
0.606GlnTrp: 0.606 ± 0.224
1.516GlnTyr: 1.516 ± 0.468
0.0GlnXaa: 0.0 ± 0.0
Arg
2.83ArgAla: 2.83 ± 0.55
0.404ArgCys: 0.404 ± 0.265
2.628ArgAsp: 2.628 ± 0.575
1.819ArgGlu: 1.819 ± 0.391
1.718ArgPhe: 1.718 ± 0.415
1.011ArgGly: 1.011 ± 0.375
0.303ArgHis: 0.303 ± 0.17
3.235ArgIle: 3.235 ± 0.642
4.043ArgLys: 4.043 ± 0.811
4.043ArgLeu: 4.043 ± 0.763
0.809ArgMet: 0.809 ± 0.283
1.921ArgAsn: 1.921 ± 0.411
1.011ArgPro: 1.011 ± 0.377
1.415ArgGln: 1.415 ± 0.335
1.516ArgArg: 1.516 ± 0.595
2.325ArgSer: 2.325 ± 0.449
1.921ArgThr: 1.921 ± 0.421
1.314ArgVal: 1.314 ± 0.365
0.303ArgTrp: 0.303 ± 0.184
1.819ArgTyr: 1.819 ± 0.512
0.0ArgXaa: 0.0 ± 0.0
Ser
4.852SerAla: 4.852 ± 1.113
0.404SerCys: 0.404 ± 0.201
4.245SerAsp: 4.245 ± 0.55
3.538SerGlu: 3.538 ± 0.61
2.325SerPhe: 2.325 ± 0.384
7.682SerGly: 7.682 ± 1.386
0.606SerHis: 0.606 ± 0.268
4.852SerIle: 4.852 ± 0.632
5.155SerLys: 5.155 ± 0.562
5.863SerLeu: 5.863 ± 1.105
2.325SerMet: 2.325 ± 0.496
4.953SerAsn: 4.953 ± 0.978
1.921SerPro: 1.921 ± 0.497
2.426SerGln: 2.426 ± 0.534
2.325SerArg: 2.325 ± 0.583
5.762SerSer: 5.762 ± 0.989
5.054SerThr: 5.054 ± 0.84
4.751SerVal: 4.751 ± 0.704
0.708SerTrp: 0.708 ± 0.25
2.527SerTyr: 2.527 ± 0.555
0.0SerXaa: 0.0 ± 0.0
Thr
4.144ThrAla: 4.144 ± 0.819
0.202ThrCys: 0.202 ± 0.144
3.841ThrAsp: 3.841 ± 0.54
3.639ThrGlu: 3.639 ± 0.55
2.83ThrPhe: 2.83 ± 0.678
4.751ThrGly: 4.751 ± 0.885
0.91ThrHis: 0.91 ± 0.3
4.347ThrIle: 4.347 ± 0.761
4.347ThrLys: 4.347 ± 0.737
5.256ThrLeu: 5.256 ± 0.801
1.213ThrMet: 1.213 ± 0.305
2.628ThrAsn: 2.628 ± 0.582
2.628ThrPro: 2.628 ± 0.548
2.628ThrGln: 2.628 ± 0.42
1.314ThrArg: 1.314 ± 0.402
3.538ThrSer: 3.538 ± 0.768
3.942ThrThr: 3.942 ± 0.718
6.065ThrVal: 6.065 ± 0.749
1.011ThrTrp: 1.011 ± 0.262
2.325ThrTyr: 2.325 ± 0.717
0.0ThrXaa: 0.0 ± 0.0
Val
3.235ValAla: 3.235 ± 0.562
0.606ValCys: 0.606 ± 0.287
4.347ValAsp: 4.347 ± 0.641
5.863ValGlu: 5.863 ± 0.75
2.426ValPhe: 2.426 ± 0.522
4.347ValGly: 4.347 ± 0.672
1.011ValHis: 1.011 ± 0.318
4.043ValIle: 4.043 ± 0.689
4.953ValLys: 4.953 ± 0.712
4.751ValLeu: 4.751 ± 0.597
1.617ValMet: 1.617 ± 0.41
3.235ValAsn: 3.235 ± 0.674
2.123ValPro: 2.123 ± 0.466
1.617ValGln: 1.617 ± 0.355
1.617ValArg: 1.617 ± 0.439
4.245ValSer: 4.245 ± 0.809
3.841ValThr: 3.841 ± 0.785
4.751ValVal: 4.751 ± 0.728
0.708ValTrp: 0.708 ± 0.235
2.123ValTyr: 2.123 ± 0.508
0.0ValXaa: 0.0 ± 0.0
Trp
0.708TrpAla: 0.708 ± 0.304
0.202TrpCys: 0.202 ± 0.133
0.606TrpAsp: 0.606 ± 0.283
0.809TrpGlu: 0.809 ± 0.224
0.809TrpPhe: 0.809 ± 0.329
0.809TrpGly: 0.809 ± 0.352
0.101TrpHis: 0.101 ± 0.102
0.809TrpIle: 0.809 ± 0.266
0.708TrpLys: 0.708 ± 0.281
0.91TrpLeu: 0.91 ± 0.255
0.202TrpMet: 0.202 ± 0.153
1.112TrpAsn: 1.112 ± 0.307
0.101TrpPro: 0.101 ± 0.086
0.809TrpGln: 0.809 ± 0.199
0.505TrpArg: 0.505 ± 0.236
1.314TrpSer: 1.314 ± 0.411
0.606TrpThr: 0.606 ± 0.255
0.91TrpVal: 0.91 ± 0.316
0.202TrpTrp: 0.202 ± 0.147
0.303TrpTyr: 0.303 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.819TyrAla: 1.819 ± 0.455
0.101TyrCys: 0.101 ± 0.108
2.931TyrAsp: 2.931 ± 0.549
2.729TyrGlu: 2.729 ± 0.573
1.314TyrPhe: 1.314 ± 0.31
2.426TyrGly: 2.426 ± 0.454
1.011TyrHis: 1.011 ± 0.286
1.819TyrIle: 1.819 ± 0.359
2.931TyrLys: 2.931 ± 0.609
2.325TyrLeu: 2.325 ± 0.529
1.112TyrMet: 1.112 ± 0.272
1.617TyrAsn: 1.617 ± 0.408
1.415TyrPro: 1.415 ± 0.346
1.921TyrGln: 1.921 ± 0.38
1.921TyrArg: 1.921 ± 0.462
2.83TyrSer: 2.83 ± 0.545
1.617TyrThr: 1.617 ± 0.47
1.921TyrVal: 1.921 ± 0.38
0.404TyrTrp: 0.404 ± 0.194
1.213TyrTyr: 1.213 ± 0.323
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (9894 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski